Yayın:
TARA: An algorithm for fast searching patterns on text files of multiple

Placeholder

Tarih

Akademik Birimler

Kurum Yazarları

Yok

Yazarlar

Külekçi, Muhammed Oǧuzhan

Danışman

Dil

Yayıncı:

IEEE

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Özet

This work introduces a new multi-pattern matching algorithm that performs searching of fixed-length strings on text files very fast by benefiting from bit-parallelism. The algorithm is given name tara. Bounded gaps as well as character classes in keywords are also supported. Although the worst case time complexity is quadratic, it performs very fast in practise. Experiments are conducted to compare the performance of the proposed algorithm with widely used GNU grep file search utility and also with 9 variants of Aho&Corasick and Comentz&Walter algorithms on natural language text. On the average tara is approximately 10% faster then grep, where up to 70% percent speed up is observed. The benchmark with the AC and CW variants results that the speed up obtained by tara is 3,5 times relative to its nearest successor.

Açıklama

Bu çalışma, 07-09, Kasım 2007 tarihlerinde Ankara[Türkiye]’de düzenlenen 22. International Symposium on Computer and Information Sciences Kongresi‘nde bildiri olarak sunulmuştur.

Kaynak:

Anahtar Kelimeler:

Konusu

Computer science, Engineering, Boolean functions, Communication, Cybernetics, Pattern matching, Information management, Information science, Speed, Bit parallelism, File search, Time complexities, International symposium, Text files, Keywords (CO), Speed ups, Natural language texts, Worst case, Pattern matching algorithms, Algorithms

Alıntı

Külekçi, M. O. (2007). "TARA: An algorithm for fast searching patterns on text files of multiple". 22. International Symposium on Computer and Information Sciences, 136-141.

Endorsement

Review

Supplemented By

Referenced By

3

Views

0

Downloads