Publication: TARA: An algorithm for fast searching patterns on text files of multiple
Date
Authors
Yok
Authors
Külekçi, Muhammed Oǧuzhan
Advisor
Language
Publisher:
IEEE
Journal Title
Journal ISSN
Volume Title
Abstract
This work introduces a new multi-pattern matching algorithm that performs searching of fixed-length strings on text files very fast by benefiting from bit-parallelism. The algorithm is given name tara. Bounded gaps as well as character classes in keywords are also supported. Although the worst case time complexity is quadratic, it performs very fast in practise. Experiments are conducted to compare the performance of the proposed algorithm with widely used GNU grep file search utility and also with 9 variants of Aho&Corasick and Comentz&Walter algorithms on natural language text. On the average tara is approximately 10% faster then grep, where up to 70% percent speed up is observed. The benchmark with the AC and CW variants results that the speed up obtained by tara is 3,5 times relative to its nearest successor.
Description
Bu çalışma, 07-09, Kasım 2007 tarihlerinde Ankara[Türkiye]’de düzenlenen 22. International Symposium on Computer and Information Sciences Kongresi‘nde bildiri olarak sunulmuştur.
Source:
Keywords:
Keywords
Computer science, Engineering, Boolean functions, Communication, Cybernetics, Pattern matching, Information management, Information science, Speed, Bit parallelism, File search, Time complexities, International symposium, Text files, Keywords (CO), Speed ups, Natural language texts, Worst case, Pattern matching algorithms, Algorithms
Citation
Külekçi, M. O. (2007). "TARA: An algorithm for fast searching patterns on text files of multiple". 22. International Symposium on Computer and Information Sciences, 136-141.