Publication:
TARA: An algorithm for fast searching patterns on text files of multiple

Placeholder

Date

Organizational Units

Authors

Yok

Authors

Külekçi, Muhammed Oǧuzhan

Advisor

Language

Publisher:

IEEE

Journal Title

Journal ISSN

Volume Title

Abstract

This work introduces a new multi-pattern matching algorithm that performs searching of fixed-length strings on text files very fast by benefiting from bit-parallelism. The algorithm is given name tara. Bounded gaps as well as character classes in keywords are also supported. Although the worst case time complexity is quadratic, it performs very fast in practise. Experiments are conducted to compare the performance of the proposed algorithm with widely used GNU grep file search utility and also with 9 variants of Aho&Corasick and Comentz&Walter algorithms on natural language text. On the average tara is approximately 10% faster then grep, where up to 70% percent speed up is observed. The benchmark with the AC and CW variants results that the speed up obtained by tara is 3,5 times relative to its nearest successor.

Description

Bu çalışma, 07-09, Kasım 2007 tarihlerinde Ankara[Türkiye]’de düzenlenen 22. International Symposium on Computer and Information Sciences Kongresi‘nde bildiri olarak sunulmuştur.

Source:

Keywords:

Keywords

Computer science, Engineering, Boolean functions, Communication, Cybernetics, Pattern matching, Information management, Information science, Speed, Bit parallelism, File search, Time complexities, International symposium, Text files, Keywords (CO), Speed ups, Natural language texts, Worst case, Pattern matching algorithms, Algorithms

Citation

Külekçi, M. O. (2007). "TARA: An algorithm for fast searching patterns on text files of multiple". 22. International Symposium on Computer and Information Sciences, 136-141.

Endorsement

Review

Supplemented By

Referenced By

1

Views

0

Downloads