APPLYING A Q-GRAM BASED MULTIPLE STRING MATCHING ALGORITHM FOR APPROXIMATE MATCHING

Main Article Content

DOI

Robert Susik

rsusik@kis.p.lodz.pl

Abstract

We consider the application of multiple pattern matching (Multi AOSO on q-Grams) algorithm for approximate pattern matching. We propose the on-line approach which translates the problem from approximate pattern matching into a multiple pattern one (called partitioning into exact search). Presented solution allows relatively fast search multiple patterns in text with given k-differences(or mismatches). This paper presents comparison of solution based on MAG algorithm, and [4]. Experiments on DNA, English, Proteins and XML texts with up to k errors show that the new proposed algorithm achieves relatively good results in practical use.

Keywords:

text processing, approximate string matching, string algorithms, q-gram

References

Article Details

Susik, R. (2017). APPLYING A Q-GRAM BASED MULTIPLE STRING MATCHING ALGORITHM FOR APPROXIMATE MATCHING. Informatyka, Automatyka, Pomiary W Gospodarce I Ochronie Środowiska, 7(3), 47–50. https://doi.org/10.5604/01.3001.0010.5214