The Prague Stringology Conference 2003

Tetsuya Nakatoh, Kensuke Baba, Daisuke Ikeda, Yasuhiro Yamada and Sachio Hirokawa

An Efficient Mapping for Score of String Matching

Abstract:
This paper proposes an efficient algorithm to solve the problem of string matching with mismatches. For a text of length n and a pattern of length m over an alphabet Σ, the problem is known to be solved in O(|Σ| n log m) time by computing a score by the fast Fourier transformation (FFT). Atallah et al. introduced a randomized algorithm in which the time complexity can be decreased by the trade-off with the accuracy of the estimates for the score. The algorithm in the present paper yields an estimate with smaller variance compared to that the algorithm by Atallah et al., moreover, and computes the exact score in O(|Σ| n log m) time. The present paper also gives two methods to improve the algorithm and an exact estimation of the variance of the estimates for the score.

Download paper: Article in PostScript Article in PDF
 PostScript   PDF