新規登録 | ログイン | FAQ      [?] 
CiteULike is a free online bibliography manager. Register and you can start organising your references online.
Recent | Unread | Search | Authors | Tags | Export

Approximate string matching in sublinear expected time

by: WI Chang, EL Lawler
Foundations of Computer Science, 1990. Proceedings., 31st Annual Symposium on (1990), pp. 116-124 vol.1.


View FullText article


X Reviews [Write a review of this article]

There are no reviews of this article

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Abstract

The <e1>k</e1> differences approximate string matching problem specifies a text string of length <e1>n</e1>, a pattern string of length <e1>m</e1>, and the number <e1>k</e1> of differences (insertions, deletions, substitutions) allowed in a match, and asks for every location in the text where a match occurs. Previous algorithms required at least <e1>O</e1>(<e1>nk</e1>) time. When <e1>k</e1> is as large as a fraction of <e1>m</e1>, no substantial progress has been made over <e1>O </e1>(<e1>nm</e1>) dynamic programming. The authors have investigated much faster algorithms for restricted cases of the problem, such as when the text string is random and errors are not too frequent. They have devised an algorithm that, for <e1>k</e1><<e1>m</e1>/log <e1>n</e1>+ <e1>O</e1>(1), runs in time <e1>O</e1>((<e1>n</e1>/<e1>m</e1>)<e1>k</e1> log <e1>n</e1>) on the average. In the worst case their algorithm is <e1>O</e1>(<e1>nk</e1>), but it is still an improvement in that it is very practical and uses only <e1>O</e1>(<e1>n</e1>) space compared with <e1>O</e1>(<e1>n</e1>) or <e1>O</e1>(<e1>n</e1><sup>2</sup>). The authors define an approximate substring matching problem and give efficient algorithms based on their techniques. Special cases include several applications to genetics and molecular biology


X BibTeX record

X RIS record



RIS BibTeX
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.