Affiliation:
1. Université Paris Diderot
2. University of Chile
Abstract
Self-indexes are able to represent a text asymptotically within the information-theoretic lower bound under the
k
th order entropy model and offer access to any text substring and indexed pattern searches. Their time complexities are not optimal, however; in particular, they are always multiplied by a factor that depends on the alphabet size. In this article, we achieve, for the first time,
full alphabet independence
in the time complexities of self-indexes while retaining space optimality. We also obtain some relevant byproducts.
Funder
Agence Nationale de la Recherche
Millennium Institute for Cell Dynamics and Biotechnology
Publisher
Association for Computing Machinery (ACM)
Subject
Mathematics (miscellaneous)
Reference47 articles.
1. Fast text searching for regular expressions or automaton searching on tries
2. R. Baeza-Yates and B. Ribeiro-Neto. 2011. Modern Information Retrieval (2nd ed.). Addison-Wesley. R. Baeza-Yates and B. Ribeiro-Neto. 2011. Modern Information Retrieval (2nd ed.). Addison-Wesley.
Cited by
42 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献