Results 11 to 20 of about 14,384 (164)
Replacing suffix trees with enhanced suffix arrays
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Abouelhoda, Mohamed Ibrahim +2 more
openaire +3 more sources
On the combinatorics of suffix arrays [PDF]
We prove several combinatorial properties of suffix arrays, including a characterization of suffix arrays through a bijection with a certain well-defined class of permutations. Our approach is based on the characterization of Burrows-Wheeler arrays given
Kucherov, Gregory +2 more
core +5 more sources
Dynamic extended suffix arrays
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Salson, Mikael +3 more
openaire +5 more sources
GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array. [PDF]
DNA sequences are translated into protein coding sequences and then further assigned to protein families in metagenomic analyses, because of the need for sensitivity.
Shuji Suzuki +3 more
doaj +1 more source
Generalized enhanced suffix array construction in external memory
Background Suffix arrays, augmented by additional data structures, allow solving efficiently many string processing problems. The external memory construction of the generalized suffix array for a string collection is a fundamental task when the size of ...
Felipe A. Louza +3 more
doaj +1 more source
Suffix-Sorting via Shannon-Fano-Elias Codes
Given a sequence T = t0t1 . . . tn-1 of size n = |T|, with symbols from a fixed alphabet Σ, (|Σ| ≤ n), the suffix array provides a listing of all the suffixes of T in a lexicographic order.
Donald Adjeroh, Fei Nan
doaj +1 more source
TOPAZ: asymmetric suffix array neighbourhood search for massive protein databases
Background Protein homology search is an important, yet time-consuming, step in everything from protein annotation to metagenomics. Its application, however, has become increasingly challenging, due to the exponential growth of protein databases.
Alan Medlar, Liisa Holm
doaj +1 more source
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing. [PDF]
With the rapidly increasing volume of deep sequencing data, more efficient algorithms and data structures are needed. Minimizers are a central recent paradigm that has improved various sequence analysis tasks, including hashing for faster read overlap ...
Yaron Orenstein +4 more
doaj +1 more source
Fast mapping of short sequences with mismatches, insertions and deletions using index structures. [PDF]
With few exceptions, current methods for short read mapping make use of simple seed heuristics to speed up the search. Most of the underlying matching models neglect the necessity to allow not only mismatches, but also insertions and deletions.
Steve Hoffmann +7 more
doaj +1 more source
Scalable Parallel Suffix Array Construction [PDF]
Suffix arrays are a simple and powerful data structure for text processing that can be used for full text indexes, data compression, and many other applications in particular in bioinformatics. We describe the first implementation and experimental evaluation of a scalable parallel algorithm for suffix array construction.
Kulla, F., Sanders, P.
openaire +3 more sources

