Results 11 to 20 of about 14,465 (182)
Replacing suffix trees with enhanced suffix arrays
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Abouelhoda, Mohamed Ibrahim +2 more
openaire +3 more sources
A quick tour on suffix arrays and compressed suffix arrays
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Roberto Grossi
openaire +5 more sources
On the combinatorics of suffix arrays [PDF]
We prove several combinatorial properties of suffix arrays, including a characterization of suffix arrays through a bijection with a certain well-defined class of permutations. Our approach is based on the characterization of Burrows-Wheeler arrays given
Kucherov, Gregory +2 more
core +5 more sources
Dynamic extended suffix arrays
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Salson, Mikael +3 more
openaire +5 more sources
GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array. [PDF]
DNA sequences are translated into protein coding sequences and then further assigned to protein families in metagenomic analyses, because of the need for sensitivity.
Shuji Suzuki +3 more
doaj +1 more source
Generalized enhanced suffix array construction in external memory
Background Suffix arrays, augmented by additional data structures, allow solving efficiently many string processing problems. The external memory construction of the generalized suffix array for a string collection is a fundamental task when the size of ...
Felipe A. Louza +3 more
doaj +1 more source
Suffix-Sorting via Shannon-Fano-Elias Codes
Given a sequence T = t0t1 . . . tn-1 of size n = |T|, with symbols from a fixed alphabet Σ, (|Σ| ≤ n), the suffix array provides a listing of all the suffixes of T in a lexicographic order.
Donald Adjeroh, Fei Nan
doaj +1 more source
On the suitability of suffix arrays for lempel-ziv data compression [PDF]
Lossless compression algorithms of the Lempel-Ziv (LZ) family are widely used nowadays. Regarding time and memory requirements, LZ encoding is much more demanding than decoding.
D. Gusfield +9 more
core +2 more sources
TOPAZ: asymmetric suffix array neighbourhood search for massive protein databases
Background Protein homology search is an important, yet time-consuming, step in everything from protein annotation to metagenomics. Its application, however, has become increasingly challenging, due to the exponential growth of protein databases.
Alan Medlar, Liisa Holm
doaj +1 more source
Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing. [PDF]
With the rapidly increasing volume of deep sequencing data, more efficient algorithms and data structures are needed. Minimizers are a central recent paradigm that has improved various sequence analysis tasks, including hashing for faster read overlap ...
Yaron Orenstein +4 more
doaj +1 more source

