Fast index based algorithms and software for matching position specific scoring matrices
Background In biological sequence analysis, position specific scoring matrices (PSSMs) are widely used to represent sequence motifs in nucleotide as well as amino acid sequences.
Homann Robert +3 more
doaj +1 more source
SArKS: de novo discovery of gene expression regulatory motif sites and domains by suffix array kernel smoothing. [PDF]
Wylie DC, Hofmann HA, Zemelman BV.
europepmc +1 more source
Compressed Suffix Arrays for Massive Data [PDF]
We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA). The algorithm requires O (n logn ) time in the worst case, and only O (n ) bits of extra space in addition to the CSA. As the basic step, we describe an algorithm for merging two CSAs.
openaire +1 more source
RIsearch2: suffix array-based large-scale prediction of RNA-RNA interactions and siRNA off-targets. [PDF]
Alkan F +7 more
europepmc +1 more source
SA-SSR: a suffix array-based algorithm for exhaustive and efficient SSR discovery in large genetic sequences. [PDF]
Pickett BD +7 more
europepmc +1 more source
Linear-time computation of minimal absent words using suffix array. [PDF]
Barton C +3 more
europepmc +1 more source
Linear-time Suffix Sorting - A New Approach for Suffix Array Construction [PDF]
Baier, Uwe
core +1 more source
Acceleration of short and long DNA read mapping without loss of accuracy using suffix array. [PDF]
Tárraga J +8 more
europepmc +1 more source
Suffix arrays in theory and practice.
The suffix array of a string is a permutation of all starting positions of the string's suffixes in lexicographical order. In this thesis, we investigate mathematical and algorithmical aspects of suffix arrays. The first part mainly deals with combinatorial properties of suffix arrays and their enumeration.
openaire +2 more sources
mkESA: enhanced suffix array construction tool. [PDF]
Homann R +3 more
europepmc +1 more source

