CoLoRMap: Correcting Long Reads by Mapping short reads [PDF]
Abstract Motivation Second generation sequencing technologies paved the way to an exceptional increase in the number of sequenced genomes, both prokaryotic and eukaryotic. However, short reads are difficult to assemble and often lead to highly fragmented assemblies.
Ehsan, Haghshenas +3 more
openaire +2 more sources
Structural Refinement by Direct Mapping Reveals Assembly Inconsistencies near Hi-C Junctions
High-throughput chromosome conformation capture (Hi-C) is widely used for scaffolding in de novo assembly because it produces highly contiguous genomes, but its indirect statistical approach can introduce connection errors.
Luca Marcolungo +10 more
doaj +1 more source
ACCURATE LONG READ MAPPING USING ENHANCED SUFFIX ARRAYS [PDF]
With the rise of high throughput sequencing, new programs have been developed for dealing with the alignment of a huge amount of short read data to reference genomes. Recent developments in sequencing technology allow longer reads, but the mappers for short reads are not suited for reads of several hundreds of base pairs.
Vyverman, Michaƫl +4 more
openaire +3 more sources
Jabba: hybrid error correction for long sequencing reads using maximal exact matches [PDF]
Third generation sequencing platforms produce longer reads with higher error rates than second generation sequencing technologies. While the improved read length can provide useful information for downstream analysis, underlying algorithms are challenged
Audenaert, P. +4 more
core +1 more source
Merging short and stranded long reads improves transcript assembly.
Long-read RNA sequencing has arisen as a counterpart to short-read sequencing, with the potential to capture full-length isoforms, albeit at the cost of lower depth.
Amoldeep S Kainth +3 more
doaj +1 more source
The hidden perils of read mapping as a quality assessment tool in genome sequencing [PDF]
This article provides a comparative analysis of the various methods of genome sequencing focusing on verification of the assembly quality. The results of a comparative assessment of various de novo assembly tools, as well as sequencing technologies, are ...
A Bankevich +14 more
core +1 more source
With the rapid development of single molecular sequencing (SMS) technologies such as PacBio single-molecule real-time and Oxford Nanopore sequencing, the output read length is continuously increasing, which has dramatical potentials on cutting-edge ...
Ze-Gang Wei +6 more
doaj +1 more source
Length-independent DNA packing into nanopore zero-mode waveguides for low-input DNA sequencing [PDF]
Compared with conventional methods, single-molecule real-time (SMRT) DNA sequencing exhibits longer read lengths than conventional methods, less GC bias, and the ability to read DNA base modifications.
Henley, Robert Y. +4 more
core +1 more source
smsMap: mapping single molecule sequencing reads by locating the alignment starting positions
Background Single Molecule Sequencing (SMS) technology can produce longer reads with higher sequencing error rate. Mapping these reads to a reference genome is often the most fundamental and computing-intensive step for downstream analysis. Most existing
Ze-Gang Wei, Shao-Wu Zhang, Fei Liu
doaj +1 more source
Inference of Markovian Properties of Molecular Sequences from NGS Data and Applications to Comparative Genomics [PDF]
Next Generation Sequencing (NGS) technologies generate large amounts of short read data for many different organisms. The fact that NGS reads are generally short makes it challenging to assemble the reads and reconstruct the original genome sequence. For
Cannon, Charles H. +5 more
core +1 more source

