Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi [PDF]
DNA phylogenetic comparisons have shown that morphology-based species recognition often underestimates fungal diversity. Therefore, the need for accurate DNA sequence data, tied to both correct taxonomic names and clearly annotated specimen data, has ...
Abarenkov, K+100 more
core +2 more sources
Developing lifelong learning algorithms are mandatory for computational systems biology. Recently, many studies have shown how to extract biologically relevant information from high-dimensional data to understand the complexity of cancer by taking the ...
Erdenebileg Batbaatar+6 more
doaj +1 more source
Improved ontology for eukaryotic single-exon coding sequences in biological databases [PDF]
Indexación: Scopus.Efficient extraction of knowledge from biological data requires the development of structured vocabularies to unambiguously define biological terms. This paper proposes descriptions and definitions to disambiguate the term 'single-exon
Clausen, P.+4 more
core +2 more sources
ImageNet: A large-scale hierarchical image database
The explosion of image data on the Internet has the potential to foster more sophisticated and robust models and algorithms to index, retrieve, organize and interact with images and multimedia data.
Jia Deng+5 more
semanticscholar +1 more source
Southern African Treatment Resistance Network (SATuRN) RegaDB HIV drug resistance and clinical management database: supporting patient management, surveillance and research in southern Africa [PDF]
Substantial amounts of data have been generated from patient management and academic exercises designed to better understand the human immunodeficiency virus (HIV) epidemic and design interventions to control it.
Bester, Armand+17 more
core +3 more sources
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST
S. Altschul+6 more
semanticscholar +1 more source
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database
Summary The Genome Taxonomy Database Toolkit (GTDB-Tk) provides objective taxonomic assignments for bacterial and archaeal genomes based on the GTDB. GTDB-Tk is computationally efficient and able to classify thousands of draft genomes in parallel.
Pierre-Alain Chaumeil+3 more
semanticscholar +1 more source
miRBase Tracker : keeping track of microRNA annotation changes [PDF]
Since 2002, information on individual microRNAs (miRNAs), such as reference names and sequences, has been stored in miRBase, the reference database for miRNA annota- tion.
Anckaert, Jasper+10 more
core +2 more sources
The Pfam protein families database in 2019
The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0.
Sara El-Gebali+15 more
semanticscholar +1 more source
A collection of database industrial techniques and optimization approaches of database operations [PDF]
Databases play an essential role in our society today. Databases are embedded in sectors like corporations, institutions, and government organizations, among others. These databases are used for our video and audio streaming platforms, social gaming, finances, cloud storage, e-commerce, healthcare, economy, etc. It is therefore imperative that we learn
arxiv +1 more source