Results 1 to 10 of about 8,999,941 (280)
The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data.
Yasset Pérez-Riverol +13 more
semanticscholar +1 more source
Pfam: The protein families database in 2021
The Pfam database is a widely used resource for classifying protein sequences into families and domains. Since Pfam was last described in this journal, over 350 new families have been added in Pfam 33.1 and numerous improvements have been made to ...
Jaina Mistry +11 more
semanticscholar +1 more source
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs [PDF]
Text-to-SQL parsing, which aims at converting natural language instructions into executable SQLs, has gained increasing attention in recent years. In particular, Codex and ChatGPT have shown impressive results in this task. However, most of the prevalent
Jinyang Li +15 more
semanticscholar +1 more source
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database
Summary The Genome Taxonomy Database Toolkit (GTDB-Tk) provides objective taxonomic assignments for bacterial and archaeal genomes based on the GTDB. GTDB-Tk is computationally efficient and able to classify thousands of draft genomes in parallel.
Pierre-Alain Chaumeil +3 more
semanticscholar +1 more source
The aptamer database is designed to contain comprehensive sequence information on aptamers and unnatural ribozymes that have been generated by in vitro selection methods. Such data are not normally collected in 'natural' sequence databases, such as GenBank.
Jennifer F, Lee +3 more
openaire +2 more sources
FAIR principles and the IEDB: short-term improvements and a long-term vision of OBO-foundry mediated machine-actionable interoperability. [PDF]
The Immune Epitope Database (IEDB), at www.iedb.org, has the mission to make published experimental data relating to the recognition of immune epitopes easily available to the scientific public.
Mungall, Christopher J +4 more
core +2 more sources
The AlphaFold Protein Structure Database (AlphaFold DB, https://alphafold.ebi.ac.uk) is an openly accessible, extensive database of high-accuracy protein-structure predictions.
M. Váradi +26 more
semanticscholar +1 more source
Benchmarking database systems for Genomic Selection implementation [PDF]
Motivation: With high-throughput genotyping systems now available, it has become feasible to fully integrate genotyping information into breeding programs.
Guignon, Valentin +10 more
core +2 more sources
Places: A 10 Million Image Database for Scene Recognition
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition.
Bolei Zhou +4 more
semanticscholar +1 more source
Improved ontology for eukaryotic single-exon coding sequences in biological databases [PDF]
Indexación: Scopus.Efficient extraction of knowledge from biological data requires the development of structured vocabularies to unambiguously define biological terms. This paper proposes descriptions and definitions to disambiguate the term 'single-exon
Clausen, P. +4 more
core +2 more sources

