Results 1 to 10 of about 9,043,080 (327)
Optimization of the Mainzelliste software for fast privacy-preserving record linkage
Background Data analysis for biomedical research often requires a record linkage step to identify records from multiple data sources referring to the same person.
Florens Rohde +4 more
doaj +1 more source
The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data.
Yasset Pérez-Riverol +13 more
semanticscholar +1 more source
Predict Diabetes Using Voting Classifier and Hyper Tuning Technique
Today, diabetes is one of the most common chronic diseases in the world due to the people’s sedentary lifestyle which led to many health issues like heart attack, kidney frailer and blindness.
Chra Ali Kamal, Manal Ali Atiyah
doaj +3 more sources
The optimal cut‐off value in fit‐based colorectal cancer screening: An observational study
Background Colorectal cancer (CRC) screening programs using fecal immunochemical test (FIT) have to choose a cut‐off value to decide which citizens to recall for colonoscopy. The evidence on the optimal cut‐off value is sparse and based on studies with a
Sisse Helle Njor +8 more
doaj +1 more source
Pfam: The protein families database in 2021
The Pfam database is a widely used resource for classifying protein sequences into families and domains. Since Pfam was last described in this journal, over 350 new families have been added in Pfam 33.1 and numerous improvements have been made to ...
Jaina Mistry +11 more
semanticscholar +1 more source
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs [PDF]
Text-to-SQL parsing, which aims at converting natural language instructions into executable SQLs, has gained increasing attention in recent years. In particular, Codex and ChatGPT have shown impressive results in this task. However, most of the prevalent
Jinyang Li +15 more
semanticscholar +1 more source
Operationalizing and automating Data Governance
The ability to cross data from multiple sources represents a competitive advantage for organizations. Yet, the governance of the data lifecycle, from the data sources into valuable insights, is largely performed in an ad-hoc or manual manner.
Sergi Nadal +3 more
doaj +1 more source
Maximum Influential Location Selection With Differentially Private User Locations
The widespread use of mobile devices and social network services has made optimal location queries an important research topic. Previous studies have focused on the problem of maximum influential (Max-inf) location selection, that is, finding a location ...
Sehwa Park, Junkyu Lee, Seog Park
doaj +1 more source
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database
Summary The Genome Taxonomy Database Toolkit (GTDB-Tk) provides objective taxonomic assignments for bacterial and archaeal genomes based on the GTDB. GTDB-Tk is computationally efficient and able to classify thousands of draft genomes in parallel.
Pierre-Alain Chaumeil +3 more
semanticscholar +1 more source
The AlphaFold Protein Structure Database (AlphaFold DB, https://alphafold.ebi.ac.uk) is an openly accessible, extensive database of high-accuracy protein-structure predictions.
M. Váradi +26 more
semanticscholar +1 more source

