Results 1 to 10 of about 175,878 (142)
Given the prevalence of missing data in modern statistical research, a broad range of methods is available for any given imputation task. How does one choose the `best' imputation method in a given application? The standard approach is to select some observations, set their status to missing, and compare prediction accuracy of the methods under ...
Näf, Jeffrey +3 more
openaire +3 more sources
Genotype imputation is now an essential tool in the analysis of genome-wide association scans. This technique allows geneticists to accurately evaluate the evidence for association at genetic markers that are not directly genotyped. Genotype imputation is particularly useful for combining results across studies that rely on different genotyping ...
Yun Li +3 more
openaire +3 more sources
Imputation can be used to obtain a large number of high-density genotypes at the cost of procuring low-density panels. Accurate imputation requires a well-formed reference population of high-density genotypes to enable statistical inference. Five methods
Rudi A. McEwin +6 more
doaj +1 more source
Methods to Handle Incomplete Data
Context: The major question for data analysis is determining the appropriate analytic approach in the presence of incomplete observations. The most common solution to handle missing data in a data set is imputation, where missing values are estimated and
Vinny Johny +2 more
doaj +1 more source
Molgenis-impute: imputation pipeline in a box [PDF]
Genotype imputation is an important procedure in current genomic analysis such as genome-wide association studies, meta-analyses and fine mapping. Although high quality tools are available that perform the steps of this process, considerable effort and expertise is required to set up and run a best practice imputation pipeline, particularly for larger ...
Kanterakis, Alexandros +5 more
openaire +2 more sources
To ensure scientific reproducibility of metabolomics data, alternative statistical methods are needed. A paradigm shift away from the p-value toward an embracement of uncertainty and interval estimation of a metabolite’s true effect size may lead to ...
Christopher E. Gillies +7 more
doaj +1 more source
Imputation for Repeated Bounded Outcome Data: Statistical and Machine-Learning Approaches
Real-life data are bounded and heavy-tailed variables. Zero-one-inflated beta (ZOIB) regression is used for modelling them. There are no appropriate methods to address the problem of missing data in repeated bounded outcomes.
Urko Aguirre-Larracoechea +1 more
doaj +1 more source
K Nearest Neighbor Imputation Performance on Missing Value Data Graduate User Satisfaction
A missing value is a common problem of most data processing in scientific research, which results in a lack of accuracy of research results. Several methods have been applied as a missing value solution, such as deleting all data that have a missing ...
Abdul Fadlil, Herman, Dikky Praseptian M
doaj +1 more source
Outcome-sensitive multiple imputation: a simulation study
Background Multiple imputation is frequently used to deal with missing data in healthcare research. Although it is known that the outcome should be included in the imputation model when imputing missing covariate values, it is not known whether it should
Evangelos Kontopantelis +3 more
doaj +1 more source
In livestock, many studies have reported the results of imputation to 50k single nucleotide polymorphism (SNP) genotypes for animals that are genotyped with low-density SNP panels.
M.P.L. Calus +4 more
doaj +1 more source

