Results 61 to 70 of about 313,820 (280)
Background Longitudinal categorical variables are sometimes restricted in terms of how individuals transition between categories over time. For example, with a time-dependent measure of smoking categorised as never-smoker, ex-smoker, and current-smoker ...
Anurika Priyanjali De Silva +4 more
doaj +1 more source
Improved Mean Methods of Imputation
Replacing missing values of a variable with the mean of the non-missing values is a simple and natural way to impute values fortunately in the case where data is missing completely at random. Following a short review of this method we consider thus possible improvements, are called the shrinkage method, a second called the weighted interval method, and
Choukri Mohamed +2 more
openaire +2 more sources
I-Impute: a self-consistent method to impute single cell RNA sequencing data [PDF]
Abstract Background Single-cell RNA-sequencing (scRNA-seq) is becoming indispensable in the study of cell-specific transcriptomes. However, in scRNA-seq techniques, only a small fraction of the genes are captured due to “dropout” events.
Feng, Xikang +3 more
openaire +3 more sources
This work identified serum proteins associated with pancreatic epithelial neoplasms (PanINs) and early‐stage PDAC. Proteomics screens assessed genetically engineered mice with abundant PanINs, KPC mice (Lox‐STOP‐Lox‐KrasG12D/+ Lox‐STOP‐Lox‐Trp53R172H/+ Pdx1‐Cre) before PDAC development and also early‐stage PDAC patients (n = 31), compared to benign ...
Hannah Mearns +10 more
wiley +1 more source
Tumors contain diverse cellular states whose behavior is shaped by context‐dependent gene coordination. By comparing gene–gene relationships across biological contexts, we identify adaptive transcriptional modules that reorganize into distinct vulnerability axes.
Brian Nelson +9 more
wiley +1 more source
This paper examines the sample proportions estimates in the presence of univariate missing categorical data. A database about smoking habits (2011 National Addiction Survey of Mexico) was used to create simulated yet realistic datasets at rates 5% and 15%
Torres Munguía, Juan Armando
doaj
K-Means Clustering with KNN and Mean Imputation on CPU Benchmark Compilation Data
In the rapidly evolving digital age, data is becoming a valuable source for decision-making and analysis. Clustering, as an important technique in data analysis, has a key role in organizing and understanding complex datasets.
Rofiq Muhammad Syauqi +2 more
doaj +1 more source
Integration of survey data and big observational data for finite population inference using mass imputation [PDF]
Multiple data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example in finite-population inference, we consider an imputation approach to combining a probability sample with big observational
Kim, Jae Kwang +2 more
core +3 more sources
Variable selection with Random Forests for missing data [PDF]
Variable selection has been suggested for Random Forests to improve their efficiency of data prediction and interpretation. However, its basic element, i.e.
Hapfelmeier, Alexander, Ulm, Kurt
core +1 more source
A multiple-phenotype imputation method for genetic studies [PDF]
Genetic association studies have yielded a wealth of biological discoveries. However, these studies have mostly analyzed one trait and one SNP at a time, thus failing to capture the underlying complexity of the data sets. Joint genotype-phenotype analyses of complex, high-dimensional data sets represent an important way to move beyond simple genome ...
Andrew Dahl +8 more
openaire +5 more sources

