Results 11 to 20 of about 4,225,561 (321)
Improving accuracy of missing data imputation in data mining
In fact, raw data in the real world is dirty. Each large data repository contains various types of anomalous values that influence the result of the analysis, since in data mining, good models usually need good data, databases in the world are not always
Nzar A. Ali, Zhyan M. Omer
doaj +1 more source
Background: Missing values in data are found in a large number of studies in the field of medical sciences, especially longitudinal ones, in which repeated measurements are taken from each person during the study.
Amin Golabpour +4 more
doaj +1 more source
Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods
Missing data is inevitable and ubiquitous in intelligent transportation systems (ITSs). A handful of completion methods have been proposed, among which the tensor-based models have been shown to be the most advantageous for missing traffic data ...
Qin Li +4 more
doaj +1 more source
Joint Models for Incomplete Longitudinal Data and Time-to-Event Data
Clinical studies often collect longitudinal and time-to-event data for each subject. Joint modeling is a powerful methodology for evaluating the association between these data.
Yuriko Takeda +2 more
doaj +1 more source
Can k-NN imputation improve the performance of C4.5 with small software project data sets? A comparative evaluation [PDF]
Missing data is a widespread problem that can affect the ability to use data to construct effective prediction systems. We investigate a common machine learning technique that can tolerate missing values, namely C4.5, to predict cost using six real world
Albrecht +60 more
core +1 more source
Random Forest variable importance with missing data [PDF]
Random Forests are commonly applied for data prediction and interpretation. The latter purpose is supported by variable importance measures that rate the relevance of predictors. Yet existing measures can not be computed when data contains missing values.
Hapfelmeier, Alexander +2 more
core +1 more source
Missing data imputation using classification and regression trees [PDF]
Background Missing data are common when analyzing real data. One popular solution is to impute missing data so that one complete dataset can be obtained for subsequent data analysis.
Cheng-Yang Chen, Yu-Wei Chang
doaj +2 more sources
Inference and Missing Data [PDF]
ABSTRACTTwo results are presented concerning inference when data may be missing. First, ignoring the process that causes missing data when making sampling distribution inferences about the parameter of the data, θ, is generally appropriate if and only if the missing data are “missing at random” and the observed data are “observed at random,” and then ...
openaire +2 more sources
High-Dimensional Matched Subspace Detection When Data are Missing [PDF]
We consider the problem of deciding whether a highly incomplete signal lies within a given subspace. This problem, Matched Subspace Detection, is a classical, well-studied problem when the signal is completely observed. High- dimensional testing problems
Balzano, Laura +2 more
core +3 more sources
Multiple imputation with missing indicators as proxies for unmeasured variables: simulation study
Background Within routinely collected health data, missing data for an individual might provide useful information in itself. This occurs, for example, in the case of electronic health records, where the presence or absence of data is informative.
Matthew Sperrin, Glen P. Martin
doaj +1 more source

