Results 31 to 40 of about 3,606,574 (279)
Robust nearest-neighbor methods for classifying high-dimensional data [PDF]
We suggest a robust nearest-neighbor approach to classifying high-dimensional data. The method enhances sensitivity by employing a threshold and truncates to a sequence of zeros and ones in order to reduce the deleterious impact of heavy-tailed data ...
Chan, Yao-ban, Hall, Peter
core +2 more sources
Program Evaluation and Causal Inference with High-Dimensional Data [PDF]
In this paper, we provide efficient estimators and honest confidence bands for a variety of treatment effects including local average (LATE) and local quantile treatment effects (LQTE) in data-rich environments. We can handle very many control variables,
Belloni, Alexandre +3 more
core +2 more sources
Growing evidence suggests a wide spectrum of potential cardiovascular complications following cancer therapies, leading to an urgent need for better risk-stratifying and disease screening in patients undergoing oncological treatment.
Haidee Chen +5 more
doaj +1 more source
Learning Discriminative Bayesian Networks from High-dimensional Continuous Neuroimaging Data [PDF]
Due to its causal semantics, Bayesian networks (BN) have been widely employed to discover the underlying data relationship in exploratory studies, such as brain research.
Liu, Lingqiao +4 more
core +3 more sources
This paper provides a brief introduction to high-dimensional data, a form of ‘Big Data’, and gives an overview of several data analysis concepts and techniques that could be used to explore and analyse such data. An example that involves genomics data from several Sri Lankan and United Kingdom oral cancer patients is used to illustrate the methods.
Dhammika Amaratunga, Javier Cabrera
openaire +2 more sources
Finding k-Dominant G-Skyline Groups on High Dimensional Data
Skyline query retrieves a set of skyline points which are not dominated by any other point and has attracted wide attention in database community. Recently, an important variant G-Skyline is developed. It aims to return optimal groups of points. However,
Kaiqi Zhang +3 more
doaj +1 more source
A classification method for high‐dimensional imbalanced multi‐classification data
High‐dimensional imbalanced multi‐classification problems (HDIMCPs) occur frequently in engineering applications such as medical detection, item classification, and email classification.
Mengmeng Li +5 more
doaj +1 more source
Consistent and Flexible Selectivity Estimation for High-dimensional Data
Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion. Answering this problem accurately and efficiently is essential to many applications, such as density estimation, outlier detection, query ...
Ishikawa, Yoshiharu +7 more
core +1 more source
An efficient predictive analytics system for high dimensional big data
The excessive growth of high dimensional big data has resulted in a greater challenge for data scientists to efficiently obtain valuable knowledge from these data. Traditional data mining techniques are not fit to process big data.
Myat Cho Mon Oo, Thandar Thein
doaj +1 more source
Asymptotic inference for high-dimensional data
In this paper, we study inference for high-dimensional data characterized by small sample sizes relative to the dimension of the data. In particular, we provide an infinite-dimensional framework to study statistical models that involve situations in ...
Kuelbs, Jim, Vidyashankar, Anand N.
core +1 more source

