Results 21 to 30 of about 1,077,907 (260)
HIGH-DIMENSIONAL DATA ANALYSIS [PDF]
High-Dimensional Classification: High-Dimensional Classification (J-Q Fan et al.) Flexible Large Margin Classifiers (Y-F Liu & Y-C Wu) Large-Scale Multiple Testing: Large-Scale Multiple Testing (T T Cai & W-G Sun) Model Building with Variable Selection: Model Building with Variable Selection (M Yuan) Bayesian Variable Selection in Regression with ...
Tony Cai, Xiaotong Shen
openaire +1 more source
The properties of nonuniformity analysis of high dimensional data
A novel approach to outlier detection and clustering on the ground of the distribution of distances between multidimensional points is presented. The basic idea is to eval uate the outlier factor for each data point.
Vydūnas Šaltenis
doaj +3 more sources
Data integration with high dimensionality
SummaryWe consider situations where the data consist of a number of responses for each individual, which may include a mix of discrete and continuous variables. The data also include a class of predictors, where the same predictor may have different physical measurements across different experiments depending on how the predictor is measured.
Gao, Xin, Carroll, Raymond J.
openaire +4 more sources
RLE plots: Visualizing unwanted variation in high dimensional data. [PDF]
Unwanted variation can be highly problematic and so its detection is often crucial. Relative log expression (RLE) plots are a powerful tool for visualizing such variation in high dimensional data.
Luke C Gandolfo, Terence P Speed
doaj +1 more source
Nonparametric Tests Applicable to High Dimensional Data
Constructions of data driven ordering of set of multivariate observations are presented. The methods employ also dissimilarity measures. The ranks are used in the construction of test statistics for location problem and in the construction of the ...
Frantisek Rublik
doaj +1 more source
High Dimensional Data Clustering using Self-Organized Map
As the population grows and e economic development, houses could be one of basic needs of every family. Therefore, housing investment has promising value in the future. This research implements the Self-Organized Map (SOM) algorithm to cluster house data
Ruth Ema Febrita +2 more
doaj +1 more source
Domain adaptation is a popular paradigm in modern machine learning which aims at tackling the problem of divergence (or shift) between the labeled training and validation datasets (source domain) and a potentially large unlabeled dataset (target domain).
Evgeny M. Mirkes +5 more
doaj +1 more source
Growing evidence suggests a wide spectrum of potential cardiovascular complications following cancer therapies, leading to an urgent need for better risk-stratifying and disease screening in patients undergoing oncological treatment.
Haidee Chen +5 more
doaj +1 more source
This paper provides a brief introduction to high-dimensional data, a form of ‘Big Data’, and gives an overview of several data analysis concepts and techniques that could be used to explore and analyse such data. An example that involves genomics data from several Sri Lankan and United Kingdom oral cancer patients is used to illustrate the methods.
Dhammika Amaratunga, Javier Cabrera
openaire +2 more sources
Finding k-Dominant G-Skyline Groups on High Dimensional Data
Skyline query retrieves a set of skyline points which are not dominated by any other point and has attracted wide attention in database community. Recently, an important variant G-Skyline is developed. It aims to return optimal groups of points. However,
Kaiqi Zhang +3 more
doaj +1 more source

