Results 41 to 50 of about 4,092,880 (310)
Nonparametric Tests Applicable to High Dimensional Data
Constructions of data driven ordering of set of multivariate observations are presented. The methods employ also dissimilarity measures. The ranks are used in the construction of test statistics for location problem and in the construction of the ...
Frantisek Rublik
doaj +1 more source
Missing Data Imputation with High-Dimensional Data
Imputation of missing data in high-dimensional datasets with more variables P than samples N, P≫N, is hampered by the data dimensionality. For multivariate imputation, the covariance matrix is ill conditioned and cannot be properly estimated. For fully conditional imputation, the regression models for imputation cannot include all the variables.
Alberto Brini, Edwin R. van den Heuvel
openaire +1 more source
High Dimensional Data Clustering using Self-Organized Map
As the population grows and e economic development, houses could be one of basic needs of every family. Therefore, housing investment has promising value in the future. This research implements the Self-Organized Map (SOM) algorithm to cluster house data
Ruth Ema Febrita +2 more
doaj +1 more source
Data for Ensemble Riemannian Data Assimilation for High-dimensional Nonlinear Dynamics
Tamang, Sagar, K; Ebtehaj, Ardeshir; van Leeuwen, Peter, J; Lerman, Gilad; Foufoula-Georgiou, Efi. (2021). Data for Ensemble Riemannian Data Assimilation for High-dimensional Nonlinear Dynamics.
Ebtehaj, Ardeshir +4 more
core +1 more source
Growing evidence suggests a wide spectrum of potential cardiovascular complications following cancer therapies, leading to an urgent need for better risk-stratifying and disease screening in patients undergoing oncological treatment.
Haidee Chen +5 more
doaj +1 more source
Recession forecasting with high‐dimensional data [PDF]
AbstractIn this paper, a large amount of different financial and macroeconomic variables are used to predict the U.S. recession periods. We propose a new cost‐sensitive extension to the gradient boosting model, which can take into account the class imbalance problem of the binary response variable.
openaire +1 more source
Finding k-Dominant G-Skyline Groups on High Dimensional Data
Skyline query retrieves a set of skyline points which are not dominated by any other point and has attracted wide attention in database community. Recently, an important variant G-Skyline is developed. It aims to return optimal groups of points. However,
Kaiqi Zhang +3 more
doaj +1 more source
A classification method for high‐dimensional imbalanced multi‐classification data
High‐dimensional imbalanced multi‐classification problems (HDIMCPs) occur frequently in engineering applications such as medical detection, item classification, and email classification.
Mengmeng Li +5 more
doaj +1 more source
An efficient predictive analytics system for high dimensional big data
The excessive growth of high dimensional big data has resulted in a greater challenge for data scientists to efficiently obtain valuable knowledge from these data. Traditional data mining techniques are not fit to process big data.
Myat Cho Mon Oo, Thandar Thein
doaj +1 more source
Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA
High-dimensional biomedical data contained many irrelevant or weakly correlated features, which affected the efficiency of disease diagnosis. This manuscript presented a feature selection method for high-dimensional biomedical data based on the ...
Yongqiang Dai +3 more
doaj +1 more source

