Results 21 to 30 of about 222,810 (285)

Bicriteria Oversampling for Imbalanced Data Classification

open access: yesProcedia Computer Science, 2022
The paper proposes bicriteria oversampling strategy for mining imbalanced data. We use two specialized criteria for oversampling -classification potential and distance from the borderline between minority and majority instances. The potential is to be maximized and the distance minimized.
Joanna Jedrzejowicz, Piotr Jedrzejowicz
openaire   +1 more source

A New Big Data Model Using Distributed Cluster-Based Resampling for Class-Imbalance Problem

open access: yesApplied Computer Systems, 2019
The class imbalance problem, one of the common data irregularities, causes the development of under-represented models. To resolve this issue, the present study proposes a new cluster-based MapReduce design, entitled Distributed Cluster-based Resampling ...
Terzi Duygu Sinanc, Sagiroglu Seref
doaj   +1 more source

Evaluating Misclassifications in Imbalanced Data [PDF]

open access: yes, 2006
Evaluating classifier performance with ROC curves is popular in the machine learning community. To date, the only method to assess confidence of ROC curves is to construct ROC bands. In the case of severe class imbalance with few instances of the minority class, ROC bands become unreliable.
William Elazmeh   +2 more
openaire   +1 more source

Single-Point Crossover and Jellyfish Optimization for Handling Imbalanced Data Classification Problem

open access: yesIEEE Access, 2022
The imbalanced datasets and their classification has pulled in as a hot research topic over the years. It is used in different fields, for example, security, finance, health, and many others.
Abeer S. Desuky   +4 more
doaj   +1 more source

A novel hybrid predictive maintenance model based on clustering, smote and multi-layer perceptron neural network optimised with grey wolf algorithm

open access: yesSN Applied Sciences, 2021
Considering the complexities and challenges in the classification of multiclass and imbalanced fault conditions, this study explores the systematic combination of unsupervised and supervised learning by hybridising clustering (CLUST) and optimised multi ...
Albert Buabeng   +3 more
doaj   +1 more source

Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data

open access: yesSensors, 2019
The Adaptive Boosting (AdaBoost) algorithm is a widely used ensemble learning framework, and it can get good classification results on general datasets.
Kewen Li   +4 more
doaj   +1 more source

Imbalanced Data Classification Algorithm Based on CSD-ELM [PDF]

open access: yesJisuanji gongcheng, 2019
The Extreme Learning Machine(ELM) based on cost-sensitive learning has its advantages in dealing with imbalanced data classification problems.However,it fails to consider the distribution characteristics of samples in different classes and the importance
WANG Dafei, XIE Wujie, DONG Wenhan
doaj   +1 more source

TGT: A Novel Adversarial Guided Oversampling Technique for Handling Imbalanced Datasets

open access: yesEgyptian Informatics Journal, 2021
With the volume of data increasing exponentially, there is a growing interest in helping people to benefit from their data regardless of its poor quality.
Ayat Mahmoud   +3 more
doaj   +1 more source

Clustering and Community Detection with Imbalanced Clusters [PDF]

open access: yes, 2016
Spectral clustering methods which are frequently used in clustering and community detection applications are sensitive to the specific graph constructions particularly when imbalanced clusters are present. We show that ratio cut (RCut) or normalized cut (
Aksoylar, Cem   +2 more
core   +1 more source

Local case-control sampling: Efficient subsampling in imbalanced data sets [PDF]

open access: yes, 2014
For classification problems with significant class imbalance, subsampling can reduce computational costs at the price of inflated variance in estimating model parameters.
Fithian, William, Hastie, Trevor
core   +1 more source

Home - About - Disclaimer - Privacy