Results 11 to 20 of about 224,004 (328)
A Method for Analyzing the Performance Impact of Imbalanced Binary Data on Machine Learning Models
Machine learning models may not be able to effectively learn and predict from imbalanced data in the fields of machine learning and data mining. This study proposed a method for analyzing the performance impact of imbalanced binary data on machine ...
Ming Zheng +5 more
doaj +1 more source
Imbalanced Data Classification Method Based on LSSASMOTE
Imbalanced data exist extensively in the real world, and the classification of imbalanced data is a hot topic in machine learning. In order to classify imbalanced data more effectively, an oversampling method named LSSASMOTE is proposed in this paper ...
Zhi Wang, Qicheng Liu
doaj +1 more source
Multicriteria Classifier Ensemble Learning for Imbalanced Data
One of the vital problems with the imbalanced data classifier training is the definition of an optimization criterion. Typically, since the exact cost of misclassification of the individual classes is unknown, combined metrics and loss functions that ...
Weronika Wegier +2 more
doaj +1 more source
Bicriteria Oversampling for Imbalanced Data Classification
The paper proposes bicriteria oversampling strategy for mining imbalanced data. We use two specialized criteria for oversampling -classification potential and distance from the borderline between minority and majority instances. The potential is to be maximized and the distance minimized.
Joanna Jedrzejowicz, Piotr Jedrzejowicz
openaire +1 more source
A method for increasing accuracy of credit imbalanced data [PDF]
The main goal of this research is to provide a method that can be used to increase the accuracy of credit imbalance data. Financial fraud is a fundamental problem that affects both the financial sector and life and plays an important role in affecting ...
Arash GhorbanniaDelavar +1 more
doaj +1 more source
Hellinger Distance Trees for Imbalanced Streams [PDF]
Classifiers trained on data sets possessing an imbalanced class distribution are known to exhibit poor generalisation performance. This is known as the imbalanced learning problem.
Brooke, J. M. +3 more
core +2 more sources
Framework for imbalanced data classification
Classifying imbalanced data remains a challenging task. The paper presents a framework for imbalanced datasets classification which makes use of different methods of oversampling and methods of dynamical selection of classifiers. The framework allows to perform extensive experiments to determine best possible configuration for the examined dataset in ...
Mikołaj Błaszczyk +1 more
+4 more sources
Box Drawings for Learning with Imbalanced Data [PDF]
The vast majority of real world classification problems are imbalanced, meaning there are far fewer data from the class of interest (the positive class) than from other classes.
Abe N. +4 more
core +3 more sources
A Novel Imbalanced Ensemble Learning in Software Defect Predication
With the availability of high-speed Internet and the advent of Internet of Things devices, modern software systems are growing in both size and complexity. Software defect prediction (SDP) guarantees the high quality of such complex systems. However, the
Jianming Zheng +4 more
doaj +1 more source
A New Big Data Model Using Distributed Cluster-Based Resampling for Class-Imbalance Problem
The class imbalance problem, one of the common data irregularities, causes the development of under-represented models. To resolve this issue, the present study proposes a new cluster-based MapReduce design, entitled Distributed Cluster-based Resampling ...
Terzi Duygu Sinanc, Sagiroglu Seref
doaj +1 more source

