Results 11 to 20 of about 39,918 (213)
LoRAS: an oversampling approach for imbalanced datasets [PDF]
AbstractThe Synthetic Minority Oversampling TEchnique (SMOTE) is widely-used for the analysis of imbalanced datasets. It is known that SMOTE frequently over-generalizes the minority class, leading to misclassifications for the majority class, and effecting the overall balance of the model.
Saptarshi Bej +4 more
openaire +3 more sources
Learning Imbalanced Datasets With Maximum Margin Loss
A learning algorithm referred to as Maximum Margin (MM) is proposed for considering the class-imbalance data learning issue: the trained model tends to predict the majority of classes rather than the minority ones. That is, underfitting for minority classes seems to be one of the challenges of generalization.
Kang, Haeyong, Vu, Thang, Yoo, Chang D.
openaire +2 more sources
Conversion of adverse data corpus to shrewd output using sampling metrics
An imbalanced dataset is commonly found in at least one class, which are typically exceeded by the other ones. A machine learning algorithm (classifier) trained with an imbalanced dataset predicts the majority class (frequently occurring) more than the ...
Shahzad Ashraf +4 more
doaj +1 more source
Active Learning for Imbalanced Datasets
Active learning increases the effectiveness of labeling when only subsets of unlabeled datasets can be processed manually. To our knowledge, existing algorithms are designed under the assumption that datasets are balanced. However, many real-life datasets are actually imbalanced and we propose two adaptations of active learning to tackle imbalance ...
Aggarwal, Umang +2 more
openaire +2 more sources
Active Class Incremental Learning for Imbalanced Datasets [PDF]
Accepted in IPCV workshop from ...
Eden Belouadah +3 more
openaire +2 more sources
SSMFN: a fused spatial and sequential deep learning model for methylation site prediction [PDF]
Background Conventional in vivo methods for post-translational modification site prediction such as spectrophotometry, Western blotting, and chromatin immune precipitation can be very expensive and time-consuming.
Favorisen Rosyking Lumbanraja +4 more
doaj +2 more sources
IDPP: Imbalanced Datasets Pipelines in Pyrus
We showcase and demonstrate IDPP, a Pyrus-based tool that offers a collection of pipelines for the analysis of imbalanced datasets. Like Pyrus, IDPP is a web-based, low-code/no-code graphical modelling environment for ML and data analytics applications. On a case study from the medical domain, we solve the challenge of re-using AI/ML models that do not
Amandeep Singh, Olga Minguett
openaire +2 more sources
Multilayer Feedforward Neural Network for Internet Traffic Classification.
Recently, the efficient internet traffic classification has gained attention in order to improve service quality in IP networks. But the problem with the existing solutions is to handle the imbalanced dataset which has high uneven distribution of flows ...
N. Manju, B. S. Harish, N. Nagadarshan
doaj +1 more source
Federated Learning on Clinical Benchmark Data: Performance Assessment
BackgroundFederated learning (FL) is a newly proposed machine-learning method that uses a decentralized dataset. Since data transfer is not necessary for the learning process in FL, there is a significant advantage in protecting personal privacy ...
Lee, Geun Hyeong, Shin, Soo-Yong
doaj +1 more source
Bayes classifiers for imbalanced traffic accidents datasets [PDF]
Traffic accidents data sets are usually imbalanced, where the number of instances classified under the killed or severe injuries class (minority) is much lower than those classified under the slight injuries class (majority). This, however, supposes a challenging problem for classification algorithms and may cause obtaining a model that well cover the ...
Mujalli, R +2 more
openaire +3 more sources

