The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. [PDF]
AbstractBackgroundTo evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet.
Chicco D, Jurman G.
europepmc +6 more sources
X-ray is not inferior to CT in terms of F1 score in the diagnosis of foreign body aspiration: a recall, precision and F1 score performance analysis based on bronchoscopically proven cases. [PDF]
In this study, we aimed to evaluate the diagnostic accuracy of X-ray and CT by using the F1 score with its non-inferiority margin in patients who underwent bronchoscopy with suspected diagnoses of foreign body aspiration (FBA).All children aged under 18 who underwent bronchoscopy with suspected diagnoses of FBA between June 2020 and December 2023 were ...
Sarac F, Yazici M.
europepmc +4 more sources
A Comprehensive Analysis on Detecting Chronic Kidney Disease by Employing Machine Learning Algorithms [PDF]
INTRODUCTION: Chronic Kidney Disease refers to the slow, progressive deterioration of kidney functions. However, the impairment is irreversible and imperceptible up until the disease reaches one of the later stages ...
Mirza Nishat +7 more
doaj +1 more source
Evolving A Neural Network to Predict Diabetic Neuropathy [PDF]
One of the main areas where machine learning (ML) techniques are used vastly is in prediction of diseases. Diabetic neuropathy (DN) disease is a complication of diabetes which causes damage to nerves.
Shiva Reddy, Gadiraju Mahesh, N. Preethi
doaj +1 more source
Towards an Online Empathy Assisted Counselling Web Application [PDF]
INTRODUCTION: In today's society mental health is becoming increasingly important. As a result, more and more individuals need guidance and counselling.
Aarif Mawani, Lawrence Nderu
doaj +1 more source
90% F1 Score in Relation Triple Extraction: Is it Real?
Accepted in GenBench workshop @ EMNLP ...
Saini, Pratik +3 more
openaire +2 more sources
Confidence interval for micro-averaged F1 and macro-averaged F1 scores [PDF]
AbstractA binary classification problem is common in medical field, and we often use sensitivity, specificity, accuracy, negative and positive predictive values as measures of performance of a binary predictor. In computer science, a classifier is usually evaluated with precision (positive predictive value) and recall (sensitivity). As a single summary
Kanae Takahashi +3 more
openaire +2 more sources
Estimating the Uncertainty of Average F1 Scores [PDF]
In multi-class text classification, the performance (effectiveness) of a classifier is usually measured by micro-averaged and macro-averaged F1 scores. However, the scores themselves do not tell us how reliable they are in terms of forecasting the classifier's future performance on unseen data.
Dell Zhang, Jun Wang 0012, Xiaoxue Zhao
openaire +1 more source
A Bayesian Hierarchical Model for Comparing Average F1 Scores [PDF]
In multi-class text classification, the performance (effectiveness) of a classifier is usually measured by micro-averaged and macro-averaged F1 scores. However, the scores themselves do not tell us how reliable they are in terms of forecasting the classifier's future performance on unseen data.
Dell Zhang +3 more
openaire +1 more source
sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification
Multiclass multilabel classification is the task of attributing multiple labels to examples via predictions. Current models formulate a reduction of the multilabel setting into either multiple binary classifications or multiclass classification, allowing for the use of existing loss functions (sigmoid, cross-entropy, logistic, etc.).
Bénédict, G. +3 more
openaire +4 more sources

