Results 1 to 10 of about 139,430 (360)
Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges.
Chen, Tianqi, Guestrin, Carlos
openaire +4 more sources
Multiple Imputation Through XGBoost
The use of multiple imputation (MI) is becoming increasingly popular for addressing missing data. Although some conventional MI approaches have been well studied and have shown empirical validity, they have limitations when processing large datasets with complex data structures.
Yongshi Deng, Thomas Lumley
openalex +4 more sources
Imbalance-XGBoost: Leveraging Weighted and Focal Losses for Binary\n Label-Imbalanced Classification with XGBoost [PDF]
The paper presents Imbalance-XGBoost, a Python package that combines the powerful XGBoost software with weighted and focal losses to tackle binary label-imbalanced classification tasks. Though a small-scale program in terms of size, the package is, to the best of the authors' knowledge, the first of its kind which provides an integrated implementation ...
Chen Wang, Chengyuan Deng, Suzhen Wang
openalex +4 more sources
Breast Cancer Classification using XGBoost
Breast cancer continues to be one of the foremost illnesses that results in the deaths of numerous women each year. Among the female population, approximately 8% are diagnosed with Breast cancer (BC), following Lung Cancer. The alarming rise in fatality rates can be attributed to breast cancer being the second leading cause.
Rahmanul Hoque +3 more
openaire +2 more sources
Influence-Balanced XGBoost: Improving XGBoost for Imbalanced Data Using Influence Functions
Decision tree boosting algorithms, such as XGBoost, have demonstrated superior predictive performance on tabular data for supervised learning compared to neural networks. However, recent studies on loss functions for imbalanced data have primarily focused on deep learning.
Akiyoshi Sutou, Jinfang Wang
openaire +3 more sources
Optimization and Application of XGBoost Logging Prediction Model for Porosity and Permeability Based on K-means Method [PDF]
The prediction and distribution of reservoir porosity and permeability are of paramount importance for the exploration and development of regional oil and gas resources.
Jianting Zhang +3 more
openalex +2 more sources
Classification of UNSW-NB15 dataset using Exploratory Data Analysis using Ensemble Learning [PDF]
Recent advancements in machine learning have made it a tool of choice for different classification and analytical problems. This paper deals with a critical field of computer networking: network security and the possibilities of machine ...
Neha Sharma +2 more
doaj +1 more source
Forecasting time series is crucial for financial research and decision-making in business. The nonlinearity of stock market prices profoundly impacts global economic and financial sectors. This study focuses on modeling and forecasting the daily prices of key stock indices - MASI, CAC 40, DAX, FTSE 250, NASDAQ, and HKEX, representing the Moroccan,
HASSAN OUKHOUYA +3 more
openaire +1 more source
The aim of this study was to determine the best non-linear function describing the growth of the Linda goose breed. To achieve this aim, five non-linear functions, such as exponential, logistic, von Bertalanffy, Brody and Gompertz, were employed to ...
Cem Tirink +3 more
doaj +1 more source
Most kidney cancers are kidney renal clear cell carcinoma (KIRC) that is a main cause of cancer-related deaths. Polygenic risk score (PRS) is a weighted linear combination of phenotypic related alleles on the genome that can be used to assess KIRC risk ...
Xiaoyu Hou +9 more
doaj +1 more source

