Abstract
The dynamic analysis of municipal solid waste (MSW) is essential for optimizing landfills and advancing sustainable development goals. Assessing damping ratio (D), a critical dynamic parameter, under laboratory conditions is costly and time-consuming, requiring specialized equipment and expertise. To streamline this process, this research leveraged several novel ensemble machine learning models integrated with the equilibrium optimizer algorithm (EOA) for the predictive analysis of damping characteristics. Data were gathered from 153 cyclic triaxial experiments on MSW, which examined the age, shear strain, weight, frequency, and percentage of plastic content. Analysis of a correlation heatmap indicated a significant dependence of D on shear strain within the collected MSW data. Subsequently, five advanced machine learning methods—adaptive boosting (AdaBoost), gradient boosting regression tree (GBRT), extreme gradient boosting (XGBoost), random forest (RF), and cubist regression—were employed to model D in landfill structures. Among these, the GBRT-EOA model demonstrated superior performance, with a coefficient of determination (R2) of 0.898, root mean square error of 1.659, mean absolute error of 1.194, mean absolute percentage error of 0.095, and an a20-index of 0.891 for the test data. A Shapley additive explanation analysis was conducted to validate these models further, revealing the relative contributions of each studied variable to the predicted D-MSW. This holistic approach not only enhances the understanding of MSW dynamics but also aids in the efficient design and management of landfill systems.
Similar content being viewed by others
Introduction
Annually, a significant volume of waste is produced worldwide, posing a major pollution challenge. According to a report by the World Bank, global waste generation is projected to increase to approximately 2.59 billion tons by 2030 and 3.4 billion tons by 20501. About 40% of this waste is disposed of in landfills, considerably impacting the environment. The potential locations for constructing new landfills are increasingly limited due to stringent environmental regulations, rapid urbanization, and local opposition from residents near proposed sites2. Consequently, focusing on the design and development of sanitary landfills represents a pivotal step toward sustainable waste management. Landfills are exposed to a broad spectrum of failures under various loading conditions, as evidenced by incidents in OII (1994), Payatas (2000), Ano Liosia (2003), Leuwigajah (2005), and Shenzhen (2015). Studies of these failures reveal that multiple factors can precipitate such outcomes, including inadequate liner-waste contact, insufficient shear strength of the landfills, and other structural deficiencies3. Seismic events, in particular, can significantly compromise landfill integrity, leading to damage in the cover systems, disruption of leachate collection, and failures in gas collection systems. Therefore, municipal solid waste (MSW) components in landfills must be designed to remain secure and operational both during and after seismic activities to mitigate these risks. A thorough review of the literature shows that extensive research has been conducted to assess the behavior of MSW under dynamic loads on a laboratory scale. Notable examples include the cyclic investigations of MSW properties by Karim-pour Fard et al.4 and the study of MSW particle behavior concerning cyclic parameters by Rawat and Mohanty5. Damping ratio (D), a critical cyclic parameter, is the capacity of soil or similar materials to dissipate energy from vibrations or seismic waves. This property is pivotal in soil-structure interaction studies, particularly seismic response analysis. Understanding D is crucial as it provides valuable insights into how soils react under cyclic loads and assists in evaluating the stability and performance of geotechnical systems. The importance of D in earthquake design is underscored by numerous experimental studies. These include the use of a cyclic triaxial device to examine D of rubber particles mixed with clay by Akbarimehr and Fakharian6, the investigation of D in soft marine clay under cyclic shear stresses utilizing a cyclic simple shear device by Patino and Galindo7, and the study of D of clays across a wide range of shear strains from 0.001% to 1% employing a triaxial simple shear machine by Abdellaziz et al.8. It is crucial to acknowledge that the structure and performance of MSW can vary significantly across different regions due to variations in community consumption habits, weather conditions, and societal factors. For example, waste composition markedly differs between industrialized and developing countries9. Adopting computational methodologies can be a noticeable advancement in examining cyclic parameters, as it can overcome the limitations, high costs, and other challenges associated with laboratory and field evaluations. This approach enhances efficiency and provides a broader, more adaptable framework for understanding geotechnical behaviors under seismic conditions. Machine learning (ML) and artificial intelligence (AI) techniques are increasingly applied across various prediction problems, leading to the development of computational models based on empirical data10. Research in this field includes models for predicting the properties of MSW, such as the high heating quantity by Bagheri et al.11 and the biomass and waste properties by Liang et al.12. Additionally, Khatti and Grover13 assessed the compaction parameters of fine-grained soils using gene expression programming (GEP), least square support vector machine (LSSVM), long-short-term memory (LSTM), and artificial neural networks (ANNs). Comparative testing demonstrates that the linear LSSVM model MD110 and polynomial LSSVM model MD15 exhibited superior predictive capabilities. Ensemble ML models offer significant advantages over single models, primarily due to their improved prediction accuracy, robustness against overfitting, and enhanced generalization capabilities. By aggregating insights from multiple predictive models, ensembles leverage collective wisdom, which helps capture varied, subtle patterns within data that individual models might miss. This aggregation leads to better performance, as errors from individual models tend to cancel out, reducing bias and variance14. Moreover, ensembles are flexible and adaptable, capable of integrating different model types and structures to handle diverse data and relationships effectively. This makes them particularly effective in scenarios where the underlying data patterns are complex, and the cost of incorrect predictions is high15. Moradi Moghaddam et al.16 evaluated the shear modulus parameters of MSW and clay soil using various computational techniques, including random forest (RF), gradient boosting regression trees (GBRT), extreme gradient boosting (XGBoost), and adaptive boosting (AdaBoost). The study found that the GBRT methodology outperformed other methods in terms of model capabilities (R2, RMSE, MAE, and MAPE). Similarly, Ahmad et al.17 implemented an XGBoost model to predict the shear strength of rockfill materials. The performance of this model was assessed and compared with other predictive models such as AdaBoost, K-nearest neighbor, support vector machine, and random forest, with findings indicating that the XGBoost model presented the highest forecasting accuracy. Numerous studies have focused on various AI approaches to analyze the dynamic characteristics of all types of geomaterials, emphasizing the significance of these factors (Table 1). The DEEPSOIL program has been utilized to simulate landfill structures under various seismic loads, using dynamic parameters derived from numerical approaches18,19. The evaluation of numerical techniques in predicting the seismic properties of landfill components is an area of considerable scholarly interest, as it underscores the application of ML in the geo-environmental sector. This approach not only saves time and money in laboratory and field tests but also addresses the notable scarcity of data regarding the cyclic properties of MSW. Consequently, this article focuses on developing a reliable method for calculating D, an emerging topic that promises to enrich our understanding of seismic engineering in landfills. Drawing from previous studies that display the high accuracy of ML techniques in estimating MSW properties, this research employs ensemble-based methods to predict D parameter of landfill MSW components. Initially, the study determines factors influencing D-MSW through a cyclic triaxial device. Subsequently, prediction models for D in landfill components will be developed using various methods, including GBRT, XGBoost, RF, AdaBoost, and Cubist, which are then hybridized with the equilibrium optimization algorithm (EOA). EOA is a robust optimization method inspired by the thermodynamic principle of mass balance, effectively balancing exploration and exploitation to efficiently find optimal solutions in complex problem spaces20. This makes it especially adept at dealing with nonlinear, multimodal, and high-dimensional challenges where traditional methods may falter. The EOA’s adaptability and simplicity, with fewer parameters to manage and no need for complex operations like those in genetic algorithms (GAs) or particle swarm optimization (PSO), enhance its appeal21. In predictive modeling, such as for D in seismic analysis, EOA can remarkably improve ensemble models by optimizing model parameters and combining predictions effectively, leading to more accurate and reliable outputs22. This adaptability and efficiency make EOA an attractive choice for complex optimization tasks, ensuring robustness and superior performance across various applications23.
Proposed models in this research offer geotechnical engineers vital tools for enhancing the seismic resilience of landfills. These models are capable of predicting the dynamic properties of MSW across a variety of conditions, facilitating precise and informed decision-making in engineering practices. Utilizing Shapley additive explanation (SHAP) values, the models deliver clear, interpretable insights into how various factors such as the age, shear strain, and material composition influence dynamic behavior16. This detailed analysis helps engineers prioritize specific design and operational adjustments, like optimizing compaction methods or selecting appropriate materials aimed at bolstering seismic stability. In essence, these models convert complex dataset analyses into practical, actionable guidance that significantly improves landfill design and management, contributing to enhanced safety and resilience against seismic activities24.
Experimental framework
Characteristics of Kahrizak landfill site
Kahrizak landfill site (KLS), with an area of about 15 km2, is the largest landfill in the Middle East. About 8000 tons of waste per day enter from Tehran province to this center. Based on the samples of waste collected for ingredient study, organic material comprised most of the components across a wide range of particle dimensions37. KLS is located in a vulnerable ecological area, and the landfill’s activities have resulted in environmental deterioration, groundwater and soil pollution, and considerable health problems for surrounding populations. Figure 1 depicts a broad picture of various MSW samples at ages of fresh, 7.5 years, and 16 years.
Experimental assessments of D-MSW
In this section, three types of MSW specimens are recognized and gathered: fresh, 7.5 years, and 16 years. The aging process in waste samples causes significant chemical and physical modifications in the composition. The examination of the variations in waste elements over various ages reveals that as the age of trash grows up to 16 years, the proportion of moisture content and organic materials drops (193% and 188%, respectively), while the amount of plastic components rises (about 160%) (Table 2).
D-MSW was determined through experiments employing a strain-controlled cyclic triaxial machine (Fig. 2) and consolidated undrained compressive tests. The experimental setup included two pore water pressure sensors (accuracy of 0.2 kPa and measurement range of 0–1000 kPa), one axial strain sensor (accuracy of ± 0.001 mm and a measurement range of 70 mm (horizontal)), and one load cell (precision of 0.0025 kN and a measurement range of ± 20 kN). The cyclic triaxial apparatus utilized for testing could accommodate samples with a height of 20 cm and a diameter of 10 cm, allowing the assessment of dynamic properties within the shear strain (varies from 0.08% to 4%). The selection of the maximum particle dimension was based on recommendations from Zekkos et al.38, specifying that the most significant size might be up to one-third of the device’s sample size. Notably, ASTM D4767 and ASTM D3999 standards were applied during the loading and analyzing phases, respectively, considering the distinct characteristics of MSW.
To empirically assess the shear modulus parameter, 153 cyclic triaxial tests were conducted on waste specimens. These tests covered various factors such as the age (ranging from 0 to 16 years), plastic content (ranging from 0 to 0.274), confining pressures at two values of 75 kPa and 150 kPa (chosen to reflect conditions at KLS with a maximum waste height of 10 m to 20 m), unit weights of 9 kN/m3and 12 kN/m3 (corresponding to nearly observed unit weights for a fresh and aged sample of wastes in the ranges of 7 kN/m3 to 10 kN/m3 and 11 kN/m3 to 13 kN/m3), and frequencies of 0.1 Hz, 0.5 Hz, and 1 Hz, and shear strain levels of 0.075, 0.148, 0.37, 0.735, 1.83, and 3.63. The cyclic triaxial tests were conducted on reconstructed samples under consolidated undrained (CU) levels. Fresh and aged samples reconstitution adhered to the guidelines outlined in ASTM D4767. Following the reconstitution of the specimens and the setup of the apparatus, saturation of the specimen was achieved by ASTM D4767-95. The attainment of B values greater than 0.9 confirmed the saturation of the sample. During the consolidation stage, the cell pressure was incrementally raised. The initiation of the consolidation phase occurred upon opening the drainage valve, marking the point when the sample had attained the required confining stress. This consolidation phase persisted until volume variation ceased, and any excess pore water pressure completely dissipated, reaching equilibrium with the back pressure. Notably, due to the low permeability of MSW, the consolidation stage extended beyond 10 h. Following the consolidation period, the drainage valve was closed to fulfill the requirements of consolidated undrained conditions for subsequent cyclic stress applications. The primary phases of reconstructing the samples involved: (a) separation of the MSW samples, (b) installation of the membrane and mold along with the placement of the paper filter and upper cap, and (c) placement of the samples (Fig. 3).
Methodology and data description
Choosing between black-box, grey-box, and white-box ML models often hinges on the trade-off between the accuracy and interpretability, which is crucial in the fields like engineering and environmental science. Black-box models like ensemble learners typically provide high predictive accuracy by capturing complex nonlinear relationships within large, high-dimensional datasets. However, their opaque decision-making processes limit their interpretability39. In contrast, grey-box models offer a middle ground by incorporating both theoretical knowledge and data-driven elements, enhancing transparency and user trust without completely sacrificing performance. Meanwhile, white-box models like linear regression and decision trees prioritize transparency, with straightforward, understandable decision pathways that make them ideal for educational purposes and settings where understanding the causality behind predictions is paramount39. However, their simplicity can be a drawback when facing complex interactions that require more nuanced or intricate modeling. Ultimately, the choice among these model types should be guided by the specific requirements and constraints of the application at hand, weighing the need for accuracy against the importance of transparency and the ability to understand and trust the underlying mechanisms driving predictions39. In the following, we introduce the models implemented in this study in more detail and compare their mechanisms for modeling more accurately.
AdaBoost
The principle behind the AdaBoost method is to appropriately integrate numerous weak classifiers to create a strong one40. AdaBoost employs iteration and trains just a single weak classifier for each cycle. The learned weak classifier will be included in the following iteration. As a result, after the Nth iteration, there will be the entire N weak classifiers, N−1 of which has already been trained, and its different variables will remain unchanged. Currently, the Nth classifier is being trained. The link among the weak classifiers is that the Nth weak classifier is more likely to categorize the data than the initial N−1 weak classifiers that do not classify. The ultimate categorization result is determined by the total impact of the N classifiers. Figure 4 provides an overview of the AdaBoost model.
GBRT
GBRT is an example of integrated learning boosting that employs a forward-distributing technique41. This technique seeks to minimize the loss function by picking a suitable tree function according to the present theory and fitting function; hence, GBRT comprises two sections: the gradient boosting and regression tree. A regression tree is utilized to forecast the actual amount. Gradient boosting iterates across many trees to jointly determine the outcome. Each tree is the culmination and remnant of the previously learned trees. Figure 5 provides an overview of the GBRT model.
XGBoost
XGBoost is a popular software package that utilizes gradient boosting methods for supervised education. Its excellent performance, versatility, and capacity make it a popular choice for ML contests and commercial use42. XGBoost supports a wide range of data formats, such as numerical and qualitative characteristics, and it contains sophisticated functions like early ending and tree removal to increase model generalization and training rate43. The XGBoost flexibility and resilience provide an excellent choice for organized and formatted data evaluation. XGBoost deploys a mixed training approach in which fundamental designs are included in a forward-step fashion. The XGBoost algorithm is defined by Fig. 6.
RF
Svetnik introduced the model of RF regression as an enhanced regression approach with classification capabilities44,45. RF comprises several decision trees, therefore, there is no linkage among them. By conducting classification work with a new study sample, each decision tree within the forest is evaluated and categorized independently. Following that, every decision tree will receive a categorization result (Fig. 7). The decision tree with the most classification outcomes will be the outcome46.
Cubist
Using Quinlan’s M5 model tree47, Cubist as a non-parametric ML model48 constructs predictive models with the capability of handling up to thousands of input variables49. Wang and Witten50 improved the M5 tree with better performance. In a Cubist model, predictions are created with nodes different from a regression tree. In a regression tree, each node is assigned a single value prediction, while in the Cubist model, each node is assigned a linear regression model prediction51. By combining two models linearly, Cubist models can balance their weights better. Multiple training committees and reinforcement are included in this approach52. Final predictions are produced by averaging predictions from the committees53. Several hyper-parameters (for instance number of rules, number of committees, number of neighbors, and exploration constant) must be optimized to improve the model performance. A smoothing process can determine the number of nodes. Furthermore, using boosting54, multiple ensemble models can be generated by generating several committees. Figure 8 shows the flow diagram of the Cubist model.
EOA
Faramarzi et al.20 were the first to provide a dynamic mass balance-inspired EOA. Utilizing multiple sources and sink processes, EOA methods determine the concentration of an element in a control volume. According to basic mechanics, the entrance, departure, and generation of mass are retained in the equation for mass equilibrium. A crucial component of traditional EOA is the iterative update of the beginning amounts of particles with specified numbers55. These solutions are extracted as the top possibilities to identify the equilibrium state regarded as the desired result. Based on an objective function, particle concentrations are assessed repeatedly to determine their states. The four best particles, arithmetic means, and the equilibrium pool are considered seriously by the EOA updating process as follows:
where Xnew and X represent the particle’s new and current concentration vectors, a random concentration vector called Xeq will be chosen from the equilibrium pool. F is an exponential term described in Eq. (2); G is the generation rate defined in Eq. (3), and λ is a random vector between 0 and 1.
Here, m is a random vector between 0 and 1, a1 and a2 are constants (a1 = 2 and a2 = 1), while T and Tmax are the present and maximum iterations. r1 and r2 are two distinct random variables, and GP is a specific value known as the generation probability (GP = 0.5). According to the three components in Eq. (1), the updating process of each concentration particle is renewed. In these latter two sections, the search area is effectively utilized, and a global search is carried out to identify the ideal outcome55. Figure 9 demonstrates the steps of the standard EOA.
Implementation process
Data collection
The database utilized in this study consists of 154 data records for waste D, which were obtained from sampling, making specimens, and performing dynamic tests in the laboratory as described above. Additionally, in this study, six variables, including the age of MSW (Age), frequency (F), percentage of plastic (PoP), confining pressure (CP), shear strain (ShS), and unit weight (UW), for D-MSW were considered as inputs. Figure 10 exhibits pairwise correlations between independent and dependent variables, including the frequency histogram of each variable. The graph displays a non-uniform distribution of outputs, and the factors are appropriately more frequent. High-frequency factors can be used for a better model56. A lower correlation is observed between positive and negative quantities in the general model variable, as indicated in Fig. 11. Also, each of the selected variables adversely affects D. The correlation heatmap analysis reveals the greater dependence of D on shear strain in the MSW database. Meanwhile, the percentage of plastic has the lowest correlation with output in D databases.
Modeling began with randomly dividing the data into training and testing subsets. To accomplish this, 70% of the data were randomly chosen as train data and 30% as testing data. Table 2 presents the statistical characteristics of output and input variables for test and train data for D-MSW. According to Table 3, the value of PoP is between 0 and 0.274 percent, Age varies between 0 and 16, CP falls between 75 and 150, F has two values of 0.1 and 1, UW varies from 9 to 12, and ShS ranges from 0.075 to 3.63 for D-MSW. Also, the measured D in this database changes in the range of 5.89 and 37.45 percent for training and the range of 3.89 and 24.21 percent for testing. Besides, various ranges of output changes in each output depict that the databases are diverse, and the models developed over them are generalizable57.
Model performance evaluation criteria
Selecting appropriate error measures like R2, RMSE, MAE, MAPE, objective function (OBJ), and a20-index for evaluating the accuracy and reliability of models in this study was driven by their specific relevance to the objectives and data characteristics22,58. The R2 metric, or coefficient of determination, is crucial as it determines the proportion of variance in the dependent variable that is predictable from the independent variables; an R2 value close to 1 is ideal, indicating a model that accurately predicts the observed outcomes, whereas a value near zero suggests the model fails to predict the data effectively. RMSE, provides insights into the average model prediction errors with sensitivity to outliers, meaning it measures the magnitude of errors with more significant discrepancies having a greater impact on the RMSE value59. An ideal RMSE value is as low as possible (zero), exhibiting more minor prediction errors. MAE, measures the average magnitude of the errors without considering their direction, being less sensitive to outliers compared to RMSE and thus offering a straightforward average of absolute differences between predicted and actual values. Lower MAE values signify the improved model performance, with the ideal value being as close to zero as possible. MAPE, expresses prediction errors as a percentage of actual values, making it particularly useful for comparisons across different scales or units. Lower MAPE values (ideal being close to zero) represent more accurate models. The OBJ function combines several statistical measures to provide a comprehensive summary of model errors, encompassing components like bias, variance, and overall fit. An ideal OBJ value is minimized, reflecting balanced and low error metrics22. The a20-index offers a unique perspective by focusing on the accuracy within the 20th percentile of predictions. It is crucial for scenarios where smaller prediction errors are critical, as it highlights the model’s performance in accurately predicting the lower range of data values. A higher a20-index value is ideal, demonstrating better performance in this context23. Each of these metrics was chosen to ensure a holistic evaluation of the model’s performance, directly supporting the study’s aims to develop reliable predictive models for real-world applications, thereby providing a comprehensive assessment of the model’s accuracy, robustness, and practical utility. The performance statistical metrics are illustrated as follows:
When N is the number of records, Ypre and Yobs present the predicted and actual values, and the bar items over the parameters show the average rate; The variable m20 displays the quantity of the records where the Yobs/Ypre ratio ranges from 0.80 to 1.20; the terms "tst" applied for testing and "tr" applied for training data, accordingly.
Proposed hybrid EL models
The suggested ML models in the present research were all constructed in Python. EOA was employed in this study to determine the ideal values. They utilized the given criteria to set the initial random values to accomplish this (Table 4). After these statistics were entered into EL approaches and the EL algorithms were trained using the training dataset, the target function was determined to be the average RMSE of both data (test and train). The meta-parameter volume and objective function quantities are sent into EOA to optimize. Figure 12 provides a summary of the various EL approaches. Table 5 lists the optimal values of numerous meta-parameters using different approaches for the D-MSW model.
Evaluation of model prediction performance
As depicted in Fig. 13, during the test and train phases, a higher R2 value and a lower degree of scattering points mean that the GBRT-EOA model outperforms other D-MSW models. Also, the low RMSE and MAE values of the introduced XGBoost-EOA model demonstrate its high accuracy and reliability in both training and testing processes with a slight difference compared to GBRT-EOA. The A20-index is a newly developed physical engineering metric that measures the number of specimens in which estimated quantities deviate by at most 20% from observed values60. An a20-index = 0.9252 in the training phase and a20-index = 0.8913 in the testing phase make GBRT-EOA the top performer in predicting D-MSW.
Comparing the performance of various models on testing and training datasets reveals that EOA generally enhances the model accuracy (Table 6). For instance, GBRT-EOA exhibits the best performance on the testing set, with the highest R2 value (0.8975) and the lowest error metrics (MAPE = 0.0954, MAE = 1.1939, and RMSE = 1.6587), significantly outperforming its default counterpart (R2 = 0.8281, MAPE = 0.14461, MAE = 1.7976, and RMSE = 2.4059). Similarly, XGBoost-EOA and RF-EOA indicate substantial improvements in testing accuracy compared to their default versions, reflected in higher R2 values and lower error rates. Notably, XGBoost-Default achieves near-perfect performance on training data (R2 = 0.9996, MAPE = 0.0053, MAE = 0.0804, and RMSE = 0.1168), but its testing performance (R2 = 0.6814, MAPE = 0.1597, MAE = 2.1399, and RMSE = 3.4910) suggests overfitting. AdaBoost-EOA also demonstrates significant improvement, with its testing R2 increasing from 0.6998 to 0.8437 and error metrics decreasing across the board compared to the default version. On the other hand, the Cubist model display less pronounced gains with EOA, revealing that this algorithm might be less responsive to enhanced optimization in this context. Overall, applying EOA leads to more reliable and accurate models, with GBRT-EOA standing out as the most robust, balancing high training accuracy with excellent generalization to unseen data. This comparison underscores the importance of optimization algorithms in refining model performance, particularly in complex prediction tasks.
Utilizing OBJ also makes it feasible to combine several statistical indicators for testing and training to evaluate a model’s performance61. High values of OBJ refer to the poor performance of a model compared to others62. According to Fig. 14, the GBRT-EOA model performs best with an OBJ of 1 for D-MSW. Also, regarding the OBJ values of 2.16% and 1.15%, the Cubist-EOA models are the worst performers for D-MSW.
As a further measure of the validity of the models, the Nash–Sutcliffe efficiency (NSE) coefficient, as well as the scatter index (SI), were used:
In which the bar items over the associated digits represent the average of each value, N designates the records number, Yobs refers to the observed, and Ypre denotes anticipated quantities. A model’s predictive accuracy can be considered excellent if the NSE value exceeds 0.75 or the SI value is less than 0.1, and good if the NSE value is between 0.65 and 0.75 or the SI value is between 0.1 and 0.2. However, it is fair if the NSE value is between 0.5 and 0.65 or if the SI value is between 0.2 and 0.357,58. As seen in Fig. 15, practically all methods have the SI rates less than 0.2, indicating that they are excellent or good predictors of D-MSW. The results also reveal that all the methods had NSE rates greater than 0.75, exhibiting the EL–EOA models’ "excellent" accuracy in predicting output values.
The effectiveness of each design was evaluated utilizing the diagram of Taylor presented in Fig. 16. To compare the anticipated outcomes with the actual values, three statistical measures of RMSE, standard deviation (STD), and R2 were applied. The standard deviation is depicted through a circle connecting the plot’s axes of horizontal and vertical; RMSE is illustrated by the horizontal green dots, and the blue line suggests the values of R2. In this regard, the GBRT-EOA models have the best performance among all the models for D-MSW. As shown in the figure, the furthest state belongs to the Cubist-EOA for D-MSW.
Figure 17 displays the absolute error boxes of all the models for the test and train datasets. GBRT-EOA has the least interquartile ranges in training phases for D-MSW, demonstrating this model’s outstanding capability. Furthermore, the median absolute error quantities of GBRT-EOA methods are 0.88 in the test stage for D-MSW, which indicates the decisive superiority of these models compared to others.
Comparison of K-fold cross-validation and hold-out methods
In studies with relatively small datasets, like the current one, data were typically split into training and testing sets to control overfitting and enhance the prediction performance of the ML models. External validation, which uses datasets from various published sources, can be beneficial but was unavailable due to the novel nature of this research. Consequently, K-fold cross-validation (CV) is a viable alternative for assessing the model performance. This method randomly divides the training set into K equal-sized subsets, using K − 1 subsets for training and the remaining subset for validation. Here, a fivefold CV approach was implemented, allocating 70% of the initial dataset as the training set and 30% as the test set. The optimum hyper-parameters for the best performing model (GBRT), were determined using EOA to achieve the lowest average RMSE over the five training folds. Subsequently, these hyper-parameters were applied to train the GBRT model for D-MSW on the entire training dataset, and the model’s efficacy was evaluated on the testing dataset. Table 7 compares the performance of GBRT models utilizing both hold-out and K-fold CV methods, noting that RMSE and MAE in K-fold CV are based on normalized data. The optimal hyper-parameters derived from the K-fold CV for D-MSW are detailed in Table 8.
SHAP study
SHAP is a game-theoretic approach designed to describe the result of machine-learning methods63. SHAP presents the features’ contribution to the model’s output, offering a more interpretable and transparent understanding of the model’s decision-making process. In the ensuing sections, we thoroughly analyze the outcomes in the proposed predicting structure, designed to interpret and comprehend the results of the probabilistic predicting model. Our primary focus is examining how the developed model utilizes various features to make predictions. The SHAP method is employed for explanations, covering D-MSW. Figure 18 illustrates the average contribution of each feature, with each bar plot representing the importance of a specific property. Shear strain plays a significant role in the MSW (+ 0.21) model, contributing more substantially. Age and UW (total + 0.02) for MSW exhibit minimal impact on the output of the models.
Each dot in Fig. 19 represents a distinct predicting, and its location along the x-axis signifies the impact of that attribute on the output of the model. Furthermore, each dot’s color corresponds to a feature value (varies from blue to red), emphasizing the relative contributions of different feature values to the result. Long tails show highly significant characteristics. The dots’ vertical distribution suggests more findings with comparable effects. These SHAP summary graphs in such a setting include details on the number of reports that have those qualities as well as the size and direction of each feature’s effect. For example, elevated shear strain values (in MSW models) tend to elevate the model output, while values closer to zero for shear strain led to decreased model output (for MSW model). Therefore, the impact on the model output becomes more substantial with higher shear strain values in MSW.
In the following, the function of variations for each output is presented in Fig. 20. The ranges of the SHAP values, which refer to the impact on each model target, are depicted by various colors ranging from blue to red. For instance, the function of MSW variations is increasing and decreasing at lower instances which is highly dependent on the ShS and PoP values.
Online application of proposed D-MSW models
Models developed using EL-FOA methods differ from classical regression methods as they do not simply relate inputs and outputs64. In this regard, implementing a Python-based online application makes it possible for researchers and practicing engineers, the end users of the proposed D-MSW models, to estimate values of dynamic properties easily. In the past, several researchers have developed software using MATLAB graphical user interface (GUI) to predict the properties of different materials65,66. There are several advantages to the developed online application: (i) results are available more quickly, and it provides a standard for an in-depth investigation of mix designs; (ii) reducing production costs while ensuring the safety and quality concerns as well, it allows to determine if a mix design is reasonable; and (iii) in addition to being easy to use, it will reduce human errors in calculations67. Free access to this online application is also provided for all researchers68. This application enables engineers and researchers to obtain a relatively accurate prediction of D-MSW at their project site in simple steps, as demonstrated in Fig. 20. To clarify, assume that PoP, Age, CP, F, UW, and ShS, as highlighted in Fig. 21, are equal to 0.12%, 0 year, 75 kPa, 0.1 Hz, 9 kPa, and 0.74%, respectively. First, the user simply needs to click on the run code button in the left corner. To continue the predicting process, the desired model type is chosen in the second step. Next, the assumed input values are entered in the embedded boxes next to each parameter. Users are guided by the max and min values of each input variable. These values are included to prevent errors in prediction since the dataset that trained the models was in the same range. Upon clicking the predict button, the result is displayed. The user can then view a new result and compare it with the previous results by changing the model type and inputs.
Summary and conclusions
Landfill stabilization under seismic loading is dependent on the components, including MSW. To achieve sustainable design and development of landfills, D is regarded as a crucial characteristic that should be assessed. A better waste management procedure may be established by determining the factors influencing the seismic performance of landfills. In this study, a novel dataset derived from laboratory experiments was utilized to develop predictive models for the damping response of landfills, employing innovative AI techniques. For this reason, 153 cyclic triaxial tests were performed on MSW. Following that, five AI-based methods including AdaBoost-EOA, GBRT-EOA, XGBoost-EOA, RF-EOA, and Cubist-EOA were adopted to formulate the produced laboratory damping data. A summary of the study’s findings is provided below:
-
The correlation heatmap analysis revealed that D-MSW in datasets was more dependent on shear strain (+ 0.62). Meanwhile, the vertical load (− 0.41) and percentage of plastic (− 0.33) had the most minor correlations with the target in the D-MSW datasets, respectively.
-
The investigation of the performance parameters and comparison of various EL models indicated that the GBRT-EOA models presented better performance than the other methods for D-MSW. The OBJ value (which exhibits the model’s performance) of the aforementioned best model (GBRT-EOA) was 1.6%. Also, regarding the OBJ value of 2.16%, the Cubist-EOA model was the worst performer for D-MSW.
-
The SHAP examination of the input factors demonstrated that a significant share of D-MSW came from shear strain changes. However, the unit weight for D-MSW (0) had the most negligible impact on the modeling procedure.
Data availability
Data associated with the present study will be available on request from the corresponding authors.
Abbreviations
- R2 :
-
Coefficient of determination
- AdaBoost:
-
Adaptive boosting
- AI:
-
Artificial intelligence
- ANFIS:
-
Adaptive neuro-fuzzy inference system
- ANN:
-
Artificial neural network
- BPNN:
-
Back-propagation neural network
- CP:
-
Confining pressure
- CRRF:
-
Classification and regression random forest
- D:
-
Damping
- EOA:
-
Equilibrium optimizer algorithm
- F:
-
Frequency
- FNN:
-
Feed forward neural network
- G:
-
Shear modulus
- GBRT:
-
Gradient boosting regression tree
- GEP:
-
Gene expression programming
- LSSVM:
-
Least square support vector machine
- LSTM:
-
Long short-term memory
- MAE:
-
Mean absolute error
- MAPE:
-
Mean absolute percentage error
- MARS:
-
Multivariate adaptive regression spline
- ML:
-
Machine learning
- MLP:
-
Multilayer perceptron
- MRM:
-
Multiple regression analysis method
- NSE:
-
Nash–Sutcliffe efficiency
- PoP:
-
Percentage of plastic
- RF:
-
Random forest
- RMSE:
-
Root mean square error
- ShS:
-
Shear strain
- SI:
-
Scatter index
- STD:
-
Standard deviation
- SVM:
-
Support vector machine
- SVR:
-
Support vector regression
- USCS:
-
Unified soil classification system
- UW:
-
Unit weight
- VL:
-
Vertical load
- XGBoost:
-
Extreme gradient boosting
References
Kaza, S., Yao, L., Bhada-Tata, P. & Van Woerden, F. What a Waste 20: A Global Snapshot of Solid Waste Management to 2050 (World Bank Publications, 2018).
Alidoust, P., Kargar, P., Goodarzi, S., Keramati, M. & Moradi Moqaddam, H. Laboratory-based assessment on similarities between dynamic behavior of MSW and clay. J. Mater. Cycles Waste Manage. 23, 622–643 (2021).
Keramati, M., Goodarzi, S., Moradi Moghadam, H. & Ramesh, A. Evaluating the stress–strain behavior of MSW with landfill aging. Int. J. Environ. Sci. Technol. 16, 6885–6894 (2019).
Karimpour-Fard, M. et al. Laboratory study on dynamic properties of municipal solid waste in Saravan Landfill, Iran. Int. J. Civ. Eng. 19, 861–879 (2021).
Rawat, P. & Mohanty, S. Parametric study on dynamic characterization of municipal solid waste fine fractions for geotechnical purpose. J. Hazard. Toxic Radioact. Waste 26(1), 04021047 (2022).
Akbarimehr, D. & Fakharian, K. Dynamic shear modulus and damping ratio of clay mixed with waste rubber using cyclic triaxial apparatus. Soil Dyn. Earthq. Eng. 140, 106435 (2021).
Patiño, H. & Galindo, R. Damping of a soft marine clay subjected to the combination of monotonic and cyclic shear stresses. Eur. J. Environ. Civ. Eng. 27(1), 72–95 (2023).
Abdellaziz, M. et al. Shear modulus and hysteretic damping of sensitive eastern Canada clays. Can. Geotech. J. 58(8), 1118–1134 (2021).
Alidoust, P., Keramati, M. & Shariatmadari, N. Laboratory studies on effect of fiber content on dynamic characteristics of municipal solid waste. Waste Manage. 76, 126–137 (2018).
Amlashi, A. T., Abdollahi, S. M., Goodarzi, S. & Ghanizadeh, A. R. Soft computing based formulations for slump, compressive strength, and elastic modulus of bentonite plastic concrete. J. Clean. Prod. 230, 1197–1216 (2019).
Bagheri, M., Esfilar, R., Golchi, M. S. & Kennedy, C. A. A comparative data mining approach for the prediction of energy recovery potential from various municipal solid waste. Renew. Sustain. Energy Rev. 116, 109423 (2019).
Liang, R. et al. Interpretable machine learning assisted spectroscopy for fast characterization of biomass and waste. Waste Manage. 160, 90–100 (2023).
Khatti, J. & Grover, K. S. Assessment of fine-grained soil compaction parameters using advanced soft computing techniques. Arab. J. Geosci. 16(3), 208 (2023).
Zhang, Y., Liu, J. & Shen, W. A review of ensemble learning algorithms used in remote sensing applications. Appl. Sci. 12(17), 8654 (2022).
Chen, C. H., Tanaka, K., Kotera, M. & Funatsu, K. Comparison and improvement of the predictability and interpretability with ensemble learning models in QSPR applications. J. Cheminform. 12, 1–16 (2020).
Moradi Moghaddam, H. et al. Shear modulus prediction of landfill components using novel machine learners hybridized with forensic-based investigation optimization. Constr. Build. Mater. 411, 134443 (2024).
Ahmad, M. et al. Extreme gradient boosting algorithm for predicting shear strengths of rockfill materials. Complexity 2022, 1–11 (2022).
Choudhury, D. & Savoikar, P. Equivalent-linear seismic analyses of MSW landfills using DEEPSOIL. Eng. Geol. 107(3–4), 98–108 (2009).
Keramati, M., Moradi Moqhaddam, H., Mozaffari, O. & Fayeghi, M. Evaluation of the effects of aging and different site conditions on the seismic response of municipal solid waste (A Case of Kahrizak Landfill). AUT J. Civ. Eng. 5(3), 377–388 (2021).
Faramarzi, A., Heidarinejad, M., Stephens, B. & Mirjalili, S. Equilibrium optimizer: A novel optimization algorithm. Knowl.-Based Syst. 191, 105190 (2020).
Elgamal, Z. M. et al. Improved equilibrium optimization algorithm using elite opposition-based learning and new local search strategy for feature selection in medical datasets. Computation 9(6), 68 (2021).
Ghanizadeh, A. R. et al. A comparison of novel hybrid ensemble learners to predict the compressive strength of green engineering materials: a case of concrete composed of rice husk ash. Eur. J. Environ. Civ. Eng. https://doi.org/10.1080/19648189.2024.2335343 (2024).
Ghanizadeh, A. R. et al. Intelligent prediction of asphalt concrete air voids during service life using cubist and GBRT-ensemble learning approaches hybridized with an equilibrium optimizer algorithm. J. Mater. Civ. Eng. 36(5), 04024098 (2024).
Alidoust, P. et al. Prediction of the shear modulus of municipal solid waste (MSW): An application of machine learning techniques. J. Clean. Prod. 303, 127053 (2021).
Gatto, M. P. A. & Montrasio, L. Artificial Neural Network model to predict the dynamic properties of sand-polyurethane composite materials for GSI applications. Soil Dyn. Earthq. Eng. 172, 108032 (2023).
Wu, Q., Wang, Z., Qin, Y. & Yang, W. Intelligent model for dynamic shear modulus and damping ratio of undisturbed marine clay based on Back-Propagation neural network. J. Mar. Sci. Eng. 11(2), 249 (2023).
Baghbani, A., Choudhury, T., Samui, P. & Costa, S. Prediction of secant shear modulus and damping ratio for an extremely dilative silica sand based on machine learning techniques. Soil Dyn. Earthq. Eng. 165, 107708 (2023).
Baghbani, A., Costa, S., Faradonbeh, R. S., Soltani, A. & Baghbani, H. Modeling the effects of particle shape on damping ratio of dry sand by simple shear testing and artificial intelligence. Appl. Sci. 13(7), 4363 (2023).
Pasha, S. M. K., Hazarika, H. & Yoshimoto, N. Estimating dynamic characteristics of gravel-tire chips mixtures using artificial intelligence techniques. J. Scoc. Mater. Sci. (材料) 69(1), 1–8 (2020).
Keshavarz, A. & Mehramiri, M. New Gene Expression Programming models for normalized shear modulus and damping ratio of sands. Eng. Appl. Artif. Intell. 45, 464–472 (2015).
Javdanian, H., Jafarian, Y. & Haddad, A. Predicting damping ratio of fine-grained soils using soft computing methodology. Arab. J. Geosci. 8, 3959–3969 (2015).
Edincliler, A., Cabalar, A. F. & Cevik, A. Modelling dynamic behaviour of sand–waste tires mixtures using Neural Networks and Neuro-Fuzzy. Eur. J. Environ. Civ. Eng. 17(8), 720–741 (2013).
Samui, P. & Kothari, D. P. A multivariate adaptive regression spline approach for prediction of maximum shear modulus and minimum damping ratio. Eng. J. 16(5), 69–78 (2012).
Cevik, A. & Cabalar, A. F. Modelling damping ratio and shear modulus of sand–mica mixtures using genetic programming. Expert Syst. Appl. 36(4), 7749–7757 (2009).
Cabalar, A. F. & Cevik, A. Modelling damping ratio and shear modulus of sand–mica mixtures using neural networks. Eng. Geol. 104(1–2), 31–40 (2009).
Akbulut, S., Hasiloglu, A. S. & Pamukcu, S. Data generation for shear modulus and damping ratio in reinforced sands using adaptive neuro-fuzzy inference system. Soil Dyn. Earthq. Eng. 24(11), 805–814 (2004).
Keramati, M., Moghaddam, H. M. & Ramesh, A. Prediction of the stress-strain behavior of MSW materials using Hyperbolic model and Evolutionary Polynomial Regression (EPR). Amirkabir J. Civ. Eng. 51(4), 793–804 (2019).
Zekkos, D. P. et al. Framework for the estimation of MSW unit weight profile. Proc., 10th Int. Waste Management and Landfill Symp.
Pintelas, E., Livieris, I. E. & Pintelas, P. A grey-box ensemble model exploiting black-box accuracy and white-box intrinsic interpretability. Algorithms 13(1), 17 (2020).
Schapire, R. E. Explaining adaboost. Empirical Inference: Festschrift in Honor of Vladimir N. Vapnik 37–52 (2013).
Prettenhofer, P. & Louppe, G. Gradient boosted regression trees in scikit-learn. PyData (2014).
Chen, T. et al. Xgboost: extreme gradient boosting. R Package Version 1(4), 1–4 (2015).
Zhao, X. Y., Chen, J. X., Chen, G. M., Xu, J. J. & Zhang, L. W. Prediction of ultimate condition of FRP-confined recycled aggregate concrete using a hybrid boosting model enriched with tabular generative adversarial networks. Thin Walled Struct. 182, 110318 (2023).
Svetnik, V. et al. Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43(6), 1947–1958 (2003).
Zhang, J., Ma, G., Huang, Y., Aslani, F. & Nener, B. Modelling uniaxial compressive strength of lightweight self-compacting concrete using random forest regression. Constr. Build. Mater. 210, 713–719 (2019).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Quinlan, J. R. Learning with Continuous Classes. In 5th Australian Joint Conference on Artificial Intelligence (World Scientific, 1992).
Houborg, R. & McCabe, M. F. A hybrid training approach for leaf area index estimation via Cubist and random forests machine-learning. ISPRS J. Photogramm. Remote Sens. 135, 173–188 (2018).
Rulequest. Data Mining with Cubist (2016).
Wang, Y. & Witten, I. H. Induction of Model Trees for Predicting Continuous Classes (1996).
Quinlan, J. R. Combining instance-based and model-based learning. Proceedings of the Tenth International Conference on Machine Learning (1993).
Zhou, J. et al. Random forests and cubist algorithms for predicting shear strengths of rockfill materials. Appl. Sci. 9(8), 1621 (2019).
Xu, Y. et al. Evaluation of machine learning techniques with multiple remote sensing datasets in estimating monthly concentrations of ground-level PM2. 5. Environ. Pollut. 242, 1417–1426 (2018).
Lee, D. G. & Ahn, K. H. “A stacking ensemble model for hydrological post-processing to improve streamflow forecasts at medium-range timescales over South Korea. J. Hydrol. 600, 126681 (2021).
Shaheen, A. M., Elsayed, A. M., El-Sehiemy, R. A. & Abdelaziz, A. Y. Equilibrium optimization algorithm for network reconfiguration and distributed generation allocation in power systems. Appl. Soft Comput. 98, 106867 (2021).
Iqbal, M. F. et al. Prediction of mechanical properties of green concrete incorporating waste foundry sand based on gene expression programming. J. Hazard. Mater. 384, 121322 (2020).
Tavana Amlashi, A., Mohammadi Golafshani, E., Ebrahimi, S. A. & Behnood, A. Estimation of the compressive strength of green concretes containing rice husk ash: a comparison of different machine learning approaches. Eur. J. Environ. Civ. Eng. 27(2), 961–983 (2022).
Alidoust, P., Goodarzi, S., Tavana Amlashi, A. & Sadowski, Ł. Comparative analysis of soft computing techniques in predicting the compressive and tensile strength of seashell containing concrete. Eur. J. Environ. Civ. Eng. 27, 1853–1875 (2022).
Sadaghat, B. et al. Evaluating strength properties of Eco-friendly Seashell-Containing Concrete: Comparative analysis of hybrid and ensemble boosting methods based on environmental effects of seashell usage. Eng. Appl. Artif. Intell. 133, 108388 (2024).
Asteris, P. G., Ashrafian, A. & Rezaie-Balf, M. Prediction of the compressive strength of self-compacting concrete using surrogate models. Comput. Concr. 24(2), 137–150 (2019).
Golafshani, E. M. & Behnood, A. Predicting the mechanical properties of sustainable concrete containing waste foundry sand using multi-objective ANN approach. Constr. Build. Mater. 291, 123314 (2021).
Ashrafian, A., Gandomi, A. H., Rezaie-Balf, M. & Emadi, M. An evolutionary approach to formulate the compressive strength of roller compacted concrete pavement. Measurement 152, 107309 (2020).
Lundberg, S. M. & Lee, S. I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 30, 4768–4777 (2017).
Heidarabadizadeh, N., Ghanizadeh, A. R. & Behnood, A. Prediction of the resilient modulus of non-cohesive subgrade soils and unbound subbase materials using a hybrid support vector machine method and colliding bodies optimization algorithm. Constr. Build. Mater. 275, 122140 (2021).
Golafshani, E. M. & Behnood, A. Application of soft computing methods for predicting the elastic modulus of recycled aggregate concrete. J. Clean. Prod. 176, 1163–1176 (2018).
Golafshani, E. M. & Behnood, A. Estimating the optimal mix design of silica fume concrete using biogeography-based programming. Cement Concret. Compos. 96, 95–105 (2019).
Chou, J. S., Chen, L. Y. & Liu, C. Y. Forensic-based investigation-optimized extreme gradient boosting system for predicting compressive strength of ready-mixed concrete. J. Comput. Des. Eng. 10(1), 425–445 (2023).
D-MSW Web Application https://colab.research.google.com/drive/15aMqMcLCk_288jApPgjvRikd2S7bGZVP?usp=sharing (2024).
Funding
Open access funding provided by University of Gävle.
Author information
Authors and Affiliations
Contributions
Hossein Moradi Moghaddam: conceptualization, methodology, investigation, validation, writing—original draft, writing—review and editing. Mohsen Keramati: conceptualization, methodology, investigation, validation, writing—original draft, writing—review and editing. Alireza Bahrami: conceptualization, methodology, investigation, validation, formal analysis, resources, writing—original draft, writing—review and editing. Ali Reza Ghanizadeh: conceptualization, validation; Amir Tavana Amlashi: investigation, validation. Haytham F. Isleem: investigation, formal analysis. Mohsen Navazani: investigation, Samer Dessouky: validation, formal analysis. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Moradi Moghaddam, H., Keramati, M., Bahrami, A. et al. Application of hybridized ensemble learning and equilibrium optimization in estimating damping ratios of municipal solid waste. Sci Rep 14, 17584 (2024). https://doi.org/10.1038/s41598-024-67381-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-67381-3
Keywords
This article is cited by
-
Evaporation forecasting using different machine learning models in Beni Haroun Dam, Algeria
Theoretical and Applied Climatology (2025)