Introduction

Breast cancer is the most common malignant tumor currently affecting women's health worldwide [1]. Establishing a predictive model for patients with breast cancer can provide a crucial reference for individualized clinical diagnosis and treatment decisions, such as the determination of surgical approaches and the development of adjuvant treatment plans. A nomogram is a visual predictive tool. It quantifies the risk of clinical events on the basis of various risk factors and generates numerical probabilities of clinical events [2]. Nomograms are among the most common forms used in studies of breast cancer LNM/prognosis prediction models.

Sentinel lymph node biopsy (SLNB), a landmark advance in breast surgery in the 1990s, has become the most common and important method for assessing axillary lymph node (ALN) status in patients with cN0 breast cancer, following refinements from trials such as NSABP-32, MILAN, and ALMANAC [3,4,5,6]. The results of SLNB often guide the decision to perform axillary lymph node dissection (ALND) [7,8,9,10]. Compared with ALND, SLNB has avoided many unnecessary axillary injuries in patients and has made significant contributions to reducing postoperative pain, arm swelling, activity impairment, muscle weakness, and other complications. However, studies have shown that SLNB, which is still a relatively invasive clinical procedure, has few but persistent complications, affecting the quality of life of some postoperative patients [11, 12]. Therefore, other potentially wider-reaching lymph node biopsy methods, such as targeted axillary dissection (TAD) [13, 14] and marking the axillary node with a radioactive iodine seed (MARI) [15], are being increasingly considered. Research suggests that these more targeted axillary lymph node assessment methods may further reduce surgical complications without compromising accurate ALN evaluation under specific conditions [16].

In fact, the evolving landscape of systemic treatment for breast cancer has also significantly impacted the surgical approaches to ALN assessment. In the pre-targeted therapy era, ALND was deemed essential not only for staging but also for local control, as adjuvant chemotherapy and endocrine therapy provided limited systemic protection [17]. The advent of HER2-directed therapies (e.g., trastuzumab) in the early 2000s marked a paradigm shift, enabling systemic eradication of micrometastases and reducing reliance on extensive nodal surgery [18]. More recently, immunotherapy (e.g., pembrolizumab in triple-negative breast cancer) and antibody–drug conjugates (e.g., trastuzumab deruxtecan) have further diminished the prognostic weight of nodal status by achieving unprecedented pathological complete response rates and metastatic burden reduction [19,20,21]. These advancements underscore a critical trend: As systemic therapies improve, the necessity of invasive nodal staging diminishes for select patients. For instance, some prospective randomized trials are exploring the possibility of omitting SLNB in patients with early breast cancer [22, 23]. The SOUND trial [22] is one of the important studies that has received widespread attention. The results revealed that there were no statistically significant differences in distant disease-free survival, DFS, OS, the axillary metastasis rate, or the distant metastasis rate between SLNB group and no axillary surgery group. Therefore, the conclusion was reached that "omitting axillary surgery is not inferior to SLNB for specific conditions of breast cancer patients." In the future, the publication of more related research results, including the BOOG 2013–08 trial [24], will provide more medical evidence for omitting SLNB.

Materials and methods

Database

We conducted research on the SEER database. The SEER database deleted all the breast cancer lymph node surgery information in 2011, lacked readily available SLNB/ALND information. To address this issue in the SEER database, we utilized the research findings of Bilimoria et al. [25]. This study extracted patient information from the National Cancer Database (NCDB) for 97,314 patients in the SLNB-alone group and the SLNB-with-completion ALND group. The median number of lymph nodes examined was 3 (interquartile range, 25) for the former group and 13 (918) for the latter. On the basis of these data, if the number of examined lymph nodes was ≤ 5, the patient was considered to have undergone SLNB alone. Many other studies have also used this conclusion as a reference [26,27,28].

Patients

Our retrospective analysis was conducted on 117,895 breast cancer patients from the SEER (Stat 8.4.2) database between January 2010 and December 2015. The inclusion criteria were as follows: (1) 1–5 lymph node examinations; (2) primary breast cancer; and (3) American Joint Committee on Cancer (AJCC) 7th edition T stage T1-T3 breast cancer. The exclusion criteria were as follows: (1) bilateral breast cancer; (2) neoadjuvant therapy; (3) histological grade IV; (4) AJCC 7th M stage cM0(i +); and (5) incomplete clinicopathological data. The lymph node status was determined on the basis of the "Regional nodes positive 1988" field in the SEER database.

We extracted the following clinicopathological information from the SEER database: age, race, sex, primary site, laterality, T stage, M stage, histological grade, pathological type, breast cancer subtype, ER status, PR status, human epidermal growth factor receptor 2 (HER2) status, and lymph node metastasis status.

Construction and validation of the nomogram

Using R Foundation, 4.2.2, patients were randomly assigned to the training and validation cohorts at a 3:1 ratio for the development and validation of a nomogram. We ultimately identified 9 significant predictive factors suitable for inclusion in the nomogram: age, race, primary site, T stage, M stage, histological grade, pathological type, ER status, and PR status. We assessed the sensitivity and specificity of the predictive model via the area under the receiver operating characteristic (ROC) curve (AUC), visualized the model calibration through 1000 bootstrap repetitions of the calibration curve, and evaluated the clinical applicability of the nomogram via decision curve analysis (DCA).

Statistical analyses

We used chi-square tests to organize detailed patient baseline information, summarizing the relationships between various influencing factors and sentinel lymph node status. We used univariate logistic regression analysis to verify the correlation, preliminarily excluding factors with no statistically significant impact on SLNM. Then, via multivariate logistic regression, we calculated odds ratios (ORs) to identify significant predictive factors independently associated with SLNM in our study population. ORs are presented with 95% CIs. We also performed Omnibus tests and Hosmer–Lemeshow tests to assess preliminary model goodness-of-fit.

Statistical analyses were performed via IBM SPSS (version 26.0) statistical software. All tests were two sided, and P < 0.05 was deemed significant. Random population grouping and the creation of a nomogram were performed via the R Foundation, 4.2.2 and other packages (car, rms, pROC, Hmisc, and rmda).

Results

Exploratory results of scientific grouping of pathology type

We first extracted the pathological type classification field "Site record–rare tumors" from the SEER database, which was not mentioned in previous studies. This field, which is based on the Surveillance of Rare Cancer in Europe (RARECARE), categorizes breast cancer into 7 main types (Table 1). We performed univariate analysis of each pathological classification. The results revealed that the SLNM rate for Paget's disease was the highest, at 19.4%, which was higher than those of invasive lobular carcinoma (ILC) and invasive ductal carcinoma (IDC). We decided to combine the "Site record-rare tumors" field and the commonly used "ICD-O-3 Hist/behav" field into a new classification method. While retaining the meaningful classification method of the former, we established additional screening criteria: 1) screening out clinically known special types of cancers that are extremely easy or difficult to metastasize through the lymphatic system and 2) screening out special pathological types with large-sample sizes and significantly different SLNM rates than established subgroups. Finally, the predictive factor "pathological type" in this study was divided into 7 subgroups: IDC, ILC, IDC mixed with ILC, mucinous carcinoma, Paget's disease, IMPC, and other types.

Table 1 The univariate analysis of the field "Site record–rare tumors" by lymph node status

Exploratory results of the scientific grouping of age

We grouped age by 5-year intervals. On the basis of the definition of the International Young Breast Cancer Expert Group (BCY) [29, 30], we divided the population under 40 years of age into one group. Owing to the small-sample size (469) for individuals ≥ 90 years old, we did not further subdivide this age group. On this basis, we created a trend graph (Fig. 1). The age of 80 years is a turning point where the curve trend changes significantly. A greater age but still < 80 years is correlated with a lower SLNM rate, which aligns with our clinical experience. However, in those over 80 years of age, the trend reversed, contradicting our clinical experience. Moreover, within the 40–79 age range, the maximum slope of the curve occurs at 60 years, indicating a significant difference between the 40–59 and 60–79 age groups. Therefore, this study divided patients into four groups according to age: < 40, 40–59, 60–79, and ≥ 80 years.

Fig. 1
figure 1

Trend graph of SLNM rate by age

Patient characteristics

From 20,102,015, this study ultimately included 117,895 breast cancer patients, who were randomly divided into a training cohort (88,422 patients) and a validation cohort (29,473 patients). The detailed clinicopathological characteristics are presented in Table 2.

Table 2 Clinicopathologic characteristics of the cohort by lymph node status

14,645 breast cancer patients (12.4%) had positive sentinel lymph nodes. The SLNM rate in patients ≥ 80 years old was 14.9%, which was higher than that in the 40–59 and 60–79 age groups and lower than the < 40-year-old group. Japanese patients had the lowest SLNM rate, followed by Chinese patients. "Other" races were similar to "White," while Black patients had the highest rate. Only 584 were male, and their SLNM rate was higher than that of females. The inner quadrant of the breast had a lower overall SLNM rate. The outer quadrants had higher rates, and the axillary tail showing even higher rates. The central part, including the nipple, had the highest percentage. The larger the tumor diameter is, the higher the SLNM rate. In addition, M1 patients had a significantly higher SLNM rate than did M0 patients. A lower tumor histological grade was associated with lower SLNM rates. Among pathology types, mucinous carcinoma had the lowest SLNM rate, IMPC had the highest rate. Among subtypes, triple-negative breast cancer (TNBC) had the lowest SLNM rate, followed by luminal A, HER2-enriched, and luminal B subtypes. ER-positive, PR-positive, and HER2-positive patients had a higher SLNM rate than did negative patients. Laterality was the only factor without a statistically significant difference, with similar rates for the left and right sides.

Univariate logistic regression analysis

The ORs of each subgroup obtained from the univariate logistic regression analysis (Table 3) were consistent with the ratio of SLNM rates between subgroups from the chi-square test, successfully validating the correlation between each influencing factor and SLNM. Laterality, which does not affect SLNM (P = 0.489), was first excluded. Age, race, sex, primary site, T stage, M stage, histological grade, pathological type, subtype, ER status, PR status, and HER2 status were statistically significantly predictive of positive lymph nodes (P < 0.001).

Table 3 The results of the univariate logistic regression analysis

Multivariate logistic regression analysis

The factors screened by univariate analysis were included in multivariate logistic regression analysis to further screen independent predictive factors associated with SLNM. The training cohort results (Table 4) revealed that sex and HER2 status had p values of 0.479 and 0.227, indicating that these factors are not independent predictors of SLNM and were excluded from the final predictive model. Moreover, we excluded breast cancer subtypes as confounding factors for ER status, PR status, and HER2 status. The remaining factors had p < 0.001 in the training cohort. Therefore, this study ultimately identified 9 independent predictive factors influencing SLNM: age, race, primary site, T stage, M stage, histological grade, pathological type, ER status, and PR status. Importantly, T2, T3, M1, and IMPC had significantly higher OR values than did the other factors, suggesting that T stage, M stage, and pathological type play more prominent roles in influencing SLNM rates. In addition, Omnibus tests and Hosmer–Lemeshow tests were performed to evaluate the model. The Hosmer–Lemeshow test performed well on both the training cohort (p = 0.172) and the total population (p = 0.192), indicating good model fit. The results of the Omnibus test (P < 0.001) showed that the entire model is effective and accurately predicts SLNM.

Table 4 Multivariate logistic regression analysis of possible variables in predicting SLNM in training cohorts

Construction and validation of the nomogram

On the basis of the results of multivariate analysis, we developed a nomogram (Fig. 2) to predict SLNM in patients with breast cancer. By summing the scores of each factor, we can predict the probability of SLNM in every specific patient. Overall, Japanese and Chinese patients with no distant metastasis and T1mic, grade 1, and mucinous tumors in the central position of the breast had a lower risk of SLNM, whereas Black patients with distant metastasis and T3, grade 2&3, and IMPC tumors were more likely to have SLNM.

Fig. 2
figure 2

Nomogram predicting the probability of SLNM

The AUCs for the training and validation cohorts (Fig. 3) were 0.711 (95% CI: 0.706–0.716) and 0.700 (95% CI: 0.691–0.708), respectively, indicating good discriminatory ability of the nomogram in differentiating sentinel lymph node status. Moreover, we performed 1000 bootstrap repetitions on the training and validation cohorts to obtain two calibration curves (Fig. 4). The calibration curves showed good agreement between the actual SLNM rate and the predicted rate, suggesting good calibration of the nomogram. The DCA results (Fig. 5) also showed that this nomogram is a reliable clinical tool for predicting SLNM in breast cancer patients.

Fig. 3
figure 3

ROC curves of the nomogram for the probability of SLNM. A training cohort; B validation cohort

Fig. 4
figure 4

Calibration curves of the nomogram for the probability of SLNM. A training cohort; B validation cohort

Fig. 5
figure 5

DCA of the nomogram for the probability of SLNM (bootstrap 1000 repetitions). A training cohort; B validation cohort

Discussion

In this study, we developed a predictive nomogram to assess the probability of SLNM in breast cancer patients. Nine significant predictive factors with independent effects were ultimately identified. ROC curves, calibration curves, and clinical decision analysis of both the training and validation cohorts demonstrated that this nomogram has high sensitivity and specificity in predicting SLNM in patients with breast cancer and is a reliable clinical tool.

Age is one of the key objects explored in this study. On the basis of practical clinical experience, older patients tend to have a lower SLNM rate. However, our study data revealed that this pattern only exists in patients under 80 years of age. In patients over 80 years old, a greater age was associated with a higher SLNM rate, which is a novel finding. Although we cannot enumerate all previous prediction model studies on the relationship between age and LNM, the results of all single-center studies reviewed thus far support this clinical experience [31,32,33,34,35,36,37]. We believe that this is related mainly to the absolute shortage of patients aged ≥ 80 years; for this reason, few studies have categorized the population aged ≥ 80 years as a separate group. In fact, according to the findings of our study, previous studies' age-grouping methods essentially mixed the lowest SLNM rate group (60–79 years old) with the ≥ 80-year-old group. This naturally diluted the unique characteristics of the ≥ 80 age group, indicating that previous research underestimates the importance of the ≥ 80 age group. Therefore, it is necessary to conduct more systematic research on this population to fully explore the authenticity of the newly discovered pattern and its underlying reasons. This will become one of our key future research directions. On the basis of the patient baseline information statistically derived from this study, we can preliminarily analyze the possible reasons for this new finding: 1. Patients ≥ 80 years old have characteristics such as relatively larger tumors, a greater proportion of distant metastasis, and lower tumor differentiation. 2. This may also be related to the fact that elderly patients are less likely to detect the disease promptly, and by the time they seek medical attention, breast cancer may have significantly progressed. We similarly explored the scientific grouping of pathological types and ultimately categorized them into 7 types. The results revealed that mucinous adenocarcinoma was the least likely to metastasize via sentinel lymph nodes. The characteristics of mucinous adenocarcinoma, including lower histological grade, better differentiation, and lower LNM rates, have also been confirmed in multiple studies [36, 38, 39]. Compared with the most common pathological type “IDC,” Paget's disease [40] and IMPC [31, 41] are considered to have a high tendency for LNM, requiring greater clinical vigilance to avoid overlooking such lymphatic progression. Moreover, we found that the pathological type of IDC mixed with ILC had a significantly higher SLNM rate than did pure IDC and pure ILC. No previous research has focused on a detailed study of this type, which will also be a key focus of our future research.

Regarding race, as most previous studies categorized it into White, Black, and others, we referenced Iqbal J et al.'s earlier study [42] on the basis of the SEER database and additionally screened Chinese and Japanese populations, whose sample sizes were large enough and exhibited significantly different LNM rates than did other races. The results revealed that the SLNM rates for Chinese and Japanese patients were significantly lower than those for Black and White patients. SLNM risk positively correlates with tumor size, as reported in nearly all relevant studies by researchers such as Rivadeneira [43]. This conclusion, along with the conclusion that "M1 patients have a much higher LNM rate than M0 patients," applies in almost all scenarios. Our study also revealed that patients with primary tumors located in the axillary tail and nipple/central regions of the breast were more likely to experience SLNM, requiring attention to these relatively unique primary tumor locations. Research by Gou et al. [44] indicated that the axillary tail is an independent factor contributing to LNM. Another survey revealed that tumors in the central and nipple regions were associated with LNM [45]. With respect to breast cancer subtypes, Reyal et al. [46] reported that the TNBC had the lowest SLNM rate, and the HER2-enriched subtype had the highest SLNM rate. A study based on the SEER database [47] suggested that while the risk of LNM in the TNBC is significantly lower than that in the luminal A subtype, there is no significant difference between the other subtypes. In fact, our univariate logistic regression analysis revealed statistically significant differences in SLNM between subtypes. However, in our multivariate logistic regression analysis, subtype was not considered a significant predictive factor. The above results indicate that there may indeed be differences in SLNM between different breast cancer subtypes, while TNBC is generally considered the least likely to metastasize to lymph nodes. However, when factors with greater influence on SLNM are included in the predictive model, the overall differences between subtypes are further reduced, which is the primary reason for the discrepancy between the univariate and multivariate analysis results for breast cancer subtypes.

Compared with previous studies, our nomogram has several advantages. First, almost all previous prediction model studies for SLNM were small-sample, single-center studies. Owing to limitations in the geographic area and absolute number of people included in these studies, their predictive value is questionable. Although two articles [48, 49] used the National Cancer Database (NCDB) for large-sample prediction model studies, they both limited the enrolled population to ductal carcinoma in situ (DCIS) and did not establish an SLNM model for a broader population. Therefore, on the basis of the conclusions of Bilimoria et al. [25]., we screened the population considered to have undergone SLNB in the SEER database, leveraging the advantages of this database's large, multicenter sample size. This allowed us to build a prediction model for SLNM in a wider population of patients with breast cancer. Compared with previous similar studies, our prediction information is richer, the prediction model is larger, and it can be better applied in clinical practice. Moreover, before conducting clinicopathological feature analysis, we first explored the scientific grouping of each predictive factor. This is a novel approach compared with previous SEER database-based predictive model studies. Taking "pathological type" as an example, most existing SEER database-based studies simply divide it into three categories: IDC, ILC, and others. Reducing the subgroups of a prediction factor often leads to higher AUC values and better model discrimination, which is why few researchers choose grouping methods beyond the "three-category" approach. However, the "site record–rare tumors" field appears to have not been studied previously. On the basis of the clinical value of pathological type, sample size, and significant differences in SLNM rates, we successfully created a new classification method. Similarly, we conducted a scientific exploration of the "age" factor on the basis of the inherent objective relationship between age and SLNM, resulting in an age grouping that was more suitable for this study. We believe that evaluating a predictive model solely on the basis of AUC values is insufficient. Many studies prioritize increasing AUC values, often altering factor groupings to achieve this goal, potentially misclassifying groups with high discrimination into one category, and leading to erroneous conclusions.

This nomogram holds significant potential for clinical translation and optimizing axillary management in breast cancer. By enabling personalized risk stratification (e.g., identifying low-risk patients who may safely avoid SLNB), it aligns with global trends toward surgical de-escalation, potentially reducing complications like lymphedema by 15–20% [19, 22]. Its integration with multimodal data—such as imaging (MRI/PET-CT) and targeted therapies (e.g., HER2 blockade)—supports dynamic treatment planning [18], while cost-effectiveness analyses suggest substantial savings in resource-limited settings [21]. Moreover, the nomogram’s framework can be augmented with deep learning algorithms trained on radiopathomic data, potentially elevating AUC to > 0.85. [20]

Although the nomogram showed several advantages and significant potential, this study had some limitations. Our study utilized the conclusions of Bilimoria KY et al. to define breast cancer patients with 1–5 lymph nodes examined in the SEER database as patients who underwent SLNB. This approach is similar to a double-edged sword. While this allowed us to successfully establish a predictive model for SLNM in breast cancer patients on the basis of the SEER database, this definition inevitably contains bias. We could not perfectly match the actual SLNB population, which is a major limitation of our study. Moreover, several knowledge gaps remains. First, we could not obtain other important information from the SEER database, such as vascular invasion, despite its proven importance in axillary LNM in some single-center studies [50, 51]. Second, external validation in non-Western populations (e.g., Chinese cohorts) is pending, and the model’s generalizability requires further confirmation. Since our nomogram is based on a population of White and Black individuals, efforts are needed to minimize selection bias. Finally, the nomogram’s AUC (0.700–0.711) suggests moderate discriminative power. Incorporating predictive factors such as vascular invasion and molecular biomarkers (e.g., circulating tumor DNA or immune characteristics) may improve the accuracy of the model's prediction.

Overall, although SLNB has significantly fewer surgical complications than ALND, trials such as INSEMA (NCT02466737) [52] still demonstrate persistent complications, including pain, lymphedema, and functional impairment, evident within the first month after surgery. Therefore, in clinical practice, we should strive to improve axillary lymph node assessment while ensuring patient quality of life. We developed this nomogram to more accurately predict the individual probability of SLNM, with the goal of tentatively screening patients who were predicted to be SLN (-). This may provide valuable evidence for potential SLNB exemptions, achieving broader benefits. Over the next five years, we anticipate that predictive models will increasingly integrate multiomics data (genomic, radiomic, and clinicopathological) to achieve higher precision. Moreover, artificial intelligence-driven tools could dynamically update risk predictions based on real-world data, enabling adaptive clinical decision-making. Our study serves as a foundational step toward these goals, emphasizing the need for collaborative efforts to validate and refine such models in prospective trials.