nach oben

Erschienen in:

Open Access 01.12.2024 | Research

Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study

verfasst von: Zhouwei Zhan, Bijuan Chen, Hui Cheng, Shaohua Xu, Chunping Huang, Sijing Zhou, Haiting Chen, Xuanping Lin, Ruyu Lin, Wanting Huang, Xiaohuan Ma, Yu Fu, Zhipeng Chen, Hanchen Zheng, Songchang Shi, Zengqing Guo, Lihui Zhang

Erschienen in: BMC Cancer | Ausgabe 1/2024

Abstract

Objective

The purpose of this study was to develop an individual survival prediction model based on multiple machine learning (ML) algorithms to predict survival probability for remnant gastric cancer (RGC).

Methods

Clinicopathologic data of 286 patients with RGC undergoing operation (radical resection and palliative resection) from a multi-institution database were enrolled and analyzed retrospectively. These individuals were split into training (80%) and test cohort (20%) by using random allocation. Nine commonly used ML methods were employed to construct survival prediction models. Algorithm performance was estimated by analyzing accuracy, precision, recall, F1-score, area under the receiver operating characteristic curve (AUC), confusion matrices, five-fold cross-validation, decision curve analysis (DCA), and calibration curve. The best model was selected through appropriate verification and validation and was suitably explained by the SHapley Additive exPlanations (SHAP) approach.

Results

Compared with the traditional methods, the RGC survival prediction models employing ML exhibited good performance. Except for the decision tree model, all other models performed well, with a mean ROC AUC above 0.7. The DCA findings suggest that the developed models have the potential to enhance clinical decision-making processes, thereby improving patient outcomes. The calibration curve reveals that all models except the decision tree model displayed commendable predictive performance. Through CatBoost-based modeling and SHAP analysis, the five-year survival probability is significantly influenced by several factors: the lymph node ratio (LNR), T stage, tumor size, resection margins, perineural invasion, and distant metastasis.

Conclusions

This study established predictive models for survival probability at five years in RGC patients based on ML algorithms which showed high accuracy and applicative value.

Additional file 1: Supporting Information 1. Correlation Matrix of different variables.

Additional file 2: Supporting Information 2. Model Evaluation. ROC Curves for Test and Training Sets.

Additional file 3: Supporting Table 3. Metrics and scoring for quantifying the Performance Quality of Risk Models on Test Set

Additional file 4: Supporting Table 4. Other metrics and scoring for quantifying the quality of risk models

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1186/s12885-024-12303-9.

Zhouwei Zhan, Bijuan Chen and Hui Cheng contributed equally to this work and share first authorship.

Songchang Shi, Zengqing Guo and Lihui Zhang contributed equally to this work and share last authorship.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Machine learning

RGC

Remnant gastric cancer

Gastric cancer

AUC

Area under the curve

DCA

Decision curve analysis

SHAP

SHapley Additive exPlanations

LNR

Lymph nodes ratio

PUD

Peptic ulcer disease

AJCC

American Joint Committee on Cancer

UICC

Union for International Cancer Control

KNN

K-nearest neighbor

ANN

Artificial neural network

GBM

Gradient boosting machine

GNB

Gaussian NB

SVM

Support vector machine

ROC

Receiver operating characteristic

DCA

Decision curve analysis

RFE

Recursive feature elimination

FNR

False Negative Rate

FDR

False Discovery Rate

FOR

False Omission Rate

GBDT

Gradient boosted decision tree

Introduction

Remnant gastric cancer (RGC), also known as gastric stump cancer, was initially reported by Balfour in 1922 as a cancer developing in the remnant stomach following previous gastric surgery for peptic ulcer disease (PUD)[1, 2]. More recently, the definition of RGC has evolved, and it is now described as any cancer occurring in the residual stomach following a previous partial gastrectomy for benign or malignant conditions[3]. In literature, the incidence of RGC ranges approximately from 1 to 7%[4‐8]. Due to the absence of specific symptoms, RGC is often diagnosed at an advanced stage, resulting in low surgical resection rates and poor prognoses, making it an important clinical concern[4, 5]. The surgical outcomes for RGC vary across studies, with 5-year survival rates ranging from 7 to 80%[6, 9‐12].

As the number of gastrectomies continues to rise, the incidence of RGC is escalating annually[13]. It’s crucial to identify relevant prognostic factors for RGC and develop effective follow-up treatment strategies. In clinical practice, the adjacent gastric mucosa in RGC demonstrates a lower degree of atrophy when compared to cases of primary gastric cancer (GC), which suggests a unique underlying pathological mechanism[14]. Furthermore, there is a significantly heightened incidence of serosal tumor invasion in RGC, affecting between 37 to 48% of patients, contrasting sharply with the rate of 19% seen in primary GC[15]. Additionally, surgical procedures for RGC result in a notably smaller total number of harvested lymph nodes compared to those in primary GC, particularly when the preceding surgery was for gastric malignancy, since the nodes would have already been removed. As such, the lymph node grouping applied in the TNM classification system for primary GC may not be suitable for staging RGC[16]. Moreover, RGC shows a significantly higher overall frequency of splenic hilar lymph node involvement when compared to primary GC. It is worth noting that jejunal mesentery lymph node involvement is predominantly observed following Billroth II reconstruction surgeries[17, 18].

RGC often exhibits a higher rate of invasion into adjacent organs, and lymph node metastasis is frequently observed[19], which can lead to a worse prognosis than primary GC[20]. However, some studies suggest that RGC prognoses are similar to primary GC[21]. Prior research has investigated the clinical characteristics of resectable RGC in small case studies, but the factors influencing patient outcomes remain unclear or controversial[22‐24]. A meta-analysis disclosed that the significance of tumor location on survival varies among studies. Some literature indicates that tumor location does not significantly impact survival rates[25, 26], while other research reports that anastomotic site tumors may be a favorable prognostic factor[27]. Nonetheless, patients with anastomotic site tumors experience worse outcomes[23]. Thus, additional research is necessary to resolve this discrepancy.

Machine learning (ML) constitutes the bedrock of contemporary artificial intelligence advancements[28]. Although these algorithms have demonstrated substantial triumphs across various disciplines, their integration into the realms of medicine and healthcare is still in its nascent stages. The non-linear nature of real-world data impacts often challenges the effectiveness of traditional models like Linear Regression for classification forecasts and Cox Regression for predicting survival outcomes, as they are confined within a linear framework[29, 30]. In comparison with traditional mathematical models, ML excels notably in handling tasks related to classification and regression, finding broad application in developing predictive frameworks, determining tumor stages, and prognostic groupings[31‐34].

ML can facilitate various problems, from patient-level observations to employing algorithms with numerous variables, seeking combinations, and ultimately reliably predicting risks and outcomes[35]. Numerous studies have developed valuable models utilizing ML techniques[36‐39]. However, there is a dearth of research exploring the application of ML for predicting survival outcomes in RGC patients. Although ML presents significant benefits in constructing models to identify risk factors, the “black-box” nature of ML algorithms poses challenges in explaining why specific predictions are made for patients. In pursuit of these objectives, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced[40, 41]. The SHAP method allows for the recognition and prioritization of attributes that influence complex classification and activity forecasting utilizing any ML model. Developing a visual predictive model to assist healthcare professionals in identifying individuals with poor prognoses would be advantageous.

Consequently, a central objective of our research was to construct and evaluate ML-based survival prediction models for patients with remnant stomach cancer over a five-year period. This endeavor encompassed not only the development of multiple ML algorithms but also an emphasis on visualizing these models to gain deeper insights into their inner workings. Furthermore, our study aimed to juxtapose the efficacy of these ML models against that of traditional linear regression models, thereby shedding light on the distinctive contributions and potential superiority of ML approaches in forecasting survival probabilities for this patient population. Through visualization, we sought to enhance interpretability and transparency, enabling a comprehensive evaluation and understanding of the complex relationships learned by the ML models in the context of RGC survival prediction.

Data and methods

Patients

Patients with RGC were enrolled at two tertiary hospitals (Fujian Provincial Hospital from June 2008 to May 2022, and Fujian Cancer Hospital from June 1999 to August 2021). RGC was characterized as an adenocarcinoma originating in the remnant stomach subsequent to a gastric resection for either a benign or malignant condition[3, 14, 42]. A total of 366 individuals participated in this study. Inclusion criteria consisted of patients who underwent surgical treatment, including radical and palliative surgery, with a follow-up duration of > 5 years or those who died. Patients with a history of neoadjuvant therapy, R1/R2 resection in previous gastrectomy, other malignant diseases within the past 5 years, death within 3 months after surgery, different pathological types, or incomplete clinicopathological data were excluded. Furthermore, patients with a follow-up duration of less than 5 years, no endpoints observed, or missing values exceeding 20% were also excluded from the study. Based on the inclusion and exclusion criteria, 286 participants remained in the study. The study’s flow chart is presented in Fig. 1. The study protocol adhered to the ethical guidelines of the 1995 Declaration of Helsinki, and was approved by the ethics committee of Fujian Cancer Hospital (ethical approval number K2021-100–01) and Fujian Provincial Hospital (ethical approval number K2022-08–034).

Data collection

Follow-up procedures encompassed outpatient visits, hospital appointments, and telephone inquiries. The follow-up period concluded on December 31, 2023. Patients’ survival time (in months) was calculated from the date of surgery to the date of death or the end of follow-up. Retrospective analysis was conducted on preoperative information (age, initial gastric disease, initial reconstruction methods, and interval between the initial surgery and RGC resection), operative details (operative approaches, combined resections, and either curative (R0) or non-curative resections (R1/2)), and postoperative data (RGC tumor location, histopathological findings, lymph nodes ratio (LNR), venous and perineural invasions, follow-up duration, and adjuvant therapy). TNM staging was performed according to the AJCC/UICC staging criteria (8th edition) after RGC surgery[43]. Histological types were classified as highly differentiated, moderately differentiated, and lowly differentiated (including signet-ring cell carcinoma, poorly differentiated, or mucinous). Tumor locations were categorized as anastomotic and non-anastomotic sites.

Study outcomes

The primary endpoint of the study was all-cause mortality within the 5-year follow-up period. All-cause mortality was defined as death resulting from any cause.

Feature selection and data preprocessing

ML algorithms were implemented in Python software, and the data were organized in the format required for applying these algorithms. Samples were classified into healthy or sepsis groups based on the outcome indicators for the classification prediction model. The K-Nearest Neighbor (KNN) algorithm[44] was used to fill in missing data. To prevent non-normal distributed features from causing incorrect outcomes in ML estimators, logistic regression (with L2 penalty and c = 0.01) was employed as an external estimator, assigning weights to each feature. This approach facilitated accurate and reliable predictions in our study.

Model development

Nine ML algorithms, including Artificial Neural Network (ANN), CatBoost, Decision Tree, Gradient Boosting Machine (GBM), Gaussian Naive Bayes (GNB), K-Nearest Neighbor (KNN), Logistic Regression, Random Forest, and Support Vector Machine (SVM), were employed to develop prognostic models. These models were compared with Linear Regression[45]. To divide the 286 patients into a training and a testing set, stratified random sampling was utilized based on the occurrence of the endpoint. The 8:2 ratio resulted in a training set of 228 patients and a test set of 58 patients.

Model performance evaluation

Various metrics and scoring methods were employed to quantify the accuracy of predictions, including application to the evaluated estimators such as accuracy, precision, recall, and F1-score. The model’s discrimination capability was assessed using the receiver operating characteristic (ROC) curve. To prevent overfitting, repeated resampling, model fitting, and evaluation were utilized. Additionally, decision curve analysis (DCA) and calibration curves were applied to calibrate the model and provide support for probability predictions.

Model interpretation

The Shapley Additive explanation (SHAP) package[46], a method for uniformly measuring feature importance in ML models, was employed for visualizing and explaining the prediction model. SHAP-based explanations offer a solid theoretical foundation and are the only attribution method that satisfies local accuracy, missingness, and consistency requirements[47]. The SHAP beeswarm plot provides a visual overview of the entire model, while sorting feature variables and creating scatter plots help explain the model. The SHAP dependence plot is used to visualize feature interactions and SHAP values, while the SHAP force plot enables visualization of the model at an individual level. We utilized SHAP to offer an explanation for our predictive model, which includes relevant risk factors contributing to mortality in patients with gastric stump cancer. This interpretation helps to enhance understanding of the model’s predictions and the factors influencing patient outcomes.

Statistical analysis

Numerical variables with normal distributions were presented as mean ± SD, while those without normal distributions were represented by median (lower quartile, upper quartile). Categorical variables were expressed as the sum (percentage). Data preprocessing was performed using R software (version 3.6.3). For missing data imputation, KNN[44], Sklearn[48], and SHAP packages[46], in Python (version 3.7) were utilized respectively. The KNN package filled in missing data, while the Sklearn package built and verified the risk models. The SHAP package was used for model visualization and explanation. All models were constructed using the Sklearn package.

Result

Clinicopathological features of RGC

A final dataset consisting of 286 patients with RGC was obtained based on the inclusion criteria. This included 250 male patients (87.4%) and 36 female patients (12.6%). The average age of all patients was 64.3 ± 10.7 years. During a 5-year follow-up period, 142 patients (49.65%) passed away. The basic participant information is presented in Table 1. The dataset encompassed 19 clinical features, including those related to the outcome variable. To prevent later model construction from being influenced by significantly correlated features, the linear correlation between continuous numerical variables in the dataset was analyzed. As shown in Supporting Information 1, there were no significantly correlated variables (r < 0.8). This ensures that the constructed model is minimally affected by redundant or confounding factors.

Table 1

The basic information of participants

	0 (N = 144)	1 (N = 142)	Overall (N = 286)
Center
Center 1	52 (36.1%)	40 (28.2%)	92 (32.2%)
Center 2	92 (63.9%)	102 (71.8%)	194 (67.8%)
Gender
Man	127 (88.2%)	123 (86.6%)	250 (87.4%)
Woman	17 (11.8%)	19 (13.4%)	36 (12.6%)
Age
Mean (SD)	63.7 (9.53)	64.8 (11.7)	64.3 (10.7)
Median [Min, Max]	65.0 [27.0, 86.0]	67.0 [4.00, 87.0]	66.0 [4.00, 87.0]
Interval
Mean (SD)	20.7 (15.5)	21.6 (14.7)	21.1 (15.1)
Median [Min, Max]	20.0 [1.00, 69.0]	20.0 [1.00, 50.0]	20.0 [1.00, 69.0]
Initial gastrectomy
Billroth I	31 (21.5%)	27 (19.0%)	58 (20.3%)
Billroth II	113 (78.5%)	115 (81.0%)	228 (79.7%)
Initial gastric disease
Benign	85 (59.0%)	87 (61.3%)	172 (60.1%)
Malignant	59 (41.0%)	55 (38.7%)	114 (39.9%)
Location
anastomotic	108 (75.0%)	106 (74.6%)	214 (74.8%)
non-anastomotic	36 (25.0%)	36 (25.4%)	72 (25.2%)
Grade
High	7 (4.9%)	0 (0%)	7 (2.4%)
Inter	66 (45.8%)	48 (33.8%)	114 (39.9%)
Low	69 (47.9%)	93 (65.5%)	162 (56.6%)
Missing	2 (1.4%)	1 (0.7%)	3 (1.0%)
T stage
Mean (SD)	2.81 (1.16)	3.61 (0.683)	3.21 (1.03)
Median [Min, Max]	3.00 [1.00, 4.00]	4.00 [1.00, 4.00]	4.00 [1.00, 4.00]
Metastasis
No	139 (96.5%)	119 (83.8%)	258 (90.2%)
Yes	5 (3.5%)	22 (15.5%)	27 (9.4%)
Missing	0 (0%)	1 (0.7%)	1 (0.3%)
Combined resection
No	122 (84.7%)	106 (74.6%)	228 (79.7%)
Yes	22 (15.3%)	36 (25.4%)	58 (20.3%)
Tumor size (cm)
Mean (SD)	4.20 (2.13)	5.61 (2.61)	4.90 (2.48)
Median [Min, Max]	4.00 [0.800, 11.0]	5.00 [0.500, 15.0]	5.00 [0.500, 15.0]
Venous invasion
No	106 (73.6%)	82 (57.7%)	188 (65.7%)
Yes	38 (26.4%)	60 (42.3%)	98 (34.3%)
Perineural invasion
No	99 (68.8%)	62 (43.7%)	161 (56.3%)
Yes	45 (31.3%)	80 (56.3%)	125 (43.7%)
Resection margins
No	138 (95.8%)	119 (83.8%)	257 (89.9%)
Yes	6 (4.2%)	23 (16.2%)	29 (10.1%)
Lymph nodes ratio
Mean (SD)	0.109 (0.243)	0.365 (0.332)	0.236 (0.318)
Median [Min, Max]	0 [0, 1.00]	0.290 [0, 1.00]	0.0500 [0, 1.00]
Postoperative complications
No	123 (85.4%)	110 (77.5%)	233 (81.5%)
Yes	21 (14.6%)	32 (22.5%)	53 (18.5%)
Adjuvant chemotherapy
No	94 (65.3%)	89 (62.7%)	183 (64.0%)
Yes	50 (34.7%)	53 (37.3%)	103 (36.0%)

0 survivor, 1 No-survivor, Center 1 Fujian Provincial Hospital, Center 2 Fujian Cancer Hospital

Feature variable selection

The data was prepared in the required format for implementing the ML algorithm. Nineteen observation indices were assessed for missing values. Aside from three instances where T-stage information was absent, no other variables exhibited any missing data. To fill in missing data, the K-Nearest Neighbor method was employed. For feature selection, recursive feature elimination (RFE) was utilized to enhance estimators’ accuracy scores or improve their performance on highly dimensional datasets. Logistic regression (with L2 penalty, c = 0.01, n = 10) was used as an external estimator to assign weights to features. This approach ensures that the selected features contribute effectively to the model’s predictive accuracy and performance.

Model performance

The predictive performance of the model during both training and testing, as measured by the AUC value, is detailed within Supporting Information 2. The confusion matrices illustrating the performance of the models trained on the test dataset are presented in Fig. 2. Upon comparison with conventional methodologies, the ML-built models showcased enhanced performance. Among all the models, CatBoost models emerged as having the highest f1-scores. The AUC ranged from 0.60 to 0.76 for the test set (refer to Supporting Information 3). Other metrics and scoring methods for quantifying the quality of risk models, such as False Negative Rate (FNR), False Positive Rate (FPR), False Discovery Rate (FDR), and False Omission Rate (FOR), are presented in Supplementary Table 4. Cross-validation serves as a principal method for internal validation[49], and in this study, five-fold cross-validation was employed. Table 2 showcases the performance metrics of the ML algorithms after being subjected to five-fold cross-validation on the test data. Notably, the KNN models achieved the most outstanding test set and f1-scores. Figure 3 further illuminates that, aside from the decision tree model, all other models delivered commendable performances, with an average AUC of the ROC exceeding 0.7, indicating their robustness and predictive capabilities.

Table 2

Metrics and Scoring for Quantifying the Quality of Model Performance with 5-Fold Stratified Cross-Validation on Test Set

	Accuracy_scores	Precision	Recall	F1-scores	AUC
	No-survivor	No-survivor	No-survivor	No-survivor
ANN	0.67±0.09	0.66±0.07	0.65±0.07	0.67±0.09	0.782 ± 0.025
CatBoost	0.62±0.10	0.64±0.13	0.62±0.10	0.62±0.10	0.757 ± 0.060
Decision Tree	0.55±0.10	0.55±0.14	0.54±0.14	0.49±0.08	0.631 ± 0.086
GBM	0.57±0.08	0.54±0.08	0.54±0.08	0.52±0.08	0.715 ± 0.063
GNB	0.57±0.11	0.54±0.18	0.58±0.11	0.54±0.15	0.793 ± 0.041
KNN	0.74±0.09	0.75±0.10	0.74±0.09	0.74±0.10	0.745 ± 0.040
Logistic	0.66±0.09	0.66±0.08	0.66±0.09	0.65±0.09	0.793 ± 0.031
Random Forest	0.64±0.14	0.66±0.12	0.61±0.09	0.58±0.06	0.728 ± 0.053
SVM	0.69±0.09	0.70±0.09	0.69±0.09	0.69±0.09	0.786 ± 0.037

LASSO least absolute shrinkage and selection operator, ANN artificial neural network, GBM gradient boosting machine, GNB Gaussian NB, KNN K-nearest neighbor, SVM supported vector machine

DCA is a method to determine whether using a prediction model for clinical decision-making provides benefits[50, 51]. In DCA, the net benefit is compared between two strategies: “treat all” and “treat none”. The optimal strategy is the one with the highest net benefit at a specific threshold probability. For the majority of models, the net benefit of the decision curve was higher than that for either “treat all” or “treat none” across all likely threshold probabilities. The GNB model showed a significant decrease in net benefit when threshold probabilities exceeded 80%. For the other eight models, a high net benefit was observed over a wide range of threshold probabilities. Consequently, the DCA results indicated that the constructed models could aid clinical decision-making to improve patient outcomes (Fig. 4).

Furthermore, the calibration curve was assessed to evaluate another measure of discrimination. The reference line is diagonal, and the calibration curve aligns with the reference when the predicted value equals the observed value. The curve is below the reference when risk is overestimated, and above when risk is underestimated. Figure 5 demonstrates that except for the decision tree model, the predicted values of the other eight models exhibited good performance.

Visualization and explanation of models

The 5-year death prediction model based on ML techniques performed satisfactorily in terms of model validity and clinical net benefit. Nonetheless, the opaque nature of ML models creates a lack of transparency. SHAP values reveal the individual contributions of each feature to the final prediction, effectively clarifying and interpreting model predictions for specific patients. After sorting features, SHAP was applied to distinguish the feature values for the selected variable (Fig. 6A). To explain the CatBoost-based model, the SHAP summary plot was utilized. The study findings suggested that a high lymph node ratio (red) had a negative impact on prognosis, while a low lymph node ratio (blue) contributed positively. Concurrently, a high Tstage (red) showed a negative effect on prognosis, whereas a low Tstage (blue) had a positive influence on the patient’s outlook. The results corresponded to those concerning resection margins, positive metastasis, and perineural invasion.

After several years of development, traditional ML methods have become capable of displaying feature variables. However, these methods fail to demonstrate the positive and negative relationships between features within the model (Fig. 6B).

The SHAP Dependence Plot enables visualization of the effects within the model. Each dot represents a sample (Fig. 6C). It was observed that as the T stage increased, so did the SHAP values. The SHAP Force Plot illustrates the individual level within the model. Figure 6D demonstrates the significance of influencing factors for the three subjects in the RGC. In comparison to the first sample (SHAP, 1.03) and third sample (SHAP, 1.54), the second sample (SHAP, -0.41) belonged to the low-risk group, possessing a decreased risk of 5-year death. Variables influencing the model’s outcomes are listed below the horizontal axis. Different individuals might have identical or slightly varying key variables affecting their outcomes.

Discussion

Our research harnessed ML techniques to create a set of ML models skilled at forecasting five-year survival prognoses for RGC following surgery. This is the first investigation to examine prognostic risk factors for RGC utilizing ML models. Through the development and validation of this model, we have showcased its consistent performance and superior reproducibility. Significantly, our risk model not only demonstrates robust stability compared to conventional techniques but also addresses the ‘black box’ issue associated with ML models by incorporating model visualization techniques. By visualizing the model, we enable healthcare professionals to more effectively discern post-surgery survival outcomes. These predictive indicators potentially grant clinicians an enhanced ability to tailor care strategies, thereby optimizing risk factor management for high-risk patients.

The proficiency, user-friendliness, and resilience of ML models in recognizing complex data significantly surpass traditional statistical models, overcoming their limitations regarding statistical efficiency[49]. In ML models, classes can be utilized for feature selection or dimensionality reduction to enhance the model’s accuracy score or improve its performance on high-dimensional datasets[52]. Gradient boosted decision trees (GBDTs), including XGBoost, LightGBM, and CatBoost, are potent tools for big data classification tasks. Our method provides not only a precise and clinically feasible technique for predicting RGC patient survival outcomes but also enhances the interpretability of the predictions. The SHAP value quantifies each feature marker’s contribution to the model’s identification results, enabling comprehensive global explanations[46, 53, 54]. The predictive capacity of a clinical factor in the XGBoost model elevates as the average absolute SHAP value of each factor rises. To obtain a uniform perspective, these factors were consolidated, and SHAP interpretation drew from individual patients. SHAP effectively addresses multicollinearity issues and determines whether an influence is beneficial, thanks to its ability to consider both individual factor effects and their synergies[41]. According to the SHAP values, LNR, T stage, tumor size, resection margins, perineural invasion, and distant metastasis were determined as the most crucial factors in identifying five-year survival prognoses for RGC. In essence, these factors can be considered an optimal subset representing the key players in survival risk assessment for RGC patients. The interpretability of the optimal subset stems from capturing and visualizing the effect direction of each feature and its contribution size to the prediction. This enables clinicians to gain specific insights into how individual predictions are influenced by various variables, affording a personalized, fine-grained understanding of different patients’ prognoses.

Most reports indicate that RGC is often diagnosed at an advanced stage, leading to a relatively low rate of curative resection and unfavorable prognosis. This suggests that RGC may possess distinct biological characteristics from primary GC[1, 55, 56]. However, some researchers have compared RGC to primary GC and found no significant difference in survival rates between the two[57‐59]. A few studies have investigated the clinicopathologic features and prognosis of RGC, but consensus has not been reached yet[1, 60, 61]. Similar to prior research[56, 62, 63], our study noted that more than 80% of RGC patients were male. This may be attributed to the fact that men are more susceptible to developing both gastroduodenal ulcers and GC[64, 65].

In the majority of studies, RGC lymph node staging adheres to the UICC/AJCC grading criteria. However, in first-time GC patients, postoperative lymph node drainage changes and the lymph nodes detected by RGC cannot comprehensively determine the N stage, particularly given the occurrence of RGC after GC. The total number of postoperative lymph node dissections during re-surgery typically does not exceed 10, which is significantly fewer than the number of lymph nodes dissected by RGC after surgery for benign lesions. This may lead to inaccurate staging. A study analyzed the prognostic significance of LNR in resectable RGC using retrospective propensity score matching and found that LNR served as an independent prognostic factor for RGC, while the number of positive lymph nodes did not act as an independent prognostic factor[42]. Our study reinforced this notion using an ML method. Therefore, LNR may be a more dependable prognostic factor for RGC patients. However, some studies suggest that LNR is not superior to the number of positive lymph nodes[66]. Further analysis incorporating data from multiple centers with larger sample sizes is necessary.

Another study identified lymphatic invasion and pathological T stage as risk factors for lymph node metastasis in RGC[67]. Many researchers have proposed that high rates of adjacent organ invasion and lymph node metastasis contribute to RGC’s poorer prognosis[19, 20]. Nonetheless, one study found pathological T stage and venous invasion to be significant independent risk factors for survival among RGC patients[68]; however, pathological N stage showed no significant association with long-term survival[68]. This contradicts our study’s findings. In our research, venous infiltration was not included in the prognostic model, suggesting it is not an independent prognostic factor, and nerve invasion plays a crucial role. Given their small sample size (65 cases) and single-center retrospective study, the prognostic value of venous infiltration deserves further examination. It has been demonstrated that tumor site affects RGC’s prognosis[22, 23, 27]. RGC’s tumor location is a vital factor for predicting recurrence patterns and overall survival[69]. However, in our study, tumor location at the anastomotic site did not act as an independent prognostic factor, which aligns with previous reports[70, 71].

The current study unavoidably has several limitations. Firstly, due to its retrospective nature, there was selection bias. Secondly, the sample size was relatively small. Thirdly, some crucial information was incomplete or missing, likely caused by difficulties in gathering data about the initial operation. Further prospective studies involving RGC patients are necessary to comprehensively explore the clinicopathological characteristics of RGC.

Given the primary aim of our research to optimize the use of pathological features in predicting mortality risks for post-gastrectomy GC patients, we intentionally confined our analysis to these specific characteristics. Consequently, we did not incorporate other potentially influential mortality risk factors, such as comorbidities, laboratory indices, and other clinical attributes for stratification purposes. This deliberate focus on pathology data alone may have limited the model's ability to achieve its maximum predictive capacity. Nonetheless, this study serves as a foundational step towards refining risk prediction. Moving forward, we plan to extend our work by integrating additional clinical indicators and biomarkers to construct a more refined and comprehensive predictive model. Such a holistic approach will likely enhance the precision and practicality of risk assessment in this patient population.

Conclusion

In summary, utilizing the CatBoost ML model to develop a prognostic risk model for RGC can effectively assist clinicians in predicting patient outcomes, outperforming traditional ML methods. Moreover, combining SHAP and ML may serve as a suitable approach to identify individuals with poor prognoses.

Acknowledgements

We appreciate the collaboration and discussions with our co-authors, and we thank the funding agency for their support. We appreciate the valuable comments and suggestions from the anonymous reviewers.

Declarations

Studies involving human participants were reviewed and granted approval by the ethics committee of Fujian Cancer Hospital and Fujian Provincial Hospital, Fuzhou, People’s Republic of China. The research adhered to the Declaration of Helsinki. Informed written consent was obtained from participants prior to study participation.

Not applicable.

Competing interests

The authors declare no competing interests.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supporting Information 1. Correlation Matrix of different variables.

Additional file 2: Supporting Information 2. Model Evaluation. ROC Curves for Test and Training Sets.

Additional file 3: Supporting Table 3. Metrics and scoring for quantifying the Performance Quality of Risk Models on Test Set

Additional file 4: Supporting Table 4. Other metrics and scoring for quantifying the quality of risk models

Ohira M, Toyokawa T, Sakurai K, Kubo N, Tanaka H, Muguruma K, Yashiro M, Onoda N, Hirakawa K. Current status in remnant gastric cancer after distal gastrectomy. World J Gastroenterol. 2016;22(8):2424–33.PubMedPubMedCentralCrossRef

Balfour DC. Factors influencing the life expectancy of patients operated on for gastric ulcer. Ann Surg. 1922;76(3):405–8.PubMedPubMedCentralCrossRef

Mak TK, Guan B, Peng J, Chong TH, Wang C, Huang S, Yang J. Prevalence and characteristics of gastric remnant cancer: a systematic review and meta-analysis. Asian J Surg. 2021;44(1):11–7.PubMedCrossRef

Onodera H, Tokunaga A, Yoshiyuki T, Kiyama T, Kato S, Matsukura N, Masuda G, Tajiri T. Surgical outcome of 483 patients with early gastric cancer: prognosis, postoperative morbidity and mortality, and gastric remnant cancer. Hepatogastroenterology. 2004;51(55):82–5.PubMed

Ikeda Y, Saku M, Kishihara F, Maehara Y. Effective follow-up for recurrence or a second primary cancer in patients with early gastric cancer. Br J Surg. 2005;92(2):235–9.PubMedCrossRef

Thorban S, Böttcher K, Etter M, Roder JD, Busch R, Siewert JR. Prognostic factors in gastric stump carcinoma. Ann Surg. 2000;231(2):188–94.PubMedPubMedCentralCrossRef

Sons HU, Borchard F. Gastric carcinoma after surgical treatment for benign ulcer disease: some pathologic-anatomic aspects. Int Surg. 1987;72(4):222–6.PubMed

Mezhir JJ, Gonen M, Ammori JB, Strong VE, Brennan MF, Coit DG. Treatment and outcome of patients with gastric remnant cancer after resection for peptic ulcer disease. Ann Surg Oncol. 2011;18(3):670–6.PubMedCrossRef

Inomata M, Shiraishi N, Adachi Y, Yasuda K, Aramaki M, Kitano S. Gastric remnant cancer compared with primary proximal gastric cancer. Hepatogastroenterology. 2003;50(50):587–91.PubMed

10.

Chen CN, Lee WJ, Lee PH, Chang KJ, Chen KM. Clinicopathologic characteristics and prognosis of gastric stump cancer. J Clin Gastroenterol. 1996;23(4):251–5.PubMedCrossRef

11.

Santoro R, Ettorre GM, Santoro E. Subtotal gastrectomy for gastric cancer. World J Gastroenterol. 2014;20(38):13667–80.PubMedPubMedCentralCrossRef

12.

Ikeguchi M, Kondou A, Shibata S, Yamashiro H, Tsujitani S, Maeta M, Kaibara N. Clinicopathologic differences between carcinoma in the gastric remnant stump after distal partial gastrectomy for benign gastroduodenal lesions and primary carcinoma in the upper third of the stomach. Cancer. 1994;73(1):15–21.PubMedCrossRef

13.

Hanyu T, Wakai A, Ishikawa T, Ichikawa H, Kameyama H, Wakai T. Carcinoma in the remnant stomach during long-term follow-up after distal gastrectomy for gastric cancer: analysis of cumulative incidence and associated risk factors. World J Surg. 2018;42(3):782–7.PubMedCrossRef

14.

Shukla A, Kalayarasan R, Gnanasekaran S, Pottakkat B. Appraisal of gastric stump carcinoma and current state of affairs. World J Clin Cases. 2023;11(13):2864–73.PubMedPubMedCentralCrossRef

15.

Tokunaga M, Sano T, Ohyama S, Hiki N, Fukunaga T, Yamada K, Yamaguchi T. Clinicopathological characteristics and survival difference between gastric stump carcinoma and primary upper third gastric cancer. J Gastrointest Surg. 2013;17(2):313–8.PubMedCrossRef

16.

Wang H, Qi H, Liu X, Gao Z, Hidasa I, Aikebaier A, Li K. Positive lymph node ratio is an index in predicting prognosis for remnant gastric cancer with insufficient retrieved lymph node in R0 resection. Sci Rep. 2021;11(1):2022.PubMedPubMedCentralCrossRef

17.

Shimada H, Fukagawa T, Haga Y, Oba K. Does remnant gastric cancer really differ from primary gastric cancer? A systematic review of the literature by the Task Force of Japanese Gastric Cancer Association. Gastric Cancer. 2016;19(2):339–49.PubMedCrossRef

18.

Han SL, Hua YW, Wang CH, Ji SQ, Zhuang J. Metastatic pattern of lymph node and surgery for gastric stump cancer. J Surg Oncol. 2003;82(4):241–6.PubMedCrossRef

19.

Tanigawa N, Nomura E, Lee SW, Kaminishi M, Sugiyama M, Aikou T, Kitajima M. Current state of gastric stump carcinoma in Japan: based on the results of a nationwide survey. World J Surg. 2010;34(7):1540–7.PubMedPubMedCentralCrossRef

20.

Tran TB, Hatzaras I, Worhunsky DJ, Vitiello GA, Squires MH 3rd, Jin LX, Spolverato G, Votanopoulos KI, Schmidt C, Weber S, et al. Gastric remnant cancer: a distinct entity or simply another proximal gastric cancer? J Surg Oncol. 2015;112(8):877–82.PubMedCrossRef

21.

Hu X, Tian DY, Cao L, Yu Y. Progression and prognosis of gastric stump cancer. J Surg Oncol. 2009;100(6):472–6.PubMedCrossRef

22.

An JY, Youn HG, Ha TK, Choi MG, Kim KM, Noh JH, Sohn TS, Kim S. Clinical significance of tumor location in remnant gastric cancers developed after partial gastrectomy for primary gastric cancer. J Gastrointest Surg. 2008;12(4):689–94.PubMedCrossRef

23.

Namikawa T, Kitagawa H, Iwabu J, Okabayashi T, Kobayashi M, Hanazaki K. Tumors arising at previous anastomotic site may have poor prognosis in patients with gastric stump cancer following gastrectomy. J Gastrointest Surg. 2010;14(12):1923–30.PubMedCrossRef

24.

Zhang DW, Dong B, Li Z, Dai DQ. Clinicopathologic features of remnant gastric cancer over time following distal gastrectomy. World J Gastroenterol. 2015;21(19):5972–8.PubMedPubMedCentralCrossRef

25.

Lee SB, Kim JH, Kim DH, Jeon TY, Kim DH, Kim GH, Park DY. Clinicopathological characteristics and prognosis of remnant gastric cancer. J Gastric Cancer. 2010;10(4):219–25.PubMedPubMedCentralCrossRef

26.

Ojima T, Iwahashi M, Nakamori M, Nakamura M, Naka T, Katsuda M, Iida T, Tsuji T, Hayata K, Takifuji K, et al. Clinicopathological characteristics of remnant gastric cancer after a distal gastrectomy. J Gastrointest Surg. 2010;14(2):277–81.PubMedCrossRef

27.

Firat O, Guler A, Sozbilen M, Ersin S, Kaplan H. Gastric remnant cancer: an old problem with novel concerns. Langenbecks Arch Surg. 2009;394(1):93–7.PubMedCrossRef

28.

Subasi O, Bel O, Manzano J, Barker KJA. The landscape of modern machine learning: a review of machine, distributed and federated learning. 2023, abs/2312.03120. https://doi.org/10.48550/arXiv.2312.03120.

29.

Li S, Yi H, Leng Q, Wu Y, Mao Y. New perspectives on cancer clinical research in the era of big data and machine learning. Surg Oncol. 2024;52:102009.PubMedCrossRef

30.

Gross AJ, Pisano CE, Khunsriraksakul C, Spratt DE, Park HS, Sun Y, Wang M, Zaorsky NG. Real-World Data: Applications and relevance to cancer clinical trials. Semin Radiat Oncol. 2023;33(4):374–85.PubMedCrossRef

31.

Capobianco E. High-dimensional role of AI and machine learning in cancer research. Br J Cancer. 2022;126(4):523–32.PubMedPubMedCentralCrossRef

32.

Lotter W, Hassett MJ, Schultz N, Kehl KL, Van Allen EM, Cerami E. Artificial intelligence in oncology: current landscape, challenges, and future directions. Cancer Discov. 2024:Of1-of16. https://doi.org/10.1158/2159-8290.

33.

Zhu L, Pan J, Mou W, Deng L, Zhu Y, Wang Y, Pareek G, Hyams E, Carneiro BA, Hadfield MJ, et al. Harnessing artificial intelligence for prostate cancer management. Cell Rep Med. 2024;5:101506.PubMedPubMedCentralCrossRef

34.

Wang H, Zhang C, Li Q, Tian T, Huang R, Qiu J, Tian R. Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches. BMC Cancer. 2024;24(1):427.PubMedPubMedCentralCrossRef

35.

Obermeyer Z, Emanuel EJ. Predicting the future - big data, machine learning, and clinical medicine. N Engl J Med. 2016;375(13):1216–9.PubMedPubMedCentralCrossRef

36.

Peng ZH, Tian JH, Chen BH, Zhou HB, Bi H, He MX, Li MR, Zheng XY, Wang YW, Chong T, et al. Development of machine learning prognostic models for overall survival of prostate cancer patients with lymph node-positive. Sci Rep. 2023;13(1):18424.PubMedPubMedCentralCrossRef

37.

Karabacak M, Jagtiani P, Carrasquilla A, Germano IM, Margetis K. Prognosis individualized: survival predictions for WHO grade II and III gliomas with a machine learning-based web application. NPJ Digit Med. 2023;6(1):200.PubMedPubMedCentralCrossRef

38.

Ji L, Zhang W, Huang J, Tian J, Zhong X, Luo J, Zhu S, He Z, Tong Y, Meng X, et al. Bone metastasis risk and prognosis assessment models for kidney cancer based on machine learning. Front Public Health. 2022;10:1015952.PubMedPubMedCentralCrossRef

39.

Kuwayama N, Hoshino I, Mori Y, Yokota H, Iwatate Y, Uno T. Applying artificial intelligence using routine clinical data for preoperative diagnosis and prognosis evaluation of gastric cancer. Oncol Lett. 2023;26(5):499.PubMedPubMedCentralCrossRef

40.

Rodríguez-Pérez R, Bajorath J. Interpretation of compound activity predictions from complex machine learning models using local approximations and shapley values. J Med Chem. 2020;63(16):8761–77.PubMedCrossRef

41.

Nohara Y, Matsumoto K, Soejima H, Nakashima N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed. 2022;214:106584.PubMedCrossRef

42.

Yang B, Liu T, Cui H, Lu Z, Fang G, Xue X, Luo T. The value of lymph nodes ratios in the prognosis of resectable remnant gastric cancer through the retrospective propensity score matching analysis. World J Surg Oncol. 2023;21(1):245.PubMedPubMedCentralCrossRef

43.

O’Sullivan B, Brierley J, Byrd D, Bosman F, Kehoe S, Kossary C, Pineros M, Van Eycken E, Weir HK, Gospodarowicz M. The TNM classification of malignant tumours-towards common understanding and reasonable expectations. Lancet Oncol. 2017;18(7):849–51.PubMedPubMedCentralCrossRef

44.

Bania RK, Halder A. R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data. Comput Methods Programs Biomed. 2020;184: 105122.PubMedCrossRef

45.

Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825–30.

46.

Lundberg SM, Lee S-I. Proceedings of the 31st International conference on neural information processing systems. In: A unified approach to interpreting model predictions. Long Beach: Curran Associates Inc; 2017. p. 4768–77.

47.

Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S-I: From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence 2020;2(1):56-67.

48.

Tanaka T. [Fundamentals] 5. Python+scikit-learn for machine learning in medical imaging. Nihon Hoshasen Gijutsu Gakkai Zasshi. 2023;79(10):1189–93.PubMedCrossRef

49.

Colmenarejo G. Machine learning models to predict childhood and adolescent obesity: a review. Nutrients. 2020;12(8):2466.PubMedPubMedCentralCrossRef

50.

Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26(6):565–74.PubMedPubMedCentralCrossRef

51.

Vickers AJ, Cronin AM, Elkin EB, Gonen M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. 2008;8:53.PubMedPubMedCentralCrossRef

52.

Abraham A, Pedregosa F, Eickenberg M, Gervais P, Mueller A, Kossaifi J, Gramfort A, Thirion B, Varoquaux G. Machine learning for neuroimaging with scikit-learn. Front Neuroinform. 2014;8:14.PubMedPubMedCentralCrossRef

53.

Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, Liston DE, Low DK, Newman SF, Kim J, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2(10):749–60.PubMedPubMedCentralCrossRef

54.

Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee S-I. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67.PubMedPubMedCentralCrossRef

55.

Dhir M. Gastric Remnant Cancer: Is it different from primary gastric cancer? Insights into a unique clinical entity. Ann Surg Oncol. 2020;27(11):4079–81.PubMedCrossRef

56.

Wang SH, Zhang JC, Zhu L, Li H, Hu KW. Does gastric stump cancer really differ from primary proximal gastric cancer? A multicentre, propensity score matching-used, retrospective cohort study. World J Gastrointest Surg. 2023;15(11):2553–63.PubMedPubMedCentralCrossRef

57.

Schaefer N, Sinning C, Standop J, Overhaus M, Hirner A, Wolff M. Treatment and prognosis of gastric stump carcinoma in comparison with primary proximal gastric cancer. Am J Surg. 2007;194(1):63–7.PubMedCrossRef

58.

Galata C, Ronellenfitsch U, Weiß C, Blank S, Reißfelder C, Hardt J. Surgery for gastric remnant cancer results in similar overall survival rates compared with primary gastric cancer: a propensity score-matched analysis. Ann Surg Oncol. 2020;27(11):4196–203.PubMedPubMedCentralCrossRef

59.

Ramos M, Pereira MA, Dias AR, Dantas ACB, Szor DJ, Ribeiro U Jr, Zilberstein B, Cecconello I. Remnant gastric cancer: an ordinary primary adenocarcinoma or a tumor with its own pattern? World J Gastrointest Surg. 2021;13(4):366–78.PubMedPubMedCentralCrossRef

60.

Song XH, Liu K, Sun LF, Chen XL, Zhao LY, Zhang WH, Chen XZ, Yang K, Zhang B, Chen ZX, et al. Clinicopathological characteristics and prognostic factors of remnant gastric cancer: a single-center retrospective analysis of 90 patients. Int J Surg. 2018;51:97–103.PubMedCrossRef

61.

Liao G, Wen S, Xie X, Wu Q. Laparoscopic gastrectomy for remnant gastric cancer: risk factors associated with conversion and a systematic analysis of literature. Int J Surg. 2016;34:17–22.PubMedCrossRef

62.

Ubøe AAS, Våge C, Mjønes P, Bringeland EA, Fossmark R. Gastric remnant cancer and long-term survival in Central Norway 2001 to 2016 - a population-based study. Surg Oncol. 2023;51:102008.PubMedCrossRef

63.

An JY, Choi MG, Noh JH, Sohn TS, Kim S. The outcome of patients with remnant primary gastric cancer compared with those having upper one-third gastric cancer. Am J Surg. 2007;194(2):143–7.PubMedCrossRef

64.

Di Leo A, Pedrazzani C, Bencivenga M, Coniglio A, Rosa F, Morgani P, Marrelli D, Marchet A, Cozzaglio L, Giacopuzzi S, et al. Gastric stump cancer after distal gastrectomy for benign disease: clinicopathological features and surgical outcomes. Ann Surg Oncol. 2014;21(8):2594–600.PubMedCrossRef

65.

Sowa M, Kato Y, Onoda N, Kubo T, Maekawa H, Yoshikawa K, Nishimura M, Nakanishi I, Chung YS. Early cancer of the gastric remnant with special reference to the importance of follow-up of gastrectomized patients. Eur J Surg Oncol. 1993;19(1):43–9.PubMed

66.

Nakagawa M, Choi YY, An JY, Hong JH, Kim JW, Kim HI, Cheong JH, Hyung WJ, Choi SH, Noh SH. Staging for remnant gastric cancer: the metastatic lymph node ratio vs. the UICC 7th Edition System. Ann Surg Oncol. 2016;23(13):4322–31.PubMedCrossRef

67.

Hayashi M, Fujita T, Matsushita H. Evaluating the optimal treatment strategy for early and advanced remnant gastric cancer. ANZ J Surg. 2022;92(11):2907–14.PubMedCrossRef

68.

Matsuo K, Lee SW, Tanaka R, Imai Y, Honda K, Taniguchi K, Tomiyama H, Uchiyama K. T stage and venous invasion are crucial prognostic factors for long-term survival of patients with remnant gastric cancer: a cohort study. World J Surg Oncol. 2021;19(1):291.PubMedPubMedCentralCrossRef

69.

Sun B, Zhang H, Wang J, Cai H, Xuan Y, Xu D. Tumor location causes different recurrence patterns in remnant gastric cancer. J Gastric Cancer. 2022;22(4):369–80.PubMedPubMedCentralCrossRef

70.

Takahashi M, Takeuchi H, Tsuwano S, Nakamura R, Takahashi T, Wada N, Kawakubo H, Saikawa Y, Kitagawa Y. Surgical resection of remnant gastric cancer following distal gastrectomy: a retrospective clinicopathological Study. Ann Surg Oncol. 2016;23(2):511–21.PubMedCrossRef

71.

Irino T, Hiki N, Ohashi M, Nunobe S, Tokunaga M, Sano T, Yamaguchi T. Characteristics of gastric stump cancer: a single hospital retrospective analysis of 262 patients. Surgery. 2016;159(6):1539–47.PubMedCrossRef

Titel: Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study
verfasst von: Zhouwei Zhan
Bijuan Chen
Hui Cheng
Shaohua Xu
Chunping Huang
Sijing Zhou
Haiting Chen
Xuanping Lin
Ruyu Lin
Wanting Huang
Xiaohuan Ma
Yu Fu
Zhipeng Chen
Hanchen Zheng
Songchang Shi
Zengqing Guo
Lihui Zhang
Publikationsdatum: 01.12.2024
Verlag: BioMed Central
Erschienen in: BMC Cancer / Ausgabe 1/2024
Elektronische ISSN: 1471-2407
DOI: https://doi.org/10.1186/s12885-024-12303-9

Update Onkologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.

Newsletter bestellen

Live-Webinar "Urologie und Sexualmedizin in der Praxis"

Springer Medizin

Identification of prognostic signatures in remnant gastric cancer through an interpretable risk model based on machine learning: a multicenter cohort study

Abstract

Objective

Methods

Results

Conclusions

Supplementary Information

Publisher’s Note

Introduction

Data and methods

Patients

Data collection

Study outcomes

Feature selection and data preprocessing

Model development

Model performance evaluation

Model interpretation

Statistical analysis

Result

Clinicopathological features of RGC

Feature variable selection

Model performance

Visualization and explanation of models

Discussion

Conclusion

Acknowledgements

Declarations

Competing interests

Publisher’s Note

Supplementary Information

Neu im Fachgebiet Onkologie

Bei seelischem Stress sind Checkpoint-Hemmer weniger wirksam

Antikörper mobilisiert Neutrophile gegen Krebs

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

15% bedauern gewählte Blasenkrebs-Therapie

Update Onkologie

Live-Webinar "Urologie und Sexualmedizin in der Praxis"

Springer Medizin

Abstract

Objective

Methods

Results

Conclusions

Supplementary Information

Publisher’s Note

Introduction

Data and methods

Patients

Data collection

Study outcomes

Feature selection and data preprocessing

Model development

Model performance evaluation

Model interpretation

Statistical analysis

Result

Clinicopathological features of RGC

Feature variable selection

Model performance

Visualization and explanation of models

Discussion

Conclusion

Acknowledgements

Declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Supplementary Information

Weitere Artikel der Ausgabe 1/2024

Correlation between socioeconomic indices and epidemiological indices of thyroid cancer from 1990 to 2019 year: a global ecologic study

p53/E2F7 axis promotes temozolomide chemoresistance in glioblastoma multiforme

Comparing the efficacy of regorafenib and 5-fluorouracil-based rechallenge chemotherapy in the third-line treatment of metastatic colorectal cancer

Prospective associations of leucocyte subtypes and obesity with the risk of developing cutaneous malignant melanoma in the UK Biobank cohort

Central nervous system metastases in breast cancer patients with germline BRCA pathogenic variants compared to non-carriers: a matched-pair analysis

Utility of bronchoscopically obtained frozen cytology pellets for next-generation sequencing

Neu im Fachgebiet Onkologie

Bei seelischem Stress sind Checkpoint-Hemmer weniger wirksam

Antikörper mobilisiert Neutrophile gegen Krebs

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

15% bedauern gewählte Blasenkrebs-Therapie

Update Onkologie