Survival prediction of glioblastoma patients using modern deep learning and machine learning techniques

Babaei Rikan, Samin; Sorayaie Azar, Amir; Naemi, Amin; Bagherzadeh Mohasefi, Jamshid; Pirnejad, Habibollah; Wiil, Uffe Kock

doi:10.1038/s41598-024-53006-2

Download PDF

Article
Open access
Published: 29 January 2024

Survival prediction of glioblastoma patients using modern deep learning and machine learning techniques

Samin Babaei Rikan¹^na1,
Amir Sorayaie Azar¹^na1,
Amin Naemi²,
Jamshid Bagherzadeh Mohasefi¹,
Habibollah Pirnejad^3,4 &
…
Uffe Kock Wiil²

Scientific Reports volume 14, Article number: 2371 (2024) Cite this article

2051 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

In this study, we utilized data from the Surveillance, Epidemiology, and End Results (SEER) database to predict the glioblastoma patients’ survival outcomes. To assess dataset skewness and detect feature importance, we applied Pearson's second coefficient test of skewness and the Ordinary Least Squares method, respectively. Using two sampling strategies, holdout and five-fold cross-validation, we developed five machine learning (ML) models alongside a feed-forward deep neural network (DNN) for the multiclass classification and regression prediction of glioblastoma patient survival. After balancing the classification and regression datasets, we obtained 46,340 and 28,573 samples, respectively. Shapley additive explanations (SHAP) were then used to explain the decision-making process of the best model. In both classification and regression tasks, as well as across holdout and cross-validation sampling strategies, the DNN consistently outperformed the ML models. Notably, the accuracy were 90.25% and 90.22% for holdout and five-fold cross-validation, respectively, while the corresponding R² values were 0.6565 and 0.6622. SHAP analysis revealed the importance of age at diagnosis as the most influential feature in the DNN's survival predictions. These findings suggest that the DNN holds promise as a practical auxiliary tool for clinicians, aiding them in optimal decision-making concerning the treatment and care trajectories for glioblastoma patients.

Prognosis Individualized: Survival predictions for WHO grade II and III gliomas with a machine learning-based web application

Article Open access 26 October 2023

Individual-patient prediction of meningioma malignancy and survival using the Surveillance, Epidemiology, and End Results database

Article Open access 30 January 2020

Multimodal artificial intelligence-based pathogenomics improves survival prediction in oral squamous cell carcinoma

Article Open access 07 March 2024

Introduction

Glioblastomas are the most aggressive brain tumors that account for 12–15% of all brain tumors. They are the most common malignant brain tumors in adults in the United States (US) that have an incidence rate of 3.21 per 100,000¹. Early detection and traditional treatments of glioblastomas are infrequently effective² because they are invasive, and the blood–brain barrier precludes medicines from eradicating tumor cells³. Although the implementation of temozolomide and radiotherapy has improved glioblastoma patients' survival^4,5, their median survival ranges from 9 to 16 months, depending on the medical care they receive⁶.

Prognostic predictions of cancer, especially glioblastoma, due to low median survival, are vital in planning patients’ treatment. Survival prediction of patients helps clinicians make informed decisions about the treatment methods and surgeries and select the most effective ones. In addition, it enables patients and their families to better comprehend the patients' condition, make appropriate decisions, and reduce their anxiety. These factors have led to the prediction of survival becoming a significant problem that requires more attention as well as accurate solutions.

The use of machine learning (ML) and deep learning (DL) methods in bioinformatics and medicine has dramatically increased⁷. They have been widely used in oncology as well and have shown promising results^8,9. In recent years, various studies have used different ML and statistical models to predict glioblastoma patients' survival^{3,10,11,12,13}. For instance, Li et al.³, developed nomograms to predict the survival of glioblastoma patients using SEER database. They used Cox proportional risk regression model to analyze the prognostic factors of patients and construct the nomogram. They used the effective factors for the prognosis of glioblastoma patients that Cox proportional risk regression model obtained in constructing the nomogram. The results of the nomogram revealed that the C-index of the training and verification group was 0.729 and 0.734, respectively. Al-Husseini et al.¹⁰ aimed to assess how a prior malignancy affects glioblastoma patients’ survival. They used the multivariable covariate-adjusted Cox models and the unadjusted Kaplan–Meier test to calculate the glioblastoma-specific and overall survival of these patients. The unadjusted Kaplan–Meier test revealed that a prior history of cancer had an adverse effect on glioblastoma-specific and overall survival for patients, and the multivariable covariate-adjusted Cox models did not show considerable differences in glioblastoma-specific or overall survival. Senders et al.¹¹ proposed both statistical and ML algorithms and developed online software to predict the number of survival months (regression) and 1-year survival status (binary classification) of glioblastoma patients based on SEER. Three statistical models were utilized in their study. The C-index values of their accelerated failure time model in predicting the overall survival and 1-year survival status were 0.70 and 0.70. Samara et al.¹² introduced an ensemble learning model to predict glioblastoma patients' survival based on SEER database. They used four ML algorithms as base classifiers and four ensemble techniques. RF achieved the best results. They reported area under the curve (AUC) values of 0.937, 0.780, and 0.893 for short-, intermediate- and long-term survival, respectively. Bakirarar et al.¹³ used five ML and two hybrid models consisting of ML models that they created to predict the 2-year and 1-year survival of glioblastoma patients based on SEER database. The hybrid models achieved the best results with for AUC metric as 0.856 and 0.764 for 1-year and 2-year survival, respectively.

Nonetheless, notable research gaps persist in existing studies. While DL has demonstrated promising outcomes in predicting survival for cancer patients^14,15,16,17, none of the preceding investigations, to the best of our knowledge, have harnessed DL algorithms to predict the survival of glioblastoma patients using the Surveillance Epidemiology and End Results (SEER) database—an invaluable resource in the realm of cancer research^{11,12,13,18,19,20,21}. Additionally, the majority of SEER-based studies have predominantly approached the glioblastoma survival prediction problem through a binary lens, focusing on whether a patient will survive for a specific period^11,12,13. This limited perspective overlooks the potential insights offered by multiclass classification and regression approaches. Furthermore, these studies have largely neglected the critical aspect of model explainability and interpretability. The absence of attention to these factors diminishes the trust clinicians may place in ML and DL) models in real-world practice.

In this study, we address these gaps by developing five ML models—extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), decision tree (DT), K-nearest neighbors (KNN), and random forest (RF)—as well as a deep neural network (DNN) model. Our aim is to predict glioblastoma patients' survival using both classification and regression approaches. This study marks three significant contributions:

Pioneering use of DL in glioblastoma survival prediction

This study stands as the first to use DL in both classification and regression approaches to predict glioblastoma patients' survival based on the SEER database. By evaluating the performance of both ML and DL models, it provides a more comprehensive understanding of their capabilities.

Clinically meaningful survival classes

Departing from binary classification, this study introduces and utilizes five clinically meaningful classes for survival based on established clinical guidelines. This innovative approach aims for more accurate predictions, facilitating effective and precise treatment planning for glioblastoma patients by predicting the expected duration of survival.

Enhanced model interpretability with SHAP

A pioneering effort, this study utilizes Shapley Additive Explanations (SHAP) to interpret SEER-based survival predictions for glioblastoma patients. This not only contributes to the transparency of our study but also enhances its reliability and robustness, making strides toward building trust in the utilization of ML and DL models in clinical decision-making.

Results

According to the ordinary least squares (OLS) method^22,23, as can be seen from Table S1 in the Supplementary, all features except two features (sex and marital status at diagnosis) had a significant relation with the predicted survival at the level of 0.05. However, these two features were also used in this study, because they have been used in all similar studies, and they are clinically meaningful^{3,10,11,12,19,20}.

The results of Pearson's coefficient for skewness²⁴ showed that initially, the classification dataset was moderately skewed, and the regression dataset was highly skewed, with skewness values of + 0.65 and + 2.42, respectively. After applying Synthetic Minority Oversampling Technique (SMOTE)^25,26, the skewness value of the classification dataset reached + 0.08, which indicates it has become fairly symmetrical. Also, after applying Synthetic Minority Over-Sampling with Gaussian Noise (SMOGN)²⁷, the skewness value of the regression dataset reached + 1.00, which became moderately skewed since the value is between ± 0.5 and ± 1.

Proposed classifiers results

The classification models’ results for each class in holdout and five-fold cross-validation sampling strategies are presented in Tables 1 and S2 in Supplementary, respectively. The AUC diagrams^28,29 and confusion matrices^30,31 of all the models in holdout sampling strategy are presented in Figs. 1 and 2, respectively. Also, in five-fold cross-validation sampling strategy, the AUC diagrams and confusion matrices of all folds of DNN are presented in Figs. S1 and S2 in the Supplementary, respectively.

Table 1 The proposed models’ performance for the classification approach in holdout strategy.

Full size table

As seen in Tables 1 and S2 in the Supplementary, DNN^32,33 achieved the highest accuracy, F1-score, specificity, and sensitivity in both holdout and five-fold cross-validation strategies on average^34,35. RF³⁶ also achieved the highest AUC on average and was our second-best model that had a very close performance to DNN in both sampling strategies.

Using SHAP library³⁷, the most important features in the best model (DNN) were extracted and shown in Fig. S3 in the Supplementary. The information about the weights of the nine important features on the output of the DNN model is provided in Fig. 3, which is plotted using SHAP. As shown in Fig. 3, "age at diagnosis" impacts our model's output the most. It is the most important feature with the highest SHAP value in predicting the survival of glioblastoma patients. The 12th, 17th, 63rd, and 81st trees of our RF model with depth four were randomly picked to show the predicted classes. The 12th tree is shown in Fig. 4 and the other trees were shown in Figs. S4–S6 in the Supplementary, respectively. Interestingly, in two of four trees, the root of trees is age at diagnosis, the feature with the highest SHAP value. The roots of the other two randomly selected trees of RF model are chemotherapy recode (second highest SHAP value) and year of diagnosis (fourth highest SHAP value), respectively.

Proposed regressors results

Along with classifiers, regressors were also developed to evaluate ML and DL models more comprehensively. Mean Squared Error (MSE), Root Mean Square Error (RMSE), and Coefficient of determination (R²) were calculated to evaluate the performance of our regression models^38,39,40. The results of holdout strategy are shown in Table 2 and the results of five-fold cross-validation strategy are represented in Table S3 in the Supplementary. As seen in both of these Tables, DNN achieved the best performance compared to the six proposed models for all the criteria. The performance of RF is also promising and very close to DNN.

Table 2 The proposed models’ performance for the regression approach in holdout strategy.

Full size table

Discussion

In this study, five ML models and one DL model were developed to predict the survival of glioblastoma patients using regression and classification approaches. To the best of our knowledge, this is the first SEER-based study that uses DL and multiclass ML models to predict the survival of glioblastoma patients. To make the survival intervals more meaningful for clinicians⁴¹, instead of binary classification, five classes of survival were selected. Moreover, SHAP was used to interpret the models’ decision-making and identify the most important features.

We used two sampling strategies of holdout and five-fold cross-validation to evaluate our models’ performance. Our results showed that in the classification approach, using both sampling strategies, when the survival is less than 6 months, XGBoost showed the best performance. However, in predicting survival when it was more than 6 months, DNN had the best performance based on confusion matrix criteria (accuracy, F1-score, sensitivity, and specificity). However, if we consider AUC criterion, RF model had the best performance among the other algorithms in predicting the survival of the patients when the survival is greater than 6 months in both holdout and five-fold cross-validation. Concordance index (C-index), a well-known measure for evaluating predictive models in healthcare, estimates the probability of concordance between observed and predicted outcomes^42,43. Given the fact that C-index is almost identical to AUC⁴⁴, our finding in this study is in line with the results of Mandrekar's study, and AUC of RF is considered outstanding²⁹. Furthermore, it is worth mentioning that when we consider five-fold cross validation strategy, DNN showed a promising performance in predicting the classes in all five folds based on confusion matrix criteria. The insignificant difference between the results obtained by DNN in the two sampling strategies indicated that our best model perform well under different conditions.

SHAP analysis showed that age at diagnosis, chemotherapy recode, and radiation sequence with surgery are the most important features in DNN. The importance and the effects of those three features in the survival of glioblastoma patients have already been emphasized in different clinical studies^45,46,47,48 which is also in line with our findings. For example, our SHAP diagram shows that a higher age at diagnosis has a negative contribution to survival, while a lower age has a positive contribution to the number of survival months. This is exactly in line with previous studies that stated the hazard ratio of death in patients with glioblastoma is increased with aging^49,50. Moreover, using interpretable methods such as SHAP enables us to explain and comprehend the model's performance and decisions much better. Furthermore, they help ML models obtain more comprehensible results than statistical models⁵¹. In half of the drawn DTs from the RF model, the root of the tree was age at diagnosis, and this also proved the importance of this feature in predicting glioblastoma patients’ survival. Similarly, in the regression approach, DNN achieved the best results in all criteria. The second-best model by a small margin was RF. The regressors predicted the exact number of months patients would live, which can help to plan the treatment of patients precisely.

Table 3 compares the best classification results of two other studies with our study. As shown in the table, although the survival problem was considered a non-binary problem, the performance of the best model for classification was better compared to the binary approach taken by the presented other studies and had an excellent AUC according to Mandrekar²⁹.

Table 3 Comparison of the performance of this study and previous studies in classification approach.

Full size table

It's important to acknowledge the limitations of this study, as they impact the interpretation of results. In particular, the absence of comprehensive treatment details, such as the sequencing of treatment procedures, additional comorbid conditions like diabetes⁵², and even genetic profiles of the patients represents constraints. Incorporating such information, obtainable from hospital records or insurance databases, could enhance the dataset, leading to more accurate survival predictions and a more thorough evaluation of the models' predictive capabilities. Moreover, the current models lack external validation, as an independent dataset for validation purposes was unavailable. To address this limitation, future endeavors aim to collect an external dataset similar to the current one. The validation of the best models against this new dataset would support the robustness and reliability of the study, demonstrating the generalizability of the models to different data contexts.

In conclusion, despite these limitations, the DNN model exhibited exceptional performance in both classification and regression approaches in two different sampling strategies, positioning it as a valuable auxiliary tool in clinical practice. The application of this tool could empower clinicians in devising tailored treatment plans for glioblastoma patients, optimizing resource allocation, time management, and ultimately alleviating the challenges and anxieties faced by patients.

Materials and methods

Dataset

SEER is a source of cancer incidence and survival information that collects cancer data from 18 states in the US⁵³. This database covers approximately 48.0 percent of the US population. It is one of the largest and most comprehensive databases of cancer patients in the US⁵⁴. It provides anonymized data on patient demographics, primary tumor site, tumor morphology, the first course of treatment, stage at diagnosis, and vital status of patients. The data collected by the SEER program are accurate and complete on all cancers and in all regions. It employs a continuous quality control and improvement program that minimizes and corrects errors and ensures the data’s high quality⁵⁵. The glioblastoma patients’ data between 2007 and 2016 was picked from SEER for this study.

Data preprocessing

To prepare the dataset, with the help of clinical researchers, 48 features of the initial dataset were reviewed. The features that were not related to glioblastoma patient's survival were removed by clinical researchers, and those that could influence the prediction of the patients' survival were left. Afterwards, since the missing data was completely random, records with more than 30% missing value were removed from these features. It is worth mentioning that, based on^56,57 this threshold does not introduce bias. Finally, patients who died because of other diagnoses rather than glioblastoma and those with unknown survival time were excluded. Our final dataset included 19,564 samples and 17 numerical and categorical features. The categorical features were converted to numerical with the help of clinicians. The retained features are shown in Table S4 in the Supplementary.

OLS is a common technique that is used in linear regression models. In this study, it was used to find the significant difference between features and the outcome. It defines the relationship between a dependent and one or more independent quantitative features. It shows the statistically significant differences between the features' values^22,23. We considered the statistical significance (p-value) at the level of 0.05 in this test.

The target variable, the patients' survival, is defined as the number of months from diagnosis to death event. For the classification approaches, five clinically relevant classes as Class 0 (≤ 6 months), Class 1 (7–12 months), Class 2 (13–18 months), Class 3 (19–24 months), and Class 4 (≥ 25 months) were considered. For regression, the number of months that patients survived was used as the target variable.

We utilized the method of Min–Max normalization from the Sklearn library of Python to standardize the data scale and change the boundaries in the range of (0, 1) according to Eq. (1), where ${X}_{max}$ and ${X}_{min}$ denote the maximum and minimum data values, respectively.

$${X}_{normalize}=\frac{X-{X}_{min}}{{X}_{max}-{X}_{min}}$$

(1)

Feature importance

The interpretability of ML models is very important to understand and trust the decision-making process. Knowing which features have more significant impacts on the model makes it easier for clinicians to interpret the performance of the model and patterns of data. As a result, medical decisions and patients' treatment processes are improved⁵⁸. As in other medical fields, this can be useful in predicting patients' survival and help clinicians make better decisions by understanding how the model works^59,60. Therefore, to identify each feature's impact on our models' decision-making, we used the SHAP library of Python 3.7³⁷.

Data imbalance

One of the inescapable challenges of medical datasets is data skewness because it may make sampling of the target feature non-uniform and reduce the generalizability of the model. We used Pearson's coefficient of skewness (second method) to detect the datasets’ skewness. To handle the problem of data skewness, we used SMOTE^25,26 and SMOGN²⁷ to balance the dataset in classification and regression approaches, respectively. Eventually, the final dataset for the classification had 46,340 cases, and the final dataset for regression had 28,573 cases. The distribution of the classification and regression datasets before and after balancing is shown in Fig. S7 in the Supplementary. We also used Pearson's second skewness coefficient test to show the symmetry of possible distribution in the datasets according to Eq. (2), where x ̅, m, and s denote the mean, the median, and the standard deviation of our dataset, respectively²⁴.

$$Skewness=\frac{3\times \left(\overline{x}-m\right)}{s}$$

(2)

The steps of the preparation of the datasets used in this study are illustrated in Fig. 5.

Predictive models development

We developed five ML models including XGBoost, AdaBoost, DT, KNN, and RF³⁶ because of their different methodologies, and a DNN model for regression and classification to predict the survival of glioblastoma patients^32,33. The structure of our DNN model for classification is represented in Fig. S8 in the Supplementary, and for the regression model is represented in Fig. S9 in the Supplementary. In order to consider this problem as a multiclass (not binary) prediction problem, we considered five classes of survival months and predicted them using classification models. Moreover, we predicted the survival of glioblastoma patients more accurately using regression and predicted the number of months the patients would survive.

In this study, we used 32 GB RAM, Intel Xeon E5-2650 CPU, and 4 GB GPU Nvidia GTX1650, to implement our proposed models. We represented the ranges of the hyper-parameters for classification and regression models in Table S5 in the Supplementary. We obtained the optimal hyper-parameters using the GridSearchCV method and provided the hyper-parameters of both approaches in Table S6 in the Supplementary. As a result of using this method, we achieved optimal performance for both approaches.

Data sampling strategy

In this study, the hold-out split dataset strategy is implemented, dedicating 80% of the data for training and reserving 20% for testing for both classification and regression models. Moreover, five-fold cross-validation was employed to evaluate the performance of all the developed models both in classification and regression. 80% and 20% of the dataset were apportioned for training and testing in each iteration, respectively.

Predictive models evaluation

One common evaluation criterion in ML models is the confusion matrix. It visualizes the performance of classification models in an $n\times n$ matrix where $n$ refers to the number of classes^30,31. In addition to confusion matrices, to evaluate the ML and DL models in the classification approach, we used five criteria including accuracy, F1-score, specificity, sensitivity or recall^34,35, and AUC²⁸. These performance metrics are introduced in Eqs. (3–6). The AUC values of the models are expressed according to the study by Mandrekar²⁹.

$$Accuracy=\frac{TP+TN}{TP+FP+FN+TN}$$

(3)

$$F1-score=\frac{2\times Precision\times Recall}{Precision+Recall}$$

(4)

$$Specificity=\frac{TN}{TN+FP}$$

(5)

$$Sensitivity\vee Recall=\frac{TP}{TP+FN}$$

(6)

In Eqs. (3–6), TP, TN, FP, and FN denote True Positive, True Negative, False Positive, and False Negative, respectively.

Three evaluation criteria, including MSE, RMSE, and R², were also used to evaluate the regression models^38,39,40. These metrics are explained in Eqs. (7–9).

$$MeanSquaredError\left(MSE\right)=\frac{1}{n}\times {\sum }_{i=1}^{n}{\left({Y}_{i}-{\widehat{Y}}_{i}\right)}^{2}$$

(7)

$$RootMeanSquareError\left(RMSE\right)=\sqrt{\frac{1}{n}\times {\sum }_{i=1}^{n}{\left({Y}_{i}-{\widehat{Y}}_{i}\right)}^{2}}$$

(8)

$${R}^{2}=\frac{TSS-RSS}{TSS}$$

(9)

where $n$ means the number of samples, $i$ is the i-th sample, ${Y}_{i}$ denotes the actual target value for the sample $i$, ${\widehat{Y}}_{i}$ shows the predicted target value for the sample $i$, and $TSS$ denotes the total sum of squares and $RSS$ refers to the residual sum of squares.

Data availability

The dataset used in this study can be requested from the SEER source website at https://seerdataaccess.cancer.gov/seer-data-access.

References

Ostrom, Q. T. et al. CBTRUS statistical report: Primary brain and other central nervous system tumors diagnosed in the United States in 2010–2014. Neuro-Oncology 19, 1–88 (2017).
Article Google Scholar
Omuro, A. & DeAngelis, L. M. Glioblastoma and other malignant gliomas: A clinical review. Jama 310, 1842–1850 (2013).
Article PubMed CAS Google Scholar
Li, H., He, Y., Huang, L., Luo, H. & Zhu, X. The nomogram model predicting overall survival and guiding clinical decision in patients with glioblastoma based on the SEER database. Front. Oncol. 10, 1051 (2020).
Article PubMed PubMed Central Google Scholar
Poon, M. T., Sudlow, C. L., Figueroa, J. D. & Brennan, P. M. Longer-term (≥ 2 years) survival in patients with glioblastoma in population-based studies pre-and post-2005: A systematic review and meta-analysis. Sci. Rep. 10, 11622 (2020).
Article PubMed PubMed Central CAS Google Scholar
Stupp, R. et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. NEJM 352, 987–996 (2005).
Article PubMed CAS Google Scholar
Bi, W. L. & Beroukhim, R. Beating the odds: Extreme long-term survival with glioblastoma. Neuro-Oncology 16, 1159–1160 (2014).
Article PubMed PubMed Central Google Scholar
Shastry, K. A. & Sanjay, H. A. Machine learning for bioinformatics. In Statistical Modelling and Machine Learning Principles for Bioinformatics Techniques, Tools, and Applications (eds Srinivasa, K. G. et al.) 25–39 (Springer, 2020).
Chapter Google Scholar
Zade, A. E., Haghighi, S. S. & Soltani, M. Deep neural networks for neuro-oncology: Towards patient individualized design of chemo-radiation therapy for Glioblastoma patients. J. Biomed. Inform. 127, 104006 (2022).
Article Google Scholar
Sorayaie Azar, A. et al. Application of machine learning techniques for predicting survival in ovarian cancer. BMC Med. Inform. Decis. Mak. 22, 345 (2022).
Article PubMed PubMed Central Google Scholar
Al-Husseini, M. J. et al. Prior malignancy impact on survival outcomes of glioblastoma multiforme; population-based study. Int. J. Neurosci. 129, 447–454 (2019).
Article PubMed Google Scholar
Senders, J. T. et al. An online calculator for the prediction of survival in glioblastoma patients using classical statistics and machine learning. Neurosurgery 86, E184 (2020).
Article PubMed Google Scholar
Samara, K. A., Al Aghbari, Z. & Abusafia, A. GLIMPSE: A glioblastoma prognostication model using ensemble learning—a surveillance, epidemiology, and end results study. Health Inf. Sci. Syst. 9, 1–13 (2021).
Article Google Scholar
Bakirarar, B., Egemen, E., Dere, Ü. A. & Yakar, F. Machine learning model to identify prognostic factors in glioblastoma: A SEER-based analysis. Pamukkale Med J. 16, 338–348 (2022).
Google Scholar
Doppalapudi, S., Qiu, R. G. & Badr, Y. Lung cancer survival period prediction and understanding: Deep learning approaches. Int. J. Med. Inform. 148, 104371 (2021).
Article PubMed Google Scholar
Ryu, S. M., Seo, S. W. & Lee, S. H. Novel prognostication of patients with spinal and pelvic chondrosarcoma using deep survival neural networks. BMC Med. Inform. Decis. Mak. 20, 1–10 (2020).
Article CAS Google Scholar
Jajroudi, M. et al. Prediction of survival in thyroid cancer using data mining technique. TCRT 13, 353–359 (2014).
CAS Google Scholar
Mourad, M. et al. Machine learning and feature selection applied to SEER data to reliably assess thyroid cancer prognosis. Sci. Rep. 10, 5176 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Tewarie, I. A. et al. Survival prediction of glioblastoma patients—are we there yet? A systematic review of prognostic modeling for glioblastoma and its clinical potential. Neurosurg. Rev. 44, 2047–2057 (2021).
Article PubMed Google Scholar
Liu, Z. Y. et al. Competing risk model to determine the prognostic factors and treatment strategies for elderly patients with glioblastoma. Sci. Rep. 11, 9321 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Goldman, D. A. et al. Lack of survival advantage among re-resected elderly glioblastoma patients: a SEER-Medicare study. Neuro-Oncol. Adv. 3, vdaa159 (2021).
Article Google Scholar
Thumma, S. R. et al. Effect of pretreatment clinical factors on overall survival in glioblastoma multiforme: A surveillance epidemiology and end results (SEER) population analysis. World J. Surg. Onc. 10, 1–12 (2012).
Article Google Scholar
Farahani, H. A., Rahiminezhad, A. & Same, L. A comparison of partial least squares (PLS) and ordinary least squares (OLS) regressions in predicting of couples mental health based on their communicational patterns. Procedia Soc. Behav. Sci. 5, 1459–1463 (2010).
Article Google Scholar
Judkins, D. R. & Porter, K. E. Robustness of ordinary least squares in randomized clinical trials. Stat. Med. 35, 1763–1773 (2016).
Article MathSciNet PubMed Google Scholar
Doane, D. P. & Seward, L. E. Measuring skewness: A forgotten statistic?. J. Stat. Educ. https://doi.org/10.1080/10691898.2011.11889611 (2011).
Article Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: Synthetic minority over-sampling technique. JAIR 16, 321–357 (2002).
Article Google Scholar
Blagus, R. & Lusa, L. SMOTE for high-dimensional class-imbalanced data. BMC Bioinform. 14, 1–16 (2013).
Google Scholar
Branco, P., Torgo, L., & Ribeiro, R. P. SMOGN: A pre-processing approach for imbalanced regression. In First international workshop on learning with imbalanced domains: Theory and applications, 36–50 (2017).
Huang, J. & Ling, C. X. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 17, 299–310 (2005).
Article CAS Google Scholar
Mandrekar, J. N. Receiver operating characteristic curve in diagnostic test assessment. JTO 5, 1315–1316 (2010).
PubMed Google Scholar
Sidey-Gibbons, J. A. & Sidey-Gibbons, C. J. Machine learning in medicine: a practical introduction. BMC Med. Res. Methodol. 19, 1–18 (2019).
Article Google Scholar
Deng, X., Liu, Q., Deng, Y. & Mahadevan, S. An improved method to construct basic probability assignment based on the confusion matrix for classification problem. Inf. Sci. 340, 250–261 (2016).
Article Google Scholar
Shalev-Shwartz, S. & Ben-David, S. Understanding Machine Learning: From Theory to Algorithms (Cambridge University Press, 2014).
Book Google Scholar
Rikan, S. B., Azar, A. S., Ghafari, A., Mohasefi, J. B. & Pirnejad, H. COVID-19 diagnosis from routine blood tests using artificial intelligence techniques. Biomed. Signal Process. Control. 72, 103263 (2022).
Article Google Scholar
Wong, H. B. & Lim, G. H. Measures of diagnostic accuracy: sensitivity, specificity PPV and NPV. Proc. Singap. Healthc. 20, 316–318 (2011).
Article Google Scholar
Parikh, R., Mathai, A., Parikh, S., Sekhar, G. C. & Thomas, R. Understanding and using sensitivity, specificity and predictive values. Indian J. Ophthalmol. 56, 45 (2008).
Article PubMed PubMed Central Google Scholar
Chen, T., & Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining 785–794 (2016).
Kristjanpoller, W., Michell, K. & Minutolo, M. C. A causal framework to determine the effectiveness of dynamic quarantine policy to mitigate COVID-19. Appl. Soft Comput. 104, 107241 (2021).
Article PubMed PubMed Central Google Scholar
Chicco, D., Warrens, M. J. & Jurman, G. The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ. Comput. Sci. 7, e623 (2021).
Article PubMed PubMed Central Google Scholar
Miles, J. R-squared, adjusted R-squared. Encycl. Stat. Behav. Sci. https://doi.org/10.1002/0470013192.bsa526 (2005).
Article Google Scholar
Royston, P., Moons, K. G., Altman, D. G. & Vergouwe, Y. Prognosis and prognostic research: developing a prognostic model. Bmj 338, B604 (2009).
Article PubMed Google Scholar
Mackillop, W. J. The importance of prognosis in cancer medicine. TNM Online Preprint at https://doi.org/10.1002/0471463736.tnmp01.pub2 (2006).
Article Google Scholar
Harrell, F. E., Califf, R. M., Pryor, D. B., Lee, K. L. & Rosati, R. A. Evaluating the yield of medical tests. Jama 247, 2543–2546 (1982).
Article PubMed Google Scholar
Wang, W. et al. An effective tool for predicting survival in breast cancer patients with de novo lung metastasis: Nomograms constructed based on SEER. Front. surg. 9, 939132 (2023).
Article PubMed PubMed Central Google Scholar
Longato, E., Vettoretti, M. & Di Camillo, B. A practical perspective on the concordance index for the evaluation and selection of prognostic time-to-event models. J. Biomed. Inform. 108, 103496 (2020).
Article PubMed Google Scholar
Kim, M. et al. Glioblastoma as an age-related neurological disorder in adults. Neuro-Oncol. Adv. 3, vdab125 (2021).
Article Google Scholar
Li, S. W. et al. Prognostic factors influencing clinical outcomes of glioblastoma multiforme. Chin. Med. J. 122, 1245–1249 (2009).
PubMed Google Scholar
Wen, J., Chen, W., Zhu, Y. & Zhang, P. Clinical features associated with the efficacy of chemotherapy in patients with glioblastoma (GBM): A surveillance, epidemiology, and end results (SEER) analysis. BMC Cancer 21, 1–10 (2021).
Article Google Scholar
Villà, S., Balañà, C. & Comas, S. Radiation and concomitant chemotherapy for patients with glioblastoma multiforme. Chin. J. Cancer 33, 25 (2014).
Article PubMed PubMed Central Google Scholar
Buckner, J. C. Factors influencing survival in high-grade gliomas. In Seminars in oncology 10–14 (2003).
Brodbelt, A. et al. Glioblastoma in england: 2007–2011. EJC 51, 533–542 (2015).
Article Google Scholar
Moncada-Torres, A., van Maaren, M. C., Hendriks, M. P., Siesling, S. & Geleijnse, G. Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival. Sci. Rep. 11, 6968 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Currie, C. J. et al. Mortality after incident cancer in people with and without type 2 diabetes: Impact of metformin on survival. Diabetes Care 35, 299–304 (2012).
Article PubMed PubMed Central Google Scholar
Surveillance Research Program: Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence—SEER 18 Regs Custom Data (with additional treatment fields). in Linked To County Attributes - Total US 1969–2017 (1975).
SEER incidence data, 1975–2020. SEER https://seer.cancer.gov/data/.
Che, W. Q. et al. How to use the Surveillance, Epidemiology, and End Results (SEER) data: Research design and methodology. Mil. Med. Res. 10, 50 (2023).
PubMed PubMed Central Google Scholar
Mack, C., Su, Z., & Westreich, D. Managing missing data in patient registries: addendum to registries for evaluating patient outcomes: a user’s guide, (2018).
Scheffer, J. Dealing with missing data, (2002).
Rado, O., Ali, N., Sani, H. M., Idris, A. & Neagu, D. Performance analysis of feature selection methods for classification of healthcare datasets. In Advances in Intelligent Systems and Computing (ed. Kacprzyk, J.) 929–938 (Springer, 2019).
Google Scholar
Laios, A. et al. Feature selection is critical for 2-year prognosis in advanced stage high grade serous ovarian cancer by using machine learning. Cancer Control 28, 10732748211044678 (2021).
Article PubMed PubMed Central Google Scholar
Kourou, K., Exarchos, T. P., Exarchos, K. P., Karamouzis, M. V. & Fotiadis, D. I. Machine learning applications in cancer prognosis and prediction. CSBJ 13, 8–17 (2015).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would like to thank the Machine Learning Lab of Urmia University for providing the resources needed to implement this study. Icons used were made by Freepik, Uniconlabs, Ajmal Naha, Flat Icons, Karyative, Eucalyp, phatplus, setiawanap from www.flaticon.com.

Funding

There is no funding attached to this study.

Author information

These authors contributed equally: Samin Babaei Rikan and Amir Sorayaie Azar.

Authors and Affiliations

Department of Computer Engineering, Urmia University, Urmia, Iran
Samin Babaei Rikan, Amir Sorayaie Azar & Jamshid Bagherzadeh Mohasefi
SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark
Amin Naemi & Uffe Kock Wiil
Erasmus School of Health Policy and Management (ESHPM), Erasmus University Rotterdam, Rotterdam, The Netherlands
Habibollah Pirnejad
Patient Safety Research Center, Clinical Research Institute, Urmia University of Medical Sciences, Urmia, Iran
Habibollah Pirnejad

Authors

Samin Babaei Rikan
View author publications
You can also search for this author in PubMed Google Scholar
Amir Sorayaie Azar
View author publications
You can also search for this author in PubMed Google Scholar
Amin Naemi
View author publications
You can also search for this author in PubMed Google Scholar
Jamshid Bagherzadeh Mohasefi
View author publications
You can also search for this author in PubMed Google Scholar
Habibollah Pirnejad
View author publications
You can also search for this author in PubMed Google Scholar
Uffe Kock Wiil
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.B.R., A.S.A., A.N., J.B.M., and H.P. were involved in the conception and design of this study. S.B.R., A.S.A., and H.P. prepared the dataset, and SBR and ASA performed the analysis. S.B.R., A.S.A., A.N., J.B.M., H.P., and U.K.W. interpreted the results. S.B.R. and A.S.A. drafted the manuscript, and all authors (S.B.R., A.S.A., A.M., J.B.M., H.P., and U.K.W.) contributed to writing the final draft and prepared the final manuscript. All authors read and approved by the final manuscript.

Corresponding authors

Correspondence to Jamshid Bagherzadeh Mohasefi or Habibollah Pirnejad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Babaei Rikan, S., Sorayaie Azar, A., Naemi, A. et al. Survival prediction of glioblastoma patients using modern deep learning and machine learning techniques. Sci Rep 14, 2371 (2024). https://doi.org/10.1038/s41598-024-53006-2

Download citation

Received: 19 April 2023
Accepted: 25 January 2024
Published: 29 January 2024
DOI: https://doi.org/10.1038/s41598-024-53006-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Prognosis Individualized: Survival predictions for WHO grade II and III gliomas with a machine learning-based web application

Individual-patient prediction of meningioma malignancy and survival using the Surveillance, Epidemiology, and End Results database

Multimodal artificial intelligence-based pathogenomics improves survival prediction in oral squamous cell carcinoma

Introduction

Pioneering use of DL in glioblastoma survival prediction

Clinically meaningful survival classes

Enhanced model interpretability with SHAP

Results

Proposed classifiers results

Proposed regressors results

Discussion

Materials and methods

Dataset

Data preprocessing

Feature importance

Data imbalance

Predictive models development

Data sampling strategy

Predictive models evaluation

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links