Machine learning-based prediction models for parathyroid carcinoma using pre-surgery cognitive function and clinical features

Wang, Yuting; Wei, Bojun; Zhao, Teng; Shen, Hong; Liu, Xing; Wang, Jiacheng; Wang, Qian; Shen, Rongfang; Feng, Dalin

doi:10.1038/s41598-023-46294-7

Download PDF

Article
Open access
Published: 03 November 2023

Machine learning-based prediction models for parathyroid carcinoma using pre-surgery cognitive function and clinical features

Yuting Wang¹^na1,
Bojun Wei¹,
Teng Zhao¹^na1,
Hong Shen¹,
Xing Liu¹,
Jiacheng Wang¹,
Qian Wang¹,
Rongfang Shen¹ &
…
Dalin Feng¹

Scientific Reports volume 13, Article number: 19007 (2023) Cite this article

571 Accesses
Metrics details

Subjects

Abstract

Patients with parathyroid carcinoma (PC) are often diagnosed postoperatively, due to incomplete resection during the initial surgery, resulting in poor outcomes. The aim of our study was to investigate the pre-surgery indicators of PC and try to develop a predictive model for PC utilizing machine learning. Evaluation of pre-surgery neuropsychological function and confirmation of pathology were carried out in 133 patients with primary hyperparathyroidism in Beijing Chaoyang Hospital from December 2019 to January 2023. Patients were randomly divided into a training cohort (n = 93) and a validating cohort (n = 40). Analysis of the clinical dataset, two machine learning including the extreme gradient boosting (XGBoost) and the least absolute shrinkage and selection operator (LASSO) regression were utilized to develop the prediction model for PC. Logistic regression analysis was also conducted for comparison. Significant differences in elevated parathyroid hormone and decreased serum phosphorus in PC compared to (BP). The lower score of MMSE and MOCA was observed in PC and a cutoff of MMSE < 24 was the optimal threshold to stratify PC from BP (area under the curve AUC 0.699 vs 0.625). The predicted probability of PC by machine learning was similar to the observed probability in the test set, whereas the logistic model tended to overpredict the possibility of PC. The XGBoost model attained a higher AUC than the logistic algorithms and LASSO models. (0.835 vs 0.683 vs 0.607). Preoperative cognitive function may be a probable predictor for PC. The cognitive function-based prediction model based on the XGBoost algorithm outperformed LASSO and logistic regression, providing valuable preoperative assistance to surgeons in clinical decision-making for patients suspected PC.

Development and validation of a new algorithm for improved cardiovascular risk prediction

Article Open access 18 April 2024

Delirium

Article 12 November 2020

Gut microbiome predicts cognitive function and depressive symptoms in late life

Article Open access 25 April 2024

Introduction

Parathyroid carcinoma (PC) is a rare malignant tumor that accounts for 0.5–5% of patients with primary hyperparathyroidism(PHPT) and only 0.005% of all cancers^1,2,3. In European and Asia countries, the mean incidence of PC has increased similarly over time, which might be attributed to the rise in PC diagnoses brought on by the prevalence of parathyroid diseases and the growing rate of PHPT suffering parathyroidectomy^4,5.

In distinction to local excision of parathyroid adenoma (PA), en bloc resection as a treatment for parathyroid cancer, particularly during the initial surgical treatment has a critical impact on patient prognosis, which emphasizes the significance of preoperative diagnosis^6,7,8. However, diagnosing PC before surgery is difficult, mainly because there are no definitive preoperative markers for PC. Due to similar clinical manifestations, PC is often misdiagnosed preoperatively and treated as a benign parathyroid disease (BP). The histological definition of WHO criteria for PC required an infiltrative growth pattern or metastasis⁹. Preoperative fine needle aspiration (FNA) and intraoperative biopsy are insufficient to diagnose if definitive histopathological criteria of invasion is absent in some PC specimens^10,11. Additionally, patients with FNA increase the risk of tumor cell seeding along the needle tract. The presurgical prediction for PC is still challenging.

An increasing number of PHPT patients present primarily with neuropsychological symptoms, such as cognitive deficits, anxiety, and poor concentration, rather than skeletal and renal complications like osteoporosis and nephrolithiasis resulting from long-term hypercalcemia, highlighting the value of assessing neuropsychological manifestations^12,13. Moreover, recent studies have demonstrated that patients with non-central nervous system tumors frequently suffer cognitive impairment even before undergoing treatments associated with toxicity, including chemotherapy, immunotherapies, and radiation^14,15. However, due to the rarity of the disease, only a few studies have evaluated cognitive impairment in PHPT by examining a limited number of cognitive domains^16,17,18. Additionally, there is a paucity of research assessing cognitive function specifically in patients with PC, further limiting the investigation of the relevant variables affecting preoperative cognitive decline and hampering comparisons between malignant and benign parathyroid diseases.

Machine learning (ML) is developed from the study of pattern recognition and computational learning to minimize errors between predicted and tested sets. Extreme gradient boosting (XGBoost) can distribute the gradient boosting library and imply ML algorithms under the Gradient Boosting framework. Interestingly, XGBoost has been successfully applied to diagnose and predict the prognosis of cancers, such as lung cancer, hepatocellular cancer, breast cancer, and lung metastases from thyroid cancer^19,20,21,22. The Least absolute shrinkage and selection operator (LASSO) can minimize the residual sum of squares and select the variables most related to the disease than traditional regression²³. Currently, LASSO has been used for the development of disease prediction and risk model^24,25,26,27.

In the present study, we sought to explore the potential predictive indicators for PC by examining the preoperative cognition functions as well as several serum biomarkers of parathyroid diseases and determining whether they have a relevant association with cognitive function. Besides, this study established prediction models for PC based on the XGBoost algorithm and LASSO regression. To the best of our knowledge, the use of XGBoost and LASSO in the prediction of PC has not ever been reported.

Methods

Participants

This study was approved by the Ethics Committee of Beijing Chaoyang Hospital, China. Informed consent was obtained from all patients participating in this research. All methods were carried out in accordance with the applicable guidelines and regulations. A total of 136 patients were consecutively diagnosed with PHPT based on biochemical criteria (serum calcium > 2.52 mmol/L and PTH > 88 pg/mL) in Beijing Chaoyang Hospital, China, from December 2019 to January 2023. Except for two patients with a history of cerebrovascular disease who were on long-term treatment and one patient who opted for clinical observation instead of surgery, 133 patients performed surgical treatment and were finally enrolled (Fig. 1). The age range of the PHPT patients was between 14 and 70 years. Exclusion criteria include patients with cerebrovascular disorders, dementia, previous head injury, severe cardiovascular diseases, and other malignant neoplasms.

Data of pre-operation clinical features including the history of osteoporosis, fracture, renal stone, and hypercalcemia-related symptoms, laboratory findings (serum level of parathyroid hormone, total calcium, phosphorous, alkaline phosphatase, creatinine, 25-hydroxy vitamin D and 24-h urinary calcium), psychological and neurocognitive function were collected. Between December 2019 and December 2022, 133 PHPT patients were evaluated neuropsychologically within one week before surgery by the same two physicians.

Diagnostic criteria and follow-up

Pathological diagnoses were made according to WHO criteria by experienced pathology physicians in Beijing Chaoyang Hospital. The BP group consisted of ninety-six patients with parathyroid adenoma (PA), three patients with parathyroid hyperplasia, and two patients with parathyroid cysts, respectively. In our institution, among the PC group, initial En-bloc resection was performed on twenty-three patients (71.88%). The remaining nine PC patients (28.12%) underwent reoperation following an unsuccessful initial operation performed elsewhere. The dataset was randomly split into a training cohort (70%) and a test cohort (30%) without a difference in the baseline (Table 1).

Table 1 Baseline characteristics of PHPT patients in the training set and validation set.

Full size table

The median follow-up time for the patients was 13.5 months. Among them, 6 patients with PC had distant metastases, with all 6 patients having lung metastases and 2 patients suspected of having bone metastases. None of the patients had definitive recurrence or death.

Neurocognitive and neuropsychological function assessment

Neurocognitive Assessment. Mini-Mental State Examination (MMSE) was used in objective measures of cognitive function including thirty items categorized into seven groups (Orientation to time and place; Registration; Attention and Calculation; Recall; Language; Visual Construction)^28,29. The total score is thirty. Mild cognitive impairment (MCI) is defined as a range of 18–24, and severe cognitive impairment as scoring 17 or less. Montreal Cognitive Assessment (MoCA) with a 30-point test including a short-term memory recall task, visuospatial and executive function, language, naming, attention and calculation, abstraction, and orientation was used. To rectify the education impact, 1 point was added for participants with 12 years of education or less on their total score of MOCA (if the score < 30). Scores of 25 or below indicate cognitive impairment^30,31.

Psychological Instrument. Hamilton Depression Rating Scale (HAMD-17) was used to evaluate depression, which contains 17 elements scored from 0(never) to 4(severe). The severity ranges for the score of HAMD are as follows: no depression (0–7); mild depression (8–16); moderate depression (17–23); and severe depression (≥ 24)^32,33. Hamilton Anxiety Rating Scale (HARS) was used to assess anxiety. It consists of 14 symptom-defined variables divided into somatic and psychogenic anxiety. Each item is scored from 0 (not present) to 4 (severe): > 8 is considered mild anxiety; 14–56 is considered moderate–severe^34,35. MAES consisting of 14 items was used to measure the emotional, behavioral, and cognitive aspects of apathy. Questions are rated on a scale from 0(a lot) to 3 (not at all). It defined apathy as having a score of ≥ 14^36,37,38.

Statistical methods

Demographic, clinical, laboratory, histological data, and cognition function were characterized by descriptive statistical methods. As the clinical data of PHPT patients were not normally distributed, the Mann–Whitney U test was used to explore the characteristic variates for their potential of differentiation between the PC and benign groups. The statistical power of the nonparametric test was 0.80, while the alfa error was 0.05, and the sample size was 88 in group 1 and 28 in group 2. The final enrollment in our study was 91 in the benign group and 32 in the malignant group which matched the required sample size. We defined the sex variable as 1 for females and 2 for males. The chi-square test was used between groups (alfa error = 0.05, power = 0.8, minimal sample size = 88). A p-value < 0.05 was significant with two-sided. The statistical analyses were conducted using SPSS version 26. 0(IBM Corp. Released 2019. IBM SPSS Statistics for Windows, Version 26.0. Armonk, NY: IBM Corp).

Model development and model performance evaluation

First, the logistic regression model was used to develop a prediction model including variates with a p-value less than 0.05 in the univariate analysis. The anticipated probability of PC computed from the best fitting model was chosen as the prediction criterion. Backward stepwise was conducted to identify significant predictors (p < 0.05). The dependent y only takes 0 and 1 as dichotomous variables. ${\text{P}} = {\text{P}}\left( {{\text{y}} = {1}|{\text{x}}_{{1}} , \ldots ,{\text{x}}_{{\text{n}}} } \right)$ is affected by N factors. The formula of P can be obtained like this:

$$ {\text{P}} = \frac{1}{{1 + e^{{ - \left( {\beta 0 + \beta 1x1 + \cdots + \beta nxn} \right)}} }} $$

Secondly, XGBoost is a scalable tree boosting system based on gradient lifting decision trees for classification and regression predictive model, which avoids overfitting by adding regularization terms, using shrinkage scales for added weights, and using column subsampling. this algorithm improves prediction accuracy by working on the principle of optimizing functions. The XGBoost algorithm uses N additive functions to predict output in a tree ensemble model. Each regression tree involves a continuous score on each leaf when T is the number of leaves in the tree and each $f\left(n\right)$ has an independent structure as well as leaf weight. We can measure the difference between the prediction to the target based on this tree. Meanwhile, this model also presents the regression tree³⁹.

We also used the LASSO regression to choose the most significant variables. The LASSO regression shrunk the coefficients by imposing a penalty term, named lambda (λ), which is selected by visualization methods and cross-validation. Based on the optimal value of λ, we calculate the coefficients and build the LASSO model²³.

Receiver operating characteristic (ROC) analysis and measured area under the curve (AUC) were used to compare the efficacy of the predictive model corresponding to pathology. Hosmer–Lemeshow test measured calibration by p-value. we evaluated the predictive effect by running this model in the validating cohort. Both the XGBoost algorithm and the LASSO regression were performed with Python version 3.10.

We used the median value of the BP or PC group as supplementation for missing values. There was one PC patient with a missing value of ALP which we supplemented to 109 IU/L and another PC with a deficient value of 24-h urinary calcium which we handled to 5.96 mmol/24H in the training set.

Ethics approval and consent to participate

This study was approved by the Medical Ethics Committee of Beijing Chao-Yang Hospital, Capital Medical University. Informed consent was obtained from all participants and/or their legal guardians.

Result

Clinical characteristics between the training set and the validating set

Among the 93 patients in the training cohort, 23 were male and 70 were female. The mean age of patients was 51 ± 1 years. Of the 40 patients in the validating cohort, 13 were male and 27 were female. The mean age of patients was 51 ± 2 years. There were no differences in sex, age, education, biochemical tests, pre-surgery cognition, psychology, and tumor diameter between the training cohort and the validating cohort (Table 1).

Comparison of preoperative demography and clinical characteristics between the PC group and BP group

The description of the demography and relevant clinical characteristics were summarized (Table 2).

Table 2 Demography and clinical characteristics in patients with PC and BP.

Full size table

It was shown that the BP group marked a definite female preponderance. Preoperative PTH was significantly higher and serum phosphorus was lower in those with PC, while no significant difference was noticed between the two groups in other aspects.

Based on the scores of instruments, neither the PC group nor the BP group reported severe depression, anxiety as well as apathy, and there were no differences in mood (p = 0.65, p = 0.271, p = 0.243). The scores of cognitive function extent to be normal on mean in both PC and BP groups. MMSE and MOCA were significant in discriminating PC from BP (p = 0.004, p = 0.013), obtained through the Mann–Whitney test. The AUC of MMSE was greater than MOCA (MMSE 0.721vs MOCA 0.646), which confirmed the superiority of MMSE to MoCA in detecting PC. Patients assessed cognitive function and psychological changes, as exposed in Fig. 2.

Logistic regression analysis of the prediction model for PC

The logistic regression model with backward stepwise is shown in Table 3. The sex of patients was defined as a dummy variable and assigned values at analysis, putting female in 1 and male in 2. Preoperative PTH was positively correlated with the prediction model of PC. In contrast, the score of MMSE and the sex were inversely associated with the prediction model of PC. The final equation developed by the logistic regression model to predict PC was as follows: P = $\frac{1}{1{+e}^{-\left(9.109+0.002\times PTH-0.367\times MMSE-1.847\times sex\right)}}$, the closer the value of p is to 1, the higher the probability of parathyroid carcinoma in PHPT.

Table 3 Multivariate logistic regression analysis of prediction.

Full size table

XGBoost model of the prediction model for PC

The hyperparameters were selected by cross-validation and grid searches in the XGBoost model, by inputting the sex, all the laboratory test results, and scores of MMSE and MOCA of all the PHPT patients, and the top eight indicators for important features were finally determined by incorporating them into the algorithm model with data of the training set (sex, MMSE, PTH, alkaline phosphatase, calcium, 24-h urinary calcium, 25-hydroxy vitamin D and phosphorous, Scores of important features were shown in Fig. 3A). The value of features in the model for improving decision tree development is used to determine the importance of a feature. An attribute's relative value is increased if it influences split point improvement (the closer it is to the root node) or is chosen by more boosting trees. According to the decision tree structures of the XGBoost model, the predictive values for PC can be calculated and normalized to range from 0–1. The first tree structure was shown in Fig. 3B. if the score of MMSE in patients with PHPT was < 24.5 and the ALP was < 144 (U/L), the probability of PC was $1/[1+\mathrm{exp}(-\mathrm{leaf})$] = $1/[1+exp(-0.360)$] = 0.589.

LASSO model of the prediction model for PC

Sex and clinical indicators, including laboratory results and neurocognitive assessment, were subjected to LASSO regression. We utilized cross-validation to ensure the optimal penalty parameter lambda (λ) at the minimum mean squared error value (Fig. 4B). Log(λ) = − 2.171463896 (λ = 0.006738079091822886) minimized the regression coefficient (Fig. 4A) while 6 variables remained in further regression (sex, MMSE, PTH, calcium, 25-hydroxy vitamin D and phosphorous, Fig. 4C).

Performance in prediction model of PC among XGBoost, LASSO, and logistic regression

We used the same training group and validation group, the AUC of LASSO regression in both the training and validation sets were lower than that of XGBoost and logistic regression. (Fig. 5A,B) The AUC of the XGBoost model in the training set was 0.861(95%CI 0.792–0.884), which is similar to the AUC of logistic regression (0.832, 95%CI 0.738–0.927, shown in Fig. 5A). As seen in Fig. 5B, the AUC of the logistic model was 0.6833(95%CI 0.520–0.970), which was lower than the AUC of the XGBoost model. (0.835, 95%CI 0.655–0.870).

The three prediction models of PC differed in terms of sensitivity, specificity, accuracy, false positives, and false negatives (shown in Table 4). The XGBoost model, in particular, had 2 false negatives compared to 5 false negatives for the logistic model and 6 false negatives for the LASSO regression in the validation group. The AUC was significant for each algorithm (p < 0.05). In the XGBoost model, the optimal threshold (0–1) was similar between the training group and the validation group, with values of 0.455 and 0.456. In the training set, the cut-off value was 0.772 for LASSO and 0.731 for logistic regression, while in the validation set it was 0.807 for LASSO and 0.623 for logistic regression. The multiple logistic and linear regressions with both 6 variables of LASSO regression and 8 variables of the XGBoost model did not observe significant differences in predictive accuracy, sensitivity, and specificity (Table 2 in supplementary). The calibration of the XGBoost model was 1.957 (p < 0.05) according to the Hosmer–Lemeshow test, which was higher than the logistic model calibration (p-value: 0.465).

Table 4 Performance in XGBoost, LASSO regression, and logistic regression of prediction model for PC.

Full size table

Discussion

Cognitive decline is common among patients with PHPT, which is characterized by elevated PTH and serum calcium. Several reports have identified that patients with PHPT appear to have an increased incidence of cognitive dysfunction^40,41,42. According to a current systematic review, cognitive impairment in PHPT is more likely to be associated with elevated PTH levels rather than hypercalcemia¹³. However, the mechanism for inducing the impairment of cognition remains to be studied. The details of these relationships between cognitive impairment and serum biomarkers, such as PTH and serum calcium, merit further investigation. Despite the conclusion of the current 5th International Workshop that cognitive evaluation for patients with PHPT is not a necessary test⁴³. The cognitive function assessment in patients with parathyroid cancer, who may have cancer-related cognitive impairment, may offer new ideas to distinguish benign parathyroid disease from parathyroid cancer.

High-level PTH may play a role in cognitive dysfunction and cerebrovascular diseases by way of PTH2 receptors (PTHrP) scattered throughout the arteries of the cerebral cortex. PTH2 receptor expression is dominated in limbic, hypothalamic, and sensory areas, particularly hypothalamic periventricular neurons and median eminence nerve terminals^{13,41,44,45,46}. The cerebral area responsible for these functions is the same as the area where PTH receptors are distributed. Therefore, it seems reasonable to speculate that the cognitive decline in patients with PHPT might be proportionally interrelated with PTH level. Bjorkman found that elevated levels of PTH was associated with MMSE in a five-year follow-up in a general-aged population⁴⁴. Unlike these previous reports^41,45,46,47, only a weak link between cognition deficit and elevated PTH level was observed in MMSE (Spearman correlation = − 0.172 p = 0.048 < 0.05) based on our data, while MOCA failed to show a correlation with PTH level (p = 0.474 > 0.05). The reasons for this inconsistency may be as follows. One reason is that we excluded the influence of age, education, depression, and anxiety on cognitive performance by comparing the PC group to the matched control group, which has been neglected in previous research. Another may be that the effect of peripheral cancer on cognitive impairment could not be excluded in parathyroid cancer because patients often experience significant neurocognitive decline, as has been observed in other cancers¹⁴. Based on the physiological perspective, cognitive decline may be associated with the distribution of PTH2 receptors in different pathological states, which requires to be proved by subsequent experiments. The modification of PTH secretion by serum calcium is changed in patients with PHPT. In accordance with previous studies^13,17, we found no link between calcium levels and neurocognitive function (MMSE: p = 0.106 > 0.05; MOCA: p = 0.506 > 0.05). Additionally, a lack of vitamin D could lead to cognitive decline in the older adult⁴⁸. Though the mean concentrations of vitamin D in patients both in PC and BP are lower than normal, we didn’t observe a link between decreased vitamin D and impaired cognition both in MMSE and MOCA (MMSE: p = 0.716 > 0.5; MOCA: p = 0.834 > 0.5). Further, it needs more mechanistic experiments to determine whether these effects are related to the neurocognitive aspects of PC.

By self-reporting neurocognitive symptoms (presenting difficult concentration and memory problems), Daniel Repplinger reported that neurocognitive dysfunction may be used as a predictor of parathyroid hyperplasia⁴⁹. In our study, we proposed the pre-surgery cognitive function as a potential indicator for PC and both MMSE and MOCA could be used as robust tools for assessing the cognition of patients with PC (p < 0.05). In addition, MMSE was superior in detecting cognition in distinguishing patients with PC from PHPT. This is more likely due to MMSE stability of no influence on sex and good internal consistency in measuring the severity of cognitive problems^50,51,52. Those deteriorations of cognitive function in patients with PC are primarily characterized by impaired attention, diminished calculative accuracy, difficulties in extracting acquired information from memory, and scathed visual constructive abilities. (Table 5. attention and calculation p = 0.003; recall p = 0.007; language and visual construction p = 0.03). Notably, a similar phenomenon was reported by Janelsins et al.⁵³ who found that patients with stage I-IIIC breast cancer have significant cognitive impairment before treatment, particularly in the areas of memory, attention, and executive function. Whether a similar phenomenon is observed in other cancers needs further investigation.

Table 5 Distribution of the scores of the MMSE between PC and BP.

Full size table

Based on the above perspective and the study data, we developed three prediction models for PC on the XGBoost algorithm, LASSO regression, and logistic regression by preoperatively taking scores of MMSE and clinical features into account (in Table 4). As far as we are aware, this is the first time that the use of XGBoost and LASSO regression in the prediction of PC has been presented. The sensitivities of the three models were 0.773, 0.727 and 0.682, and their specificities were 0.817, 0.789, and 0.887, respectively. In comparison to the traditional statistical approach, the XGBoost model could learn complex nonlinear decision boundaries through boosting, whereas linear models such as logistic regression may ignore interactive relationships of the multiple indicators in non-linear and perform the suboptimal outcome^54,55,56. In our study, the predictive performance of the XGBoost model, with the lowest false negative rate, was superior to that of the logistic model and LASSO regression model. With a low percentage of underdiagnosis, it would be sensitive to forecast the likelihood of cancer in PHPT avoiding the second surgery. In addition, the XGBoost model can learn the optimal strategy for the best direction to effectively handle missing values in the data by sparsity-aware split-finding. This method enables the XGBoost model to decide on missing data during the training process, which frequently leads to improved model performance. Although the median or mean values from the data are typically used to impute missing values, the many individual differences may still have an impact on the outcomes. In this study, there are currently two PC patients with missing values of ALP and 24-h urinary calcium in the training set. Even after handling missing values, the predictive performance of the XGBoost model remained superior to LASSO and logistic regression. In addition, we processed missing values into the origin validation dataset. Even if the extent of missing values for each variable reached 10%, the predictive performance of the XGBoost remained superior to the Logistic and LASSO regression models, especially in significantly reducing false negative rates (AUC:0.807 vs 0.503 vs 0.513. Table 1 in supplementary). Furthermore, when the extent of missing value increased to 20%, the XGBoost model still outperformed LASSO and Logistic regression with 10% missing data. Thus, whether missing values are present or not, the XGBoost model still demonstrated good predictive performance, indicating its ability to handle missing data effectively while still maintaining high accuracy in prediction. According to three-fold cross-validation, The XGBoost model outperformed Logistic and LASSO regression in terms of accuracy and AUC (accuracy of the model: 0.842 vs 0.773 vs 0.800; AUC: 0.851 vs 0.723 vs 0.666. Table 3 in supplementary) The three-fold cross-validation method emphasized the consistency of the XGBoost model’s performance, further supporting the validity of our findings. This research may offer a reasonably accurate and convenient tool for predicting PC.

Our study has serval limitations. First, the neurocognitive psychological evaluations were subjective assessments that might be influenced by individuals. More PHPT patients need to be included to validate the predictive model. Second, this is considered a preliminary study due to a single-center study with an inevitably small sample size which may affect the generalizability of the findings. Future studies with a larger number of participants in multi-center are required. Third, the model was developed based on our internal verification in the Chinses population, consequently unknowing in other populations. Furthermore, multiple populations need to be used to validate the prediction models developed by XGBoost.

In conclusion, our research demonstrated that the pre-surgery cognitive function might be a potential predictor for PC in patients with PHPT. MMSE is superior to MOCA in evaluating cognition function in PHPT patients and differing PC from BP. Preoperative cognitive assessment of MMSE is necessary for patients with PHPT suspected of PC. The XGBoost model, which had a better performance than the LASSO and logistic model, could predict PC based on pre-surgery cognitive function and clinical features. The performance of the prediction model for PC based on the XGBoost model needs to be further verified in larger populations of PHPT patients.

Data availability

The datasets analyzed during the current study are not available since we are still collecting more data for further study, but are available from the corresponding author on reasonable request.

References

Rawat, N., Khetan, N., Williams, D. W. & Baxter, J. N. Parathyroid carcinoma. Br. J. Surg. 92, 1345–1353 (2005).
Article CAS PubMed Google Scholar
Wei, C. H. & Harari, A. Parathyroid carcinoma: Update and guidelines for management. Curr. Treat. Options Oncol. 13, 11–23 (2012).
Article PubMed Google Scholar
Lee, P. K., Jarosek, S. L., Virnig, B. A., Evasovich, M. & Tuttle, T. M. Trends in the incidence and treatment of parathyroid cancer in the United States. Cancer 109(9), 1736–1741 (2007).
Article PubMed Google Scholar
Kong, S. H., Kim, J. H., Park, M. Y., Kim, S. W. & Shin, C. S. Epidemiology and prognosis of parathyroid carcinoma: Real-world data using nationwide cohort. J. Cancer Res. Clin. Oncol. 147(10), 3091–3097 (2021).
Article CAS PubMed Google Scholar
Ryhänen, E. M. et al. A nationwide study on parathyroid carcinoma. Acta Oncol. 56(7), 991–1003 (2017).
Article PubMed Google Scholar
Talat, N. & Schulte, K. M. Clinical presentation, staging, and long-term evolution of parathyroid cancer. Ann. Surg. Oncol. 17(8), 2156–2174 (2010).
Article PubMed Google Scholar
Villar-del-Moral, J. et al. Prognostic factors and staging systems in parathyroid cancer: A multicenter cohort study. Surgery 156, 1132–1144 (2014).
Article PubMed Google Scholar
Wei, B. et al. Extended en bloc reoperation for recurrent or persistent parathyroid carcinoma: Analysis of 31 cases in a single institute experience. Ann. Surg. Oncol. 29, 1208–1215 (2022).
Article PubMed Google Scholar
Erickson, L. A., Mete, O., Juhlin, C. C., Perren, A. & Gill, A. J. Overview of the 2022 WHO classification of parathyroid tumors. Endocr. Pathol. 33, 64–89 (2022).
Article PubMed Google Scholar
Schantz, A. & Castleman, B. Parathyroid carcinoma. A study of 70 cases. Cancer 31, 600–605 (1973).
Article CAS PubMed Google Scholar
Kim, J. et al. The dangers of parathyroid biopsy. J. Otolaryngol. Head Neck Surg. 46, 4 (2017).
Article PubMed PubMed Central Google Scholar
Chiang, C. Y. et al. A controlled, prospective study of neuropsychological outcomes post parathyroidectomy in primary hyperparathyroid patients. Clin. Endocrinol. (Oxf.) 62, 99–104 (2005).
Article PubMed Google Scholar
Chandran, M., Yeh, L. T. L., de Jong, M. C., Bilezikian, J. P. & Parameswaran, R. Cognitive deficits in primary hyperparathyroidism: What we know and what we do not know—A narrative review. Rev. Endocr. Metab. Disord. 23, 1079–1087 (2022).
Article CAS PubMed Google Scholar
Olson, B. & Marks, D. L. Pretreatment cancer-related cognitive impairment—Mechanisms and outlook. Cancers 11, 687 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jansen, C. E., Cooper, B. A., Dodd, M. J. & Miaskowski, C. A. A prospective longitudinal study of chemotherapy-induced cognitive changes in breast cancer patients. Support. Care Cancer 19(10), 1647–1656 (2011).
Article PubMed Google Scholar
Walker, M. D. et al. Neuropsychological features in primary hyperparathyroidism: A prospective study. J. Clin. Endocrinol. Metab. 94, 1951–1958 (2009).
Article CAS PubMed PubMed Central Google Scholar
Perrier, N. D. et al. Prospective, randomized, controlled trial of parathyroidectomy versus observation in patients with “asymptomatic” primary hyperparathyroidism. Surgery 146, 1116–1122 (2009).
Article PubMed Google Scholar
Prager, G. et al. Parathyroidectomy improves concentration and retentiveness in patients with primary hyperparathyroidism. Surgery 132, 930–936 (2002).
Article PubMed Google Scholar
Wang, X. et al. Prediction of the 1-year risk of incident lung cancer: Prospective study using electronic health records from the state of Maine. J. Med. Internet Res. 21, e13260 (2019).
Article PubMed PubMed Central Google Scholar
Li, Q. et al. XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer. J. Transl. Med. 20, 177 (2022).
Article CAS PubMed PubMed Central Google Scholar
Chen, D. et al. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int. J. Biol. Sci. 18, 360–373 (2022).
Article CAS PubMed PubMed Central Google Scholar
Liu, W. et al. Prediction of lung metastases in thyroid cancer using machine learning based on SEER database. Cancer Med. 11, 2503–2515 (2022).
Article PubMed PubMed Central Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Methodol. 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Huang, J. C. et al. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Comput. Methods Progr. Biomed. 195, 105536 (2020).
Article Google Scholar
Li, Y., Lu, F. & Yin, Y. Applying logistic LASSO regression for the diagnosis of atypical Crohn’s disease. Sci. Rep. 12(1), 11340 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kang, J. S. et al. Risk prediction for malignant intraductal papillary mucinous neoplasm of the pancreas: Logistic regression versus machine learning. Sci. Rep. 10(1), 20140 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
McEligot, A. J., Poynor, V., Sharma, R. & Panangadan, A. Logistic LASSO regression for dietary intakes and breast cancer. Nutrients 12(9), 2652 (2020).
Article CAS PubMed PubMed Central Google Scholar
Folstein, M. F., Folstein, S. E. & McHugh, P. R. Mini-mental state. J. Psychiatr. Res. 12, 189–198 (1975).
Article CAS PubMed Google Scholar
Tombaugh, T. N. The mini-mental state examination: a comprehensive review. Dementia 40(9), 922–935 (1992).
CAS Google Scholar
O’Driscoll, C. & Shaikh, M. Cross-cultural applicability of the montreal cognitive assessment (MoCA): A systematic review. J. Alzheimers Dis. 58, 789–801 (2017).
Article PubMed Google Scholar
Nasreddine, Z. S. et al. The montreal cognitive assessment, MoCA: A brief screening tool for mild cognitive impairment: MOCA: A brief screening tool for MCI. J. Am. Geriatr. Soc. 53, 695–699 (2005).
Article PubMed Google Scholar
Hamilton, M. A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23(1), 56–62 (1960).
Article CAS PubMed PubMed Central Google Scholar
Zimmerman, M., Martinez, J. H., Young, D., Chelminski, I. & Dalrymple, K. Severity classification on the Hamilton depression rating scale. J. Affect. Disord. 150, 384–388 (2013).
Article PubMed Google Scholar
Thompson, E. Hamilton rating scale for anxiety (HAM-A). Occup. Med. (Lond.) 65(7), 601 (2015).
Article PubMed Google Scholar
Hamilton, M. The assessment of anxiety states by rating. Br. J. Med. Psychol. 32, 50–55 (1959).
Article CAS PubMed Google Scholar
Marin, R. S., Biedrzycki, R. C. & Firinciogullari, S. Reliability and validity of the apathy evaluation scale. Psychiatry Res. 38, 143–162 (1991).
Article CAS PubMed Google Scholar
Starkstein, E. & Mayberg, S. Validity, and Clinical of Apathy in Disease. 6 (1992).
Faerden, A. et al. Reliability and validity of the self-report version of the apathy evaluation scale in first-episode psychosis: Concordance with the clinical version at baseline and 12 months follow-up. Psychiatry Res. 267, 140–147 (2018).
Article PubMed Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (ACM, 2016).
Benge, J. F. et al. Cognitive and affective sequelae of primary hyperparathyroidism and early response to parathyroidectomy. J. Int. Neuropsychol. Soc. 15, 1002–1011 (2009).
Article PubMed Google Scholar
Liu, M. et al. Cognition and cerebrovascular function in primary hyperparathyroidism before and after parathyroidectomy. J. Endocrinol. Invest. 43, 369–379 (2020).
Article CAS PubMed Google Scholar
Babińska, D. et al. Evaluation of selected cognitive functions before and after surgery for primary hyperparathyroidism. Langenbecks Arch. Surg. 397, 825–831 (2012).
Article PubMed Google Scholar
Bilezikian, J. P. et al. Evaluation and management of primary hyperparathyroidism: Summary statement and guidelines from the fifth international workshop. J. Bone Miner. Res. 37(11), 2293–2314 (2022).
Article PubMed Google Scholar
Usdin, T. B., Wang, T., Hoare, S. R. J., Mezey, É. & Palkovits, M. New members of the parathyroid hormone/parathyroid hormone receptor family: The parathyroid hormone 2 receptor and tuberoinfundibular peptide of 39 residues. Front. Neuroendocrinol. 21, 349–383 (2000).
Article CAS PubMed Google Scholar
Björkman, M. P., Sorva, A. J. & Tilvis, R. S. Does elevated parathyroid hormone concentration predict cognitive decline in older people?. Aging Clin. Exp. Res. 22, 164–169 (2010).
Article PubMed Google Scholar
Puy, L. et al. Cognitive impairments and dysexecutive behavioral disorders in chronic kidney disease. J. Neuropsychiatry Clin. Neurosci. 30, 310–317 (2018).
Article PubMed Google Scholar
Roman, S. A. et al. The effects of serum calcium and parathyroid hormone changes on psychological and cognitive function in patients undergoing parathyroidectomy for primary hyperparathyroidism. Ann. Surg. 253, 131–137 (2011).
Article PubMed Google Scholar
Feart, C. et al. Associations of lower vitamin D concentrations with cognitive decline and long-term risk of dementia and Alzheimer’s disease in older adults. Alzheimers Dement. 13, 1207–1216 (2017).
Article PubMed Google Scholar
Repplinger, D., Schaefer, S., Chen, H. & Sippel, R. S. Neurocognitive dysfunction: A predictor of parathyroid hyperplasia. Surgery 146, 1138–1143 (2009).
Article PubMed Google Scholar
Rowland, J. T., Basic, D., Storey, J. E. & Conforti, D. A. The Rowland universal dementia assessment scale (RUDAS) and the Folstein MMSE in a multicultural cohort of elderly persons. Int. Psychogeriatr. 18, 111–120 (2006).
Article PubMed Google Scholar
Nieuwenhuis-Mark, R. E. The death knoll for the MMSE: Has it outlived its purpose?. J. Geriatr. Psychiatry Neurol. 23, 151–157 (2010).
Article PubMed Google Scholar
Pinto, T. C. C. et al. Is the Montreal cognitive assessment (MoCA) screening superior to the mini-mental state examination (MMSE) in the detection of mild cognitive impairment (MCI) and Alzheimer’s disease (AD) in the elderly?. Int. Psychogeriatr. 31, 491–504 (2019).
Article PubMed Google Scholar
Janelsins, M. C. et al. Longitudinal trajectory and characterization of cancer-related cognitive impairment in a nationwide cohort study. J. Clin. Oncol. 36, 3231–3239 (2018).
Article CAS PubMed Central Google Scholar
Ding, C., Guo, Y., Mo, Q. & Ma, J. Prediction model of postoperative severe hypocalcemia in patients with secondary hyperparathyroidism based on logistic regression and XGBoost algorithm. Comput. Math. Methods Med. 2022, 1–7 (2022).
Google Scholar
Liu, H. et al. Machine learning risk score for prediction of gestational diabetes in early pregnancy in Tianjin, China. Diabetes Metab. Res. Rev. 37, e3397 (2021).
Article CAS PubMed Google Scholar
Obermeyer, Z. & Emanuel, E. J. Predicting the future—Big data, machine learning, and clinical medicine. N. Engl. J. Med. 375, 1216–1219 (2016).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Yuting Wang and Teng Zhao contributed equally as first authors.

Author information

These authors contributed equally: Yuting Wang and Teng Zhao.

Authors and Affiliations

Department of Thyroid and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
Yuting Wang, Bojun Wei, Teng Zhao, Hong Shen, Xing Liu, Jiacheng Wang, Qian Wang, Rongfang Shen & Dalin Feng

Authors

Yuting Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bojun Wei
View author publications
You can also search for this author in PubMed Google Scholar
Teng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Hong Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jiacheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rongfang Shen
View author publications
You can also search for this author in PubMed Google Scholar
Dalin Feng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.W.: Writing-Original Draft, Formal analysis, Software, Visuallization. B.W.*: Writing-Review & Editing, Supervision. T.Z.: Conceptualization, Methodology. H.S.¹: Validation. X.L.¹, J.W.¹, Q.W.¹: Data Curation. R.S.¹, D.F.¹: Investigation.

Corresponding author

Correspondence to Bojun Wei.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Wei, B., Zhao, T. et al. Machine learning-based prediction models for parathyroid carcinoma using pre-surgery cognitive function and clinical features. Sci Rep 13, 19007 (2023). https://doi.org/10.1038/s41598-023-46294-7

Download citation

Received: 23 July 2023
Accepted: 30 October 2023
Published: 03 November 2023
DOI: https://doi.org/10.1038/s41598-023-46294-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Development and validation of a new algorithm for improved cardiovascular risk prediction

Delirium

Gut microbiome predicts cognitive function and depressive symptoms in late life

Introduction

Methods

Participants

Diagnostic criteria and follow-up

Neurocognitive and neuropsychological function assessment

Statistical methods

Model development and model performance evaluation

Ethics approval and consent to participate

Result

Clinical characteristics between the training set and the validating set

Comparison of preoperative demography and clinical characteristics between the PC group and BP group

Logistic regression analysis of the prediction model for PC

XGBoost model of the prediction model for PC

LASSO model of the prediction model for PC

Performance in prediction model of PC among XGBoost, LASSO, and logistic regression

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links