Prediction of future cognitive impairment among the community elderly: A machine-learning based approach

Na, Kyoung-Sae

doi:10.1038/s41598-019-39478-7

Download PDF

Article
Open access
Published: 04 March 2019

Prediction of future cognitive impairment among the community elderly: A machine-learning based approach

Kyoung-Sae Na ORCID: orcid.org/0000-0002-0148-9827¹

Scientific Reports volume 9, Article number: 3335 (2019) Cite this article

5556 Accesses
39 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The early detection of cognitive impairment is a key issue among the elderly. Although neuroimaging, genetic, and cerebrospinal measurements show promising results, high costs and invasiveness hinder their widespread use. Predicting cognitive impairment using easy-to-collect variables by non-invasive methods for community-dwelling elderly is useful prior to conducting such a comprehensive evaluation. This study aimed to develop a machine learning-based predictive model for future cognitive impairment. A total of 3424 community elderly without cognitive impairment were included from the nationwide dataset. The gradient boosting machine (GBM) was exploited to predict cognitive impairment after 2 years. The GBM performance was good (sensitivity = 0.967; specificity = 0.825; and AUC = 0.921). This study demonstrated that a machine learning-based predictive model might be used to screen future cognitive impairment using variables, which are commonly collected in community health care institutions. With efforts of enhancing the predictive performance, such a machine learning-based approach can further contribute to the improvement of the cognitive function in community elderly.

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Article Open access 12 April 2024

Hang Yuan, Shing Chan, … Aiden Doherty

Neurofilaments as biomarkers in neurological disorders — towards clinical application

Article 12 April 2024

Michael Khalil, Charlotte E. Teunissen, … Jens Kuhle

The effects of genetic and modifiable risk factors on brain regions vulnerable to ageing and disease

Article Open access 27 March 2024

Jordi Manuello, Joosung Min, … Gwenaëlle Douaud

Introduction

Cognitive impairment has devastating effects on individuals, caregivers, and society. Individuals with cognitive impairment frequently suffer from comorbid psychiatric conditions (e.g., depression, wandering, agitation, insomnia, psychotic symptoms, etc.)^1,2. It is commonly associated with physical diseases, such as diabetes mellitus (DM) and cardiovascular diseases³. Individuals with cognitive impairment also experience a decreased quality of life⁴.

The harmful effects of cognitive impairment are not restricted to its advanced forms such as dementia. In addition to the well-known risk of progress to dementia⁵, mild cognitive impairment (MCI) can also cause substantial psychological symptoms in caregivers⁶ and patients⁷. The prevalence of MCI is 10–20% among the elderly. Approximately 30–40% of cases with MCI consequently progress to dementia⁸. The financial burden and medical complications among patients with MCI are certainly higher than those for healthy individuals⁹.

Currently, the best way to prevent or minimize this devastating course is to detect risk in people early and begin intervention¹⁰. Many researchers have identified neurobiological, genetic, and neuroimaging biomarkers for cognitive impairment, particularly in Alzheimer’s disease^10,11. These efforts should persist, and would consequently yield results. However, the high costs of neuroimaging and genetic evaluation restrict their wide dissemination to the community elderly.

Various factors, including sociodemographic, personal, health, and quality of life, contribute to future cognitive functions^12,13,14,15. These factors provide invaluable information that is not captured by a simple cognitive test, such as the Mini-Mental Status Examination (MMSE). For example, regular exercise has therapeutic effects for stress-induced cognitive impairment¹⁶. If one exercises regularly, then he or she is likely to have an advantage in terms of cognitive functioning. Alcohol use and depression are well known for their adverse effects on cognitive functions^17,18. However, simply identifying the presence or absence of various risk or protective factors is not helpful in predicting future cognitive impairment. These variables can be meaningful when their complex interactions are analyzed using appropriate algorithms.

This study sought to build a predictive model that incorporates variables that can be easily obtained at a low cost. Machine learning is used to integrate these variables and construct a reproducible predictive model.

Results

Participant data

Table 2 summarizes the variables used in the predictive model. The mean (SD) age of the participants at baseline was 70.4 (6.97) years. The mean (SD) score on the K-MMSE at baseline was 26.9 (3.14). The mean (SD) K-MMSE score after 2 years was 25.9 (4.33). The number of the elderly with cognitive impairment after 2 years was 80 (2.34%).

Table 1 Cut-off point of the scores on the Korean Mini-mental Status Examination according to age group and gender.

Full size table

Table 2 Summary of the sociodemographic, health, interpersonal, quality of life, and subjective well-being variables.

Full size table

Performance

Table 3 shows that the sensitivity of the predictive model was excellent (0.967). The negative predictive value (NPV) was 0.999, while precision (positive predictive value) was 0.143. The AUC (0.921) represents good binary classifying performance (Fig. 1). The precision–recall plot shows that the classifier performs well considering the highly imbalanced dataset (Fig. 2).

Table 3 Performance metrics of the gradient boosting machine.

Full size table

Importance of variables

Figure 3 presents the 10 most influential variables. As expected, age, MMSE, and education levels had the strongest influences on the predictive model. The limited daily activity caused by health problems was ranked fifth, followed by the presence of cohabitating children, arthritis diagnosis, subjective satisfaction in their own economic state, subjective satisfaction in their own general health, and DM or hyperglycemia diagnosis.

Discussion

A predictive model with machine learning algorithms was built herein to classify elderly at risk for cognitive impairment 2 years later. The predictive model with GBM showed excellent sensitivity (0.968) and AUC (0.921). Specificity (0.825) and accuracy (0.829) were tolerable. Overall, this predictive classifier seemed to have good screening performance¹⁹. This predictive performance is better than that of the previous study, which used machine learning to compute the likelihood of dementia 1 year later²⁰. However, the performance of the predictive model should be cautiously considered in terms of the low F₁-score and MCC. The low F₁-score was already expected because the dataset was highly imbalanced in favor of the negative cases. The modest MCC values might have resulted from the low precision (0.143). In short, if 1,000 elderly people are classified to the cognitive impairment group, only 143 would actually be suffering from cognitive impairment. Further, the excellent negative predictive value (0.999) and sensitivity ensure that almost all elderly people classified as having no future cognitive impairments will be actually normal. This high-recall and low-precision predictive model is frequently used in the field of medicine, where failure of detection of the risk group can lead to critical health problems; this is also why the primary outcome measure was set to sensitivity.

The longitudinal approach of this study is differentiated from several studies using neuroimaging modalities. Many of such studies built classification models based on the matched case-control design (for a detailed review, please refer to the study by Pellegrini et al.²¹). A similar proportion of the case and controls is advantageous for building a model with stronger performance metrics. However, in the real-world, the number of the elderly with cognitive impairment is substantially lower than those with normal cognitive function. Hence, the proposed algorithm would be suitable for screening future cognitive impairment in practice.

The high cost and restricted measuring environment of MRI and PET are possible limitations of their wide application to community-dwelling elderly. Needle insertion and the use of radioactive materials are additional drawbacks of PET. In contrast, the predictive models in this study only required variables that can be easily collected during the routine practice of the community healthcare centers. Together with good predictive performances, the availability of the variables makes it possible to disseminate and screen future cognitive impairment among community-dwelling elderly.

By contrast, variables that are important in the predictive models should be noted. The importance of the baseline cognitive function, age, and educational levels for future cognitive function has been consistently reported^22,23. The other major important variables of the predictive model herein were the limited daily activity caused by health problems, presence of the cohabitating children, chronic diseases (arthritis and DM), and subjective wellbeing (satisfaction in their own economic and health status). Although the weights of the variables are relatively small, this supports the notion that there may be complex direct and indirect interactions among various factors on the cognitive function²⁴. Previous studies reported a close association between cognitive functions and life satisfaction²⁵. Cohabiting children also had beneficial roles in the cognitive functioning of the elderly. First, they can serve familiar relationships in the family, thereby reducing loneliness in the elderly. The elderly frequently experience loss and loneliness. Recent studies have suggested that loneliness can exert harmful effects on the cognitive functions and mental health of the elderly²⁶. Children can be a psychological comfort and prevent solitude in the elderly²⁷. Additionally, children who frequently meet with their elderly patients can easily recognize any significant changes in their parents’ cognitive functions. This may lead to early evaluation and intervention, which contribute to a better cognitive outcome. However, it is plausible that cognitive impairments would have reciprocal relationships with the quality of life, subjective wellbeing, and functional disability in the elderly²⁸.

Although several important factors that contribute to the predictive model have been briefly discussed, what counts is not the individual risk or protective factors, but a model that encompasses such factors and identifies which one is likely to be cognitively impaired. To date, several research groups, not limited to the Republic of Korea, have used the KLoSA data to examine the risk factors of cognitive impairment. One group evaluated the cognitive changes between 2008 and 2012 and identified that baseline social activities, including contact with their children, were associated with less cognitive impairment²⁹. Other studies have shown that gender³⁰ and body mass index³¹ played a role in the future cognitive functioning among the elderly. Some studies revealed risk factors for the cognitive functioning in a cross-sectional design^13,32,33. However, although the data similar to those in the previous studies were used herein, the present study differed in terms of the objective. While all the previous studies using the KLoSA data aimed to identify the risk factors for cognitive impairment, this study used data from the national survey to pragmatically build a predictive model.

Several limitations should be noted. First, a binary classifier was built instead of a multiclass classifier (healthy controls vs. MCI vs. dementia). As stated in the Introduction section, finely discriminating the degree of cognitive impairment was not the objective of this study. Rather, this study intended to develop a model that can be widely used among the community-residing elderly given variables that are easy to collect at reasonable costs. Second, the cognitive impairment was measured without clinical diagnostic evaluation. Clinical criteria, such as the Diagnostic and Statistical Manual of Mental Disorders, 5th edition (DSM-5)³⁴ and the International Classification of Diseases, 10th edition (ICD-10), must be used to diagnose the severe form of cognitive impairment, such as dementia³⁵. Third, we may also need additional measurements, including hematological, urine, and brain MRI to specify the types of dementia. However, most of these professional measurements are taken at the hospital for selected populations who have risk factors and/or symptoms. In contrast, the predictive model for future cognitive impairment was constructed based on the community-residing middle-aged to elderly. The primary objective of this machine learning-based predictive model is to screen the elderly who will likely have cognitive impairment 2 years later, but not confirm the specific neurocognitive disorders. The weakness of the MMSE, varying accuracy according to the age, educational levels, and gender³⁶ were minimized by applying stratified cut-off points for each subgroup. Hence, the lack of a clinician-made diagnostic evaluation will not substantially gilt off the strength of this study.

This study demonstrated that the sociodemographic, health, functional, and interpersonal, and subjective domain variables can be used to predict future cognitive impairment among community-dwelling elderly. These variables can be easily collected from the elderly and their close relatives; hence, this predictive model can be widely disseminated to the community. Considering the effort put into enhancing the performance of this predictive model, the model can be of help to community-dwelling elderly in terms of promoting cognitive function before it becomes worse.

Methods

Participants and data

Data from the Korean Longitudinal Study of Aging (KLoSA)³⁷ from 2014 to 2016 were used. The participants of the survey were recruited using a multistage stratified cluster sampling based on 15 geographical areas and housing types. Blaise (http://blaise.com) was used for convenient and accurate data collection. Blaise is a computer-assisted personal interviewing software widely used over 30 countries. A skilled interview is important for obtaining reliable information; hence, intensive education and mock interviews were conducted 1 month before the start of the survey. All participants provided written informed consent before the data collection.

The sampling frame of the KLoSA was initially created and used in the population census in the Republic of Korea in 2005³⁸. The first survey was conducted between August and December in 2006. The initial respondents were 10,254 individuals aged over 45 years. The KLoSA survey is biennially performed. The author used data from 2014 (wave 5) and 2016 (wave 6) to exclude the very young age group and utilize the most recent information. Based on the previous study³⁹, the criteria of the cognitive impairment were defined as the Korean Mini-mental State Examination (K-MMSE) scores below 1 standard deviation of the mean scores of age by educational level stratified groups (Table 1). Unlike the original study³⁹, the current study categorized uneducated and less than 6 years of education into the same group due to the lack of the detailed information on the years of education less than 4 years.

The inclusion criteria at baseline were elderly aged between 60 and 89 without cognitive impairment. The total number of participants included in the final dataset was 3424 (i.e., 1586 males and 1838 females).

Based on previous studies^9,14 and expert opinions, the author used 35 variables associated with cognitive functions from the four main domains (i.e., sociodemographic, health, functional, and subjective wellbeing) (Table 2).

The study protocol was approved by the Institutional Review Board in the Gachon University Gil Medical Center (GCIRB2018-152). All methods were performed in accordance with the relevant guidelines and regulations.

Preprocessing

The proportion of the training and hold-out test set was determined as 0.7 and 0.3, respectively. The synthetic minority over-sampling technique (SMOTE) was used to deal with the imbalanced ratio of the elderly with and without cognitive impairment⁴⁰. Unlike up-sampling, which simply replicates duplicate samples, the SMOTE generates artificial data that resemble the original dataset. The SMOTE was only applied to the training set in the cross-validation to avoid any possibility of overfitting. The final performance metrics were evaluated with the hold-out test set, which has never been included in the SMOTE or cross-validation procedures.

Given the number of the observations and variables, no prior feature selection process was conducted. The importance of the variables in each predictive model was separately summarized.

Machine learning model

All machine learning processes were conducted using the caret package⁴¹ for R (https://www.r-project.org/). The caret package enables the construction of a unitary preprocessing dataset and, thus, provides a reliable comparison between different machine learning models. The gradient boosting machine (GBM) was used herein because it utilizes the ensemble approach; hence, the predictive model might be built while minimizing classifying errors. The principles and practices of the GBM are well described in several literatures^42,43; thus, the essential features of the GBM are only briefly summarized herein. The GBM is an ensemble algorithm with the boosting method based on the decision tree model⁴⁴. The boosting algorithm initially generates a weak classifier with the same weights for all instances. This weak classifier can correctly classify the binary class only slightly more than random classifiers do by chance. The classifying algorithm is then trained again. This time, the weight, which wrongly classified the target in the previous training, is increased, whereas the weight of the correct classifiers is decreased. This adjustment of the weights makes the classifier more robust to the previously misclassified cases. The ‘gradient’ in the GBM has the same meaning as the term ‘gradient descent.’ Gradient descent is one of the several mathematical algorithms by which the boosting methods update the classifier to become stronger. The gradient descent adjusts the parameters to minimize a loss function and determine the optimal point with the smallest error. For example, the fourth classifier is fitted to the residual error made from the third classifier. This process of sequentially adding new weak classifiers with gradient descent is iterated until the classifying performance of the classifier becomes perfect (i.e., the error rate is 0) or the iteration reaches the predetermined number.

Cross-validation

This k-fold cross-validation is a recommended cross-validation method because it can secure more samples for training without loss of sample size as compared to the splitting method⁴⁵. Within the training set, a ten-fold cross-validation was conducted with five repeated processes.

Hyperparameters

Hyperparameters were tuned by the grid search during the cross-validation. The learning rate is the basic component of hyperparameters in most machine learning algorithms. The time to reach the optimal point with the least error can be delayed when the learning rate is too low. However, when the learning rate is too large, the algorithm might jump over the optimal point such that suboptimal points can be obtained after the predetermined length of learning. The depth of trees reflects the number of splits. More interactions among the variables were considered in the algorithm as the depth of trees became large. Finally, the following hyperparameters were tuned: shrinkage (learning rate) was 0.007; n.trees (number of trees) was 1000; interaction.depth (depth of trees) was 4; and n.minobsinnode (minimum number of observations allowed in the trees of terminal nodes) was 5. Figure 4 visualizes the performance metrics according to the shrinkage values.

Performance metrics

The performance metrics were considered based on the imbalanced proportion of the elderly with cognitive impairment. Detecting cognitive impairment among a large number of observations is important when applied in real-world practice; hence, sensitivity was first considered. The overall accuracy and the area under the receive operator curve (AUC) were measured as secondary performance metrics.

The F₁-score and Matthew’s correlation coefficients (MCC) were used as the performance metrics⁴⁶. The F₁-score was formularized using the true positives (TP), false positives (FP), and false negatives (FN) \((\frac{2TP\,}{2TP+FP+{FN}})\). As the F1-score does not account for the true negatives (TN), it has limited utility in the highly imbalanced data in which majority of the cases belong to the negatives.

In contrast, the MCC utilizes all four major components of the confusion metrics \((\frac{(TP\times TN)-(FP\times FN)\,}{\sqrt{(TP+FP)(TP+FN)(TN+FP)(TN+FN)}})\). The MCC are a discretized form of the Pearson’s correlational analysis; thus, the MCC value is interpreted in terms of the Pearson’s correlational coefficients, r⁴⁷. Unlike other performance metrics with a range of 0 to 1, the range of the MCC is from −1 to 1. The value of −1 in the MCC indicates complete disagreement between the actual and predicted values, such as the value of 0 for accuracy. In contrast, the value of +1 in the MCC represents complete agreement between actual and predicted values, such as 1 for accuracy. Although the interpretation of the MCC might not be intuitive as other performance metrics ranging from 0 to 1, it is advantageous over the F₁-score in the imbalance dataset.

Data Availability

The dataset generated and analyzed in the current study is available from the corresponding author upon reasonable request. The predictive model is deployed and available at https://ksna19.shinyapps.io/Prediction_of_cognitive_function.

References

Werner, P. & Korczyn, A. D. Mild cognitive impairment: conceptual, assessment, ethical, and social issues. Clin Interv Aging 3, 413–420 (2008).
Article Google Scholar
Bennett, S. & Thomas, A. J. Depression and dementia: cause, consequence or coincidence? Maturitas 79, 184–190, https://doi.org/10.1016/j.maturitas.2014.05.009 (2014).
Article PubMed Google Scholar
Yuan, X. Y. & Wang, X. G. Mild cognitive impairment in type 2 diabetes mellitus and related risk factors: a review. Rev Neurosci 28, 715–723, https://doi.org/10.1515/revneuro-2017-0016 (2017).
Article PubMed Google Scholar
Pan, C. W. et al. Cognitive dysfunction and health-related quality of life among older Chinese. Sci Rep 5, 17301, https://doi.org/10.1038/srep17301 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Farias, S. T., Mungas, D., Reed, B. R., Harvey, D. & DeCarli, C. Progression of mild cognitive impairment to dementia in clinic- vs community-based cohorts. Arch Neurol 66, 1151–1157, https://doi.org/10.1001/archneurol.2009.106 (2009).
Article PubMed PubMed Central Google Scholar
Paradise, M. et al. Caregiver burden in mild cognitive impairment. Aging Ment Health 19, 72–78, https://doi.org/10.1080/13607863.2014.915922 (2015).
Article PubMed Google Scholar
Song, D., Li, P. W. C. & Yu, D. S. F. The association between depression and mild cognitive impairment: A cross-sectional study. Int J Geriatr Psychiatry 33, 672–674, https://doi.org/10.1002/gps.4798 (2018).
Article PubMed Google Scholar
Petersen, R. C. Clinical practice. Mild cognitive impairment. N Engl J Med 364, 2227–2234, https://doi.org/10.1056/NEJMcp0910237 (2011).
Article CAS PubMed Google Scholar
Ton, T. G. N. et al. The financial burden and health care utilization patterns associated with amnestic mild cognitive impairment. Alzheimers Dement 13, 217–224, https://doi.org/10.1016/j.jalz.2016.08.009 (2017).
Article PubMed Google Scholar
Frisoni, G. B. et al. Strategic roadmap for an early diagnosis of Alzheimer’s disease based on biomarkers. Lancet Neurol 16, 661–676, https://doi.org/10.1016/S1474-4422(17)30159-X (2017).
Article PubMed Google Scholar
Winblad, B. et al. Defeating Alzheimer’s disease and other dementias: a priority for European science and society. Lancet Neurol 15, 455–532, https://doi.org/10.1016/S1474-4422(16)00062-4 (2016).
Article PubMed Google Scholar
Barnes, D. E. & Yaffe, K. The projected effect of risk factor reduction on Alzheimer’s disease prevalence. Lancet Neurol 10, 819–828, https://doi.org/10.1016/S1474-4422(11)70072-2 (2011).
Article PubMed PubMed Central Google Scholar
Lyu, J., Lee, C. M. & Dugan, E. Risk factors related to cognitive functioning: a cross-national comparison of U.S. and Korean older adults. Int J Aging Hum Dev 79, 81–101 (2014).
Article Google Scholar
Cooper, C., Sommerlad, A., Lyketsos, C. G. & Livingston, G. Modifiable predictors of dementia in mild cognitive impairment: a systematic review and meta-analysis. Am J Psychiatry 172, 323–334, https://doi.org/10.1176/appi.ajp.2014.14070878 (2015).
Article PubMed Google Scholar
Cooper, R. et al. Objectively measured physical capability levels and mortality: systematic review and meta-analysis. BMJ 341, c4467, https://doi.org/10.1136/bmj.c4467 (2010).
Article PubMed PubMed Central Google Scholar
Nakajima, S., Ohsawa, I., Ohta, S., Ohno, M. & Mikami, T. Regular voluntary exercise cures stress-induced impairment of cognitive function and cell proliferation accompanied by increases in cerebral IGF-1 and GST activity in mice. Behav Brain Res 211, 178–184, https://doi.org/10.1016/j.bbr.2010.03.028 (2010).
Article CAS PubMed Google Scholar
Nagane, A. et al. Comparative study of cognitive impairment between medicated and medication-free patients with remitted major depression: class-specific influence by tricyclic antidepressants and newer antidepressants. Psychiatry Res 218, 101–105, https://doi.org/10.1016/j.psychres.2014.04.013 (2014).
Article PubMed Google Scholar
Schwarzinger, M. et al. Contribution of alcohol use disorders to the burden of dementia in France 2008-13: a nationwide retrospective cohort study. Lancet Public Health 3, e124–e132, https://doi.org/10.1016/S2468-2667(18)30022-7 (2018).
Article PubMed Google Scholar
Simundic, A. M. Measures of Diagnostic Accuracy: Basic Definitions. EJIFCC 19, 203–211 (2009).
PubMed PubMed Central Google Scholar
Hurd, M. D., Martorell, P., Delavande, A., Mullen, K. J. & Langa, K. M. Monetary costs of dementia in the United States. N Engl J Med 368, 1326–1334, https://doi.org/10.1056/NEJMsa1204629 (2013).
Article CAS PubMed PubMed Central Google Scholar
Pellegrini, E. et al. Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review. Alzheimers Dement (Amst) 10, 519–535, https://doi.org/10.1016/j.dadm.2018.07.004 (2018).
Article Google Scholar
Tzang, R. F., Yang, A. C., Yeh, H. L., Liu, M. E. & Tsai, S. J. Association of depression and loneliness with specific cognitive performance in non-demented elderly males. Med Sci Monit 21, 100–104, https://doi.org/10.12659/MSM.891086 (2015).
Article PubMed PubMed Central Google Scholar
Herrmann, L. L., Goodwin, G. M. & Ebmeier, K. P. The cognitive neuropsychology of depression in the elderly. Psychol Med 37, 1693–1702, https://doi.org/10.1017/S0033291707001134 (2007).
Article PubMed Google Scholar
Pinto, J. M., Fontaine, A. M. & Neri, A. L. The influence of physical and mental health on life satisfaction is mediated by self-rated health: A study with Brazilian elderly. Arch Gerontol Geriatr 65, 104–110, https://doi.org/10.1016/j.archger.2016.03.009 (2016).
Article PubMed Google Scholar
Rouch, I. et al. Seven-year predictors of self-rated health and life satisfaction in the elderly: the PROOF study. J Nutr Health Aging 18, 840–847, https://doi.org/10.1007/s12603-014-0488-2 (2014).
Article CAS PubMed Google Scholar
Landeiro, F., Barrows, P., Nuttall Musson, E., Gray, A. M. & Leal, J. Reducing social isolation and loneliness in older people: a systematic review protocol. BMJ Open 7, e013778, https://doi.org/10.1136/bmjopen-2016-013778 (2017).
Article PubMed PubMed Central Google Scholar
Wang, S. On a young-elderly support system maintained in separation in urban areas. Chin J Popul Sci 7, 371–378 (1995).
CAS PubMed Google Scholar
Dos Santos, S. B., Rocha, G. P., Fernandez, L. L., de Padua, A. C. & Reppold, C. T. Association of Lower Spiritual Well-Being, Social Support, Self-Esteem, Subjective Well-Being, Optimism and Hope Scores With Mild Cognitive Impairment and Mild Dementia. Front Psychol 9, 371, https://doi.org/10.3389/fpsyg.2018.00371 (2018).
Article PubMed PubMed Central Google Scholar
Lee, S. H. & Kim, Y. B. Which type of social activities may reduce cognitive decline in the elderly?: a longitudinal population-based study. BMC Geriatr 16, 165, https://doi.org/10.1186/s12877-016-0343-x (2016).
Article PubMed PubMed Central Google Scholar
Lyu, J. & Kim, H. Y. Gender-Specific Incidence and Predictors of Cognitive Impairment among Older Koreans: Findings from a 6-Year Prospective Cohort Study. Psychiatry Investig 13, 473–479, https://doi.org/10.4306/pi.2016.13.5.473 (2016).
Article PubMed PubMed Central Google Scholar
Kim, S. & Kim, Y. & Park, S. M. Body Mass Index and Decline of Cognitive Function. PLoS One 11, e0148908, https://doi.org/10.1371/journal.pone.0148908 (2016).
Article CAS PubMed PubMed Central Google Scholar
Min, J. Y., Park, J. B., Lee, K. J. & Min, K. B. The impact of occupational experience on cognitive and physical functional status among older adults in a representative sample of Korean subjects. Ann Occup Environ Med 27, 11, https://doi.org/10.1186/s40557-015-0057-0 (2015).
Article PubMed PubMed Central Google Scholar
Jun, H. J. Educational differences in the cognitive functioning of grandmothers caring for grandchildren in South Korea. Res Aging 37, 500–523, https://doi.org/10.1177/0164027514545239 (2015).
Article PubMed Google Scholar
American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5 edn, (American Psychiatric Publishing, 2013).
World Health Organization. ICD-10 Version:2016, http://apps.who.int/classifications/icd10/browse/2016/en (2016).
Belle, S. H. et al. Effect of education and gender adjustment on the sensitivity and specificity of a cognitive screening battery for dementia: results from the MoVIES Project. Monongahela Valley Independent Elders Survey. Neuroepidemiology 15, 321–329, https://doi.org/10.1159/000109922 (1996).
Article CAS PubMed Google Scholar
Korea Employment Information Service. Korean Longitudinal Study of Ageing (KLoSA), http://survey.keis.or.kr/eng/klosa/klosa01.jsp.
Statistics Korea. Preliminary Results of the Population and Housing Census 2005 (Statistics Korea, Daejeon, Korea, 2006).
Kang, Y. W. A Normative Study of the Korean-Mini Mental State Examination (K-MMSE) in the Elderly. Kor J Psychol Gen 25, 1–12 (2006).
Google Scholar
Chawlam, N. V., Bowyerm, K. W., Hallm, L. O. & Philip Kegelmeyer, W. SMOTE: Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research 16, 321–357 (2002).
Article Google Scholar
Kuhn, M. Building Predictive Models in R Using the caret Package. J Statistical Software 28, 26, https://doi.org/10.18637/jss.v028.i05 (2008).
Article Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front Neurorobot 7, 21, https://doi.org/10.3389/fnbot.2013.00021 (2013).
Article PubMed PubMed Central Google Scholar
Murphy, K. P. Machine learning: a probabilistic prospective (The Massachusetts Institute of Technology, 2012).
Greenwell, B., Boehmke, B., Cunningham, J. & Developers, G. Package ‘gbm’. https://cran.r-project.org/web/packages/gbm/gbm.pdf (2018).
Yestui, N. An Introduction to Machine Learning Theory (Wikibooks, 2015).
Matthews, B. W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta 405, 442–451 (1975).
Article CAS Google Scholar
Hinkle, D. E., Wiersma, W. & Jurs, S. G. Applied Statistics for the Behavioral Sciences. 5th edn, (Houghton Mifflin, 2003).

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP; Ministry of Science, ICT & Future Planning) (No. 2017R1C1B5073684).

Author information

Authors and Affiliations

Department of Psychiatry, Gil Medical Center, Gachon University College of Medicine, Incheon, Republic of Korea
Kyoung-Sae Na

Authors

Kyoung-Sae Na
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.S.N. solely conceived the concept, designed the protocol, performed the machine learning analysis, and wrote the manuscript.

Corresponding author

Correspondence to Kyoung-Sae Na.

Ethics declarations

Competing Interests

The author declares no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Na, KS. Prediction of future cognitive impairment among the community elderly: A machine-learning based approach. Sci Rep 9, 3335 (2019). https://doi.org/10.1038/s41598-019-39478-7

Download citation

Received: 19 June 2018
Accepted: 18 January 2019
Published: 04 March 2019
DOI: https://doi.org/10.1038/s41598-019-39478-7

This article is cited by

Visual and auditory attention defects in children with intermittent exotropia
- Cong Wei
- Ding-Ping Yang
- Shuai Chang
Italian Journal of Pediatrics (2024)
Immediate word recall in cognitive assessment can predict dementia using machine learning techniques
- Michael Adebisi Fayemiwo
- Toluwase Ayobami Olowookere
- Piper Jackson
Alzheimer's Research & Therapy (2023)
Machine learning analyses identify multi-modal frailty factors that selectively discriminate four cohorts in the Alzheimer’s disease spectrum: a COMPASS-ND study
- Linzy Bohn
- Shannon M. Drouin
- Roger A. Dixon
BMC Geriatrics (2023)
Clinical decision support system for quality of life among the elderly: an approach using artificial neural network
- Maryam Ahmadi
- Raoof Nopour
BMC Medical Informatics and Decision Making (2022)
Recommender System for Responsive Engagement of Senior Adults in Daily Activities
- Igor Kulev
- Carlijn Valk
- Pearl Pu
Journal of Population Ageing (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Participant data

Performance

Importance of variables

Discussion

Methods

Participants and data

Preprocessing

Machine learning model

Cross-validation

Hyperparameters

Performance metrics

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links