Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports

Kessler, R C; van Loo, H M; Wardenaar, K J; Bossarte, R M; Brenner, L A; Cai, T; Ebert, D D; Hwang, I; Li, J; de Jonge, P; Nierenberg, A A; Petukhova, M V; Rosellini, A J; Sampson, N A; Schoevers, R A; Wilcox, M A; Zaslavsky, A M

doi:10.1038/mp.2015.198

Original Article
Published: 05 January 2016

Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports

R C Kessler¹,
H M van Loo²,
K J Wardenaar²,
R M Bossarte³,
L A Brenner^4,5,
T Cai⁶,
D D Ebert^1,7,
I Hwang¹,
J Li⁶,
P de Jonge²,
A A Nierenberg⁸,
M V Petukhova¹,
A J Rosellini¹,
N A Sampson¹,
R A Schoevers²,
M A Wilcox⁹ &
…
A M Zaslavsky¹

Molecular Psychiatry volume 21, pages 1366–1371 (2016)Cite this article

4832 Accesses
128 Citations
34 Altmetric
Metrics details

Subjects

Psychiatric disorders

Abstract

Heterogeneity of major depressive disorder (MDD) illness course complicates clinical decision-making. Although efforts to use symptom profiles or biomarkers to develop clinically useful prognostic subtypes have had limited success, a recent report showed that machine-learning (ML) models developed from self-reports about incident episode characteristics and comorbidities among respondents with lifetime MDD in the World Health Organization World Mental Health (WMH) Surveys predicted MDD persistence, chronicity and severity with good accuracy. We report results of model validation in an independent prospective national household sample of 1056 respondents with lifetime MDD at baseline. The WMH ML models were applied to these baseline data to generate predicted outcome scores that were compared with observed scores assessed 10–12 years after baseline. ML model prediction accuracy was also compared with that of conventional logistic regression models. Area under the receiver operating characteristic curve based on ML (0.63 for high chronicity and 0.71–0.76 for the other prospective outcomes) was consistently higher than for the logistic models (0.62–0.70) despite the latter models including more predictors. A total of 34.6–38.1% of respondents with subsequent high persistence chronicity and 40.8–55.8% with the severity indicators were in the top 20% of the baseline ML-predicted risk distribution, while only 0.9% of respondents with subsequent hospitalizations and 1.5% with suicide attempts were in the lowest 20% of the ML-predicted risk distribution. These results confirm that clinically useful MDD risk-stratification models can be generated from baseline patient self-reports and that ML methods improve on conventional methods in developing such models.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Individualized prediction of three- and six-year outcomes of psychosis in a longitudinal multicenter study: a machine learning approach

Article Open access 02 July 2021

Jessica de Nijs, Thijs J. Burger, … Hugo G. Schnack

A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data

Article Open access 12 January 2021

Jakub Tomasik, Sung Yeon Sarah Han, … Sabine Bahn

Predictive biosignature of major depressive disorder derived from physiological measurements of outpatients using machine learning

Article Open access 25 April 2023

Nicolas Ricka, Gauthier Pellegrin, … Pierre A. Geoffroy

References

Altshuler LL, Cohen LS, Moline ML, Kahn DA, Carpenter D, Docherty JP et al. Treatment of depression in women: a summary of the expert consensus guidelines. J Psychiatr Pract 2001; 7: 185–208.
Article CAS PubMed Google Scholar
Hetrick SE, Simmons M, Thompson A, Parker AG . What are specialist mental health clinician attitudes to guideline recommendations for the treatment of depression in young people? Aust N Z J Psychiatry 2011; 45: 993–1001.
Article PubMed Google Scholar
Kuiper S, McLean L, Fritz K, Lampe L, Malhi GS . Getting depression clinical practice guidelines right: time for change? Acta Psychiatr Scand Suppl 2013; 444: 24–30.
Article Google Scholar
Perlis RH . Use of treatment guidelines in clinical decision making in bipolar disorder: a pilot survey of clinicians. Curr Med Res Opin 2007; 23: 467–475.
Article PubMed Google Scholar
van Loo HM, de Jonge P, Romeijn JW, Kessler RC, Schoevers RA . Data-driven subtypes of major depressive disorder: a systematic review. BMC Med 2012; 10: 156.
Article PubMed PubMed Central Google Scholar
Vrieze E, Demyttenaere K, Bruffaerts R, Hermans D, Pizzagalli DA, Sienaert P et al. Dimensions in major depressive disorder and their relevance for treatment outcome. J Affect Disord 2014; 155: 35–41.
Article PubMed Google Scholar
Hasler G, Northoff G . Discovering imaging endophenotypes for major depression. Mol Psychiatry 2011; 16: 604–619.
Article CAS PubMed Google Scholar
Kennedy SH, Downar J, Evans KR, Feilotter H, Lam RW, MacQueen GM et al. The Canadian Biomarker Integration Network in Depression (CAN-BIND): advances in response prediction. Curr Pharm Des 2012; 18: 5976–5989.
Article CAS PubMed Google Scholar
Uher R, Perroud N, Ng MY, Hauser J, Henigsberg N, Maier W et al. Genome-wide pharmacogenetics of antidepressant response in the GENDEP project. Am J Psychiatry 2010; 167: 555–564.
Article PubMed Google Scholar
James G, Witten D, Hastie T, Tibshirani R . An Introduction to Statistical Learning: With Applications in R. Springer: New York, 2013.
Book Google Scholar
van der Laan MJ, Rose S . Targeted Learning: Causal Inference for Observational and Experimental Data. Springer: New York, 2011.
Book Google Scholar
Chang YJ, Chen LJ, Chung KP, Lai MS . Risk groups defined by Recursive Partitioning Analysis of patients with colorectal adenocarcinoma treated with colorectal resection. BMC Med Res Methodol 2012; 12: 2.
Article PubMed PubMed Central Google Scholar
Chao ST, Koyfman SA, Woody N, Angelov L, Soeder SL, Reddy CA et al. Recursive partitioning analysis index is predictive for overall survival in patients undergoing spine stereotactic body radiation therapy for spinal metastases. Int J Radiat Oncol Biol Phys 2012; 82: 1738–1743.
Article PubMed Google Scholar
Nelson JC, Zhang Q, Deberdt W, Marangell LB, Karamustafalioglu O, Lipkovich IA . Predictors of remission with placebo using an integrated study database from patients with major depressive disorder. Curr Med Res Opin 2012; 28: 325–334.
Article CAS PubMed Google Scholar
Riedel M, Moller HJ, Obermeier M, Adli M, Bauer M, Kronmuller K et al. Clinical predictors of response and remission in inpatients with depressive syndromes. J Affect Disord 2011; 133: 137–149.
Article PubMed Google Scholar
van Loo HM, Cai T, Gruber MJ, Li J, de Jonge P, Petukhova M et al. Major depressive disorder subtypes to predict long-term course. Depress Anxiety 2014; 31: 765–777.
Article PubMed PubMed Central Google Scholar
Wardenaar KJ, van Loo HM, Cai T, Fava M, Gruber MJ, Li J et al. The effects of co-morbidity in defining major depression subtypes associated with long-term course and severity. Psychol Med 2014; 44: 3289–3302.
Article CAS PubMed PubMed Central Google Scholar
Kessler RC, McGonagle KA, Zhao S, Nelson CB, Hughes M, Eshleman S et al. Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States. Results from the National Comorbidity Survey. Arch Gen Psychiatry 1994; 51: 8–19.
CAS PubMed Google Scholar
Kessler RC, Merikangas KR, Berglund P, Eaton WW, Koretz DS, Walters EE . Mild disorders should not be eliminated from the DSM-V. Arch Gen Psychiatry 2003; 60: 1117–1122.
Article PubMed Google Scholar
Kessler RC, Wittchen HU, Abelson JM, McGonagle KA, Schwarz N, Kendler KS et al. Methodological studies of the Composite International Diagnostic Interview (CIDI) in the US National Comorbidity Survey. Int J Methods Psychiatr Res 1998; 7: 33–55.
Article Google Scholar
Spitzer RL, Williams JB, Gibbon M, First MB . The Structured Clinical Interview for DSM-III-R (SCID). I: history, rationale, and description. Arch Gen Psychiatry 1992; 49: 624–629.
Article CAS PubMed Google Scholar
Endicott J, Andreasen N, Spitzer RL . Family History Research Diagnostic Criteria (FHRDC). Biometrics Research, New York State Psychiatric Institute: New York, 1978.
Google Scholar
Therneau T, Atkinson B . An Introduction to Recursive Partitioning Using the RPART Routines. Mayo Foundation: Rochester, MN, 2015.
Google Scholar
Friedman J, Hastie T, Tibshirani R . Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw 2010; 33: 1–22.
Article PubMed PubMed Central Google Scholar
SAS Institute Inc. SAS/STAT software. 9.2 for Unix edn. SAS Institute Inc.: Cary, NC, 2009.
Research Triangle Institute SUDAAN: Professional Software for Survey Data Analysis, 9th edn Research Triangle Institute: Research Triangle Park: NC, 2004.
Marsland S . Machine Learning: An Algorithmic Perspective 2nd (edn). Taylor & Francis: Boca Raton, FL, 2015.
Google Scholar
van der Laan MJ, Polley EC, Hubbard AE . Super learner. Stat Appl Genet Mol Biol 2007; 6: Article 25.
Article Google Scholar
Klein DN, Shankman SA, Rose S . Dysthymic disorder and double depression: prediction of 10-year course trajectories and outcomes. J Psychiatr Res 2008; 42: 408–415.
Article PubMed Google Scholar
Moos RH, Cronkite RC . Symptom-based predictors of a 10-year chronic course of treated depression. J Nerv Ment Dis 1999; 187: 360–368.
Article CAS PubMed Google Scholar
Angst J, Gamma A, Rossler W, Ajdacic V, Klein DN . Childhood adversity and chronicity of mood disorders. Eur Arch Psychiatry Clin Neurosci 2011; 261: 21–27.
Article PubMed Google Scholar
Bradvik L, Mattisson C, Bogren M, Nettelbladt P . Long-term suicide risk of depression in the Lundby cohort 1947–1997—severity and gender. Acta Psychiatr Scand 2008; 117: 185–191.
Article CAS PubMed Google Scholar
Rice ME, Harris GT . Comparing effect sizes in follow-up studies: ROC Area, Cohen's d, and r. Law Hum Behav 2005; 29: 615–620.
Article PubMed Google Scholar
Singh JP, Desmarais SL, Van Dorn RA . Measurement of predictive validity in violence risk assessment studies: a second-order systematic review. Behav Sci Law 2013; 31: 55–73.
Article PubMed Google Scholar
Sjostedt G, Grann M . Risk assessment: what is being predicted by actuarial prediction instruments? Int J Forensic Ment Health 2002; 1: 179–183.
Article Google Scholar
Echouffo-Tcheugui JB, Kengne AP . Comparative performance of diabetes-specific and general population-based cardiovascular risk assessment models in people with diabetes mellitus. Diabetes Metab 2013; 39: 389–396.
Article PubMed Google Scholar
Siontis GC, Tzoulaki I, Siontis KC, Ioannidis JP . Comparisons of established risk prediction models for cardiovascular disease: systematic review. BMJ 2012; 344: e3318.
Article PubMed Google Scholar
Tzoulaki I, Liberopoulos G, Ioannidis JP . Assessment of claims of improved prediction beyond the Framingham risk score. JAMA 2009; 302: 2345–2352.
Article CAS PubMed Google Scholar
Anothaisintawee T, Teerawattananon Y, Wiratkapun C, Kasamesup V, Thakkinstian A . Risk prediction models of breast cancer: a systematic review of model performances. Breast Cancer Res Treat 2012; 133: 1–10.
Article PubMed Google Scholar
Haas LR, Takahashi PY, Shah ND, Stroebel RJ, Bernard ME, Finnie DM et al. Risk-stratification methods for identifying patients for care coordination. Am J Manag Care 2013; 19: 725–732.
PubMed Google Scholar
Morris JN, Howard EP, Steel K, Schreiber R, Fries BE, Lipsitz LA et al. Predicting risk of hospital and emergency department use for home care elderly persons through a secondary analysis of cross-national data. BMC Health Serv Res 2014; 14: 519.
Article PubMed PubMed Central Google Scholar
Williams LM, Rush AJ, Koslow SH, Wisniewski SR, Cooper NJ, Nemeroff CB et al. International Study to Predict Optimized Treatment for Depression (iSPOT-D), a randomized clinical trial: rationale and protocol. Trials 2011; 12: 4.
Article PubMed PubMed Central Google Scholar
Burke JF, Hayward RA, Nelson JP, Kent DM . Using internally developed risk models to assess heterogeneity in treatment effects in clinical trials. Circ Cardiovasc Qual Outcomes 2014; 7: 163–169.
Article PubMed PubMed Central Google Scholar
Willke RJ, Zheng Z, Subedi P, Althin R, Mullins CD . From concepts, theory, and evidence of heterogeneity of treatment effects to methodological approaches: a primer. BMC Med Res Methodol 2012; 12: 185.
Article PubMed PubMed Central Google Scholar
Li C, Lu Y . Evaluating the improvement in diagnostic utility from adding new predictors. Biom J 2010; 52: 417–435.
Article PubMed PubMed Central Google Scholar
Neugebauer R, Schmittdiel JA, van der Laan MJ . Targeted learning in real-world comparative effectiveness research with time-varying interventions. Stat Med 2014; 33: 2480–2520.
Article PubMed Google Scholar
Anglemyer A, Horvath HT, Bero L Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials. Cochrane Database Syst Rev 2014; (4): MR000034.
Jain FA, Hunter AM, Brooks JO 3rd, Leuchter AF . Predictive socioeconomic and clinical profiles of antidepressant response and remission. Depress Anxiety 2013; 30: 624–630.
Article CAS PubMed Google Scholar
Perlis RH . A clinical risk stratification tool for predicting treatment resistance in major depressive disorder. Biol Psychiatry 2013; 74: 7–14.
Article PubMed PubMed Central Google Scholar
Cuijpers P, Reynolds CF 3rd, Donker T, Li J, Andersson G, Beekman A . Personalized treatment of adult depression: medication, psychotherapy, or both? A systematic review. Depress Anxiety 2012; 29: 855–864.
Article PubMed Google Scholar
Simon GE, Perlis RH . Personalized medicine for depression: can we match patients with treatments? Am J Psychiatry 2010; 167: 1445–1455.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

PdJ was supported by a VICI grant (no: 91812607) from the Netherlands Organization for Scientific Research (NWO-ZonMW). The NCS data collection was supported by the National Institute of Mental Health (NIMH; R01MH46376). The NCS-2 data collection was supported by the National Institute on Drug Abuse (NIDA; R01DA012058). Data analysis for this paper was additionally supported by NIMH grants R01MH070884 and U01MH060220, with supplemental support from the Substance Abuse and Mental Health Services Administration (SAMHSA), the Robert Wood Johnson Foundation (RWJF; Grant 044780) and the John W. Alden Trust. The NCS-2 is carried out in conjunction with the World Health Organization World Mental Health (WMH) Survey Initiative. We thank the staff of the WMH Data Collection and Data Analysis Coordination Centres for assistance with instrumentation, fieldwork and consultation on data analysis. These activities were supported by the NIMH (R01MH070884), the John D and Catherine T MacArthur Foundation, the Pfizer Foundation, the US Public Health Service (R13MH066849, R01MH069864 and R01DA016558), the Fogarty International Center (FIRCA R03TW006481), the Pan American Health Organization, Eli Lilly and Company, Ortho-McNeil Pharmaceutical, GlaxoSmithKline and Bristol-Myers Squibb.

Author information

Authors and Affiliations

Department of Health Care Policy, Harvard Medical School, Boston, MA, USA
R C Kessler, D D Ebert, I Hwang, M V Petukhova, A J Rosellini, N A Sampson & A M Zaslavsky
Interdisciplinary Center Psychopathology and Emotion Regulation, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
H M van Loo, K J Wardenaar, P de Jonge & R A Schoevers
Department of Veterans Affairs, Office of Public Health, Washington, DC, USA
R M Bossarte
Departments of Physical Medicine and Rehabilitation, Psychiatry, and Neurology, University of Colorado, Anschutz Medical Campus, Aurora, Colorado; Rocky Mountain Mental Illness Research Education and Clinical Center,
L A Brenner
Rocky Mountain Mental Illness Research Education and Clinical Center, Denver, CO, USA
L A Brenner
Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
T Cai & J Li
Department of Psychology, Clinical Psychology and Psychotherapy, Friedrich-Alexander University Nuremberg-Erlangen, Erlangen, Germany
D D Ebert
Department of Psychiatry and Depression Clinical and Research Program, Harvard Medical School and Massachusetts General Hospital, Boston, MA, USA
A A Nierenberg
Epidemiology, Janssen Research & Development, LLC, Titusville, NJ, USA
M A Wilcox

Authors

R C Kessler
View author publications
You can also search for this author in PubMed Google Scholar
H M van Loo
View author publications
You can also search for this author in PubMed Google Scholar
K J Wardenaar
View author publications
You can also search for this author in PubMed Google Scholar
R M Bossarte
View author publications
You can also search for this author in PubMed Google Scholar
L A Brenner
View author publications
You can also search for this author in PubMed Google Scholar
T Cai
View author publications
You can also search for this author in PubMed Google Scholar
D D Ebert
View author publications
You can also search for this author in PubMed Google Scholar
I Hwang
View author publications
You can also search for this author in PubMed Google Scholar
J Li
View author publications
You can also search for this author in PubMed Google Scholar
P de Jonge
View author publications
You can also search for this author in PubMed Google Scholar
A A Nierenberg
View author publications
You can also search for this author in PubMed Google Scholar
M V Petukhova
View author publications
You can also search for this author in PubMed Google Scholar
A J Rosellini
View author publications
You can also search for this author in PubMed Google Scholar
N A Sampson
View author publications
You can also search for this author in PubMed Google Scholar
R A Schoevers
View author publications
You can also search for this author in PubMed Google Scholar
M A Wilcox
View author publications
You can also search for this author in PubMed Google Scholar
A M Zaslavsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R C Kessler.

Ethics declarations

Competing interests

RCK has been a consultant for Hoffman La Roche, Johnson & Johnson Wellness and Prevention and Sonofi-Aventis Groupe; has served on an advisory board for Lake Nona Institute; and owns stock in DataStat. AAN has been a consultant for Abbott Laboratories, American Psychiatric Association, Appliance Computing (Mindsite), Basliea, Brain Cells, Brandeis University, Bristol-Myers Squibb, Clintara, Corcept, Dey Pharmaceuticals, Dainippon Sumitomo (now Sunovion), Eli Lilly and Company, EpiQ, L.P./Mylan, Forest, Genaissance, Genentech, GlaxoSmithKline, Hoffman La Roche, Infomedic, Lundbeck, Janssen Pharmaceutica, Jazz Pharmaceuticals, Medavante, Merck, Methylation Sciences, Naurex, Novartis, PamLabs, Pfizer, PGx Health, Ridge Diagnostics Shire, Schering-Plough, Somerset, Sunovion, Takeda Pharmaceuticals, Targacept and Teva; consulted through the MGH Clinical Trials Network and Institute (CTNI) for Astra Zeneca, Brain Cells, Dianippon Sumitomo/Sepracor, Johnson and Johnson, Labopharm, Merck, Methylation Science, Novartis, PGx Health, Shire, Schering-Plough, Targacept and Takeda/Lundbeck Pharmaceuticals; had grant/research support from the American Foundation for Suicide Prevention, AHRQ, Brain and Behavior Research Foundation, Bristol-Myers Squibb, Cederroth, Cephalon, Cyberonics, Elan, Eli Lilly, Forest, GlaxoSmithKline, Janssen Pharmaceutica, Lichtwer Pharma, Marriott Foundation, Mylan, NIMH, PamLabs, PCORI, Pfizer Pharmaceuticals, Shire, Stanley Foundation, Takeda and Wyeth-Ayerst; received honoraria from Belvoir Publishing, University of Texas Southwestern Dallas, Brandeis University, Bristol-Myers Squibb, Hillside Hospital, American Drug Utilization Review, American Society for Clinical Psychopharmacology, Baystate Medical Center, Columbia University, CRICO, Dartmouth Medical School, Health New England, Harold Grinspoon Charitable Foundation, IMEDEX, International Society for Bipolar Disorder, Israel Society for Biological Psychiatry, Johns Hopkins University, MJ Consulting, New York State, Medscape, MBL Publishing, MGH Psychiatry Academy, National Association of Continuing Education, Physicians Postgraduate Press, SUNY Buffalo, University of Wisconsin, University of Pisa, University of Michigan, University of Miami, University of Wisconsin at Madison, APSARD, ISBD, SciMed, Slack Publishing and Wolters Klower Publishing; owns stock in Appliance Computing (MindSite), Brain Cells, Medavante; and owns the following copyrights: Clinical Positive Affect Scale and the MGH Structured Clinical Interview for the Montgomery Asberg Depression Scale exclusively licensed to the MGH Clinical Trials Network and Institute (CTNI). MAW is an employee of Janssen Pharmaceuticals. HMvL, KJW, RMB, LAB, TC, DDE, IH, JL, PdJ, MVP, AJR, NAS, RAS and AMZ declare no conflict of interest.

Additional information

A complete list of NCS and NCS-2 publications can be found at http://www.hcp.med.harvard.edu/ncs.

Disclaimer

The views, opinions and/or findings contained in this article are those of the authors and should not be construed as an official Department of Veterans Affairs position, policy or decision unless so designated by other documentation, or the views of any of the sponsoring organizations, agencies or the US Government.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kessler, R., van Loo, H., Wardenaar, K. et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Mol Psychiatry 21, 1366–1371 (2016). https://doi.org/10.1038/mp.2015.198

Download citation

Received: 22 May 2015
Revised: 30 September 2015
Accepted: 26 October 2015
Published: 05 January 2016
Issue Date: October 2016
DOI: https://doi.org/10.1038/mp.2015.198

This article is cited by

Early antidepressant treatment response prediction in major depression using clinical and TPH2 DNA methylation features based on machine learning approaches
- Bingwei Chen
- Zhigang Jiao
- Zhi Xu
BMC Psychiatry (2023)
A hybrid machine learning model of depression estimation in home-based older adults: a 7-year follow-up study
- Shaowu Lin
- Yafei Wu
- Ya Fang
BMC Psychiatry (2022)
Predicting non-response to multimodal day clinic treatment in severely impaired depressed patients: a machine learning approach
- Johannes Simon Vetter
- Katharina Schultebraucks
- Birgit Kleim
Scientific Reports (2022)
Machine learning model for predicting Major Depressive Disorder using RNA-Seq data: optimization of classification approach
- Pragya Verma
- Madhvi Shakya
Cognitive Neurodynamics (2022)
Applications of machine learning to behavioral sciences: focus on categorical data
- Pegah Dehghan
- Hany Alashwal
- Ahmed A. Moustafa
Discover Psychology (2022)