Training a machine learning classifier to identify ADHD based on real-world clinical data from medical records

Mikolas, Pavol; Vahid, Amirali; Bernardoni, Fabio; Süß, Mathilde; Martini, Julia; Beste, Christian; Bluschke, Annet

doi:10.1038/s41598-022-17126-x

Download PDF

Article
Open access
Published: 28 July 2022

Training a machine learning classifier to identify ADHD based on real-world clinical data from medical records

Pavol Mikolas^1,2,
Amirali Vahid¹,
Fabio Bernardoni³,
Mathilde Süß¹,
Julia Martini²,
Christian Beste¹ &
…
Annet Bluschke¹

Scientific Reports volume 12, Article number: 12934 (2022) Cite this article

4227 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The diagnostic process of attention deficit hyperactivity disorder (ADHD) is complex and relies on criteria sensitive to subjective biases. This may cause significant delays in appropriate treatment initiation. An automated analysis relying on subjective and objective measures might not only simplify the diagnostic process and reduce the time to diagnosis, but also improve reproducibility. While recent machine learning studies have succeeded at distinguishing ADHD from healthy controls, the clinical process requires differentiating among other or multiple psychiatric conditions. We trained a linear support vector machine (SVM) classifier to detect participants with ADHD in a population showing a broad spectrum of psychiatric conditions using anonymized data from clinical records (N = 299 participants). We differentiated children and adolescents with ADHD from those not having the condition with an accuracy of 66.1%. SVM using single features showed slight differences between features and overlapping standard deviations of the achieved accuracies. An automated feature selection achieved the best performance using a combination 19 features. Real-world clinical data from medical records can be used to automatically identify individuals with ADHD among help-seeking individuals using machine learning. The relevant diagnostic information can be reduced using an automated feature selection without loss of performance. A broad combination of symptoms across different domains, rather than specific domains, seems to indicate an ADHD diagnosis.

Use of machine learning to classify adult ADHD and other conditions based on the Conners’ Adult ADHD Rating Scales

Article Open access 02 November 2020

A data driven machine learning approach to differentiate between autism spectrum disorder and attention-deficit/hyperactivity disorder based on the best-practice diagnostic instruments for autism

Article Open access 05 November 2022

Predicting childhood and adolescent attention-deficit/hyperactivity disorder onset: a nationwide deep learning approach

Article Open access 19 December 2022

Introduction

The diagnostic process in the case of suspected attention deficit hyperactivity disorder (ADHD) commonly entails collecting a substantial amount of data and is thus complex, time-consuming and costly. A substantial amount of data, however, is necessary to distinguish whether ADHD underlies the particular pattern of observed symptoms as opposed to norm variants of behavior, possible differential diagnoses, or comorbidities occurring in addition to ADHD^1,2. Overall, this extensive diagnostic process relies on criteria highly sensitive to subjective biases (for discussion see Faraone et al.³) and may result in delays in treatment initiation. This is particularly unfortunate given that effective treatments for ADHD are readily available. Thus, it should be paramount to streamline, shorten and specify the diagnostic process. To achieve this, it is necessary to identify the most relevant aspects of data that predict the diagnostic outcome. This may be possible via the application of machine learning techniques.

In recent years, machine learning has made remarkable progress, from its application in detecting between-group differences to making predictions on the individual level⁴. Concerning ADHD, previous studies based on clinical and/or neuroimaging data have performed automated classifications to distinguish between ADHD and typically developing individuals with classification accuracies ranging from 62 to 89.5%^5,6,7,8,9. Unfortunately, this dichotomous distinction between the labels of "typically developing" and "ADHD" does not reflect the question typically asked in the clinical setting. Even though this question is clinically much more relevant, to our knowledge no study so far has attempted to apply machine learning in order to predict whether the diagnostic outcome will be "ADHD" or "something else" (i.e., a norm variant of behaviour or another psychiatric diagnosis) in a broad spectrum of clinical conditions within a help-seeking population.

Since neuroimaging or genetic data are not (yet) part of the routine diagnostic process for ADHD due to limitations in cross-sample reliability/validity as well as in sensitivity and specificity¹⁰ and may result in lower classification accuracy than clinical measures^5,6, it is currently still necessary to focus on readily available behavioural/clinical data including demographic information, subjective symptom ratings, and objective neuropsychological data.

Demographic data like male gender, severe early onset and familial predispositions¹¹ are associated with a higher risk for ADHD. Self-report symptom rating scales, are less reliable than informant ratings¹² with studies further reporting low to medium correlations between parent and teacher ratings¹³. To account for these differences, it has been suggested to use the degree of consistency between them as an indicator of ADHD symptom severity¹⁴.

Neuropsychological tests are a further important component of the data collected during the diagnostic process. Lower overall IQ¹⁵, as well as difficulties in working memory¹ and processing speed^16,17, have been proposed to distinguish between individuals with ADHD and typically developing controls. Verbal comprehension and logical reasoning, in turn, are not systematically reduced in children with ADHD¹⁸. Overall, reductions in general or subscale-specific IQ are not specific to ADHD. Instead, the label "ADHD" explicitly points to difficulties in attentional processes (e.g. Günther et al.¹⁹). Evidence for impairments in terms of accuracy and reaction time variability²⁰ in tests pertaining to inhibition²¹ seems to be particularly strong. Similarly, this is the case regarding the intensity domain of attention²². Specifically, this concerns omission errors occurring in tests of sustained attention/tonic alertness²³. Evidence is rather mixed concerning the selectivity domain of attention^21,22.

In this proof-of-concept study, we attempted to train a machine learning model to predict the diagnostic outcome of "ADHD" in a help-seeking clinical sample. To our knowledge, this is the first study that attempted to train a machine learning classifier on anonymized real-world clinical data and to distinguish children/adolescents with an ADHD diagnosis from those with none or other diagnoses. In addition to well-established neuropsychological measures and individual symptom ratings, we included features capturing the degree of consistency between parents' and teachers' ratings. In order to test possible implications for shortening the diagnostic process, we assessed the predictive information of every single feature. Moreover, we attempted to reduce the necessary diagnostic information using a data driven, automated feature selection.

Methods

Participants

The standardized diagnostic process included several consultations with the child and caregivers together and individually. Parents and (nursery) school teachers completed general and ADHD-specific rating scales. Further, general intelligence and attention were assessed via standardized testing batteries. In addition, somatic conditions which may contribute to any existing attention problems were excluded (e.g., laboratory measures, ophthalmological and ENT evaluations, EEG). The final diagnostic decision was given strictly based on ICD-10 clinical criteria assessed by a senior specialist in child and adolescent psychiatry or psychology.

This was a study based exclusively on data from a clinical records. We extracted the data of help-seeking individuals who were referred to our secondary care outpatients unit with a suspected ADHD diagnosis, or in whom an ADHD diagnosis was the suspected diagnosis after the initial consultation. The group labeled "ADHD" included patients who had received one of the following diagnoses: attention deficit hyperactivity disorder (F90.0), hyperkinetic conduct disorder (F90.1), or attention deficit disorder without hyperactivity (F98.80). Importantly, not all psychiatric comorbidities constituted an exclusion criterion for the ADHD group (see below). The "non-ADHD" group contained patients who did not fulfill diagnostic criteria for ADHD. Socio-demographic and clinical characteristics of the sample (N = 299) are presented in Table 1. Individuals who were classified in the group ADHD were significant more often male (chi² = 6.871, p = .009) and younger (t = 2.038, df = 290, p = .043).

Table 1 Socio-demographic characteristics of the sample.

Full size table

Data sets were included in the study if ADHD had been suspected at the beginning of the diagnostic process, patients were younger than 18 years at the beginning of the diagnostic procedure, and if at least 2 out of 3 attention tests scores from the TAP diagnostic battery (for details, see below) were available²⁴. Data sets were excluded if neurological or genetic disorders, endocrine disorders (incl. not corrected hypo- or hyperthyroidism), or other severe documented medical comorbidities on Axis IV had been identified.

Data collection

We extracted data from medical records of the Department of Child and Adolescent Psychiatry and Psychotherapy at the Medical Faculty of the Technical University Dresden from 2015 to 2020. As we used anonymized data from a clinical register, in alignment with the Saxony Hospital Act §34 Section 1, the informed consent was waived by the Ethics Committee of the Medical Faculty of the Technische Universität Dresden, Germany (No: EK31012016), who also approved the study. The study was performed in accordance with the Declaration of Helsinki. Briefly, we extracted the following 92 features from the clinical records (for a detailed summary, see Supplementary table 1):

I.
Demographic variables (age and gender);
II.
Symptom ratings (Conners-3 parent/teacher ratings²⁵; parent version of the Child Behavior Checklist (CBCL) and its school equivalent, the Teacher's report form (TRF)²⁶; Strengths and Difficulties Questionnaire parents (SDQ-P) and teacher (SDQ-T) versions²⁷). To account for age and gender differences amongst patients, we used normed T-values as features in all cases. Additionally, we computed a set of 'consistency indices' describing the consistency between parent and teacher ADHD specific Conners-3 ratings (for details see Supplementary note 1).
III.
Neuropsychological measures (three subtests from the TAP, a commonly used German computer-based assessment of attention in children and adolescents²⁴ was used to assess inhibition (GoNogo subtest), divided attention (Divided Attention subtest) and Alertness (Alertness subtest). The Wechsler Intelligence Scale for Children IV or V^28,29 was used to measure general intelligence. To generate compatibility between versions IV and V, we used the average of the visual-spatial index and the fluid reasoning index as 'perceptual reasoning' in participants who completed the WISC V. For the attention measures, we used the T-values as features. For the intelligence measures, we used the standardized IQ scores as features.

Machine learning classification

Prior to classification, we discarded all features with > 20% of missing values (N = 62 features), as well as all participants with > 20% of missing features (N = 150 participants, 49%, (i.e. n = 70 participants) from the ADHD group). We determined the 20% cutoff as a compromise solution to preserve a diverse set of features without too strongly negatively impacting the accuracy due to too many missing values³⁰. The final dataset was comprised of 292 participants and 30 features (Table 1). As the support vector machine (SVM) classifier cannot handle missing values, some imputations were necessary in order to retain the most participants and features. We imputed the sample mean (continuous variables) or mode (discrete variables). As an alternative approach to data imputation, we performed a supplementary analysis after discarding all subjects containing missing data on a dataset of N = 248 (53.2% ADHD). Finally, to eliminate the effect of a different range of features on classification performance, all features were normalized into a z-score.

We used a linear SVM classifier to classify the participants into ADHD and non-ADHD groups in three ways. First, to assess the SVM classifier's performance on the whole dataset, we used the complete set of 30 features for training and testing. Second, we assessed the importance of single features for the classification by performing the classification using each one single feature at a time (i.e., we repeated the above-mentioned training and testing phase, including the k-fold crossvalidation described below using a single feature at a time, obtaining 30 single-feature classifiers). We chose this procedure rather than reporting the SVM weights, as those cannot be interpreted regarding the importance of single features³¹. Finally, to try and optimize the algorithm's performance, we aimed to eliminate irrelevant features in a data-driven way. Similar to our previous work³², we used the sequential floating forward selection (SFFS)^33,34 implemented in MATLAB 2017a (Mathworks Inc.) for this purpose. In an SFFS feature selection, two separate algorithms are combined. The sequential forward selection (SFS) starts from an empty set of features and sequentially adds features that result in the highest classifier accuracy when combined with the already selected features. Sequential backward selection (SBS) works in the opposite direction by removing the feature, leading to higher accuracy. In SFFS, each feature selection step comprises SFS and SBS³². After adding each feature, we performed an SVM classification using the selected set of features. We performed the train and test procedures using a standard k-fold crossvalidation method (k = 10)^4,35,36 (for details see Supplementary note 2 and Supplementary figure 1). We calculated the classification accuracy (i.e. the number of correct predictions divided by the number of all predictions made) as the average accuracy on all folds and reported the standard deviation of the achieved accuracies.

Since this was a population-based study, the ADHD and non-ADHD participants were not matched by age and gender. To check that the classification was based on ADHD-specific traits rather than predominantly demographic variables (age, sex), we compared correctly vs. incorrectly predicted participants using a t test and a chi-square test, where applicable. A significant difference in some demographic variables (e.g., age) would indicate that the classifier would have a limited validity/range of applicability. To further assess the contribution of demographic variables to the classification, we also performed a secondary analysis repeating the primary SVM classification using all the features listed in Table 2 except for age and sex.

Table 2 Ranking of features according to the classification accuracy when used as single feature in an SVM model.

Full size table

Results

SVM classification

The classification using the complete set of 30 features yielded an average accuracy of 66.1% (obtained from the true label) (SD = 8%, sensitivity = 66.9%, specificity = 65.4%, AUC = 0.66). The classifier falsely identified 18.2% of ADHD patients as not having ADHD (type 2 error). Conversely, 15.8% of patients without ADHD were falsely identified as having the condition (type 1 error). The permutation test showed that the accuracy is higher than randomly assigned labels (p value = .001). The correctly and incorrectly classified participants did not significantly differ in age (t = − .733, df = 290, p = .464), gender (χ²(1) = .171, p = .679) and total IQ (t = 1.173, df = 290, p = .242).

We ranked the features according to the achieved classification accuracy when exclusively one feature was used for testing and training (Table 2). For a graphical interpretation including standard deviations, see Fig. 1.

The automated feature-selection procedure achieved a maximum classification accuracy of 68.1% using a set of 19 features (Supplementary table 2).

Secondary classification without demographic features

In order to determine the predictive value of non-demographic features, we excluded the demographic features (age and sex) from training and classification in a secondary analysis. The model achieved an accuracy of 65.1% (sensitivity = 64.7%, specificity = 65.4%, AUC = 66.3%). A permutation test revealed this performance was significantly above chance (p = .001).

Secondary classification without missing data

In order to relatively estimate the influence of missing data on the classification performance, we retrained the classifier using the automatically selected set of the 19 best predictive features (Supplementary table 2) only on subjects without any missing data. The SVM achieved an accuracy of 68.8% (SD = 8.5%, sensitivity = 63.3%, specificity = 73.9%, AUC = 69.6%).

Discussion

In this machine learning study, we differentiated help-seeking children and adolescents with ADHD from those not having the condition with an accuracy of 66.1% using real-world clinical data from hospital records. Excluding demographic features (age and gender) resulted in a comparable accuracy. An automated feature selection achieved the best performance using a combination of 19 most predictive features across attention and intelligence domains and symptom ratings. The accuracy might be further increased using datasets without missing data. The consistency index of parent and teacher ratings did not outperform conventional features. Our study suggests that ADHD can be identified using data from clinical records even in a mixed, help-seeking population of children and adolescents.

Machine learning studies require large amounts of data³¹ which may be challenging to collect by recruiting participants for a specific study but are readily available in clinical databases. Moreover, the results from experimental studies might not generalize to a clinical setting, where clinicians are commonly confronted with multiple/concurrent disorders and/or various potential differential diagnoses. Thus, we showed that SVM in combination with real-world, comprehensive clinical data could yield an above-chance classification accuracy and detect individuals with ADHD among those having none or different condition(s).

To our knowledge, the highest achieved accuracy in studies of ADHD patients and healthy individuals were 89.5%⁸. Although we used more features than this study, the resulting accuracy was lower. This might be because many help-seeking individuals in our sample received other diagnoses associated with symptoms that may mimic ADHD (such as attention deficits in depression, increased activity in tic disorders, etc.). Thus, the two groups (ADHD vs. "something else") are not as clearly differentiated from each other as it would be the case when distinguishing between individuals with confirmed ADHD diagnoses and those not showing any symptoms at all. Previous studies aiming at distinguishing more than one disorder from typically developing controls reported lower classification accuracies than studies aiming at classifying typically developing individuals and patients with one condition³⁷.

Age and gender were shown to be useful for diagnostic and prognostic tools based on machine learning in previous studies^38,39. This was also the case in the current study. In this study, instead of identifying physiological patterns typical of ADHD, we aimed to train a classifier to identify ADHD based on data available from medical records. As typical age and gender distributions of ADHD may naturally be reflected in this data structure, which may constitute a sampling bias, conducting a second analysis without these features was essential. This analysis without age and gender still revealed a significant classification accuracy, demonstrating that the neuropsychological features and ADHD-specific ratings on their own are sufficient to identify ADHD in a mixed patient sample.

Previous studies have opted not to include clinical ratings in the analysis to avoid possible subjective biases⁸. We addressed this issue by using the consistency index above, which did not outperform conventional ADHD-specific features like parent/teacher-rated symptoms. The automatic feature selection also only emphasized a rather unspecific symptoms like peer relations, aggression, and teacher negative impression bias. These results suggest that clinical ratings capturing broader ADHD-related behavioral irregularities (i.e., not simply pertaining to ADHD core symptoms) as reported by different sources using the Conners-3 questionnaire are informative when aiming to identify ADHD amongst a help-seeking clinical population. This may reflect the notion that the rather qualitative "clinical impression" of ADHD plays a significant role in the diagnostic process⁴⁰. Similarly, this may also be interpreted as showing that a rather broad functional impairment associated with ADHD symptoms (in regards to social interactions, for example) is indicative of diagnostic classification in the clinical setting. This issue could be examined further by including clinician rating scales or those capturing the degree of functional impairment⁴¹.

Among the neuropsychological measures, the total IQ score did not rank among the most predictive features. Previous machine learning studies suggesting IQ to be a predictive feature^5,6 included IQ scores as part of an overall "phenotypic" feature that also contained aspects like age and gender, making a specific interpretation impossible. In addition, these studies only focused on the distinction between individuals with ADHD and typically developing controls, thus reducing the validity of the results for clinical practice. It is the goal to distinguish ADHD from disorders or norm variants of behavior mimicking ADHD symptoms. Interestingly, the processing speed subscale ranked highest of all IQ-related features in the single feature classification. This may reflect the previously reported relevance of this aspect of neuropsychological processing¹⁶ when comparing individuals with ADHD and healthy controls. Within the automatic feature selection, reaction time variability and accuracy in tests capturing tonic/phasic alertness and inhibition ranked numerically higher than mean reaction times. While a previous study suggested that objective neuropsychological measures considerably underscored rating scales in distinguishing ADHD from healthy participants⁸, our results show that these scores in general indeed contribute to classification when identifying individuals with ADHD in a mixed help-seeking population. This supports the notion that objective measures like those employed in the current context are indeed important elements of the diagnostic process of ADHD as has been suggested previously⁴².

This study has the following limitations. First, we could not include broader clinical measures such as the CBCL as possible features due to too many missing values. These measures might have provided more specific information on differences between diagnostic entities. Similarly, father ratings also needed to be excluded due to missing data (although father ratings were included in the consistency index where possible). Retraining the classifier without missing data achieved a further increase in the classification accuracy (imputation 66.1% vs no missing data 68.8%). This suggests, that an effort to simplify the diagnostic process in order to reduce the probability of missing data might increase the performance of automated classifiers. Second, although we tested generalizability indirectly using the permutation test, an independent validation sample would provide more precise information on the generalizability of our classifier. Third, the relative importance of single features needs to be interpreted carefully while considering the low classification accuracy differences between the features and the relatively high standard deviations of the achieved accuracies. As the Fig. 1 shows, the standard deviations of the classification accuracies for single features overlapped. Although our results suggest that some features might be superior to others, we cannot conclude that there are single outstanding features in our sample that distinguish individuals with a definite ADHD diagnosis from those with another or no psychiatric diagnosis. Overall, a further increase in classification performance might be achieved by using larger samples with more complete data on all clinically relevant features rather than adding new ones. Our results do not provide full implications for exclusion and/or prioritization of specific clinical ratings in future studies.

Conclusion

In conclusion, we provide a proof-of-concept that real-world clinical data from medical records might contribute to identification of ADHD among help-seeking individuals. In this context, age, gender, and accuracy/reaction time variability seem to play a marginally more critical role than other features. Further, ADHD core symptoms reported by parents and/or teachers do not seem to carry the degree of importance as it may be assumed. Instead, results suggest a relatively broad combination of symptoms across different domains to indicate an eventual ADHD diagnosis. Overall, this implies that research endeavors aiming to identify biological and less subjective markers of ADHD need to be continued (see Faraone et al.³). Although the classification performed above chance (i.e. accuracy of 66.1%), the performance did not reach a level suggestive of possible clinical utility (i.e. 80% accuracy⁴³). Multimodal data (particularly neuroimaging and genetic data) might improve the recognition of psychiatric disorders using machine learning⁴⁴. In order to validate such recognition tools, multicentric data are necessary. In order to arrive at firm conclusions in this matter, there is a need for standardized recommendations for ADHD diagnostic procedures, such as specification of the obligatory attention domains, cognitive assessments and assessment scales used. These recommendations should also take into account the risk of missing data finding compromises between a broad assessment and feasibility.

References

Nikolas, M. A., Marshall, P. & Hoelzle, J. B. The role of neurocognitive tests in the assessment of adult attention-deficit/hyperactivity disorder. Psychol. Assess. 31, 685–698 (2019).
Article Google Scholar
Rowland, A. S., Lesesne, C. A. & Abramowitz, A. J. The epidemiology of attention-deficit/hyperactivity disorder (ADHD): A public health view. Ment. Retard. Dev. Disabil. Res. Rev. 8, 162–170 (2002).
Article Google Scholar
Faraone, S. V. The scientific foundation for understanding attention-deficit/hyperactivity disorder as a valid psychiatric disorder. Eur. Child Adolesc. Psychiatry 14, 1–10 (2005).
Article Google Scholar
Dwyer, D. B., Falkai, P. & Koutsouleris, N. Machine learning approaches for clinical psychology and psychiatry. Annu. Rev. Clin. Psychol. 14, 91–118 (2018).
Article Google Scholar
Bohland, J. W., Saperstein, S., Pereira, F., Rapin, J. & Grady, L. Network, anatomical, and non-imaging measures for the prediction of ADHD diagnosis in individual subjects. Front. Syst. Neurosci. 6, 78 (2012).
Article Google Scholar
Brown, M. R. G. et al. ADHD-200 Global Competition: Diagnosing ADHD using personal characteristic data can outperform resting state fMRI measurements. Front. Syst. Neurosci. 6, 69 (2012).
Article ADS Google Scholar
Christiansen, H. et al. Use of machine learning to classify adult ADHD and other conditions based on the Conners’ Adult ADHD Rating Scales. Sci. Rep. 10, 18871 (2020).
Article ADS CAS Google Scholar
Emser, T. S. et al. Assessing ADHD symptoms in children and adults: Evaluating the role of objective measures. Behav. Brain Funct. 14, 11 (2018).
Article Google Scholar
Zhang-James, Y. et al. Machine-learning prediction of comorbid substance use disorders in ADHD youth using Swedish registry data. J. Child Psychol. Psychiatry 61, 1370–1379 (2020).
Article Google Scholar
Takahashi, N., Ishizuka, K. & Inada, T. Peripheral biomarkers of attention-deficit hyperactivity disorder: Current status and future perspective. J. Psychiatr. Res. 137, 465–470 (2021).
Article Google Scholar
Mowlem, F. D. et al. Sex differences in predicting ADHD clinical diagnosis and pharmacological treatment. Eur. Child Adolesc. Psychiatry 28, 481–489 (2019).
Article Google Scholar
Du Rietz, E. et al. Self-report of ADHD shows limited agreement with objective markers of persistence and remittance. J. Psychiatr. Res. 82, 91–99 (2016).
Article Google Scholar
Murray, A. L., Booth, T., Ribeaud, D. & Eisner, M. Disagreeing about development: An analysis of parent-teacher agreement in ADHD symptom trajectories across the elementary school years. Int. J. Methods Psychiatr. Res. 27, e1723 (2018).
Article Google Scholar
Gomez, R. Australian parent and teacher ratings of the DSM-IV ADHD symptoms: Differential symptom functioning and parent–teacher agreement and differences. J. Atten. Disord. 11, 17–27 (2007).
Article Google Scholar
Kuntsi, J. et al. Co-occurrence of ADHD and low IQ has genetic origins. Am. J. Med. Genet. B Neuropsychiatr. Genet. 124B, 41–47 (2004).
Article CAS Google Scholar
Kramer, E. et al. Diagnostic associations of processing speed in a transdiagnostic, pediatric sample. Sci. Rep. 10, 10114 (2020).
Article ADS CAS Google Scholar
Wanderer, S., Roessner, V., Strobel, A. & Martini, J. WISC-IV performance of children with Chronic Tic Disorder, Obsessive-Compulsive Disorder and Attention-Deficit/Hyperactivity Disorder: results from a German clinical study. Child Adolesc. Psychiatry Ment. Health 15, 44 (2021).
Article Google Scholar
Mayes, S. D. & Calhoun, S. L. WISC-IV and WISC-III profiles in children with ADHD. J. Atten. Disord. 9, 486–493 (2006).
Article Google Scholar
Günther, T., Konrad, K., De Brito, S. A., Herpertz-Dahlmann, B. & Vloet, T. D. Attentional functions in children and adolescents with ADHD, depressive disorders, and the comorbid condition. J. Child Psychol. Psychiatry 52, 324–331 (2011).
Article Google Scholar
Bluschke, A., Zink, N., Mückschel, M., Roessner, V. & Beste, C. A novel approach to intra-individual performance variability in ADHD. Eur. Child Adolesc. Psychiatry 30, 733–745 (2021).
Article Google Scholar
Booth, J. R. et al. Larger deficits in brain networks for response inhibition than for visual selective attention in attention deficit hyperactivity disorder (ADHD). J. Child Psychol. Psychiatry 46, 94–111 (2005).
Article ADS Google Scholar
van Zomeren, A. H. & Brouwer, W. H. Clinical Neuropsychology of Attention. x, 250 (Oxford University Press, 1994).
Thomson, P. et al. Longitudinal trajectories of sustained attention development in children and adolescents with ADHD. J. Abnorm. Child Psychol. 48, 1529–1542 (2020).
Article Google Scholar
Fimm, B. & Zimmermann, P. Testbatterie zur Aufmerksamkeitsprüfung - Version 2.3: TAP (Psytest, 2014).
Google Scholar
Conners, C. K., Pitkanen, J. & Rzepa, S. R. Conners 3rd Edition (Conners 3; Conners 2008). In Encyclopedia of Clinical Neuropsychology (eds Kreutzer, J. S. et al.) 675–678 (Springer, 2011). https://doi.org/10.1007/978-0-387-79948-3_1534.
Chapter Google Scholar
Achenbach, T. M. Manual for the Child Behavior Checklist/4-18 and 1991 Profile (University of Vermont, 1991).
Google Scholar
Stone, L. L., Otten, R., Engels, R. C. M. E., Vermulst, A. A. & Janssens, J. M. A. M. Psychometric properties of the parent and teacher versions of the strengths and difficulties questionnaire for 4- to 12-year-olds: A review. Clin. Child Fam. Psychol. Rev. 13, 254–274 (2010).
Article Google Scholar
Pearson Assessment & Information GmbH. Wechsler Intelligence Scale for Children—Fourth Edition: Manual 1: Grundlagen, Testauswertung und Interpretation: Übersetzung und Adaptation der WISC-IV® von David Wechsler (Pearson, 2014).
Google Scholar
Wechsler, D., Pearson Education, I., & Psychological Corporation. WISC-V: Wechsler Intelligence Scale for Children (NCS Pearson, Inc.: PsychCorp, 2014).
Elhassan, A., Abu-Soud, S. M., Alghanim, F. & Salameh, W. ILA4: Overcoming missing values in machine learning datasets—An inductive learning approach. J. King Saud Univ. Comput. Inf. Sci. https://doi.org/10.1016/j.jksuci.2021.02.011 (2021).
Article Google Scholar
Pereira, F., Mitchell, T. & Botvinick, M. Machine learning classifiers and fMRI: A tutorial overview. Neuroimage 45, S199-209 (2009).
Article Google Scholar
Vahid, A., Mückschel, M., Neuhaus, A., Stock, A.-K. & Beste, C. Machine learning provides novel neurophysiological features that predict performance to inhibit automated responses. Sci. Rep. 8, 16235 (2018).
Article ADS Google Scholar
Chandrashekar, G. & Sahin, F. A survey on feature selection methods. Comput. Electr. Eng. 40, 16–28 (2014).
Article Google Scholar
Khazaee, A., Ebrahimzadeh, A. & Babajani-Feremi, A. Application of advanced machine learning methods on resting-state fMRI network for identification of mild cognitive impairment and Alzheimer’s disease. Brain Imaging Behav. 10, 799–817 (2016).
Article Google Scholar
Arlot, S. & Celisse, A. A survey of cross-validation procedures for model selection. Stat. Surv. 4, 40–79 (2010).
Article MathSciNet Google Scholar
Iniesta, R., Stahl, D. & McGuffin, P. Machine learning, statistical learning and the future of biological research in psychiatry. Psychol. Med. 46, 2455–2465 (2016).
Article CAS Google Scholar
Koutsouleris, N. et al. Individualized differential diagnosis of schizophrenia and mood disorders using neuroanatomical biomarkers. Brain J. Neurol. https://doi.org/10.1093/brain/awv111 (2015).
Article Google Scholar
Fusar-Poli, P. et al. Transdiagnostic risk calculator for the automatic detection of individuals at risk and the prediction of psychosis: Second replication in an independent national health service trust. Schizophr. Bull. 45, 562–570 (2019).
Article Google Scholar
Fusar-Poli, P. et al. Development and validation of a clinically based risk calculator for the transdiagnostic prediction of psychosis. JAMA Psychiatry 74, 493–500 (2017).
Article Google Scholar
Kovshoff, H. et al. The decisions regarding ADHD management (DRAMa) study: Uncertainties and complexities in assessment, diagnosis and treatment, from the clinician’s point of view. Eur. Child Adolesc. Psychiatry 21, 87–99 (2012).
Article Google Scholar
Mahdi, S. et al. An international clinical study of ability and disability in ADHD using the WHO-ICF framework. Eur. Child Adolesc. Psychiatry 27, 1305–1319 (2018).
Article Google Scholar
Pritchard, A. E., Nigro, C. A., Jacobson, L. A. & Mahone, E. M. The role of neuropsychological assessment in the functional outcomes of children with ADHD. Neuropsychol. Rev. 22, 54–68 (2012).
Article Google Scholar
Nunes, et al. Using structural MRI to identify bipolar disorders—13 site machine learning study in 3020 individuals from the ENIGMA Bipolar Disorders Working Group. Mol. Psychiatry 25, 2130–2143 (2020).
Article Google Scholar
Koutsouleris, N. et al. Multimodal machine learning workflows for prediction of psychosis in patients with clinical high-risk syndromes and recent-onset depression. JAMA Psychiatry 78, 195–209 (2021).
Article Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Child and Adolescent Psychiatry and Psychotherapy, Carl Gustav Carus University Hospital, Technische Universität Dresden, Fetcherstr. 74, 01307, Dresden, Germany
Pavol Mikolas, Amirali Vahid, Mathilde Süß, Christian Beste & Annet Bluschke
Department of Psychiatry and Psychotherapy, Carl Gustav Carus, University Hospital, Technische Universität Dresden, Fetscherstraße 74, 01307, Dresden, Germany
Pavol Mikolas & Julia Martini
Translational Developmental Neuroscience Section, Division of Psychological and Social Medicine and Developmental Neuroscience, Faculty of Medicine, Technische Universität Dresden, Dresden, Germany
Fabio Bernardoni

Authors

Pavol Mikolas
View author publications
You can also search for this author in PubMed Google Scholar
Amirali Vahid
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Bernardoni
View author publications
You can also search for this author in PubMed Google Scholar
Mathilde Süß
View author publications
You can also search for this author in PubMed Google Scholar
Julia Martini
View author publications
You can also search for this author in PubMed Google Scholar
Christian Beste
View author publications
You can also search for this author in PubMed Google Scholar
Annet Bluschke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.M., C.B., J.M., and A.B. designed the study. P.M. and M.S. performed in the data collection. A.V., P.M., A.B., F.B. and M.S. performed the data analyses and statistics. P.M., A.B., and C.B. wrote the article. A.V., M.S., F.B. and J.M. revised it critically for important intellectual content. All of the authors reviewed and approved the manuscript for publication.

Corresponding author

Correspondence to Pavol Mikolas.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mikolas, P., Vahid, A., Bernardoni, F. et al. Training a machine learning classifier to identify ADHD based on real-world clinical data from medical records. Sci Rep 12, 12934 (2022). https://doi.org/10.1038/s41598-022-17126-x

Download citation

Received: 03 December 2021
Accepted: 20 July 2022
Published: 28 July 2022
DOI: https://doi.org/10.1038/s41598-022-17126-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.