Suicidal and aggressive behaviours cause significant personal and societal burden. As risk factors associated with these behaviours frequently overlap, combined approaches in predicting the behaviours may be useful in identifying those at risk for either. The current study aimed to create a model that predicted if individuals will exhibit suicidal behaviour, aggressive behaviour, both, or neither in late adolescence. A sample of 5,974 twins from the Child and Adolescent Twin Study in Sweden (CATSS) was broken down into a training (80%), tune (10%) and test (10%) set. The Netherlands Twin Register (NTR; N = 2702) was used for external validation. Our longitudinal data featured genetic, environmental, and psychosocial predictors derived from parental and self-report data. A stacked ensemble model was created which contained a gradient boosted machine, random forest, elastic net, and neural network. Model performance was transferable between CATSS and NTR (macro area under the receiver operating characteristic curve (AUC) [95% CI] AUCCATSS(test set) = 0.709 (0.671–0.747); AUCNTR = 0.685 (0.656–0.715), suggesting model generalisability across Northern Europe. The notable exception is suicidal behaviours in the NTR, which was no better than chance. The 25 highest scoring variable importance scores for the gradient boosted machines and random forest models included self-reported psychiatric symptoms in mid-adolescence, sex, and polygenic scores for psychiatric traits. The model’s performance is comparable to current prediction models that use clinical interviews and is not yet suitable for clinical use. Moreover, genetic variables may have a role to play in predictive models of adolescent psychopathology.
Aggressive and suicidal behaviours cause significant disruptions on a personal and societal level [1,2,3]. Early identification of those at risk for these behaviours in the general population would likely improve prognosis by assisting with risk stratification as well as targeting clinical interventions, more efficiently than with clinical interviews for each child. However, many risk prediction models focus on clinical samples, i.e. those who have already sought treatment. By expanding this approach to the general population, prevention effectors could be improved. To this end, it is of interest to create a model that can determine who is at risk to exhibit suicidal behaviour, aggressive behaviour or both, versus who is not.
Aggressive and suicidal behaviours
Aggressive behaviours are here defined as maladaptive behaviours that negatively impact others, such as hitting others or destroying property. Suicidal behaviours are considered to be a spectrum of behaviours that negatively impact the self, including: suicidal ideation, i.e. thoughts or planning of ending one’s life, suicide attempts, and death by suicide . Although there is a debate on whether self-harm without suicidal intent is considered a suicidal behaviour, it is heavily implicated in suicidal behaviours and is a coping strategy borne from severe psychiatric symptoms which require clinical intervention. Thus, we included self-harm in our definition of suicidal behaviours due to its clinical relevance.
Suicidal and aggressive behaviours are often seen as separate constructs, however there is evidence for an intrinsic link [4,5,6]. Perhaps the most palpable example is in individuals with co-occurring symptoms of impulsivity and emotional dysregulation, such as with attention-deficit hyperactivity disorder (ADHD) or borderline personality disorder [7,8,9]. The occurrence of these symptoms can lead to impulsive-aggressive behaviour in a subset of individuals, blurring both the control and direction of their actions [10,11,12,13].
Many overlapping risk factors for these behaviours have been implicated across psychological symptoms and home environmental factors . They have both been shown to be associated with internalising symptoms such as depression, substance abuse, family dysfunction and abuse during childhood, amidst a myriad of risk factors [7, 14,15,16,17]. Sex is a perhaps the most distinguishing risk factor as females are more likely to report self-harm, and suicidal ideation, while men have higher instances of death by suicide, aggressive behaviour and criminal acts [18,19,20]. There is evidence for genetic influence in the broad spectrum of suicidal behaviours and aggressive behaviours [21,22,23], though little is known regarding genetic overlap between them.
Polygenic scores (PGS) represent an aggregate score of an individual’s genetic propensity for a trait based on effect sizes from genome-wide association studies (GWAS) and can be used as a measure of genetic risk for a trait, as well as for estimation of genetic overlap between traits . With this approach genetic associations have been reported between suicidal behaviours and psychiatric traits like anxiety and depression , as well as between aggression and psychiatric traits including ADHD and autism spectrum disorder . To the authors’ knowledge no such prediction models which examine suicidal and aggressive behaviours utilise these relatively novel methods. Thus, incorporating PGS for psychiatric traits may add information that improves prediction in suicidal behaviours and aggression.
Predicting suicidal behaviours and aggressive behaviour
Many studies have looked into predicting suicide or aggressive behaviour. Although many models exist for prediction of suicidal behaviours in the clinical population, there are relatively fewer studies which use population-based samples [25, 26]. Whereas research in forensic psychology has worked to predict recidivism and violent criminal behaviour in general [27,28,29], an approach to prevent aggressive behaviours in adolescence, a developmentally sensitive period, is less common. Moreover, only a handful of studies using a clinical population have examined suicidal and aggressive behaviours together as an outcome [30, 31]. While aggressive behaviour tends to be childhood- and adolescent-limited, there is a subset of individuals for whom aggressive behaviour persists into adulthood. This trajectory is associated with poorer outcomes in adulthood . Thus identifying those who remain aggressive at late adolescence or older are of clinical importance . Additionally, given the significant overlap between the risk factors and co-occurrence of suicidal behaviours and aggression, creating a combined model that could be used in practice would reduce the need for separate questionnaires or assessments.
Our aim is to create a multi-class model that can predict who will report suicidal behaviours, aggressive behaviours, both, or neither in young adulthood within large scale, epidemiological samples. Using a combination of genetic, environmental, and psychosocial factors obtained from epidemiological cohorts would theoretically allow for a highly generalisable, comprehensive model that could improve understanding of risk factors for these behaviours, as well as further inform future models for clinical prediction and decision-making.
A total of 8676 participants from population-based twin cohorts, who completed self-report questionnaires about suicidal and aggressive behaviours between ages 17 and 21 were included in this study. Before the data cleaning procedures, the sample comprised 6,669 participants from the Child and Adolescent Twin Study in Sweden (CATSS)  and 2,764 participants from the Netherlands Twin Register (NTR) . Further descriptions are provided in the Supplementary text.
Aggressive and suicidal behaviours were both measured using the Life History of Aggression Checklist  at age 18. Based on the average scores in a clinical sample, a score of 15 out of a possible 40 was used as a cut-off for determining aggressive individuals . Suicidal behaviours were determined using two questions, “Deliberately attempted to injure yourself physically when you were angry or despondent” and “Deliberately attempted to kill yourself when you were angry or despondent”. Participants who endorsed either question were classified as having suicidal behaviours.
We included 19 predictors collected at age 9 or 12 and 15 which included psychiatric symptoms, parent and child relationship characteristics, as well as substance use [37,38,39,40,41]. A full list of variables and corresponding questionnaires used can be found in S1 and S2 Table.
Aggressive and suicidal behaviours were measured using the Young Adult Self Report and Adult Self Report of the Achenbach System of Empirically Based Assessment  at age 18. The aggression cut-off was derived using a T-score cut off of 64 from their respective aggressive behaviour subscales as suggested by the developers scoring system, which was equivalent to the 91.24% percentile of the NTR sample. Suicidal behaviour was measured using two related items from the internalising problems subscales: “I deliberately try to hurt or kill myself” and “I think about killing myself”. Participants who endorsed either question were classified as having suicidal behaviours.
Polygenic scores (PGS) were constructed using summary data from recent GWASs. A complete list of the 17 traits and associated GWAS on which PGS were based can be found in S2 Table. Leave-one-out summary statistics excluding CATSS and/or NTR data samples were generated for any trait for which they were included in the discovery GWAS. Analyses were limited to individuals of European ancestry. From the psychiatric PGS scores a general psychopathology PGS variable was created to account for the underlying correlation between psychiatric disorders (S2 Table) [44, 45]. Genotyping and quality control were performed in both samples and are described in the Supplementary text.
PGS were derived using LDpred . LDpred accounts for the linkage disequilibrium between single nucleotide polymorphisms (SNPs) to avoid inflation of effect sizes. For NTR the LD structure was determined on a subset of unrelated individuals and using a set of well imputed variants, while in CATSS, data from 1000 genomes phase 3 version 5 was used as an external reference sample [46, 47]. The weighted effect sizes were used as a basis for the polygenic scores. LDpred requires the specification of prior probabilities corresponding to the fraction of SNPs from the discovery samples considered causal with the trait, and we created scores at a range of priors (0.01, 0.05, 0.1, 0.2 0.3, 0.5, 1). In order to reduce the complexity inherent in having multiple PGS predictors and outcome variables, we performed principal component analysis (PCA) on all priors for each trait PGS, and included the first principal component (PCA-PGS) for each trait in our model according to Coombes and colleagues . PCA analysis is an unsupervised machine learning technique which reduces the dimensionality of datasets while maintaining as much variability as possible; the resulting principal components (PCs) represent a certain amount of variation within the dataset. The first PC can be interpreted to represent the most variation within the data . This method has been shown to prevent overfitting each PGS to each outcome and removes the need to select a single prior across all PGS.
All analyses were performed in R. First, all non-binary predictor variables were scaled separately in CATSS and NTR to account for variations in measurement tools. Pearson correlation was completed on all included variables (S3 Table). Participants with more than half of variables missing were removed from the analysis (Fig. 1). This removal was for two reasons: (1) although a standard strategy is to maximise the number of data points this may not be the best approach when considering data quality; (2) K-nearest neighbours was used for imputation and requires a certain ratio of complete cases and removing those with more than half the data missing provided us with a suitable ratio.
Next, PCA was completed on the entire CATSS dataset to identify outliers which could hamper model performance . Outliers were determined through the first principal component, severe outliers (scores outside the range of [−5.5, 5.5]) were removed. This process was repeated separately in the NTR dataset.
The CATSS data was then broken down into a training set, a tuning set, and a test set in an 80/10/10 split. As twin pairs are more similar to each other compared to other participants, we kept twins in pairs together in order to prevent overfitting that could occur from twin pairs being separated between the data subsets. Additionally, we stratified on the outcome to ensure that the outcome proportion was balanced across the data subsets. Descriptive statistics were checked to determine the consistency of the split. The NTR data was used without splitting.
Missing predictor data was imputed using K-nearest neighbours. A value of 6 was chosen for k, i.e. the number of data points within a cluster, was chosen by finding the square root of the number of columns in our dataset . To avoid bias during this step, the test set and the external validation data were imputed separately. The outcome variable was not included as an informative imputation variable. We then completed a final check for outliers in the imputed datasets using the steps described above. This led to a total sample size of 8,676 participants (Ntraining = 4773; Ntuning = 603; Ntest = 598; NNTR = 2702) (Fig. 1).
A PCA in the final data sets was completed to determine clustering of the four outcomes (suicidal behaviours, aggressive behaviours, both, or neither). R package H2O  was used for all supervised machine learning analyses. The performance of the models was determined by macro area under the receiver operating characteristic curve (AUC), derived from averaging the AUC for each class. The AUC for each class was determined using a one-versus-all approach that collapses each class into a binary outcome, e.g. having only suicidal behaviours vs all other outcomes combined . Positive predictive values (PPV), negative predictive values (NPV), sensitivity, and specificity were determined based on Youden’s J statistic . As the model merely gives probabilities of an individual belonging to the classes rather than giving a specific class prediction, a threshold must be selected at which a participant is classified into a specific class. Youden’s J is a way to determine this threshold by finding the maximum of the “sensitivity plus specificity minus 1”, associated with each of the possible thresholds; the values for J range from 0 to 1 .
First, a LASSO model without tuning was created to determine variable selection via coefficient weighting, variables which had a coefficient reduced to zero were considered to be unnecessary to the model and removed . During the model creation process, each model was trained using the training set while the tune set was used to evaluate performance at each iteration. Neither the CATSS test set nor the external validation NTR dataset were used during this process. First, as the sample size for the classes varied extensively, we added weights to the training set to improve the performance of the model . Weighting can be interpreted as the number of times participants in each class are resampled during the model learning process. The weight values W were determined by taking the sample size of the different classes over the sample size of the minority class (weights: Wsuicidal behaviours = 6; Waggressive behaviours = 10; Wboth = 16; Wneither = 1). This means that during the learning process each model resampled individuals who had aggressive behaviours 10 times, while those in the majority class (neither suicidal nor aggressive behaviours) were used only once.
We created a stacked ensemble model, i.e. a model that combines input predictions from separate models, which included a gradient boosted machine, random forest, elastic net, and a neural network . These models were selected based on their availability in H2O. This package was chosen based on its compatibility with multiclass ensemble models. The stacked ensemble model did not contain any parameters other than the number of folds for cross validation.
Each individual model was trained separately using cross-validation with 5 folds in the training set until the mean per class error converged (no improvements by 0.0001 for five rounds based on performance within the cross-validation folds). The model was then applied to the tuning set to determine if the model training process should continue. A combination of grid search and random search was used to tune the hyper-parameters of each model (S4–S7 Tables). Once the performance of each of the models did not improve after several rounds of parameter searching in the tune set, the models were combined into a stacked ensemble model, and the model was not further modified. Finally, we assessed the stacked ensemble model performance in the test set and in the NTR data.
Variable importance scores can be created from tree-based models and thus were obtained from the gradient boosted machines and random forest models that were included within the ensemble model. The overall variable importance rankings were determined using the average of scaled importance scores across the random forest and gradient boosted machine model. The variable importance in models built with H2O can be interpreted as the improvement in the squared error when the variable is split on a node, i.e. the decision points of the tree .
In order to determine the extent to which all of the PGS variables contributed to the model we completed the analysis with all genetic variables removed. The same tuning procedure in the main analysis was used to create the final ensemble model. We then performed a Venkatraman test using the R package pROC to determine whether the performance difference between both models (with and without genetic variables) was significant . Additionally, in order to assess the discrepancy between the proportion of suicidal behaviours in the CATSS and NTR we performed a logistic regression using suicidal behaviours as an outcome with cohort and measurement year as predictors.
The data was fairly well-balanced between the training, tune, and test set (Table 1). The age of the sample at the measurement of the outcomes ranged from 17 to 21. As a sensitivity analysis we addressed the discrepancy between the percentage of those with suicidal behaviours and/or aggression in the sets, the CATSS data had a considerably higher number of individuals who reported suicidal behaviours (CATSS = 12.92%; NTR = 3.15%) or both aggression and suicidal behaviours (CATSS = 4.72%; NTR = 1.70%), while the NTR data had a higher number of individuals who were classified as aggressive (CATSS = 7.48%; NTR = 12.10%). In the logistic regression analysis, a statistically significant negative association with cohort (NTR coded as 1; β = −1.547; SE = 0.149; P < 2.00 × 10−16) and a non-statistically significant association with both birth year and the year of outcome measurement. This indicates that differences between cohorts, e.g. instruments, may explain the disparity in prevalence rates.
The PCAs of the entire CATSS and NTR data sets did not show a clear distinction between individuals with and without the outcome (S1–S2 Figs.). No variables were reduced to 0 in the LASSO and thus none were removed from the model.
The performance of the four models included in the ensemble model were similar (S8 Table; final parameters S4-S7 Tables; the ensemble model contained no parameters): The ensemble model’s performance was relatively uniform between the datasets and classes, with the notable expectation of suicidal behaviours in the NTR, which was no better than chance (average of all classes, i.e. macro, AUC [10,000 bootstrap, 95% CI] AUCCATSS(test set) = 0.709 [0.671–0.747]; AUCNTR = 0.685 [0.656–0.715]; Fig. 2 and Table 2). The macro PPV was lower than the NPV (test set macro PPV = 0.350; test set macro NPV = 0.803; NTR set macro PPV = 0.303; NTR set macro NPV = 0.811; Table 2). Based on the Youden’s J statistic (test set = 0.306; NTR set = 0.278), the sensitivity (test set macro = 0.722; NTR set = 0.683) and specificity (test set macro = 0.584; NTR set = 0.594) of the models moderately varied. The confusion matrices for the training and test set can be found S9 Table.
Variable importance rankings across gradient boosted machines and random forest were relatively consistent: self-reported aggression, sex, as well as externalising and internalising symptoms at age 15/16 were ranked as most informative (Fig. 3 and S10 Table). Several PGS were also ranked highly including psychiatric disorders, IQ, and childhood BMI/birth weight. Self-reported psychopathology measures consistently ranked higher than their corresponding parent-reported measures.
The ensemble model which did not contain any genetic variables showed attenuated performance compared to the main model (macro AUC [10,000 bootstrap, 95% CI] AUCCATSS(test set) = 0.677 [0.648–0.727]; AUCNTR = 0.682 [0.653–0.710]) (S11 Table, final model parameters S12–S15 Tables). Testing the differences between both models showed that for most classes, there was no significant difference between the model with genetic variables and the model without (S16 Table).
We created a model to identify adolescents at a high risk of suicidal and/or aggressive behaviour using a wide range of predictors: questionnaires relating to home environment, behaviours, psychiatric symptoms, and genetic data for various traits. By training the model in the CATSS sample and validating it in the NTR sample, we tested the cross-cultural/external prediction of the model. Moreover, we examined the extent to which PGS variables contributed to the model through variable importance scores and by creating a model without genetic variables.
The AUC results for one-vs-all and macro, i.e. average of all one-vs-all scores, were similar between the CATSS test set (macro AUC = 0.709 [0.671–0.747]) and the NTR (macro AUC = 0.685 [0.656–0.715]). This indicates that the results of this model are generalisable across the national twin registers in the Netherlands and Sweden. Although the confidence intervals overlapped, the CATSS test set slightly outperformed the tune set (macro AUC = 0.697 [0.656–0.739]). This discrepancy came from a better performance of the suicidal behaviours and “both aggression and suicidal behaviours” classes. This be in part due to the slight variation in proportions between the classes, but counterintuitively, there were a lower proportion of individuals with suicidal behaviours in the test set (11.04% compared to 14.76%).
A cut-off of 0.8 for sensitivity and 0.5 for specificity has been previously proposed for use of prediction models in clinical settings . The macro sensitivity (CATSS = 0.722; NTR = 0.683) did not meet this threshold, but the macro specificity did (CATSS = 0.584; NTR = 0.593). This criterion was met for “both aggression and suicidal behaviours” class in CATSS (sensitivity = 0.935; specificity = 0.541) but this did not hold true in the NTR, indicating that our model is not sufficient for clinical use yet. The PPV scores (macro PPVCATSS(test set) = 0.350; macro PPVNTR = 0.303) were lower than the NPV scores (macro NPVCATSS(test set) = 0.803; macro NPVNTR = 0.811). The macro PPV indicates that the model was correct around 30% of the time when it placed an individual into the four classes.
Overall, our study shows comparable to marginally improved prediction compared to previous studies investigating both suicidal behaviours and aggression outcomes in clinical settings. A previous study that developed a clinical risk assessment implemented by psychologists for self-harm and aggression had an average AUC of 0.68 and 0.66, respectively . Another clinical study predicting self-harm and aggression in patients reported a PPV of 0.24 .
Similarly, the model’s performance was comparable to prediction models solely examining suicidal behaviours OR aggressive behaviours in clinical populations. The one-vs-all aggressive behaviours class had comparable performance to a study examining aggression in a prison sample . The direct comparison for suicidal behaviours to other studies is more challenging due to varying definitions. Additionally, while the AUC for the over-vs-all suicidal behaviours class was satisfactory, the AUC for this class in the NTR was no better than chance. Therefore, our model under-performs in comparison to current models including those which make use of non-genetic biomarkers [61, 63]. That said, the macro results from this model moderately out performed or had comparable results to these studies.
By using a combination of data types we were able to investigate the performance of variables related to home environment, genetics, behaviours, and psychiatric symptoms. Our highest scoring variable was self-reported aggression at age 15/16, followed by sex, self-reported psychiatric symptoms at age 15/16, PGS of psychiatric traits including general psychopathology, anorexia nervosa, neuroticism, and anxiety disorders, as well as population stratification. The lowest ranked variables were related to family functioning and parent-reported symptoms. Our top variable importance scores fit within literature, as aggression is fairly stable in late adolescence . Moreover, there are marked gender differences in the prevalence of the different outcomes [64, 65].
Out of the 25 highest variable importance scores 18 were PGS variables, with four PGS variables among the top ten. The highest performing PGS variables were for psychiatric related variables, but other traits such as birth weight, childhood body mass index, IQ, and educational attainment were also ranked highly. Additionally, removing all PGS variables from the model showed attenuated performance compared to the main model, however the difference in model performance was not statistically significant for most classes. Thus, while PGS of traits generally do not have clinical utility within psychiatry on their own, the results suggest that they can contribute to model performance when combined with other variables/risk factors . Notably, AUC of the NTR outperformed the test set in the sensitivity analysis. This is likely a result of the model over relying on the variable with the highest importance, aggression at age 15/16, as the NTR had a higher proportion of individuals in the aggressive class and the one-vs-all aggression AUC was the top performing class in the NTR.
The primary strength of this study was the use of longitudinal data from both questionnaires and genetic data. Moreover, we were able to validate our model through an externally collected data source , indicating that our results are generalisable to Northern Europe and did not suffer from overfitting. However, our study comes with caveats. First, our model would likely be improved by a larger sample size and additional variables related to psychiatric symptoms, such as impulsivity and emotion-dysregulation. Second, our measure of suicidal behaviours were somewhat inconsistent across both cohorts, which likely affected the performance between the data sets.
This study adds to the growing literature around genetically informed prediction models for mental health in adolescents. The results from the variable importance scores suggest that self-reported psychiatric symptoms in mid-adolescence, sex, and psychiatric PGS are key indicators for predicting later aggression and self-harm. Through the sensitivity analysis we found that removing genetic variables led to attenuated but not statistically significant differences in model performance for most of the classes. As of now prediction models are not ready for clinical use in psychiatric clinics and our model is no different . Moreover, the current cost and processing time of genotyping means that clinical utility of genetically informed models may be limited, especially given our models comparable performance to models using non-genetic biomarkers .
Future studies could improve our results through including additional variables related to biomarkers, other psychiatric symptoms, and additional raters at varying time points. Moreover, investigating these predictors in a clinical sample could further improve the performance of the model. In sum, to our knowledge our study resulted in the first genetically informed population-model to predict suicidal behaviours, aggressive behaviours, their absence, and co-occurrence. In doing so, we created a model generalisable across Northern Europe with comparable performance to current models, (with the notable exception of the suicidal behaviours class). Finally, we show the non-inferiority of a model that used PGS variables in lieu of variables based on clinical interviews.
The code for the study is available upon request from the authors.
Brownstein HH The social reality of violence and violent crime: Allyn and Bacon Boston; 2000.
Ferrey AE, Hughes ND, Simkin S, Locock L, Stewart A, Kapur N, et al. The impact of self-harm by young people on parents and families: a qualitative study. BMJ Open. 2016;6:e009631.
Kinchin I, Doran CM, Hall WD, Meurk C. Understanding the true economic impact of self-harming behaviour. Lancet Psychiatry. 2017;4:900–1.
Fazel S, Wolf A, Palm C, Lichtenstein P. Violent crime, suicide, and premature mortality in patients with schizophrenia and related disorders: a 38-year total population study in Sweden. Lancet psychiatry. 2014;1:44–54.
Sahlin H, Kuja-Halkola R, Bjureberg J, Lichtenstein P, Molero Y, Rydell M, et al. Association between deliberate self-harm and violent criminality. JAMA Psychiatry. 2017;74:615–21.
Laporte N, Ozolins A, Westling S, Westrin Å, Billstedt E, Hofvander B, et al. Deliberate self-harm behavior among young violent offenders. PLoS ONE. 2017;12:e0182258.
Loeber R, Pardini D, Homish DL, Wei EH, Crawford AM, Farrington DP, et al. The prediction of violence and homicide in young men. J Consulting Clin Psychol. 2005;73:1074.
O’Connor RC, Kirtley OJ. The integrated motivational–volitional model of suicidal behaviour. Philos Trans R Soc B: Biol Sci. 2018;373:20170268.
O’Donnell O, House A, Waterman M. The co-occurrence of aggression and self-harm: systematic literature review. J Affect Disord. 2015;175:325–50.
Brent DA, Johnson BA, Perper J, Connolly J, Bridge J, Bartle S, et al. Personality disorder, personality traits, impulsive violence, and completed suicide in adolescents. J Am Acad Child Adolesc Psychiatry. 1994;33:1080–6.
Retz W, Rösler M. The relation of ADHD and violent aggression: What can we learn from epidemiological and genetic studies? Int J Law Psychiatry. 2009;32:235–43.
Zanarini MC, Frankenburg FR, Hennen J, Reich DB, Silk KR. Axis I comorbidity in patients with borderline personality disorder: 6-year follow-up and prediction of time to remission. Am J Psychiatry. 2004;161:2108–14.
Terzi L, Martino F, Berardi D, Bortolotti B, Sasdelli A, Menchetti M. Aggressive behavior and self-harm in Borderline Personality Disorder: the role of impulsivity and emotion dysregulation in a sample of outpatients. Psychiatry Res. 2017;249:321–6.
Shafti M, Taylor PJ, Forrester A, Pratt D. The co-occurrence of self-harm and aggression: a cognitive-emotional model of dual-harm. Front Psychology. 2021;12:586135.
Allely CS. The association of ADHD symptoms to self-harm behaviours: a systematic PRISMA review. BMC Psychiatry. 2014;14:133.
Hong JS, Espelage DL. A review of research on bullying and peer victimization in school: an ecological system analysis. Aggression Violent Behav. 2012;17:311–22.
Paul E, Ortin A. Psychopathological mechanisms of early neglect and abuse on suicidal ideation and self-harm in middle childhood. Eur Child Adolesc Psychiatry. 2019;28:1311–9.
Moffitt TE, Caspi A. Childhood predictors differentiate life-course persistent and adolescence-limited antisocial pathways among males and females. Dev Psychopathol. 2001;13:355–75.
Moran P, Coffey C, Romaniuk H, Olsson C, Borschmann R, Carlin JB, et al. The natural history of self-harm from adolescence to young adulthood: a population-based cohort study. Lancet. 2012;379:236–43.
Nock MK, Borges G, Bromet EJ, Alonso J, Angermeyer M, Beautrais A, et al. Cross-national prevalence and risk factors for suicidal ideation, plans and attempts. Br J Psychiatry. 2008;192:98–105.
Campos AI, Verweij KJH, Statham DJ, Madden PAF, Maciejewski DF, Davis KAS, et al. Genetic aetiology of self-harm ideation and behaviour. Sci Rep. 2020;10:9713.
Ip HF, Van der Laan CM, Krapohl EM, Brikell I, Sánchez-Mora C, Nolte IM, et al. Genetic association study of childhood aggression across raters, instruments, and age. Transl Psychiatry. 2021;11:1–9.
Pappa I, St Pourcain B, Benke K, Cavadino A, Hakulinen C, Nivard MG, et al. A genome‐wide approach to children’s aggressive behavior: The EAGLE consortium. Am J Med Genet Part B: Neuropsychiatr Genet. 2016;171:562–72.
Fullerton JM, Nurnberger JI. Polygenic risk scores in psychiatry: Will they be useful for clinicians? F1000Research. 2019;8:12–19.
Belsher BE, Smolenski DJ, Pruitt LD, Bush NE, Beech EH, Workman DE, et al. Prediction models for suicide attempts and deaths: a systematic review and simulation. JAMA Psychiatry. 2019;76:642–51.
Carter G, Milner A, McGill K, Pirkis J, Kapur N, Spittal MJ. Predicting suicidal behaviours using clinical instruments: systematic review and meta-analysis of positive predictive values for risk scales. Br J Psychiatry. 2017;210:387–95.
Kang H-W, Kang H-B. Prediction of crime occurrence from multi-modal data using deep learning. PLoS ONE. 2017;12:e0176244.
Ramesh T, Igoumenou A, Montes MV, Fazel S. Use of risk assessment instruments to predict violence in forensic psychiatric hospitals: a systematic review and meta-analysis. Eur Psychiatry. 2018;52:47–53.
Wolf A, Fanshawe T, Sariaslan A, Cornish R, Larsson H, Fazel S. Prediction of violent crime on discharge from secure psychiatric hospitals: a clinical prediction rule (FoVOx). Eur Psychiatry. 2018;47:88–93.
O’Shea LE, Dickens GL. Predictive validity of the Short-Term Assessment of Risk and Treatability (START) for aggression and self-harm in a secure mental health service: Gender differences. Int J Forensic Ment Health. 2015;14:132–46.
Phillips NL, Stargatt R, Brown A. Risk assessment of self-and other-directed aggression in adolescent psychiatric inpatient units. Aust N Z J Psychiatry. 2012;46:40–6.
Huesmann LR, Dubow EF, Boxer P. Continuity of aggression from childhood to early adulthood as a predictor of life outcomes: implications for the adolescent-limited and life-course-persistent models. Aggress Behav. 2009;35:136–49.
Moffitt TE. Life-course persistent and adolescence-limited antisocial behavior. Causes of conduct disorder and juvenile delinquency. 2003;100:49–75.
Anckarsäter H, Lundström S, Kollberg L, Kerekes N, Palm C, Carlström E, et al. The child and adolescent twin study in Sweden (CATSS). Twin Res Hum Genet. 2011;14:495–508.
Ligthart L, van Beijsterveldt CEM, Kevenaar ST, de Zeeuw E, van Bergen E, Bruins S, et al. The Netherlands Twin Register: longitudinal research based on twin and twin-family designs. Twin Res Hum Genet. 2019;22:623–36.
Coccaro EF, Berman ME, Kavoussi RJ. Assessment of life history of aggression: development and psychometric characteristics. Psychiatry Res. 1997;73:147–57.
Gerard AB. Parent-Child Relationship Inventory (PCRI): Western psychological services; 1994.
Goodman R. The Strengths and Difficulties Questionnaire: a research note. J Child Psychol Psychiatry. 1997;38:581–6.
Larson T, Anckarsäter H, Gillberg C, Ståhlberg O, Carlström E, Kadesjö B, et al. The autism-tics, AD/HD and other comorbidities inventory (A-TAC): further validation of a telephone interview for epidemiological research. BMC Psychiatry. 2010;10:1.
Raine A, Dodge K, Loeber R, Gatzke‐Kopp L, Lynam D, Reynolds C, et al. The reactive–proactive aggression questionnaire: Differential correlates of reactive and proactive aggression in adolescent boys. Aggress Behav. 2006;32:159–71.
Stattin H, Kerr M. Parental monitoring: a reinterpretation. Child Dev. 2000;71:1072–85.
Achenbach TM, Rescorla LA. Manual for the ASEBA Adult Forms & Profiles. 2003.
Achenbach TM. Achenbach system of empirically based assessment (ASEBA). The Encyclopedia Of Clinical Psychology. John Wiley & Sons, Inc; 2014. p. 1-8 .
Brikell I, Larsson H, Lu Y, Pettersson E, Chen Q, Kuja-Halkola R, et al. The contribution of common genetic risk variants for ADHD to a general factor of childhood psychopathology. Mol Psychiatry. 2020;25:1809–21.
Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. The p factor: one general psychopathology factor in the structure of psychiatric disorders? Clin Psychol Sci. 2014;2:119–37.
Vilhjálmsson BJ, Yang J, Finucane HK, Gusev A, Lindström S, Ripke S, et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am J Hum Genet. 2015;97:576–92.
Consortium GP. A global reference for human genetic variation. Nature 2015;526:68.
Coombes BJ, Ploner A, Bergen SE, Biernacka JM. A principal component approach to improve association testing with polygenic risk scores. Genet Epidemiol. 2020;44:676–86.
Jolliffe IT, Cadima J. Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical. Phys Eng Sci. 2016;374:20150202.
Wold S, Esbensen K, Geladi P. Principal component analysis. Chemometrics Intell Lab Syst. 1987;2:37–52.
Pan R, Yang T, Cao J, Lu K, Zhang Z. Missing data imputation by K nearest neighbours based on grey relational structure and mutual information. Appl Intell. 2015;43:614–32.
Cook D. Practical machine learning with H2O: powerful, scalable techniques for deep learning and AI. O’Reilly Media, Inc; 2016. .
Wu X-Z, Zhou Z-H. A unified view of multi-label performance measures. International Conference On Machine Learning. PMLR; 2017.
Bewick V, Cheek L, Ball J. Statistics review 13: receiver operating characteristic curves. Crit Care. 2004;8:1–5.
Youden WJ. Index for rating diagnostic tests. Cancer 1950;3:32–5.
Ranstam J, Cook J. LASSO regression. J Br Surg. 2018;105:1348-.
Guo X, Yin Y, Dong C, Yang G, Zhou G. On the class imbalance problem. 2008 Fourth International Conference On Natural Computation. IEEE; 2008.
Huang JZ. An introduction to statistical learning: With applications in r by gareth james, trevor hastie, robert tibshirani, daniela witten. Springer; 2014.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. Package ‘pROC’. Package ‘pROC’. 2021.
Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1–W73.
Runeson B, Odeberg J, Pettersson A, Edbom T, Jildevik Adamsson I, Waern M. Instruments for the assessment of suicide risk: a systematic review evaluating the certainty of the evidence. PLoS ONE. 2017;12:e0180292.
Menger V, Spruit M, Van Est R, Nap E, Scheepers F. Machine learning approach to inpatient violence risk assessment using routinely collected clinical notes in electronic health records. JAMA Netw Open 2019;2:e196709-e.
Whiting D, Fazel S. How accurate are suicide risk prediction models? Asking the right questions for clinical practice. Evid Based Ment health. 2019;22:125–8.
Archer J. Sex differences in aggression in real-world settings: a meta-analytic review. Rev Gen Psychol. 2004;8:291–322.
Bresin K, Schoenleber M. Gender differences in the prevalence of nonsuicidal self-injury: a meta-analysis. Clin Psychol Rev. 2015;38:55–64.
Murray GK, Lin T, Austin J, McGrath JJ, Hickie IB, Wray NR. Could polygenic risk scores be useful in psychiatry?: a review. JAMA Psychiatry. 2021;78:210–9.
Larsson H. Causation and prediction in child and adolescent mental health research. Wiley Online Library; 2021. p. e12026.
The computations and data handling were/was enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at Uppmax partially funded by the Swedish Research Council through grant agreement no. 2018–05973. This project has received funding from the European Union’s Horizon 2020 research and innovation programme, Marie Sklodowska Curie Actions – MSCA-ITN-2016 – Innovative Training Networks under grant agreement No . WAA and AET received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no. 721567. MeB is funded by an ERC Consolidator Grant (WELL-BEING 771057).
Open access funding provided by Karolinska Institute.
HL reports receiving grants from Shire Pharmaceuticals; personal fees from and serving as a speaker for Medice, Shire/Takeda Pharmaceuticals and Evolan Pharma AB; and sponsorship for a conference on attention-deficit/hyperactivity disorder from Shire/Takeda Pharmaceuticals and Evolan Pharma AB, all outside the submitted work. All other authors report no conflicts of interest.
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Tate, A.E., Akingbuwa, W.A., Karlsson, R. et al. A genetically informed prediction model for suicidal and aggressive behaviour in teens. Transl Psychiatry 12, 488 (2022). https://doi.org/10.1038/s41398-022-02245-w