Validation and recalibration of OxMIV in predicting violent behaviour in patients with schizophrenia spectrum disorders

Lamsma, Jelle; Yu, Rongqin; Fazel, Seena

doi:10.1038/s41598-021-04266-9

Download PDF

Article
Open access
Published: 10 January 2022

Validation and recalibration of OxMIV in predicting violent behaviour in patients with schizophrenia spectrum disorders

Jelle Lamsma^1,2,
Rongqin Yu³,
Seena Fazel³ &
Genetic Risk and Outcome of Psychosis (GROUP) investigators

Scientific Reports volume 12, Article number: 461 (2022) Cite this article

3554 Accesses
3 Citations
Metrics details

Subjects

Abstract

Oxford Mental Illness and Violence (OxMIV) addresses the need in mental health services for a scalable, transparent and valid tool to predict violent behaviour in patients with severe mental illness. However, external validations are lacking. Therefore, we have used a Dutch sample of general psychiatric patients with schizophrenia spectrum disorders (N = 637) to evaluate the performance of OxMIV in predicting interpersonal violence over 3 years. The predictors and outcome were measured with standardized instruments and multiple sources of information. Patients were mostly male (n = 493, 77%) and, on average, 27 (SD = 7) years old. The outcome rate was 9% (n = 59). Discrimination, as measured by the area under the curve, was moderate at 0.67 (95% confidence interval 0.61–0.73). Calibration-in-the-large was adequate, with a ratio between predicted and observed events of 1.2 and a Brier score of 0.09. At the individual level, risks were systematically underestimated in the original model, which was remedied by recalibrating the intercept and slope of the model. Probability scores generated by the recalibrated model can be used as an adjunct to clinical decision-making in Dutch mental health services.

Predicting suicide risk in 137,112 people with severe mental illness in Finland: external validation of the Oxford Mental Illness and Suicide tool (OxMIS)

Article Open access 18 April 2023

Predicting criminal and violent outcomes in psychiatry: a meta-analysis of diagnostic accuracy

Article Open access 09 November 2022

Schizophrenia polygenic risk score and 20-year course of illness in psychotic disorders

Article Open access 14 November 2019

Introduction

Structured tools for violence risk assessment are increasingly used to inform decisions around admission, discharge and treatment of psychiatric patients¹. However, their suitability for patients with schizophrenia spectrum disorders is questionable. The most widely used tools, such as the Historical, Clinical and Risk Management-20 (HCR-20), Violence Risk Appraisal Guide (VRAG) and Level of Service Inventory (LSI)², were developed in other populations and with methods that are now considered low quality (e.g., absence of a study protocol, small and selected samples, vague or undefined risk categories)³. Moreover, they have rarely been validated in patients with schizophrenia spectrum disorders and have low-to-moderate discrimination across psychiatric populations. A systematic review identified two tools, the HCR-20 and VRAG, that were validated in outpatients with schizophrenia spectrum disorders (twice each). The median areas under the curve (AUCs) had wide ranges (interquartile range [IQR] = 0.60–0.77). In outpatients with any diagnosis, the median AUCs of the 10 included tools ranged from 0.62 to 0.85⁴. Another systematic review found an aggregated median AUC of 0.69 (IQR = 0.62–0.75) for 7 tools validated in diagnostically heterogenous samples of inpatients⁵. None of the validation studies, in either review, reported calibration measures. Finally, current tools are resource intensive. They typically take several hours to complete and cost hundreds of dollars in manuals and training⁶. By contrast, other areas of medicine, in particular cardiology, have developed scalable tools that can be used by clinicians to discuss health risks with patients⁷.

A tool that overcomes many of these limitations is Oxford Mental Illness and Violence (OxMIV). It is freely available online (https://oxrisk.com/oxmiv-7/), relies on routinely collected information, and requires no formal training. The model was derived and externally validated in a total population cohort of over 75 000 Swedish individuals with schizophrenia spectrum or bipolar disorder. The candidate variables, statistical analyses and output were specified beforehand. Upon entry of the 16 items, OxMIV estimates the probability of violent offending within 1 year. This estimate is expressed as a percentage, capped at 20%. A classification of ‘low risk’ (< 5%) or ‘increased risk’ (≥ 5%) is also given. In external validation, OxMIV showed excellent discrimination—the AUC was 0.89 (95% confidence interval [CI] 0.85–0.93)—and calibration⁸. A recent study in Germany found moderate discrimination (AUC = 0.72) for the prediction of inpatient violence in a forensic setting⁹.

However, further studies are needed to validate OxMIV for different countries, care settings and forms of violent behaviour. The last are relevant because all violence (not solely incidents leading to arrest and conviction) has negative consequences, including treatment disruption, morbidity in victims, costs to services¹⁰ and stigmatisation of patients¹¹, and may accurately be predicted by OxMIV. Therefore, we have evaluated the performance of OxMIV in predicting interpersonal violence over a 3-year period in a Dutch sample of general psychiatric patients with schizophrenia spectrum disorders. We also explored the feasibility of adjusting OxMIV for this population and outcome.

Methods

Setting and participants

Data were collected as part of a larger research project, called Genetic Risk and Outcome of Psychosis (GROUP). The GROUP project was conducted by four university hospitals and affiliated mental healthcare centres (k = 36) in the Netherlands. These institutions are located in representative geographical areas of the country and provide access to psychiatric treatment in a variety of settings (e.g., psychiatric hospitals, outpatient clinics, residential care) to approximately 75% of the population. Throughout 2004, consecutive patients were invited to participate if they met the following criteria: (1) age between 16 and 50; (2) good command of the Dutch language; and (3) Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR)¹² diagnosis of schizophrenia or other non-affective psychotic disorder. Their parents and siblings were also invited. In total, 1013 patients, 907 parents and 1061 siblings enrolled. Assessments took place at the university hospitals, with a follow-up at 3 years. The protocol for the GROUP project was approved centrally by the ethics committee of the University Medical Centre Utrecht and implemented in accordance with relevant guidelines. All participants gave written informed consent before the first assessment.

Predictors and outcome

We selected variables whose definitions most closely matched those in the derivation study. The definitions of the predictors (Table S1) and the instruments used to measure them (Table S2) can be found in the supplement. The psychometric properties of the instruments and training of research personnel have been described elsewehere¹³. The patients themselves provided information on all predictors, apart from ‘parental drug or alcohol misuse’ (parents), ‘parental violence’ (parents) and ‘sibling violence’ (siblings). For the predictors ‘parental violence’, ‘sibling violence’ and ‘personal income’, data were only collected at three of the four university hospitals. We excluded cannabis from the predictors ‘previous drug misuse’ and ‘parental alcohol or drug misuse’, as their prevalence would otherwise have been considerably higher than in the derivation sample (46% vs. 12% and 20% vs. 11%, respectively). Furthermore, we have previously shown that, in the current sample, the contribution of cannabis misuse to violence is small and nonsignificant when adjusted for background factors¹⁴. A possible explanation for both observations is that, unlike in Sweden, cannabis use is not criminalised in the Netherlands.

The outcome was physical abuse of another person (i.e. interpersonal violence) during the 3 years of follow-up, ascertained from clinical case notes and patient interviews. The definition (physical abuse vs. violent offending) and time period (3 years vs. 1 year) thus differed from the outcome OxMIV is designed to predict.

Statistical analysis

We aimed to validate and, if necessary, update OxMIV for a different population (i.e., general psychiatric patients with schizophrenia spectrum disorders in the Netherlands) and outcome (i.e., interpersonal violence over 3 years). For model updating, we followed an incremental strategy suggested previously^15,16. This strategy involves up to three steps: (1) recalibrating the intercept; (2) recalibrating the intercept and slope; and (3) re-estimating one or more coefficients. Performance was assessed in terms of calibration, both ‘in the large’ (with the ratio between predicted and observed events across the sample and the Brier score) and at the individual level (through calibration plots), and discrimination. We calculated the following discrimination metrics: AUC, sensitivity, specificity, and positive (PPVs) and negative (NPVs) predictive values. Wilson’s method¹⁷ was used to construct 95% confidence intervals around the last four of these. Discrimination was also visualised with a receiver operating characteristic (ROC) curve. To aid interpretation, we presented results for the following probability thresholds: 5% (the default), 10%, 15% and 20% (the cap).

Based on its distribution in the Dutch population, the predictor ‘personal income’ was converted into deciles. Patients with ‘unstable’ incomes belonged to deciles 1–3, and those with ‘stable’ incomes to deciles 4–10¹⁸. The corresponding model coefficients were averaged. Since the predictor ‘recent (substance) dependence treatment’ was not available, we assigned the derivation sample proportion to all patients. The same was done with patients who had ever been admitted to a psychiatric hospital for the predictor ‘currently an inpatient’. Others were assumed to be outpatients. For partially missing predictors, we used multiple imputation by chained equations. As recommended, the outcome was excluded from the imputation models¹⁸. We averaged values across 20 imputations. Among the predictors measured at all sites, proportions of missing data were modest (≤ 13%) (Table S3). Insofar data were missing due to local practice, they can reasonably be assumed to be missing at random. Missingness on most predictors correlated significantly (p < 0.05) with values on at least one other predictor (Table S4).

Outcome data were available for 637 (63%) patients. As outcomes should not be imputed in external validation¹⁹, these patients formed the sample used in the analyses. They typically had attained a higher level of education (χ² [2] = 12.18, p = 0.002) and were less likely to receive benefits (χ² [1] = 5.42, p = 0.020) than patients without outcome data. No significant differences were observed for any of the other predictors (Table S5).

Analyses were carried out in SPSS 21.0 and Stata 12.1. We adhered to the TRIPOD statement for validation studies²⁰.

Results

Table 1 outlines summary statistics for the predictors in the patient sample (N = 637). Patients were mostly male (n = 493, 77%) and, on average, 27 (SD = 7) years old. Previous violence (n = 115, 21%) and drug misuse (n = 118, 20%) were each present in about one in five patients. Almost all patients (n = 565, 95%) had taken antipsychotics in the past 6 months. Fifty-nine (9%) patients physically assaulted another person during the 3 years after baseline.

Table 1 Summary statistics for the predictors in the current sample (N = 637).

Full size table

Discrimination, as measured by the AUC, was moderate at 0.67 (95% CI 0.61–0.73). The ROC curve is shown in Fig. 1. The original model had low sensitivity (25%) and high specificity (90%) at the default threshold of 5%. The same pattern was observed for the PPV (21%) and NPV (92%) (Table 2). Calibration-in-the-large was satisfactory, with a ratio between predicted and observed events of 1.2 and a Brier score of 0.09 (Table 3). At the individual level, however, risks were systematically underestimated. This was remedied by recalibration of the intercept and slope (updating step ii) (Fig. 2 and, for the model formula, Table S6). Re-estimation of coefficients (updating step iii) was therefore not necessary. When using a threshold of 10%, the model with the recalibrated intercept and slope also offered a better balance between sensitivity (47%) and specificity (73%) than the original model (Table 2).

Table 2 Discrimination metrics for the original and recalibrated models.

Full size table

Table 3 Calibration-in-the-large of the original and recalibrated models.

Full size table

Discussion

In a Dutch sample of 637 general psychiatric patients with schizophrenia spectrum disorders, we evaluated the performance of a newly developed risk assessment tool (OxMIV) in predicting interpersonal violence over 3 years. We found OxMIV performed moderately well, especially considering that it is designed to predict a different outcome (i.e., violent offending within 1 year). The broader definition of the outcome and longer follow-up period may partly explain why we obtained a lower AUC (0.67, 95% CI 0.61–0.73) than previous validation studies of OxMIV^7,8. At the same time, it is comparable to AUCs reported by validation studies of other more resource-intensive tools^4,, and the current study is an external validation using a different clinically informative outcome. Furthermore, unlike other tools, where calibration has not been reported, OxMIV demonstrated good calibration in the large. In addition, we showed that the performance of OxMIV can be optimised with model updating: calibration at the individual level was adequate after recalibration of the intercept and slope. This is important methodologically as it provides an approach to test the performance of prediction models and risk assessment tools for a different outcome than in their derivation/development studies.

Strengths of this study include the representativeness of the sample, use of multiple data sources for the outcome, prespecification of the methods, and presentation of a wide range of performance measures. However, there are some limitations. First, most predictors were defined differently than in the derivation study (Table S1). The distribution of predictors differed as well. Of note, the proportion of men was larger (77% vs. 49%), mean age lower (27 years vs. 44 years) and recent treatment with antipsychotics more common (95% vs. 54%) in the current sample (Table 1) than in the derivation sample (Table S7). These differences may have hampered the performance of OxMIV. At the same time, they reflect the profile of patients presenting at mental health services in the Netherlands where information is not always available to align predictors exactly with those in the derivation study, and external validations with patient groups with different baseline characteristics provide evidence whether a tool's performance can be maintained in real-world clinical settings and practice. Another limitation was the relatively small number of patients with the outcome. It has been suggested that ≥ 100 events are required to reliably measure predictive accuracy²¹. For this reason, the findings may be considered preliminary rather than definitive. Finally, missing data may have introduced bias. However, multiple imputation would have reduced this bias in the predictors²², and patients with and without outcome data were similar on nearly all predictors.

The findings suggest that OxMIV is suitable for predicting violent behaviour in Dutch patients with schizophrenia spectrum disorders. Clinicians are advised to use the probability scores generated by the model with the recalibrated intercept and slope, as it had the best individual-level calibration and discrimination was worse at the chosen thresholds. This revised model can be accessed on the OxRisk website (https://oxrisk.com). The original model can be used to screen patients for low risk of violence (< 5%), as facilitated by the high NPV (92%). The low PPV (21%) suggests that patients should be assessed further if classified as high risk (≥ 5%). There remains a need for validation studies in which variable definitions more closely match those in the derivation study and the number of events is higher. Comparing the performance of OxMIV against other tools or investigating its clinical feasibility could also be considered.

Data availability

Supporting data for this study are not available, as the participants did not agree for these to be shared publicly.

References

Douglas, T., Pugh, J., Singh, I., Savulescu, J. & Fazel, S. Risk assessment tools in criminal justice and forensic psychiatry: The need for better data. Eur. Psychiatr. 42, 134–137 (2017).
Article CAS Google Scholar
Hurducas, C. C., Singh, J. P., de Ruiter, C. & Petrila, J. Violence risk assessment tools: A systematic review of surveys. Int. J. Forensic Ment. Health 13, 181–192 (2014).
Article Google Scholar
Fazel, S. & Wolf, A. Selecting a risk assessment tool to use in practice: A 10-point guide. Evid.-Based Ment. Health 21, 41–43 (2017).
Article PubMed PubMed Central Google Scholar
Singh, J. P., Serper, M., Reinharth, J. & Fazel, S. Structured assessment of violence risk in schizophrenia and other psychiatric disorders: A systematic review of the validity, reliability, and item content of 10 available instruments. Schizophr. Bull. 37, 899–912 (2011).
Article PubMed PubMed Central Google Scholar
Ramesh, T., Igoumenou, A., Vazquez Montes, M. & Fazel, S. Use of risk assessment instruments to predict violence in forensic psychiatric hospitals: A systematic review and meta-analysis. Eur. Psychiatr. 52, 47–53 (2018).
Article Google Scholar
Viljoen, J. L., McLachlan, K. & Vincent, G. M. Assessing violence risk and psychopathy in juvenile and adult offenders: A survey of clinical practices. Assessment 17, 377–395 (2010).
Article PubMed Google Scholar
Cooney, M. T., Dudina, A. L. & Graham, I. M. Value and limitations of existing scores for the assessment of cardiovascular risk: A review for clinicians. J. Am. Coll. Cardiol. 54, 1209–1227 (2009).
Article PubMed Google Scholar
Fazel, S. et al. Identification of low risk of violent crime in severe mental illness with a clinical prediction tool (Oxford Mental Illness and Violence tool [OxMIV]): A derivation and validation study. Lancet Psychiatr. 4, 461–468 (2017).
Article Google Scholar
Negatsch, V., Voulgaris, A., Seidel, P., Roehle, R. & Opitz-Welke, A. Identifying violent behavior using the Oxford mental illness and violence tool in a psychiatric ward of a German prison hospital. Front. Psychiatr. 10, 264 (2019).
Article Google Scholar
Senior, M., Fazel, S. & Tsiachristas, A. The economic impact of violence perpetration in severe mental illness: A retrospective, prevalence-based analysis in England and Wales. Lancet Public Health 5, e99-106 (2020).
Article PubMed PubMed Central Google Scholar
Torrey, E. F. Stigma and violence: Isn’t it time to connect the dots? Schizophr. Bull. 37, 892–896 (2011).
Article PubMed PubMed Central Google Scholar
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (Author, 2000).
Korver, N. et al. Genetic risk and outcome of psychosis (GROUP), a multi-site longitudinal cohort study focused on gene-environment interaction: Objectives, sample characteristics, recruitment and assessment methods. Int. J. Methods Psychiatr. Res. 21, 205–221 (2012).
Article PubMed PubMed Central Google Scholar
Lamsma, J., Cahn, W., Fazel, S., GROUP and NEDEN investigators. Use of illicit substances and violent behaviour in psychotic disorders: Two nationwide case-control studies and meta-analyses. Psychol. Med. 50, 2028–2033 (2020).
Article PubMed Google Scholar
Steyerberg, E. W. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating (Springer, 2009).
Su, T.-L., Jaki, T., Hickey, G. L., Buchan, I. & Sperrin, M. A review of statistical updating methods for clinical prediction models. Stat. Methods Med. Res. 27, 185–197 (2018).
Article MathSciNet Google Scholar
Wilson, E. B. Probable inference, the law of succession, and statistical inference. J. Am. Stat. Assoc. 22, 209–212 (1927).
Article Google Scholar
Statistics Netherlands. Welvaart in Nederland 2019 [Prosperity in the Netherlands 2019] (Statistics Netherlands, 2019).
Hoogland, J. et al. Handling missing predictor values when validating and applying a prediction model to new patients. Stat. Med. 39, 3591–3607 (2020).
Article MathSciNet PubMed PubMed Central Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. BMC Med. 13, 1 (2015).
Article PubMed PubMed Central Google Scholar
Steyerberg, E. W. Validation in prediction research: The waste by data-splitting. J. Clin. Epidemiol. 103, 131–133 (2018).
Article PubMed Google Scholar
Pedersen, A. B. et al. Missing data and multiple imputation in clinical epidemiological research. Clin. Epidemiol. 15, 157–166 (2017).
Article Google Scholar

Download references

Acknowledgements

The GROUP project was supported by the Geestkracht program of The Netherlands Organization for Health Research and Development (Grant number 10-000-1001) and matching funds from the coordinating university hospitals (Academic Medical Centre, Maastricht University Medical Centre, University Medical Centre Groningen and University Medical Centre Utrecht), their affiliated mental healthcare institutions (Altrecht, Arkin, Delta, Dimence, Dijk en Duin, Erasmus University Medical Centre, GGNet, GGZ Breburg, GGZ Centraal, GGZ Drenthe, GGZ Eindhoven en De Kempen, GGZ Friesland, GGZ inGeest, Mondriaan, GGZ Noord-Holland-Noord, GGZ Oost-Brabant, GGZ Overpelt, GGZ Rivierduinen, Lentis, Mediant GGZ, Met GGZ, Parnassia Psycho-Medical Centre, Psychiatric Centre Ziekeren, Psychiatric Hospital Sancta Maria, Public Centre for Mental Health Rekem, The Collaborative Antwerp Psychiatric Research Institute, Vincent van Gogh voor Geestelijke Gezondheid, Virenze riagg, University Psychiatric Centre Sint Jozef, Yulius and Zuyderland GGZ) and participating pharmaceutical companies (Lundbeck, AstraZeneca, Eli Lilly and Janssen Cilag). S.F. is funded by a Wellcome Trust Senior Research Fellowship in Clinical Science (202836/Z/16/Z). We are grateful to the patients and their family members who generously took the time and effort to participate in the GROUP project. We also would like to thank all research personnel involved, and in particular: Joyce van Baaren, Erwin Veermans, Ger Driessen, Truda Driesen, Karin Pos, Erna van 't Hag, Jessica de Nijs, Atiqul Islam, Wendy Beuken and Debora Op't Eijnde.

Author information

A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Department of Criminology and Sociology, Middlesex University, London, UK
Jelle Lamsma
Department of Psychiatry, University Medical Centre Utrecht, Utrecht, The Netherlands
Jelle Lamsma & Wiepke Cahn
Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK
Rongqin Yu & Seena Fazel
Department of Psychiatry and Neuropsychology, Maastricht University Medical Centre, Maastricht, The Netherlands
Therese van Amelsvoort & Claudia Simons
University Centre for Psychiatry, University of Groningen, Groningen, The Netherlands
Agna Bartels-Velthuis
Department of Psychiatry, Academic Medical Centre, Amsterdam, The Netherlands
Lieuwe de Haan & Frederike Schirmbeck
Arkin Institute for Mental Health, Amsterdam, The Netherlands
Lieuwe de Haan & Frederike Schirmbeck
GGzE Institute for Mental Health Care, Eindhoven, The Netherlands
Claudia Simons

Authors

Jelle Lamsma
View author publications
You can also search for this author in PubMed Google Scholar
Rongqin Yu
View author publications
You can also search for this author in PubMed Google Scholar
Seena Fazel
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Genetic Risk and Outcome of Psychosis (GROUP) investigators

Therese van Amelsvoort
, Agna Bartels-Velthuis
, Wiepke Cahn
, Lieuwe de Haan
, Frederike Schirmbeck
& Claudia Simons

Contributions

J.L., S.F. and R.Y. designed the study. J.L. performed the analyses under the supervision of S.F. and R.Y. J.L. drafted the manuscript, and the other authors critically revised it.

Corresponding author

Correspondence to Seena Fazel.

Ethics declarations

Competing interests

S.F. was part of the study team that developed OxMIV. He has not received any compensation for its development, use or translation, and will not receive any compensation for its future use in the Netherlands. The other authors report no potential conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lamsma, J., Yu, R., Fazel, S. et al. Validation and recalibration of OxMIV in predicting violent behaviour in patients with schizophrenia spectrum disorders. Sci Rep 12, 461 (2022). https://doi.org/10.1038/s41598-021-04266-9

Download citation

Received: 02 November 2021
Accepted: 16 December 2021
Published: 10 January 2022
DOI: https://doi.org/10.1038/s41598-021-04266-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.