Introduction

Less than 50% of schizophrenia patients respond to an initial treatment with antipsychotic medication.1, 2 This is not surprising as most of the current antipsychotic medications used in the treatment of schizophrenia were discovered empirically based on serendipitous clinical observations and reverse pharmacology.3 Therefore, there is not sufficient understanding of the underlying pathophysiology of schizophrenia to inform diagnosis. Furthermore, traditional pharmacotherapy for schizophrenia using ‘blockbuster’ drugs usually leads to the administration and switching of drugs multiple times until an adequate response is achieved. It is perhaps not surprising that there is a low treatment response rate and a high rate of non-compliance often resulting in relapse.4 Thus, reliable tests with a sound biological rationale to guide the treatment selection are needed, as recently suggested in the case of antidepressant research and development.5

The adoption of new, molecule-based algorithms has not been common practice in neuropsychiatric research. However, this is likely to change considering the increased drive for the improved clinical management and benefit/cost ratios in schizophrenia treatment, as well as the Food and Drug Administration's integrated objective for personalized medicine approaches in managing diseases.6 Cost containment for clinical trials will also be a sufficient incentive for pharmaceutical companies to pursue this approach.7

We have previously used multiplexed immunoassays to show that schizophrenia patients have a distinctive molecular signature in blood serum.8, 9 In this study, we have used the same method in an attempt to identify molecular serum signatures that could predict response and subsequent relapse in 77 schizophrenia patients. The study had four main aims: (1) to identify molecules associated with symptom severity in antipsychotic naive and unmedicated patients, (2) to determine biomarker signatures that could predict response over a 6-week treatment period, (3) to identify molecular panels that could predict the time to relapse in a cross-sectional population of patients in remission and (4) to investigate how the biological relapse signature changed throughout the treatment course. For this purpose, 77 initially acutely psychotic schizophrenia patients were treated for 6 weeks and followed up until relapse occurred within a 2-year time period.

Methods

Clinical samples

The institutional ethical committee of the University of Magdeburg approved the protocols of the study, informed written consent given by all participants and studies were conducted according to the Declaration of Helsinki. All diagnoses were carried out using the Diagnostic and Statistical Manuel (DSM)-IV, and clinical tests were performed by psychiatrists under good clinical practice-compliance to minimize variability. Subjects were diagnosed as having the paranoid subtype of schizophrenia (295.30). The exclusive focus on paranoid schizophrenia, which comprises the most prevalent subtype of the illness, was intended to minimize variability. Out of 77 patients, 36 were in the first stage of illness and antipsychotic naïve at the start of the study (Table 1). The remaining 41 patients had been antipsychotic-free for at least 6 weeks. At T0, the average disease duration was 6.32±8.63 years. All subjects were treated as inpatients over the first 6 weeks of treatment. Clinicians had access to all detailed clinical files including medical histories by proxy and referral letters from the general practitioners. Any patients with other medical conditions such as type 2 diabetes, hypertension, cardiovascular or autoimmune diseases were excluded. Patients with autoimmune diseases were excluded to rule out psychosis secondary to disorders like systemic lupus erythematodes or multiple sclerosis, which may cause schizophrenia-like psychotic symptoms but in the view of the authors rather resemble organic schizophreniform psychosis than schizophrenia.

Table 1 Demographic details of schizophrenia patients at T0 and at the time of relapse

During the first 6 weeks of treatment, where all subjects were treated as inpatients, therapeutic drug monitoring from peripheral blood samples was applied in order to assess the compliance.

Blood samples were collected from all subjects by venous puncture into S-Monovette 7.5 ml serum tubes (Sarstedt; Numbrecht, Germany) at the start of the study (T0), after the 6 week treatment period (T6), at the last clinical visit before relapse (TR) and during relapse (Figure 1). Serum was prepared by placing samples at room temperature for 2 h to allow blood coagulation, followed by centrifugation at 4000 g for 5 min to remove particulate material. The resulting supernatants were stored at −80 °C in Low Binding Eppendorf tubes (Hamburg, Germany) before the analysis.

Figure 1
figure 1

Schematic diagram showing time of sampling of schizophrenia patients treated with antipsychotics. Subjects (n=77) were treated for 6 weeks (T0–T6) with antipsychotics. After this, 0–24 weeks elapsed until the last clinical visit (TR) of the 18 patients who relapsed after an additional 3 weeks to 25 months. Blood samples were drawn from subjects, serum prepared at the indicated time points and the levels of 190 proteins and small molecules were measured by multiplex immunoassay.

Multiplex immunoassay

The HumanMAP immunoassay platform was used to measure the serum concentrations of 191 proteins and small molecules (Supplementary Table 1) in a Clinical Laboratory Improvement Amendments-certified laboratory at Rules Based Medicine (Austin, TX, USA) as described previously.9 Samples were randomized and blinded to analysts using code numbers until all biochemical assays were complete. Assays were calibrated using standards, raw intensity measurements converted to absolute protein concentrations by comparison to the standards, and performance was verified using quality control samples. Data analyses were performed using the statistical software package R (http://www.r-project.org). The protocol for the study participants, clinical samples and test methods was carried out in compliance with the Standards for Reporting of Diagnostic Accuracy (STARD) initiative.10

Data analysis

The molecular data set was preprocessed by filtering out all analytes, which contained measurements outside the limits of quantitation in more than 30% of samples. After this procedure, 168 out of 191 analytes remained in the data set, and the proportion of missing values dropped from 16.7 to 6.8%. To assess data quality, Principal Component Analysis (SIMCA-P+ vs 12.0, Umetrics, Umeå, Sweden) was used for the detection of strong effects on the overall variance in the filtered data set. No such effects could be identified across the first 10 principal components, which covered 44% of the variance in the data set. Shapiro–Wilk analysis showed that the majority of the investigated analytes (97%) were significantly (P<0.05) non-normally distributed. Outliers in each analyte as well as response variables were excluded if a given concentration differed by more than two standard deviations from the mean of the respective analyte. For the different analyses described in the present study, this resulted in the removal of 3.2–4.1% of subjects per analyte, on average. Outliers identified in this fashion did not substantially affect results, as repetition of all analyses without exclusion of outliers led to qualitatively identical results.

Significant associations were determined using non-parametric Spearman's correlation tests and adjusted for false discovery rate.11 Comparisons between the two groups were based on the non-parametric Wilcoxon Rank sum test or the paired Wilcoxon Rank sum test for within-subject comparisons. Comparisons between three or more groups were performed using the non-parametric Kruskal–Wallis test. Analysis of co-variance (ANCOVA) was used to account for ‘regression to the mean’ effects between baseline and follow-up measurements.12 ANCOVA was also used to estimate the effect of body mass index (BMI) on molecular differences between the groups. For ANCOVA, all variables were log10-transformed to approximate normality. Unadjusted P values of less than 0.05 were considered as significant.

To identify a molecular fingerprint that could predict patient responses such as time to relapse, we applied Random Forests analysis.13 This technique works by building multiple decision trees and using a majority decision of classification outputs among trees to assign patients to one of two groups based on the levels of serum molecules. This method was also used to select the most important molecules for the prediction. For this purpose, the measured values of each molecule were permuted sequentially and the variable importance was determined from the impact of such randomization on the classification output.

Results and Discussion

Experimental design

As already mentioned in the introduction, Multiplex immunoassay analysis was carried out in four stages (Figure 1):

  • The first stage was aimed at the identification of molecules that correlated with symptom severity in untreated schizophrenia patients. For this purpose, serum was obtained from 77 patients who were experiencing an episode of acute psychosis (T0). At this time-point, all patients were unmedicated or antipsychotic naive. All 77 patients were followed up over a subsequent 6-week treatment period with antipsychotic medication.

  • The change in symptom severity between T0 and the end of the 6 week follow-up period (T6) was the main outcome variable for the second stage of this study. Here, we tried to identify molecules in the T0 samples that predicted the change in the symptom severity between T0 and T6.

  • For the third stage of the study, we measured serum molecules at TR, the last clinical visit before patients relapsed. At this time-point patients were in remission with different durations of antipsychotic treatment. TR was chosen since patients at this time-point reflected the real clinical scenario of a heterogeneous patient population for which molecular relapse predictors would be most useful. The aim of this stage was to identify a molecular signature that could predict the time to relapse for subjects in this cross-sectional group.

  • During the fourth stage, we investigated how the molecular relapse signature changed throughout the treatment period.

Patient characteristics and psychopathology scores

Patient characteristics are indicated in Table 1. As the symptom severity was the main outcome variable for stages 1–3 of this study, we tested whether this was associated with any of the remaining recorded patient characteristics. This was done to identify covariates that could influence the identification of potential biomarkers. At T0, no significant associations between PANSS scores and the remaining patient characteristics were found (Table 2). Secondly, there were no significant associations between patient characteristics that were not related to symptom severity and the change in PANSS between T0 and T6 (ΔPANSS) (Table 2).

Table 2 Association between patient characteristics at T0 and symptom severity

ΔPANSS quantifies the symptom improvement that patients experienced after 6 weeks of antipsychotic treatment. On average, positive PANSS scores decreased from 23.0 to 12.7 (P<0.001; paired Wilcoxon test) and negative PANSS scores from 19.3 to 14.4 (P<0.001) (Figure 2). Patients with more severe symptoms at T0 improved more than those with less severe symptoms. This is likely to have been influenced by ‘regression to the mean’, as patients with severe symptoms are statistically more likely to have a reduction in these symptoms at the follow-up time-point.14 Therefore, Δ positive PANSS was significantly correlated to the T0 PANSS values (P<0.001, ρ=0.77; Spearman correlation) (Figure 2). This was also the case for negative PANSS scores (P<0.001, ρ=0.65). As a consequence of the difference in choice of antipsychotic medication for patients with differing symptom severity, patients treated with quetiapine improved significantly less than those treated with olanzapine or risperidone (positive PANSS: P=0.001, negative PANSS: P=0.101, Kruskal–Wallis test). Due to the higher risk of weight gain associated with olanzapine treatment,15 patients who were antipsychotic naive at T0 experienced a higher but not significant increase in BMI until T6 compared to patients treated with other medications (ΔBMIOLA=1.1 in olanzapine group vs ΔBMIOTHER=0.63 in remaining patients, P=0.200, Wilcoxon test). This effect was absent in patients who were not antipsychotic-naive at T0 (ΔBMIOLA=0.7, ΔBMIOTHER=0.8, P=0.845), potentially due to the fact that patients who had already been treated with olanzapine had also a higher BMI compared with the remaining patients in this group (P=0.030, Wilcoxon test). Across all 77 patients, there was also no significant difference between antipsychotic medications in the effect on ΔBMI during the first 6 weeks of treatment, when patients were treated as inpatients (P=0.563, Kruskal–Wallis test).

Figure 2
figure 2

Symptom severity measured using PANSS scores during the first 6 weeks of treatment (T0-T6). PANSS positive (top left panel) and negative (bottom left panel) item scores improved significantly over the treatment period. The lines between T0 and T6 connect scores of the same patients. Right panels: the change in positive (top) and negative (bottom) PANSS scores were correlated to the corresponding symptom severity at T0.

Overall, there was a greater decrease in positive symptoms (64%) compared with that seen for negative symptoms (40%), and this was reflected in the number of patients who showed these responses. For positive symptoms, 52 patients improved by more than 50%. For negative symptoms, only 33 out of the 77 patients showed this improvement. The finding that T0 positive and negative PANSS scores were not significantly correlated to each other (P=0.286; Spearman correlation) suggested that these parameters were independent. Likewise, Δ positive and Δ negative PANSS scores were not significantly correlated (P=0.094, ρ=0.19; Spearman test).

Stage 1: identification of biomarkers of symptom severity at T0

During the first stage of the analysis, we attempted to identify any associations between T0 PANSS scores and T0 levels of serum molecules. A total of 18 molecules were significantly correlated with positive T0 PANSS scores (14 with ρ>0) and 8 molecules (4 with ρ>0) were associated with negative T0 PANSS scores (P<0.05; Spearman correlation; Table 3 shows those associations with P<0.01). The significant changes included increased levels of the acute phase response pathway proteins myoglobin, ferritin, C-reactive protein and interleukin 16, which have already been implicated in the pathophysiology of schizophrenia.16 In addition, prolactin levels showed a significant negative correlation (ρ=−0.27, P=0.022) with positive PANSS, indicating that patients with higher prolactin levels had lower symptom severity at T0. This is interesting as prolactin has been used as a biomarker of antipsychotic drug administration for many years,17 and we have shown recently that levels of this protein are elevated in serum from first onset antipsychotic naïve schizophrenia patients.18

Table 3 Significant associations between molecular levels at T0 and psychopathology scores

Stage 2: identification of biomarkers for response prediction (ΔPANSS)

In the second stage of this study, we attempted to identify molecules that could predict the improvement in PANSS scores between T0 and the end of the 6 weeks treatment period (T6). For this purpose, we determined associations between T0 molecular levels and ΔPANSS. Due to the ‘regression to the mean’ affecting ΔPANSS, we performed ANCOVA analysis using T0 PANSS levels as covariate.12 This showed that no molecule was significantly correlated with Δ positive PANSS and only insulin showed a weak association with Δ negative PANSS (P=0.006, Q=0.47, Table 3). Lower insulin levels at T0 were predictive of symptom improvement after antipsychotic treatment. This is consistent with previous studies, which have shown that antipsychotic-induced effects such as insulin resistance and weight gain are associated with and may even be required for symptom improvement in response to antipsychotic treatment.19, 20

Stage 3: identification of biomarkers for prediction of relapse

Univariate predictors of relapse

The next stage of the analysis was aimed at the identification of a molecular signature that could predict the time until patients in remission would relapse. For this purpose, we analyzed the sera of 18 patients who relapsed after the treatment course (Table 1). Six patients were off medication at TR and all but three patients were off medication at the time of relapse. The time to relapse was quantified as the time between the last clinical visit (TR) and relapse. These values were dichotomized such that patients below the median (8.3 months) were partitioned into a short-term relapse group (n=9) and those above the median were assigned to the long-term relapse group (n=9). The mean times to relapse in the short- and long-term relapse groups were 3.7 and 13.2 months, respectively. The classification into short-term and long-term relapse groups was similar between the scenarios when time to relapse was determined from either T0 or TR (16 out of 18 subjects stayed in the same class). This showed that the assignment of patients to relapse groups was reasonably consistent throughout the treatment.

To avoid the effect of potential confounding factors on biomarkers during this stage, we tested the association between the two relapse groups and all recorded patient characteristics at TR. Subjects in the short-term relapse group had significantly lower BMIs compared with those in the long-term relapse group (24±4 and 28±4 kg m−2, respectively; P=0.040; Wilcoxon test) (Figure 3). This could have been caused by non-compliance of patients in the short-term relapse group. The difference in BMIs did not reach significance at the actual time of relapse (P=0.0625). The two groups did not differ in age (P=0.232, Wilcoxon test), gender (P=0.577, Fisher's exact test), PANSS positive (P=0.894, Wilcoxon test) or PANNS negative (P=0.824) scores. We found similar distributions of medication prescribed at T0 (the number of subjects receiving olanzapine/olanzapine+risperidone/risperidone/quetiapine/haloperidol/none was 2/2/2/2/0/1 in the short-term relapse group and 5/0/1/1/1/1 in the long-term group). There was no significant difference in time to relapse between olanzapine-treated patients and those receiving other medications (P=0.348 in short-term and P=0.111 in long-term relapse group, P=0.717 in all 18 patients). Also, we did not find an association of time to relapse with medication status at TR (P=0.708, Wilcoxon test). This indicated that the six patients who were off medication at TR were equally distributed across the short- and long-term relapse groups. Furthermore, there was no difference in time to relapse between patients who were antipsychotic naive or unmedicated at T0 (P=0.328, Wilcoxon test). This showed that no clinical feature except for BMI was indicative of the time to relapse at the last clinical visit. Although time to relapse was not associated with medication status at the last clinical visit, there is a possibility that the difference in relapse time was associated with non-compliance rather than the molecular factors reflecting response. This is addressed specifically in stage 4.

Figure 3
figure 3

Differences of BMI and the levels of leptin, proinsulin and TGF-α between the short- and long-term relapse groups. The measurements were taken at the last clinical visit before relapse (TR).

To identify molecules associated with the time of relapse, we compared the levels of all molecules at TR with respect to subjects who ended up in the short- or long-term relapse groups using non-parametric Wilcoxon tests. This analysis showed that 27 molecules were significantly different between the groups (P<0.05, Q<0.31). To determine if the significant difference in BMI values at TR had any effect on the levels of these molecules, we performed ANCOVA. A total of 13 of these molecules remained significant (P<0.05) after this analysis and 11 of the molecules were also significant using non-parametric Wilcoxon tests (Table 4). The three molecules with the greatest differences between the short- and long-term relapse groups were leptin, proinsulin and transforming growth factor (TGF)-alpha (Figure 3).

Table 4 Significant molecular differences at the last clinical visit before relapse (TR) between the short- and long-term relapse groups

The significant association between time to relapse and BMI, and the finding of higher BMIs in the long-term relapse group, provided further evidence for the importance of metabolic changes such as weight gain in antipsychotic treatment response.21, 22 Also, leptin and proinsulin have been used previously as biomarkers for insulin resistance23 and metabolic syndrome.24 These findings raise the interesting possibility that schizophrenia patients who experience little or no increase in body weight or in the levels of these molecules in response to antipsychotic treatment are more likely to relapse. Previous studies have indicated that the levels of leptin25 and insulin26 are positively correlated with BMI in patients treated with antipsychotics. However, it is not clear whether the changes in these two hormones are involved in the mechanism of action of antipsychotics, if they are due to side effects or if they are a consequence of changes that occur in the brain. Previous studies have found that high levels of insulin can lead to impaired cognition and administration of insulin-sensitizing agents can reverse this effect.27 Also, it has been shown that leptin levels are inversely correlated with grey matter volumes in obese subjects, suggesting that high levels of this hormone can also have negative effects on the brain function.28

The final objective of stage 3 was an attempt to predict the time to relapse using serum molecular signatures from the last clinical visit (TR). The Random Forests technique was used to select molecules that were the most important determinants for the prediction of relapse time (short- or long-term) and for determining the accuracy of this prediction. This led to the selection of a group of molecules comprised of leptin, proinsulin, TGF-α, β-cellulin, CD5L, CD40 and Apo CI, which were also significant in the univariate analysis (see above), as well as clusterin, insulin, interleukin-8, MIP-1-β and matrix metalloproteinase 3. Prediction testing using this combination of molecules yielded an accuracy of 94.5% in determining whether subjects were in the short- or long-term relapse groups.

We also tested the BMI values at the last clinical visit (TR) as a potential predictor of relapse time. Interestingly, another round of Random Forest analysis did not result in selection of BMI as an important variable, although 11 out of the 12 molecules indicated above were reselected (excluding matrix metalloproteinase 3). BMI alone achieved a predictive performance of 83.4%, which was lower than that achieved using the signature of 12 molecules indicated above.

Stage 4: changes in the relapse biomarker signature induced by treatment

In the final stage of this study, we performed an exploratory analysis into the change in molecular predictors of relapse throughout the treatment period. For 15 of the 18 patients who experienced relapse, samples were available at T0 and T6. Similar to the findings at TR, patients in the short-term relapse group (n=7) had a lower BMI at T0 compared to those in the long-term relapse group (n=8), although this difference was not significant (P=0.094; Wilcoxon test). At T0, 8 molecules showed significantly different levels between the short- and long-term relapse groups (P<0.05, Wilcoxon test), including leptin (P=0.040), although (Supplementary Table 1) all of these molecules had a high false discovery rate. This demonstrated that at T0 there were no pronounced molecular differences between the two relapse groups.

Therefore, we tested the multivariate biomarker signature identified above to determine if this was altered at T0. A Random Forests algorithm comprising these molecules was tested using T0 samples from the same patients. All patients who were detected in the short-term relapse group at TR were also identified at T0. Interestingly, the molecular signature at T0 also predicted imminent relapse for 50% of the patients (4 out of 8) in the long-term relapse group. Although based on a low n-number, this may suggest that the molecular state at T0 was more predictive of relapse than when measured at TR. This could indicate that at T0, when patients were unmedicated and acutely psychotic, their molecular profile was suggestive of imminent relapse. During the treatment the molecular profile might have responded in those patients, who would only relapse after a longer period of time.

With this in mind, we analyzed whether serum molecules of subjects in short- and long-term treatment groups showed differences in early response to medication. For this purpose, the change in serum levels of molecules between T0 and T6 was compared between the two relapse groups. ANCOVA analysis was performed using baseline molecular levels as covariates to prevent a potential ‘regression to the mean’ on results.12 We found that eight molecules were associated with the time of relapse (P<0.05). For example, the early treatment response of leptin, insulin and C-peptide were significantly different between the relapse groups (P<0.041; Q=0.52, Figure 4). Levels of these three molecules showed essentially no change in the group that relapsed early, although these were increased significantly in patients who relapsed later. Other molecules that showed a differential early treatment response between the relapse groups are listed in Supplementary Table 2. In addition, the short-term relapse group had a lower BMI increase (ΔBMI=0.28 kg m−2) between T0 and T6 compared with the patients who relapsed later (ΔBMI=1.94 kg m−2). These findings support the hypothesis stated above that early response to medication, which affects metabolic and other processes, are related to predisposition for subsequent relapse. The result is of particular importance for the present study, as patients were inpatients between T0 and T6 and non-compliance could be ruled out by therapeutic drug monitoring. The differential early molecular response to medication may open up the possibility for the development of molecular tests that allow assessment of states indicative of relapse and identify those subjects who may require a change in treatment strategy.

Figure 4
figure 4

Differential early response to treatment (between T0 and T6) of leptin, insulin and C-peptide between the short- and long-term relapse groups. The figure shows the difference in the change of levels between T0 and T6 in serum from patients in the two different relapse groups.

As a final step of stage 4, we investigated the molecular state in samples taken from subjects at the time of relapse. This revealed that 12 serum molecules had significantly different concentrations between the two relapse groups (Table 5). Again, abnormalities were pronounced for leptin, insulin, proinsulin and C-peptide, consistent with differences in BMI between the relapse groups and the suggestion of altered insulin signalling as seen by an increased insulin/glucose ratio.

Table 5 Significant molecular differences at the time of relapse between the short- and long-term relapse groups

Conclusion

Non-compliance is a serious issue in the treatment of psychiatric conditions, such as schizophrenia given that this is the most common cause leading to psychotic relapse. This is the first study to identify serum molecules that may be indicative of relapse. Further development of assays targeting these specific molecules could lead to the development of useful tests to guide the clinical management of schizophrenia. As these molecular signatures may be dynamic and change over the course of treatment, we also investigated the change in molecular levels at the initiation of treatment up to and including the time of relapse. The results suggested that molecular processes associated with weight gain and metabolism changes have a central role in re-occurrence of psychotic symptoms. Specifically, a lower BMI and non-response of the molecular signature to treatment was associated with a shorter time to relapse. The identification of biomarkers, such as the insulin-related molecules proinsulin, C-peptide and insulin, along with changes in the hormone leptin, is consistent with previous studies showing changes in these molecules in first onset antipsychotic naive schizophrenia patients.18, 29, 30 We have also identified molecular signatures associated with symptom severity and response. However, the prediction of positive symptom improvement was of limited value due to the high proportion of patients in this study who showed an improvement in response to treatment.

Our results extend our previous findings that schizophrenia is characterized by a reproducible, peripheral signature. Further testing of these findings in validation studies using larger numbers of subjects in prospective clinical trials are required for the development of molecular tools, which could be useful in guiding the clinical management of schizophrenia. This could be particularly important for early monitoring of response to ensure a better clinical outcome in schizophrenia patients treated with antipsychotics. It may also provide an early indication for cases in which switching of antipsychotics may be required.

Limitations

The findings need to be validated in analyses of further, independent cohorts, in particular in the light of the regression towards the mean effect observed in this follow-up study. Also, the prediction of positive symptom improvement may be of limited value due to the high proportion of patients who improved after treatment. It will be important to validate these findings using a cohort of responsive and non-response patients. Finally, it is not clear whether any of the changes in biomarker profiles are involved in the mechanism of action of antipsychotics or if they are a consequence of changes that occur in the brain. Studies, which have found that hyper-insulinemia can affect cognition, provide some evidence that the response of some molecules could be mechanism based. However, further studies will be required to answer this question.