Introduction

Schizophrenia (SCZ) is a complex syndrome affecting 1% of the population worldwide. There are likely diverse abnormalities during and after development underlying the positive, negative, and cognitive symptoms that characterize the illness. However, immune dysfunction and chronic inflammation have been of major interest in this regard [1]. Post-mortem morphological evidence, including loss of white matter and cortical gray matter without observed cell death [2], excessive synaptic pruning of mature or new dendritic spines in cortical pyramidal neurons [3, 4], and activation of microglia during adolescence and early adulthood [5], has led to the suggestion that these abnormalities may be due, in part, to immune system dysfunction. This may be the result of major histocompatibility complex (MHC) molecules (HLA I & II) mediated antigen-presentation and the complement-mediated classical pathway, producing microglial engulfment and auto-phagocytosis [6,7,8,9,10]. The hyperactivated microglia could be the major basis for excessive synaptic elimination [5, 11].

Functional implications of MHC genes in neuropsychiatric disorders, particularly schizophrenia, were strongly supported by genome-wide association studies (GWAS) which indicated that the extended MHC region (xMHC, a total of 7.6 Mb on the short arm of chromosome 6 with genetic coordinates between 25 M and 34 M) were the most significant and replicable genetic associations with SCZ [12,13,14]. A functional study showed that the level of C4A, a pivotal complement molecule, partially account for this association [7]. This landmark study integrated the genetic, autoimmune, and neurobiology (e.g., excessive synaptic pruning) theories of schizophrenia. It was proposed that overexpression of C4 in mouse prefrontal cortical neurons negatively affected dendritic spine development by the enhancement of microglia-induced synaptic engulfment and led to enhanced negative symptoms [15, 16]. Transcriptional imputation of SCZ GWAS summary statistics confirmed the increased expression of C4A, BTN3A2 from xMHC in several brain regions associated with schizophrenia, including dorsolateral prefrontal cortex (DLPFC) [7, 17].

There is some evidence that neuroinflammation has a significant impact on the course of schizophrenia. Significantly higher plasma levels of C3 and C4, as well as other acute-phase proteins have been reported in APD-treated compared to non-medicated SCZ patients [18]. A significant increase in cerebrospinal fluid C4 levels was observed in SCZ patients [19]. The HLA system has been linked to clinical response to the haloperidol [20] and chlorpromazine [21]. Although some single nucleotide polymorphisms (SNPs) at xMHC region predicted the treatment response to the atypical APDs, olanzapine, and risperidone [22], none of those markers have been linked to risk for SCZ or have a functional impact on alteration of gene expression at the xMHC region. C4A or C4B expression, imputed by copy number analysis of long and short forms of C4A and C4B, does not contribute to the risk and severity of tardive dyskinesia in SCZ patients [23]. There is a paucity of studies which have examined the role of C4 in treatment response to APDs in schizophrenia patients or other populations [24].

The purpose of this study was to identify common variants contributing to the treatment response to APDs in chronic schizophrenic patients in a candidate gene study of the xMHC region. The primary endpoint was the change in the subscales of the Brief Psychiatric Rating Scale (BPRS) at 6-week or 6-month after treatment with atypical APDs, including clozapine, olanzapine, risperidone, and lurasidone. A secondary fine-mapping approach was conducted to prioritize the potential causal genes associated with improvement in psychotic symptoms.

Methods

Subjects and clinical evaluation of treatment response

A structured interview, the Schedule for Affective Disorders and Schizophrenia [25], or the Structured Clinical Interview for Schizophrenia for DSM III or IV [26], provided the basis for diagnosis. This was integrated with all available data to make the final diagnosis by consensus according to the Diagnostic and Statistical Manual of Mental Disorders, third edition (DSM-III) criteria at discharge. A review of these diagnoses indicates all patients met DSM-IV criteria for schizophrenia or schizoaffective disorder. Data from patients with either diagnosis were combined. Over 75% of patients were unmedicated or had a drug-free period of 3–10 days prior to baseline assessment.

All participants were temporally split into discovery (n = 88) and replication (n = 42) cohorts. Demographics and clinical information for the discovery and replication cohorts were provided in Table 1. For quantitative variables, Shapiro–Wilk test confirmed the approximately normally distributed data for these variables (p > 0.05) and Welch’s t-test was conducted to compare the mean difference between two cohorts with an assumption of equal variance or unequal variance determined by Levene’s test; for categorical variables, Fisher’s exact test was conducted to examine the significance of the association between two kinds of classification. The subjects selected for GWAS had participated in a prospective clinical trial of the effect of atypical APDs. Diagnosis and classification as treatment-resistant schizophrenia (TRS) or non-treatment-resistant schizophrenia (NTRS) were based on severity of positive symptoms and poor functional outcome after two or more trials of APDs of usually adequate duration [27]. Patients whose psychotic symptom responded to atypical APDs other than clozapine were classified as NTRS. Determination of TRS or NTRS was based upon review of medical records for inpatient and out-patient treatment and assessment of response of positive symptoms to two or more trials with ADPs, including the current trial [28].

Table 1 Demographic, psychopathology, and genotype data for discovery and replication cohorts.

The discovery GWAS included 88 self-described Caucasian patients diagnosed with schizophrenia or schizoaffective disorder by DSM-IV criteria. They were recruited between 1999 and 2010 at clinical facilities associated with Departments of Psychiatry at Case Western Reserve University and Vanderbilt University. Most of these patients were initially hospitalized for an acute exacerbation of symptoms and failure to respond adequately to conventional or atypical antipsychotic drugs. The subjects selected for GWAS had participated in a clinical trial (NCT00539071, NCT00179062) or prospective longitudinal studies of the effect of clozapine (n = 60), olanzapine (n = 15), or risperidone (13). 62.5% patients in the discovery cohort were classified as TRS [29]. An additional 42 subjects with were studied at Vanderbilt University and Northwestern University [30]. The subjects selected for GWAS had participated in a clinical trial (NCT01569659, NCT00179062) or prospective longitudinal study of the effect of lurasidone (n = 14), risperidone (n = 19), olanzapine (n = 6), clozapine (n = 1), or ziprasidone (n = 2).

Here we use the classical 18-item Brief Psychiatric Rating Scale (BPRS) with 0 to 6 scaling for each item. All ratings were conducted by trained neuropsychiatric technicians who were blind to the hypothesis of this study. The BPRS positive symptom subscale, BPSY, includes assessment of suspiciousness, hallucinatory behavior, and unusual thought content. The BPRS negative subscale, WR, which stands for BPRS negative subscale, is comprised of three items: emotional withdrawal, motor retardation, and blunted affect. Quantitative treatment response was evaluated at 6-week and 6-month, using the change in ∆BPSY or ∆WR. Only patients with moderate to severe psychosis (baseline BPSY ≥ 6) were included in the analysis. Different cutoff values for BPSY had been implemented to avoid selection bias, and BPSY ≥ 6 gave the strongest signal compared to BPSY ≥ 0, ≥2, ≥4, and ≥8 (see the Supplementary Table 1 for effect size and power analysis), with 88 out of previously reported 174 patients [29] included in the discovery cohort. Therefore, the same baseline cutoff values were applied to the replication cohort. 42 out of 71 patients were included.

After a description of the study, written informed consent was obtained from every subject. All patients provided written informed consent to remain drug-free during the assessment. The drug-free period was terminated if patient well-being required it. Some were not receiving psychotropic drugs prior to admission because of non-compliance. This study was approved by institutional review boards from Case Western Reserve University, Vanderbilt University, and Northwestern University.

Quality control of genotyping data and association testing

Genome-wide SNP genotyping was performed using Illumina 610 K quad BeadChip® for discovery cohort [29] or Illumina PsychArray® for replication cohort [30].

DATA QC was conducted to exclude samples with minor allele frequency (MAF) < 0.05, genotyping rate per SNP < 0.95, and significant deviation from Hardy–Weinberg equilibrium (p < 0.0001). There were 491,932 (Discovery) and 272,589 (Replication) SNPs available for further analysis. Total genotyping rate in the remaining individuals was >99.96%.

Regional SNP imputation for xMHC (Chr6: 25 M to 34 M) was conducted by IMPUTE2 using 1000 Genome Project (April 2014) as reference panel. Briefly, after prephasing by SHAPEIT2 using 37 macGT1 data as reference, we conducted a stepwise imputation in 5-Mb segment using Quest High Performance Computing Cluster. The imputed data was finally converted to PLINK format by GTOOL. SNPs with imputation quality core >0.9 were used in the following association testing. All cases were considered as unrelated individuals (PiHAT = 0.20 as a cutoff) based on the pairwise identity-by-descent (IBD). All patients (discovery and replication) recruited in this study were self-described Caucasians which was verified by Principal Component Analysis (PCA) [29, 30].

The association testing was conducted by PLINK 1.9. The primary endpoint was change in psychotic (ΔBPSY) or negative (ΔWR) symptoms of BPRS at 6-week and 6-month after treatment with APDs. Linear regression in an additive model of minor alleles, adjusted for covariates, first three principal components (PC1-3), gender, and drug, was utilized. TRS status was tested as a covariate in the linear regression model but had no significant impact on the association for the top variants. Therefore, it was not included as a covariate in the summary statistics. False discovery rate (FDR) corrections for multiple testing were calculated using the Benjamin and Hochberg (BH) procedure for the regional association testing.

Mapping cis eQTL in disease-related tissues was conducted by three online portals, LIBD eQTL browser (http://eqtl.brainseq.org/) [31], GTEx Portal (https://gtexportal.org/home/), and scanDB (http://scandb.org/newinterface/about.html). We arbitrarily considered p < 4 × 10−5 as the cutoff for cis-eQTL, providing >25,000 genes which could be investigated.

Transcriptome imputation and gene-based association testing was conducted by PrediXcan [32] using the PredictDB, GTEx-V7_HapMap-2017-11-29.tar.gz, solely from subjects of European ancestry (EUR) to impute gene expression at xMHC (Chr6: 25 M to 34 M). The dosage file for our samples (Discovery and Replication) was created by PLINK. Linear regression adjusted for covariates such as first three principal components(PC1-3), gender, and drug, was utilized to determine a gene-level association from imputed expression to ΔBPSY or ΔWR. A raw p-value was calculated. The Benjamini–Hochberg procedure was conducted to adjust the false discovery rate (BH-FDR) from each brain tissue and the q value <0.1 was considered significant.

Classical MHC I & II and amino-acid polymorphisms were imputed by SNP2HLA (version 1.02) [33] using T1DGC_REF from The Broad Institute as reference and association testing was performed to determine if the candidate HLA alleles, previously reported to increase risk for SCZ, also contributed to the variation in treatment response/resistance.

Finally, Targets Genetics (www.OpenTargets.org), a PheWAS portal, was employed to prioritize the causal variants with functional impact and explore their pleiotropy if any.

See Supplementary material for additional description of the methods. The summary statistics of our GWAS may be shared with the third party upon execution of data sharing agreement for reasonable requests. The analysis codes are available upon request.

Results

Table 1 summarized the demographic, psychopathology (baseline and change after 6wk or 6mon), as well as microarray information for discovery (n = 88) and replication (n = 42) cohorts, respectively. It was noted that baseline, as well as the improvement in psychotic symptoms at 6-week in the replication cohort, were significantly lower than those in the discovery cohort; the type of APDs was also quite different. These factors contributed to reducing the power of replication of the top findings identified in the discovery cohort.

The initial attempt to identify genetic variants associated with improvement in psychotic symptoms (∆BPSY) at 6-week after treatment with APDs was made by GWAS in a cohort of chronic schizophrenic patients of European ancestry (EUR) with persistent psychotic episodes (n = 88). Manhattan and QQ plots were shown in Supplementary Fig. 2. No genome-wide significant loci were identified after a linear regression adjusted for covariates, suggesting the complexity of the trait and that this GWAS was underpowered because of small sample size. Nevertheless, rs204991 (genotyped, but not imputed, β = 3.917, p = 3.72 × 10−6), was the 2nd genome-wide strongest signal. It, and other weaker signals, was noteworthy because of its location near C4A/C4B. TRS status was tested as a covariate in the linear regression model but has no significant impact on the association for the top variants (for rs204991, p = 0.64; for rs6904596, p = 0.91). The medication was not the significant covariate in the genotype-phenotype association for the top hit, rs204991 (p = 0.488 and p = 0.193 in both discovery and replication cohorts). However, we retained it as a covariate in the full regression model.

The subsequent fine-mapping by a candidate region approach (Fig. 1) was conducted after regional SNP imputation in order to identify causal variants or genes from xMHC associated with treatment response to APDs. SNP imputation disclosed much stronger signals with β = 5.140, plowest = 1.57 × 10−7 for rs9268469 and other weaker signals (p < 1 × 10−5) spanning the entire xMHC region.

Fig. 1: Overview of the strategy for the fine-mapping causal variants/genes from xMHC region in association with treatment response to APDs.
figure 1

ΔBPSY, change in positive symptom subscale including suspiciousness, hallucinatory behavior, and unusual thought content; eQTL, expression quantitative trait loci; EUR, European ancestry; HWE, Hardy–Weinberg equilibrium; INFO, an information score, which takes a value between 0 and 1, reported by IMPUTE2, with a value near 1 indicating high certainty of imputed genotype; MAF, minor allele frequency; PCA, principal component analysis; QC, quality control; PheWAS, phenom-wide association studies; ΔWR, change in withdrawal symptom subscale which is comprised of three items: emotional withdrawal, motor retardation and blunted affect.

The top signals associated with improvement in psychotic symptoms at 6-week were labeled in the regional association plot (Fig. 2A) and listed in Table 2 after LD-based clumping. We also listed the results from two genotyped SNPs, rs6904596 and rs7775397 which were in LD with two imputed SNPs, rs150353632 and rs59134830. The association of top SNPs with symptom improvement at 6-month was tested. Only SNPs (rs204991, β = 2.57, p = 0.006, Fig. 2B) near C4A, but not those closest to BTN3A2 (rs6904596, β = 0.9, p = 0.309, Fig. 2C) showed a significant association (p < 0.01) but with smaller effect size (β).

Fig. 2: The genetic variants at xMHC were associated with improvement in psychotic symptoms at 6-week after treatment with APDs.
figure 2

A Common SNPs (maf > 0.05) within xMHC region were genotyped or imputed to 1000 Genome Project reference panel (April 2014) with high confidence (INFO > 0.90). The linear regression adjusted for the covariates including gender, drug, and PC 1–3 derived from the initial GWAS were conducted by PLINK to determine the association between SNPs and treatment response to APDs. The regional association plot was created by LocusZoom. Purple diamond indicated the most significant finding at this region. Chromosomal positions and LD were based on hg19/1000 Genome Project (April 2014) EUR. Colors represent LD (r2) with the top SNP, rs9268469. Based on the LD pattern, the top signals at xMHC can be partitioned into three blocks. Estimated recombination rate (light blue line) was plotted on the right y-axis. The top signals of real-typed SNPs from the representative LD blocks were labeled in red font. The epistasis testing of a set of SNPs with p < 10−4 in the linear regression model were conducted and the top SNP×SNP was the interaction between rs204991 and rs3132541 (BETA_INT = 5.24, p = 0.019). B, C The boxplots showed the distribution of ∆BPSY at 6-week or 6-month stratified by each genotype of rs204991 (B) and rs6904596 (C). Minor allele (G) carriers of rs204991 having decreased risk for SCZ and lower expression of C4A had a better response to clozapine in BPSY at both 6-week and 6-month after treatment with APDs; minor allele (A) carriers of rs6904596 having decreased risk for SCZ had a better response to clozapine in BPSY only at 6-week after treatment with APDs.

Table 2 Summary of the top genetic variants or genes at xMHC regions in association with symptom improvement.

Conditional linear regression analysis to identify independent signals

Based on the LD pattern between rs9268469 and other weaker signals with PFDR-BH < 0.05 (Fig. 2A), we partitioned the regional association plot into 3 blocks, block 1 (r2 < 0.2), block 3 (0.2 < r2 < 0.4), and block 2 (r2 > 0.4). In order to identify the independent effects, three representative SNPs which showed high LD (rs2240991, r2 > 0.8), intermediate LD (rs3132541, 0.2 < r2 < 0.4), and poor LD (rs6904596, r2 < 0.2), with rs9268469, from the corresponding three subregions, were selected as the fixed effect and added to the original linear regression model (Supplementary Fig. 3). The strength of the association of rs9268469 with treatment response in BPSY was attenuated from β = 5.14 (p = 1.57 × 10−7) to β = 3.99 (p = 0.001) after adjusting for rs3132541 (Supplementary Fig. 3A), suggesting the associations from rs3132541 and that from rs9268469 were not completely independent from each other. This was also found with rs6904596 (β decreased from 3.36 to 1.79; p changed from 5.13 × 10−5 to 0.06) and others in block one (Supplementary Fig. 3C) which spanned Histones and BTN3A2. The strength of the association of SNPs in block one or block two were only slightly changed after adjusting for rs6904596 (for rs9268469, β changed from 5.14 to 4.13, p changed from 1.57 × 10−7 to 1.09 × 10−4; for rs204991, β changed from 3.92 to 3.02, p changed from 3.72 × 10−6 to 4.44 × 10−4) respectively, suggesting that the top signals from block one were independent of those from block two. However, they both show significant interaction with signals from block three, suggesting SNP×SNP interactions.

Summary of the epistasis among the top 22 variants reported with the association p value <1 × 10−5 was provided in Supplementary Fig. 4A. We added β3gsnp1gsnp2 into the original linear regression model, Y = β0 + β1gsnp1 + β2gsnp2 + βcovar1Covar1 + … for each inspected variant pair (SNP1, SNP2). The eQTL analysis from LIBD eQTL Browser (Supplementary Fig. 4B) indicated that rs3132541 and rs886423 have significant impact on the gene expression of HLA-C in DLPFC. Given the epistasis between rs3132541 and rs204991 (p = 0.019) or rs886423 and rs204991 (p = 0.033), the genetic evidence supported that there is a significant interaction between HLA-C and C4A. It was noted that all these variants show genome-wide significant risk for SCZ according to PGCGWAS on SCZ (Table 2A).

None of the top hits showed association with negative symptom improvement. An independent cohort (n = 42) validated the genetic association (rs9268409 and rs7775397) at C4A (p < 0.05) with improvement in psychotic symptoms in the same direction for the minor allele.

eQTL analysis of the top variants in disease-related tissues

The boxplots from LIBD eQTL Browser on the left indicate the significant impact of top variants in block two on the gene expression of C4A in DLPFC (Supplementary Fig. 5A–C). The barplots on the right represent the significant impact of those variants on expression of C4A in multiple brain tissues and whole blood (Supplementary Fig. 5D, E). rs9268469 was a strong eQTL for C4A in those tissues (p = 5.5 × 10−20 in whole blood; p = 1.5 × 10−8 in hippocampus) according to GTEx. The boxplots in Fig. 3B showed the distribution of ∆BPSY at 6-week or 6-month stratified by each genotype of rs204991 (A) and rs6904596 (B). Minor allele (G) carriers of rs204991 have decreased risk for SCZ, lower expression of C4A, and a better response in BPSY at both 6-week and 6-month after treatment with APDs. Minor allele (A) carriers of rs6904596 have decreased risk for SCZ and a better response in BPSY only at 6-week after treatment with APDs.

Fig. 3: The forest plot illustrated the inverse association between the imputed expression level of C4A or BTN3A2 and treatment response (∆BPSY at 6-week) to atypical APDs in several brain tissues using PrediXcan.
figure 3

Transcriptome prediction model was GTEx-V7 (EUR only) from PredictDB based on elastic net; xMHC (Chr6: 25 M to 34 M) was the target region. Dosage file for our samples was created by PLINK. PrediXcan was run at Python 2.7. The estimates of effect size (β) with 95% confidence interval in discovery and replicate datasets were present, grouped by the brain region. The significance of this association was present as raw. The genes predicted to have the most significant impact on improvement in psychotic symptoms at 6-week after the treatment with APDs were ranked in each brain tissue. C4A (left) and BTN3A2 (right) which represents one of the genes from block 1 and 2, has been frequently ranked to the top in schizophrenia-related tissues such as frontal cortex, hypothalamus, and basal ganglia (Caudate and Putamen). C4A level could not be successfully imputed using the transcriptomic data from “brain_basal_ganglia_nucleus_accumbens”. The result from Nucleus Accumbens was not included.

Transcriptome imputation and gene-based analysis

Prediction of gene expression in a variety of brain tissues based on the genetic architecture at xMHC was conducted to correlate imputed gene expression with ∆BPSY at 6-week using PrediXcan. The expression levels of nearly 29 to 37 genes from the xMHC were imputed; those numbers varied depending on brain tissue. The genes predicted to have the most significant impact on symptom improvement in BPSY were ranked in each brain tissue (Fig. 3). C4A and BTN3A2 from the corresponding block 1 and 2, have been frequently ranked at the top in some schizophrenia-relevant brain regions such as frontal cortex, basal ganglia, and hypothalamus. According to the forest plot (Fig. 3), patients predicted to have a higher level of C4A or BTN3A2 showed poor response to APDs, as indicated by the negative value of β. Significant associations were identified in frontal cortex and hypothalamus.

HLA imputation and association testing

The most significant association between HLA alleles and ∆BPSY at 6-week is HLA_DRB1_0301 (p = 3.732 × 10−5). When including the top HLA allele reported in earlier PGC GWAS 2009 for the risk of SCZ, the presence of those HLA alleles which decreased risk for SCZ have significant positive association with improvement in psychotic symptoms at 6-week (Supplementary Table 2). HLA_DQA1_0501 is the only one not showing contribution to the treatment response. Some of those associations were also replicated (HLA_DRB1_0301, p = 0.023) or showed a trend of replication (HLA_DQB1_0201, p = 0.074) in the 2nd cohort. This is the first evidence for a connection between these SCZ risk-associated HLA alleles and treatment outcome in a subgroup of chronic schizophrenic patients with persistent moderate to severe positive symptoms.

Conclusion

The purpose of this study was to identify common variants contributing to treatment response to APDs in chronic schizophrenic patients with persistent psychotic episodes using xMHC fine-mapping approach. Although no genome-wide significant locus was identified after linear regression adjusted for covariates (n = 88), rs204991 (p = 3.72 × 10−6), located near C4A/C4B, was among the top-tier signals. Regional SNP imputation disclosed even stronger signals with the lowest p = 1.57 × 10−7 for rs9268469 and other weaker signals (p < 1E−5) spanning the entire xMHC region. All these variants show genome-wide significant risk for SCZ. The subsequent LD-based conditional fine-mapping revealed three subgroups of SNPs which were found to be eQTLs (p < 1 × 10−7) for C4A, HLA-C, and BTN3A2 in disease-associated tissue such as DLPFC, basal ganglia, and whole blood. Genetic evidence of epistasis between HLA-C and C4A was observed (p = 0.019). Minor allele (G) carriers of rs204991, eQTL for C4A, who had decreased risk for SCZ and lower expression of C4A, had a better response to APDs at both 6-week and 6-month; Minor allele (A) carriers of rs6904596, eQTL for BTN3A2, who had decreased risk for SCZ have a better response to APDs only at 6-week. In line with these findings, HLA imputation showed the presence of some HLA alleles which decrease the risk for schizophrenia [12] have a positive association with improvement in psychotic symptoms. An independent cohort (n = 42) validated the genetic association between response to APDs and the C4A expression. These top SNPs associated with treatment response in schizophrenia were also related to several autoimmune-related diseases (e.g., coeliac disease, hyperthyroidism/thyrotoxicosis). Neuroinflammation could be a part of systemic dysfunction of immune system in SCZ. A recent meta-analysis confirmed an overall positive association between non-neurological autoimmune disorders and psychosis with a larger effect size (odds ratio) in pernicious anemia, pemphigoid, psoriasis, celiac disease, and Graves’s disease [34]. Prolonged inflammatory/immune response may contribute to treatment resistance in some schizophrenic patients [24, 35, 36]. Together, we provide evidence that some genetic risk factors for SCZ in the xMHC region are associated with treatment response/resistance to APDs and those variants significantly alter the gene expression of C4A, HLA molecules (HLA-C), and other immune-related genes (BTN3A2).

Linking the genetic risk for disease to treatment outcome or other intermediate phenotypes

The initial attempts to map SCZ risk loci to the MHC region were first reported in 1974 [20]. Since then, many reports provide additional evidence [37] but the definitive evidence for MHC involvement in SCZ was the result of three GWAS of SCZ patients with EUR descent published in 2009 [12,13,14]. One of the most significant association signals, rs13194053 with p = 9.54 × 10−9, was located at the extended MHC region. Others including rs6932590 (p = 1.4 × 10−12), rs1635 (p = 7 × 10−12), rs2523722 (p = 2.88 × 10−16), rs2021722 (p = 4.3 × 10−11), rs886424 (p = 4.54 × 10−8), rs3131296 (p = 2.3 × 10−10), and rs9272219 (p = 6.88 × 10−8) [9, 38, 39]. It was noted that all these SNPs showed association with improvement in psychotic symptoms in this study, e.g., rs13194053 (β = 2.065, p = 0.012); rs6932590 (β = 1.225, p = 0.056); rs1635 (identified from Han Chinese); rs2523722 (β = 0.761, p = 0.047); rs2021722 (β = 1.536, p = 0.047); rs3131296 (β = 4.726, p = 3.11 × 10−5) and rs9272219 (β = 1.256, p = 0.060). rs3131296 was closest to C4A and had significant impact on the gene expression of C4A (DLPFC p = 2.01 × 10−18). These genetic risk variants were associated with brain structure and cognition [38, 40]. Six SNPs show nominally significant association with one or multiple domains of cognitive function. The (G) allele carriers of rs6904071, which were associated with increased risk for SCZ, poor delayed episodic memory, and decreased hippocampal volume, showed poor response to APDs in our study (Supplementary Table 3). Other SNPs also showed the association with treatment response in the same direction.

Chronic neuroinflammation may determine the patient outcome after treatment with APDs

According to the PheWAS report from UK Biobank (Supplementary Table 4), the top SNPs associated with treatment response in this study were related to several autoimmune-related diseases (e.g., coeliac disease, hyperthyroidism/thyrotoxicosis). Those SNPs were linked to many autoimmune diseases unrelated to neuropsychiatric phenotypes (Supplementary Table 5). The neuroinflammation could be a part of systemic dysfunction of immune system. Thus, we suggest as have others [34, 41] screening for some autoimmune disorders (autoantibodies) in patients who do not response to APDs.

Multiple lines of evidence support the relationship between neuroinflammation and schizophrenia. Polygenic risk scores for SCZ were higher in patients with early-life complications (ELCs) [42]. Genes whose expression was modified by ELCs were involved in regulation of oxidative and cellular stress as well as inflammation. A meta-analysis of RNAseq and array-based transcriptome study of lymphoblastoid cell lines derived from schizophrenia cases and controls have indicated immune-related genes as the top-ranked for differential expression [43].

Our study was based on a candidate region, followed by LD-based conditional association testing, which was extended to eQTL analysis, transcriptomic imputation and association, HLA imputation and association, and PheWAS. These efforts confirmed the association between the gene expression and response to APD treatment, but also prioritized the causal SNPs/Genes. It is, thus, quite noteworthy that C4A related SNPs had the most significant and replicable association with APD response. This suggests that chronic neuroinflammation not only contributes to the risk for development of psychosis, but also is important for predicting some types of improvement with atypical APD treatment. This finding also provides additional impetus for further study of anti-neuroinflammatory drugs as adjuvants to atypical APDs [44, 45].

The contribution of the MHC to negative symptom improvement was not supported by this study, although there was some evidence that chronic neuroinflammation may contribute to negative symptoms [46].

It is noteworthy that the C4 and HLA-related SNPs which predicted response in this study did not predict treatment response in acutely psychotic, NTRS patients who were included in double-blind registration trials of lurasidone [47, 48]. This suggests that the genetic predisposition for immune activation in schizophrenia may be most important for SCZ patients with chronic psychosis [49]. Only some brain regions in SCZ may be vulnerable to inflammation. Expression of inflammatory genes may be confined to specific brain region, as a transcriptomic study found that abnormal immune/inflammatory responses were limited to the hippocampus [50]. The hippocampal regional gene expression showed above suggests neuroinflammation may be important for the neurocognitive deficits associated with schizophrenia. Another study focusing on inflammatory genes in psychotic patients under or over 40 years of age found a difference in gene expression only in older patients [51]. The decreased expression levels of altered inflammatory genes in DLPFC in post-mortem specimens from aged SCZ correlated with the microglial marker CD68 [52]. This supports the idea of a dysfunction of these processes in aged patients and a possible relationship with active microglia abundance.

C4A, synaptic plasticity, and effect of antipsychotic drugs

C4A copy number variation is associated with synaptic pruning. C4-deficient mice have shown decreased synaptic pruning [7]. Increased imputed C4A mRNA levels predicted poorer performance on memory recall and reduced cortical activity in middle temporal cortex during a measure of visual processing, suggesting that there is a positive correlation between predicted C4A transcription and impairment in memory [53]. A recent study on the changes in neuronal membrane expansion and contraction within the neuropil by phosphorus magnetic resonance spectroscopy has shown that C4A copy number positively correlated with neuropil contraction in the DLPFC and thalamus of adolescent-onset SCZ patients [54].

In addition, other immune-related molecules such as certain HLA I and II alleles, members of immunoglobulin superfamily, participate in the C4-mediated classical pathway, which is actively involved in activity-dependent synaptic remodeling and plasticity [55]. Identification of genetic variants which altered the expression of those molecules in this study suggests the activated autoimmune-mediated processes contribute to resistance to APDs.

Limitations

The individual genetic associations reported here need to be validated by independent studies with larger sample sizes, similar open trial design and patient cohort, conducted by other investigators. Given the main effect of βG (~3.83 for top markers like rs204991), a type 1 error rate of 1 × 10−4 for nominal significance with two-sided test, on the continuous trait with mean ± SD of ∆BPSY as 2.98 ± 3.89, we conducted a power test using QUANTO. Our sample size of 88 had >80% power to identify a significant association when MAF was equal to 0.136. Our study had limited power to test the association in the small replication cohort (n = 42). The levels of C4 and other proinflammatory cytokines in plasma or cerebrospinal fluid was not tested during or after the trial. We did not examine comorbidity information related to autoimmune or infectious diseases. Finally, our study was based exclusively on subjects of European ancestry. We did include a diversity of chronic schizophrenic patients with regard to treatment resistance status and current exacerbation of positive symptoms.

The major finding of this manuscript indicated that the genetic liability (i.e., xMHC region) to schizophrenia also contribute to the risk for treatment response to antipsychotic agents in a cohort of chronic schizophrenic patients with persistent positive symptom. This study highlights the importance of personalized treatment of schizophrenic patients, particularly for the subgroup of patients who might developed the treatment-resistant phenotype due to the persistent neuroinflammation, and further emphasizes the target treatment of this neuroinflammation as well as potential inflammation-related comorbidities. Although a recent prediction study by leveraging ‘real-world’ EHR data reported a poor prognostic value of SCZ polygenic risk score (PRS) for adults with psychosis, we hypothesize that this PRS may have a potential value in outcome prediction in a subgroup of patients of the similar kind as we reported in this study.

In conclusion, we demonstrate for the first time that genetic variants which predict higher C4A levels, an indicator of inflammation, are associated with less improvement in psychotic symptoms during treatment with APDs.