Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Generating testable hypotheses for schizophrenia and rheumatoid arthritis pathogenesis by integrating epidemiological, genomic, and protein interaction data


Patients with schizophrenia and their relatives have reduced prevalence of rheumatoid arthritis. Schizophrenia and rheumatoid arthritis genome-wide association studies also indicate negative genetic correlations, suggesting that there may be shared pathogenesis at the DNA level or downstream. A portion of the inverse prevalence could be attributed to pleiotropy, i.e., variants of a single nucleotide polymorphism that could confer differential risk for these disorders. To study the basis for such an interrelationship, we initially compared lists of single nucleotide polymorphisms with significant genetic associations (p < 1e-8) for schizophrenia or rheumatoid arthritis, evaluating patterns of linkage disequilibrium and apparent pleiotropic risk profiles. Single nucleotide polymorphisms that conferred risk for both schizophrenia and rheumatoid arthritis were localized solely to the extended HLA region. Among single nucleotide polymorphisms that conferred differential risk for schizophrenia and rheumatoid arthritis, the majority were localized to HLA-B, TNXB, NOTCH4, HLA-C, HCP5, MICB, PSORS1C1, and C6orf10; published functional data indicate that HLA-B and HLA-C have the most plausible pathogenic roles in both disorders. Interactomes of these eight genes were constructed from protein–protein interaction information using publicly available databases and novel computational predictions. The genes harboring apparently pleiotropic single nucleotide polymorphisms are closely connected to rheumatoid arthritis and schizophrenia associated genes through common interacting partners. A separate and independent analysis of the interactomes of rheumatoid arthritis and schizophrenia genes showed a significant overlap between the two interactomes and that they share several common pathways, motivating functional studies suggesting a relationship in the pathogenesis of schizophrenia/rheumatoid arthritis.


Schizophrenia (SZ) is a severe psychiatric disorder of unknown etiology with a lifetime risk of approximately 1%. The heritability of SZ, estimated at ~70%, is best explained by a multi-factorial polygenic threshold model (MFPT) that invokes multiple genetic risk factors modified by the environment.1 Ongoing genome-wide association studies (GWAS) of SZ indicate over 100 SNPs and relatively rare mutations of variable effect sizes, but for majority of the loci the primary risk variants and the functions are uncertain.2 Further clarity may be gained by an integrative analyses of genomic and proteomic data.3

Rheumatoid arthritis (RA) is an autoimmune disease that causes inflammation of the small joints in the hand and feet with a prevalence of approximately 1%. Like SZ, the etiology of RA also is best explained by an MFPT model.4 Several studies over the past five decades indicate an inverse prevalence of RA and SZ, i.e., individuals with SZ are less likely to be diagnosed with RA, and vice versa. 5,6,7 Children and siblings of individuals with SZ show reduced risk of seronegative RA,8 though some reports differ.5 The inverse prevalence of RA in patients with SZ and the reduced familial risk of RA raise the possibility that there are shared pathogenic processes for these disorders. A portion of the genetic risk factors could even have pleiotropic effects, i.e., one allele confers risk for SZ, while another variant of the same polymorphism elevates risk for RA. The latter possibility was initially tested in a relatively small under-powered candidate gene studies and GWAS analyses that did not provide any supportive evidence.9 Recently, an analysis of larger datasets indicated a small but statistically significant negative correlation between SZ and RA, with stronger effect for SNPs localized to coding and regulatory regions (correlation of −0.046 and −0.174, respectively); these analyses are supported by genetic correlations of a similar magnitude even with GWAS score statistics in lieu of individual-level genotype data.10 Thus, it may be worthwhile to search for individual genetic risk factors with pleiotropic effects on SZ/RA.

In a broader framework, the epidemiologic data also motivate a search for shared pathogenic processes. Interactome analysis is a useful approach for discovering novel functional associations of genes. In the present study, we sought individual SNPs with plausible pleiotropic risks and explored their functions through in silico analyses, including interactome analyses, to search for links to shared pathology. We identified several leads that could motivate further hypothesis-driven functional studies.


To study the molecular and genetic interrelation between rheumatoid arthritis and schizophrenia, we carried out two parallel and independent analyses. The first one was to identify the SNPs that potentially have pleiotropic effects, and the second one was to study the protein interactomes of the genes associated with the two diseases. For the parallel (independent) interactome analyses, we computed novel protein–protein interactions (PPIs) using high-confidence protein–protein interaction prediction (HiPPIP) method that we previously developed.11 Additionally, to accelerate future studies of functional mechanisms of genes containing pleotropic SNPs, we also computed their novel PPIs.

Identification of risk SNPs with apparent pleiotropic effects

The following cascaded analysis of SNPs associated with RA and SZ was carried out to identify pairs of SNPs with strong linkage disequilibrium (LD), such that they conferred risk in opposite directions for the two disorders. A search for pairs of SNPs (r, z) where r is significantly associated with RA and z with SZ, and r and z are in strong LD, yielded 1376 pairs of SNPs; all of which were located in the extended HLA region in chromosome 6p. Of these, 46 pairs were identified as having alleles that conferred risk in opposite directions for RA and SZ (Supplementary File 1). Of these, 29 pairs consisted of SNPs with putative pleiotropic effects (i.e, r and z were found to refer to the same SNP but with opposing odds ratios in RA versus SZ GWAS), 18 of which were located within gene regions including exonic, intronic and flanking regions, and 11 were not within gene regions using these criteria (see Methods). These 18 SNPs were located to the following 8 genes: TNXB, HLA-B, HLA-C, NOTCH4, HCP5, MICB, PSORS1C1 and C6orf10 (Table 1). We refer to these 8 as Genes Associated with Putative Pleiotropic SNPs (GAPPS). Four of these genes contain SNPS with pleiotropic effects in their exonic regions: rs915894 in NOTCH4, rs2073045 in C6orf10, rs1050420 in HLA-C and rs709055, rs12721827, and rs1131163 in HLA-B.

Table 1 18 putative pleiotropic SNPs

Expression of GAPPS

After identifying the 8 GAPPS as described above, we analyzed their expression patterns in different tissues by searching public databases. We studied their expression first in the brain regions under the assumption that it is likely the site for key SZ pathology and then, we also evaluated expression in the following immune-related cells: Epstein Barr virus (EBV)-transformed lymphocytes, spleen, and whole blood. The following brain regions were evaluated: amygdala, anterior cingulate cortex, caudate, cerebellar hemisphere, cerebellum, cortex, frontal cortex, hippocampus, hypothalamus, nucleus accumbens, putamen, spinal cord cervical c1, and substantia nigra. Of the 8 GAPPS, HLA-C and HLA-B were consistently expressed in all brain regions, while the expression of TNXB, NOTCH4, MICB, PSORS1C1, and HCP5 was much lower. C6orf10 was not expressed in the brain. In EBV-transformed lymphocytes, spleen, and whole blood, the expression of HLA-C, HLA-B, and HCP5 has been noted consistently, whereas TNXB, NOTCH4, and MICB are less consistently expressed in all three tissues. PSORS1C1 has minimal expression in the lymphocytes and spleen and is not expressed in whole blood. C6orf10 is not expressed in any of the these tissues.12 Expression patterns were not used to further filter GAPPS.

Interactome analyses

Analysis of the network of PPIs (or “interactome”) can reveal higher level functional relations among genes, which may not be apparent by analyzing the genes in isolation.11 For example, we recently studied the relation between SZ-risk genes that were identified through GWAS and those that were considered to be associated with SZ in pre-GWAS era, and showed that although they shared only one common gene, they had several common interactors, and that their interactomes shared common functional pathways.11 We constructed the interactomes of genes associated with RA as identified through GWAS (“RA genes”, “RA interactome”),13 and genes associated with SZ through GWAS (“SZ genes”, “SZ interactome”),11, 14 and analyzed the interconnections between them. We also constructed the interactome of GAPPS and analyzed how closely they are connected to the RA and SZ genes. To construct the interactomes, we included previously known PPIs and also novel PPIs that were discovered using our HiPPIP model.11 The novel PPIs were shown to be highly accurate based on computational and experimental evaluations as described in our prior work.11

The RA interactome consists of 98 RA genes and 1960 interactors, connected by 598 novel PPIs and 2232 known PPIs. Similarly, the SZ interactome consists of 77 SZ genes and 968 interactors, connected by 365 novel PPIs and 814 known PPIs. There are 316 genes in common between the RA and SZ interactomes, including direct PPIs between 7 RA genes and 6 SZ genes (Fig. 1 and Supplementary File 2), which is statistically highly significant (p < 10−72, hypergeometric distribution test). A subset of the interactome that highlights the inter-connections between RA and SZ genes mediated by novel PPIs is shown in Fig. 1. A complete list of PPIs that connect SZ and RA genes either directly or through a single intermediate interactor to each other are given in Supplementary File 2. This also includes a number of novel PPIs of RA genes, whereas novel PPIs of SZ genes were presented in our earlier work described above.11 The PPIs and the genes are clearly labeled as novel or known and also by their membership in the two interactomes.

Fig. 1

Novel interactors that connect RA and SZ genes: RA genes (gold-colored nodes) and SZ genes (green nodes) connect to each other either directly or through intermediate interactors (red nodes). Novel PPIs predicted with HiPPIP are shown as red lines (“edges”) and known PPIs as blue edges. Their inter-connections mediated by genes that have at least one novel interaction are shown here, whereas all PPIs including those mediated by known interactors are given in Supplementary File 2. Novel interactions that connect RA genes with each other and SZ genes with each other are also shown

The interactome of the GAPPS (Fig. 2a) consisted of 8 GAPPS (dark blue nodes) connected to 33 novel interactors (red nodes) and 50 known interactors (light blue nodes) through 36 novel PPIs (red lines or “edges”) and 54 known PPIs (blue edges). One novel PPI shows that C6orf10 and HCP5 interact directly. Three of these genes had no known interactions except one PPI connecting HCP5 to APP, but we predicted 9 novel PPIs. Even for NOTCH4, HLA-B and HLA-C, which are well-studied genes with several known PPIs, we predicted additional novel PPIs for each. The GAPPS do not directly interact with any of the RA or SZ associated genes, but they connect to 32 RA genes (Fig. 2b: gold nodes) and 10 SZ genes (green nodes) through 27 common interactors (grey nodes).

Fig. 2

Interactome of genes associated with putative pleiotropic SNPs (GAPPS): a GAPPS interactomes: GAPPS (dark blue square nodes), novel interactors (red nodes) and known interactors (blue nodes), novel PPIs (red lines or “edges”) and known PPIs (blue edges). b The GAPPS interactome network is extended to show how its genes further interact with RA or SZ associated genes (gold and green nodes, respectively). GAPPS interactors that do not connect to RA or SZ genes are not shown here. c C4A interactomes, with legend same as in a

Next, we studied the pathways associated with the RA and SZ interactomes separately using Ingenuity Pathway Analysis® (IPA) suite ( There are several pathways common to both the interactomes (Supplementary File 1). The commonality arises from not only the shared genes between the interactomes, but also from additional genes that are exclusive to either interactome (see selected pathways in Table 2). To highlight this aspect, we show in Fig. 3, the top 30 pathways associated with SZ interactome and the number of genes that are associated with each pathway that are exclusive and common to the two interactomes. For example, the glucocorticoid receptor signaling pathway is associated with 33 proteins from both interactomes, 80 proteins exclusive to RA interactome, and 14 proteins exclusive to SZ interactome.

Table 2 Selected pathways and the genes associated with them from the RA and SZ interactomes
Fig. 3

Common pathways associated with SZ and RA gene interactomes: Pathways associated with the interactome are computed with Ingenuity Pathway Analysis, which shows not only the significance of the association of the pathway but also the genes within the interactome that are associated with that pathway. Pathways are computed separately for SZ and RA gene interactomes. Shown here are the top 30 pathways in the SZ interactomes, along with number of genes associated exclusively with SZ interactome (blue), exclusively with RA interactome (orange) and common to both (green)

In another analysis using NextBio suite of tools,15 we analyzed gene expression patterns in schizophrenia and rheumatoid arthritis, and found a negative correlation between them: i.e., some genes that are over-expressed in one disorder are under-expressed in the other. For example, the gene expression data from neurons of SZ patients and synovial tissues of RA patients showed that there are 58 genes upregulated in SZ and downregulated in RA, and 48 genes are downregulated in SZ and upregulated in RA. We found 5 datasets that show a similar relationship between these diseases (Supplementary File 4). Overall, there were 369 such genes with opposite expression in the two diseases. Of these, 101 were found to among the SZ and RA interactomes, and pathway analysis showed that they are related to chemokine receptor signaling, signaling of IL-15, IL-12, IL-2, and IL-6 and Natural Killer Cell Signaling pathways, among others. Seven of them were found to be novel interactors of the SZ genes (ALDH6A1, FBLN1, MYL9, NKG7, RGS1, SETBP1, and SYNGR1), and twenty were novel interactors of RA genes (ARHGAP6, BMP4, CADM3, CYR61, DIO3, EFEMP1, GMPR, GNPDA1, IRF9, KCNMA1, MGST3, MX1, NET1, PEX6, RAB31, RAP2A, SGCE, SPP1, UAP1, and ZFP36L1); of these, EFEMP1 and SPP1 are also known interactors of SZ. Some studies identified SPP1 as an RA-susceptibility gene and hence this novel interactor may have a role in RA pathogenesis. These interactome analyses further strengthen the link between rheumatoid arthritis and schizophrenia, and also highlight the functional significance of novel predicted interactors in the disease processes.

We used this 369 gene signature to query large scale perturbagen signatures (L1000 profiles) from the NIH’s Library of Integrated Network-based Cellular Signatures (LINCS— to identify small molecules that could be potentially therapeutic for SZ or RA. Among the top compounds were both known and investigational compounds used for RA (e.g. bortezomib) or SZ (e.g., trifluoperazine, ropinirole) (Supplementary File 5). By studying the expression signatures associated with gene knockdown (i.e., gene perturbagens) in LINCS database, we found that there are 150 genes whose knockdown signature correlates with the observed signature of differential expression. Of these 150, about one-third were included in the interactomes.


Mounting data supports the pathophysiological importance of neuroinflammation in SZ.16,17,18,19,20,21,22,23,24,25 SZ postmortem studies note activated microglia/macrophages,16, 20, 26 elevated expression of inflammatory markers in the prefrontal cortex neurons27 and vasculature,28 and autoantibodies against frontal,29 cingulate,29,30,31 hippocampal cortices,30 and glutamate receptors.32 Autoimmune disorders may elevate the risk for SZ both independently33, 34 and in combination with infections.33 A meta-analysis noted elevated peripheral blood inflammatory cytokines in SZ compared to healthy controls.35 Genome-wide association studies replicate the association of variants in the major histocompatibility complex region (where a large number of immune genes are located) with SZ.36,37,38,39 Non-steroidal anti-inflammatory drugs may reduce psychotic symptom severity.40,41,42 We noted anatomical dysconnections43 and increased neuropil pruning44 associated with peripheral inflammatory markers suggesting possible mechanisms through which inflammation may underlie schizophrenia pathogenesis. While there is ample evidence for such correlations, to our knowledge, there are no biological data that directly support the relation between SZ and RA. We have utilized an in silico cross-disciplinary approach to determine the possible focal points of such relation.

We initially identified GWAS SNPs to find those that show apparent pleiotropic effects on risk for SZ and RA based on the inverse epidemiological relationship between the two diseases and prior indicated negative correlations for genetic risk.8, 10 The SNP-based analyses identified 29 SNPs, of which 11 were localized to intergenic regions and 18 were localized to genes including their flanking regions. As one of the goals of our study was to identify genes with high probability of involvement in SZ pathology that could be investigated further in subsequent studies, we conducted additional in silico analyses of 18 SNPs within genes or their flanking regions (Table 1), recognizing that the SNPs in intergenic regions continue to be of interest and deserve functional analyses. Through the location of the 18 putative pleiotropic SNPs, we identified GAPPS and constructed their interactomes. In parallel, and independent of the hypotheses based on pleiotropic effects, we carried out interactome analyses of RA and SZ genes to find network and pathway relations between the two diseases. To our knowledge, such analyses relating SZ and RA have not been conducted before.

All the SNPs with putative pleiotropic effects were localized to the extended HLA region. Associations of SZ and RA in this region have been known for over four decades and have also been confirmed through GWAS.45, 46 As there is extensive LD in this gene-rich region and the risk attributable to individual variants is relatively small, it has been difficult to identify primary risk variants. Recently, a portion of the risk in the HLA region has been attributed to relatively frequent copy-number variation (CNV) spanning the C4A-C4B complement genes, and the gene expression analysis of post-mortem tissues from several brain regions indicated that the C4A gene is over expressed among patients with schizophrenia.47 These analyses also indicate additional, independently acting risk variants in the HLA region.47 Similarly, our analyses pointed to additional SNPs in HLA that should be explored further. To enable functional studies of C4A gene, we computed the interactome of C4A, and found that it has 6 novel interactors (Fig. 2c). C4A shares two common interactors (APC and ATF6B) with RA genes and one common interactor (APC) with SZ genes.

As the precise functional effects of the putative pleiotropic SNPs is unknown, we sought genes that might be impacted by such variation. Our analyses identified eight GAPP genes, of which HLA-B and HLA-C appear to have the greatest amount of published data relating to SZ or RA pathogenesis. Previous studies have shown genetic associations of HLA-B in both rheumatoid arthritis and schizophrenia. Allele HLA B27 has been associated with several types of arthritis disorders such as ankylosing spondylitis, reactive arthritis and psoriatic arthritis.48 Other studies have shown that similarity in certain regions of the HLA-B gene between the mother and the daughter give an increased risk of schizophrenia to the daughter.49 Aside from genetic associations, HLA-B may play a role in the two diseases by its involvement in the natural killer cell pathway. Natural killer cells (NKCs) are found to have low activity per cell in rheumatoid arthritis, as opposed to a high activity per cell found in schizophrenic patients.50, 51 However, other studies indicate that the lower activity is due to a lower number of circulating NKCs. NKC activity in schizophrenia patients, on the other hand, is elevated. Homozygosity in multiple alleles in the HLA-B gene is thought to lower NKC activity that has been associated with rheumatoid arthritis.52,53,54 The HLA-C protein is thought to act as a ligand for the killer immunoglobulin receptors found on NKC, thus acting as an NKC inhibitor and regulating their activity.55 Some polymorphisms in HLA-C are thought to cause a decreased risk of rheumatoid arthritis, while other studies indicate that polymorphisms in this gene cause an increased risk of schizophrenia.56, 57 The pleiotropic associations with HLA-C polymorphisms, the opposite activity levels of NKCs in the two diseases and their association with HLA-B, makes these genes prime candidates for further research. Among the remaining genes, MICB encodes for a stress-induced protein that is a ligand for the NKG2D Type 2 receptor, which is located on NKCs, CD8 alpha/beta T cells, and gamma/delta T cells. HCP5 denotes the HLA complex protein P5, and while located in the HLA region, it is not similar in sequence to the other HLA genes. It is more similar to the human endogenous retroviruses HERV-L and HERV-16. Tenascin XB (TNXB) is a member of the extracellular matrix glycoproteins and several genetic association studies have indicated TNXB SNPs as risk factors for SZ, but similar associations with RA have not been reported, to our knowledge. NOTCH4, a transmembrane protein mediating the Notch signaling pathway, regulates interactions between adjacent neurons. Multiple genetic association studies have associated NOTCH4 with schizophrenia and rheumatoid arthritis, but their functional implications are uncertain.58, 36

Using interactome analysis, we found that the eight GAPPS share common interacting partners with RA and SZ genes (Fig. 2b). It also showed that even though RA and SZ do not share common risk genes beyond the HLA region, they have 316 common interacting partners through PPIs (Fig. 1 and Supplementary File 2). There are even direct PPIs between RA and SZ genes: 7 RA genes and 6 SZ genes interact with each other. Two of the RA genes, namely CSF2 and UNG, are novel interactors of SZ genes. One of the subunits of CSF2 (CSF2RB) is essential for IL3 signaling which is involved in schizophrenia pathology.59 Similarly, the SZ genes ATP2A2 and ETF1 are novel interactors of RA genes.

The parallel analyses of the SZ and the RA interactomes indicated that there are several pathways that are common to, and significantly associated with both the disorders, though it is likely that some of these pathways are disrupted in other disorders (Fig. 3 and Supplementary File 2). We found several pathways that are associated with immune function and inflammation. For example, pathways such as role of NFAT in regulation of immune response has 43 common genes, IL-8 signaling has 27 genes, CD28 signaling in T-Helper cells has 15 genes, natural killer cell signaling has 17 genes, crosstalk between dendritic cells and natural killer cells has 6 genes, NF-kB signaling has 21 genes and B cell receptor signaling has 27 common genes. Prior studies suggested that interleukins may be associated with schizophrenia pathology.60 The NF-kB signaling pathway may also have an important role because NF-kB molecule has key role in immune response regulation and is associated with RA pathology; it has also been implicated in synaptic plasticity and memory, which are commonly altered in schizophrenia patients.61, 62 The PPIs, including novel PPIs that we predicted, highlight how the RA and SZ genes connect to pathways that are of interest in the biology of both the diseases (examples shown in Table 2). All these pathways have highly significant associations with both interactomes, though it is recognized that some of these pathways are likely involved in other disorders (Supplementary File 2).

Some limitations of the present work should be noted. In addition to pleiotropy, there are undoubtedly other mechanisms for the inverse prevalence of SZ and RA that need to be explored. Our studies were solely in silico, and need to be verified through experiments. Genetic risk variants with pleiotropic effects, which were not within genes or their flanking regions, were not explored further (Supplementary File 1). Other variants such as a CNV on C4A could not be analyzed in detail.47 The functional implications of the novel PPIs predicted for C4A (Fig. 2c), especially those that have links to both RA and SZ genes, should be studied further.

It could be argued that instead of evaluating effects of risk SNPs in function of their respective genes, one ought to evaluate their effects on the overall “output” of the respective pathways. Our studies motivate these and similar studies. A related question that would need to be addressed is the impact of such changes on the “end organs” for SZ and RA, i.e., the brain and the joints, respectively.

In conclusion, we integrated epidemiological, GWAS, gene expression and proteomic data to identify genes and pathways with potential pathogenic relevance for both SZ and RA. We recommend avenues for further functional analyses based on these hypotheses.


Identifying SNPs with putative pleiotropic effects

The overview of the procedure for identifying SNPs with putative pleiotropic effects is shown in Fig. 4. We used publicly available SNP level summary statistics from large scale GWAS of schizophrenia and rheumatoid arthritis. The SZ dataset included results from 36,989 cases and 113,075 controls for approximately 9.5 million SNPs.14 The RA dataset included results from 29,880 cases and 73,758 controls for approximately 9.7 million SNPs.13

Fig. 4

Overview of methods to identify SNP pairs with pleiotropic effects: Detailed flowchart representing all of the steps in the thresholding analysis starting with the genome-wide SNPs tested for RA and SZ associations

The analysis started with the selection of candidate RA SNPs, where our goal was to prepare a list of SNPs that are not highly correlated with each other and would individually confer statistically significant risk at genome-wide levels. We combined linkage disequilibrium (LD) pruning with p-value thresholding (p < 10−8) on the RA dataset using ‘Swiss’ software ( LD between pairs of SNPs was calculated based on 1000 Genomes data (phase II, Caucasian ancestry sample),63 with a threshold of r 2 ≤ 0.6 to identify a relatively independent set of RA SNPs. Each of these highly significant and low-LD RA SNPs was paired with each of the SZ SNPs in the initial GWAS list. Out of these, we selected the pairs that had an LD value of r2 ≥ 0.8 between them, resulting in 1376 SNP pairs. When multiple SZ SNPs matched the same RA SNP, the pair with the highest LD between them and the lowest p-value for the SZ SNP was chosen; this resulted in 290 pairs of SNPs.

The odds ratios (OR) and reference alleles of the resulting pairs were then examined as given in the original summary statistics, to identify those which had opposite odds ratios (i.e., where SNP of one disorder had OR < 1 and the SNP of the other disorder had OR > 1).64 This resulted in 46 (r, z) SNP pairs.

In the analysis thus far, for the SNP pair (r, z), SNP r was picked for its effect on RA and z for its effect on SZ; but we also verified that the odds ratio of r for RA and r for SZ are in opposite directions, and vice versa for z. This was found to be applicable only where r and z referred to the same SNP but not otherwise. Thus, at the end of the analysis, there were 29 individual SNPs with opposing effects on the two diseases.

These SNPs were mapped to gene boundaries including 5000 base upstream and downstream flanking regions, as given in the Known Canonical Genes track of UCSC Genome Browser for the human genome build hg19 (GRCh37).65 This resulted in 18 SNPs. Only these 18 SNPs that were localized within gene boundaries were considered for further analysis. The genes within which these 18 SNPs were located are referred to as GAPPS.

Expression analysis of GAPPS

The GTEx browser was used to analyze tissue expression for the 8 GAPPS.12

Interactome construction and analysis

The interactomes were assembled by collecting known PPIs from the Human Protein Reference Database66 and Biological General Repository for Interaction Datasets,67 and by computing novel PPIs using the HiPPIP model that we developed.11 The predicted PPIs have been shown to be highly accurate by computational evaluations and experimental validations of a few PPIs in our earlier work.11 Interactome figures were created using Cytoscape.68 Pathways associated with proteins in RA interactome and SZ interactome were collected separately using Ingenuity Pathway Analysis® suite ( All the proteins including candidates, known and novel interactors that are present in each interactome were loaded into IPA suite, which returns all the pathways associated with any of the genes and the list of those genes, and the statistical significance of association computed with Benjamini-Hochberg correction for multiple testing, a widely used method to control the rate of false discoveries in statistical hypothesis testing. A corrected p-value P can be interpreted as an upper bound for the expected fraction of falsely rejected null hypotheses among all functions with p-values smaller than P.69 Supplementary File 3 shows all pathways associated with any gene(s) from the interactomes irrespective of the statistical significance of association so that readers can use the supplementary data to not only identify significantly associated pathways (by choosing a stringent p-value threshold) and also to query individual genes for their functional/pathway associations irrespective of that pathway being significant in the interactome; for example, novel interactors of SZ genes PRKAG1, PRKAR1B and two other known interactors are found to be involved in sonic hedgehog signaling pathway, which may be useful to know although that pathway is not statistically significant in the overall interactome.

The list of pathways associated with each of the two interactomes were merged using a computer program to present clearly which of the two interactomes the genes of associated with the pathway belonged to, how many genes from each interactome were associated with that pathway, and what the B-H corrected P-value of significance was for that pathway with each interactome. While a selected few pathways are shown in Table 2, a full list is given in Supplementary File 3. The pathways shown in Table 2 and Fig. 3 are statistically significant (p-value < 10–6), whereas Supplementary File 3 shows all pathways associated with any gene(s) from the interactomes irrespective of the statistical significance of association.

NextBio is a suite of tools that enables the study of correlated effect of diseases and/or drugs on gene expression using publicly available gene expression data.15 We used this in an independent analysis to identify genes that were overexpressed in one disease (either RA or SZ) and under-expressed in the other. We queried the genes in this RA–SZ reciprocal expression against the LINCS database (, a massive catalog of differential gene-expression profiles of human cells resulting from treatment with chemical and genetic perturbagens. With this, we identified gene knockouts and chemical compounds which result in differential gene expression pattern that correlates with RA (anti-correlation with SZ) or correlates with SZ (anti-correlation with RA).


  1. 1.

    Gottesman, I. I. & Shields, J. A polygenic theory of schizophrenia. Proc. Natl. Acad. Sci. 58, 199–205 (1967).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Consortium, S. W. G. o. t. P. G. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 511, 421–427, doi:10.1038/nature13595 (2014).

    Article  Google Scholar 

  3. 3.

    Andreassen, O. A. et al. Genetic pleiotropy between multiple sclerosis and schizophrenia but not bipolar disorder: differential involvement of immune-related gene loci. Mol. Psychiatry. 20, 207–214, doi:10.1038/mp.2013.195 (2015).

    CAS  Article  PubMed  Google Scholar 

  4. 4.

    Smolen, J. S., Aletaha, D. & McInnes, I. B. Rheumatoid arthritis. Lancet.. doi:10.1016/S0140-6736(16)30173-8 (2016).

    Google Scholar 

  5. 5.

    Benros, M. E. et al. A nationwide study on the risk of autoimmune diseases in individuals with a personal or a family history of schizophrenia and related psychosis. Am. J. Psychiatry. 171, 218–226, doi:10.1176/appi.ajp.2013.13010086 (2014).

    Article  PubMed  Google Scholar 

  6. 6.

    Vinogradov, S., Gottesman, I. I., Moises, H. W. & Nicol, S. Negative association between schizophrenia and rheumatoid arthritis. Schizophr. Bull. 17, 669–678 (1991).

    CAS  Article  PubMed  Google Scholar 

  7. 7.

    Oken, R. J. & Schulzer, M. At issue: schizophrenia and rheumatoid arthritis: the negative association revisited. Schizophr. Bull. 25, 625–638 (1999).

    CAS  Article  PubMed  Google Scholar 

  8. 8.

    Lee, S. H. et al. New data and an old puzzle: the negative association between schizophrenia and rheumatoid arthritis. Int. J. Epidemiol. 44, 1706–1721, doi:10.1093/ije/dyv136 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Euesden, J., Breen, G., Farmer, A., McGuffin, P. & Lewis, C. M. The relationship between schizophrenia and rheumatoid arthritis revisited: genetic and epidemiological analyses. Am. J. Med. Genet. B. Neuropsychiatr. Genet. 168B, 81–88, doi:10.1002/ajmg.b.32282 (2015).

    Article  PubMed  Google Scholar 

  10. 10.

    Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241, doi:10.1038/ng.3406 (2015).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  11. 11.

    Ganapathiraju, M. K. et al. Schizophrenia interactome with 504 novel protein-protein interactions. NPJ. Schizophr 2, 16012, doi:10.1038/npjschz.2016.12 (2016).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Consortium, G. T. The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580–585, doi:10.1038/ng.2653 (2013).

    Article  Google Scholar 

  13. 13.

    Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 506, 376–381 (2014).

    CAS  Article  PubMed  Google Scholar 

  14. 14.

    Schizophrenia Working Group of the Psychiatric Genomics, C. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 511, 421–427, doi:10.1038/nature13595 (2014).

    Article  Google Scholar 

  15. 15.

    Kupershmidt, I. et al. Ontology-based meta-analysis of global collections of high-throughput public data. PLoS. ONE. 5, e13066, doi:10.1371/journal.pone.0013066 (2010).

  16. 16.

    Bayer, T. A., Buslei, R., Havas, L. & Falkai, P. Evidence for activation of microglia in patients with psychiatric illnesses. Neurosci. Lett. 271, 126–128 (1999).

    CAS  Article  PubMed  Google Scholar 

  17. 17.

    Heath, R. G. & Krupp, I. M. Schizophrenia as an immunologic disorder. I. Demonstration of antibrain globulins by fluorescent antibody techniques. Arch. Gen. Psychiatry. 16, 1–9 (1967).

    CAS  Article  PubMed  Google Scholar 

  18. 18.

    Heath, R. G., Krupp, I. M., Byers, L. W. & Lijekvist, J. I. Schizophrenia as an immunologic disorder. 3. Effects of antimonkey and antihuman brain antibody on brain function. Arch. Gen. Psychiatry. 16, 24–33 (1967).

    CAS  Article  PubMed  Google Scholar 

  19. 19.

    Heath, R. G., Krupp, I. M., Byers, L. W. & Liljekvist, J. I. Schizophrenia as an immunologic disorder. II. Effects of serum protein fractions on brain function. Arch. Gen. Psychiatry. 16, 10–23 (1967).

    CAS  Article  PubMed  Google Scholar 

  20. 20.

    Radewicz, K., Garey, L. J., Gentleman, S. M. & Reynolds, R. Increase in HLA-DR immunoreactive microglia in frontal and temporal cortex of chronic schizophrenics. J. Neuropathol. Exp. Neurol. 59, 137–150 (2000).

    CAS  Article  PubMed  Google Scholar 

  21. 21.

    Rothermundt, M., Arolt, V. & Bayer, T. A. Review of immunological and immunopathological findings in schizophrenia. Brain. Behav. Immun. 15, 319–339, doi:10.1006/brbi.2001.0648 (2001).

    CAS  Article  PubMed  Google Scholar 

  22. 22.

    Meyer, U. & Feldon, J. Neural basis of psychosis-related behaviour in the infection model of schizophrenia. Behav. Brain. Res. 204, 322–334 (2009). doi:S0166-4328(08)00724-9 [pii]10.1016/j.bbr.2008.12.022.

    CAS  Article  PubMed  Google Scholar 

  23. 23.

    Meyer, U., Weiner, I., McAlonan, G. M. & Feldon, J. The neuropathological contribution of prenatal inflammation to schizophrenia. Expert. Rev. Neurother. 11, 29–32, doi:10.1586/ern.10.169 (2011).

    CAS  Article  PubMed  Google Scholar 

  24. 24.

    Muller, N. & Schwarz, M. J. Immune system and schizophrenia. Curr. Immunol. Rev. 6, 213–220 (2010).

    Article  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Saetre, P. et al. Inflammation-related genes up-regulated in schizophrenia brains. BMC. Psychiatry. 7, 46 (2007).

    Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Wierzba-Bobrowicz, T., Lewandowska, E., Lechowicz, W., Stepien, T. & Pasennik, E. Quantitative analysis of activated microglia, ramified and damage of processes in the frontal and temporal lobes of chronic schizophrenics. Folia. Neuropathol. 43, 81–89 (2005).

    PubMed  Google Scholar 

  27. 27.

    Fillman, S. G. et al. Increased inflammatory markers identified in the dorsolateral prefrontal cortex of individuals with schizophrenia. Mol. Psychiatry. 18, 206–214, doi:10.1038/mp.2012.110 (2013).

    CAS  Article  PubMed  Google Scholar 

  28. 28.

    Harris, L. W. et al. The cerebral microvasculature in schizophrenia: a laser capture microdissection study. PLoS. One. 3, e3964, doi:10.1371/journal.pone.0003964 (2008).

    Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Henneberg, A. E., Horter, S. & Ruffert, S. Increased prevalence of antibrain antibodies in the sera from schizophrenic patients. Schizophr. Res. 14, 15–22 (1994).

    CAS  Article  PubMed  Google Scholar 

  30. 30.

    Ganguli, R., Rabin, B. S., Kelly, R. H., Lyte, M. & Ragu, U. Clinical and laboratory evidence of autoimmunity in acute schizophrenia. Ann. N. Y. Acad. Sci. 496, 676–685 (1987).

    CAS  Article  PubMed  Google Scholar 

  31. 31.

    Kelly, R. H., Ganguli, R. & Rabin, B. S. Antibody to discrete areas of the brain in normal individuals and patients with schizophrenia. Biol. Psychiatry. 22, 1488–1491 (1987).

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Tsutsui, K. et al. Anti-NMDA-receptor antibody detected in encephalitis, schizophrenia, and narcolepsy with psychotic features. BMC. Psychiatry. 12, 37, doi:10.1186/1471-244X-12-37 (2012).

    Article  PubMed  PubMed Central  Google Scholar 

  33. 33.

    Benros, M. E. et al. Autoimmune diseases and severe infections as risk factors for schizophrenia: a 30-year population-based register study. Am. J. Psychiatry. 168, 1303–1310, doi:10.1176/appi.ajp.2011.11030516 (2011).

    Article  PubMed  Google Scholar 

  34. 34.

    Chen, S.-J. et al. Prevalence of autoimmune diseases in in-patients with schizophrenia: nationwide population-based study. The British. J. Psyc. 200, 374–380, doi:10.1192/bjp.bp.111.092098 (2012).

    Article  Google Scholar 

  35. 35.

    Potvin, S. et al. Inflammatory cytokine alterations in schizophrenia: a systematic quantitative review. Biol. Psyc. 63, 801–808, doi:10.1016/j.biopsych.2007.09.024 (2008).

    CAS  Article  Google Scholar 

  36. 36.

    Purcell, S. M. et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 460, 748–752, doi:10.1038/nature08185 (2009).

    CAS  PubMed  Google Scholar 

  37. 37.

    Ripke, S. et al. Genome-wide association study identifies five new schizophrenia loci. Nat. Genet. 43, 969–976, doi:10.1038/ng.940 (2011).

    CAS  Article  Google Scholar 

  38. 38.

    Shi, J. et al. Common variants on chromosome 6p22.1 are associated with schizophrenia. Nature. 460, 753–757, doi:10.1038/nature08192 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  39. 39.

    Stefansson, H. et al. Common variants conferring risk of schizophrenia. Nature. 460, 744–747, doi:10.1038/nature08186 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  40. 40.

    Muller, N. et al. Celecoxib treatment in an early stage of schizophrenia: results of a randomized, double-blind, placebo-controlled trial of celecoxib augmentation of amisulpride treatment. Schizophr. Res. 121, 118–124 (2010). doi:S0920-9964(10)01268-5 [pii]10.1016/j.schres.2010.04.015.

    Article  PubMed  Google Scholar 

  41. 41.

    Muller, N. et al. Beneficial antipsychotic effects of celecoxib add-on therapy compared to risperidone alone in schizophrenia. Am. J. Psychiatry. 159, 1029–1034 (2002).

    Article  PubMed  Google Scholar 

  42. 42.

    Sommer, I. E., de Witte, L., Begemann, M. & Kahn, R. S. Nonsteroidal anti-inflammatory drugs in schizophrenia: ready for practice or a good start? A meta-analysis. J. Clin. Psychiatry. 73, 414–419, doi:10.4088/JCP.10r06823 (2012).

    CAS  Article  PubMed  Google Scholar 

  43. 43.

    Prasad, K. M., Upton, C. H., Nimgaonkar, V. L. & Keshavan, M. S. Differential susceptibility of white matter tracts to inflammatory mediators in schizophrenia: An integrated DTI study. Schizophr. Res. 161, 119–125, doi:10.1016/j.schres.2014.09.043 (2015).

    Article  PubMed  Google Scholar 

  44. 44.

    Prasad, K. M., Burgess, A., Nimgaonkar, V. L., Keshavan, M. S. & Stanley, J. A. neuropil pruning in early-course schizophrenia: Immunological, clinical and neurocognitive correlates. Biol. Psyc. 1, 528–538, doi:10.1016/j.bpsc.2016.08.007 (2016).

  45. 45.

    Wright, P. et al. Genetic association of the HLA DRB1 gene locus on chromosome 6p21.3 with schizophrenia. Am. J. Psychiatry. 153, 1530–1533, doi:10.1176/ajp.153.12.1530 (1996).

    CAS  Article  PubMed  Google Scholar 

  46. 46.

    Ripke, S. et al. Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nat. Genet. 45, 1150–1159, doi:10.1038/ng.2742 (2013).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  47. 47.

    Sekar, A. et al. Schizophrenia risk from complex variation of complement component 4. Nature. 530, 177–183, doi:10.1038/nature16549 (2016).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  48. 48.

    Sheehan, N. J. The ramifications of HLA-B27. J. R. Soc. Med. 97, 10–14 (2004).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  49. 49.

    Palmer, C. G. et al. HLA-B maternal-fetal genotype matching increases risk of schizophrenia. Am. J. Hum. Genet. 79, 710–715, doi:10.1086/507829 (2006).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Aramaki, T. et al. A significantly impaired natural killer cell activity due to a low activity on a per-cell basis in rheumatoid arthritis. Mod. Rheumatol. 19, 245–252, doi:10.1007/s10165-009-0160-6 (2009).

    Article  PubMed  Google Scholar 

  51. 51.

    Yovel, G. et al. Higher natural killer cell activity in schizophrenic patients: the impact of serum factors, medication, and smoking. Brain. Behav. Immun. 14, 153–169, doi:10.1006/brbi.1999.0574 (2000).

    CAS  Article  PubMed  Google Scholar 

  52. 52.

    Dubey, D. P., Yunis, I., Leslie, C. A., Mehta, C. & Yunis, E. J. Homozygosity in the major histocompatibility complex region influences natural killer cell activity in man. Eur. J. Immunol. 17, 61–66, doi:10.1002/eji.1830170111 (1987).

    CAS  Article  PubMed  Google Scholar 

  53. 53.

    Dubey, D. P., Alper, C. A., Mirza, N. M., Awdeh, Z. & Yunis, E. J. Polymorphic Hh genes in the HLA-B(C) region control natural killer cell frequency and activity. J. Exp. Med. 179, 1193–1203 (1994).

    CAS  Article  PubMed  Google Scholar 

  54. 54.

    Zhang, H. et al. Linkage of the genes controlling natural killer cell activity to HLA-B. Zhonghua. Yi. Xue. Yi. Chuan. Xue. Za. Zhi. 17, 188–191 (2000).

    PubMed  Google Scholar 

  55. 55.

    Blais, M. E., Dong, T. & Rowland-Jones, S. HLA-C as a mediator of natural killer and T-cell activation: spectator or key player? Immunology. 133, 1–7, doi:10.1111/j.1365-2567.2011.03422.x (2011).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  56. 56.

    Zhang, Y. et al. Human leukocyte antigen (HLA)-C polymorphisms are associated with a decreased risk of rheumatoid arthritis. Mol. Biol. Rep. 41, 4103–4108, doi:10.1007/s11033-014-3280-9 (2014).

    CAS  Article  PubMed  Google Scholar 

  57. 57.

    Irish Schizophrenia Genomics, C. & the Wellcome Trust Case Control, C. Genome-wide association study implicates HLA-C*01:02 as a risk factor at the major histocompatibility complex locus in schizophrenia. Biol. Psyc. 72, 620–628, doi:10.1016/j.biopsych.2012.05.035 (2012).

    Article  Google Scholar 

  58. 58.

    Wei, J. & Hemmings, G. P. The NOTCH4 locus is associated with susceptibility to schizophrenia. Nat. Genet. 25, 376–377 (2000).

    CAS  Article  PubMed  Google Scholar 

  59. 59.

    Chen, Q. et al. Association study of CSF2RB with schizophrenia in Irish family and case - control samples. Mol. Psyc. 13, 930–938, doi:10.1038/ (2008).

    CAS  Article  Google Scholar 

  60. 60.

    Brown, A. S. et al. Elevated maternal interleukin-8 levels and risk of schizophrenia in adult offspring. Am. J. Psyc. 161, 889–895 (2004).

    Article  Google Scholar 

  61. 61.

    Roman-Blas, J. A. & Jimenez, S. A. NF-kappaB as a potential therapeutic target in osteoarthritis and rheumatoid arthritis. Osteoarthritis. Cartilage. 14, 839–848, doi:10.1016/j.joca.2006.04.008 (2006).

    CAS  Article  PubMed  Google Scholar 

  62. 62.

    Albensi, B. C. & Mattson, M. P. Evidence for the involvement of TNF and NF-kappaB in hippocampal synaptic plasticity. Synapse. 35, 151–159, doi:10.1002/(SICI)1098-2396(200002)35:2<151::AID-SYN8>3.0.CO;2-P (2000).

    CAS  Article  PubMed  Google Scholar 

  63. 63.

    Genomes Project, C. et al. A global reference for human genetic variation. Nature. 526, 68–74, doi:10.1038/nature15393 (2015).

    Article  Google Scholar 

  64. 64.

    Sirota, M., Schaub, M. A., Batzoglou, S., Robinson, W. H. & Butte, A. J. Autoimmune disease classification by inverse association with SNP alleles. PLoS. Genet. 5, e1000792, doi:10.1371/journal.pgen.1000792 (2009).

    Article  PubMed  PubMed Central  Google Scholar 

  65. 65.

    Karolchik, D. et al. The UCSC table browser data retrieval tool. Nucleic. Acids. Res. 32, D493–D496, doi:10.1093/nar/gkh103 (2004).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  66. 66.

    Peri, S. et al. Human protein reference database as a discovery resource for proteomics. Nucleic. Acids. Res. 32, D497–D501, doi:10.1093/nar/gkh070 (2004).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  67. 67.

    Stark, C. et al. BioGRID: a general repository for interaction datasets. Nucleic. Acids. Res. 34, D535–539, doi:10.1093/nar/gkj109 (2006).

    CAS  Article  PubMed  Google Scholar 

  68. 68.

    Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504, doi:10.1101/gr.1239303 (2003).

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  69. 69.

    Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J of Royal Stat. Soc. Series B 57, 289–300 (1995).

    Google Scholar 

Download references


VLN thanks Ansuman Chattopadhyay, Bernie Devlin and colleagues in his research group for discussions in the SNP analyses and for comments on the manuscript. This work has been funded by grants MH63480 and MH093246 (VLN), R01MH094564 (MKG) and MH084053 (Dr. David Lewis) awarded by the National Institute of Mental Health of National Institutes of Health (NIMH/NIH) of USA. We thank Dr. David Lewis of Department of Psychiatry at University of Pittsburgh for his support through MH084053.

Author information




V.L.N. initiated and oversaw the study. T.A.M., J.W., and K.V.C. carried out analysis to identify pleiotropic SNPs. S.C. carried out pathway and NextBio gene expression analysis. M.K.G. carried out the interactome analysis and oversaw the pathway and gene expression analysis. AGJ carried out analysis of perturbagens. K.V.C., L.M., and K.M.P. provided critical comments and assisted with the discussion. Manuscript has been prepared by all the co-authors. Manuscript has been read and approved by all authors.

Corresponding authors

Correspondence to Madhavi K. Ganapathiraju or Vishwajit L. Nimgaonkar.

Ethics declarations

Competing interest

Authors declare that they do not have any competing interest.

Electronic supplementary material

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Malavia, T.A., Chaparala, S., Wood, J. et al. Generating testable hypotheses for schizophrenia and rheumatoid arthritis pathogenesis by integrating epidemiological, genomic, and protein interaction data. npj Schizophr 3, 11 (2017).

Download citation

Further reading


Quick links