Introduction

Psychotic and affective disorders have long been viewed as two separate axes of mental illness, and early practitioners of psychiatry like Emil Kraepelin and Eugen Bleuler observed that relatives of patients with schizophrenia (SCZ) have an increased rate of psychiatric disorders ranging from atypical psychoses to SCZ spectrum personality disorders, many of which appeared to be milder versions of the symptoms observed in patients with SCZ1. Some of the first family studies of SCZ conducted in the early 20th century, confirmed that in addition to SCZ, a range of other psychiatric disorders on the psychosis spectrum also aggregate in the relatives of probands with SCZ. These findings were later solidified by the Danish Adoption Study of Schizophrenia, which showed that biological relatives of patients with SCZ were at an increased risk for SCZ as well as milder syndromes on the psychosis spectrum2.

Large-scale genome-wide association studies (GWAS) conducted by the Psychiatric Genomics Consortium (PGC) have shown that common risk variation in the genome (minor allele frequency ā‰„1%) can explain a modest portion of the heritability of major psychiatric disorders3,4,5. Additionally, the Cross-Disorder Group of the PGC has provided robust, replicable evidence for strong genetic correlation (rG) among SCZ and bipolar disorder (BIP) (rGā€‰=ā€‰0.68), and SCZ and major depressive disorder (MDD) (rGā€‰=ā€‰0.35)6. BIP and MDD also have a significant positive genetic correlation estimated to be rGā€‰=ā€‰0.444. Together, these results indicate that there is a substantial genetic overlap among these three disorders with varying degree of psychotic and affective features, suggesting widespread pleiotropy in the genetic architecture of psychiatric disorders at common variation level6.

The Irish Study of High-Density Schizophrenia Families (ISHDSF)7 consists of 257 multiplex SCZ families with genotype data, ascertained to have two or more first-degree relatives meeting the Diagnostic and Statistical Manual of Mental Disorders (DSM-III-R) criteria for SCZ or poor-outcome schizoaffective disorder. In line with previous epidemiological observations in the relatives of probands with SCZ8,9, in addition to a significant aggregation of psychotic disorders, other psychiatric diagnoses including affective, personality, and substance use disorders, are also present in the ISHDSF sample7. Furthermore, our previous polygenic risk score (PRS) profiling in the ISHDSF sample shows that all case definitions of the psychosis spectrum in the ISHDSF sample, including the unaffected relatives of probands, have an increased burden of common SCZ risk variation compared to population controls. This observation is consistent with its polygenic architecture, and the observation of SCZ transmission through some non-psychotic, or unaffected family members in this sample10,11.

The high baseline risk for SCZ observed across all case definitions of the ISHDSF sample10, coupled with the evidence for strong rG among SCZ, BIP, and MDD, suggests that members of the ISHDSF may have an increased polygenic risk for BIP and MDD. In this study, we sought to test this hypothesis by constructing univariate BIP and MDD PRS in subjects from 257 multiplex SCZ families and 2,205 ancestry-matched population controls all from the population of the island of Ireland. Given that the strong rG among SCZ, BIP, and MDD, makes standard univariate cross-disorder PRS profiling in members of multiplex families less informative, we also used GWAS-by subtraction12 as implemented in the genomic structural equation modeling (genomicSEM) framework13, to disentangle BIP and MDD polygenic signals into underlying genetic factors. By doing so, we investigated whether the increased polygenic risk for BIP or MDD in multiplex SCZ families is attributable to the portion of the genetic risk that these two disorders share with SCZ due to their rG, or the affective portion of the genetic risk that is unique to them. To further investigate whether polygenic risks for BIP and MDD, and their unique and shared genetic factors are over-transmitted from parents to probands in the families, we performed polygenic transmission disequilibrium testing14 in a subset of the ISHDSF sample with full parent-offspring information. Based on epidemiological findings in multiplex SCZ families and the substantial rG among SCZ, BIP, and MDD, we hypothesized that increased burden of common risk variation conferring risk to BIP or MDD in multiplex SCZ families is likely to be due to the portion of the genetic risk that these disorders share with SCZ due to rG, rather than the affective portion of the genetic risk unique to them. Therefore, by addressing these questions, we attempt to clarify the complexity of cross-disorder PRS analyses in multiplex families.

Results

Sample structure

The diagnostic schema in the ISHDSF (Fig. 1) follows a concentric pattern ranked by the degree to which they reflect the narrow versus broad definitions of the psychosis spectrum. This includes four case definitions: narrow which includes SCZ, simple SCZ, and poor-outcome schizoaffective disorder, intermediate (adding schizotypal personality disorder, schizophreniform disorder, delusional disorders, psychosis not otherwise specified, and good-outcome schizoaffective disorder), broad (adding psychotic affective illness, paranoid, avoidant, and schizoid personality disorders, and other disorders that significantly aggregate in relatives of SCZ probands based on previous epidemiological work in Ireland15), and very broad (adding any other psychiatric illnesses in the families). The ISHDSF sample also includes unaffected relatives in multiplex families with no diagnosis of any psychiatric illness.

Fig. 1: Concentric diagnostic schema of the ISHDSF sample.
figure 1

The concentric diagnostic schema of the ISHDSF sample based on previous genetic epidemiology contains four case definitions reflecting the schizophrenia spectrum: (1) narrow case definition, (2) intermediate case definition, (3) broad case definition, and (4) very broad case definition. SADā€‰=ā€‰schizoaffective disorder.

Increased univariate BIP and MDD PRS in multiplex SCZ families

Fig. 2 shows the results for the univariate BIP and MDD PRS in multiplex SCZ families versus population controls. All case definitions in multiplex SCZ families show a significantly increased BIP PRS compared to population controls (Fig. 2a). The highest odds ratio (OR) was observed in the broad case definition (ORā€‰=ā€‰2.21, 95% CIā€‰=ā€‰1.62ā€“3.03) which includes 17 of the 21 BIP cases in the ISHDSF sample with psychotic features, excluding BIP cases without psychotic features which are represented in the very broad case definition. Except for the unaffected relatives, all case definitions in the ISHDSF sample also show a significantly increased MDD PRS compared to population controls (Fig. 2b). The highest OR was observed in the very broad case definition (ORā€‰=ā€‰1.45, 95% CIā€‰=ā€‰1.20ā€“1.76), which includes 80 of 102 MDD cases in the ISHDSF sample, excluding MDD cases with psychotic features that are represented in the broad case definition. Full results are provided in Supplementary Tables 1 and 2.

Fig. 2: Odds ratio (OR) plots with 95% confidence intervals (95% CI) for bipolar disorder (BIP) and major depressive disorder (MDD) PRS.
figure 2

All comparison analyses follow the hypothesis that ISHDSF subjects have a higher PRS for BIP (a) or MDD (b) compared to population controls. All PRS have been normalized using Z score standardization prior to obtaining ORs. The plot shows OR (filled circles) with 95% CI for each case definition compared to population controls. One-sided p values after multiple testing corrections are provided on the right side of the Y axis. Red dot represents significant and the blue dot represents non-significant results.

No increased affective factor PRS in multiplex SCZ families

Fig. 3 shows the results for SCZ factor and Affective factor components derived from BIP and MDD polygenic risks in multiplex SCZ families versus population controls. The PRS constructed for SCZ factor in BIP, and SCZ factor in MDD (Fig. 3a, c), representing part of the polygenic risk that these two disorders share with SCZ due to rG, are significantly increased in all case definitions in the ISHDSF sample compared to population controls. In contrast, the PRS constructed from Affective factor in BIP, and Affective factor in MDD (Fig. 3b, d), representing the affective portion of the polygenic risk unique to these two disorders, show no significant increase in members of multiplex SCZ families compared to population controls. Full results are provided in Supplementary Table 3.

Fig. 3: Density plots showing the distribution of SCZ factor and Affective factor PRS results.
figure 3

Dotted line shows the mean PRS value for population controls. a, b show the distribution of SCZ factor PRS for bipolar disorder (BIP) and major depressive disorder (MDD) respectively. c, d show the distribution of Affective factor PRS for BIP and MDD respectively. Each color represents one of the case definitions in the ISHDSF sample shown on the left side of each panel. One-sided p values after multiple testing correction are provided on the right side of each panel. All comparisons follow the hypothesis that ISHDSF members have higher PRS compared to population controls.

To assess the generalizability of these observations beyond multiplex SCZ families, we replicated the PRS analyses for SCZ and Affective factors underlying BIP and MDD in an independent cohort of ancestry-matched sporadic SCZ cases (Nā€‰=ā€‰2225) from Ireland. As shown in Supplementary Fig. 1, the observed pattern of PRS enrichment in cases from multiplex SCZ families is similar to ancestry-matched sporadic cases, demonstrating the generalizability of these observations in an independent cohort of SCZ cases.

Polygenic transmission disequilibrium test (pTDT) in multiplex SCZ families

Fig. 4 shows the pTDT results in multiplex SCZ families. In panel 4a we show that as expected, SCZ polygenic risks described in a previous study10 are significantly over-transmitted from parents to probands, while low-density lipoprotein (LDL) polygenic risks used as a negative control show no significant over-transmission from parents to probands, suggesting the absence of systematic biases in the results. We next show that univariate BIP and MDD polygenic risks are also significantly over-transmitted from parents to probands in multiplex SCZ families (Fig. 4a). Polygenic risks for SCZ factors derived from BIP and MDD are also significantly over-transmitted from parents to probands in the families. In contrast, no significant over-transmission of polygenic risks derived from Affective factors unique to BIP or MDD were observed in multiplex SCZ families (Fig. 4b).

Fig. 4: Polygenic transmission disequilibrium test results.
figure 4

transmission disequilibrium is presented as standard deviation on the mid-parent distribution with 95% confidence intervals. One-sided one-sample t-tests were used to evaluate polygenic over-transmission in 41 multiplex SCZ families. a polygenic risk for schizophrenia (SCZ), bipolar disorder (BIP), major depressive disorder (MDD), and low-density lipoprotein (LDL). b Polygenic risk for SCZ factors and Affective factors, representing part of the genetic risk shared with SCZ or unique to BIP or MDD, respectively. Dotted line represents mid-parent PRS.

Polygenicity and polygenic overlap

We used MiXeR16 to estimate polygenicity, polygenic overlap, and the number of shared and unique causal variants between SCZ and BIP, SCZ and MDD, and their underlying latent genetic factors. In agreement with previous findings, the polygenic signals of SCZ substantially overlap with BIP (Fig. 5a) and MDD (Fig. 5b), while SCZ is estimated to be more polygenic than BIP (Fig. 5a), but less polygenic than MDD (Fig. 5b)16. Similarly, we observe that polygenic signals from SCZ factors and Affective factors derived from BIP (Fig. 5c) and MDD (Fig. 5d) also substantially overlap. The SCZ factor underlying BIP (Fig. 5c) is estimated to be more polygenic than the Affective factor underlying BIP, whereas the Affective factor underlying MDD (Fig. 5d) is estimated to be more polygenic than the SCZ factor.

Fig. 5: Polygenic overlap of schizophrenia (SCZ) with bipolar disorder (BIP) and major depressive disorder (MDD) and their derived latent factors.
figure 5

Venn diagrams show the estimated portion of non-null causal variants shared and unique to each of the phenotypes. The gray part represents the overlap, and the size represents the level of polygenicity for each phenotype. The numbers indicate the number of causal variants in thousands, explaining 90% of SNP heritability for each phenotype followed by the standard error in parenthesis. The polygenic overlap is measured as the Dice coefficient, which is the percentage of shared causal variants between two phenotypes (0ā€“100% scale). a Polygenic overlap between schizophrenia and bipolar disorder. b Polygenic overlap between schizophrenia and major depressive disorder. c Polygenic overlap between underlying factors in bipolar disorder. d Polygenic overlap between underlying factors in major depressive disorder.

Discussion

Large-scale GWAS of SCZ, BIP, and MDD4,5,17, have shown that many common risk variants with small effect sizes contribute to disease risk in major psychiatric disorders. Additionally, cross-disorder analyses of psychiatric disorders have also provided consistent evidence that SCZ, BIP, and MDD share substantial genetic risk at common variation level6. Based on these observations, we sought to investigate the source of the increased common risk variation burden of BIP and MDD, two disorders with prominent affective features and strong rG with SCZ, in members of multiplex SCZ families.

Our results indicate that members of multiplex SCZ families, including the unaffected relatives, have an increased burden of common risk variation conferring risk to BIP compared to ancestry-matched population controls. With the exception of the unaffected relatives, we also observe that members of multiplex SCZ families also have an increased burden of common risk variation conferring risk to MDD compared to ancestry-matched population controls. We used genomicSEM to disentangle BIP and MDD polygenic risks into underlying genetic factors that they share with SCZ due to rG which we called SCZ factors, and underlying genetic factors unique to BIP and MDD, which we called Affective factors. Our results suggest that increased polygenic risks for BIP and MDD in multiplex SCZ families are largely driven by part of the genetic liability that these two disorders share with SCZ due to rG. This observation is in agreement with epidemiological findings that show a significant increase in the incidence of psychotic, but not affective disorders in relatives of patients with SCZ in multiplex families8,9,18. While in addition to SCZ, non-schizophrenic psychotic disorders19 also show significant familial aggregation in multiplex families15, affective and anxiety disorders are generally not considered to be on the same continuum as psychotic disorders20. Therefore, these results provide empirical genetic evidence in support of previous epidemiological findings in multiplex SCZ families by suggesting that although members of multiplex SCZ families have an increased polygenic risk burden for two disorders with prominent affective features, the source of this increased polygenic risk in a sample with a high incidence of SCZ is largely due to strong rG of these two disorders with SCZ. We also replicated these results in an independent sample of sporadic SCZ cases and showed that this observation is also generalizable to sporadic SCZ cases beyond multiplex families.

Using the MiXeR, we quantified the polygenicity and polygenic overlap between the latent factors generated in this study. MiXeR has been used in recent years to quantify the polygenicity and the polygenic overlap between SCZ and other complex traits21,22,23. In agreement with previous findings16, we first showed that SCZ shows substantial polygenic overlap with BIP and MDD. Next, we showed that while SCZ and Affective factors underlying BIP or MDD have a substantial overlap, SCZ factor appears to be more polygenic than Affective factor in BIP, whereas SCZ factor appears to be less polygenic than Affective factor in MDD. This observation is expected, as previous findings show that while SCZ is more polygenic than BIP, MDD is considered to be more polygenic than both BIP and SCZ24. Therefore, these findings suggest that although both BIP and MDD have prominent affective features in their etiology, the polygenic signals conferring risk to affective features of MDD appear to be more polygenic than BIP.

We used an extension of standard TDT, called pTDT to investigate whether polygenic risks for BIP, MDD, and their shared and unique genetic factors are over-transmitted from parents to probands. Children are expected to inherit half of their parentsā€™ risk alleles and transmission disequilibrium method25 tests whether a genetic variant or aggregate polygenic scores in pTDT is transmitted more than half the time from parents to probands which signifies over-transmission. pTDT is robust to population stratification and different sources of unmeasured biases such as socioeconomic status or environmental influences since matched, un-transmitted chromosomes in families are employed as controls. In agreement with expectation and prior studies26, polygenic liability to SCZ was over-transmitted from parents to probands in the ISHDSF sample, while we observed no over-transmission of polygenic liability to LDL which shows no direct correlation or causation to psychiatric disorders. Next, we demonstrated that univariate BIP and MDD polygenic risks are also significantly over-transmitted from parents to probands in the ISHDSF sample, indicating that probandā€™s polygenic risks for these disorders were on average higher than that of their parents. Furthermore, only the portion of the polygenic risk that BIP or MDD share with SCZ due to rG were over-transmitted from parents to probands, and no over-transmission of the affective portion of the risk unique to them was observed. Members of multiplex SCZ families show a higher incidence of psychotic, but not affective disorders compared to the general population, making these observations consistent with SCZ transmission through some non-psychotic or unaffected family members in the ISHDSF sample10.

Other families and pedigree studies of psychiatric disorders have also demonstrated the presence of an increased cross-disorder polygenic risk for psychiatric disorders. Andlauer and colleagues27 analyzed multiplex BIP families (Nā€‰=ā€‰395) consisting of 166 BIP and 78 MDD cases and showed that familial BIP cases and their unaffected relatives, had an increased PRS for BIP and SCZ compared to population controls. Szatkiewicz and colleagues28 used a densely affected psychiatric pedigree (Nā€‰=ā€‰418) and showed an increased SCZ PRS in affected members compared to unaffected members and population controls. De Jong and colleagues29 also used a dense pedigree (Nā€‰=ā€‰300) with BIP and MDD cases and showed nominally significant BIP and SCZ PRS in affected members compared to unaffected members and population controls.

In contrast to the findings, Halvorsen and colleagues30 analyzed a densely affected pedigree (Nā€‰=ā€‰122) with Tourette syndrome, a neurodevelopmental disorder characterized by recurrent nonrhythmic tics that shows significant rG with obsessive-compulsive disorder (OCD) and attention deficit hyperactivity disorder (ADHD)31,32. While a significantly increased PRS for Tourette syndrome were observed in cases compared to controls, no significantly increased cross-disorder PRS for OCD or ADHD were observed. This lack of cross-disorder PRS loading in families with Tourette syndrome could reflect the low power of OCD33 and ADHD34 GWAS. Alternatively, there may be important differences in the genetic architecture of neurodevelopmental disorders such as Tourette syndrome that differentiate their polygenic profile from major psychiatric disorders such as SCZ. Here, we show that similar to the studies noted above27,28,29, members of multiplex SCZ families analyzed here (Nā€‰=ā€‰1005) also have an increased cross-disorder polygenic risk for correlated psychiatric disorders. Furthermore, our PRS profiling in the full sample of multiplex SCZ families, in combination with pTDT results, provides empirical genetic evidence that the source of increased cross-disorder polygenic risk for BIP and MDD in multiplex SCZ families is attributable to the portion of the genetic risk that BIP or MDD share with SCZ. Despite distinct manifestations of psychotic and affective illnesses, these two separate axes of mental illness share significant genetic portions and our results offer new insights into the nature of the elevated risk for affective disorders such as BIP and MDD in multiplex SCZ families.

The analyses presented in this study should be interpreted in the context of some limitations. Although GWAS-by-subtraction can be extended from bivariate to multivariate models, we opted to use two separate bivariate models. This is due to our specific hypothesis about BIP and MDD as two distinct disorders with varying degrees of affective and psychotic features in their symptomatology. Additionally, caution is warranted due to the sample size and power limitations of current GWAS in psychiatric disorders. For example, if we extend our bivariate models to a single multivariate model and subtract out both BIP and MDD signals from SCZ in a single model, we may be left with inadequate signals for the affective factors since a large portion of signals would be subtracted out due to strong rG among these three disorders. As the sample size and power of GWAS for psychiatric disorders increases, future work could extend the bivariate models to multivariate models in order to empirically test for this.

Testing the predictive power of SCZ and Affective factors in independent SCZ, BIP, and MDD cohorts could provide further genetic evidence in support of the derived factors from genomicSEM models. While we did not have access to independent BIP and MDD cohorts, we were able to show that SCZ factors underlying BIP and MDD can significantly distinguish between SCZ cases and controls from the ISGC cohort. Future work could also test the predictive power of SCZ and Affective factors in independent BIP and MDD cohorts.

Furthermore, some case definitions in the ISHDSF sample (e.g the broad), have a lower number of subjects, which may potentially bias some of the results due to lower power. We addressed this potential issue by repeating the PRS analyses by grouping the subjects in a concentric manner as described in the ISHDSF publication7. The concentric comparison versus population controls shows similar patterns of PRS enrichment, indicating that lower numbers of subjects in some of the diagnostic categories in the ISHDSF sample are unlikely to be a source of bias for the central findings (Supplementary Table 4). Given that environmental factors have not been assessed here, future analyses could also integrate environmental influences unique to the families to further elucidate the role of environmental factors on the elevated polygenic risk for BIP and MDD in multiplex SCZ families. The predictive power of current PRS methods is mostly limited to individuals of European ancestry. As cross-ancestry PRS methods become more sophisticated and samples of multiplex families from more diverse populations become available, we could replicate these findings in ancestrally diverse multiplex families. Additionally, PRS methods are limited to common risk variations in the genome which omits some important genetic risk factors in the genome. Despite sparse evidence for the involvement of rare risk variation in the genetic architecture of MDD35 and BIP36 at current sample sizes, copy number variants in the genome often show strong pleiotropy among psychiatric disorders37 and could influence the risk that is not captured by PRS profiling. Finally, due to the sample collection and genotyping strategies, not all the multiplex families have full parent-offspring genotypic or phenotypic information available. Therefore, pTDT analysis results should be interpreted with the caveat that 41 families out of the full sample of 257 had full parent-offspring and genotype information available.

In conclusion, in this study, we showed that in addition to the increased burden of common risk variation conferring risk to SCZ10, members of multiplex SCZ families studied here also have an increased polygenic vulnerability to BIP and MDD. However, this observation is likely to be largely attributable to part of the genetic risk that BIP or MDD share with SCZ due to their rG, rather than the affective portion of the genetic risk unique to these two disorders. Therefore, these results suggest that a complete interpretation of elevated cross-disorder PRS across correlated psychiatric disorders in multiplex families requires consideration of the relative contribution of the shared and unique genetic factors to account for the known genetic correlations across psychiatric disorders.

Methods

Sample description

Irish Study of High-Density Schizophrenia Families

Fieldwork for the ISHDSF sample was carried out between 1987 and 1992, with probands ascertained from public psychiatric hospitals in the Republic of Ireland and Northern Ireland7. Selection criteria were two or more first-degree relatives meeting DSM-III-R criteria for SCZ or poor-outcome schizoaffective disorder, with all four grandparents being born in either the Republic of Ireland or the United Kingdom. Relatives of the probands suspected of a psychotic illness were interviewed by trained psychiatrists, and trained social workers interviewed other relatives of the probands. To avoid bias and detect possible mistakes in diagnosis, independent review of all diagnostic information was made blind to family assignments by two trained psychiatrists, with each psychiatrist making up to 3 best estimate DSM-III-R diagnoses, with high agreement (weighted kā€‰=ā€‰0.94ā€‰+ā€‰- 0.05). Full consent from all the participants and approval from local ethics committees in the Republic of Ireland and Northern Ireland, as well as Virginia Commonwealth University Institutional Review Board were acquired and archived and can be provided upon request.

Irish Schizophrenia Genomics Consortium (ISGC)

The ISGC sample was assembled for a GWAS of SCZ in Ireland. Details of recruitment, screening, and quality control (QC) of controls are described elsewhere38. Cases were recruited through community mental health services and inpatient units in the Republic of Ireland and Northern Ireland using DSM-III-R or DSM-IV following protocols with local ethics approval. Cases were further screened to exclude substance-included psychotic disorders. Controls were blood donors from the Irish blood Transfusion Service assembled as part of the Irish Biobank in the Republic of Ireland. All individuals reported all four grandparents were born in either Ireland or the United Kingdom, with no reported history of psychotic illness, and full consent was received from all the participants. Due to the relatively low lifetime prevalence of SCZ in the general population (~1%), misclassification of controls should have a minimum impact on power39.

Genotyping and imputation

Information on genotyping and imputation is provided elsewhere10. Briefly, genotyping was carried out on 3 different arrays: 830 ISHDSF subjects were genotyped on the Illumina 610-Quad Array by Illumina, 175 ISHDSF subjects were later genotyped on the Infinium psychArray V.1.13 Array (psychArray) at Mount Sinai, and 1730 population controls were genotyped using the Affymetrix V.6.0 Array either at the Broad Institute or by Affymetrix. An additional 475 population controls that either did not pass the QC or were not included in the Affymetrix Array study were later genotyped on the psychArray along with the additional ISHDSF subjects at Mount Sinai. The same QC protocols were applied to all three genotype datasets. In brief, exclusion criteria for samples were a call rate of <95%, more than one Mendelian error in the ISHDSF sample, and a difference between reported and genotypic sex. Exclusion criteria for SNPs were MAFā€‰<ā€‰1%, call rate <98%, and pā€‰<ā€‰0.0001 for deviation from Hardy-Weinberg expectation. The final ISHDSF sample includes 1,005 subjects from 257 pedigrees whose SNP data from the Illumina Array and the psychArray passed all the QC filters. The final population control sample includes 2205 subjects whose SNP data from the Affymetrix and the psychArray passed all the QC filters.

Genotypes passing QC were phased using Eagle V.2.4 and imputed to the Haplotype Reference Consortium (HRC) reference panel using Minimac4 on the Michigan Imputation Server40. The HRC panel includes 64,975 samples that are predominantly of European ancestry, making it suitable for imputation of our sample from the island of Ireland41. Imputed genotype dosages were extracted and used for all downstream analyses. As part of the post-imputation QC, variants with MAFā€‰<ā€‰1% were excluded from the initial merging. After all standard QC measurements, 9,298,012 SNPs on the Illumina Array, 11,080,279 SNPs on the Affymetrix Array, and 11,081,999 SNPs on the psychArray remained for analysis. Of these three sets of SNPs, 9,008,825 SNPs were shared across all three imputed arrays and were used for downstream analyses. The imputation quality score for the shared SNPs used for PRS construction and downstream analyses was high across all three genotyping platforms (INFO ā‰„ā€‰0.96 for all). More information on genotyping, imputation, and QC measures is provided in the Supplementary Materials, Supplementary Figs. 2ā€“5, and Supplementary Tables 5 and 6.

GWAS-by-subtraction

We performed GWAS-by-subtraction using the genomicSEM framework by analyzing summary statistics data for SCZ, BIP, and MDD. Leave-N-out SCZ GWAS excluding all Irish participants, (Nā€‰=ā€‰156,509)5, BIP (Nā€‰=ā€‰413,466)4, and MDD (Nā€‰=ā€‰500,199)17 summary statistics were acquired and the genetic covariance between them was estimated using LD Score Regression42. SNPs were filtered for MAFā€‰<ā€‰0.01 and INFOā€‰<ā€‰0.8, and only SNPs that are present in both SCZ and BIP GWASs, or SCZ and MDD GWASs were used to generate the factors from GWAS-by-subtraction models. This left us with 6,361,243 SNPs for BIP, and 6,599,052 SNPs for MDD. We then used the post-QC summary statistics by first regressing them on two latent factors that we called SCZ factor and Affective factor underlying BIP or MDD. Next, we regressed SCZ factor and Affective factor on each SNP from the summary statistics that passed QC measurements as described above, which allowed for two separate paths of association with BIP or MDD for each SNP: (1) a path that is fully mediated by SCZ factor, and (2) a path that is fully independent of SCZ factor, called Affective factor. Therefore, SCZ factors in BIP or MDD capture part of the genetic risk that each of these two disorders shares with SCZ due to their rG, whereas Affective factors in BIP or MDD capture the affective portion of the genetic risk that is unique to BIP or MDD and not shared with SCZ. The models assume that genetic effects on SCZ are also impacting BIP or MDD to some degree given that both BIP and MDD have a strong genetic correlation with SCZ. The path diagrams for the Cholesky decomposition used to disentangle the polygenic signals are provided in Supplementary Fig. 6. More details on the GWAS-by-subtraction analysis, path estimates, and the formula used to estimate the effective sample size are provided in Supplementary Table 7 and Supplementary Materials.

Construction of PRS

Summary statistics for BIP (Nā€‰=ā€‰413,466)4, MDD (Nā€‰=ā€‰500,199)17, SCZ factor in BIP (Neffā€‰=ā€‰146,420), Affective factor in BIP (Neffā€‰=ā€‰310,018), SCZ factor in MDD (Neffā€‰=ā€‰147,014), and Affective factor in MDD (Neffā€‰=ā€‰458,356) were used by excluding variants with MAFā€‰<ā€‰1% or imputation quality score of < 0.9 and removing strand ambiguous and indel polymorphisms. We then constructed PRS using a Bayesian regression framework by placing a continuous shrinkage prior to SNP effect sizes using PRS-CS43. Based on current recommendations43 we used the phi value of 1e-2 for BIP and MDD due to their high polygenicity, whereas the ā€œautoā€ function was used to automatically learn the phi value for SNP weights for SCZ and Affective factors. PRS-CS uses linkage disequilibrium information from an external reference panel such as the 1000 Genomes European Phase 3 European sample44, to estimate the posterior effect sizes for each SNP. Although p-value thresholding and clumping method (P-T) have been traditionally used, PRS-CS shows substantial improvement in predictive power over P-T45.

Polygenic transmission disequilibrium test

We used pTDT14 in a subset of the ISHDSF sample (41 families) with full parent-offspring data to test for over-transmission of polygenic risks for BIP, MDD, and their unique and shared (with SCZ) underlying genetic factors from parents to probands. Additionally, to detect possible bias or systematic issues in the analyses, we also assessed the over-transmission of polygenic risks generated from PGC3-SCZ5 and LDL from the ENGAGE Consortium39 as positive and negative controls, respectively. Previous studies suggest that there is no genetic correlation or causal relationship between LDL and major psychiatric disorders, making LDL an appropriate comparison phenotype in which no inflation of psychiatric PRS would be expected46. Details of the PRS construction for PGC3-SCZ and LDL in the ISHDSF sample are provided elsewhere10 The pTDT deviation scores were generated for each multiplex family by subtracting the mean parental polygenic risks from the proband polygenic risks. The pTDT deviation scores were then standardized by dividing them by the cohort-specific mean parental polygenic risks standard deviations. To test whether the mean pTDT deviation was significantly greater than zero, representing an over-transmission of polygenic risks from parents to probands, a one-sided, one-sample t-test was employed.

Statistical analyses

Statistical analyses were carried out using a mixed-effects logistic regression model fitted by maximum likelihood using the GMMAT package in R47. To account for the high degree of relatedness among the subjects, genomic relationship matrix (GRM) was calculated using LDAK48 and included in the statistical models as a random effect. While none of the principal components (PCs) showed a significant association with genotype arrays or sites, in order to account for other possible batch and site effects, we also included genotyping platform and site as additional covariates, with more details provided in the Supplementary Materials.

Principal component analysis was consistent with all subjects in the sample having European ancestry (Supplementary Figs. 7ā€“9), but to account for fine-scale structure within the Irish population41 (Supplementary Fig. 10) the top 10 ancestry PCs were also included as covariates in the analysis. The final mixed models included GRM as a random effect, with the top 10 PCs, genotyping platform, genotyping site, and sex as fixed effect covariates. All statistical analyses followed the hypothesis that members of multiplex SCZ families and sporadic cases used as a replication cohort have a significantly higher PRS compared to population controls and all PRS underwent Z-score normalization. The results were adjusted for multiple testing correction using the Holm method. While less stringent than Bonferroni, the family-wise error rate for the Holm method is similar to Bonferroni49, making it suitable for multiple testing correction in modestly sized cohorts such as our sample.

Polygenic overlap analysis

We used MiXeR framework16 to quantify the polygenicity and the polygenic overlap of SCZ with BIP and MDD. MiXeR uses GWAS summary statistics to estimate the polygenicity of each phenotype and constructs a bivariate Gaussian mixture model to estimate the number of shared and unique variants that explains 90% of SNP heritability for each GWAS. We also estimated the heritability of the derived factors underlying BIP and MDD using LD Score Regression42 as shown in Supplementary Table 8.