Gender differences in CNV burden do not confound schizophrenia CNV associations


Compared with the general population, an excess of rare copy number variants (CNVs) has been identified in people with schizophrenia. Females with neurodevelopmental disorders and in the general population have been reported to carry more large, rare CNVs than males. Given that many schizophrenia datasets do not have equal gender ratios in cases and controls, sex differences in CNV burden might have impacted on estimates of case-related CNV burden and also on associations to individual loci. In a sample of 13,276 cases and 17,863 controls, we observed a small but significant excess of large (≥500 Kb), rare (<1%) CNVs in females compared with males in both cases and controls (OR = 1.17, P = 0.0012 for controls; OR = 1.11, P = 0.045 for cases). The burden of 11 schizophrenia-associated CNVs was significantly higher in female cases compared with male cases (OR = 1.38, P = 0.0055), but after accounting for the rates of CNVs in controls, we found no significant gender difference in the risk conferred by these loci. Controlling for gender had a negligible effect on the significance of association between specific CNVs and schizophrenia. The female excess of large CNVs in both cases and controls suggests a female protective mechanism exists for deleterious CNVs that may extend beyond neurodevelopmental phenotypes.


Schizophrenia has a strong genetic component involving rare and common alleles distributed across many genes1,2. Common alleles confer weak effects (OR < 1.1 in general) but collectively account for a substantial proportion of genetic liability to the disorder3. On the other hand, some rare genetic variants, such as copy number variants (CNVs), confer larger risks for schizophrenia4,5. A combined analysis of published datasets has shown strong evidence for association between 11 individual CNVs and risk of schizophrenia and weaker evidence for 4 additional risk CNVs6. Duplications of 22q11.2 have also been associated with reduced risk of schizophrenia7.

A gender-bias in prevalence has been repeatedly observed in neurodevelopmental disorders. In autism spectrum disorder (ASD), the mean male-to-female ratio is about 4:18, whereas in schizophrenia the male-to-female ratio is 1.4:1, though this varies by age reflecting sex related differences in age at onset9. Sex differences in schizophrenia have also been reported for structural and functional brain abnormalities whose origins are thought to occur during brain development10,11,12,13,14.

A recent study found an excess of deleterious autosomal CNVs in females compared with males in a neurodevelopmental disorders cohort and in an ASD cohort15. Females with ASD have also been found to possess a higher burden of de novo CNVs and genes disrupted by de novo CNVs than males with ASD16,17,18. The lower prevalence of ASD and higher mutational burden in females with the disorder is consistent with a different liability threshold for females compared to males whereby females require a greater risk factor load to manifest neurodevelopmental disorders15,19,20. A higher female autosomal burden of large, rare CNVs, attributed to lower rates of foetal loss, has also been reported in the general population21.

Given the excess burdens of CNVs in females in both neurodevelopmental and control samples, we tested whether similar differences in burden exist in schizophrenia using a large case control dataset (N = 31,139 individuals). We evaluated the CNV burden in females and males at a genome-wide level and for 11 strongly associated schizophrenia risk loci. We also tested whether association between specific CNVs and schizophrenia are robust to adjusting for gender. Consistent with earlier findings, we found an increased CNV burden for large (≥500 Kb), rare (<1%) CNVs in female controls and now report an excess in female cases with the disorder. Although female cases also had an excess burden compared with males for 11 CNVs that have been implicated in schizophrenia, the association between those CNVs and schizophrenia was not diminished after controlling for gender.


Females carry an excess of CNVs both genome-wide and at specific loci

Females had a genome-wide excess of large (≥500 kb) and rare (<1%) CNVs in cases (OR = 1.11, 95% CI = 1.00–1.23, P = 0.045) and controls (OR = 1.17, 95% CI = 1.06–1.29, P = 0.0012) (Fig. 1, Supplementary Table S1) although only in female controls would this survive correction (P = 0.0048) for 4 independent tests (CNVs < 500kb and CNV ≥ 500 kb separately for cases and controls). While the excess CNV burden in female cases does not survive correction for multiple testing, importantly the size of the effect is not significantly different from that in female controls (Z-test P = 0.24). The effect size for schizophrenia conferred by all large, rare CNVs in females (OR = 1.24, 95% CI = 1.12–1.38, P = 8.22 × 10−5) and in males (OR = 1.32, 95% CI = 1.20–1.45, P = 4.13 × 10−9) did not statistically differ (Z-test P = 0.19).

Figure 1

Gender CNV burden in the combined dataset.

CNV burden is compared between males and females for both case and control samples. The combined CLOZUK, MGS and ISC dataset consists of 9,172 male and 4,104 female case samples and 8,807 male and 9,056 female control samples.

When deletions and duplications were analysed separately (Supplementary Table S1), we found that the contribution from large deletions to the excess of CNVs in control females was significantly greater than that from large duplications (Z-test P = 0.013). In cases, no significant difference was observed between the contribution of large deletions and duplications to the CNV excess in females (Z-test P = 0.44, Supplementary Table S1). No difference in the burden of smaller CNVs (<500 kb) was observed between males and females for cases or controls (Fig. 1, Supplementary Table S1). A breakdown of CNV burden in the datasets that constitute our case and control samples can be found in Supplementary Table S2. Despite the excess burden of large deletions in female controls, we found no significant difference between males and females for the number of genes overlapping CNVs for any class of CNV tested (Supplementary Table S3).

The burden of 11 schizophrenia risk CNVs was significantly higher in female cases than male cases (OR = 1.38, 95% CI = 1.10–1.73, P = 0.0055) but not in female controls compared with male controls (OR = 1.12, 95% CI = 0.80–1.55, P = 0.52) (Fig. 1, Supplementary Table S1). However, the difference in effect size between these two tests did not significantly differ (Z-test P = 0.15). When we compared the risk for schizophrenia conferred by the set of 11 CNV loci in females (OR = 3.56, 95% CI = 2.66–4.75, P = 2.06 × 10−18) and males (OR = 2.85, 95% CI = 2.15–3.76, P = 3.66 × 10−15), we found no significant difference in effect size (Z test P = 0.14). Hence, the non-significant test by sex in the controls might reflect lower power given that (by definition) there are fewer observations of these pathogenic CNVs in controls.

Association of specific CNVs with schizophrenia is not confounded by gender

Table 1 shows association statistics between individual CNVs and schizophrenia. These CNVs include 11 known schizophrenia risk loci, 4 additional risk CNVs that have yet to be implicated in schizophrenia with strong evidence and 1 protective CNV (see methods for detail). Controlling for gender had a negligible effect on the significance of association between specific CNVs and schizophrenia. Table S4 presents a breakdown of these associations for the individual datasets (CLOZUK, ISC and MGS).

Table 1 Effect of gender stratification on schizophrenia CNV associations.


We have carried out an analysis of gender stratified CNV burden in schizophrenia at both a genome-wide level and for previously associated loci. We restricted our analysis to the autosome as sex chromosomal CNVs were not available for all samples. We found a genome-wide excess of large, rare CNVs in females compared with males, irrespective of disease status. Similar observations have been reported in ASD and general population cohorts15,21. We also report an excess burden of CNVs at 11 schizophrenia risk loci in female cases compared with male cases, an observation that is reminiscent of an excess of deleterious variants in females with childhood neurodevelopmental disorders15. However, when males and females were analysed separately, we found no difference in effect size for risk of schizophrenia conferred by the total burden of the 11 associated loci. Moreover, association of individual CNV loci was not affected when statistical tests were used that accounted for gender. These results confirm that associations between schizophrenia and 11 specific risk CNVs and the protective 22q11.2 duplication are robust to differences in gender proportion in case and control samples. The weaker evidence for 4 additional risk CNVs cannot be attributed to gender related heterogeneity. Interestingly, despite the excess number of CNVs in females, we found no significant difference by sex in the number of genes disrupted by CNVs, suggesting that on average, CNVs are less gene-dense in females than they are in males.

At a genome-wide level, an excess female CNV burden was only observed for large CNVs (≥500 kb). However, we cannot confidently exclude a similar effect from smaller CNVs given the limited resolution to which microarrays can accurately call CNVs. When deletions and duplications ≥500 Kb were analysed separately, we found that deletions primarily accounted for the excess genome-wide CNV burden seen in female controls.

Recent findings of an excess mutational burden in females provide support for the view that females are relatively protected from neurodevelopmental disorders15,17,18,19,22,23. Schizophrenia is now generally considered to be at least in part a neurodevelopmental disorder; it is known to share genetic risk alleles with autism and other neurodevelopmental disorders24,25 and there is also an excess risk of the disorder in males, albeit much more modest than it is for other neurodevelopmental disorders such as ID, ASD and ADHD. The excess burden of CNVs in females might be considered evidence for female robustness to schizophrenia but caution is required given the pattern of findings in controls. As we and others21 have shown, female controls also have higher rates of large, rare CNVs, which in general are the class of CNV that contribute to neurodevelopmental disorders. This elevation in rate of CNVs in both cases and controls could potentially point to a mechanism that goes beyond protection from disorders such as ASD and schizophrenia: for example, protection from female foetal loss or premature death (pre- or post-natal). However, to address this question requires truly representative population cohorts as the gender CNV bias in controls might be an artefact resulting from female protection from conditions (including neurodevelopmental disorders) that would normally result in exclusion from control cohorts.

It is clear that CNV burden differs between males and females in both patients diagnosed with neurodevelopmental disorders, such as schizophrenia and also in the general population. However, our data indicate that variation in gender CNV burden does not impact on known schizophrenia-CNV associations. We provide further evidence that in both case and control samples, the rate of deleterious CNVs is greater in females than in males, although investigating the reason for this will require large population cohorts.


Study samples and QC

Case and control CNVs were derived from three published samples: CLOZUK6, the ISC4 and the MGS26. A full description of these samples, the arrays they were genotyped on and CNV calling procedures can be found in the original publications4,6,26. Quality control was performed to remove low quality samples (see original publications) and details of those samples that passed quality control in each study can be found elsewhere6. We excluded samples of unknown gender (sex chromosome molecular genetic data were not available for all samples) and putative CNVs < 15 kb in size and/or covered by <15 probes. After filtering, a total of 31,139 individuals were included in the current study (17,959 from CLOZUK and its corresponding controls, 6,600 from MGS and 6,580 from the ISC, Table 2). All CNVs have a population frequency <1%.

Table 2 Female and male samples included in the schizophrenia datasets.

Selection of specific schizophrenia-associated CNVs

We tested the burden of 11 schizophrenia risk CNV loci that showed significant association with schizophrenia in our previous publication6; deletions at 1q21.1, NRXN1 (exonic CNVs only), 3q29, 15q11.2, 15q13.3 and 22q11.2; duplications at 1q21.1, 16p11.2 and 16p13.11, the Prader-Willi/Angelman syndrome (PWS/AS) region and Williams-Beuren syndrome (WBS) region. In our analysis of individual CNV loci, we tested these 11 schizophrenia risk CNVs, along with previously implicated risk CNVs for which the overall evidence is less strong6, postulating some of the variable evidence here might be due to gender related heterogeneity. These included duplications of VIPR2, deletions of 17p12, 17q12 and distal 16p11.2. We also tested the impact of gender on protective effects for 22q11.2 duplication7.

Statistical analysis

We tested for CNV burden (≥500 kb; <500 kb) and burden of 11 schizophrenia risk loci in females versus males by comparing the change in deviance between the following logistic regression models (R glm function, family = binomial(“logit”)) using a two-sided test (ANOVA). Odds ratios (ORs) were derived from the correlation coefficients for number of CNVs.

  1. 1

     logit (pr (gender)) ~ number of CNVs + study source + microarray platform

  2. 2

     logit (pr (gender)) ~ study source + microarray platform

To test whether females had more genes disrupted by CNVs than males, a similar analysis was performed where ‘number of CNVs’ in the above model was replaced with ‘number of genes overlapping CNVs’.

To test whether the risk of schizophrenia conferred by large CNVs (≥500 kb) or the set of 11 specific schizophrenia CNVs is significantly different for females and males, we compared the effect size in females with the effect size in males using the coefficients and standard errors (Z-test, see below) derived from the following logistic regression model (R glm function, family = binomial(“logit”)):

logit (pr(case)) ~ number of schizophrenia CNVs + study source + microarray platform

We note that almost identical P values were obtained when the above model was replaced with Firth’s procedure27 (data not shown).

Differences in effect size between tests were evaluated with a Z-test, which was constructed using the equation (1)

where β is the coefficient for the variable of interest taken from the regression model and σ is that variable’s standard error. P values were generated by comparing the absolute of the Z statistic against the cumulative distribution function of the standard normal distribution.

To assess whether a difference in gender proportion between cases and controls confounds association between schizophrenia and specific CNVs, we calculated individual locus association statistics using a Fisher’s Exact test (two-sided) and a Cochran-Mantel-Haenszel Exact test (two-sided) that was stratified by gender.

All statistical tests were conducted using R Statistical Software (

Additional Information

How to cite this article: Han, J. et al. Gender differences in CNV burden do not confound schizophrenia CNV associations. Sci. Rep. 6, 25986; doi: 10.1038/srep25986 (2016).


  1. Sullivan, P. F., Daly, M. J. & O’Donovan, M. Genetic architectures of psychiatric disorders: the emerging picture and its implications. Nat Rev Genet 13, 537–551 (2012).

  2. Purcell, S. M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–190 (2014).

  3. Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).

  4. International Schizophrenia Consortium (ISC). Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 455, 237–241 (2008).

  5. Stefansson, H. et al. Large recurrent microdeletions associated with schizophrenia. Nature 455, 232–236 (2008).

  6. Rees, E. et al. Analysis of copy number variations at 15 schizophrenia-associated loci. Br J Psychiatry 204, 108–114 (2014).

  7. Rees, E. et al. Evidence that duplications of 22q11. 2 protect against schizophrenia. Mol Psychiatry 19, 37–40 (2014).

  8. Fombonne, E. Epidemiology of pervasive developmental disorders. Pediatr Res 65, 591–598 (2009).

  9. McGrath, J., Saha, S., Chant, D. & Welham, J. Schizophrenia: a concise overview of incidence, prevalence and mortality. Epidemiol Rev 30, 67–76 (2008).

  10. Elsabagh, S., Premkumar, P., Anilkumar, A. P. & Kumari, V. A longer duration of schizophrenic illness has sex-specific associations within the working memory neural network in schizophrenia. Behav Brain Res 201, 41–47 (2009).

  11. Abbs, B. et al. Covariance modeling of MRI brain volumes in memory circuitry in schizophrenia: sex differences are critical. Neuroimage 56, 1865–1874 (2011).

  12. Goldstein, J. M. et al. Hypothalamic abnormalities in schizophrenia: sex effects and genetic vulnerability. Biol Psychiatry 61, 935–945 (2007).

  13. Goldstein, J. M. et al. Impact of normal sexual dimorphisms on sex differences in structural brain abnormalities in schizophrenia assessed by magnetic resonance imaging. Arch Gen Psychiatry 59, 154–164 (2002).

  14. Gur, R. E. et al. A sexually dimorphic ratio of orbitofrontal to amygdala volume is altered in schizophrenia. Biol Psychiatry 55, 512–517 (2004).

  15. Jacquemont, S. et al. A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am J Hum Genet 94, 415–425 (2014).

  16. Gilman, S. R. et al. Rare de novo variants associated with autism implicate a large functional network of genes involved in formation and function of synapses. Neuron 70, 898–907 (2011).

  17. Levy, D. et al. Rare de novo and transmitted copy-number variation in autistic spectrum disorders. Neuron 70, 886–897 (2011).

  18. Sanders, S. J. et al. Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 87, 1215–1233 (2015).

  19. Robinson, E. B., Lichtenstein, P., Anckarsäter, H., Happé, F. & Ronald, A. Examining and interpreting the female protective effect against autistic behavior. Proc Natl Acad Sci USA 110, 5258–5262 (2013).

  20. Martin, J., Hamshere, M. L., Stergiakouli, E., O’Donovan, M. C. & Thapar, A. Genetic risk for attention-deficit/hyperactivity disorder contributes to neurodevelopmental traits in the general population. Biol Psychiatry 76, 664–671 (2014).

  21. Desachy, G. et al. Increased female autosomal burden of rare copy number variants in human populations and in autism families. Mol Psychiatry 20, 170–175 (2015).

  22. Sanders, S. J. et al. Multiple recurrent de novo CNVs, including duplications of the 7q11. 23 Williams syndrome region, are strongly associated with autism. Neuron 70, 863–885 (2011).

  23. Zhao, X. et al. A unified genetic theory for sporadic and inherited autism. Proc Natl Acad Sci USA 104, 12831–12836 (2007).

  24. Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).

  25. Malhotra, D. & Sebat, J. CNVs: harbingers of a rare variant revolution in psychiatric genetics. Cell 148, 1223–1241 (2012).

  26. Levinson, D. F. et al. Copy number variants in schizophrenia: confirmation of five previous findings and new evidence for 3q29 microdeletions and VIPR2 duplications. Am J Psychiatry, 168, 302–316 (2011).

  27. Firth, D. Bias reduction of maximum likelihood estimates. Biometrika, 80, 27–38 (1993).

Download references


The work at Cardiff University was funded by Medical Research Council (MRC) Centre (MR/L010305/1) and Program Grants (G0800509), the European Community’s Seventh Framework Programme (HEALTH-F2-2010-241909 (Project EU-GEI), a Clinical Research Fellowship to J.T.R.W. from the MRC/Welsh Assembly Government and the Margaret Temple Award from the British Medical Association. Genotyping of the CLOZUK cases was performed at the Broad Institute and funded by a philanthropic gift to the Stanley Center for Psychiatric Research. Funding to pay the Open Access publication charges for this article was provided by the MRC UK and the Wellcome Trust. We would like to acknowledge the contribution of data from outside sources: (i) Genetic Architecture of Smoking and Smoking Cessation accessed through dbGAP: Study Accession: phs000404.v1.p1. Funding support for genotyping, which was performed at the Center for Inherited Disease Research (CIDR), was provided by 1 × 01 HG005274-01. CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, contract number HHSN268200782096C. Assistance with genotype cleaning, as well as with general study coordination, was provided by the Gene Environment Association Studies (GENEVA) Coordinating Center (U01 HG004446). Funding support for collection of datasets and samples was provided by the Collaborative Genetic Study of Nicotine Dependence (COGEND; P01 CA089392) and the University of Wisconsin Transdisciplinary Tobacco Use Research Center (P50 DA019706, P50 CA084724). (ii). High-Density SNP Association Analysis of Melanoma: Case–Control and Outcomes Investigation, dbGaP Study Accession: phs000187.v1.p1. Research support to collect data and develop an application to support this project was provided by 3P50CA093459, 5P50CA097007, 5R01ES011740 and 5R01CA133996. (iii) Genetic Epidemiology of Refractive Error in the KORA Study, dbGaP Study Accession: phs000303.v1.p1. Principal investigators: Dwight Stambolian, University of Pennsylvania, Philadelphia, PA, USA; H. Erich Wichmann, Institut für Humangenetik, Helmholtz-Zentrum München, Germany, National Eye Institute, National Institutes of Health, Bethesda, MD, USA. Funded by R01 EY020483, National Institutes of Health, Bethesda, MD, USA. (iv) WTCCC2 study: Samples were downloaded from and include samples from the National Blood Donors Cohort, EGAD00000000024 and samples from the 1958 British Birth Cohort, EGAD00000000022. Funding for these projects was provided by the Wellcome Trust Case Control Consortium 2 project (085475/B/08/Z and 085475/Z/08/Z), the Wellcome Trust (072894/Z/03/Z, 090532/Z/09/Z and 075491/Z/04/B) and NIMH grants (MH 41953 and MH083094). (v) Molecular Genetics of Schizophrenia (MGS) study, Funding support for the Genome-Wide Association of Schizophrenia Study was provided by the National Institute of Mental Health (R01 MH67257, R01 MH59588, R01 MH59571, R01 MH59565, R01 MH59587, R01 MH60870, R01 MH59566, R01 MH59586, R01 MH61675, R01 MH60879, R01 MH81800, U01 MH46276, U01 MH46289 U01 MH46318, U01 MH79469 and U01 MH79470) and the genotyping of samples was provided through the Genetic Association Information Network (GAIN). The datasets used for the analyses described in this manuscript were obtained from the database of Genotypes and Phenotypes (dbGaP) found at through dbGaP accession numbers phs000021.v3.p2 and phs000167.v1.p1. Samples and associated phenotype data for the Genome-Wide Association of Schizophrenia Study were provided by the Molecular Genetics of Schizophrenia Collaboration (PI: Pablo V. Gejman, Evanston Northwestern Healthcare (ENH) and Northwestern University, Evanston, IL, USA).

Author information

M.J.O., M.C.O.D. and E.R. conceived and designed the study. J.H. and E.R. performed the analysis. J.H., E.R. and M.C.O.D. wrote the main manuscript text. E.R., M.J.O., M.C.O.D., J.H., J.T.R.W., G.K., A.P., V.E.-P. and P.H. reviewed the results and discussed and revised the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Han, J., Walters, J., Kirov, G. et al. Gender differences in CNV burden do not confound schizophrenia CNV associations. Sci Rep 6, 25986 (2016).

Download citation

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.