Spatial fine-mapping for gene-by-environment effects identifies risk hot spots for schizophrenia

Fan, Chun Chieh; McGrath, John J.; Appadurai, Vivek; Buil, Alfonso; Gandal, Michael J.; Schork, Andrew J.; Mortensen, Preben Bo; Agerbo, Esben; Geschwind, Sandy A.; Geschwind, Daniel; Werge, Thomas; Thompson, Wesley K.; Pedersen, Carsten Bøcker

doi:10.1038/s41467-018-07708-7

Download PDF

Article
Open access
Published: 13 December 2018

Spatial fine-mapping for gene-by-environment effects identifies risk hot spots for schizophrenia

Nature Communications volume 9, Article number: 5296 (2018) Cite this article

2853 Accesses
15 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Spatial mapping is a promising strategy to investigate the mechanisms underlying the incidence of psychosis. We analyzed a case-cohort study (n = 24,028), drawn from the 1.47 million Danish persons born between 1981 and 2005, using a novel framework for decomposing the geospatial risk for schizophrenia based on locale of upbringing and polygenic scores. Upbringing in a high environmental risk locale increases the risk for schizophrenia by 122%. Individuals living in a high gene-by-environmental risk locale have a 78% increased risk compared to those who have the same genetic liability but live in a low-risk locale. Effects of specific locales vary substantially within the most densely populated city of Denmark, with hazard ratios ranging from 0.26 to 9.26 for environment and from 0.20 to 5.95 for gene-by-environment. These findings indicate the critical synergism of gene and environment on the etiology of schizophrenia and demonstrate the potential of incorporating geolocation in genetic studies.

Dissecting schizophrenia phenotypic variation: the contribution of genetic variation, environmental exposures, and gene–environment interactions

Article Open access 10 May 2022

Expanding the environmental scope: an environment-wide association study for mental well-being

Article Open access 14 June 2021

Gene–environment correlations across geographic regions affect genome-wide association studies

Article Open access 22 August 2022

Introduction

For public mental health, it is critical to know which environmental factors can be modified to mitigate the risk of psychiatric disorders. However, identifying modifiable environmental factors has been a contentious issue^1,2,3, especially when the effects may depend on one’s genetic liability for illness. Take as an example one of the best-established environmental risks for schizophrenia, childhood upbringing in an urban area. Persons born and raised in urban areas have an approximately twofold increased risk of schizophrenia compared to those born and raised in rural areas^4,5. Researchers have examined potentially causal elements of urban upbringing, such as accessibility to health care^4,6, selective migration of individuals^7,8, air-pollution⁹, infections¹⁰, and socioeconomic inequality^11,12,13. Yet none of these factors have substantially explained the risk associated with urbanicity^4,6,9,14, nor are they highly correlated with instruments used in defining urbanicity, such as population density¹⁵. The conditional relationships between genetic liabilities and putative environmental factors are even harder to detect despite some cohort studies suggesting an interaction between urban upbringing and family history of schizophrenia^{16,17,18,19,20}.

The difficulty in isolating specific environmental risk elements underlying urbanicity effects on schizophrenia incidence exemplifies a serious methodological challenge. The process for discovering environmental risk factors typically relies on a hypothesis-driven “candidate environmental factor” approach. Researchers need to formulate a carefully constructed environmental hypothesis, measure it, and then determine if it associates with risk of the disease. Analyses is usually performed in a study of selected participants not necessarily representative of the entire population of interest. Similar to the candidate gene approach before the dawning of genome-wide association studies (GWAS)²¹, the candidate environment approach suffers from the “spotlight effect”, ignoring the likely complexity of many environmental factors interacting with each other and with genetic liabilities to determine overall risk for illness. The environmental impact can even be a joint holistic effects from multiple environmental factors³. Measurement of the specific environmental factor may also be imprecise, masking its relationship to the illness. For example, many instruments have been devised to characterize socioeconomic inequality, yet have not shown consistent effects on incidence of schizophrenia. Given the complexity of real-life socioeconomic forces, lack of association with schizophrenia could be caused by instrument measurement error or because the instrument does not capture the relevant social-economic factors^11,12.

An alternative to the candidate environment approach is to assess spatial patterns of disease risk without directly measuring environmental factors. As with John Snow isolating the environmental source of cholera outbreak via mapping the cases²², identifying spatially localized disease “hot spots” can assist in the discovery of latent environmental factors. Advanced methods for disease mapping have been developed within the field of geostatistics, particularly in applying spatial random effect models to infer latent environmental variation in causal risk factors²³. As the urbanicity-related increase in risk for schizophrenia was first noted through spatial clustering of disease incidence²⁴, inferring risk hot spots to a finer resolution may provide insight into potential risk-modulating environmental elements before investing substantial resources in active measurement.

With this concept in mind, we develop a disease mapping strategy to address the need for discovering environmental factors without direct measurement. We use spatial random effects to map the geographic distribution of genetic liabilities (G), locale of upbringing (E), and their synergistic effects (GxE) on disease risk. By treating E and GxE as “latent random fields” on the map of Denmark, we avoid methodological issues inherent in the candidate environment approach. Although several studies have utilized random effect models to examine spatially localized risk for schizophrenia^15,25,26,27, our method differs by utilizing spatial fine-mapping and enabling the partition of risk into E and GxE components without the need for candidate environmental factors.

As a proof of concept, we examine geospatial variation in schizophrenia risk across Denmark. To do so, we apply this novel analytical approach to data from a population-based case-cohort study that includes subject genotyping and detailed residential information from birth up to age 7 years. We are thus able to assess locale of upbringing effects on schizophrenia risk with a resolution beyond conventionally defined levels of urbanicity, allowing us to assess variation in spatial risk, and to ask whether spatially localized environmental factors modulate genetic liability of risk for schizophrenia.

Results

Spatial distribution of overall risk of schizophrenia

We utilize the entire population cohort of iPSYCH, excluding cases, to derive locales. The resulting map contains 186 non-overlapping locales, with the number of cohort members ranging from 65 to 197 individuals in each locale (median = 105). Figure 1 displays the risk ratio (RR) from the Mantel-Haenszel analyses. With the exception of the southwestern portion of Denmark, the majority of rural regions have lower risk ratios while high-risk locales are concentrated in large cities (Fig. 1a). By plotting RR’s against the size of each locale, Fig. 1b demonstrates a general trend for spatial risks of schizophrenia, meaning locales with higher population density tend to have higher RR’s. Thus, the risk distribution recapitulates the known urbanicity effects. However, there is substantial variation in risk even controlling for locale size; for example, RR’s can range from protective to highly detrimental within densely populated areas (Fig. 1b).

The contribution of the E and GxE

Table 1 shows the estimations from multilevel models. Compared to rural regions, being born and living in densely populated urban area increases the risk of schizophrenia by (hazard ratio = 1.89, 95% CI: 1.53–2.33), which replicates previous studies on urbanicity effects^4,5. The inclusion of spatial random effects (E) reduces the urbanicity effect to hazard ratio = 1.64 with confidence interval encompassing 1. Model 3 with both E and GxE effects significantly contributes explanatory power to the variation in risk for schizophrenia (Log-likelihood ratio tests p < 2 × 10⁻¹⁶), while the urbanicity effect is further reduced (hazard ratio = 1.46). Due to the concerns of residual confounds from interaction effects, Model 3 contains full pairwise interaction terms of fixed-effect covariates included in the model, i.e., PRS, genetic principal components, gender, and family history¹. Median hazard ratios for E and GxE components, defined as the median absolute difference in hazard ratios for all possible combinations of pairs of locales²⁸, are 2.22 and 1.78, respectively, representing a 122 and 78% expected change in risk if living in a high-risk locale.

Table 1 Hazard ratio estimates from three nested Cox regression models of the iPSYCH case-cohort data

Full size table

Spatial distribution of the risk components of schizophrenia

The geographical distribution of E and GxE are shown in Fig. 2. The E component mirrors the heightened risk in the southwestern part of the Denmark (Fig. 2a) and the southern portion of Copenhagen, the metropolitan area with highest population density (Fig. 2b). However, within the city boundary, hazard ratios vary strongly from protective to highly detrimental (hazard ratio: 0.26 to 9.26, Fig. 2c). The GxE component has a different spatial pattern compared to E (Fig. 2d). Within the metropolitan boundary, high-risk GxE locales are concentrated in the city center (Fig. 2e) and the modulating effect can range from a decrease of risk of 80% to a sixfold increase (hazard ratios: 0.20 to 5.95, Fig. 2f).

Discussion

Our novel spatial mapping analysis strategy transforms the "candidate environment” approach for disease risk into a search for environmental hot spots, localizing where environmental factors appear to have a strong impact. The flexibility of this approach enables the estimation of the amount variance accounted for by E and GxE effects without direct measurement of environmental risk factors. Both simulations and empirical application demonstrate the utility of this strategy as an alternative to the candidate environment approach.

Applying this strategy to nationwide, population-based longitudinal data enriched with genetic information, we recapitulate the well-known urban-rural gradient in schizophrenia risk based on the residential information alone. Furthermore, we show that locale of upbringing significantly contributes to the risk for schizophrenia even after controlling for population density. Both E and GxE spatial effects demonstrate substantial variation within city boundaries and account for a higher proportion of schizophrenia risk than simple urban-rural contrasts. In terms of schizophrenia risk, results indicate that the locale an individual was born and raised in is more important than urban-rural differences per se, even within the confines of a single city. Our patterns of E and GxE across Denmark can be regarded as reference distribution. The partitioned risk contour serves as an initial guide to find the true risk element. Further comparisons with putative environmental factors can reveal the underlying elements that are highly relevant for the etiology of schizophrenia.

As a proof of concept study, our current analysis is not without limitations. First, the average age of the iPSYCH case-cohort is younger than the expected incidence peak of schizophrenia. Although the age range of our cohort is 8–32 years, encompassing the incidence peak of schizophrenia, some cohort members are still at risk for schizophrenia. Right-censoring among cohort members reduces the power of statistical analyses. However, by analyzing the case-cohort with age-adjusted RR’s and survival analyses with inverse probability of sampling weights, we obtain unbiased estimates of incidence proportions. Second, our case-cohort is relatively young, while existing GWAS of schizophrenia tend to recruit more chronic patients in middle age²⁹. Thus, the PRS we used may be biased toward older patients, reducing the predictive power of the already weak biological instrument. Third, the diagnostic uncertainty of very early-onset schizophrenia (onset age lesser than 13-years-old) can impact observed associations. However, a recent validation study of schizophrenia diagnoses using the Danish registry has shown good reliability in both early-onset (age 13 years to 18 years) and very early-onset (age < 13 years) schizophrenia, with diagnostic concordance greater than 82 percent³⁰. Another concern with the relatively young age of the iPSYCH sample is the inclusion of cohort members younger than 10-years-old who have very low-risk of being diagnosed as schizophrenia. These subjects are handled in the Cox proportional hazards model by treating their potential future diagnoses as right-censored outcomes, and hence have little impact on the model outputs. To verify this, we performed a sensitivity analysis on Model 3. We removed anyone younger than age 10 at study end and re-ran Model 3. As expected, the results are almost identical, with the E component on-average increasing risk by 127 percent (originally 122 percent) and GxE component on-average increasing the risk by 77 percent (originally 78 percent). Fourth, as shown in our simulations, the size of the GxE effect depends upon the predictive accuracy of the G effect. Because the PRS is a weak instrument of G, the true size of the GxE effect is probably several times larger than our current estimate, as suggested by our simulations. Fifth, we did not examine the impact of migration on locale effects. Since we cannot differentiate GxE from the gene by environment correlation introduced by migration, we restricted our analyses to individuals who have Danish parents and defined the locales as the place of birth. Although by this we intended to reduce the influence of migration, migration itself can be an important contributor for spatially-embedded risk⁸, as many migrants tend to live in clusters, especially in urban areas. A recent study on community samples across several countries shown that individuals with higher genetic risks of schizophrenia tend to migrate to urban area⁸. However, the spatial patterns we observe are unlikely due to the confounding effects of within generational drift⁴ since locale of upbringing was assessed before age 7, at which age no one had yet been diagnosed with schizophrenia. Inter-generational drift might still cause spatial aggregation of individuals with high genetic liabilities. A Swedish family-based study suggested urbanicity effects on schizophrenia can be explained by familial aggregation of risk¹³. Nevertheless, familial risk might not be the result of genetic liability but shared environmental risks within families. Danish registry studies using a cohort independent of our sample showed no evident urban aggregation of polygenic risk²⁰, and the polygenic risk scores associated with incidence of schizophrenia independent of family history³¹. Therefore, there is little evidence to suggest that the identified spatial patterns is driven by inter-generational drift of families with high genetic liability for schizophrenia. Finally, we did not investigate a variety of possible socioeconomic factors in our current analyses. The potential importance such factors mandates in-depth examination in the future research; however, obtaining, validating, and analyzing socioeconomic variables as potential candidate environmental factors in the iPSYCH sample needs to be handled carefully and is beyond the scope of current paper.

Despite these caveats, we demonstrate that locale effects and modulating effects of locale on genetic risk account for a substantial proportion of urbanicity effects in Demark. Living in a locale with a high E component increases the risk for schizophrenia by as much as 122 percent, independent of genetic liability and family history. Meanwhile, living in a locale with a high GxE component can increase risk due to genetic liability for schizophrenia by as much as 78 percent. Because our results demonstrate risk variation with finer resolution and stronger effects than urban-rural demarcation, there must be specific factors underlying previously observed urban effects. However, identification of factors explicating urban risk has been unsuccessful to date^4,5,6,7. Given the uncertainty involved, invalid constructs or measurement error could be contributors to low power to detect risk associations with specific environmental factors. Our spatial mapping strategy is an alternative approach, since finding high-risk locales does not depend on correct specification of a purported environmental risk factor.

In the nineteenth century, epidemiology pioneer John Snow mapped high-density regions of cholera cases onto London streets and thus identified the water source as the key infectious medium. By demonstrating that the locale of upbringing significantly contributes to risk and modulates genetic susceptibility to schizophrenia, we hope this is the first step in isolating the source of spatial risk variation, facilitating the design of future public health interventions for severe mental disorders.

Methods

Our spatial mapping approach follows three steps: (1) defining neighboring locales to characterize the latent environment field, (2) estimating random effects associated with each locale, and (3) mapping the spatial distribution based on the realized effects on locales. These three steps are calibrated to ensure a good balance between fine spatial resolution and adequate statistical power. Furthermore, the modeling strategy partitions observed effects on risk for schizophrenia into different components: locale of upbringing (E), genetics (G), and the synergistic effects of spatial locale and genetics (GxE).

Defining locales for risk mapping

We exploit the duality between Delaunay triangulation and Voronoi tessellation³², ensuring each defined locale has a sufficient number of study subjects to be well-powered while achieving a fine spatial resolution (Supplemental Information). The Voronoi tessellation partitions the whole map into smaller units based on individuals’ coordinates on the map, making sure every point in a given unit area is closer to its centroid than any other. Their neighborhood relationships are defined simultaneously because the centroids are connected by the dual of Voronoi tessellation, i.e., Delaunay triangulation. After defining neighborhood relationships, individuals are grouped with their closest neighbors, making the locale growing in size, until the number of individuals in the defined locale reaches a pre-defined range (Supplementary Fig. 1 and Supplemental Information). The algorithm thus achieves a balance between spatial resolution and a sufficient number of subjects in each locale by adaptively merging neighboring locales with too few individuals into larger locales. The primary advantage from this approach is to localize the regions as much as possible while retaining high statistical power to estimate locale (E) and gene x locale (GxE) spatial random effects. This also prevents potential bias introduced by estimating spatial risks via a smoothing kernel, as exemplified by one twin study that used an isotropic smoothing kernel to estimate the spatial distribution of the risk in mental illness, inadvertently biasing all outcomes, regardless of diagnosis, toward densely populated areas²⁷.

Estimating the effects associate with the locale

Mixed effects models provide the necessary tools to estimate the latent environmental and gene x environmental effects. Fixed effects in the model control for potential confounding factors, whereas random locale effects approximate the latent field across all spatial locations. Once the random effect variance is estimated and determined to be significantly greater than zero, spatial mapping is achieved through computing the posterior means of the random effects for each locale, defined by the best linear unbiased predictors²³.

To ensure the validity of this approach, we performed 1000 Monte Carlo simulations to determine how well we can estimate E and GxE via the spatial mixed effects model. Given a sample size of 30,000 individuals with disease prevalence of one percent and heritability of 70 percent (similar to the profile of schizophrenia³³), we obtain an unbiased estimation of spatial effects (E), while GxE effects are conservatively bounded by the predictive power of the genetic instrument (Supplementary Fig. 2 and Supplemental Information). As variance explained of the genetic liabilities increases for the genetic instrument, the amount of GxE effects explained is also increased.

Empirical study on the risk of schizophrenia

We demonstrate the feasibility of our spatial mapping approach by characterizing E and GxE effects of schizophrenia in the Danish population. To map the synergistic effects of locale of upbringing and schizophrenia genetic liability, chronological residential information and genotyping data from the same population-based cohort is needed. The Danish Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH) case-cohort study provides a unique opportunity for this aim³⁴. Prior to iPSYCH, genome-wide association studies (GWAS) of psychiatric disorders have lacked information on locale of upbringing, while population registry studies with detailed residential locales have not yet implemented polygenic data analyses. By linking with the Danish Civil Registration System, iPSYCH has a nationally representative sample with whole-genome genotyping and detailed chronological residential information. Altogether with the case-cohort design¹⁷, these characteristics of iPSYCH enable us to obtain nationally representative estimates of the locale effects and the modulating effects of locale on genetic risk.

For this analysis, we extracted genotyped schizophrenia cases and a population random sample cohort from the iPSYCH study³⁴. The aim of the iPSYCH study was to combined biobank and national registry to comprehensively examine the genetic and environmental risk factors of mental illness³⁴. Cohort members (N = 30,000) were randomly sampled individuals from the entire Danish population born between 1981 and 2005 and surviving past 1 year of age (N = 1,472,262). Individuals with a diagnosis of selected mental disorders were ascertained through the Danish Psychiatric Central Research Register, using diagnostic classifications based on the International Classification of Diseases, 10th revision, Diagnostic Criteria for Research (Diagnostic code F20; ICD-10-DCR). The use of these samples is protected under strict regulation with the Danish legislation. The informed consent was obtained from all participants. The study is approved by the Danish Scientific Ethics Committee, the Danish Health Data Authority, the Danish data protection agency and the Danish Neonatal Screening Biobank Steering Committee. Here, we focused on a subset of cases who were diagnosed with schizophrenia. A flow chart of the recruitment can be found in the Supplementary Information (Supplementary Fig. 3). Patients with schizoaffective disorders were excluded. All psychiatric contacts until 31 December 2013 were obtained from the register, resulting in 3540 genotyped individuals diagnosed with schizophrenia. The residential locations of case-cohort members were obtained through linkage to the Danish Civil Registration System. To focus on the early life experience, i.e., upbringing effects, the residential location of an individual was retrieved at three ages: at birth, age 5 years, and age 7 years. Individuals’ exact locations were blurred to 1 km² grid cells to protect privacy. DNA samples were obtained from the Danish Neonatal Screening Biobank and sequenced with Infinium PsychChip v1.0 array (Illumina, San Diego, California, United States of America).

To prevent confounds due to recent emigration/immigration and large-scale ethnic differences, we restrict our analyses to unrelated individuals who are of European descent, as determined by genetic ancestry^35,36 and with both parents born in Denmark based on Danish registry information. The final analyses include 24,028 case-cohort members (2328 schizophrenia cases, 21,700 cohort members) who met above criteria and passed genotyping quality controls. Supplementary Table 1 demonstrates the basic demographic characteristics of the included case-cohort.

We performed our analysis of iPSYCH case-cohort based on a sequence intend to demonstrate the magnitude of partitioned E and GxE in the context of well-researched urbanicity effect. First, we examined the risk distribution through our algorithm for locale definition without multilevel modeling. This represents an overall risk distribution without partitioning the risks into different components. We use the Mantel-Haenszel approach for estimating risk ratios (RR) while correcting for age differences³⁷. Next, we implement the spatial mixed effects model to identify sources of variation in the observed risk across locales. Given the concern of potential confounds, all models include fixed effects of gender, the first three genetic principal components, and family history as covariates. Genetic principal components were covaried to reduce the potential for spatial confounds due to population history³⁵. Family history of psychosis was also covaried to avoid clustering of high-risk families and unmodeled rare genetic mutations³¹. Family history was obtained by querying parents’ records in the registry. Survival models were used to account for age distribution³⁴ and observations were weighted by the inverse of each subject’s sampling probability³⁸ for inclusion in iPSYCH. Time-to-event is defined as age at first hospital contact for schizophrenia for cases, and the minimum of age of death, disappearance, emigration or age at date of registry information collection (31 December 2013) for cohort members without schizophrenia. Because locale of upbringing, especially place at birth, has been consistently associated with a twofold increase in schizophrenia risk^4,5,14,17, we defined the locale based on the place at birth in our analysis. To reduce the effects of potential confounding caused by differences in time residing in the defined locale, we added the duration of residence in the same locale as a stratifying factor in models, so that only subjects residing the same time in a given locale are compared (5 years or 7 years due to the sampling time frames). For comparison purposes, we also fit a model with fixed-effect of the covariates and no random effects (Model 1).

As a byproduct of our locale defining algorithm, the population density of each locale is also automatically calculated, since the size of each locale is inversely proportional to the population density. In the statistical analyses, population density is a continuous instrument, derived by dividing the number of individuals by the area of the defined locale, using the locale at birth for population density. To determine whether we reproduce the urbanicity effects previously reported in Danish cohorts⁴, the effect measure for population density is contrasted between 55 person/km² (rural category) and 5220 person/km² (urban category). Sensitivity analyses indicate the effect measures remain the same if we use locale at age 5 or 7 years instead of locale at birth.

Genotype processing and deriving polygenic risk scores

Eleven million single-nucleotide polymorphisms (SNPs) were imputed based on genotyped SNPs that pass the following criteria: minor allele frequencies greater than 1 percent; frequencies in Hardy–Weinberg Equilibrium; SNPs autosomal and bi-allelic. SHAPEIT3 was used for phasing³⁹ and IMPUTE2 was used for imputation⁴⁰. The reference panel was 1000 genomes project phase 3⁴¹.

To control for potential confounds due to distant shared ancestry within the sample, we calculated genetic principal components (PCs) for iPSYCH samples. Genetic PCs were derived based on principal component analysis with a set of 43,769 independent SNP that are genotyped and passed quality control. We used flashPCA³⁶ to perform the calculation because of its computational speed. By including the leading PCs in the models, it reduces the risk of spurious findings emerging due to population stratification³⁵. Here, we used first three genetic principal components in our analysis since none of the remaining genetic principal components show associations with schizophrenia in iPSYCH sample.

To obtain a genetic instrument with good predictive power for detecting GxE, we calculated the polygenic risk score (PRS) using the summary statistics for 34,129 cases and 45,512 controls from the Psychiatric Genomics Consortium (PGC) Schizophrenia GWAS⁴². The PRS is the sum of the products of effect sizes of SNPs estimated from this independent GWAS and the dosage of those SNPs from the iPSYCH case-cohort. The included SNPs were pruned to ensure independence, while no significance threshold was set to filter SNPs. Parameters for calculating PRS include clumping (r² = 0.1, distance = 250 kb), and pruning (r² = 0.8, window = 2 kb, increment = 2 kb). Nonetheless, PRS is inherently a weak genetic instrument, so our estimate on GxE is as a conservative lower bound of interaction effects (Supplementary Fig. 2).

Code availability

The code used for simulations, empirical analysis, and visualization can be found at [https://chunchiehfan.shinyapps.io/iPSYCH_geo_tess_SZ/]. The interactive version of the disease mapping is shown on the web portal while all the relevant codes can be downloaded on it. All analyses are implemented in R⁴³. R packages employed include spatstat ⁴⁴and coxme⁴⁵. The geographical visualization is done with ggmap⁴⁶, which extracts geographical information from Google Maps. An interactive version of the risk map is generated using leaflet⁴⁷ and shiny⁴⁸.

Data availability

Data for generating figures are provided as Supplementary Information. All relevant data is available upon request.

References

Keller, M. C. Genexenvironment interaction studies have not properly controlled for potential confounders: The problem and the (simple) solution. Biol. Psychiatry 75, 18–24 (2014).
Article PubMed Google Scholar
McAllister, K. et al. Current challenges and new opportunities for gene-environment interaction studies of complex diseases. Am. J. Epidemiol. 186, 753–761 (2017).
Article PubMed PubMed Central Google Scholar
Rappaport, S. M. & Smith, M. T. Environment and disease risks. Science 330, 460 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Pedersen, C. B. & Mortensen, P. B. Family history, place and season of birth as risk factors for schizophrenia in Denmark: a replication and reanalysis. Br. J. Psychiatry 179, 46–52 (2001).
Article CAS PubMed Google Scholar
Vassos, E., Pedersen, C. B., Murray, R. M., Collier, D. A. & Lewis, C. M. Meta-analysis of the association of urbanicity with schizophrenia. Schizophr. Bull. 38, 1118–1123 (2012).
Article PubMed PubMed Central Google Scholar
Pedersen, C. B. No evidence of time trends in the urban--rural differences in schizophrenia risk among five million people born in Denmark from 1910 to 1986. Psychol. Med. 36, 211–219 (2006).
Article PubMed Google Scholar
Pedersen, C. B. Persons with schizophrenia migrate towards urban areas due to the development of their disorder or its prodromata. Schizophr. Res. 168, 204–208 (2015).
Article PubMed Google Scholar
Colodro-Conde, L. et al. Association between population density and genetic risk for schizophrenia. JAMA Psychiatry 75, 901–910 (2018).
Article PubMed PubMed Central Google Scholar
Pedersen, C. B. & Mortensen, P. B. Urbanization and traffic related exposures as risk factors for schizophrenia. BMC Psychiatry 6, 2 (2006).
Article PubMed PubMed Central Google Scholar
Brown, A. S. & Derkits, E. J. Prenatal infection and schizophrenia: a review of epidemiologic and translational studies. Am. J. Psychiatry 167, 261–280 (2009).
Article Google Scholar
Werner, S., Malaspina, D. & Rabinowitz, J. Socioeconomic status at birth is associated with risk of schizophrenia: Population-based multilevel study. Schizophr. Bull. 33, 1373 (2007).
Article PubMed PubMed Central Google Scholar
Kirkbride, J. B., Jones, P. B., Ullrich, S. & Coid, J. W. Social deprivation, inequality, and the neighborhood-level incidence of psychotic syndromes in East London. Schizophr. Bull. 40, 169–180 (2014).
Article PubMed Google Scholar
Sariaslan, A. et al. Does population density and neighborhood deprivation predict schizophrenia? A nationwide swedish family-based study of 2.4 million individuals. Schizophr. Bull. 41, 494–502 (2015).
Article PubMed Google Scholar
Pedersen, C. B. & Mortensen, P. B. Are the cause (s) responsible for urban-rural differences in schizophrenia risk rooted in families or in individuals? Am. J. Epidemiol. 163, 971–978 (2006).
Article PubMed Google Scholar
Zammit, S. et al. Individuals, schools, and neighborhood: A multilevel longitudinal study of variation in incidence of psychotic disorders. Arch. General. Psychiatry 67, 914–922 (2010).
Article Google Scholar
van Os, J., Hanssen, M., Bak, M., Bijl, R. V. & Vollebergh, W. Do urbanicity and familial liability coparticipate in causing psychosis? Am. J. Psychiatry 160, 477–482 (2003).
Article PubMed Google Scholar
van Os, J., Pedersen, C. B. & Mortensen, P. B. Confirmation of synergy between urbanicity and familial liability in the causation of psychosis. Am. J. Psychiatry 161, 2312–2314 (2004).
Article PubMed Google Scholar
Krabbendam, L. & Van, Os,J. Schizophrenia and urbanicity: a major environmental influence---conditional on genetic risk. Schizophr. Bull. 31, 795–799 (2005).
Article PubMed Google Scholar
Grech, A., van Os, J. & Investigators, G. Evidence that the urban environment moderates the level of familial clustering of positive psychotic symptoms. Schizophr. Bull. 43, 325–331 (2017).
Article PubMed PubMed Central Google Scholar
Paksarian, D. et al. The role of genetic liability in the association of urbanicity at birth and during upbringing with schizophrenia in Denmark. Psychol. Med. 48, 1–10 (2017).
Visscher, P. M. et al. 10 Years of GWAS discovery: Biology, function, and translation. Am. J. Human. Genet. 101, 5–22 (2017).
Article CAS Google Scholar
Snow, J. On the Mode of Communication of Cholera. (John Churchill, London, 1855).
Kelsall, J. & Wakefield, J. Modeling spatial variation in disease risk: a geostatistical approach. J. Am. Stat. Assoc. 97, 692–701 (2002).
Article MathSciNet MATH Google Scholar
Faris, R. E. L. & Dunham, H. W. Mental Disorders in Urban Areas: an Ecological Study of Schizophrenia and Other Psychoses. (Univ. Chicago Press, Oxford, England, 1939).
Kirkbride, J. B. et al. Neighbourhood variation in the incidence of psychotic disorders in Southeast London. Social. Psychiatry Psychiatr. Epidemiol. 42, 438–445 (2007).
Article Google Scholar
Torrey, E. F., Mortensen, P. B., Pedersen, C. B., Wohlfahrt, J. & Melbye, M. Risk factors and confounders in the geographical clustering of schizophrenia. Schizophr. Res. 49, 295–299 (2001).
Article CAS PubMed Google Scholar
Davis, O. S. P., Haworth, C. M. A., Lewis, C. M. & Plomin, R. Visual analysis of geocoded twin data puts nature and nurture on the map. Mol. Psychiatry 17, 867 (2012).
Article CAS PubMed PubMed Central Google Scholar
Austin, P. C., Wagner, P. & Merlo, J. The median hazard ratio: a useful measure of variance and general contextual effects in multilevel survival analysis. Stat. Med. 36, 928–938 (2017).
Article MathSciNet PubMed Google Scholar
Meier, S. M. et al. High loading of polygenic risk in cases with chronic schizophrenia. Mol. Psychiatry 21, 969–974 (2016).
Article CAS PubMed Google Scholar
Vernal, D. L. et al. Validation study of the early onset schizophrenia diagnosis in the Danish Psychiatric Central Research Register. Eur. Child & Adolesc. Psychiatry 27, 965–975 (2018).
Article Google Scholar
Lu, Y. et al. Genetic risk scores and family history as predictors of schizophrenia in Nordic registers. Psychol. Med. 48, 1201–1208 (2018).
Barr, C. D. & Schoenberg, F. P. On the Voronoi estimator for the intensity of an inhomogeneous planar Poisson process. Biometrika 97, 977–984 (2010).
Hilker, R. et al. Heritability of schizophrenia and schizophrenia spectrum based on the Nationwide Danish Twin Register. Biol. Psychiatr. 83, 492–498 (2018).
Pedersen, C. B. et al. The iPSYCH2012 case-cohort sample: new directions for unravelling genetic and environmental architectures of severe mental disorders. Mol. Psychiatr. 23, 6–14 (2017).
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904 (2006).
Article CAS PubMed Google Scholar
Abraham, G., Qiu, Y. & Inouye, M. FlashPCA2: principal component analysis of biobank-scale genotype datasets. Bioinformatics 33, 2776–2778 (2016).
Article CAS Google Scholar
Rothman, K. J., Greenland, S. & Lash, T. L. Modern Epidemiology. (Lippincott, Williams & Wilkins, Philadelphia, 2008).
Barlow, W. E., Ichikawa, L., Rosner, D. & Izumi, S. Analysis of case-cohort designs. J. Clin. Epidemiol. 52, 1165–1172 (1999).
Article CAS PubMed Google Scholar
O'Connell, J. et al. Haplotype estimation for biobank-scale data sets. Nat. Genet. 48, 817–820 (2016).
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5, e1000529 (2009).
Article CAS PubMed PubMed Central Google Scholar
1000 Genome Project. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56 (2012).
Article CAS Google Scholar
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421 (2014).
Article ADS CAS PubMed Central Google Scholar
R Core team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, 2016).
Baddeley, A. & Turner, R. spatstat: An R package for analyzing spatial point patterns. J. Stat. Softw. 12, 1–42 (2005).
Article Google Scholar
Therneau, T. M. coxme: Mixed Effects Cox Models. R package version 2.2-10. (2018). https://CRAN.R-project.org/package=coxme.
Kahle, D. & Wickham, H. ggmap: Spatial Visualization with ggplot2. The R Journal 5, 144–161 (2013).
Google Scholar
Cheng, J., Karambelkar, B. & Xie, Y. leaflet: Create Interactive Web Maps with the JavaScript 'Leaflet' Library. R package version 2.0.2. (2018). https://CRAN.R-project.org/package=leaflet.
Chang, W., Cheng, J., Allaire, J. J., Xie, Y. & McPherson, J. shiny: Web Application Framework for R. R package version 1.2.0. (2017). https://CRAN.R-project.org/package=shiny.

Download references

Acknowledgements

This study was supported by the Lundbeck Foundations Initiative for Integrated Psychiatic Reseach, IPSYCH (grant numbers R102-A9118 and R155-2014-1724), Denmark, the Novo Nordisk Foundation (Big Data Center for Environment and Health, grant number NNF17OC0027864), and conducted using the Danish National Biobank resource supported by the Novo Nordisk Foundation. J.M. was supported by a NHMRC Project John Cade Fellowship (APP1056929) and a Niels Bohr Professorship from the Danish National Research Foundation. W.K.T. and A.J.S. were supported by 1R01GM104400.

Author information

These authors contributed equally: Wesley K. Thompson, Carsten Bøcker Pedersen.

Authors and Affiliations

Center for Human Development, University of California, San Diego, CA, 92093, USA
Chun Chieh Fan
Mental Health Center Sct. Hans, Capital Region of Denmark, Roskilde, 4000, Denmark
Chun Chieh Fan, Vivek Appadurai, Alfonso Buil, Andrew J. Schork, Thomas Werge & Wesley K. Thompson
National Centre for Register-based Research, Aarhus University, Aarhus, 8210, Denmark
John J. McGrath, Preben Bo Mortensen, Esben Agerbo & Carsten Bøcker Pedersen
Queensland Brain Institute, University of Queensland, St. Lucia, QLD, 4072, Australia
John J. McGrath
Queensland Centre for Mental Health Research, Wacol, QLD, 4076, Australia
John J. McGrath
The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark
Vivek Appadurai, Alfonso Buil, Andrew J. Schork, Preben Bo Mortensen, Esben Agerbo, Thomas Werge, Wesley K. Thompson & Carsten Bøcker Pedersen
Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA, 90095, USA
Michael J. Gandal & Daniel Geschwind
Centre for Integrated Register-based Research, CIRRAU, Aarhus University, Aarhus, 8210, Denmark
Preben Bo Mortensen, Esben Agerbo & Carsten Bøcker Pedersen
Scientific Decision Consulting, Santa Monica, CA, 90401, USA
Sandy A. Geschwind
Department of Neurology, University of California, Los Angeles, CA, 90095, USA
Daniel Geschwind
Department of Clinical Sciences, University of Copenhagen, Copenhagen, 2200, Denmark
Thomas Werge
Institute of Biological Psychiatry, Mental Health Services of Copenhagen, Copenhagen, 4000, Denmark
Thomas Werge
Family Medicine and Public Health Division of Biostatistics, University of California, San Diego, CA, 92093, USA
Wesley K. Thompson
Big data Centre for Environment and Health, Aarhus University, Aarhus, 8210, Denmark
Carsten Bøcker Pedersen

Authors

Chun Chieh Fan
View author publications
You can also search for this author in PubMed Google Scholar
John J. McGrath
View author publications
You can also search for this author in PubMed Google Scholar
Vivek Appadurai
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Buil
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Gandal
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Schork
View author publications
You can also search for this author in PubMed Google Scholar
Preben Bo Mortensen
View author publications
You can also search for this author in PubMed Google Scholar
Esben Agerbo
View author publications
You can also search for this author in PubMed Google Scholar
Sandy A. Geschwind
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Geschwind
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Werge
View author publications
You can also search for this author in PubMed Google Scholar
Wesley K. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Carsten Bøcker Pedersen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.C.F., W.K.T., and C.B.P. designed the study, performed data analysis, interpreted the results, and wrote the manuscript. V.A., A.B., and A.J.S. collected the data, performed data quality control, and performed data analysis. J.M., M.J.G., D.G., S.G., P.B.M., E.A., and T.W. provide substantial inputs on revising the manuscript. All authors approved the final manuscript.

Corresponding authors

Correspondence to Wesley K. Thompson or Carsten Bøcker Pedersen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Source Data

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fan, C.C., McGrath, J.J., Appadurai, V. et al. Spatial fine-mapping for gene-by-environment effects identifies risk hot spots for schizophrenia. Nat Commun 9, 5296 (2018). https://doi.org/10.1038/s41467-018-07708-7

Download citation

Received: 14 May 2018
Accepted: 16 November 2018
Published: 13 December 2018
DOI: https://doi.org/10.1038/s41467-018-07708-7

This article is cited by

Geographical variation in treated psychotic and other mental disorders in Finland by region and urbanicity
- Kimmo Suokas
- Olli Kurkela
- Sami Pirkola
Social Psychiatry and Psychiatric Epidemiology (2024)
Geospatial investigations in Colombia reveal variations in the distribution of mood and psychotic disorders
- Janet Song
- Mauricio Castaño Ramírez
- Sally Blower
Communications Medicine (2024)
Heritability Estimation of Cognitive Phenotypes in the ABCD Study® Using Mixed Models
- Diana M. Smith
- Robert Loughnan
- Anders M. Dale
Behavior Genetics (2023)
Association of ADH7 Gene Polymorphism with Schizophrenia in the Han Population of Northern China: a Case-Control Study
- Kuo Zeng
- Ya Li
- Bao-jie Wang
Journal of Molecular Neuroscience (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.