Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure

Chen, Chi-Hua; Wang, Yunpeng; Lo, Min-Tzu; Schork, Andrew; Fan, Chun-Chieh; Holland, Dominic; Kauppi, Karolina; Smeland, Olav B.; Djurovic, Srdjan; Sanyal, Nilotpal; Hibar, Derrek P.; Thompson, Paul M.; Thompson, Wesley K.; Andreassen, Ole A.; Dale, Anders M.

doi:10.1038/s41598-017-15705-x

Download PDF

Article
Open access
Published: 16 November 2017

Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure

Scientific Reports volume 7, Article number: 15736 (2017) Cite this article

1849 Accesses
9 Citations
Metrics details

Subjects

Abstract

Discovering genetic variants associated with human brain structures is an on-going effort. The ENIGMA consortium conducted genome-wide association studies (GWAS) with standard multi-study analytical methodology and identified several significant single nucleotide polymorphisms (SNPs). Here we employ a novel analytical approach that incorporates functional genome annotations (e.g., exon or 5′UTR), total linkage disequilibrium (LD) scores and heterozygosity to construct enrichment scores for improved identification of relevant SNPs. The method provides increased power to detect associated SNPs by estimating stratum-specific false discovery rate (FDR), where strata are classified according to enrichment scores. Applying this approach to the GWAS summary statistics of putamen volume in the ENIGMA cohort, a total of 15 independent significant SNPs were identified (conditional FDR < 0.05). In contrast, 4 SNPs were found based on standard GWAS analysis (P < 5 × 10⁻⁸). These 11 novel loci include GATAD2B, ASCC3, DSCAML1, and HELZ, which are previously implicated in various neural related phenotypes. The current findings demonstrate the boost in power with the annotation-informed FDR method, and provide insight into the genetic architecture of the putamen.

Multi-ancestry eQTL meta-analysis of human brain identifies candidate causal variants for brain-related traits

Article 20 January 2022

A significant, functional and replicable risk KTN1 variant block for schizophrenia

Article Open access 08 March 2023

Polygenic risk for schizophrenia and subcortical brain anatomy in the UK Biobank cohort

Article Open access 09 September 2020

Introduction

Many aspects of the human brain, including subcortical structures, are highly heritable. Twin studies have shown genetic influences accounted for approximately 40–80% of the variance in the volume of subcortical structures^1,2,3,4. Thus, it is unequivocal that brain structures are highly influenced by genetic factors. However, our knowledge of which genetic variants are associated with brain structural variations is currently limited. A large genome-wide association study (GWAS) by the Enhancing Neuro Imaging Genetics through Meta-Analysis (ENIGMA) consortium found the strongest effects for the putamen³. Four independent significant loci associated with the putamen volume were reported and all together they explained about 1.1% of phenotypic variance and 1.4–2.2% of estimated genetic variance (twin-based heritability estimated to be ~0.8^1,3 and SNP-based heritability estimated to be ~0.5⁵), suggesting most of the genetic variance has yet to be identified.

Putamen is part of the basal ganglia, which are a group of subcortical structures involved in sensorimotor, associative, reward and mnemonic functions⁶. The dorsal part of the basal ganglia is generally associated with motor and associative functions, while the ventral part is associated with reward and motivation processes^6,7,8. Dopamine is the main neurotransmitter that regulates brain activity, and the dopaminergic pathway is one of the most important anatomical substrates for reward, such as food, drugs and social interactions^9,10, and an important pharmacological target for schizophrenia or Parkinson’s disease treatment¹¹. Alterations in putamen activity or volumes have been implicated in psychiatric and substance use disorders^12,13,14,15. However, how genes play a part in these neuroanatomical and functional characteristics remain mainly unknown. Our analysis to identify novel common genetic variants influencing the variation of putamen volume in the human population could be utilized as resources to examine genetic contribution to brain structure, function and disorders.

Large GWAS have successfully identified thousands of single nucleotide polymorphisms (SNPs) associated with hundreds of human complex traits^16,17, thus improving our understanding of the genetic basis of many human diseases and traits. The emerging consensus from GWAS suggests that complex traits and diseases exhibit a polygenic architecture composed of many individually small effects. A polygenic architecture poses challenges for GWAS, as a massive number of statistical tests reduce power considerably for detecting small signals. As widely recognized, SNPs that exceed the GWAS significance threshold explain only a small fraction of the heritability^18,19.

To mitigate this limitation of the standard GWAS approach, we employ a framework that concurrently uses genic annotations, heterozygosity, total linkage-disequilibrium (LD) scores, and summary statistics from GWAS²⁰. We have shown that ranking SNPs according to these genome characteristics yields a larger number of loci surpassing a given threshold than ranking SNPs according to their nominal P values alone²¹. Ranking SNPs by incorporating genomic annotations and other sources of “enrichment” along with the P values obtained from existing large GWAS allows us to accelerate discovery of genetic variants associated with the phenotype of interest in a cost-efficient manner. This approach can be useful especially when phenotypes are very difficult to attain for a sufficiently large number of subjects, as is the case with brain imaging phenotypes.

The rationale behind our framework is that polymorphism variations in and around genes have been shown to harbor more genetic effects than intergenic regions^22,23,24,25. This observation suggests that some categories of SNPs such as regulatory and coding elements of protein coding genes are more enriched for genetic effects on a phenotype than other SNPs^20,26,27. We use our previously developed LD-weighted genic annotation method that takes into account the LD structure to select SNPs that are related to various functional categories of the genome such as exon, intron, and 3′UTR²⁰. In addition to the genic annotation, we use other information in the genome to improve gene discovery, including heterozygosity (H, where H = 2 f (1 − f); f is allele frequency for either of the two SNP alleles) and total LD scores of individual SNPs, because variants that are of high frequency and in regions of extensive LD are more detectable in GWAS^28,29. We integrate these various sources of enrichment information to construct a relative enrichment score (RES) for each SNP, which was used in our previous study of Covariate-Modulated Mixture Model (CM3)²¹. RES is defined as the estimated enrichment (X \(\hat{{\boldsymbol{\beta }}}\)) obtained from a logistic regression model for the thresholded GWAS summary statistics using LD-weighted genic annotation categories and total LD scores with heterozygosity weightings as explanatory variables. We then re-rank the SNPs based on their RES (instead of GWAS P values), and categorize the SNPs into several strata. For each stratum, the stratum-specific information can be used to calculate a stratified True Discovery Rate (TDR). We hypothesize that by incorporating prior enrichment information of the genome in the analysis of genotype-phenotype mapping, we can improve power to discover common genetic variants associated with the putamen volume.

Results

The Q-Q plot of putamen stratified by relative enrichment scores

The stratified Q-Q plot shows different enrichment levels across RES strata, which deviate further away from the null line as RES increases (Fig. 1a). An earlier or greater departure from the null line (leftward shift) suggests a larger proportion of true associations for a given nominal P value. SNPs with higher RES (X \(\hat{{\boldsymbol{\beta }}}\), see Methods), calculated by a logistic regression model incorporating annotation categories, total LD and heterozygosity, are more likely to be associated with putamen than those with lower RES.

True discovery rate (TDR)

Variation in enrichment across RES strata is associated with corresponding variation in TDR for a given P value threshold (Fig. 1b). The enrichment can be directly interpreted in terms of TDR formed by estimating 1 − P/Q for each nominal P value from the stratified Q-Q plots (see Methods). This relationship is shown for putamen, the corresponding estimated TDR increases as RES stratum increases. The top RES stratum contained SNPs reaching high TDR earlier than those in other strata, indicating its greater power to identify SNPs associated with putamen.

Predicted stratified Q-Q plot and TDR

In addition to model-free Q-Q and TDR plots generated by empirical distributions, we applied a model-based method to fit Q-Q and TDR curves in each stratum. The fitted Q-Q plot (dotted curves of Fig. 1a) is generated by using Weibull-chi-square mixture distribution (see Methods) and then the fitted TDR plot (dotted curves of Fig. 1b) is estimated by 1 − P/Q for each nominal P value as described above. Specifically, the model-based method did not perform well in Stratum 1, but had a good fitting in Stratum 2 and 3 with a larger proportion of trait-associated SNPs. In addition, curves in high TDR (i.e. low FDR) were well fit by the parametric model, which might facilitate obtaining good predicted values of TDR for detecting significant SNPs.

Lookup table

Given nominal P values, the lookup tables were constructed by interpolated FDR conditional on RES strata (Supplementary Fig. S1b) and by interpolated FDR for all combined strata (Supplementary Fig. S1a). In the lookup table, a gradual decrease of FDR from the bottom-left to top-right corners suggests improved enrichment by stratification of RES (shown as a gradual increase of −log₁₀ (FDR) in the figure) and smooth gradients indicate good interpolation for our FDR estimate. Given 0.05 of FDR threshold (i.e., ~1.3 for −log₁₀ (FDR)), the corresponding nominal P value is around 10⁻⁷ for lower level of RES, whereas nominal P value reduces to around 10⁻³ for higher level of RES.

P value and conditional FDR results

SNPs associated with putamen were identified by P value threshold of GWAS and FDR conditional on RES. To ensure that significant loci are independent, we removed SNPs with LD r ² > 0.2 and retained the SNP with the lowest FDR P value in each LD block. For a GWAS threshold of P value < 5×10⁻⁸, a total of 4 independent SNPs located in different loci were found (Table 1). Given a threshold of conditional FDR < 0.05, we identified 15 significant independent SNPs (Table 1). Compared to the same threshold for unconditional FDR, 8 SNPs were identified. Although SNPs detected by P value are not entirely overlapping with SNPs discovered by conditional FDR, their neighboring SNPs in the same LD block (i.e., LD r ² > 0.2 between SNPs) showed lower conditional FDR values.

Table 1 Genetic variants associated with putamen with conditional FDR < 0.05.

Full size table

To visualize SNPs associated with putamen, we constructed a Manhattan plot showing the FDR stratified by RES. The 15 independent loci were identified with a significance threshold of conditional FDR < 0.05 (Table 1), were plotted in the Manhattan plot (Fig. 2) where gene names for those loci were also shown, except intergenic SNPs. Interestingly, several significant loci with stronger signals were distributed on chromosomes 14 and 18.

Method comparison: fgwas

Applying fgwas to our putamen GWAS summary statistics data, only one SNP was prominent at posterior probability > 0.5, two SNPs (in the same LD block) at posterior probability > 0.4, nine SNPs (in four LD blocks) at posterior probability > 0.1. The posterior probabilities for 15 significant SNPs were shown in Supplementary Table S1 and none of SNPs with posterior probability > 0.5. The SNP rs8017172 was most significant at levels of P value, conditional FDR and posterior probability.

Discussion

By applying a new method capitalizing on genic annotations, heterozygosity and total LD, we were able to model the cumulative probability distributions of SNPs assigned to different strata and detected 15 significant loci including 4 SNPs reported in the original GWAS paper. Using enrichment information increased the power to find more independent significant loci.

Of the 15 loci influencing putamen volume, 4 have been reported in the original GWAS paper³. Among 11 novel loci, we identified an intronic locus (rs597583, P = 3.59 × 10⁻⁷, conditional FDR = 0.0174) within DSCAML1 (Down syndrome cell adhesion molecule like 1), which is expressed in the brain and produces cell adhesion molecule that is involved in formation and maintenance of neural networks and neurite arborization^30,31. The chromosomal locus of this gene on 11q23 has been suggested as a candidate for neuronal disorders³¹, because 11q23 contains a number of genes and gene families expressed in the nervous system and harbors candidate regions for several diseases with neurological features³². Its paralog, DSCAM, a conserved gene has been found to be involved in learning-related synapse formation in aplysia³³. We also detected an intronic locus within GATAD2B (GATA zinc finger domain containing 2B), which is a protein coding gene and may play a role in synapse development and normal cognitive performance³⁴. Diseases associated with GATAD2B include mental retardation and severe intellectual disability with distinct facial features^34,35. Two other associated genes, ASCC3 (activating signal cointegrator 1 complex subunit 3) and HELZ (helicase with zinc finger), encode proteins that belong to the helicase family for unwinding double-strands, which may be involved in DNA repair or RNA metabolism in multiple tissues with ubiquitous expression for ASCC3 and predominant expression in thymus and brain for HELZ ^36,37. In addition, DLG2 identified previously³ and the novel locus, ASCC3, were reported to be suggestively associated with neurodegenerative diseases, Parkinson’s disease³⁸ and multiple system atrophy³⁹, respectively. The dopamine deficiency within the basal ganglia leads to Parkinsonian motor symptoms⁴⁰ and putamen is part of the basal ganglia. In patients with Parkinson subtype of multiple system atrophy, the volume of putamen was observed to be atrophic, and MRI signals in the putamen were shown to be marginally hyperintense⁴¹. This evidence suggested that DLG2 and ASCC3 might have pleiotropic effects on putamen and neurodegenerative diseases; on the other hand, they might influence neurodegenerative diseases through alteration of putamen. Please see Table 1 for the full list of the loci discovered.

We have recently developed the FDR approach for improved gene discovery in complex genetic phenotypes^42,43,44,45. Applying this approach to brain structure phenotypes, we increased discovery of loci jointly influencing schizophrenia and brain structure volumes⁴⁶. The current findings suggest that re-prioritizing SNPs according to their characteristics is advantage for gene discovery in the context of FDR. It has increasingly become evident that certain genomic regions harbor more genetic effects on a given phenotype than other genomic regions²¹. The characteristic of heterozygosity for each SNP represents power to detect genetic effects in the sample population of association analysis. Heterozygosity is defined as 2 f(1 − f) and f is the SNP minor allele frequency, which is the genotype variance in the regression model with a higher value for common variants. It is known that allele frequency plays an important role in determining power of SNP associations⁴⁷ (Supplementary Fig. S2), and square root of heterozygosity (H) is directly proportional to GWAS summary statistics (z values) for each SNP. We also incorporate total LD scores from the reference genomes of the same genetic ancestry (i.e. European) when calculating the relative enrichment score (RES), because if a SNP is in a large LD block, it is more likely to be linked with one or more causal variants. All of these characteristics have predictive power for genetic effects on phenotypes thus are used to construct RES for each SNP. Their enrichment features were explored and visualized in Q-Q plots (Supplementary Fig. S2). As described in Methods, RES is defined as the predicted response X \(\hat{{\boldsymbol{\beta }}}\) from a logistic regression with these SNP characteristics as predictors, representing a composite score of estimated enrichment. All SNPs are re-ranked and stratified by their RES.

The existing methods, such as those based on stratified and conditional FDR have been shown to be superior to the traditional GWAS because they incorporate auxiliary information for stratification^47,48. Indeed, there is often natural stratification present in the data such as stratification by allele frequency⁴⁷ or genome annotation. Therefore, treating SNPs by strata with the incorporation of prior information, we increased the power to detect trait-associated SNPs. In comparison with fgwas⁴⁹ which incorporates annotation information, we detected additional seven SNPs that were not identified by P value or unconditional FDR, and fgwas detected one SNP with posterior probability > 0.5 which was already significant at genome-wide P value level. It is of note that our approach is in line with the stratified FDR method⁴⁷ (computation of FDR by strata), however, we make our FDR values continuous by computing FDR estimates on a grid and interpolating these estimates⁴⁸. Presumably, the continuous estimates will more realistically reflect the FDR estimates for SNPs that fall in between the stratum Q-Q curves. We also build on the modeling framework initially presented in the CM3 in which the RES was formulated²¹.

There are some methodological considerations in the approach. First, although we found more significant loci than those identified by the traditional GWAS approach, all the novel loci collectively only explained a small fraction of heritability, suggesting that most of the trait-associated loci are still uncovered. Second, some parameters in our model cannot be precisely determined, for example, the total number of the strata or the percentile coverage of each stratum. This caveat is partly due to the fact that the true underlying genetic architecture of complex traits is seldom known in advance (e.g., the level of polygenicity, or annotations and allele frequencies of causal variants), and elucidating this issue is the subject of on-going research. Changes in these parameters may affect the significance status for SNPs close to the threshold, but most of the highly significant SNPs remained robust. Third, our method uses summary statistics of GWAS and hence inherits the limitations of GWAS. For example, multi-loci association analyses that take into account effects of other SNPs may give more unbiased estimates of genetic effects for each SNP. Along this line, association findings from GWAS are susceptible to the presence of population structure. Although principal components may represent broad differences across the sample, other polygenic mixed linear models including genetic relationship matrices have been proposed to be less susceptible to population structures and to increase the precision of genetic effect estimation^50,51. Fourth, our approach is conservative formulation for FDR estimation in the enriched strata. We assume π₀ = 1 in the model, but the top enriched stratum has lower π₀ (i.e., lower proportion of null SNPs) so that the FDR as P/Q can be overestimated. However, other aspects in the analysis such as correlations among SNPs might over- and under-estimate FDR. We tried to minimize this issue by randomly pruning SNPs with a stringent threshold of r ² = 0.2. We also randomly pruned the SNPs with a less stringent threshold (r ² = 0.8). The analysis identified the same 15 loci as those from the analysis of using r ² = 0.2. Fifth, there are other methods that use prior information. Many of these methods are Bayesian association study methods and calculate Bayes factors or posterior probability of association^49,52,53,54. Some of the approaches are scalable to a very large number of functional annotations or characteristics of SNPs, and is relatively more complicated to apply in practice. Other methods propose various ways to include SNP characteristics such as multi-thresholding by varying the significant threshold at each SNP^29,55 or defining weightings to SNPs depending on prior information via multivariate regression⁵⁶. Our current approach is built on our prior work^20,48,57,58, which is based on a Bayesian two-groups mixture model for Fdr control by Efron⁵⁹. Our straightforward approach complementary to other methods can be a useful tool for gene discovery.

GWAS is an efficient tool to survey through genome-wide millions of loci for identifying any trait-associated SNPs in a hypothesis-free manner with all SNPs treated identically. As previously shown, SNPs from GWAS with sub-threshold P values account for a considerable proportion of the variance in independent samples, suggesting that these sub-threshold SNPs are enriched for genetic effects⁶⁰. Our method is built on the GWAS approach and utilizes GWAS summary statistics in the framework of FDR as a screening tool to uncover subthreshold and high-priority candidates by incorporating genic annotations. The resulting FDR estimates may have utility as resources or databases for hypothesis generation, and could aid in more robust and meaningful candidate gene selection (e.g., testing causal genetic effects in biological experiments).

Methods

Participant samples

We obtained putamen GWAS results in the form of summary statistics from the ENIGMA consortium. The putamen GWAS summary statistic data consisted of 12,596 participants derived from 26 substudies with all European ancestry, which is a subset of GWAS discovery sample (N = 13,171) published in 2015³. Putamen GWAS was used for its better power than GWAS of other subcortical structures. All participants in substudies gave written informed consent and sites involved obtained approval from local research ethics committees or Institutional Review Boards³.

Putamen structural measure

The subcortical putamen measure was obtained from structural MRI data collected, processed and examined for quality at participating sites, following a standardized protocol procedure (http://enigma.ini.usc.edu/protocols/imaging-protocols/) to harmonize the analysis across sites³. In addition, the measure of head size (intracranial volume, ICV) were calculated and corrected for the subcortical measures in the association analyses³.

Genotyping and imputation

Samples were genotyped using commercially available platforms and assessed for genetic homogeneity using multi-dimensional scaling (MDS) analysis to exclude ancestry outliers in each substudy³. SNPs with low minor allele frequency (<0.01), poor genotype call rate (<95%), and deviations from Hardy–Weinberg equilibrium (P < 1 × 10⁻⁶) were filtered³. The imputation and quality control procedures were followed by the protocol (http://enigma.ini.usc.edu/protocols/genetics-protocols/) using MaCH⁶¹ for haplotype phasing and minimac⁶² for imputation³. Poorly imputed SNPs (with r ² < 0.5) and SNPs with low minor allele count (<10) were removed. The total number of SNPs included in the analysis for each substudy ranged between 6.9–10.5 million.

Genome-wide association analysis

The association analysis between putamen measure and each SNP (additive dosage value) was based on a multiple linear regression model controlling for age, square of age, sex, 4 MDS components, ICV, diagnosis (when applicable) and centers/scanners (for substudies with data collected from several centers/scanners)³. The protocols used for testing association can be found online (http://enigma.ini.usc.edu/protocols/genetics-protocols/) with mach2qtl⁶¹ for substudies of unrelated subjects and merlin-offline⁶³ for family-based designs³.

Meta-analysis of genome-wide association results from substudies

The GWAS results from each substudy were corrected for genomic inflation³. The meta-analysis was performed using a fixed-effect, inverse-variance model implemented in the software package METAL⁶⁴.

SNPs for the meta-analysis were reduced into ~2.5 million SNPs based on pre-calculated LD-weighted annotation scores for individual SNPs (see the section below). The correlation structure of SNPs for calculating annotation scores was determined by an LD matrix of 2,549,449 autosomal SNPs generated from the European reference sample in the 1000 Genomes Project phase1 v3 within 1,000,000 base pairs (1 Mb)²⁰.

LD-weighted genic annotation

Each SNP analyzed in our study was annotated with LD-weighted genic annotation scores. The score was calculated based on the European reference sample provided by the November 2012 release of the Phase I 1000 Genomes Project (1KGP). Specially, each SNP in the 1KGP reference panel was initially assigned to a single mutually exclusive genic annotation category based on its genomic position (the UCSC gene database, hg19). Eight genic annotation categories were used: exon, intron, 5′ untranslated region (5′UTR), 3′UTR, 1 and 10 kilo-base pairs upstream of the gene transcription start positions, and 1 and 10 kilo-base pairs downstream of gene transcription end positions⁶⁵. Pairwise LD scores (r ²) between SNPs were calculated. For each SNP, a continuous, non-exclusive LD-weighted category score was assigned as the LD weighted sum of the positional category scores for variants tagged in each of the eight categories mentioned above. By incorporating LD information, the annotation of individual SNPs reflects the weighted annotation in the context of underlying linkage blocks. For detailed information on SNP annotation, score construction and quality control see Schork et al.²⁰.

Relative enrichment score (RES)

Let p denote the P value of a particular SNP from GWAS summary statistics data. We defined y = 1 if p ≤ p _thresh (p _thresh = 10⁻³ in the current study) and y = 0 otherwise, to divide the SNPs into those that are more likely to have a non-null effect and those that are more likely to have null effects. A multiple logistic regression model was fit: logit[Pr(y = 1 | X = x))] = (β ₁x₁ + β ₂x₂ + … + β _k x _k)H, where the x _i, i = 1…k are the nine predictors for a SNP’s association with the phenotype. We included the genic annotation scores from the eight categories and total LD scores (TLD) weighted by heterozygosity (H = 2 f(1− f), where f is the SNP minor allele frequency from the 1KGP European reference panel), because they have been shown to associate with strength of association and probability of replication for many complex phenotypes²⁰. The RES for the SNP is defined as the estimated value, \({\bf{X}}\hat{{\boldsymbol{\beta }}}\), from the above logistic regression model. We have used this RES approach in a previous paper²¹. Before computing the RES, SNPs were randomly pruned at LD r ² < 0.8. Correlated SNPs do not affect β estimation so we prune SNPs at a liberal threshold.

GWAS summary statistics data used to calculate RES ideally should be an independent data set from the data set for gene discovery to avoid overfitting problems by fitting the same data set twice (i.e., calculating RES first and estimating conditional FDR second). However, it is often hard to obtain two or three independent GWAS data sets of a given phenotype (the third one for replication analysis). Our prior work and that of others have observed that height is extremely polygenic and its pattern of SNP associations has several typical features such as that associated signals are near genes^20,66. Height can be used as a proxy of a generic phenotype for complex traits and its GWAS summary statistics can be used to locate polygenic loci in the genome, when multiple independent GWAS data sets of the phenotype of interest are not available. We previously adopted a similar approach using height GWAS to train the logistic regression for computing SNP enrichment scores^20,67.

Stratified Q-Q plots and enrichment

Q-Q plots are standard tools for assessing the degree of similarity between two cumulative distribution functions (CDFs). When the probability distribution of GWAS summary statistic P values is of interest, under the global null hypothesis, the theoretical distribution is uniform on the interval [0,1]. If nominal P values are ordered from smallest to largest, so that P(1) < P(2) < … < P(N), the corresponding empirical CDF, denoted by “Q,” is simply Q(i) = i/N, where N is the number of retained SNPs. Thus, for a given index i, the x-coordinate of the Q-Q curve is Q(i) and the y-coordinate is the nominal P value P(i). Instead of plotting nominal P values against empirical P values, in GWAS it is common practice to plot −log₁₀ nominal P values against the −log₁₀ empirical P values, Q, so as to emphasize tail probabilities of the theoretical and empirical distributions. Leftward deflections of the observed Q-Q curves from the projected null line reflect increased tail probabilities in the distribution of test statistics and consequently an over-abundance of low P values compared to that expected by chance. We qualitatively refer to this deflection as “enrichment”^20,43.

To assess improved enrichment afforded by genic annotations, heterozygosity and total LD, we used stratified Q-Q plots based on RES. We classified SNPs with the bottom 25–30% RES as the first stratum and the SNPs with the top 1–5% RES as the last stratum. The rest of SNPs are in the second stratum. There is no overlapping set of SNPs between strata. The reason for uneven placement of stratum cut-offs at the two ends of the RES distribution was based on our previous observation that, for the distribution of effects in complex traits, a large proportion of SNPs have negligible effects and a very small proportion of SNPs have non-negligible effects. We then constructed RES stratified Q-Q plots of empirical quantiles of nominal SNP association with putamen for all SNPs, and for subsets of SNPs in each of three strata determined by their RES. Improved enrichment for trait-associated signals is present if the degree of deflection from the expected null line is dependent on the level of the RES. Specifically, the SNPs with higher RES showed a greater degree of deflection from the expected null line.

False discovery rate

The ‘enrichment’ seen in the Q-Q plots (i.e., the leftward deflection from the null line) can be directly interpreted in terms of False Discovery Rate (FDR). To reduce the effect of correlation among SNPs in FDR estimation, SNPs were randomly pruned at LD r ² < 0.2. For full details of FDR estimation, please see previous papers^48,57 and Supplementary Information.

Parametric model

The shape of the empirical distributions depicted in the Q-Q plots resembles the shape of the distribution function of a mixture of Weibull and chi-square distributions. So, for each RES stratum we modeled the Q-Q curve with a function proportional to the distribution function of a Weibull-chi-square mixture to compute stratum-specific predicted FDR. We assumed different scale parameters for the two component distributions. Further, an exploratory analysis showed that a value of 0.5 is a reliable choice for the shape parameter of the Weibull component. Keeping the shape parameter fixed at 0.5, the unknown parameters of the mixture were estimated by maximizing a cost function using unconstrained nonlinear optimization, where the cost function is proportional to the logarithm of the likelihood function of the parameters given the observed SNP distribution. The dotted line in Fig. 1a gives a graphic presentation of the predicted Q-Q curve from the mixture distribution using estimated parameters. The predicted TDR curve in each stratum is generated from the corresponding Q-Q curve and 1-FDR.

Lookup table

We used heat maps to illustrate lookup tables to visualize variations of FDR across and within RES strata shown in Supplementary Fig. S1. Unconditional FDR is denoted as FDR obtained from the predicted Q-Q curve of all SNPs estimated by using Weibull-chi-square mixture distributions based on the 10,000-bin empirical quantile. Specifically, for each SNP the unconditional FDR value was obtained by linear interpolation from the predicted FDR values of 10,000 bins and illustrated by an unconditional FDR lookup table (Supplementary Fig. S1a) corresponding to variations of P values. Conditional FDR values⁴⁸ were generated by bilinear interpolation from the predicted FDR values of 10,000 bins across three strata and displayed in a conditional lookup table (Supplementary Fig. S1b) reflecting RES strata against nominal P values. The values of FDR in terms of −log₁₀(FDR) are illustrated by gradient colors in the lookup tables with color bars. Smooth gradients indicate good interpolation for FDR estimate of each SNP and colors varied from dark to light show enrichment improved by increasing RES.

Manhattan plot

To illustrate the localization of the genetic markers associated with putamen conditional on RES, we constructed a ‘RES-stratified Manhattan plot’ by plotting all SNPs within an LD block in relation to their chromosomal location. All SNPs without pruning are shown as individual points but only the most significant SNP with respect to conditional FDR in each LD block is illustrated with its gene name in the plot. In each LD block, FDR values of SNPs were ranked in ascending order and SNPs that have high LD (r ² > 0.2) with top SNPs were then removed. Thus, we retained the most significant SNP associated with putamen in each LD block. The large points and small points represent significant (FDR < 0.05) and non-significant SNPs, respectively. Two colors, red and black, denote signals from conditional and unconditional FDR, respectively. The red gene names denote the loci with FDR < 0.05.

Method comparison using fgwas

To compare with other methods incorporating annotation information, we performed an additional analysis using fgwas⁴⁹ (https://github.com/joepickrell/fgwas) which incorporates multiple functional annotations to inform GWAS. This method calculates the posterior probability that any given SNP is causal based on an empirical Bayes approach. We included eight annotation categories (exon, intron, 5′UTR, 3′UTR, 1 and 10 kb and 1 and 10 kb downstream) which is identical to RES in our analysis.

References

Kremen, W. S. et al. Genetic and environmental influences on the size of specific brain regions in midlife: The VETSA MRI study. Neuroimage 49, 1213–1223 (2010).
Article PubMed Google Scholar
den Braber, A. et al. Heritability of subcortical brain measures: a perspective for future genome-wide association studies. Neuroimage 83, 98–102 (2013).
Article Google Scholar
Hibar, D. P. et al. Common genetic variants influence human subcortical brain structures. Nature 520, 224–229 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Blokland, G. A., de Zubicaray, G. I., McMahon, K. L. & Wright, M. J. Genetic and environmental influences on neuroimaging phenotypes: a meta-analytical perspective on twin imaging studies. Twin Res Hum Genet 15, 351–371 (2012).
Article PubMed PubMed Central Google Scholar
Toro, R. et al. Genomic architecture of human neuroanatomical diversity. Mol Psychiatry 20, 1011–1016 (2015).
Article CAS PubMed Google Scholar
Bolam, J. P., Hanley, J. J., Booth, P. A. & Bevan, M. D. Synaptic organisation of the basal ganglia. J Anat 196(Pt 4), 527–542 (2000).
Article CAS PubMed PubMed Central Google Scholar
Robbins, T. W. & Everitt, B. J. Neurobehavioural mechanisms of reward and motivation. Curr Opin Neurobiol 6, 228–236 (1996).
Article CAS PubMed Google Scholar
Balleine, B. W., Delgado, M. R. & Hikosaka, O. The role of the dorsal striatum in reward and decision-making. J Neurosci 27, 8161–8165 (2007).
Article CAS PubMed Google Scholar
Nestler, E. J. & Carlezon, W. A. Jr. The mesolimbic dopamine reward circuit in depression. Biol Psychiatry 59, 1151–1159 (2006).
Article CAS PubMed Google Scholar
Groenewegen, H. J., Berendse, H. W. & Haber, S. N. Organization of the output of the ventral striatopallidal system in the rat: ventral pallidal efferents. Neuroscience 57, 113–142 (1993).
Article CAS PubMed Google Scholar
Carpenter, W. T. Jr. & Davis, J. M. Another view of the history of antipsychotic drug discovery and development. Mol Psychiatry 17, 1168–1173 (2012).
Article CAS PubMed Google Scholar
Phillips, M. L., Drevets, W. C., Rauch, S. L. & Lane, R. Neurobiology of emotion perception II: Implications for major psychiatric disorders. Biol Psychiatry 54, 515–528 (2003).
Article PubMed Google Scholar
Phillips, M. L., Drevets, W. C., Rauch, S. L. & Lane, R. Neurobiology of emotion perception I: The neural basis of normal emotion perception. Biol Psychiatry 54, 504–514 (2003).
Article PubMed Google Scholar
Phillips, M. L., Ladouceur, C. D. & Drevets, W. C. A neural model of voluntary and automatic emotion regulation: implications for understanding the pathophysiology and neurodevelopment of bipolar disorder. Mol Psychiatry 13(829), 833–857 (2008).
Article Google Scholar
Chen, C. H., Suckling, J., Lennox, B. R., Ooi, C. & Bullmore, E. T. A quantitative meta-analysis of fMRI studies in bipolar disorder. Bipolar Disord 13, 1–15 (2011).
Article CAS PubMed Google Scholar
Visscher, P. M., Brown, M. A., McCarthy, M. I. & Yang, J. Five years of GWAS discovery. Am J Hum Genet 90, 7–24 (2012).
Article CAS PubMed PubMed Central Google Scholar
Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42, D1001–1006 (2014).
Article CAS PubMed Google Scholar
Collins, F. Has the revolution arrived? Nature 464, 674–675 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Schork, A. J. et al. All SNPs are not created equal: genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated SNPs. PLoS Genet. 9, e1003449 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. et al. Leveraging Genomic Annotations and Pleiotropic Enrichment for Improved Replication Rates in Schizophrenia GWAS. PLoS Genet 12, e1005803 (2016).
Article PubMed PubMed Central CAS Google Scholar
Torkamani, A., Scott-Van Zeeland, A. A., Topol, E. J. & Schork, N. J. Annotating individual human genomes. Genomics 98, 233–241 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat Genet 43, 519–525 (2011).
Article CAS PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet, (2015).
Gusev, A. et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am J Hum Genet 95, 535–552 (2014).
Article CAS PubMed PubMed Central Google Scholar
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Hindorff, L. A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA 106, 9362–9367 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Stephens, M. & Balding, D. J. Bayesian statistical methods for genetic association studies. Nat Rev Genet 10, 681–690 (2009).
Article CAS PubMed Google Scholar
Eskin, E. Increasing power in association studies by using linkage disequilibrium structure and molecular function as prior information. Genome Res 18, 653–660 (2008).
Article CAS PubMed PubMed Central Google Scholar
Potkin, S. G. et al. Gene discovery through imaging genetics: identification of two novel genes associated with schizophrenia. Mol Psychiatry 14, 416–428 (2009).
Article CAS PubMed Google Scholar
Agarwala, K. L. et al. Cloning and functional characterization of DSCAML1, a novel DSCAM-like cell adhesion molecule that mediates homophilic intercellular adhesion. Biochem Biophys Res Commun 285, 760–772 (2001).
Article CAS PubMed Google Scholar
Eubanks, J. H. et al. Structure and linkage of the D2 dopamine receptor and neural cell adhesion molecule genes on human chromosome 11q23. Genomics 14, 1010–1018 (1992).
Article CAS PubMed Google Scholar
Li, H. L. et al. Dscam mediates remodeling of glutamate receptors in Aplysia during de novo and learning-related synapse formation. Neuron 61, 527–540 (2009).
Article CAS PubMed PubMed Central Google Scholar
Willemsen, M. H. et al. GATAD2B loss-of-function mutations cause a recognisable syndrome with intellectual disability and are associated with learning deficits and synaptic undergrowth in Drosophila. J Med Genet 50, 507–514 (2013).
Article CAS PubMed Google Scholar
de Ligt, J. et al. Diagnostic exome sequencing in persons with severe intellectual disability. N Engl J Med 367, 1921–1929 (2012).
Article PubMed CAS Google Scholar
Dango, S. et al. DNA unwinding by ASCC3 helicase is coupled to ALKBH3-dependent DNA alkylation repair and cancer cell proliferation. Mol Cell 44, 373–384 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wagner, D. S., Gan, L. & Klein, W. H. Identification of a differentially expressed RNA helicase by gene trapping. Biochem Biophys Res Commun 262, 677–684 (1999).
Article CAS PubMed Google Scholar
Nalls, M. A. et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease. Nat Genet 46, 989–993 (2014).
Article CAS PubMed PubMed Central Google Scholar
Sailer, A. et al. A genome-wide association study in multiple system atrophy. Neurology 87, 1591–1598 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kalia, L. V. & Lang, A. E. Parkinson’s disease. Lancet 386, 896–912 (2015).
Article CAS PubMed Google Scholar
Wenning, G. K. & Krismer, F. Multiple system atrophy. Handb Clin Neurol 117, 229–241 (2013).
Article PubMed Google Scholar
Schork, A. J., Wang, Y., Thompson, W. K., Dale, A. M. & Andreassen, O. A. New statistical approaches exploit the polygenic architecture of schizophrenia–implications for the underlying neurobiology. Curr Opin Neurobiol 36, 89–98 (2016).
Article CAS PubMed Google Scholar
Andreassen, O. A., Thompson, W. K. & Dale, A. M. Boosting the power of schizophrenia genetics by leveraging new statistical tools. Schizophr Bull 40, 13–17 (2014).
Article PubMed Google Scholar
Desikan, R. S. et al. Polygenic Overlap Between C-Reactive Protein, Plasma Lipids, and Alzheimer Disease. Circulation 131, 2061–2069 (2015).
Article CAS PubMed PubMed Central Google Scholar
Le Hellard, S. et al. Identification of Gene Loci That Overlap Between Schizophrenia and Educational Attainment. Schizophr Bull, (2016).
Smeland, O. B. et al. Genetic Overlap between Schizophrenia and Volumes of Hippocampus, Putamen and Intracranial Volume Indicates Shared Molecular Genetic Mechanisms. Schizophr Bul. epub ahead of print, https://doi.org/10.1093/schbul/sbx148 (2017).
Sun, L., Craiu, R. V., Paterson, A. D. & Bull, S. B. Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies. Genet Epidemiol 30, 519–530 (2006).
Article PubMed Google Scholar
Andreassen, O. A. et al. Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. Am. J. Hum. Genet. 92, 197–209 (2013).
Article CAS PubMed PubMed Central Google Scholar
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am J Hum Genet 94, 559–573 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42, 348–354 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat Genet 46, 100–106 (2014).
Article PubMed PubMed Central CAS Google Scholar
Fridley, B. L. et al. Bayesian mixture models for the incorporation of prior knowledge to inform genetic association studies. Genet Epidemiol 34, 418–426 (2010).
Article PubMed PubMed Central Google Scholar
Gagliano, S. A., Barnes, M. R., Weale, M. E. & Knight, J. A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization. PLoS One 9, e98122 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Knight, J., Barnes, M. R., Breen, G. & Weale, M. E. Using functional annotation for the empirical determination of Bayes Factors for genome-wide association study analysis. PLoS One 6, e14808 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Darnell, G., Duong, D., Han, B. & Eskin, E. Incorporating prior information into association studies. Bioinformatics 28, i147–153 (2012).
Article PubMed PubMed Central Google Scholar
Kindt, A. S., Navarro, P., Semple, C. A. & Haley, C. S. The genomic signature of trait-associated variants. BMC genomics 14, 108 (2013).
Article CAS PubMed PubMed Central Google Scholar
Andreassen, O. A. et al. Improved detection of common variants associated with schizophrenia and bipolar disorder using pleiotropy-informed conditional false discovery rate. PLoS Genet. 9, e1003455 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zablocki, R. W. et al. Covariate-modulated local false discovery rate for genome-wide association studies. Bioinformatics 30, 2098–2104 (2014).
Article CAS PubMed PubMed Central Google Scholar
Efron, B. Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction (Cambridge University Press, 2010).
International Schizophrenia, C. et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009).
Google Scholar
Li, Y., Willer, C. J., Ding, J., Scheet, P. & Abecasis, G. R. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Epidemiol 34, 816–834 (2010).
Article PubMed PubMed Central Google Scholar
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J. & Abecasis, G. R. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 44, 955-+, (2012).
Abecasis, G. R., Cherny, S. S., Cookson, W. O. & Cardon, L. R. Merlin–rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30, 97–101 (2002).
Article CAS PubMed Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hsu, F. et al. The UCSC Known Genes. Bioinformatics 22, 1036–1046 (2006).
Article CAS PubMed Google Scholar
Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 46, 1173–1186 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chen, C. H. et al. Large-scale genomics unveil polygenic architecture of human cortical surface area. Nat Commun 6, 7549 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This project was funded by National Institute of Health R01MH100351and R01GM104400, NARSAD Young Investigator award, KG Jebsen Stiftelsen, Research Council of Norway (229129, 223273, and a FRIPRO Mobility Grant, 251134) and The South-East Norway Regional Health Authority (2013-123, 2016-064; 2017-004). The FRIPRO Mobility grant scheme (FRICON) is co-funded by the European Union’s Seventh Framework Programme for research, technological development and demonstration under Marie Curie grant agreement no. 608695, and funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement n°602450.

Author information

Authors and Affiliations

Center for Multimodal Imaging and Genetics, Department of Radiology, University of California, San Diego, La Jolla, California, 92093, USA
Chi-Hua Chen, Min-Tzu Lo, Andrew Schork, Chun-Chieh Fan, Karolina Kauppi, Nilotpal Sanyal & Anders M. Dale
Department of Neurosciences, University of California, San Diego, La Jolla, California, 92093, USA
Yunpeng Wang, Dominic Holland, Olav B. Smeland & Anders M. Dale
NORMENT, KG Jebsen Centre for Psychosis Research, Institute of Clinical Medicine, University of Oslo and Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
Yunpeng Wang, Olav B. Smeland & Ole A. Andreassen
Department of Cognitive Science, University of California, San Diego, La Jolla, California, 92093, USA
Andrew Schork & Chun-Chieh Fan
Department of Radiation Sciences, Umea University, Umea, Sweden
Karolina Kauppi
Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
Srdjan Djurovic
NORMENT, KG Jebsen Centre for Psychosis Research, Department of Clinical Science, University of Bergen, Bergen, Norway
Srdjan Djurovic
Imaging Genetics Center, Mark and Mary Stevens Neuroimaging & Informatics Institute, Keck School of Medicine of the University of Southern California, Marina del Rey, California, 90027, USA
Derrek P. Hibar & Paul M. Thompson
Division of Biostatistics, Department of Family Medicine and Public Health, University of California, San Diego, La Jolla, California, 92093, USA
Wesley K. Thompson
Department of Psychiatry, University of California, SanDiego, La Jolla, California, 92093, USA
Anders M. Dale

Authors

Chi-Hua Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yunpeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Min-Tzu Lo
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Schork
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Chieh Fan
View author publications
You can also search for this author in PubMed Google Scholar
Dominic Holland
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Kauppi
View author publications
You can also search for this author in PubMed Google Scholar
Olav B. Smeland
View author publications
You can also search for this author in PubMed Google Scholar
Srdjan Djurovic
View author publications
You can also search for this author in PubMed Google Scholar
Nilotpal Sanyal
View author publications
You can also search for this author in PubMed Google Scholar
Derrek P. Hibar
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Wesley K. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Ole A. Andreassen
View author publications
You can also search for this author in PubMed Google Scholar
Anders M. Dale
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-H.C., Y.W. and A.M.D. designed the study. C.-H.C., T.W. and M.-T.L. analysed data and wrote the manuscript. D.P.H. and P.M.T. provided the ENIGMA data. A.S., C.-C.F., D.H., K.K., O.B.S., S.D., N.S., W.K.T. and O.A.A. contributed to manuscript preparation. All authors commented on and approved the manuscript.

Corresponding author

Correspondence to Chi-Hua Chen.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, CH., Wang, Y., Lo, MT. et al. Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure. Sci Rep 7, 15736 (2017). https://doi.org/10.1038/s41598-017-15705-x

Download citation

Received: 24 July 2017
Accepted: 31 October 2017
Published: 16 November 2017
DOI: https://doi.org/10.1038/s41598-017-15705-x

This article is cited by

Genetic architecture of brain age and its causal relations with brain and mental disorders
- Esten H. Leonardsen
- Didac Vidal-Piñeiro
- Yunpeng Wang
Molecular Psychiatry (2023)
Male-specific, replicable and functional roles of genetic variants and cerebral gray matter volumes in ADHD: a gene-wide association study across KTN1 and a region-wide functional validation across brain
- Xingguang Luo
- Xiandong Lin
- Chiang-Shan R. Li
Child and Adolescent Psychiatry and Mental Health (2023)
Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits
- Bingxin Zhao
- Tianyou Luo
- Hongtu Zhu
Nature Genetics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Multi-ancestry eQTL meta-analysis of human brain identifies candidate causal variants for brain-related traits

A significant, functional and replicable risk KTN1 variant block for schizophrenia

Polygenic risk for schizophrenia and subcortical brain anatomy in the UK Biobank cohort

Introduction

Results

The Q-Q plot of putamen stratified by relative enrichment scores

True discovery rate (TDR)

Predicted stratified Q-Q plot and TDR

Lookup table

P value and conditional FDR results

Method comparison: fgwas

Discussion

Methods

Participant samples

Putamen structural measure

Genotyping and imputation

Genome-wide association analysis

Meta-analysis of genome-wide association results from substudies

LD-weighted genic annotation

Relative enrichment score (RES)

Stratified Q-Q plots and enrichment

False discovery rate

Parametric model

Lookup table

Manhattan plot

Method comparison using fgwas

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Genetic architecture of brain age and its causal relations with brain and mental disorders

Male-specific, replicable and functional roles of genetic variants and cerebral gray matter volumes in ADHD: a gene-wide association study across KTN1 and a region-wide functional validation across brain

Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits

Comments

Search

Quick links