Large consortia have revealed hundreds of genetic loci associated with anthropometric traits, one trait at a time. We examined whether genetic variants affect body shape as a composite phenotype that is represented by a combination of anthropometric traits. We developed an approach that calculates averaged PCs (AvPCs) representing body shape derived from six anthropometric traits (body mass index, height, weight, waist and hip circumference, waist-to-hip ratio). The first four AvPCs explain >99% of the variability, are heritable, and associate with cardiometabolic outcomes. We performed genome-wide association analyses for each body shape composite phenotype across 65 studies and meta-analysed summary statistics. We identify six novel loci: LEMD2 and CD47 for AvPC1, RPS6KA5/C14orf159 and GANAB for AvPC3, and ARL15 and ANP32 for AvPC4. Our findings highlight the value of using multiple traits to define complex phenotypes for discovery, which are not captured by single-trait analyses, and may shed light onto new pathways.
Large-scale meta-analyses of genome-wide association studies (GWAS) have identified numerous loci for anthropometric traits, including more than 600 loci for height1,2,3 and over 160 loci for obesity-related outcomes, predominantly for commonly available traits such as body mass index (BMI)2 and waist-to-hip ratio (WHR)4,5, but also for body fat percentage6, childhood obesity7 and extreme and early onset obesity7,8,9. While GWAS-meta-analyses have successfully revealed new loci, so far, all these studies have focused on one single anthropometric trait at a time and may not adequately capture differences in body shape between individuals who are similar in one trait but different in others. For example, two individuals may have the same BMI, but their WHR and/or height can differ substantially, so that each has a different body shape, which may translate into differences in disease risk10,11. Several loci identified from previous single-trait GWAS on BMI, BMI-adjusted WHR (WHRadjBMI) and height are associated with more than one anthropometric trait1,2,4,12. For example, the loci near MC4R and near POMC/ADCY3 are each associated with BMI and height. However, the BMI-increasing allele of the near-MC4R locus is associated with increased height, whereas the BMI-increasing allele of the near-POMC/ADCY3 locus is associated with reduced height1,2. Thus, these loci are likely each associated with a more comprehensive body shape phenotype that is not captured by current GWAS that only consider anthropometric traits individually.
In recent years, several approaches have been developed to examine whether single-nucleotide polymorphisms (SNPs) influence multiple correlated traits associated with disease13,14. However, most approaches test phenotypes separately and are thus subject to multiple testing penalties that ultimately reduce the statistical power to detect genotype–phenotype relationships among correlated traits. One way forward is to apply a dimension reduction method to the traits of interest, such as principal component analysis (PCA) that combines multiple correlated traits into a set of uncorrelated outcomes principal components(principal components (PCs))15,16. This method is very appealing to capture a composite phenotype, such as body shape. To date, no large-scale GWAS meta-analyses have been reported that aim to identify genetic loci associated with body shape based on simultaneous analysis of multiple anthropometric traits using PCA methods.
Therefore, the purpose of our study was twofold. First, we aimed to capture body shape in its multi-dimensional structure using PCs from several commonly available anthropometric traits. To allow the meta-analysis of summary statistics across a large number of cohorts, we developed an approach that calculates averaged PCs (AvPCs) that robustly represent body shape across a wide range of studies. Second, using this approach, we aimed to identify genetic loci associated with body shape based on the AvPCs in 65 studies of the GIANT Consortium, including >170,000 individuals.
Defining composite phenotypes of body shape
As basis for our analysis of body shape we used six anthropometric traits: BMI, WHR, height, weight, hip and waist circumference. First, we performed separate PCA in a subset of 20 large population-based studies (up to 82,355 individuals, Supplementary Table 1) and compared the loadings of the anthropometric traits in each PC between studies. Visual inspection of PCA loadings showed high concordance across studies (Supplementary Fig. 1) and between men and women. Between-study variation in variance explained by the PCs was small (Supplementary Fig. 1, Supplementary Table 2). On average, the first four PCs explained more than 99% of the variance (Fig. 1, Supplementary Table 2), and were therefore pursued as body shape outcomes for our gene-discovery effort. Given the across-study stability of PCs, we derived average loadings that were calculated as weighted means of loadings from all 20 population-based studies that were analysed in this step. We used these average loadings to calculate average principal components (AvPCs) as targets in each of the GWAS included in the first and second stage. In other words, the phenotypes used for genome-wide association were constructed in a consistent way across studies, such that the summary statistics could be meta-analysed.
Each AvPC represents a specific composition of the six anthropometric traits and thus captures a specific aspect of body shape (Fig. 1). The first AvPC, which explains on average 64.4% of the variation in all traits, shows high loadings for all traits, except for height. The loadings are in the same direction; meaning that the AvPC captures inter-individual variation in either increased or decreased BMI, weight, WHR, hip and waist circumference. Therefore, variation in this PC seems to predominantly capture overall adiposity. The second AvPC, which explains 18.5% of the variation, is characterized by particularly high but opposite loadings on height and WHR. In other words, AvPC2 captures variation in a composite phenotype that represents tall individuals with a small WHR or, vice versa, short individuals with a large WHR. The third AvPC, explaining 13.8% of the variation, also shows predominantly high loadings on height and WHR but in the same direction, with an opposite loading of nearly the same size on hip circumference. Given these loadings, AvPC3 discriminates mainly between tall individuals with a high WHR resulting from a smaller hip circumference on one extreme and short individuals with low WHR, and a larger hip circumference on the other extreme. The fourth AvPC explains on average 3% and is harder to interpret. It displays high loadings on BMI and body weight, and opposite loadings of a similar size on hip and waist circumference. These could be interpreted as a phenotype ranging between high BMI and weight, with relatively small hip and waist circumference on the one hand and low BMI and weight but large waist and hip circumference on the other hand.
Consistent with the individual anthropometric traits, the four AvPCs that describe body shape are also heritable. Using data from four isolate populations (n=4,000), we estimated that AvPC2 has the highest heritability (75–80%), consistent with the fact that height is the main contributing trait to this AvPC with a strong genetic component1. The heritability of AvPC1 (35–50%), AvPC3 (50–75%) and AvPC4 (25–50%) were moderately high and similar to the heritability for individual anthropometric traits17 (Supplementary Fig. 2). From a clinical perspective, each of the four AvPCs exhibit known correlations with cardio-metabolic traits (Supplementary Fig. 3), including diastolic blood pressure, systolic blood pressure, total cholesterol, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and total triglycerides levels.
Genomic discovery of body shape composite phenotypes
We performed a two-staged meta-analysis to identify genetic loci that are associated with the four AvPCs (Supplementary Table 3, Supplementary Table 4). In the first stage, a meta-analysis of 43 studies with imputed genome-wide SNP data including more than 133,000 individuals identified SNPs in 385 loci across the four AvPCs (56 loci for AvPC1, 205 for AvPC2, 89 for AvPC3 and 35 for AvPC4) that showed promising association (P value<5 × 10−6) for at least one of the four AvPCs (Fig. 2, Supplementary Fig. 4). Lead SNPs (and proxies; see ‘Methods’ section) of each locus were taken forward for validation in a second stage, including data from more than 39,900 individuals from 22 studies of which 12 studies had genotypes from the Illumina CardioMetabochip and 10 studies had imputed genome-wide SNP data. In the combined analyses, consisting of the first and second stage studies, the association of 207 of the 385 loci reached genome-wide significance (P value <5 × 10−8) (31 for AvPC1, 124 for AvPC2, 45 for AvPC3 and 7 for AvPC4; Fig. 2, Fig. 3, Supplementary Fig. 4, Supplementary Table 6), of which 16 loci were identified for two AvPCs and one showed significant association with three AvPCs (Supplementary Fig. 7, Supplementary Table 5) resulting in a total of 189 loci with association to at least one AvPC. To determine whether the loci we identified were independent of the loci previously found for BMI, WHRadjBMI and height, we performed conditional analyses on SNPs reported in previous GIANT-GWAS publications on BMI, WHRadjBMI and height1,2,4,5,18,19. A locus was considered independent of reported findings if the P value in the analyses conditioned on all previously identified loci remained suggestive (P value <5 × 10−6). In total, 183 loci had already been established for BMI, WHRadjBMI or height (Fig. 3, Supplementary Fig. 7), whereas six loci had not previously been identified for association with conventional anthropometric traits; two for AvPC1, two for AvPC3 and two for AvPC4 (Table 1, local association plots given in Supplementary Fig. 5). For these six novel loci, the results of the lead SNPs were checked in previously performed GWAS meta-analyses on anthropometric and cardio-metabolic traits (Supplementary Table 7).
Results for AvPC1
For AvPC1, we identified 31 genome-wide significant loci, of which two were novel (upstream of LEMD2 and CD47). Of the 29 previously established loci, 24 have been associated with BMI only18, 3 with height only1,3, while two loci have been reported for associations with both BMI and height3,18 (Fig. 3a). While both novel loci showed some evidence of association with BMI in the latest GIANT–GWAS (n>339,000; P<7.2 × 10−3; Table 1), they did not reach genome-wide significance. The lead SNP (rs943466) 7 kb upstream of LEMD2 has been reported to be associated with expression of LEMD2 in liver (P=1.66 × 10−9) 20,21. Another variant in LEMD2 (rs2296743 at 8 kb from our lead SNP rs943466; r2=0.2, D′=1.0) was previously reported for its promising association (P value=8 × 10−6) with energy intake at dinner in a small GWAS of 815 Hispanic children22. The lead SNP (rs7640424) for the second novel locus was located in an enhancer region 10 kb upstream of CD47 (refs 23, 24), which encodes a membrane protein that might be involved in signal transduction and membrane transport25. No genome-wide significant associations have been reported for the lead SNP or other SNPs in the CD47 gene before23,24,25. However, a recent study revealed a link to diet-induced obesity in mice and suggests CD47 as a potential drug-target to combat obesity and metabolic complications26,27.
Results for AvPC2
For AvPC2, we identified no novel loci. Almost all (n=122) of the 124 loci associated with AvPC2 had previously been identified for height1 (Fig. 3b), which is consistent with AvPC2’s high loadings on height and opposite loadings on WHR. Of these 122 loci, 103 were reported for association to height only, whereas of the 19 remaining loci, 4 were previously associated with height, BMI and WHRadjBMI, 2 loci were reported for height and BMI and 13 loci overlapped with height and WHR. The two AvPC2 loci that did not associate with height were previously identified for WHRadjBMI19.
Results for AvPC3
We identified 45 loci that reached genome-wide significance for AvPC3, of which 2 were novel. Consistent with the loadings of AvPC3, 43 of the associated loci had been reported before for height1 or WHR4,19 (Fig. 3c). The lead SNP of the first novel locus rs7492628, upstream of the genes RPS6KA5 (>20 kb) and C14orf159 (>30 kb), failed to reach genome-wide significance in previous WHRadjBMI GWAS (P value=9.3 × 10−8) and was nominally associated with extreme obesity risk (P value=7.26 × 10−5) 28. The lead SNP of the other novel locus, GANAB, rs7949030, showed some evidence of association with WHRadjBMI in the latest GIANT GWAS (P value=3.3 × 10−6) and was reported to be an eQTL for several other genes21: In monocytes, regulation of MIR3654, EEF1G, EML3, BSCL2, HNRNPUL2-BSCL2, LRRN4CL was found29,30,31. BSCL2 is of interest, as it is a known candidate gene for the most severe lipodystrophy phenotype32. In blood rs7949030 was found to be an eQTL of HNRNPUL2-BSCL2, AHNAK, LRRN4CL and INTS5 (refs 33, 34), while in skin and adipocytes it was found as an eQTL for EML3 (refs 30, 31, 35).
Results for AvPC4
Seven loci were identified for AvPC4, of which five had been previously reported; one for BMI and height, one for WHR and height, one for height only and two for WHR only1,3,4,36 (Fig. 3). The lead SNPs of the two novel loci identified with AvPC4 were both intronic, in ARL15 and ANP32. The allele associated with increased AvPC4 of the lead SNP (rs4865796) in ARL15 was moderately associated with higher BMI (P value=1.6 × 10−4), increased adiponectin levels (P value=4.2 × 10−6 ADIPOGEN37) and decreased risk of diabetes (P value=1.8 × 10−5, DIAGRAM38). This SNP was associated with fasting insulin (rs4865796, P=2.1 × 10−8 and 2.2 × 10−12 after adjustment for BMI39). Other nearby SNPs in high linkage disequilbrium (LD), have previously been reported for associations with BMI-adjusted adiponectin levels (rs6450176/rs4311394, r2=0.087, D′=0.87 (refs 37, 40)), high density lipoprotein C (HDL-C) levels (rs6450176 (refs 41, 42)) and risk of type 2 diabetes (rs702634, r2=1.0, D′=1.0 (ref. 38)). A duplication in ARL15, tagged by rs16992296) was previously found to be associated with increased risk of childhood obesity in European and African Americans43. However, this duplication is independent of the association we found for rs4865796-ARL15 and AvPC4, which is in low LD (r2EUR=0.065) with the duplication (represented by rs16992296), located 168 kb upstream. The lead SNP (rs7855432) of the second locus, ANP32B, was moderately associated with height (P value=5.5 × 10−6) 1. A SNP in high LD (rs4743150 r2=0.95, D′= 1.0) was reported to be promisingly associated with coronary heart disease risk (P value=5 × 10−6)44.
We developed a PCA-based approach to capture variation across multiple traits simultaneously in a uniform way across multiple studies. Resulting AvPCs are a robust cross-phenotype representation allowing their use in large-scale meta-analyses. We assessed this approach to capture body shape based on six individual anthropometric traits and identified six novel loci that were not identified before in much larger GWAS-meta-analyses for BMI, WHRadjBMI and height1,2,4. Our findings suggest that the body shape composite phenotype, assessed by AvPCs, represents information that is not fully captured by individual (anthropometric) traits. Application of this method to other related traits, for example, in immune disease, different types of cancer, cardiometabolic traits, or other correlated traits might comparably reveal new loci, and potentially new pathways, that have not been identified in single-trait GWAS.
The AvPCs are combinations of different anthropometric traits and therefore capture more complex body shape phenotypes than the single traits. AvPC1, representing overall adiposity, and AvPC2, representing height with respect to WHR, are the most important contributors to body shape, explaining on average more than 80% of the variation. More specific body shape types were captured by AvPC3 and AvPC4 and were defined by impact of height and WHR (AvPC3) or BMI, waist and hip (AvPC4). Our initial analyses demonstrated that the loadings are stable across studies, study designs and between men and women. Moreover, we have shown that the AvPCs are heritable traits and correlated with cardiometabolic traits and risk factors.
To further demonstrate the strength of this approach, we compared total variance explained of single traits and AvPCs by SNPs previously identified in single-trait GWAS (for BMI, WHRadjBMI, height1,2,4). For example, the 97 loci that have been reported for association in the latest BMI single-trait GWAS (N∼340,000) explain 8.7% of the variation in AvPC1, whereas they explained only 2.68% of the variation in BMI2. These data indicate that our PC-defined phenotype for overall body size (AvPC1) captures a more composite phenotype compared with BMI as a single-trait. Explaining more of the variance with the same genetic variants as previous single-trait studies in our composite phenotype shows promise to update and inform existing methods.
So far, typical GWAS have tested for association of genetic variants with anthropometric traits, one trait at a time. We define ‘body shape’ as a composite of multiple traits defined by PCs. We first performed PC-analyses in representative population-based studies and averaged PC loadings across these studies (AvPCs). We subsequently use these AvPCs to calculate PCs in all participating studies. This approach ensures that PCs are calculated in a uniform manner across all studies, thus facilitating subsequent meta-analyses. This approach could be applied to capture genetic variation across related traits that is currently not captured by single-traits GWAS (for example, in the context of autoimmune disease, blood traits, lipid levels, different cancers and so on.).
Consistent with published anthropometric traits10,11,17, the derived AvPCs are heritable and correlated with clinically relevant outcomes. We identified additional loci, despite a much smaller sample size compared with the latest single-trait GWAS analyses for BMI, height and WHRadjBMI1,2,4. This suggests that the AvPC method captures phenotype information that is not captured by single-trait analyses and associated loci may highlight biological pathways that are not revealed with single-trait associated loci only.
Even though our approach has several advantages, it is not meant to replace single trait GWAS analyses. A number of loci that were identified in the latest single-trait GWAS were not identified in our body shape GWAS; that is, we identified 124 loci (or 14.2%) of the 837 loci recently reported in the GIANT single-trait meta-analyses (Supplementary Fig. 6). This may be due to the fact that these recent single-trait GWAS meta-analyses were at least twice as large as the current body shape GWAS. However, even when we compare the number of identified loci in earlier GWAS meta-analyses, which are of similar size as the current body shape GWAS, we do not identify all previously reported loci for single traits. Perhaps this is most obvious with height (largely representative of AvPC2), where we only identified 91 (13.1%) of 697 loci identified for height. This is in part due to the fact that a conservative definition for linkage disequilibrium was applied (r2>0.8), lack of power due to sample size for SNPs of modest effects, or perhaps the AvPCs introduces noise to purely single traits such as height. Consistent with this finding, we also observe that some single traits also explain more of the variance of body shape compared with AvPCs. Our comparison of the variance explained between previous single-traits meta-GWAS and our AvPCs support this evidence for overlapping associated variants. Since AvPC2 represents largely a single trait, height, with large height loadings we were unable to explain more of the variance. In fact we explained less of the variance, which is likely due to noise introduced using this composite AvPCs phenotype. This observation is also evident for variance in body shape explained by height compared with AvPC3 and AvPC4, but is in contrast to BMI, a complex trait comprised of multiple anthropometric measurements, which explains less variance in body shape compared with AvPC3 and AvPC4. It is important to emphasize our approach is most informative for complex traits such as BMI that are derived from a series of other traits. We believe that using PC space to define complex traits is useful for the detection of loci involved in multiple pathways that might go undetected in a single trait setting.
We have developed a new strategy that applies a PCA approach in a meta-analysis setting to combine composite phenotypes in a harmonized way across multiple studies. We successfully applied this approach to anthropometric traits to capture body shape. The derived combined anthropometric traits (AvPCs) were shown to be heritable and correlated to cardio-metabolic traits. Large-scale GWAS meta-analyses of the AvPCs identified six new loci that were not identified by previous single-trait GWAS that were twice as large in samples size. This PCA approach could maximize gene discovery for other correlated traits, such as cancers, immune disease, hematologic traits and so on. and may identify genes that point towards shared physiological pathways.
In the first stage analyses, 43 studies participated (133,376 individuals) that had HapMap 2 imputed genome-wide data available. A subset of 20 studies with unrelated individuals was used for calculation of average loadings. Second stage analyses were performed in 10 studies (7,734 individuals) with genome-wide data that became available after the first stage and 12 studies (32,170 individuals) with Cardio-MetaboChip (by Illumina) data (number of included studies and individuals given in Supplementary Table 3). Details on study phenotypes, genotyping and imputation of each study are given in the Supplementary Tables 8 and 9, respectively.
All study participants gave written informed consent and ethic committees approved all studies. The ethic statement of each study is given in the study specific acknowledgements.
Calculation of average loadings
In 20 independent studies (Supplementary Table 1) with unrelated participants PCAs were performed on six anthropometric traits (BMI, height, hip, waist, weight and WHR). Each study performed a PCA on the standardized residuals of the anthropometric traits adjusted for age and gender. The same analyses were done for men and women separately with residuals adjusted for age only. The result of the PCA in each study is a set of six PCs that are orthogonal linear combinations of the six anthropometric traits. In other words each PC is a weighted sum of the six transformed anthropometric traits and independent of the other PCs. The weights of each trait per PC are called loadings. Each study also calculated the explained variance per PC. The loadings and explained variances were comparable for all studies (Supplementary Fig. 1 (1)).
With the intention to create phenotypes that are identically constructed in all studies, the results of single study PCAs were used to deduce the average loadings. This approach is reasonable as the loadings of the study specific PCAs were comparable. With the use of the single study correlation matrices a combined average correlation matrix was derived (weighted sum divided by number of individuals). This average correlation matrix is then used as basis for a PCA. The loadings that result from this PCA are called average loadings (Fig. 1a) and Supplementary Table 2). This was performed for men, women and all individuals combined, however ultimately we used combined loadings for primary results reported in the manuscript. Sex specific results are reported in the Supplementary Material. The average loadings and explained variance were comparable to the study specific loadings and explained variances (Supplementary Fig. 1).
Heritability of the avPCs was calculated within four population isolates, CROATIA-Vis (n=909), CROATIA-Korcula (n=842), CROATIA-Split (n=499) and ORCADES (n=866) using the ‘polygenic’ function of the GenABEL package for R 45.
Average principal components as body shape phenotype
The average loadings were used in each study to calculate the AvPCs in a standardized way. Therefore, the average loadings were distributed together with an R-script (http://www.r-project.org/) that calculated the AvPCs as linear combination of residuals of the study phenotypes with the use of the average loadings. This was done for men and women separately and additionally for combined in studies with relatedness structure. As the first four PCs explain on average more than 99% of the variance (Fig. 1b) we decided to limit all analyses to these four PCs.
Stage 1 analyses
GWAS on the first four AvPCs were calculated for men and women separately in studies of unrelated samples and combined for studies with related samples with an adjustment for study site when necessary. All studies of the first stage analyses used HapMap 2 imputed genome-wide data. GWAS results underwent extensive quality control and study-wise filtering (call rate >95%, P value (HWE)>10−6, imputation quality, minor allele count (MAC) >3). The meta analyses of GWAS results for the first four AvPCs we combined sex-stratified results for studies with unrelated individuals and unstratified GWAS results for studies with relatedness individuals. Meta analyses were performed with METAL 46 using fixed effects inverse variance-weighted method. Single study and the meta analysis P values were corrected by the genomic control inflation factor λ (meta analysis λ before correction: λ(PC1)=1.29, λ(PC2)= 1.407, λ(PC3)= 1.236, λ(PC4)= 1.136). Results were limited to SNPs that are in HapMap 2 and had results for more than 30,000 individuals. Heterogeneity analysis was performed with METAL. Each AvPC all SNPs with a promising P value (P value<5 × 10−6) were identified in combined analyses. To identify promising loci clustering (LD>0.01, distance <1,000 kb) with PLINK47 based on HapMap 2 genotypes was performed. All leading SNPs per clump for AvPCs were taken forward to second stage analyses and named promising SNPs in this manuscript.
Two SNPs that were promising for the first principal component had very low heterogeneity P values (rs10847678 (P value(het)=8.8 × 10−152), rs13296358 (P value(het)=5.4 × 10−67)). For both SNPs the effect was driven only by a single study and no other SNP in high LD had a promising P value. Therefore, these two SNPs were removed from further analyses.
Stage 2 Analyses
As mentioned above for second stage analyses a mixture of studies with genome-wide SNP data and MetaboChip genotypes was available. Some of the leading SNPs of the first stage analyses were not genotyped on the MetaboChip. To increase the power for all promising SNPs of each AvPC proxies were defined that were all SNPs close to promising SNPs (distance <500 kb), in high LD (LD>0.9) and available in more than 70% of the individuals of the second stage. Results of the second stage analyses underwent the same quality control as first stage results.
The combined analyses of all first and second stage GWAS was performed with METAL35 with inverse variance based method. Results for men and women were combined as described for the first stage meta-analyses. All promising loci for which at least one proxy had a genome-wide significant P value in the combined analysis were named genome-wide significant loci and the best SNP of the combined analyses (largest absolute beta) was reported as topSNP of this locus.
Novel loci - conditional analyses and look-ups in previous GIANT analyses
Two analyses were performed to distinguish between genome-wide significant body shape loci that are known from previous GWAS on BMI, height and WHR and novel body shape loci. First, conditional analyses were performed. We used the 226 reported topSNPs (32 BMI, 180 height, 14 WHR) of published GIANT analyses on BMI, height and WHR1,2,4 to perform conditional analyses of the first stage meta-analyses using GCTA15,48. The results of this analysis were then analysed conditioned on 843 topSNPs (97 BMI, 697 height, 49 WHR) of the published GIANT analyses1,2,4. To identify the overlap of the results for AvPCs with the single anthropometric traits, the same conditional analyses were performed for BMI, height and WHR separately. For calculation of the LD-structure genotype data from KORA F4 was used. Two topSNPs of the unpublished GIANT results had to be removed before analyses as they were in high correlation with two other topSNPs. If the body shape topSNPs were independent loci identified by previous GIANT analyses, the P value should stay promising (P value<5 × 10−6) in both conditional analyses. Second, we checked by look-ups if those genome-wide significant SNPs that are independent from the previously reported topSNPs were not genome-wide significant (P value>5 × 10−8) in GIANT analyses1,2,4.
Genome-wide significant SNPs are named novel SNPs if they fulfil the following conditions:
P value of conditioned analyses on topSNPs reported by previous GIANT analyses (on BMI, height, WHR) remained promising (P value<5 × 10−6).
P value in previous GIANT analyses (on BMI, height, WHR) was not genome-wide significant (P value>5 × 10−8).
For identification of potential pleiotropic effects several look-ups in various large-scale consortia on different phenotypes were performed, including GIANT, DIAGRAM and MAGIC, all references are given in the results table of the look-ups (Supplementary Table 7). For comparison of effect directions the loadings of each AvPC have to be considered. For example AvPC2 includes height with a positive loading and BMI with a negative loading. That means an increasing effect on AvPC2 means an increasing effect on height but a decreasing effect on BMI.
PCA, further analyses and plots were generated with R (http://www.r-project.org/) if not stated otherwise. Apart from the GCTA analyses, which uses LD structure of KORA F4, all LD analyses were performed in PLINK based on HapMap 2 (CEU) genotypes. For comparison of findings between loci from different AvPCs two loci are assumed to be identical if the topSNPs are in high LD (LD>0.8).
Summary statistics of all analyses can be downloaded from: https://www.broadinstitute.org/collaboration/giant/
How to cite this article: Ried, J. S. et al. A principal component meta-analysis on multiple anthropometric traits identifies novel loci for body shape. Nat. Commun. 7, 13357 doi: 10.1038/ncomms13357 (2016).
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 46, 1173–1186 (2014).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Lango Allen, H. et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467, 832–838 (2010).
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
Liu, C. T. et al. Genome-wide association of body fat distribution in African ancestry populations suggests new loci. PLoS. Genet. 9, e1003681 (2013).
Kilpelainen, T. O. et al. Genetic variation near IRS1 associates with reduced adiposity and an impaired metabolic profile. Nat. Genet. 43, 753–760 (2011).
Bradfield, J. P. et al. A genome-wide association meta-analysis identifies new childhood obesity loci. Nat. Genet. 44, 526–531 (2012).
Wheeler, E. et al. Genome-wide SNP and CNV analysis identifies common and low-frequency variants associated with severe early-onset obesity. Nat. Genet. 45, 513–517 (2013).
Meyre, D. et al. Genome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populations. Nat. Genet. 41, 157–159 (2009).
Myint, P. K., Kwok, C. S., Luben, R. N., Wareham, N. J. & Khaw, K. T. Body fat percentage, body mass index and waist-to-hip ratio as predictors of mortality and cardiovascular disease. Heart 100, 1613–1619 (2014).
InterAct, C.. et al. Long-term risk of incident type 2 diabetes and measures of overall and regional obesity: the EPIC-InterAct case-cohort study. PLoS Med. 9, e1001230 (2012).
He, M. et al. Meta-analysis of genome-wide association studies of adult height in East Asians identifies 17 novel loci. Hum. Mol. Genet. 24, 1791–1800 (2015).
Pendergrass, S. A. et al. Phenome-wide association study (PheWAS) for detection of pleiotropy within the Population Architecture using Genomics and Epidemiology (PAGE) Network. PLoS Genet. 9, e1003087 (2013).
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics 26, 1205–1210 (2010).
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, S1–S3 (2012).
Aschard, H. et al. Maximizing the power of principal-component analysis of correlated phenotypes in genome-wide association studies. Am. J. Hum. Genet. 94, 662–676 (2014).
Polderman, T. J. et al. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nat. Genet. 47, 702–709 (2015).
Speliotes, E. K. et al. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat. Genet. 42, 937–948 (2010).
Heid, I. M. et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat. Genet. 42, 949–960 (2010).
Innocenti, F. et al. Identification, replication, and functional fine-mapping of expression quantitative trait loci in primary human liver tissue. PLoS Genet. 7, e1002078 (2011).
Arnold, M., Raffler, J., Pfeufer, A., Suhre, K. & Kastenmuller, G. SNiPA: an interactive, genetic variant-centered annotation browser. Bioinformatics 31, 1334–1336 (2015).
Comuzzie, A. G. et al. Novel genetic loci identified for the pathophysiology of childhood obesity in the Hispanic population. PLoS ONE 7, e51954 (2012).
Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42, D1001–D1006 (2014).
Beck, T., Hastings, R. K., Gollapudi, S., Free, R. C. & Brookes, A. J. GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies. Eur. J. Hum. Genet. 22, 949–952 (2014).
Li, M. J. et al. GWASdb: a database for human genetic variants identified by genome-wide association studies. Nucleic Acids Res. 40, D1047–D1054 (2012).
Soto-Pantoja, D. R., Kaur, S. & Roberts, D. D. CD47 signaling pathways controlling cellular differentiation and responses to stress. Crit. Rev. Biochem. Mol. Biol. 1–19 (2015).
Maimaitiyiming, H., Norman, H., Zhou, Q. & Wang, S. CD47 deficiency protects mice from diet-induced obesity and improves whole body glucose tolerance and insulin sensitivity. Sci. Rep. 5, 8846 (2015).
Paternoster, L. et al. Genome-wide population-based association study of extremely overweight young adults--the GOYA study. PLoS ONE 6, e24303 (2011).
Zeller, T. et al. Genetics and beyond--the transcriptome of human monocytes and disease susceptibility. PLoS ONE 5, e10693 (2010).
Wang, H. D. et al. DNA methylation study of fetus genome through a genome-wide analysis. BMC Med. Genomics 7, 18 (2014).
Tegha-Dunghu, J. et al. EML3 is a nuclear microtubule-binding protein required for the correct alignment of chromosomes in metaphase. J. Cell. Sci. 121, 1718–1726 (2008).
Wee, K., Yang, W., Sugii, S. & Han, W. Towards a mechanistic understanding of lipodystrophy and seipin functions. Biosci. Rep. 34, (2014).
Westra, H. J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238–1243 (2013).
Ramdas, M., Harel, C., Armoni, M. & Karnieli, E. AHNAK KO Mice are protected from diet-induced obesity but are glucose intolerant. Horm. Metab. Res. 47, 265–272 (2015).
Grundberg, E. et al. Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat. Genet. 44, 1084–1089 (2012).
Chambers, J. C. et al. Genetic loci influencing kidney function and chronic kidney disease. Nat. Genet. 42, 373–375 (2010).
Dastani, Z. et al. Novel loci for adiponectin levels and their influence on type 2 diabetes and metabolic traits: a multi-ethnic meta-analysis of 45,891 individuals. PLoS Genet. 8, e1002607 (2012).
Replication, D. I. G. et al. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat. Genet. 46, 234–244 (2014).
Scott, R. A. et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat. Genet. 44, 991–1005 (2012).
Richards, J. B. et al. A genome-wide association study reveals variants in ARL15 that influence adiponectin levels. PLoS Genet. 5, e1000768 (2009).
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).
Global Lipids Genetics, C. et al. Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283 (2013).
Glessner, J. T. et al. A genome-wide study reveals copy number variants exclusive to childhood obesity cases. Am. J. Hum. Genet. 87, 661–666 (2010).
Coronary Artery Disease Genetics Consortium. A genome-wide association study in Europeans and South Asians identifies five new loci for coronary artery disease. Nat. Genet. 43, 339–344 (2011).
Aulchenko, Y. S., Ripke, S., Isaacs, A. & van Duijn, C. M. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296 (2007).
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
A complete list of acknowledgements is described in Supplementary Notes.
Kari Stefansson, Valgerdur Steinthorsdottir, Gudmar Thorleifsson and Unnur Thorsteinsdottir are used by deCODE Genetics/Amgen inc. Gérard Waeber and Peter Vollenweider received an unrestricted grant from GSK to build the CoLaus study. Bruce M. Psaty serves on a DSMB for a clinical trial of a device funded by the manufacturer (Zoll LifeCor).
Supplementary Figures 1-7, Supplementary Tables 1-5 (PDF 2020 kb)
Association results for 189 GWAS-significant loci. (XLS 71 kb)
Transferability of novel loci in other anthropometric GWAS (XLS 166 kb)
Phenotype summary statistics of all studies in 1st and 2nd stage analyses. (XLS 37 kb)
Genotyping description for all studies in 1st and 2nd stage analyses. (XLS 141 kb)
About this article
Cite this article
Ried, J., Jeff M., J., Chu, A. et al. A principal component meta-analysis on multiple anthropometric traits identifies novel loci for body shape. Nat Commun 7, 13357 (2016). https://doi.org/10.1038/ncomms13357
Cardiovascular Diabetology (2022)
Scientific Reports (2022)
Composite trait Mendelian randomization reveals distinct metabolic and lifestyle consequences of differences in body shape
Communications Biology (2021)
Using Composite Phenotypes to Reveal Hidden Physiological Heterogeneity in High-Altitude Acclimatization in a Chinese Han Longitudinal Cohort
Cellular and Molecular Life Sciences (2021)