Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth

Helgeland, Øyvind; Vaudel, Marc; Juliusson, Petur B.; Lingaas Holmen, Oddgeir; Juodakis, Julius; Bacelis, Jonas; Jacobsson, Bo; Lindekleiv, Haakon; Hveem, Kristian; Lie, Rolv Terje; Knudsen, Gun Peggy; Stoltenberg, Camilla; Magnus, Per; Sagen, Jørn V.; Molven, Anders; Johansson, Stefan; Njølstad, Pål Rasmus

doi:10.1038/s41467-019-12308-0

Download PDF

Article
Open access
Published: 01 October 2019

Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth

Nature Communications volume 10, Article number: 4448 (2019) Cite this article

5883 Accesses
43 Citations
57 Altmetric
Metrics details

Subjects

Abstract

Infant and childhood growth are dynamic processes with large changes in BMI during development. By performing genome-wide association studies of BMI at 12 time points from birth to eight years (9286 children, 74,105 measurements) in the Norwegian Mother, Father, and Child Cohort Study, replicated in 5235 children, we identify a transient effect in the leptin receptor (LEPR) locus: no effect at birth, increasing effect in infancy, peaking at 6–12 months (rs2767486, P_6m = 2.0 × 10⁻²¹, β_6m = 0.16 sd-BMI), and little effect after age five. We identify a similar transient effect near the leptin gene (LEP), peaking at 1.5 years (rs10487505, P_1.5y = 1.3 × 10⁻⁸, β_1.5y = 0.079 sd-BMI). Both signals are protein quantitative trait loci for soluble-LEPR and LEP in plasma in adults independent from adult traits mapped to the respective genes, suggesting key roles of common variation in the leptin signaling pathway for healthy infant growth.

Exome-wide analysis implicates rare protein-altering variants in human handedness

Article Open access 02 April 2024

Dick Schijven, Sourena Soheili-Nezhad, … Clyde Francks

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Protein-truncating variants in BSN are associated with severe adult-onset obesity, type 2 diabetes and fatty liver disease

Article Open access 04 April 2024

Yajie Zhao, Maria Chukanova, … John R. B. Perry

Introduction

BMI patterns in infancy and childhood follow well-characterized trajectories: a rapid increase soon after birth until ~9 months, the adiposity peak, followed by a gradual decline until ~4–6 years of age, and then the adiposity rebound, when BMI starts to increase again until the end of puberty¹. Recently, a study revealed that the most powerful predictor of obesity in adolescence is an increase in BMI between 2 and 6 years of age², but the underlying cause for this remains unknown. While large genome-wide association studies (GWAS) have revealed many loci associated with adult BMI and adiposity traits³, less is known about the genetic influences on infant and childhood BMI development. The most recent meta-analyses of childhood BMI suggest a strong overlap between the genetic architecture of childhood BMI and adult BMI. However, these studies mainly involve BMI measurements after the adiposity rebound^4,5,6. Thus, there is little knowledge regarding the genetic factors influencing growth during the first 5 years of life.

To explore how common genetic variation influences BMI development in infancy and early childhood, we here perform a GWAS of BMI measurements at 12 time points from birth to eight years of age (9286 children, 74,105 measurements) in the Norwegian Mother, Father, and Child Cohort Study^7,8, with replication in 5235 children (41,502 measurements). We identify variants in five loci including LEPR, ADCY3, LEP, LCOR, and FTO associating with BMI at distinct developmental stages. Both LEPR and LEP signals are protein quantitative trait loci (pQTLs) for soluble LEPR and LEP in plasma in adults and independent from signals associated with other adult traits mapped to the respective genes. Hence, our longitudinal analysis uncovers a complex and dynamic influence of common variation on BMI during infant and early childhood growth, dominated by the LEP-LEPR axis in infancy.

Results

Genotyping the Norwegian Mother, Father, and Child Cohort Study

A total of 17,474 children in the Norwegian Mother, Father, and Child Cohort Study (Supplementary Table 1) were genotyped in discovery and replication combined. The children’s BMI was measured at birth, 6 weeks, 3, 6, 8 months, and 1, 1.5, 2, 3, 5, 7, and 8 years of age (Fig. 1 and Supplementary Table 2). We performed genotype quality control (QC), imputation using the Haplotype Reference Consortium (HRC), and phenotype QC, leaving 9286 and 5235 samples for the discovery and replication cohorts, respectively, all of Norwegian ancestry.

Five loci associated with BMI at distinct developmental stages

We conducted separate linear regression analyses of standardized BMI for each time point using an additive genetic model (Fig. 2 and Supplementary Fig. 1). The lead SNPs at independent loci reaching P < 1.0 × 10⁻⁷ at one or more time points in the discovery sample were taken forward for replication (Table 1). This revealed a dynamic pattern of association during early growth. SNPs in five independent loci reached genome-wide significance, presenting peak association at different time points: (1) an intronic SNP rs2767486 in the LEPR locus peaking at 6 months; (2) an intronic SNP, rs13035244, near ADCY3 peaking at 1 year; (3) an intronic SNP rs6842303 near LCORL peaking at 1.5 years; (4) an intergenic SNP rs10487505 near LEP peaking at 1.5 years; and (5) an intronic SNP rs9922708 near FTO peaking at seven years (Figs. 2–4, and Supplementary Data 1).

Table 1 Summary statistics for the signals that met criteria for replication

Full size table

A novel transient effect on BMI by a variant in LEPR

The strongest association with BMI was found for rs2767486 at 6 months (P_6m = 2.0 × 10⁻²¹, β_6m = 0.16) in the LEPR/LEPROT locus. The locus associated with BMI from 3 months of age, with effects peaking at 6–12 months, and waning from age three with little effect at eight years (Figs. 3 and 4). We found no evidence of association at birth for rs2767486 or nearby markers in our data or in recent large publicly available GWASs of birth weight⁹ and adult BMI^3,10. Thus, this locus most likely affects BMI development primarily during infancy. Conditioning on rs2767486 revealed a putative additional signal in the LEPR locus, rs17127815 (P_6m = 7.5 × 10⁻⁵ after conditioning on the top signal rs2767486), that followed the same association pattern over time as the main signal (Fig. 5).

rs2767486 is a pQTL for soluble LEPR in plasma in adults

LEPR encodes the leptin receptor, which functions as a receptor for the adipose cell-specific hormone leptin. High leptin levels suppress hunger by interacting with the long form of the leptin receptor (OB-RL) in the hypothalamus¹¹. The soluble form of leptin receptor (sOB-R), which is produced through ectodomain shedding of OB-RL in peripheral tissues, can bind leptin in circulation, and thereby reduce its effect on the central nervous system¹². The LEPR locus has previously been implicated in monogenic morbid obesity^13,14, severe childhood obesity¹⁵, age of menarche¹⁶, age of voice breaking¹⁷, levels of fibrinogen¹⁸ and C-reactive protein¹⁹, several blood cell count traits^20,21, and plasma sOB-R levels^21,22. To test whether any of the established variants for these traits explain the observed association with BMI in infancy, we repeated the analysis conditioning on the top SNPs reported in these studies. The association with infant BMI remained unaffected by conditioning on these SNPs, except for rs2767485 (Supplementary Fig. 2a), the strongest pQTL for sOB-R-plasma levels in adults²². This SNP is located only 12.2 kb upstream our top SNP rs2767486, with strong LD (r² = 0.9) between the BMI-raising and the sOB-R-increasing alleles. We next surveyed GnomAD for putative coding LEPR SNPs that could explain the association in the region. None of the three known common missense variants in the gene revealed any significant LD with our top SNP (all r² < 0.1). Thus, it is unlikely that the main effect in the region is acting through a coding polymorphism. We could, however, not rule out a role for rs1805094 encoding p.Lys656Asn for the putative second independent signal in the region that is tagged by rs17127815 (pairwise LD: r² = 0.83, Supplementary Fig. 3).

A variant in LEP is a pQTL for circulating leptin levels

The association between variants in the LEPR locus and infant BMI suggests an important role of leptin signaling in early growth. The genome-wide significant association with infant BMI for rs10487505 located 20 kbp upstream of LEP is therefore noteworthy. This SNP is a known pQTL for circulating leptin levels in adults²³. The leptin-increasing allele from Kilpeläinen et al.²³ is associated with lower infant BMI in our data. The effect presents a rise-and-fall pattern, rising during the 3–12 months period when the LEPR signal is at its plateau, reaching its peak at 1.5 years (P_1.5y = 1.3 × 10⁻⁸, β_1.5y = 0.08) before waning (Figs. 3 and 4). Children homozygous for the alleles associating with higher sOB-R and lower leptin levels exhibited higher mean standardized BMI ( + 0.65) than children homozygous for the opposite alleles (Fig. 6).

Effects on BMI by variants in LCORL and ADCY3

We identified an association with BMI in the LCORL locus for rs6842303, presenting a similar rise-and-fall pattern with peak effect at 1.5 years (P_1.5y = 7.5 × 10⁻⁹, β_1.5y = 0.09) (Figs. 3 and 4). Previously, this marker has been associated with related traits such as birth weight, birth length, infant length, and adult height. Interestingly, rs6842303 has also been associated with peak height velocity in infancy²⁴, but no association was reported in the largest adult BMI GWASs to date^3,10. This supports our finding of a transient effect of LCORL in early growth.

The second strongest signal was found at the ADCY3 locus. Biallelic mutations in ADCY3 have recently been found to cause severe syndromic obesity^25,26. ADCY3 is known to interact with MC4R, and rare mutations in MC4R account for 3–5% of severe obesity²⁷. The lead ADCY3 SNP, rs13035244, showed no association at birth, became genome-wide significant with a peak effect between one and 1.5 years (P_1y = 7.9 × 10⁻¹³, β_1y = 0.10), and then stabilized during the course of childhood (Figs. 3 and 4). This result is in agreement with a previous study of growth trajectories in children from one to 17 years of age⁴.

FTO is robustly associated with BMI only from age seven

In contrast to the rise-and-fall pattern reported here for signals in the LEPR, ADCY3, LEP, and LCORL loci, the FTO risk allele was not associated with BMI at birth or around adiposity peak, and being robustly associated with BMI only from seven years of age (P_7y = 2.8 × 10⁻¹², β_7y = 0.12). These results are in agreement with previous reports^4,28, establishing the timing of this transition of effect to around five years of age (Figs. 3 and 4).

The biology of BMI shifts around adiposity rebound

Previous studies have suggested a tight genetic overlap between child and adult BMI, but the details of this relationship across the first years of life remain elusive^4,5. We used LD score regression²⁹ in LD Hub³⁰ to quantify the shared genetic contribution between BMI at each of the 12 time points and other traits (Fig. 7a, b and Supplementary Fig. 5). These results show that BMI in infancy show modest genetic correlation with adult BMI and related traits, before there is a shift towards higher correlation from three years and onwards indicating a transition of BMI biology at around the adiposity rebound. Notably, the genetic correlation with a range of non-anthropometric traits varied substantially at infant age (Supplementary Fig. 6). However, it should be noted that the LD score regression estimates have large uncertainties at this sample size, and these results should thus be considered exploratory. Polygenic risk score analyses across all time points for markers associated with birth weight⁹, childhood BMI⁵, and adult BMI^3,10 revealed similar patterns (Fig. 7c). We also used LD score regression to estimate the SNP-based heritability of BMI measurements across infancy and childhood. The LD score regression-based heritability estimates varied with age, with relatively modest levels at birth and during the adiposity rebound, and high levels when adiposity is high, i.e. around adiposity peak and from seven years of age onwards (Fig. 7a). This finding is supported by twin-studies that also show high heritability estimates for BMI in infancy, lower levels around four years of age, followed by higher estimates in later childhood³¹. Collectively, these results further indicate that the genetic mechanisms underlying BMI change from infancy to adulthood.

Partitioned LD-score regression also has the potential of identifying tissues, cells, and functional annotations that show heritability enrichment and thus provide better insight into the biology of the trait. Applying the GTEx and Franke Lab annotations^32,33, we did not find any study-wide significantly enriched annotations at any time points, probably due to limited power, as these methods typically require very large sample sizes. It is, however, notable that the lowest p-values clustered in the adipose and musculoskeletal/connective tissue categories at around six to eight months (Supplementary Fig. 7 and Supplementary Data 2).

Discussion

Here we report a GWAS with dense measurements of BMI during the first years of life. The few GWASs published on BMI in infancy and childhood mainly involve children above five years of age, i.e. during adiposity rebound^4,5. These studies point toward a strong genetic correlation for BMI around adiposity rebound and adulthood. Our results confirm a strong overlap of the genetics of BMI from five to eight years and adulthood, however, this association is much less pronounced during infancy. Infant weight and height have considerable heritable components³⁴. Our results suggest that there are distinct molecular mechanisms that dynamically and specifically influence weight gain in infancy, partly acting through leptin signaling. However, recent secular changes in childhood growth patterns³⁵ illustrate that also non-genetic factors play central roles during early infancy and childhood. Future studies in large cohorts such as the Norwegian Mother, Father, and Child Cohort Study might be able to shed light on how diet, parenting, life-style, and genetic factors influence the growth-pattern in early life and later adulthood.

Leptin has an important role in fetal growth, and is positively correlated with birth weight³⁶. Leptin levels are high at birth and decrease quickly, whereas sOB-R levels are low at birth and increase rapidly during the first postnatal days³⁷. This pattern is hypothesized to be an important mechanism for suppressing leptin-induced energy expenditure during the first neonatal days. The sOB-R level remains very high during the first two years of life and then declines³⁸, mirroring the association of LEPR with infant BMI observed in our study (Fig. 3). An effect of genetic variant(s) on the level of sOB-R in infancy is therefore a possible causal mechanism underlying the association with BMI. An interaction between the LEPR- and LEP-associated variants with increased BMI in individuals who carry both the sOB-R-raising and leptin-lowering alleles would further support a mechanism where sOB-R in circulation sequesters leptin, reducing its membrane receptor activation, hence promoting energy intake during infancy. The SNPs associated with increased BMI during infancy near LEPR and LEP are not known to affect adult BMI. In fact, they are not in LD with any marker associated with adult diseases, and might thus promote healthy weight gain during infancy, a notion further supported at the genome level by LD score regression. This result is further supported by a recent independent study³⁹ suggesting that SNPs in the LEPR/LEPROT locus are associated with BMI at the adiposity peak.

A strength of the study is that all samples are drawn from the same birth cohort with harmonized data collection practices across the study, something that is rarely possible with a more traditional meta-analysis of many different cohorts and study designs. It is likely that this has contributed to our ability to discover and replicate several genome-wide significant loci despite considerably lower sample sizes compared to current mega studies performed on birthweight and adult BMI. By utilizing a replication sample from the same study cohort that was genotyped using a different genotyping array, we were also able to perform very specific replication of the initial time-dependent associations found in the discovery sample. While that provides a very pure and powerful replication design, it should be noted that the absence of an external non-Norwegian replication sample might limit the generalizability of our findings towards other populations.

In summary, our first GWAS performed in the Norwegian Mother, Father, and Child Cohort Study capitalizing on a wealth of phenotypes, the longitudinal analysis uncovers a complex and dynamic influence of common genetic variation on BMI during infant and early childhood growth, dominated by the LEP-LEPR axis in infancy. Improved understanding of infant weight biology is important as childhood obesity as well as undernutrition and premature births are worldwide challenges. Our study provides knowledge of time-resolved genetic determinants for infant and early childhood growth, suggesting that weight management intervention should be tailored to developmental stage and genetic profile of the patients.

Methods

Ethics

Informed consent was obtained from all study participants. The administrative board of the Norwegian Mother, Father, and Child Cohort Study led by the Norwegian Institute of Public Health approved the study protocol. The establishment of MoBa and initial data collection was based on a license from the Norwegian Data Protection Agency and approval from The Regional Committee for Medical Research Ethics. The MoBa cohort is currently regulated by the Norwegian Health Registry Act. The study was approved by The Regional Committee for Medical Research Ethics (#2012/67).

Study population

The Norwegian Mother, Father, and Child Cohort Study is an open-ended cohort study that recruited pregnant women in Norway from 1999 to 2008. Approximately 114,000 children, 95,000 mothers, and 75,000 fathers of predominantly Norwegian ancestry were enrolled in the study from 50 hospitals all across Norway⁷. Anthropometric measurements of the children were carried out at hospitals (at birth) and during routine visits by trained nurses at 6 weeks; 3, 6, and 8 months; and 1, 1.5, 2, 3, 5, 7, and 8 years of age. Parents later transcribed these measurements to questionnaires. In 2012, the project Better Health By Harvesting Biobanks (HARVEST) randomly selected 11,490 umbilical cord blood DNA samples from the Norwegian Mother, Father, and Child Cohort Study’s biobank for genotyping, excluding samples matching any of the following criteria: (1) stillborn, (2) deceased, (3) twins, (4) non-existing Medical Birth Registry data, (5) missing anthropometric measurements at birth in Medical Birth Registry, (6) pregnancies where the mother did not answer the first questionnaire (as a proxy for higher fallout rate), and (7) missing parental DNA samples. In 2016, HARVEST randomly selected a second set of samples, 5984, using the same criteria.

Genotyping

For the discovery sample, genotyping was performed using Illumina’s HumanCoreExome-12 v.1.1 and HumanCoreExome-24 v.1.0 arrays for 6938 and 4552 samples, respectively, at the Genomics Core Facility located at the Norwegian University of Science and Technology, Trondheim, Norway. The replication sample was genotyped using Illumina’s Global Screening Array v.1.0 for all 5984 samples at the Erasmus University Medical Center in Rotterdam, Netherlands. We used the Genome Reference Consortium Human Build 37 (GRCh37) reference genome for all annotations and included autosomal markers only for this study.

Genotypes were called in Illumina Genome Studio (for discovery v.2011.1 and for replication v.2.0.3). Cluster positions were identified from samples with call rate ≥0.98 and GenCall score ≥0.15. We excluded variants with low call rates, signal intensity, quality scores, heterozygote excess, and deviation from Hardy–Weinberg equilibrium (HWE) based on the following QC parameters: call rate <98%, cluster separation <0.4, 10% GC-score <0.3, AA T Dev >0.025, HWE p-value < 10⁻⁶. Samples were excluded based on call rate <98% and heterozygosity excess >4 SD. Study participants with non-Norwegian ancestry were excluded after merging with samples from the HapMap project (ver. 3). Sample pairs with PI_HAT > 0.1 in identical-by-descent (IBD) calculations were resolved by removing a random sample in each pair. After genotype calling and QC, 9286 (80.8%) from the discovery sample set, and 5235 (87.5%) from the replication sample remained eligible for analysis.

Pre-phasing and imputation

Prior to imputation, insertions and deletions were removed to make the dataset congruent with Haplotype Reference Consortium (HRC) v.1.1 imputation panel using HRC Imputation preparation tool by Will Rayner version 4.2.5 (see URLs): insertions and deletions were excluded. Allele, marker position, and strand orientation were updated to match the reference panel. A total of 384,855 and 568,275 markers remained eligible for phasing and imputation for the discovery and replication set, respectively. Pre-phasing was conducted locally using Shapeit v2.790⁴⁰. Imputation was performed at the Sanger Imputation Server (see URLs) with positional Burrows-Wheeler transform⁴¹ and HRC version 1.1 as reference panel.

Phenotypes

Age, height, and weight values were extracted from hospital records through the Norwegian Medical Birth Registry (NMBR) for measurements at birth, and from the study questionnaires for remaining time points. Pregnancy duration in days was extracted from Medical Birth Registry and pregnancies with duration <37 weeks 0 day were excluded (515 pregnancies). Height and weight values were inspected at each age and those provided in centimeter or gram instead of meter and kilogram, respectively, were converted. Extreme outliers, typically an error in handwritten text parsing or a consequence of incorrect units, were excluded (47 length and 8 weight measurements). A value x was considered as an extreme outlier if x > m + 2 × (perc₉₉ − m) or x < m −2 × (m − perc₁), where m represents the median and perc₁and perc₉₉ the 1^st and 99^th percentiles, respectively.

Subsequently, height and weight curves were inspected for extreme outliers by monitoring the variation of height and weight over time as follows: (i) the height and weight ratio between consecutive ages were calculated at each time point but the last: $r_i = x_{i + 1}/x_i$ where r_i is the ratio at time point i and x_i is height or weight at i; (ii) the ratios were scaled after logarithm base 2 transformation, $r_i\prime = f\left( {{\mathrm{log}}_2\left( {r_i} \right)} \right)$, using the function f of Eq. 1:

$$\begin{array}{l}f\left( {x_{s,i}} \right) = \frac{{x_{s,i} - m_{s,i}}}{{F_{s,i}^{ - 1}\left( {{\it{\Phi }}\left( z \right)} \right) - m_{s,i}}},\\ z = \left\{ {1\,if\,x_i \ge m_{s,i} - 1\,{\mathrm{otherwise}}} \right.\end{array}$$

(1)

Where x_s,i is the value for an individual of sex s at time point i, m_s,i is the median, $F_{s,i}^{ - 1}$ the empirical quantile function of the values at i of individuals of sex s presenting at least three values before age two (exclusive) and at least two values after age two (inclusive), and Φ the distribution function of the standard normal distribution; (iii) the height or weight of an individual at time point i, presenting surrounding scaled ratios $r\prime _{i - 1}$ and $r\prime _i$ was considered as an outlier and excluded if $r\prime _{i - 1}\,> \,1$ and $r\prime _i\,<\,-1$ or if $r\prime _{i - 1}\,<\,-1$ and $r\prime _i\,> \,1$, corresponding to peaks or gaps in the curve, respectively.

If for an individual of sex s, two consecutive height values, h_i and h_i+1 presented a decrease in height, i.e. h_i+1 < h_i, this was considered an artefact and corrected as follows.

If the individual presented three or more other height measurements, h_j with j ≠ i and j ≠ i + 1, for each j the corresponding height at i and i+1 was estimated by interpolating the height curve using the ratios as in Eq. 2:

$$x_{i,j} = \widehat {r_{i,j}} \times x_j$$

(2)

where x_i,j is the value at i interpolated from j, x_j is the value at j, and $\widehat {r_{i,j}} = \mathop {\prod}\nolimits_j^i {\widehat {r_k}}$ if j < i and $\widehat {r_{i,j}} = \frac{1}{{\mathop {\prod }\nolimits_i^j \widehat {r_k}}}$ if j > i, with $\widehat {r_k}$ the median of the ratios r at time point k for the individuals of sex s presenting at least three values before age two (exclusive) and at least two values after age two (inclusive). If, for all j, h_i > h_i,j, h_i was considered an outlier and excluded. Similarly, if, for all j, h_i+1 < h_i,j, h_i+1 was considered an outlier and excluded.

Alternatively, if the individual presented two or fewer other height measurements, and h_i > h_high, h_i was considered as outlier and removed, with h_high defined as in Eq. 3:

$$h_{{\mathrm{high}}} = m_{s,i} + {\it{\Phi }}^{ - 1}(0.99) \times \left( {F_{s,i}^{ - 1}({\it{\Phi }}\left( 1 \right)) - m_{s,i}} \right)$$

(3)

where m_s,i is the median and $F_{s,i}^{ - 1}$ the empirical quantile function of the heights at i of individuals of sex s presenting at least three values before age two (exclusive) and at least two values after age two (inclusive), Φ and Φ⁻¹ the distribution and quantile functions of the standard normal distribution, respectively. Similarly, if the individual presented two or less other height measurements, and h_i+1 < h_low, h_i+1 was considered as outlier and removed, with h_low defined as in Eq. 4:

$$h_{{\mathrm{low}}} = m_{s,i} - {\it{\Phi }}^{ - 1}(0.99) \times \left( {m_{s,i} - F_{s,i}^{ - 1}({\it{\Phi }}\left( { - 1} \right))} \right)$$

(4)

If h_i and h_i+1were not considered as outliers, ${h_{i_0}}$ and $h_{i + 1_0}$were defined as the median of h_i,j as defined in Eq. 2, for all j ≠ i and j ≠ i+1, respectively. Starting from $h_{i_k} = h_i,h_{i + 1_k} = h_{i + 1},$ h_i and h_i+1 were iteratively decreased or increased, respectively, until h_i+1 ≥ h_i as described in Eqs. 5 and 6.

$$h_{i_{k + 1}} = \left\{ {h_{i_0} + 0.9 \times } \right.\left( {h_{i_k} - h_{i_0}} \right)\,if\,\left| {h_{i_k} - h_{i_0}} \right|\,> \,\left| {h_{i + 1_k} - h_{i + 1_0}} \right|h_{i_k}{\mathrm{otherwise}}$$

(5)

$$h_{i + 1_{k + 1}} = \left\{ {h_{i + 1_k}\,if\,} \right.\left| {h_{i_k} - h_{i_0}} \right|\,> \,\left| {h_{i + 1_k} - h_{i + 1_0}} \right|h_{i + 1_0} + 0.9 \times \left( {h_{i + 1_k} - h_{i + 1_0}} \right){\mathrm{otherwise}}$$

(6)

Subsequently, height and weight missing values were imputed from the individual height and weight curves at all ages for individuals presenting at least three values before age two (exclusive) and at least two values after age two (inclusive), and until age two (exclusive), for individuals presenting at least three values before age two (exclusive). A missing value at i was imputed to x_i = median(x_i,j), with x_i,j as defined in Eq. 2. Importantly, missing values were imputed only if at least two non-imputed values were present at both earlier and later ages. Upon imputation of missing values, outlier removal and height decrease correction was conducted as described previously, and the new missing values were imputed using the same rules. The number of imputed samples per time point for discovery and replication is available in Supplementary Table 2.

Finally, BMI was computed where both height and weight values were available. At each time point, BMI values were scaled prior to association as described in Eq. 1. These scaled values are referred to as standardized BMI in the text.

The quality control of the phenotypes was conducted in R version 3.5.1 (2018-07-02) -- “Feather Spray” (https://www.R-project.org).

Statistical analyses

Genome-wide analyses were performed using SNPTEST v.2.5.2 using dosages of alternate allele with an additive linear model using sex, batch, and ten principal components as covariates. LD score regression was performed with LD Hub v.1.9.0 using LDSC v.1.0.0²⁹ using all markers remaining after performing pruning recommended by the LD Hub³⁰ authors.

Cell type specific partitioned LD score regression was performed on a local server using LDSC v.1.0.0. We used baseline LD scores (v.2.2), regression weights, allele frequencies, and segregated LD scores for the respective cell types built from 1000 G Phase 3 obtained from the LDSC repository (see URLs) to run cell type specific analyses on all 12 time points.

Polygenic risk scores (PRS) were derived using effect sizes from genome-wide significant loci in the original studies on birth weight⁹, childhood BMI⁵ and adult BMI³. For each of the three comparisons traits, PRS were compared against sd-BMI across all 12 time points. Only directly genotyped and imputed markers with information score >0.7 were included in the analyses leaving 58, 40, and 94 markers for birth weight, childhood BMI, and adult BMI, respectively. Imputed markers were hard called to their most likely genotype prior to calculating the scores. PRS were calculated for each individual as the sum of the effect weighted count of birth weight- or BMI-increasing alleles. Thus, each child got three different polygenic scores, one for each of the three traits compared, which where then tested for their ability to predict sd-BMI at each of the 12 time points. Hence the same weights and markers were applied to all time-points for each of the compared traits. Furthermore, Pearson correlation coefficient, r, was calculated for the correlation between birth weight/BMI and PRS for all samples in for compared trait and at each age separately.

All p-values in the manuscript are presented as nominal unless where otherwise stated in the manuscript.

Figures

All figures in the manuscript were generated in R version 3.5.1 (2018-07-02) -- “Feather Spray” (https://www.R-project.org). In addition to the system packages, the following packages were used: ggplot2 version 3.0.0, scico version 1.0.0, gtable version 0.2.0, ggrepel version 0.8.0, and ggdendro version 0.1–20.

URLs

For HRC or 1000 G Imputation preparation and checking, see http://www.well.ox.ac.uk/~wrayner/tools/; for Sanger Imputation Service, see https://imputation.sanger.ac.uk/; for

LD Score repository, see https://data.broadinstitute.org/alkesgroup/LDSCORE/.

Data availability

Summary data from the discovery analysis is available for download at the Norwegian Mother, Father, and Child Cohort Study website. Access to genotypes and phenotypes can be obtained by direct request to the Norwegian Institute of Public Health (https://www.fhi.no/en/studies/moba/for-forskere-artikler/gwas-data-from-moba/).

References

Rolland-Cachera, M. F. et al. Adiposity rebound in children: a simple indicator for predicting obesity. Am. J. Clin. Nutr. 39, 129–135 (1984).
Article CAS PubMed Google Scholar
Geserick, M. et al. Acceleration of BMI in early childhood and risk of sustained obesity. N. Engl. J. Med. 379, 1303–1312 (2018).
Article PubMed Google Scholar
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Warrington, N. M. et al. A genome-wide association study of body mass index across early life and childhood. Int. J. Epidemiol. 44, 700–712 (2015).
Article PubMed PubMed Central Google Scholar
Felix, J. F. et al. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index. Hum. Mol. Genet. 25, 389–403 (2016).
Article CAS PubMed Google Scholar
Bradfield, J. P. et al. A genome-wide association meta-analysis identifies new childhood obesity loci. Nat. Genet. 44, 526–531 (2012).
Article CAS PubMed PubMed Central Google Scholar
Magnus, P. et al. Cohort profile update: the Norwegian mother and child cohort study (MoBa). Int. J. Epidemiol. 45, 382–388 (2016).
Article PubMed Google Scholar
Paltiel, L. et al. The biobank of the Norwegian Mother and Child Cohort Study – present status. Nor. Epidemiol. 24, 29–35 (2014).
Horikoshi, M. et al. Genome-wide associations for birth weight and correlations with adult disease. Nature 538, 248–252 (2016).
Article CAS PubMed PubMed Central Google Scholar
Yengo, L. et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum. Mol. Genet. https://doi.org/10.1093/hmg/ddy271 (2018).
Article PubMed PubMed Central Google Scholar
Sáinz, N., Barrenetxe, J., Moreno-Aliaga, M. J. & Martínez, J. A. Leptin resistance and diet-induced obesity: central and peripheral actions of leptin. Metabolism 64, 35–46 (2015).
Article PubMed Google Scholar
Schaab, M. & Kratzsch, J. The soluble leptin receptor. Best. Pract. Res. Clin. Endocrinol. Metab. 29, 661–670 (2015).
Article CAS PubMed Google Scholar
Farooqi, I. S. & O’Rahilly, S. Genetic factors in human obesity. Obes. Rev. 8, 37–40 (2007).
Article PubMed Google Scholar
Clément, K. et al. A mutation in the human leptin receptor gene causes obesity and pituitary dysfunction. Nature 392, 398–401 (1998).
Article ADS PubMed Google Scholar
Wheeler, E. et al. Genome-wide SNP and CNV analysis identifies common and low-frequency variants associated with severe early-onset obesity. Nat. Genet. 45, 513–517 (2013).
Article CAS PubMed PubMed Central Google Scholar
Day, F. R. et al. Genomic analyses identify hundreds of variants associated with age at menarche and support a role for puberty timing in cancer risk. Nat. Genet. 49, 834–841 (2017).
Article CAS PubMed PubMed Central Google Scholar
Day, F. R. et al. Shared genetic aetiology of puberty timing between sexes and with health-related outcomes. Nat. Commun. 6, 8842 (2015).
Article CAS ADS PubMed Google Scholar
Sabater-Lleal, M. et al. Multiethnic meta-analysis of genome-wide association studies in >100 000 subjects identifies 23 fibrinogen-associated Loci but no strong evidence of a causal association between circulating fibrinogen and cardiovascular disease. Circulation 128, 1310–1324 (2013).
Article CAS PubMed Google Scholar
Elliott, P. Genetic loci associated with C-reactive protein levels and risk of coronary heart disease. JAMA 302, 37 (2009).
Article CAS PubMed PubMed Central Google Scholar
Astle, W. J. et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell 167, 1415–1429 (2016). e19.
Article CAS PubMed PubMed Central Google Scholar
Suhre, K. et al. Connecting genetic risk to disease end points through the human blood plasma proteome. Nat. Commun. 8, 14357 (2017).
Article CAS ADS PubMed PubMed Central Google Scholar
Sun, Q. et al. Genome-wide association study identifies polymorphisms in LEPR as determinants of plasma soluble leptin receptor levels. Hum. Mol. Genet 19, 1846–1855 (2010).
Article CAS PubMed PubMed Central Google Scholar
Kilpeläinen, T. O. et al. Genome-wide meta-analysis uncovers novel loci influencing circulating leptin levels. Nat. Commun. 7, 10494 (2016).
Article ADS PubMed PubMed Central Google Scholar
Sovio, U. et al. Genetic determinants of height growth assessed longitudinally from infancy to adulthood in the Northern Finland Birth Cohort 1966. PLoS Genet. 5, e1000409 (2009).
Article PubMed PubMed Central Google Scholar
Saeed, S. et al. Loss-of-function mutations in ADCY3 cause monogenic severe obesity. Nat. Genet. https://doi.org/10.1038/s41588-017-0023-6 (2018).
Grarup, N. et al. Loss-of-function variants in ADCY3 increase risk of obesity and type 2 diabetes. Nat. Genet. https://doi.org/10.1038/s41588-017-0022-7 (2018).
Siljee, J. E. et al. Subcellular localization of MC4R with ADCY3 at neuronal primary cilia underlies a common pathway for genetic predisposition to obesity. Nat. Genet. https://doi.org/10.1038/s41588-017-0020-9 (2018).
Sovio, U. et al. Association between common variation at the FTO locus and changes in body mass index from infancy to late childhood: the complex nature of genetic association through growth and development. PLoS Genet. 7, e1001307 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics. https://doi.org/10.1101/051094 (2016).
Silventoinen, K. et al. Genetic and environmental effects on body mass index from infancy to the onset of adulthood: an individual-based pooled analysis of 45 twin cohorts participating in the COllaborative project of Development of Anthropometrical measures in Twins (CODATwins) study. Am. J. Clin. Nutr. 104, 371–379 (2016).
Article CAS PubMed PubMed Central Google Scholar
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).
Article CAS PubMed PubMed Central Google Scholar
GTEx Consortium et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Article PubMed Central Google Scholar
Mook-Kanamori, D. O. et al. Heritability estimates of body size in fetal life and early childhood. PLoS ONE 7, e39901 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Johnson, W. et al. A changing pattern of childhood BMI growth during the 20th century: 70 y of data from the Fels Longitudinal Study. Am. J. Clin. Nutr. 95, 1136–1143 (2012).
Article CAS PubMed PubMed Central Google Scholar
Vatten, L. J., Nilsen, S. T., Odegård, R. A., Romundstad, P. R. & Austgulen, R. Insulin-like growth factor I and leptin in umbilical cord plasma and infant birth size at term. Pediatrics 109, 1131–1135 (2002).
Article PubMed Google Scholar
Kratzsch, J. et al. Inverse changes in the serum levels of the soluble leptin receptor and leptin in neonates: relations to anthropometric data. J. Clin. Endocrinol. Metab. 90, 2212–2217 (2005).
Article CAS PubMed Google Scholar
Kratzsch, J. et al. Circulating soluble leptin receptor and free leptin index during childhood, puberty, and adolescence. J. Clin. Endocrinol. Metab. 87, 4587–4594 (2002).
Article CAS PubMed Google Scholar
De Silva, N. M. G. et al. Genetic architecture of early childhood growth phenotypes gives insights into their link with later obesity. Preprint at https://www.biorxiv.org/content/10.1101/150516v1 (2017).
Delaneau, O., Zagury, J.-F. & Marchini, J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods 10, 5–6 (2013).
Article CAS PubMed Google Scholar
Durbin, R. Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT). Bioinformatics 30, 1266–1272 (2014).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by grants (to P.R.N.) from the European Research Council (AdG #293574), the Bergen Research Foundation (“Utilizing the Mother and Child Cohort and the Medical Birth Registry for Better Health”), Stiftelsen Kristian Gerhard Jebsen (Translational Medical Center), the University of Bergen, the Research Council of Norway (FRIPRO grant #240413), the Western Norway Regional Health Authority (Strategic Fund “Personalized Medicine for Children and Adults”), the Novo Nordisk Foundation (grant #54741), and the Norwegian Diabetes Association; and (to S.J.) Helse Vest’s Open Research Grant (grant #912250). This work was partly supported by the Research Council of Norway through its Centres of Excellence funding scheme (#262700), Better Health by Harvesting Biobanks (#229624) and The Swedish Research Council, Stockholm, Sweden (2015-02559), The Research Council of Norway, Oslo, Norway (FRIMEDBIO #547711, March of Dimes (#21-FY16-121). The Norwegian Mother, Father, and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research, NIH/NIEHS (contract no N01-ES-75558), NIH/NINDS (grant no.1 UO1 NS 047537-01 and grant no.2 UO1 NS 047537-06A1). We are grateful to all the families in Norway who are taking part in this ongoing cohort study.

Author information

These authors contributed equally: Stefan Johansson, Pål Rasmus Njølstad.

Authors and Affiliations

KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, NO-5020, Bergen, Norway
Øyvind Helgeland, Marc Vaudel, Rolv Terje Lie, Jørn V. Sagen, Anders Molven, Stefan Johansson & Pål Rasmus Njølstad
Department of Genetics and Bioinformatics, Health Data and Digitalisation, Norwegian Institute of Public Health, NO-0473, Oslo, Norway
Øyvind Helgeland, Bo Jacobsson & Gun Peggy Knudsen
Department of Clinical Science, University of Bergen, NO-5020, Bergen, Norway
Petur B. Juliusson & Jørn V. Sagen
Department of Pediatrics and Adolescents, Haukeland University Hospital, NO-5021, Bergen, Norway
Petur B. Juliusson & Pål Rasmus Njølstad
Department of Health Registries, Norwegian Institute of Public Health, NO-5020, Bergen, Norway
Petur B. Juliusson
KG Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, NO-7491, Trondheim, Norway
Oddgeir Lingaas Holmen & Kristian Hveem
HUNT Research Center, Department of Public Health and Nursing, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, NO-7491, Trondheim, Norway
Oddgeir Lingaas Holmen
Department of Gynecology and Obstetrics, Sahlgrenska Academy, University of Gothenburg, SE-405 30, Gothenburg, Sweden
Julius Juodakis, Jonas Bacelis & Bo Jacobsson
Department of Gynecology and Obstetrics, Sahlgrenska University Hospital, SE-413 45, Gothenburg, Sweden
Jonas Bacelis
Department of Community Medicine, UiT The Arctic University of Norway, NO-9019, Tromsø, Norway
Haakon Lindekleiv
HUNT Research Center, NO-7600, Levanger, Norway
Kristian Hveem
Department of Global Public Health and Primary Care, University of Bergen, NO-5020, Bergen, Norway
Rolv Terje Lie & Camilla Stoltenberg
Norwegian Institute of Public Health, NO-0473, Oslo, Norway
Camilla Stoltenberg
Centre for Fertility and Health, Norwegian Institute of Public Health, NO-0473, Oslo, Norway
Per Magnus
Institute of Health and Society, Faculty of Medicine, University of Oslo, NO-0315, Oslo, Norway
Per Magnus
Hormone Laboratory, Haukeland University Hospital, NO-5021, Bergen, Norway
Jørn V. Sagen
Department of Clinical Medicine, University of Bergen, NO-5020, Bergen, Norway
Anders Molven
Department of Pathology, Haukeland University Hospital, NO-5021, Bergen, Norway
Anders Molven
Department of Medical Genetics, Haukeland University Hospital, NO-5021, Bergen, Norway
Stefan Johansson

Authors

Øyvind Helgeland
View author publications
You can also search for this author in PubMed Google Scholar
Marc Vaudel
View author publications
You can also search for this author in PubMed Google Scholar
Petur B. Juliusson
View author publications
You can also search for this author in PubMed Google Scholar
Oddgeir Lingaas Holmen
View author publications
You can also search for this author in PubMed Google Scholar
Julius Juodakis
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Bacelis
View author publications
You can also search for this author in PubMed Google Scholar
Bo Jacobsson
View author publications
You can also search for this author in PubMed Google Scholar
Haakon Lindekleiv
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Hveem
View author publications
You can also search for this author in PubMed Google Scholar
Rolv Terje Lie
View author publications
You can also search for this author in PubMed Google Scholar
Gun Peggy Knudsen
View author publications
You can also search for this author in PubMed Google Scholar
Camilla Stoltenberg
View author publications
You can also search for this author in PubMed Google Scholar
Per Magnus
View author publications
You can also search for this author in PubMed Google Scholar
Jørn V. Sagen
View author publications
You can also search for this author in PubMed Google Scholar
Anders Molven
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Pål Rasmus Njølstad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Ø.H. and M.V. performed the analyses. O.L., J.J, J.B., B.J., H.L., K.H., R.T.L., G.P.K., C.S., and P.M. contributed to sample acquisition and genotyping. J.J. and J.B. assisted with genotype quality control. Ø.H., M.V., S.J., and P.R.N. wrote the manuscript with contributions from all authors. P.B.J, J.V.S., and A.M. critically revised the manuscript for important intellectual content. Ø.H., M.V., S.J., and P.R.N. designed the study. S.J and P.R.N. directed the study. P.R.N. conceived the project, secured funding, and initiated the study.

Corresponding authors

Correspondence to Stefan Johansson or Pål Rasmus Njølstad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data 2

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Helgeland, Ø., Vaudel, M., Juliusson, P.B. et al. Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth. Nat Commun 10, 4448 (2019). https://doi.org/10.1038/s41467-019-12308-0

Download citation

Received: 08 December 2018
Accepted: 28 August 2019
Published: 01 October 2019
DOI: https://doi.org/10.1038/s41467-019-12308-0

This article is cited by

Meta-regression of genome-wide association studies to estimate age-varying genetic effects
- Panagiota Pagoni
- Julian P. T. Higgins
- Kate Tilling
European Journal of Epidemiology (2024)
Genetically-predicted placental gene expression is associated with birthweight and adult body mass index
- Elizabeth A. Jasper
- Jacklyn N. Hellwege
- Digna R. Velez Edwards
Scientific Reports (2023)
Genome-wide association study of placental weight identifies distinct and shared genetic influences between placental and fetal growth
- Robin N. Beaumont
- Christopher Flatley
- Pål R. Njølstad
Nature Genetics (2023)
Modeling assortative mating and genetic similarities between partners, siblings, and in-laws
- Fartein Ask Torvik
- Espen Moen Eilertsen
- Eivind Ystrom
Nature Communications (2022)
LncOb rs10487505 variant is associated with leptin levels in pediatric non-alcoholic fatty liver disease
- Melania Manco
- Annalisa Crudele
- Anna Alisi
Pediatric Research (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.