Introduction

Obesity is a worldwide major health problem. A large body of literature has established a clear link between obesity with adverse health outcomes, including diabetes mellitus, coronary heart disease, stroke, heart failure and increased mortality [1, 2].

Risk factors for obesity have been studied extensively, and in general, they are divided into: demographic and socio-economic (e.g., education, and socioeconomic status) [3, 4]; environmental (e.g., unhealthy diet, smoking, and sedentary lifestyle), [5] and genetic factors [6, 7]. From a genetic perspective, the most important distinction is that between rare (monogenic) and common (multifactorial) obesity. Rare, high-risk genetic variations in roughly two dozen genes are known to cause monogenic obesity [8, 9]. However, for the vast majority of the population, obesity is polygenic, involving many common genetic variants with individually minor effects, collectively contributing to the majority of inherited susceptibility [9]. In the past 15 years, large-scale genome-wide association studies (GWAS) and meta-analyses have identified ~200 loci that are associated with adiposity traits across different populations worldwide [10,11,12]. However, single nucleotide polymorphisms (SNPs) identified in GWAS explain little of obesity variance [7, 13, 14]. Combining these SNPs into a polygenic risk score (PRS) has proven valuable, offering a quantifiable measure of inherited predisposition toward a trait, such as BMI [15]. Developing and validating obesity-specific PRS represents a breakthrough in personalized obesity and obesity-related disease risk prognosis [16]. Obesity-related PRS have been constructed in adults and children, displaying significant associations with numerous phenotypes and health outcomes such as circulatory, metabolic and other diseases, across diverse populations [17,18,19].

Beyond the aforementioned risk factors, current research indicates that fetal and early-life characteristics (like birth weight) play an important role in determining childhood and adulthood obesity. Maternal overnutrition, reflected in part by greater maternal pre-pregnancy body mass index (mppBMI) and gestational weight gain (GWG), has been consistently linked with offspring adiposity across their lifespan [20]. Other maternal attributes (e.g., educational attainment and maternal smoking during pregnancy) have also emerged as risk factors [21, 22].

Early intervention to prevent childhood and young adulthood obesity is key in attenuating the obesity epidemic [23]. While risk prediction based on a single genetic factor, such as PRS, has limited prognostic accuracy, combining multiple factors into a prediction model, enhances accuracy [24]. Additionally, studies indicate a significant impact of early-life social factors and maternal characteristics on the likelihood of adult obesity compared to similar factors measured during adulthood [25]. Thus, in this unique study prediction models of young-adulthood obesity were constructed using BMI-related PRS, early-life socio-demographic characteristics and perinatal traits. These attributes were collected through the Jerusalem Perinatal Study (JPS), alongside follow-up measurements of obesity-related traits at age 32. We employed both the conventional association approach, using measures such as explained variability (e.g., R2 statistic) and measures of performance of a prediction model (e.g., net reclassification improvement (NRI), and integrated discrimination improvement (IDI)) to determine if data collected during early life improve the prediction of adult obesity projected by PRS.

Methods

Study population

The JPS encompasses a subset of 17,003 births in Jerusalem from 1974 to 1976 [26]. Data consist of demographic and sociodemographic information, medical conditions of the mother during current and previous pregnancies, and offspring birth weight from birth certificates or maternity ward logbooks. In addition, mothers on the first or second day postpartum were interviewed regarding lifestyle and maternal health and provided data on gestational age, smoking status, height and pre-pregnancy weight, end of pregnancy weight and gynecological history.

The JPS Follow-up study conducted between 2007 and 2009 includes a sample of 2591 offspring with mean age 32 years (range 30–35 years) who were sampled from the original cohort [27] and 1440 were examined and interviewed (Supplementary Fig. 1). Sampling frame included singletons and term (gestational age ≥36 weeks) births without congenital malformations. We obtained a stratified sample of eligible individuals, where the strata were defined by mppBMI and birth weight (BW). Both low (≤2500 g) and high (≥4000 g) birth weight as well as mothers with overweight and obesity were over-sampled. Detailed information on data collection has been previously described [27]. Height, weight, waist measurements and fasting blood samples were collected following standard protocols.

This study was approved by the Institutional Review Board of the Hadassah-Hebrew University Medical Center and all participants provided informed consent. Data are stored in the European Genome-phenome Archive, https://ega-archive.org/ accession number EGAS00001004075.

Study variables

The primary outcome examined was offspring’s BMI and waist circumference (WC), treated as continuous variables. Additionally, this study also examined two outcomes: (1) obesity, defined as BMI ≥30, and (2) central obesity, defined as WC >8th decile of the gender specific WC distribution [28, 29].

In addition, the following explanatory variables were examined: early life socioeconomic status (SES) based on father’s occupation (grouped into six categories: 1 to 6, for lower to upper class, respectively) which was found to be similar and well correlated with other Israeli social class scales [4, 26], mother’s years of education (continuous), parental age at offspring’s birth (continuous), birth weight (BW) (gr, continuous), gestational age (weeks, continuous), mppBMI (kg/m2, continuous) and GWG (kg, continuous), and maternal smoking during pregnancy (categorical: current smoker (including those who quit during the current pregnancy) vs. never smoked or smoked in the past and quit before the this pregnancy. Father’s occupation was selected as an indicator of early life SES, rather than mother’s occupation, as approximately half of the women in this cohort were not employed outside the home during childbirth [26].

Genotyping and quality control

Genomic DNA was extracted using the salting-out method, then genotyping was carried out using the Affymetrix Biobank array (comprising of 587,021 variants). We used standard quality control criteria for individual samples and genetic variants [30, 31]. Out of 2790 DNA samples of both parents and offspring (950 triads), 68 samples had poor DNA quality and 138 samples did not pass the QC due to high rate of missing information and/or high rate of heterozygosity. Overall, we were left with 2593 samples, 872 from offspring. We started the SNPs QC for 587,021 SNPs with high variant calling rates. The missingness test had filtered out 22,151 SNPs, 166,313 SNPs were defined as monomorphic (frequency <0.00001) and in addition, we filtered out 142 SNPs with more than 100 Mendelian errors. Thus, a total of 398,491 validated SNPs were available for the analysis.

Imputation was done using a combined panel of the 1000 genomes [32] and The Ashkenazi Genome Consortium (n = 128) as ref. [33]. Imputed variants with MAF ≥ 1% and quality score ≥0.9 are retained (hard calls), generating a high-quality genetic dataset with nearly 7M genetic variants per individual available for subsequent analyses.

PRS calculation

PRS was computed for offspring genotype data using parameters derived from BMI-related GWAS. The association study employed BMI summary statistics from various European ancestry populations, encompassing 339,224 individuals from 125 studies, either with GWAS results or with results from Metabochip studies [18, 34], and a LD reference panel of 503 European samples from 1000 Genomes Phase 3, Ver 5 [32]. The results from multiple PRS that we have calculated showed that using only 10 to 100 SNPs which were found significant by the original GWAS was insufficient to create a reliable PRS. In fact, as many as 500k to 700k SNPs were needed to reach 90 and 95% of the maximal achieved PRS-BMI correlation. The majority of the variants (up to 2 million) contributed positively to this correlation and therefore for the current analyses we used a panel which consisted all 2,100,302 common variants (MAF ≥ 1%). The PRS was established by the LDPred computational algorithm, a Bayesian method that calculates a posterior mean effect for all variants using external weights, then shrinks them based on linkage disequilibrium [35].

Principle component analysis (PCA) was applied on genotyped samples of both parents and offspring (n = 2575) along with Jewish reference panel (n = 174) using PC-Air [36]. Rather than assigning offspring to a single ethnicity, the first five PCs were included as covariates in regression models to account for variations in ancestry within the Jewish population in Israel.

Statistical analyses

Linear regression models were used to explore the associations of the genetic group (i.e., PRS and 5 PCs), socio-demographic group (i.e., parental age at offspring birth and SES at birth) and perinatal characteristics group (i.e., Gestational week, mppBMI, GWG, BW and parental smoking during pregnancy) independent of each other, with BMI and WC. Four sets of models were constructed; Model 1 included sex as covariate, Model 2 included PRS adjusted for ethnicity (PCs 1–5) and sex, Model 3 incorporated socio-demographic characteristics, and Model 4 incorporated perinatal characteristics. Since these models were nested it enabled an assessment of each variable group’s contribution to explained BMI and WC variability. For categorical outcomes (obesity and central obesity), we used logistic regression analyses. Since the analyses of our dataset used probability weights, the likelihood-based tests may not be valid. An alternative method is comparing predicted probabilities from each model to assess their performance.

The area under the curve (AUC) was applied in evaluating the effectiveness of various diagnostic-models in separating between subjects with and without obesity [37]. Internal validity of the prediction model was assessed by 10-fold cross-validation.

The net reclassification improvement (NRI) index [38], was used to compare the saturated model (Model 4) and the restricted models (Model 2 and Model 3) for their ability to classify each subject either with or without obesity. The overall NRI which is the sum of improvement in reclassification of each obesity group (predicted risk <15 or ≥15%) was examined by χ2 values. A category-free version of the NRI was also presented [39] to ascertain risk increases to any extent for subjects with obesity under the saturated model (Model 4) compared to the restricted models, and similarly whether risk decreases to any degree for subjects without obesity.

Results

Population characteristics

Table 1 presents maternal and offspring birth characteristics, offspring PRS and obesity-related traits at age 32. Among mothers (around age 27.5), mppBMI was 23.8 kg/m2 and GWG was 11.3 kg. Mean offspring BMI at age 32 was 26.7, with greater WC among males than females. Overweight prevalence (BMI > 25) was higher in males (41.1%) than females (25.9%).

Table 1 Descriptive characteristicsa of study participants by sex groups.

Risk factor association

BMI and obesity

Linear regression models’ outcomes are displayed in Table 2a, examining the genetic, socio-demographic, and perinatal factors associated with offspring BMI. Model 2, showed a significant BMI increase (1.4 kg/m²) per one standard deviation increase in PRS. Notably, this PRS association is independent of sex and the first 5 PCs.

Table 2 Association analysis of obesity-related traits with groups of risk factors.

Socio-demographic factors contributed minimally to the explained variability of offspring BMI (ΔR2 = 0.0058; p = 0.183; Model 3).

Upon inclusion of perinatal variables into the model, the explained variability of offspring BMI notably increased from R2 of 0.133 (in Model 3) to 0.196 (in Model 4), F5, 823 = 12.9; p ≤ 0.001). Each SD change in mppBMI was associated with 0.39 kg/m² increase in offspring BMI (p ≤ 0.001). The association between PRS and offspring BMI was slightly attenuated (though remained significant) with the addition of perinatal variables into the model.

Figure 1A illustrates observed and adjusted offspring BMI means across PRS deciles, showing substantial shifts between the 3 lower and 3 upper deciles. For example, mean observed and predicted BMI levels for offspring located at the upper PRS decile were nearly 5 kg/m² higher compared with mean levels of BMI in the lowest decile of PRS.

Fig. 1: Observed and predicted means partitioned by PRS deciles.
figure 1

A shows observed and predicted BMI adjusted means generated from Model 2 (Genetic), Model 3 (Genetic + Socio-demographic), and Model 4 (Genetic + Socio-demographic + Perinatal). Similarly, B shows observed and predicted waist circumference (WC) adjusted means estimated from Model 2 (Genetic), Model 3 (Genetic + Socio-demographic), and Model 4 (Genetic + Socio-demographic + Perinatal).

Additionally, logistic regression models were used to examine these associations with dichotomous measures of obesity (BMI ≥ 30). Supplementary Table 1a reveals a significant PRS-obesity association (adjusted odds ratio [OR] per SD increment, 2.08; 95% CI, 1.56–2.77). In Model 4, OR changes for genetic variables were minor. Yet, both ppmBMI (adjusted OR per SD increment, 1.22; 95% CI, 1.14–1.30) and GWG (adjusted OR per SD increment, 1.08; 95% CI, 1.01–1.15) were significantly associated with obesity.

Central obesity

Similar to BMI, the association between PRS deciles and WC mean levels is clearly demonstrated across the lowest and the highest deciles of PRS (Fig. 1B).

Table 2b presents results of linear regression models examining the associations with offspring WC. Sex strongly associated with offspring WC and explained a substantial proportion of its variability (R2 = 0.19). Incorporating PRS, the change in explained WC variability was modest, yet significant, (ΔR2 = 0.043), and the independent contribution of socio-demographic variables was negligible (ΔR2 = 0.004; p = 0.282).

Incorporating perinatal variables in Model 4 (Table 2b), significantly increased WC explained variance (ΔR2 = 0.0425, F5, 823 = 9.821; p ≤ 0.001). Maternal smoking and mppBMI, adjusted for other perinatal and socio-demographic characteristics and for genetic factors, were positively associated with offspring WC. The association between PRS and WC was attenuated (though remained significant) when perinatal variables were included in the model.

Finally, logistic regression models applied to categorical measures of central obesity (WC > 8th decile), confirmed that similar to the association with obesity, both PRS and ppmBMI were significantly associated with central obesity (Supplementary Table 1b, Model 4).

Risk prediction—discrimination and reclassification

Obesity

Our data showed that a risk model based on PRS had a sensitivity of 0.80 and specificity of 0.54 to predict young adult obesity (BMI ≥ 30) and the positive predictive value (PPV) and negative predictive value (NPV) of PRS were 0.27 and 0.91, respectively. Overall, the potential discrimination for predicting obesity in offspring increased when the perinatal variables were included in the model (sensitivity = 0.84, specificity = 0.60, PPV = 0.33 and NPV = 0.94). Figure 2A presents AUCs that estimate the discrimination between subjects with and without obesity (BMI ≥ 30) across the combination of risk predictors. The 10-fold average AUC increased modestly from 0.68 (Model 2—genetic) to 0.70 (Model 3—genetic + socio-demographic) and considerably to 0.74 when genetic + socio-demographic + perinatal variables were included in Model 4.

Fig. 2: Discrimination between subjects with and without obesity, estimated by parameters of Model 2 (Genetic), Model 3 (Genetic + Socio-demographic), and Model 4 (Genetic + Socio-demographic + Perinatal).
figure 2

Mean AUC (95% confidence interval) are presented where AUCs are based on 10-fold cross-validation for obesity (BMI ≥ 30 kg/m2) (A) and high central obesity (WC > 8th decile) (B).

Next, we determined calibration by comparing observed and expected frequency of obesity-related outcomes from models without (Model 2) and with (Model 4) socio-demographic and perinatal variables. Model 2 categorized 43.9% of the sample as low risk (predicted risk ≤15%) and 56.1% as high risk (predicted risk >15%) (Table 3). Observed proportion of subjects without and with obesity in these 2 groups were 91.0% (NPV) and 27.2% (PPV), respectively. Adding socio-demographic and perinatal variables to the model (Model 4), adjusted this to 51.4% and 48.6% as low and high-risk subjects, respectively, with improved accuracy (observed proportion of subjects without and with obesity of 94.0% and 33.1%, respectively). When adding to Model 2 the socio-demographic variables (Model 3) the NRI was somewhat improved (NRI, 0.073; 95% CI, 0.0291–0.1162), however when both the socio-demographic and perinatal variables were added to Model 2, the risk categories significantly improved reclassification accuracy in obesity (NRI, 0.147; 95% CI, 0.068–0.225), where net proportion of correct reclassification for subjects with obesity was 7/161 = 0.044 and 70/678 = 0.103 in subjects without obesity (Table 4).

Table 3 Risk of obesity predicted by logistic models without and with perinatal characteristics: the Jerusalem Perinatal Study.
Table 4 Comparison of reclassification and discrimination improvement models for obesity-related variables.

Supplementary Fig. 2a depicts predicted probabilities based on the different models for subjects with and without obesity. These results show that 72.7% (117/161) of subjects with obesity (BMI ≥ 30) had a predicted risk based on model 4 higher than the predicted risk based on model 2, i.e., a more accurate risk estimate (Panel E). Conversely, 27.3% had a higher predicted risk based on model 2 (Panel E). The net proportion of risk estimates with higher accuracy (obese NRI (>0)) hence was 45.4%. In addition, we note that 54.9% of subjects without obesity had a lower prediction based on model 4, while 45.1% had a higher prediction based on Model 4 (non-obese NRI (>0) = 9.5%; Panel F). The sum of the two components of NRI (>0) equals 54.8%. The integrated discrimination improvement (IDI) also showed a significant improvement favoring the model with PRS and perinatal variables as compared to the model without perinatal characteristics (0.062%; 95% CI 0.040–0.083; Table 4). The improvement in IDI was nearly 5-times greater compared to the improvement observed when solely integrating socio-demographic variables into the PRS model (Model 3 vs. Model 2) (0.013%; 95% CI 0.0043–0.0218, Table 4).

Central obesity

Similarly, Fig. 2B presents AUC that estimate the discrimination between subjects with and without high central obesity (WC > 8th decile) across the combination of risk predictors. The 10-fold average AUC increased only when the perinatal variables were included in the model from 0.62–0.63 (Models 2 and 3) to 0.68 (Model 4).

Regarding central obesity, Model 2 categorized 36.9% of the sample as low-risk and 63.1% as high-risk (Table 3). In these groups, 91.0% without and 26.1% with central obesity were observed. Model 4, compared to Model 2, placed more individuals in low-risk (45.3%) and fewer in high-risk (54.7%) for central obesity. The observed proportion of subjects without and with central obesity in these 2 groups were 92.4% and 29.8%, respectively.

Analyses suggested a better net reclassification of central obesity using Model 4 (NRI, 0.097; 95% CI, 0.034–0.159), with a similar proportion of correct reclassification of −0.006 in subjects with central obesity and an improved reclassification of 0.097 in subjects without central obesity (Table 4). There was no significant improvement in NRI when only socio-demographic variables were added to the genetic model (0.015 95% CI −0.0293 to 0.0592).

Similarly, Supplementary Fig. 2b (Panel E and Panel F) indicated that the sum of the two components of the continuous NRI (NRI > 0) was about 40.0%. Yet, the better risk prediction based on Model 4, was limited to subjects with central obesity. IDI was also improved (0.049%; 95% CI 0.0307 to 0.0672), favoring the combination of PRS, socio-demographic and Perinatal model (Table 4). This improvement in IDI was almost 10-fold compared to improvement derived from adding only the socio-demographic variables to the PRS model (Model 3 vs. Model 2) (0.005%; 95% CI −0.0001 to 0.0107, Table 4).

Discussion

In this work our objective was to investigate the impact of PRS for BMI, socio-demographic characteristics, and perinatal exposures on the risk of obesity among young adults. To achieve this, we employed two methodologies. Initially, we modeled the associations of these factors with obesity-related traits and assessed the contribution of each set of factors to the explained variability of obesity traits. Subsequently, we developed prediction models and assessed their effectiveness and performance.

Results of the association analyses revealed significant findings regarding the relationship between genetic factors and obesity-related traits in young adults. We measured the PRS for BMI using multiple loci from SNPs derived from GWAS of BMI in the UK Biobank [18, 34, 40]. PRS displayed a significant association with obesity and central obesity, albeit contributing modestly to the explained variability (4–9%). The correlation observed of BMI-PRS (with sex and 5 PCs included in the model) with BMI (r = 0.3) and WC (r = 0.2) were comparable to those reported in previous studies. For example, a large study involving middle-aged subjects (n = 288,016, mean age = 57) from the UK Biobank, demonstrated a PRS approximating a normal distribution which was significantly associated with BMI values (r = 0.29) [18]. Similar to our findings, in another large study based on young women from the Nurses’ Health Study and young men from the Health Professionals Follow-Up Study, each 10-allele increment in BMI-PRS was associated with an average BMI gain of 0.54 kg/m2 in women and 0.20 kg/m2 in men [41]. In a prospective longitudinal study of participants in the Dunedin birth cohort, children with higher PRS had higher BMIs at every age assessed, from age 3 through age 38 years [17]. At ages 30–38 the PRS correlation with BMI (r = 0.13–0.14) and with WC (r = 0.11–0.12) were statistically significant and similar, yet, these correlations were somewhat lower compared to our results.

Additionally, we show independent associations of PRS and perinatal exposures, such as maternal pregnancy-related body size, with offspring BMI and central obesity. Interestingly, associations between PRS and offspring BMI and WC were marginally attenuated (14.2% and 13.7%, respectively) with the inclusion of perinatal variables. It is plausible that common genetic variation contributed to both maternal and offspring phenotype levels, leading to a slight attenuation in the association between PRS and offspring obesity-related traits when perinatal variables were added to the model. Consistent with our findings, within the Dunedin birth cohort, PRS was associated with offspring BMI independently of family history of obesity in their parents [17]. Thus, PRS may contain additional information about children’s risk for obesity beyond their parental history of obesity. However, while the PRS was associated with offspring obesity at young adulthood independently of birth weight, adiposity rebound tended to mediate a large portion of PRS effect on young adulthood obesity [17].

Similar to other studies in Caucasian and non-Caucasian populations [42] we have demonstrated a distinct variation in WC levels between sexes. This discrepancy emerges in early life and is attributed to differences in body composition, fat distribution and sex hormone profiles. As opposed to this sex difference in WC and the observed effects of the perinatal variables, we show that parental ages and socio-economic status (SES) were not associated with offspring’s BMI and WC at age 32 and their inclusion in the model did not change the regression coefficients for PRS. Our findings contradict those observed in a birth cohort of White Western European subjects, in which both lower SES and high PRS were associated with increased pre-adolescence BMI [43]. The effect of SES was largely independent of the effect of the PRS and the inclusion of both variables in the model did not substantially alter the effect sizes observed when each variable was included alone. A more complex picture was observed in a study spanning three generations of birth cohorts in Finland, exploring associations of early-life social disadvantage with later-life BMI [25]. In the older two cohorts there was either no association between social disadvantage and BMI or an association independent of PRS (albeit slightly attenuated). However, within the youngest cohort, similar to our findings, the association between early social disadvantage and later life BMI was fully attenuated after adjusting for PRS. The authors suggested genetic selection and assortative mating may partly explain the social selection observed in the youngest age group. In our study, we suggest that the full attenuation of the socio-demographic factors when PRS and PCs are included could be also attributed to the confounding effect of ancestry, represented by the PCs. In the Jewish Israeli population ethnicity is strongly associated with socio-demographic characteristics [44] and thus the presence of PCs in the model likely diminishes their effect on BMI.

The second approach we took to address our study aims was to construct prediction models for obesity (BMI ≥ 30) and central obesity (WC > 8th decile). We showed that the capacity of the PRS model to discriminate between individuals with and without obesity was AUC = 0.69, and this was improved when perinatal variables were additionally included (AUC = 0.73). We also demonstrated this improvement using other measures of prediction performance (i.e., NRI and IDI). The results for the risk of central obesity paralleled those of overall obesity, yet the models provided slightly less accurate estimates compared to models based on BMI.

Several studies have examined obesity risk prediction models based on PRS. Interestingly, regardless of the number of loci included in the score, the models’ discrimination capacity for obesity ranged from AUC of 0.57–0.61, in line with our results [7, 45, 46]. Others have examined whether obesity risk prediction is improved with addition of risk factors to the model, above and beyond PRS. For example, in a Danish study, the AUC for a genetic model (PRS only) was 0.58 and after the inclusion of non-genetic obesity risk factors (e.g., education, diet, smoking, and physical activity) the AUC increased to 0.69 [47]. Compared to our findings, the contribution of risk factors to obesity prediction was even more substantial, yet overall performance of the saturated models in both studies are equivalent.

Furthermore, using a 15% threshold, we showed increased discrimination ability for predicting offspring’s obesity when the perinatal variables were also included in the model (sensitivity = 0.84, specificity = 0.60, PPV = 0.33 and NPV = 0.94). A previous study showed that parental obesity status at age 6–9 years had a sensitivity, specificity, PPV and NPV of 0.58, 0.70, 0.27 and 0.90 respectively, predict adult obesity [7, 48]. The resemblance to our results might imply that family history encompasses both genetic and non-genetic components which contribute to obesity prediction in offspring. However, the observed differences in results, particularly the slightly heightened sensitivity and reduced specificity in our study, could be partially explained by the relatively older age in which maternal pre-pregnancy BMI was assessed (i.e., adulthood in our study vs. childhood). It has been shown that the predictive sensitivity of parental obesity for anticipating the risk of adult obesity in offspring may rise with the age of parental obesity assessment, while specificity tends to diminish with parental age [7].

Inspection of the proportion of correct reclassification among subjects who are either with or without obesity highlights the strengths and weaknesses of the combined PRS-perinatal strategy and provides insights on its implications. Overall, 25% of total subjects were reclassified, of which 17% were reclassified correctly, the majority (15%) being without obesity. The improved performance among subjects without obesity is also reflected by the higher NRI for subjects without obesity (0.103) compared to subjects with obesity (0.044), totaling the reported overall NRI of 0.147. This is also supported by the high NPV (0.94). These figures provide confidence that only a limited number of offspring predicted to be at low risk for obesity in adulthood by the combined genetic-perinatal model will eventually become overweight or obese. However, prediction of high obesity risk is less accurate and thus a larger number of those identified will in fact not develop obesity in the future. Nonetheless, the advantage of PRS and early life exposures is that they can be assessed at young ages long before adverse outcomes are manifested. If these markers are indeed used for early screening, identified high-risk individuals could truly benefit from early lifestyle modifications (e.g., healthy diet and increased physical activity). Importantly, such behavioral and/or environmental interventions to tackle obesity are in fact beneficial, even if applied among low-risk individuals.

The major strength of our study is the combination of high-quality detailed records of prenatal and perinatal characteristics with a comprehensive long-term follow-up data.

There are also several limitations to our study. First, it includes only a sample of offspring from the original 1974–1976 JPS cohort who were invited to participate in the follow-up study. However, using a stratified sampling approach and over-sampling the ends of the distribution ensured that offspring with a full range of mppBMI and birth weight were included in our study. Second, the PRS explained only a modest percent of the variation for BMI. This is in agreement with other studies, and expected with the current methodology used [18, 25]. Third, both mppBMI and GWG were reported by mothers in interviews conducted by nurses while hospitalized after delivery, whereas, verification from clinical records was not available. Nevertheless, we have demonstrated associations between reported maternal attributes and BMI more than 30 years later, as well as with long-term clinical outcomes in mothers [49], which lend support to the validity of the data. Additionally, in our study mothers were interviewed within days after delivery, in accordance of the reports on the validity and reproducibility of maternal recollection of pre-pregnancy weight [50, 51]. Even if reporting error was present, it is most likely that this non-differential misclassification resulted in an underestimation in our findings.

Conclusions

This study underscores the strong relationship between BMI and the risk of obesity in early adulthood with PRS and with maternal excess weight before pregnancy. In addition, our prediction models demonstrate that utilizing PRS while combining perinatal exposures enhances the ability to quantify future obesity risk in young adults. These findings offer impetus to progress toward the subsequent evaluation phase, in which PRS and early-life exposures could be utilized to target young individuals at higher obesity risk who could benefit from early preventive interventions.

Summary

What was known before?

  • Studies demonstrate associations between obesity-related phenotypes and subsequent risk of various diseases.

  • Genetic factors explored through polygenic risk score (PRS) provides valid association estimates with obesity throughout adolescence and adulthood.

  • Evidence also suggests that early life exposures, such as maternal pre-pregnancy body mass index (mppBMI) and gestational weight gain (GWG), are associated with adult offspring adiposity.

What this study adds?

  • Our analyses show that both genetic factors and early life exposures were significantly and independently associated with obesity-related outcomes.

  • Our study also demonstrates that inclusion of early life exposures improved the obesity risk-prediction model relative to a model that solely considered genetics factors.

How might these results change the direction of research or the focus of clinical practice?

  • Our prediction models demonstrate that combining PRS with early life exposures results in improved capacity to estimate future obesity risk in young adults.

  • These findings offer impetus to progress toward the subsequent evaluation phase, in which PRS and early-life exposures could be utilized to target young individuals at higher obesity risk who could benefit from early preventive interventions.