Introduction

Assessment of infant body composition has become an increasingly important area of research due to the association between early growth and subsequent risk of obesity and metabolic diseases [1,2,3]. Body composition can be assessed using a range of approaches, with most common methods dividing the body into two compartments (2C): fat mass (FM) and fat-free mass (FFM), and assuming that the densities of FM and FFM remain constant between individuals [4]. However, while the density of FM remains stable across the lifespan, the density of FFM varies by age and other individual factors, such as whole-body hydration status, with the highest values found in infancy [5, 6]. The 3-compartment (3C) model divides body mass into FM, total body water (TBW), and fat-free dry mass (FFDM), and produces more valid body composition results than the 2C model as it accounts for the inter-individual variation of TBW. The 4-compartment (4C) model creates a fourth component by dividing the FFDM component in the 3C model into proteins and minerals [7]. Despite being the “gold standard reference method”, 4C models require measurements using various techniques, which makes the model more expensive and increases the participant burden. Therefore, 2C and 3C models are considered more suitable approaches in clinical or field studies, including with infants.

Despite the availability of various body composition techniques, using them in the paediatric population can be associated with a range of practical challenges due to the distinctive physiological and behavioural characteristics of infants. For example, the use of dual-energy X-ray absorptiometry (DXA) raises concerns regarding radiation exposure (albeit being extremely low), and keeping an infant motionless for a DXA scan can be problematic [8]. Air displacement plethysmography (ADP) using the PEA POD to estimate body density is arguably one of the most “practical” approaches for body composition assessment during early infancy [9]. The technique has several advantages, including ease of use, a very short assessment time, being non-invasive, and not being affected by an infant’s behavioural state (e.g., movement, crying, urination). The technique also has good precision; however, it is limited to assessment of infants up to ~6 months of age (a body mass of 8-10 kg) [10]. The adult ADP system (BOD POD) adapted with a paediatric option has demonstrated good validity with the 4C model in 2–6 years old children [11]; however, to date, there has been a gap in using ADP technology for 6–24 months old infants. The deuterium dilution (DD) technique, which estimates TBW, is another commonly used body composition approach in infancy (and suited to all ages). The DD technique has several advantages, including safety, suitability for field use, and collected samples can be stored for extended periods prior to analysis [8]. The practical challenges involved with the use of the DD technique in infants include dose losses due to spills and extended waiting periods between sample collections [12]. Both PEA POD and DD techniques are used in 2C models to derive FFM and FM and are also integral in 3C and 4C models.

Relatively few studies have compared infant body composition derived from PEA POD or DD using either 2C or multi-compartment models. In full-term infants, a 2C model using PEA POD (2C-PP) has shown good agreement with a 2C model using the DD technique (2C-DD) [13] and a 4C model [14], but not with a 2C model using DXA [15]. In pre-term infants, the 2C-PP has shown good agreeability with a 2C model using H218O isotope dilution [16] and a 3C model [17]. However, to our knowledge, no study has compared measurements derived from PEA POD and DD techniques using both 2C and 3C models in full-term infants. Moreover, the earlier study [13] to compare PEA POD and DD using a 2C model included predominantly Asian infants. As ethnicity is a significant predictor of infant body composition [18, 19], it may be valuable to test the agreeability of both techniques in infants of other ethnicities.

The overall aim of this study was to appraise the agreeability of body composition measures obtained from PEA POD and DD using 2C and 3C models in 6-month-old infants. Specific aims were to compare FM and %FM derived from (i) 2C-PP vs a 3C model that uses both PEA POD and DD techniques, (ii) 2C-DD vs a 3C model that uses both PEA POD and DD techniques, and (iii) 2C-PP vs 2C-DD.

Subjects and methods

Participants

This study was conducted as part of the Baby-bod study [20, 21]—a prospective longitudinal cohort study conducted at the Launceston General Hospital, Tasmania, Australia (September 2017 to October 2019), the Australian arm of the Multicenter Infant Body Composition Reference Study (MIBCRS) [22]. In the Baby-bod study, infants were longitudinally tracked from birth until the age of 6 months. This study utilised data gathered during the 6-month follow-up, incorporating body composition assessments conducted using both the PEA POD and DD techniques. Inclusion criteria of the Baby-bod study were that participating mothers must be 18 years of age or older, able to speak and understand English, have a gestational age at birth between 37+0 weeks and 41+6 weeks, and have a singleton pregnancy. Exclusion criteria were infants born with congenital birth defects, infants admitted to the neonatal intensive care unit, mothers with significant morbidity, and mothers unable to negotiate the informed consent process due to a difficult birthing experience. All eligible mothers who agreed to their infants’ participation in the study provided signed informed consent.

Body composition assessment procedures

  1. a.

    Assessment of body density by PEA POD body composition system

    Infant body density was assessed using the PEA POD (COSMED USA, Inc., Concord, CA, USA; software version 3.5.0). The physical design and the operating procedures of the PEA POD have been described in detail elsewhere [13, 23]. In brief, the unclothed infant was first placed on the scale for measuring body mass (MB) and then in the test chamber for two minutes for volume measurement (VB) to evaluate body density (DB = MB/VB).

  2. b.

    Assessment of TBW using the DD technique

    Body composition assessment in infants using the DD technique has also been described in detail elsewhere [24]. Briefly, a sample of saliva (pre-dose sample) was collected from the infant who had fasted for 20–30 min using a sterile cotton ball held with sterile plastic forceps. One gram of undiluted deuterium oxide (D2O 99.8%) was administered to the infant. Saliva samples were collected at 2.5 hours (post-dose sample I) and 3 hours (post-dose sample II) after the dose administration in the same way described above. D2O concentration in each saliva sample was determined using the Agilent 4500 FTIR portable spectroscopy instrument (Agilent Technologies, Inc., USA) [25]. Using the values of concentration (C1) and volume (V1) of the tracer (D2O) in the dose, and the concentration (C2) of the tracer in the saliva sample is measured, the volume of distribution (V2), also known as dilution space, was calculated using the dilution principle, V2 = C1V1/C2. The dilution space is slightly larger than TBW due to the non-aqueous exchange of the isotope. For D2O, it is 1.041 times that of TBW; thus, TBW was calculated as V2/1.041.

Estimation of FM and %FM using 2C and 3C models

  1. a.

    2C model with body density measured by PEA POD

    Whole-body density (DB) can be defined as a function of the densities of FM (DFM) and FFM (DFFM).

    $$\frac{1}{{{\rm{D}}}_{{\rm{B}}}}=\frac{{P}_{{FM}}}{{{\rm{D}}}_{{\rm{FM}}}}+\frac{{{\rm{P}}}_{{FFM}}}{{{\rm{D}}}_{{\rm{FFM}}}}$$

    The above equation can be rearranged as below to express %FM.

    $$\% {\rm{FM}}=\left[\frac{{{{\rm{D}}}_{{\rm{FM}}}{\rm{D}}}_{{\rm{FFM}}}}{{{\rm{D}}}_{{\rm{B}}}\left({{{\rm{D}}}_{{\rm{FFM}}}-{\rm{D}}}_{{\rm{FM}}}\right)}-\frac{{{\rm{D}}}_{{\rm{FM}}}}{\left({{\rm{D}}}_{{\rm{FFM}}}-{{\rm{D}}}_{{\rm{FM}}}\right)}\right]* 100 \%$$

    The density of FM is equal to 0.9007 kg/L and considered to be constant throughout life. Age- and sex-specific estimates for DFFM were derived from the Fomon model [5]. FM and FFM were calculated as below.

    $${\rm{FM}}=\frac{( \% {\rm{FM}}){{\rm{M}}}_{{\rm{B}}}}{100 \% }$$
    $${\rm{FFM}}={{\rm{M}}}_{{\rm{B}}}-{\rm{FM}}$$
  2. b.

    2C model with TBW assessed by DD technique

    FFM was estimated using the estimated TBW from the DD technique and hydration factor of infants from Fomon et al. [5].

    $${\rm{FFM}}=\frac{{\rm{TBW}}}{{\rm{hydration}}\,{\rm{factor}}}$$

    After estimating FFM, FM and %FM were calculated using the below equations.

    $${\rm{FM}}={{\rm{M}}}_{{\rm{B}}}-{\rm{FFM}}$$
    $$\% \,{\rm{FM}}=\frac{{\rm{FM}}\,(100 \% )}{{{\rm{M}}}_{{\rm{B}}}}$$
  3. c.

    3C model with body density and TBW

The 3C model divides body mass into FM, TBW, and FFDM, i.e., proteins and minerals. The whole-body densitometry equation can be arranged as below to express the % values of FM and TBW.

$$\frac{100}{{{\rm{D}}}_{{\rm{B}}}}=\frac{ \% {\rm{FM}}}{{{\rm{D}}}_{{\rm{FM}}}}+\frac{ \% {\rm{TBW}}}{{{\rm{D}}}_{{\rm{TBW}}}}+\frac{100- \% {\rm{FM}}- \% {\rm{TBW}}}{{{\rm{D}}}_{{\rm{FFDM}}}}$$

DFFDM was estimated to be 1.4898 kg/L for boys and 1.4909 kg/L for girls by applying the densitometric principle on FFDM as a mixture of proteins and minerals, using densities [26] and sex-specific proportions [5] of proteins and minerals in 6 month-old infants. These values and body density (DB) estimated from ADP and percent TBW (%TBW) estimated from DD technique with the assumptions of DFM to be 0.9007 kg/L, DTBW to be 0.9937 kg/L were used to calculate %FM and, thereby, FM.

Statistical analysis

All analyses were performed using R Project for Statistical Computing (version 4.2.0, Vienna, Austria) [27]. Descriptive variables are expressed using mean and standard deviation (SD). Absolute values of FM in grams and %FM derived by different methods were compared using paired t-tests. The degree of agreement between the measures was assessed with Bland-Altman analysis. The expected limits of agreement (LOA) were defined as a priori based on measurements with the 4C reference model [14]. Deming regression analysis [28] was performed to determine systematic bias (constant and proportional bias) between the methods, as it accounts for random errors in both X and Y variables. In an effort to gain a better understanding and clarify the potential factors contributing to the difference in FM and %FM calculated by each method, the differences were plotted against body mass, length, body volume, body density, and %TBW, and a regression line was added to determine the significance of any relationship observed (Supplementary Figs 13). All hypothesis tests were two-sided, and p < 0.05 was considered statistically significant.

Results

Sixty-eight infants (36 boys and 32 girls) were assessed using PEA POD and DD techniques. They were predominantly Caucasian (93%), with a mean age of 5.82 months, mean body mass of 7.25 kg, and mean length of 65.33 cm (Table 1). Within the study cohort, 15% of the infants were exclusively breastfed, while approximately 60% of them relied on breastmilk for over 75% of their dietary intake. Paired t-tests revealed no significant differences (p > 0.6 for all) between the means of body composition estimates (FFM, FM and %FM) obtained from different methods (Table 2). On average, their %FM was ~25% in all 3 methods.

Table 1 Demographic characteristics of the infants.
Table 2 Body composition estimates of the infants by different methods.

Bland-Altman results showed that the mean differences in FM and %FM between methods (mean bias) were not significant and close to zero (Table 3, Fig. 1). The LOA were widest in the comparison of 2C-PP vs 2C-DD (Lower LOA and Upper LOA for FM: -0.643, 0.657; for %FM: -9.217, 9.436), followed by 2C-PP vs 3C (FM: –0.439, 0.439; %FM: –6.352 to 6.352) and it was comparatively narrower in 2C-DD vs 3C comparison (FM: -0.296, 0.282; %FM: –4.278, 4.061). No trends were observed in the scatter of the points on the plots of 2C-PP vs 3C (slopes were not significant). Positive trends were evident along the graphs of 2C-DD vs 3C, with 2C-DD underestimating infant fatness compared to 3C at lower mean values and overestimating at higher mean values. In contrast, negative trends were observable in the plots of 2C-PP vs 2C-DD, with 2C-PP overestimating infants’ fatness compared to 2C-DD at lower mean values and underestimating it at higher mean values.

Table 3 Estimates of Bland-Altman analyses.
Fig. 1: Bland–Altman analyses comparing the two-compartment model with PEA POD to the three-compartment model (2C-PP vs 3C), the two-compartment model with deuterium dilution to the three-compartment model (2C-DD vs 3C), and the two-compartment model with deuterium dilution to the two-compartment model with PEA POD (2C-DD vs 2C-PP).
figure 1

The panels in the top row display Bland–Altman analyses of fat mass (FM) for 2C-PP vs 3C (top left), 2C-DD vs 3C (top middle) and 2C-DD vs 2C-PP (top right). The panels in the bottom row display Bland–Altman analyses of percent fat mass (%FM) for 2C-PP vs 3C (bottom left), 2C-DD vs 3C (bottom middle) and 2C-DD vs 2C-PP (bottom right). In each panel, the Y axes show the difference between the methods, and the X axes show the mean of the methods pertaining to the respective body composition variable. The solid black line represents the mean differences between the methods, and the dashed lines are the limits of agreement (±2 SD from the mean difference). The blue colour diagonal line represents the proportional bias; coloured shaded areas around solid and dashed lines show 95% confidence intervals.

Deming regression analysis revealed no constant or proportional differences and no major departure from the line of identity in the comparison of 2C-PP vs 3C. However, in the comparisons of 2C-DD vs 3C and 2C-PP vs 2C-DD, the regression line deviated from 1, and constant and proportional differences were observed (Table 4, Fig. 2). Additionally, the mean differences in FM and %FM derived from the different methods comparisons (2C-PP vs 3C, 2C-DD vs 3C and 2C-PP vs 2C-DD) were significantly associated with %TBW (p < 0.001) but not with body mass, length, or volume (p > 0.4 for all; Supplementary Figs 13). Body density was significantly associated with the mean differences in 2C-PP vs 3C and 2C-PP vs 2C-DD (p < 0.05), but not in 2C-DD vs 3C (p > 0.4).

Table 4 Estimates of Deming regression analyses.
Fig. 2: Deming regression analyses comparing the two-compartment model with PEA POD to the three-compartment model (2C-PP vs 3C), the two-compartment model with deuterium dilution to the three-compartment model (2C-DD vs 3C), and the two-compartment model with deuterium dilution to the two-compartment model with PEA POD (2CDD vs 2C-PP).
figure 2

The panels in the top row display Deming regression analyses of fat mass (FM) for 2C-PP vs 3C (top left), 2C-DD vs 3C (top middle) and 2C-DD vs 2C-PP (top right). The panels in the bottom row display Deming regression analyses of percent fat mass (%FM) for 2C-PP vs 3C (bottom left), 2C-DD vs 3C (bottom middle) and 2C-DD vs 2C-PP (bottom right). In each panel, the regression line is given in ‘blue’, and the line of identity (Y = X) is given in ‘red’; Shaded areas show 95% confidence interval for the regression line; the Pearson correlation coefficient is shown in each panel.

Discussion

In this study, we investigated the agreement of body composition measures obtained from PEA POD and DD techniques using 2C and 3C models in 6-month-old infants. Mean differences between the methods were not significant. However, significant constant and proportional bias were noted in comparisons of 2C-DD vs 3C and 2C-DD vs 2C-PP, but not in 2C-PP vs 3C comparison. In addition, significant associations were observed between the mean differences and %TBW.

Our findings may be particularly relevant in clinical settings where precise assessments of infant body composition are fundamental for growth monitoring and nutritional management. The absence of significant mean differences among the methods is an encouraging finding for researchers and healthcare professionals engaged in infant health assessments, as they can confidently choose from these methods, knowing that, on average, they provide similar body composition estimates. However, it is essential to acknowledge methodological nuances, including varying LOA, especially notable in the 2C-PP vs 2C-DD comparison, and potential systematic variations, particularly the constant and proportional biases observable in the 2C-DD vs 3C and 2C-DD vs 2C-PP comparisons. These differences suggest potential discrepancies in individual measurements, particularly in infants whose body fat levels deviate significantly from the typical range, either being extremely below or above the average. Further, our study revealed significant associations between mean differences in FM and %FM and %TBW, inferring variations in %TBW may influence the results obtained from different methods. This emphasises the importance of adopting a holistic approach for assessing infant body composition that considers not only methodological choices but also close control of pre-analytical factors, such as the hydration status of the participants.

Studies that determined the agreeability of body composition values derived from PEA POD and DD using 2C models, either with each other or with other techniques that used 2C or multi-compartment models, have been limited. Ma et al. [13] reported an excellent agreement between %FM assessed with 2C-PP and 2C-DD with identical mean values, and similarly, in our study, the mean difference in %FM did not differ significantly from zero. However, Ma et al. did not observe any proportional bias as we observed in our study, and their LOA for %FM (-6.84%, 6.71%) was narrower than ours (-9.22%, 9.44%). There could be several potential reasons for these discrepancies. Firstly, infants in the present study were older (5.82 months vs 1.34 months, respectively) and predominantly Caucasian vs Asian in the earlier study. Asian infants are characterised by higher FM and lower FFM than white Caucasians [29, 30]. We performed the analysis excluding the infants from ethnicities other than Caucasian (n = 5) and found no significant differences between the results. Secondly, as shown in supplementary figures, the significant variation in %TBW of infants in our study may have influenced the results; principles used in PEA POD are based on assumptions of body density, ignoring the inter-individual variation in the constituents of FFM. We also observed that the associations between body density and mean differences were only significant when comparing 2C-PP, which does not consider inter-individual variations in %TBW, with other methods.

Kuriyan et al. [31] evaluated the agreement of body composition estimates from 2C-PP and 2C-DD using pooled data from the larger Multicenter Infant Body Composition Reference Study (MIBCRS), which involved infants from Australia (n = 46), India (n = 86), and South Africa (n = 44), and reported a low mean difference (-0.05 kg; p < 0.05) in pooled FM derived from the two methods. However, the results of the present study showed no significant mean differences in FM in Australian infants (n = 66), suggesting the differences in the former study [31] may have arisen from different numbers and ethnicities of participating infants. Moreover, Ellis et al. [14] compared %FM in full-term infants (age 2–17 weeks) derived by 2C-PP against a 4C and reported no significant mean bias. In their study, LOA was wider (-6.8% to 8.1%) than the study by Ma et al. [13] but narrower than ours. Further, a study by Fields et al. [15] compared FM and %FM using 2C-PP and 2C-DXA in term-born infants at 6 months of age and reported that estimates of FM (2.284 vs 1.921 kg; p < 0.001) and %FM (31.1% vs 26.7%; p < 0.001) by 2C-DXA were significantly greater than those by 2C-PP. They showed that the difference in %FM reduced with increasing mean %FM values, and a significant association existed between body mass and infant fatness; specifically, significant differences in FM and %FM occurred when infant body mass was less than 7 kg. In contrast, the mean differences in our study were not dependent on infants’ body mass, length, or volume.

A 4C model to independently assess FFM constituents is considered the gold standard for body composition assessment. In contrast, 2C models are based on several assumptions [8] and may not provide the most accurate estimations of body composition [32]. Fields et al. [33] compared the accuracy of FM assessed by ADP (BOD POD), DD, DXA, and hydrostatic weighing against a 4C model and concluded that ADP prevailed over the other methods. They also highlighted that DD-2C might be associated with the highest error as it assumes a constant hydration status, but individuals may have significant differences in hydration status. Our results support this finding, as there was no constant or proportional bias between the estimates derived from PEA POD with a 2C model and a 3C model, whereas significant proportional bias existed when the DD technique was used in a 2C model. Other plausible reasons for the lower accuracy of the 2C-DD may include dose spillages [12], although we took all precautions to ensure that infants consumed the doses completely. Insensible water loss during the equilibration period may be another issue [24]; we assumed that the loss of deuterium in urine and sweat was minimal and could be ignored.

PEA POD software assigns values for the density of FFM based on the age and sex of the infant. Additionally, to account for differences in the compressibility of air in the thoracic cavity and near the skin surface, the PEA POD software makes corrections in the measured body volume by predicting thoracic gas volume and surface area artefacts [23]. These predictions may introduce errors at the individual level. Moreover, hair, body moisture and temperature have been shown to significantly affect %FM measurements by ADP, with increases in body temperature and moisture resulting in underestimation of body fatness [34, 35]. Despite these inherent potential sources of errors, the PEA POD is widely considered a reliable and valid tool to track body composition during the postnatal period, as less erroneous multi-compartment models are expensive and especially impractical with infants [8, 13, 14, 36].

Ours is the first study to compare PEA POD and DD measurements using 2C and 3C models in term-born, predominantly Caucasian infants. Our study has a reasonable sample size compared to similar studies [13,14,15], and participating infants were all approximately the same age (6 months), limiting the effect age-related variations in body composition may have on comparisons. Due to the practical challenges of working with infants and for reducing participant burden, we did not perform repeated body composition assessments. Consequently, we could not calculate within-day and between-day coefficients of variations (CVs) for body composition measurements, which would have provided insights into the precision and stability of the body composition measurements. Moreover, a comparison with a “gold standard” 4C model would have provided further information on the accuracy of the results.

In conclusion, while we found no significant mean differences in body composition estimates between the methods, it is noteworthy that we identified significant constant and proportional biases in specific comparisons. Specifically, biases were observed when comparing 2C-DD to both 3C and 2C-PP methods, but no such biases were evident in the 2C-PP vs 3C comparison. The observed proportional bias suggests that applying these methods to infants with body fat levels below or above the average can result in substantial differences in the estimates. In addition, the significant associations between the mean differences and %TBW emphasise that the hydration state of the participants can influence the accuracy and reliability of the results. These results underline the need for careful consideration of the choice of method, the importance of maintaining methodological consistency in longitudinal studies, and the need to consider and control various pre-analytical factors, such as the hydration state of the participants. On the whole, our findings contribute valuable insights to the intricate field of infant body composition assessment, encouraging further investigations to elucidate sources of bias and continued research and refinement of measurement techniques to enhance accuracy and reliability in this challenging population.