Race and geography impact validity of maximum allowable standing height equations for para-athletes

World Athletics use maximum allowable standing height (MASH) equations for para-athletes with bilateral lower extremity amputations to estimate stature and limit prosthesis length since longer prostheses can provide running performance advantages. The equations were developed using a white Spanish population; however, validation for other races and geographical groups is limited. This study aimed to determine the validity of the MASH equations for Black and white Americans and whether bias errors between calculated and measured stature were similar between these populations. Sitting height, thigh length, upper arm length, forearm length, and arm span of 1899 male and 1127 female Black and white Americans from the Anthropometric Survey of US Army Personnel database were input into the 6 sex-specific MASH equations to enable comparisons of calculated and measured statures within and between Black and white groups. Two of 12 MASH equations validly calculated stature for Black Americans and 3 of 12 equations were valid for white Americans. Bias errors indicated greater underestimation or lesser overestimation of calculated statures in 10 equations for Black compared to white Americans and in 2 equations for white compared to Black Americans. This study illustrates that race and geography impact the validity of MASH equations.

The proportionality between stature and body segment lengths is influenced by numerous extrinsic and intrinsic variables.These extrinsic variables include physical factors such as distance from equator 14,15 , temperature [15][16][17][18][19][20] , humidity 16,21 , pathogen loads 22 , environmental resource availability 23 , and socioeconomic factors such as wealth 24,25 , education 26 , and occupation 27 .Intrinsic variables such as genetics [27][28][29][30] , biological sex 31,32 , age 31,32 , and race 33,34 have likewise been observed to impact proportionality.The potential impact of geography and race on MASH equations is particularly important as World Para Athletics competitions include athletes from around the globe with 103 countries participating in the 2023 World Para Athletics 35 .Prior studies have suggested the need for race-and population-specific stature estimation methods because racial differences in limb proportions cause varying accuracy for multiple stature estimation equations 36,37 .
The currently used MASH equations were developed using a white Spanish population 12 and were compared to various stature estimation equations in white Australian and Asian Japanese populations 38 .While only three countries are represented and relatively small populations were used to compare the equations in Australia (N = 30, 15 females) and Japan (N = 31, 15 females) 38 , the currently used MASH equations were determined to be the most accurate for estimating stature.However, to ensure the accuracy of the MASH equations for a specific race or geographical group, the equations must be validated specific to that population, thus additional studies are warranted.
Numerous studies support that Black and white people have different body segment proportions where Black people have relatively shorter trunks and longer limbs [32][33][34]36,37,[39][40][41][42][43][44] . This emphaszes the importance of validating the MASH equations for a Black population.Additionally, the United States is consistently well represented in World Para Athletics competitions, but how the MASH equations may apply to Americans is unknown.Validating the MASH equations for Black and white American populations will improve our understanding of the generalizability of these equations to people from different races and geography and support global equity and fairness in para-athletic competitions.
The aim of this study is to determine if existing MASH equations validly calculate stature in Black and white populations from the United States.This study has 3 hypotheses: (1) the existing MASH equations will accurately calculate stature in Black Americans with no difference compared to their measured stature, (2) the existing MASH equations will accurately calculate stature in white Americans with no difference compared to their measured stature, and (3) the bias error, or difference, between calculated and measured stature for Black Americans will be similar to white Americans.

Results
A total of 1959 males and 1145 females from the Anthropometric Survey of US Army Personnel (ANSUR II) database [45][46][47] met the inclusion criteria.Sixty males and 18 females were identified as outliers and were removed from the study population resulting in 1899 males and 1127 females included for analysis.
Table 1 provides the study population's age, mass, stature, and BMI values.No differences existed between Black and white males or females for these variables (p > 0.05).Body segment lengths and segment length to stature ratios are presented in Table 2.For males, thigh length, upper arm length, forearm length, and arm span lengths and length to stature ratios were greater in the Black group (p < 0.001 for all); however, sitting height and sitting height to stature ratio (p < 0.001) were shorter in Black compared to white males.For females, no difference existed between races for upper arm length (p = 0.079).Thigh length, forearm length, and arm span were longer in Black compared to white females (p < 0.001 for all), while sitting height was shorter for Black females (p < 0.001).Black females had greater body segment length to stature ratios for thigh length, upper arm length, forearm length, and arm span (p < 0.001 for all) and smaller sitting height to stature ratios (p < 0.001) compared to white females.
Stature was calculated using each of the sex-specific MASH equations and compared to measured stature (Table 3).For males, the main effect of stature showed a significant difference in mean stature and calculation method (F (2.81, 5328.31)= 519.97,p < 0.001), and a significant interaction between race and stature existed (F (2.81, 5328.31)= 302.21,p < 0.001).For Black males, no difference existed in measured stature and M10.Measured stature was greater than calculated stature from M8, M9, M11, M15, and MSH.For white males, measured stature did not differ from M15. Measured stature was greater than M8, M9, and M11, and less than M10 and MSH.When comparing races, no differences existed in measured stature or calculated stature using M8.Calculated stature was lower in Black vs white males for M9, M10, M11, M15, and MSH.
For Black females, the main effect of stature significantly differed by equation (χ 2 (6) = 1366.808,p < 0.001).No differences existed between measured stature and F8.Measured stature was greater than F9, F13, F12, and FSH but less than F10.For white females, the main effect of stature significantly differed by equation (χ 2 (6) = 968.238,p < 0.001).No differences existed between measured stature and F8 or F10.Measured stature was greater than F9,    4.An expanded range of tolerance intervals can be found in the Supplementary Data.Linear regression indicated bias error and stature were negatively correlated for all equations and correlation coefficients generally increased in magnitude as MASH equations decreased in accuracy (Supplementary Data).For Bland-Altman analyses, all error comparisons had normal distributions (p > 0.05), except for white male equation M11 vs Stature (p = 0.03).In this case, the absolute differences of the errors and the means had a small negative correlation (r = − 0.08), and log transformations did not result in a normal distribution (p = 0.04), thus log transforms were not performed prior to calculating limits of agreement [48][49][50] .Bland-Altman plots (Fig. 1) show the mean bias and limits of agreement (LOA), i.e., the range within which 95% of all differences between the MASH-estimated and measured stature are likely to lie 48,51 .Plots indicated that LOA ranges increased as MASH equations decreased in accuracy.
For males, a significant main effect (F (1,1897) = 155.44,p < 0.001) indicated bias error was significantly greater in Black compared to white males.A significant interaction existed between race and bias error (F (2.15,4075.37)= 342.98,p < 0.001).Equations M8, M9, M11, and M15 had negative bias errors indicating they underestimated stature for both Black and white males.M8 underestimated stature significantly more for white than Black males, whereas stature was underestimated significantly more for Black than white males for equations M9, M11, and M15.M10 overestimated stature for both Black and white males, as indicated by positive bias errors, but to a greater extent in the white group.MSH significantly differed between the races and underestimated stature for Black males while overestimating stature for white males.
For females, F8, F9, F13, and FSH underestimated stature for both Black and white groups.Bias errors indicated stature was significantly more underestimated for Black than white females for F8, F9, F13, and FSH.F10 overestimated stature for both groups but more so for Black females.F12 underestimated stature for Black females but overestimated stature for white females.

Discussion
A comparison of our data with Canda 12 and Connick et al. 38 highlights that different races and geographical groups can affect the accuracy and application of the MASH equations.Our Black and white populations were shorter in stature than Canda's white Spanish population 12 for both males and females and body segment proportionality differences were observed.The differences in our population's anthropometric values produced poorer stature estimation performance demonstrated by smaller R 2 values for all equations.Our Black populations generated greater RMSE values than Canda in 9 out of 10 equations (all but M10); however, two of the equations for each sex (M10, M15, F8, and F10) in our white populations produced lower RMSE values.This reflects that the MASH equations produce worse correlations while a majority of the equations produced greater stature prediction errors for Black and white American populations compared to a white Spanish population.
Four of the male MASH equations (M8, M9, M10, and M15) and three of the female equations (F8, F9, and F12) were validated by Connick et al. 38 using white Australian and Asian Japanese populations.Our Black and white American males and females were shorter in stature compared to the white Australian groups but taller than the Asian Japanese groups.Varying body segment length to stature proportions were also observed.The equations generated lower R 2 values in our Black males and females and white females than those reported by Connick et al. 38 However, our white male population generated higher R 2 values for 2 of the 4 equations (M10 and M15).Our Black and white females had RMSE values lower than Connick et al. for all three MASH equations.Our Black male group generated smaller RMSE values for 2 of the 4 equations (M8 and M10) while our white male group produced smaller RMSE values for 3 of the 4 equations (M8, M10, and M15).Connick et al. also reported bias errors that overestimated stature from all equations in both males and females 38 , which contrasts with our study's underestimations of stature.Our data showed negative correlations between bias error and  38 observed positive correlations for M8, F8, M9, and F9.Thus, these equations tend to overestimate stature for taller individuals in Australia or Japan but underestimate stature for taller individuals in the United States.Our correlation coefficients were also similar in magnitude or greater than prior studies.These comparisons show that anthropometric variability between Black and white Americans, white Australians, and Asian Japanese populations generates inconsistent R 2 and error outcomes.The purpose of this study was to determine if the currently used MASH equations validly calculate stature in Black and white populations from the United States.Statistical significance was used to determine equation validity.Thus, if an equation's predicted stature significantly differed from the measured stature, the equation was labeled invalid.We recognize that a single, rigid definition may oversimplify determining an equation's validity.Consequently, when a predictive equation was identified as statistically invalid, the mean bias error, RMSE, and effect sizes were evaluated to provide a multi-faceted approach to determine if the MASH equations provide "reasonable" stature estimations for practical purposes.LOAs provided further insights into the agreement between the estimated and measured stature values.Tolerance intervals, which present the range of stature prediction errors that encompass a certain percent of the population, were used to examine trends associated with the data spread and skew across the different populations and to supplement the interpretation of MASH equations' predictive ability.Table 3. Measured and MASH calculated statures (mean ± 1 standard deviation) for Black and white males and females.95% confidence intervals for statures are presented along with statistical differences and effect sizes of MASH calculated compared to measured stature within races (vs Measured) and between races for each stature measurement method (B vs W).MASH maximum allowable standing height, CI confidence interval of the mean, ES effect size.# Significant difference (p ≤ 0.05) between measured and calculated stature within each group.*A significant difference (p ≤ 0.05) in stature between sex-specific Black (B) and white (W) groups for a predictive equation.The first hypothesis stated that MASH equations would accurately calculate stature in Black Americans with no statistical difference compared to their measured stature.For males, this hypothesis was accepted for equation M10, indicating that the MASH equations will provide a similar estimate to stature for male Black American athletes with bilateral above-knee amputations and at least 1 intact upper arm and forearm.The hypothesis was rejected for equations M8, M9, M11, M15, and MSH.For females, this hypothesis was accepted for equation F8, indicating that the MASH equations will provide a similar estimate to stature for female Black American athletes with at least one intact thigh, upper arm, and forearm.This hypothesis was rejected for equations F9, F10, F13, F12, and FSH.The rejected equations can be considered invalid for estimating measured stature for male and female Black athletes.
The equations that resulted in significantly different calculations from measured stature had mean bias errors ranging from − 0.54 to − 4.03 cm for Black males and 0.50 to −5.38 cm for Black females.95% LOAs ranged from 9.0 to 17.7 cm for Black males and 7.7 to 15.1 cm for Black females.95% tolerance interval ranges spanned from 9.6 to 19.0 cm for Black males and 8.2 to 16.0 cm for Black females.The greater range of bias errors with smaller LOA and tolerance interval ranges for Black females indicate that the equations do a poorer job at estimating stature but were more consistent in their predictive ability relative to Black males.All five of the rejected equations underestimated stature for Black males.Four out of 5 rejected equations underestimated stature and 1 of 5 overestimated stature for Black females.An underestimation of stature greater than 5 cm likely has a greater impact on performance than an over-or underestimation of one-half cm.Our study included a large population that provided high statistical power.This resulted in bias errors that are sometimes small in magnitude yet statistically significant.It is not clear what amount of over-or underestimation of stature is acceptable since no agreed upon error limits currently exist for MASH equations; however, effect sizes and tolerance intervals provide additional perspective.The effect sizes for M8 were very small (d = 0.07) suggesting that the observed difference from measured stature may not be meaningful; however, the remaining significantly different equations (M9, M11, M15, and MSH) had effect sizes between d = 0.35 and 0.54, suggesting these equations on average meaningfully underestimate stature in Black American males.The confidence and tolerance intervals also indicated that bias errors were generally skewed toward underestimating stature, but overestimation www.nature.com/scientificreports/also occurs.For females, small effect sizes were observed for F10 and F12 (r = 0.11 and 0.21, respectively), so these over-and underestimations of stature can be considered minimal.However, effect sizes for F9, F13, and FSH ranged from r = 0.37 to 0.74, suggesting they meaningfully underestimate stature for Black American females.Rejecting the use of a predictive equation that produces a small bias error may not be pragmatic.A bias error threshold of < 1 cm can be considered relatively small compared to mean measured statures over 175 cm and 163 cm for males and females, respectively.An RMSE threshold of < 3 cm represents the largest RMSE value reported for the MASH equations developed by Canda (2.97 cm) 12 .Using these exemplar thresholds of bias error < 1 cm and RMSE < 3 cm along with a small effect size (d ≤ 0.2; r < 0.3) on the rejected equations, we identified that equations M8 and F10 may be reasonable for use with Black males and Black females, respectively.The remaining statistically rejected equations (M9, M11, M15, and MSH for Black males and F9, F13, F12, and FSH for Black females) did not satisfy all three threshold requirements, indicating poor predictive performance.To improve the accuracy of these rejected equations, it is recommended that their corresponding bias errors be used to offset the MASH equation outcomes.Using an exemplar error tolerance threshold of ± 3 cm, 60% tolerance interval limits meet this requirement for both equations M8 and F10, indicating 40% of the population may fall outside of that error tolerance.However, bias error estimations were skewed toward underestimating stature, resulting in a lower percentage of the population meeting a particular error tolerance.Using a total error tolerance range of 6 cm shifts the tolerance interval limits to accounting for near 80% of the Black male and female populations.CVs for Black Americans ranged from 0.87 to 2.88%, suggesting the MASH equations performed reliably; however, low CV values alone do not indicate adequate reliability, and LOAs have been shown as the preferred method to assess intra-test variation 52,53 .Furthermore, CV methods should only be depended on if heteroscedasticity is present 50 , which was not the case for our data.The best performing MASH equation generated Bland-Altman LOAs where 95% of the errors fell within ± 3.9 cm.Thus, applying the ± 3 cm threshold results in all MASH equations being considered ambiguous in their ability to accurately estimate stature.This suggests that optimizing equations to reduce the predictive stature range will be beneficial, pending better understanding of the relationship between prosthesis lengthening and performance.Raising the threshold so the acceptable range of LOAs will fall within 5% of measured stature, LOA ranges greater than 8.2 cm for females and 8.8 cm for males can be considered ambiguous in their ability to accurately estimate stature.This leads to equation F8 being equivalent to measured stature for Black females but all other equations for Black Americans being ambiguous in their ability to accurately estimate stature.
The second hypothesis was that MASH equations would accurately calculate stature in white Americans with no difference compared to their measured stature.For males, this hypothesis was accepted for equation M15, indicating that the MASH equations provide a similar estimate to stature for male white American athletes with bilateral above-knee and below-elbow amputations with at least one intact upper arm.The hypothesis was rejected for equations M8, M9, M10, M11, and MSH, which can be considered invalid.For females, this hypothesis was accepted for equations F8 and F10, supporting the use of MASH equations in female white American athletes with at least one intact thigh, upper arm, and forearm as well as those with bilateral aboveknee amputations and at least one intact upper arm and forearm.The hypothesis was rejected for equations F9, F13, F12, and FSH, so these equations may also be considered invalid for estimating stature.
The mean bias errors of the rejected equations ranged from 1.71 to − 2.83 cm for white males and 0.98 to − 1.80 cm for white females.CVs for white Americans ranged from 0.81 to 1.72% while 95% LOAs ranged from 8.6 to 15.1 cm for white males and 7.3 to 13.1 cm for white females.95% tolerance interval ranges spanned from 8.9 to 15.6 cm for white males and 7.6 to 13.7 cm for white females.The MASH equations do a poorer job of estimating stature for white males than females as demonstrated by the greater range of average bias errors, LOAs, and tolerance interval ranges for males.Three of 5 rejected equations underestimated stature for males while 3 of 4 underestimated stature for females.Three equations, 2 male and 1 female, overestimated stature.For males, only equation M11 produced a small-medium effect size of d = 0.38 while the remaining equations had small effect sizes of d < 0.25.For females, small effect sizes were observed for F9 and FSH (r = 0.25 and 0.15, respectively) while medium and large effect sizes were observed for F12 (r = 0.35) and F13 (r = 0.52).As with the Black populations, it is not clear what amount of over-or underestimation of stature is acceptable given that the greatest mean overestimation error is less than 2 cm for white Americans and less than 3 cm for underestimation.
The multi-faceted method of using bias error (< 1 cm), RMSE (< 3 cm), and effect size (d ≤ 0.2; r < 0.3) thresholds along with considering tolerance intervals on the rejected equations as described earlier supports the reasonable use of M8 and M10 for white males and F9 for white females.The remaining statistically rejected equations (M9, M11, and MSH for white males and F13, F12, and FSH for white females) did not satisfy all three threshold requirements, and the LOAs and tolerance interval limits indicated greater bias error ranges.Likewise to the Black population outcomes, these rejected equations can be adjusted using the observed bias errors as offsets.The 60% tolerance interval limits met the ± 3 cm error tolerance threshold for equations M8 and M10, and the 70% limits met the threshold for equation F9.Using the total error range of 6 cm, each equation falls in the 80% tolerance interval limits, indicating less than 20% of the population fall outside of this requirement.LOAs indicate that 95% of errors between measured and estimated stature fell within at best ± 3.7 cm, suggesting that improved predictive equations may be needed to narrow this range.Raising the acceptable threshold range of LOAs to 5% measured stature (8.2 cm females, 8.8 cm males) leads to equations M8, M10, F8, F9, and F10 being equivalent to measured stature for white males and females.
Our third hypothesis proposed that the error between calculated and measured stature for Black Americans would be similar to white Americans.Four out of the 6 equations had larger bias errors and RMSE values for Black males and all six equations had larger bias errors and RMSE values for Black females compared to their white counterparts.Equations M8 and M10 had less bias and lower RMSE values for Black compared to white males.This hypothesis was rejected for both males and females as the errors of each predictive equation significantly differed between races.This indicates that the accuracy of the equations was race-dependent and anthropometric differences between the races influenced the predictive ability of the MASH equations.
Differences in bias error between Black and white males generally had small (M8, M9, M10, M11 d ≤ 0.16) to small-medium (M15 d = 0.35) effect sizes; however, the racial differences in bias error had a medium-large effect for MSH (d = 0.74).The MASH equations produced similar correlations between Black and white Americans with the exception of M15 and MSH, which had noticeably lower R 2 values in Black males.These two equations were also the only ones where RMSE values between Black and white males differed by more than 1 cm.The tolerance interval ranges were consistently larger for Black males, but M10, M15 and MSH were the only equations where the difference in range exceeded 1 cm.Taken together, these outcomes suggest that equations M15 and MSH did not predict stature similarly for Black and white American males.One can argue that the remaining equations (M8, M9, M10, M11) perform similarly between the races due to smaller effect sizes and RMSE value differences; however, the systematic bias where the equations consistently generate larger errors for Black males remains a concern.
Effect sizes for the differences in bias error between Black and white females were very small for F8 and F10 (r ≤ 0.09) and small for F9 and F13 (r = 0.24-0.25)but medium for F12 (r = 0.39) and large for FSH (r = 0.54).While R 2 of the MASH equations was similar between female groups for each equation, all RMSE values were greater for Black females, where F13 differed by more than 1 cm and FSH differed by greater than 3 cm.Tolerance interval ranges were greater for Black females for every equation, where F10, F13, and FSH had range differences greater than or equal to 1 cm.The tolerance limits were shifted to more negative for F9, F13, F12, and FSH.These data suggest that equations F13, F12, and FSH do not predict stature with the same accuracy for both Black and white females.The female equations F8, F9, and F10 had smaller effect sizes and RMSE value differences, thus it can be debated that these equations predict stature similarly between the races.Similar to the male data, systematic bias was evident where the female equations generated greater errors for Black females.
Overall, two MASH equations for Black Americans and three for white Americans were statistically valid and two additional equations for Black Americans and three for white Americans were considered reasonable for use.The remaining eight equations for Black Americans and six equations for white Americans were invalid (Table 5).Furthermore, the observed ranges of bias error tolerance limits highlights that MASH equations have a fairly wide predictive ability for all groups.The best performing equations (i.e., those determined as valid or reasonable for use) resulted in 95% of individuals falling within bias error tolerance limit ranges between 7.6 and 10.2 cm (Table 4).As demonstrated by the lower and upper tolerance limits of the best performing equation (F8 for white females), one athlete could have stature underestimated by 4 cm and another athlete could have stature overestimated by 3.6 cm, resulting in an effective height difference of 7.6 cm.Using a notional threshold of 6 cm as an acceptable predictive error range, the best performing MASH equation for each sex-specific Black and white group accounts for ~ 75% (M10 for Black and M15 for white males) to ~ 85% (F8 for Black and white females) of the group populations.To reduce the potential for stature estimation differences between athletes, MASH equations should ideally generate more consistent stature predictions with lower tolerance interval ranges.
The findings from our study justify the need to develop new MASH equations for Black and white Americans.Although the development of new equations was beyond the scope of this study, our data provide the groundwork and direction for future studies.Thresholds for acceptable error levels for the MASH equations are needed as guidance for future equation development and validation efforts.Such thresholds should be informed by research investigating performance advantages caused by prosthesis lengthening.Ideally the MASH equations will generalize for all racial and geographical populations.The equations should be validated with additional racial and geographical groups to determine the need to either develop race-and geography-specific equations for the populations whose statures were not validly calculated by the current MASH equations or develop new generalizable MASH equations that will apply to all para-athletes regardless of race or geography.
Several limitations should be considered when interpreting data from this study.Our study used anthropometric measurement procedures that followed US Army and Marine Corps guidelines 46,47 , whereas the MASH equations were developed using International Society for the Advancement of Kinanthropometry (ISAK) measurement procedures 54 .Differences between the procedures include the forearm measurement being performed with the palm facing forward instead of the palm facing the thigh, which could cause a minor difference in forearm length.The thigh length measurement was calculated by subtracting tibiale mediale height from the trochanterion height instead of directly measuring the distance between the trochanterion and tibiale laterale, which can result in a slightly shorter thigh length.Differences in forearm and thigh lengths could influence the predictive ability of the MASH equations and their comparison to measured stature; however, these data will not impact the comparisons of Black vs white populations presented here..The different statures of each group could impact geographical comparisons, but also highlights potential regional differences.Validating anthropometric data using elite athletes will potentially improve the accuracy of MASH equations, and future studies should directly compare MASH predictions using athletes from different geographic locales.
Finally, in practice, pure error values between 1.73 and 2.97 cm are added to each MASH stature estimation based on Canda's 12 observed bias errors.Adding these values to our stature estimates yields overestimated mean statures in most cases, but does not change comparative analyses.Pure error values were not included in our analyses so we could generate unbiased stature estimations from the MASH equations and directly compare to the literature, which also does not use pure error values as inputs.The mean bias errors observed in this study (Table 4, Fig. 1) can be offset from each MASH equation result to provide more accurate stature estimations for Black and white para-athletes from the United States.

Conclusion
While the existing MASH equations are the best available for stature estimation, this study suggests that while some MASH equations were valid or were considered reasonable for use (Table 5), a majority of the equations did not accurately estimate stature in Black and white males and females from the United States.Furthermore, bias errors significantly differed between Black and white males and females for every equation, and a systematic bias was evident where the MASH equations consistently generated greater errors for Black compared to white Americans.This study confirms that race and geography impact the validity of the MASH equations.
Several recommendations related to stature estimation can be made to improve the accuracy, validity, and generalizability of MASH equations and consequently enhance fairness in competitions for para-athletes: (1) use the bias errors from this study as offsets to MASH equation outcomes for Black and white para-athletes; (2) validate the MASH equations with additional racial and geographical populations; (3) identify acceptable MASH error thresholds for average bias and LOA ranges based on the prosthesis lengthening that provides a significant performance advantage; and (4) develop new MASH equations with a large, racially and geographically diverse group of athletes to minimize population biases.

Methods
Study demographic and anthropometric data were obtained from the ANSUR II database [45][46][47] , a publicly available comprehensive measurement study performed by anthropometric experts on US Army service members.Since MASH equations are most commonly applied to adult elite athletes, military service members who are often considered "tactical athletes" can well represent an athletic population.Furthermore, the database was screened to include male and female participants between 18 and 35 years old with a body mass index (BMI) less than 30 kg/m 2 .Metrics of interest included age, body mass, stature, sitting height, thigh length, upper arm length, forearm length, and arm span.Body segment length to stature ratios were then calculated for sitting height, thigh length, upper arm length, forearm length, and arm span.

Anthropometric measurements
Stature and body segment measurements in the ANSUR II database were obtained following the US Army and Marine Corps guidelines described in Gordon et al. 47 and Hotzman et al. 46 .Brief setup, anatomical landmark, and measuring descriptions are as follows:

Stature
The vertical distance from a standing surface to the top of the head was measured with an anthropometer.The participant stood erect with the head in the Frankfurt plane, heels together, and weight distributed equally on both feet.

Sitting height
The vertical distance between the sitting surface and the top of the head was measured with an anthropometer.The participant sat erect with the head in the Frankfurt plane, thighs parallel, knees flexed 90°, and feet in line with the thighs.

Thigh length
Thigh length was calculated as the trochanterion height minus tibiale mediale height.Trochanterion height was the vertical distance between the standing surface and right trochanterion (superior point of the greater trochanter) landmark while tibiale mediale height was the vertical distance between the standing surface and right tibiale mediale (superior point on the medial condyle of the tibia) landmark.The participant stood erect with heels together and weight distributed equally on both feet, and the vertical distances were measured with an anthropometer.www.nature.com/scientificreports/Bland-Altman plots were constructed to assess the agreement between measured and estimated stature, where the mean of the measurements was plotted against the difference of the methods (i.e., error) 48 .Errors were assessed for normality using the Anderson-Darling test 49 .If errors were not normally distributed, heteroscedasticity was assessed by correlating the absolute errors and individual means, where a positive correlation indicated the need to log transform data prior to calculating limits of agreement [48][49][50] .Within-subject coefficients of variation (CV%) were calculated for each MASH equation 55 .Within-subject stature variance, s 2 , was calculated for each subject as (MASH -Measured) 2 /2.Within-subject variance was divided by the subject's squared mean stature from each method (s 2 /m 2 ), and the mean was calculated across all subjects for each group.Within-subject CV% was calculated as the square root of the mean of s 2 /m 2 multiplied by 100.
Study population descriptive data, body segment lengths, and length to stature ratios were compared between Black and white groups using Student's independent t-tests for males and Mann-Whitney U tests for females with Bonferroni adjustments for multiple comparisons.For males, a 2-way (2 × 7) Mixed ANOVA determined the effect of Race on Stature.Race (Black, white) was treated as a between factors variable.Stature was treated as a within factors variable (Measured Stature, M8, M9, M10, M11, M15, and MSH).A 2-way (2 × 6) Mixed ANOVA was performed to compare the effect of Race on Bias Error.Race (Black, white) was treated as a between factors variable, and Bias Error was treated as a within factors variable (M8, M9, M10, M11, M15, and MSH).When the full factorial models identified significant differences, 1-way ANOVAs and pairwise comparisons with Bonferroni adjustments determined which conditions differed from each other.Greenhouse-Geisser estimates and Games Howell post hoc tests were used to address the minor violations of sphericity and homogeneity of variance.Effect sizes were calculated using Cohen's d (mean difference divided by mean squared error) where d = 0.2, d = 0.5, and d = 0.8 are considered small, medium, and large effects, respectively 56 .
For females, Mann Whitney U tests compared the effect of Race (Black, white) on Stature (Measured Stature, F8, F9, F10, F13, F12, and FSH) and on Bias Error (F8, F9, F10, F13, F12, and FSH).Within each race, Friedmans tests determined whether differences existed between stature calculations (Measured Stature, F8, F9, F10, F13, F12, and FSH).All assumptions for use of Mann-Whitney U and Friedmans tests were met, and Bonferroni corrections for multiple comparisons were employed.Effect sizes, r, were calculated as the absolute value of the z-statistic divided by the square root of the number of samples, where r < 0.3, r = 0.3-0.5, and r > 0.5 were considered small, medium, and large effects, respectively 57 .Statistical analyses were performed using SPSS 29.0 (IBM Inc., Armonk, NY, USA), and significance levels were set at α = 0.05.

Figure 1 .
Figure 1.Bland-Altman plots of the mean of measured and estimated stature vs the difference (error), in cm, between the methods for each MASH equation for white (blue circles) and Black (black circles) males (columns 1 and 2) and females (columns 3 and 4).Mean bias errors (solid line) with upper and lower limits of agreement (LOA U , LOA L ; dashed lines) are presented where values are mean ± 95% confidence interval. https://doi.org/10.1038/s41598-024-56597-y

Table 2 .
Body segment lengths, in cm, and proportionality calculated as body segment length to stature ratios.Values are mean ± 1 standard deviation.*Significant difference (p ≤ 0.001) between sex-specific black and white groups.
stature for each MASH equation where Connick et al.

Table 4 .
Correlation coefficients (R 2 ) and error (RMSE and Bias) values of the MASH equations compared to measured stature for Black and white males and females.95% confidence intervals and 95% tolerance intervals for bias errors with coefficients of variation are also presented.MASH maximum allowable standing height, RMSE root mean squared error, ES effect size, CI confidence interval of the mean, TI tolerance interval, CV% within-subjects coefficient of variation.*A significant difference in Bias between sex-specific Black (B) and white (W) groups for a predictive equation.Bias values are mean ± 1 standard deviation.

Table 5 .
Summary of MASH equations and their validity.MASH maximum allowable standing height.

but reasonable for use Invalid, recommend new equation development
This study is limited to investigating Black and white populations from the United States, thus it is recommended to validate MASH equations with additional racial and geographical populations.The stature and limb proportions of military personnel used in this study may not represent elite athletes.On average, our populations were shorter thanCandaand Connick et al. 's white athletes, but taller than Connick et al. 's Asian Japanese athletes