Introduction

In the United States, major racial disparities in health occur across the lifespan resulting in higher mortality rates among non-Hispanic black versus non-Hispanic white Americans.1,2 Perinatal disparities are among the most striking; black infants are three times more likely than white infants to die in the first year of life.3 The largest contributor to infant mortality differences between black and white infants is the wide racial disparity in preterm birth (PTB) rate, which is 50% higher among black women.4,5,6 As molecular assays generate large-scale biomarker data to profile genomes, epigenomes, metabolomes, immunomes, and microbiomes for individualized precision medicine approaches, interest has grown in whether these “omic” assays can shed light on differences in the pathophysiology of health conditions,7 including PTB, by race. However, before determining that variation in these molecular profiles is on the causal pathway to racial disparities in health, potential pitfalls and strategies to avoid them should be considered.

The basic pitfall is to assume that black race is a genetic construct and thus not modifiable.8 The assumption that race genetically determines disease susceptibility arises from extrapolations beyond two facts: first that continental ancestral genetic patterns are somewhat associated with self-identified race; and second, that Mendelian diseases such as sickle cell disease track (although incompletely) along racial lines. Neither of these facts warrant the assumption that race is an unmodifiable biologic construct. That false assumption ignores the large body of literature demonstrating that race (as opposed to ancestry) is largely a social construct with biologic impacts that affect disease risk. The study of genetics to determine susceptibility or resilience to a causative exposure is important. However, applying genetics to the social construct of race is problematic. Using self-identified race will misclassify the genetics of many people who do not map exactly to ancestral patterns that are most common in their self-identified racial group. Disaggregating self-identified race, which may change susceptibility to disease due to societal context, from ancestral patterns of genetic sequence is critical to truly understanding why disparities exist. This is especially so in complex, multifactorial diseases, such as PTB, that reflect variation in lived experiences between racial groups.9 This is demonstrated by international studies where the consequences of race vary by society. For example, McKinnon et al. showed that black women living in Canada have lower PTB rates than black women living in the United States (8.9% vs 12.7%, respectively).10 Better birth outcomes among foreign-born black women in the US compared to US-born black women11,12 highlight that the genetics that determine skin color do not determine adverse birth outcomes. Hence, concurrent differences in continental ancestral genetic sequences and differences in PTB risk by race do not mean that genetic sequence causes differences in PTB risk.

However, precision medicine approaches extend beyond genetic sequence analysis. Assays that measure multiple biomarkers can now be used to interrogate several key steps in the gene–environment interaction: transcription (transcriptome),13,14 translation (proteome),15 regulation of gene expression (epigenome),16 host–microbial interactions (immunome and microbiome),17 exposures and their metabolites (metabolome),18,19,20 and their combined synergies (multiomics).21 Results of these assays could highlight processes by which exposures lead to increased risk of PTB (mechanistic discovery of causal pathways) or may generate biomarkers of risk of PTB (useful for prediction). These two potential applications (mechanism identification and prediction) could be useful for developing therapeutic targets and, diagnostic tests, as well as identifying populations at particularly high risk in whom interventions might be most important. If omic data are to be used to explain the biology underlying disparities in PTB, they must be involved in the causal pathway leading to PTB, especially among black women. A conceptual model (Fig. 1a) shows how omic assays may shed light on how exposures or ancestral sequences could lead to differential biomarker results by race, with the caveat that the relationship between omic biomarkers and outcomes may be confounded by race (Fig. 1b). We describe below how omic assays common in “precision medicine” or “precision public health” may, or may not, be useful to understand and ultimately ameliorate racial disparities in perinatal health.

Fig. 1
figure 1

Conceptual framework of how race could lead to differential omic biomarkers and be associated with racial disparities in preterm birth (a) and how the relationship between omic biomarkers could be confounded by race-associated factors (b)

Reducing racial disparities in PTB at a population level

Resolving racial disparities in health will require two phases of discovery. The first is to determine why there are population differences in risk by race in order to identify causal factors for groups of women at risk (primary prevention). The second is to identify populations at risk and intervene on these populations in the biomedical setting to prevent PTB (secondary prevention).

We first consider how population differences in harmful exposures could lead to racial disparities in health outcomes, such as PTB. There are two potential ways: (1) differences in prevalence of external and universally potent causal factors or (2) differences in susceptibility to causal factors.

Exposure differences by race: the case of universal potency but differing prevalence

One risk factor with similar impacts on black and white women, but which is more prevalent among black women is a short inter-pregnancy interval. The association of intervals <6 months, compared to 18–23 months, has been shown to elevate PTB risk (odds ratio (OR) = 1.22, 99% confidence interval (CI) = 1.11–1.35) even in a within-family study,22 which avoids confounding by variables that remain constant between pregnancy.23 In the United States, a study of >4.8 million births examined inter-pregnancy interval examined the odds of low birth weight (<2.5 kg) within racial/ethnic strata in the United States from 1989 to 1991. Among births with a preceding inter-pregnancy interval of <6 months compared to intervals >12 months, higher odds were observed for PTB in non-Hispanic black (OR: 1.64, 95% CI 1.60–1.68) and white (OR: 1.67, 95% CI 1.63–1.70) women,24 demonstrating similar rates in each racial/ethnic group. The etiology of any disparity resulting from short inter-pregnancy intervals, therefore, results from the higher prevalence of short inter-pregnancy interval among black women (10.5%) compared with white women (5.7%). In a separate analysis, investigators found that the potential impact of increasing inter-pregnancy intervals to 18–23 months would reduce the excess PTB by up to 8% among black and 4% among white women.25

Population differences by race: the case of differential susceptibility due to social factors

An example where women may have differential susceptibility to causal factors according to self-identified race may be the association of air pollution and PTB. While all people are susceptible to air pollution, it may be that other exposures that track with self-identified race, specifically socioeconomic disadvantage, may increase susceptibility to the adverse effects of air pollution.26,27,28 Heightened susceptibility is not necessarily a genetic predisposition but could occur due to stress-induced immunologic alterations that alter the body’s ability to handle an additional physical stressor—analogous to increased susceptibility to viral infections during periods of stress.29 It is known that black families are more likely than white families to have lower socioeconomic position as well as experience discrimination and stress.30,31,32 Hence, black women may have increased susceptibility to the health effects of air pollution, including PTB. A recent study of 53,843 births in California demonstrated that PTB risk was associated with environmental pollution especially in areas of low socioeconomic status.28 Given that air pollution exposures are higher among urban black communities33,34,35,36 than white communities (different doses) and that black women are more likely to be of low socioeconomic position that might heighten susceptibility to air pollution, reduction in air pollution overall could lead to decreased racial disparities in PTB. In fact, there has been some recent evidence that closures of coal power plants in California reduced PTB rates in surrounding areas, particularly among minorities.37 However, no study has assessed the causal pathway between air pollution reduction and racial disparities in PTB.

Taken together, identifying causal factors either with higher prevalence or heightened susceptibility among black families because of the association between race/ethnicity and additional social factors may help to reduce racial disparities in PTB. However, as physicians and patients, we cannot wait for policy efforts to reduce harmful exposures to go into effect. We need to find ways to help individual women maximize the chance at a healthy, full-term birth. It is here that the promise of “precision medicine” is so exciting.

Precision medicine for the prediction of an individual’s risk of PTB

Currently, providers’ ability to predict which women will have PTB and which women will have a healthy, full-term delivery is poor. An ideal test in the prenatal period would enable obstetricians to identify which women will deliver early, and once identified, providers would then offer an effective therapy to prevent PTB. Imperfect interventions38,39 currently are: women with prior PTB (at high risk of a subsequent PTB) are offered progesterone,40 and if found to have a short cervix, they are additionally offered a cerclage41 and women without a prior spontaneous PTB (sPTB) but with a short cervix are offered progesterone.42,43 For medically indicated PTB due to conditions such as preeclampsia or intrauterine growth restriction, providers may prescribe aspirin, but it is only mildly protective. The bottom line is that we still do understand the pathogenesis of PTB and thus are limited in strategies to improve birth outcomes.

Precision medicine research has the potential to improve the landscape of pregnancy care. The goal of these studies is, first, to find biomarkers that identify individual women who are at particularly high risk of PTB and, second, to identify therapeutic targets to prevent it. Yet, to date, no panel of biomarkers informs an individual woman and her physician of her percentage of chance of PTB. However, such advances may be possible. In a small study, Ngo et al. studied circulating cell-free RNA using RNA sequencing among women at risk for sPTB [training dataset of women with preterm contractions (n = 15) and validation cohort with prior sPTB (n = 23)] and found that the model performed moderately well.44 The model misclassified 1 out of the 5 sPTB deliveries, and 3 of the 18 full-term deliveries (area under the receiver operating characteristic curve 0.81). A recent study analyzed the cervicovaginal microbiome and a protein involved in innate immunity, beta defensin 2, in 549 women (107 sPTB and 432 term) at three time points in pregnancy. This study showed that a microbiome lacking lactobacillus dominance (community state type IV-B) early in pregnancy is associated with sPTB; 32% of women with sPTB compared to 21% of women with term birth had community state type IV-B (p = 0.017).17 This study also showed that beta defensin 2 levels were lower among women, specifically black women, who went on to have sPTB compared to women with term births. This  highlights the complex balance between the microbiome and the immune system to modulate PTB risk. Hence, combining low beta defensin 2 with low relative abundance of lactobacillus appears to increase sPTB risk. While black women were more likely to have a higher-risk microbiome with respect to sPTB, they also had higher levels of beta defensin 2 that mitigated that risk. The authors propose that these findings could lead to “innovative therapeutic opportunities to prevent sPTB including combination of microbiome-based therapeutics and immune modulators.” However, while associations between the microbiome and immune status are interesting, with respect to prediction, we may require more data (larger studies with more combinations of variables to identify the most informative markers) so as to avoid false positives and false negatives that are common in epidemiologic frameworks that assess risk as opposed to prediction. Nonetheless, if the microbiome and immune status were therapeutic targets that would be useful among black women, such approaches could potentially reduce disparities. These two studies highlight the two promises of precision medicine, respectively: (1) to predict and (2) to identify molecular targets that could be used to prevent an individual from having sPTB.

Limitations of precision medicine methods to understand population’s risk of PTB—and how to improve them

Genomic approaches

Even though omic approaches are not yet ready to predict an individual woman’s risk of PTB, omic data may shed light on mechanisms by which risk factors lead to differential risk of PTB among populations and thus racial disparities. Genome-wide association studies (GWAS) have been limited by several factors. The first is under representation of minority women. A striking example was a large study of 43,568 women to examine genetic variants associated with sPTB.45 In this study, six single-nucleotide polymorphisms (SNPs) were associated with sPTB that replicated in a combined dataset of three Nordic cohorts (n = 8643). However, data were obtained from 23andMe®, a company using customers’ DNA to provide individuals with genomic analysis of ancestry and other genetic traits for a fee, and restricted to only participants with >97% European genetic ancestry. Restriction to these women could have been because of under representation of other ancestral groups using 23andMe®, the availability to replicate with a European cohort, or because of a desire to analyze a more homogenous population for the purpose of isolating effects of SNPs, but reasons are not stated. Nonetheless, data restricted to one ancestral group cannot be used to determine the extent to which disparities in PTB are due to genetic variation between ancestral groups (or racial groups that do track, though incompletely, with self-identified race).

In studies with more diversity in ancestral groups, often there remains residual under representation of participants of non-European descent. For example, in a study of 791 family trios with 270 PTBs, there were just 14 (5%) PTBs among women of African Ancestry. This is too few to determine whether ancestral genetic variation contributes to population differences in PTB risk.46 In another large GWAS of sPTB, 1349 cases were ancestry-matched to full-term controls (based on genomic fetal signals).47 The investigators then performed stratified analyses within ancestral groups to identify genes within groups associated with sPTB. They found only two SNPs in intergenic regions, one within the ancestral group from Africa and one within the ancestral group from the Americas that were associated with sPTB. However, this approach cannot determine the extent to which ancestral genetic patterns are responsible for racial differences in sPTB. Notably, these SNPs failed to replicate in several independent cohorts47 and were not reported in prior GWAS analyses of PTB.45,46 In summary, the contribution of genetics to PTB remains an open question and is even less well established as an explanation for racial disparities.

Epigenomic approaches

In various disease states, epigenomic signals may result from exposures that change epigenomic biomarkers. Importantly, it is thought that epigenomic changes are potentially more modifiable than genetic sequences. However, it can be a mistake to assume that these epigenomic changes are on the causal pathway to disease. We believe that one epigenetic mechanism, DNA methylation, provides an illustrative cautionary tale. It is an example of how epigenetic assays may not only help understand physiology but also to mistake DNA methylation as in the causal pathway linking exposure to health outcome. Most human epigenetic studies are observational, but one blinded, randomized cross-over study of air pollution (an exposure associated with PTB48) evaluated blood pressure and DNA methylation in peripheral blood.49 Fifteen healthy participants participated in the trial in which they were exposed to odorless concentrated ambient particles (akin to levels in highly polluted cities) or medical air. Exposure to the concentrated particles resulted in significantly lower DNA methylation of repetitive element DNA (Alu) as well as the TLR4 gene. Although not an epigenome-wide study, short-term exposure to pollution caused measurable changes in DNA methylation and air pollution caused higher systolic blood pressure. The investigators also found an association (not necessarily causal) between lower DNA methylation and higher blood pressure. Hence, this elegant hypothesis-generating study demonstrated causality of air pollution and DNA methylation changes (as well as air pollution and changes in blood pressure). Yet, it does not determine whether the observed DNA methylation changes are causally responsible for differences in blood pressure. In other words, DNA methylation might be the biologic process that connects air pollution to elevated blood pressure; or alternatively, there could be other, unmeasured mechanisms along the causal pathway that confound the relationship between DNA methylation and blood pressure (Fig. 2). In the latter case, DNA methylation would be a marker of air pollution but not a therapeutic target to lower blood pressure. One mechanistic hypothesis is that air pollution could cause differences in leukocyte demargination from the bone marrow and different leukocyte populations in the peripheral blood, which by definition have differential DNA methylation. If the mechanism were activation of adrenergic pathways or hormonal, this could lead to higher blood pressure independent of the changes in DNA methylation. This illustrates that even in well-controlled human studies of exposure, biomarker, and outcome, it can be difficult to demonstrate that biomarkers are on the causal pathway. This is particularly relevant with respect to sPTB where peripherally circulating biomarkers may or may not be relevant to reproductive organs. Nonetheless, even if biomarkers are not on the causal pathway, they can still serve as markers of exposures or risk of outcomes, but they may not be useful targets of therapeutic agents.

Fig. 2
figure 2

Example of how an exposure (air pollution) might both cause a change to a biomarker (DNA methylation) and an outcome (blood pressure) yet not be on the causal pathway. Here an unmeasured biologic process acts as a mediator of the pollution–blood pressure relationship and as a confounder in the methylation–blood pressure relationship. Solid lines established, dashed lines hypothesized

When considering whether DNA methylation differences represent biologic pathways by which racial disparities occur, there are additional challenges to consider. Consider for a moment a theoretical molecular epidemiologic study of race, DNA methylation, and PTB. One might find in this study that DNA methylation differs by race. If similar to national data,4 this theoretical study would likely find black women have extreme PTB (<28 weeks of gestation) rates that are three times as high as white women. Assume lastly, DNA methylation is found to be associated with PTB. One might conclude that differential DNA methylation is causally related to racial disparities in PTB, but such a conclusion would be premature. There are potential alternative explanations. The first is that DNA methylation is not causally related to PTB but is simply a passive marker of SNP frequencies at potential methylation sites (CpG sites in which cytosines are followed by guanines in the DNA sequence) that vary by continental ancestry, which may be somewhat correlated with race (genetic confounding), but that some other mechanism is contributing to differences in PTB (akin to Fig. 2 but substituting black race for air pollution and PTB for blood pressure). The second alternative explanation is that a diffrerent co-exposure, an unmeasured confounder (such as air pollution exposure, which is higher, on average, among black families33,34,50) leads to differential DNA methylation that is causally involved in the pathophysiology of PTB (classical confounding). Neither self-identified race nor the ancestral DNA patterns that are more common among black women would have been the causative etiology of differences in DNA methylation. Race was perhaps confounded by socioeconomic position that led to a complex set of exposures including air pollution, which was the true cause of DNA methylation differences. In other words, doing expensive omic assays does not avoid the traps of traditional epidemiologic studies such as confounding.

Concern about confounding in molecular epidemiologic studies of PTB is not new. The most striking example of a biomarker that is strongly associated with sPTB but is not a useful therapeutic target to reduce sPTB is a Nugent score51 positive for bacterial vaginosis (BV). Trials of antibiotics therapy are only successful at treating BV but do not reduce the risk of sPTB,49 suggesting that BV may not be causally related to sPTB. Making the leap that since BV is more common among black women52 and that BV is associated with sPTB,53 that BV plays a causal role in racial disparities sPTB, may be misguided given the lack of efficacy of treating BV in preventing sPTB. Simply treating BV will not reduce racial disparities in sPTB.

Potential solutions to omic studies of racial disparities in PTB

Whether a single biomarker for a condition like BV or an omic signature, such as the epigenome or microbiome, is measured, to determine the extent to which a biomarker might explain a race–PTB association, causal mediation analyses are needed.54 While rarely employed in birth outcome studies, studies of aging populations have demonstrated that DNA methylation may mediate the smoking–lung function relationship.55 Causal mediation methods can quantify the direct and indirect (through a proposed mediator) effects between an exposure and an outcome. Such approaches will be necessary to determine the extent to which omic assay results that differ by race explain disparities in PTB risk. In order to perform formal causal mediation analyses, sufficient sample sizes with variation in the distribution of the biomarker and in the incidence of the outcome across participants of all racial groups is required. With respect to the recent cervicovaginal microbiome study demonstrating associations of the microbiome with both race and sPTB, the study design included frequency matching of sPTB cases to term controls by race, eliminating the racial disparity in sPTB in the analytic dataset.17 This approach was important for the primary goal of analyzing the main effect of the microbiome and sPTB but precludes a mediation analysis of the microbiome as a mediator between race and sPTB. Race-stratified analyses in which a biomarker is associated with PTB among African Americans allows for the possibility that the mediator is involved in racial disparities. However, both causal mediation and race-stratified analyses do not completely avoid the pitfalls of observational studies; confounding can still occur. Interventional animal studies and eventually human intervention trials would be required to demonstrate that a mediator were causally involved in sPTB before concluding that differences by race are involved in racial disparities in sPTB. Such trials should target modifying the omic signatures, and even more likely combinations of omic signatures that may work in concert, and should have as a primary outcome sPTB rates. Lastly, while there are high rates of sPTB among African Americans, the vast majority of women deliver full-term, healthy infants. Identifying omic signatures that mediate resilience in spite of experiences of discrimination or physical toxic environmental exposures may help to identify therapeutic interventions to bolster biologic resilience.

Conclusion

Omic assays common in “precision medicine” and “precision public health” have the potential to risk-stratify individuals for PTB and to elucidate mechanisms by which exposures cause PTB. If biomarkers do represent pathways by which PTB occurs, they may be appropriate therapeutic targets to prevent PTB. Such biomedical interventions do not solve social and environmental injustices that may be the underlying causes of disparities but are likely simpler. Studies that attempt to explain racial disparities in PTB with omic assays need to have adequate representation and sufficient sampling of women across racial groups. However, observational omic studies are affected by all of the same challenges (confounding, bias) that affect observational epidemiologic studies more generally. In order for biomarkers from omic studies to be useful at the individual level, they must perform well as predictors (as opposed to simply being statistically significantly associated with exposures or outcome) for clinical intervention. In order for omic assays to be helpful in reducing racial disparities in PTB at the population level, they either need to be good biomarkers of exposures that are causal and track by race (thus motivating policies to reduce these exposures) or they need to be specifically modifiable among black women so as to reduce the excess risk of PTB among black women. Lastly, omic assays, when their results differ by race, should not be assumed to be innate biologic, genetic differences between races. While they may be subtly affected by SNPs that differ by ancestry, differences in omic signatures may also be due to differential exposures to toxic exposures by race due to longstanding inequity in the United States. Determining why they differ and whether there are ways to mitigate their effects represents a new frontier in tackling racial disparities in birth outcomes.