Introduction

In contrast with full-term infants, those born before 28 weeks’ gestation frequently fail to close their ductus arteriosus (DA) after birth. Persistent DA patency alters cerebral, mesenteric, and renal blood flow, impairs pulmonary mechanics, increases the risk of pulmonary hemorrhage, and prolongs the need for mechanical ventilation. Prior studies have shown that immature gestation, absence of antenatal glucocorticoid exposure, and mother’s self-identified race are the most consistent independent risk factors for identifying preterm newborn infants who fail to close their patent ductus arteriosus (PDA) either spontaneously or with inhibitors of prostaglandin production like indomethacin and ibuprofen.1,2,3,4,5 Both immature gestation and absence of antenatal betamethasone decrease the expression of a wide range of DA genes involved in oxygen-induced constriction (e.g., calcium channels, potassium channels, and endothelin signaling), contractile protein maturation, prostaglandin- and nitric oxide-mediated relaxation, and tissue inflammation and remodeling.5,6,7

There is growing evidence from monozygotic twin studies that genetic risk factors may act in concert with gestational age to alter the ability of the DA to close in preterm infants.8,9 We previously identified several single-nucleotide polymorphisms (SNPs) in genes encoding transcription factor AP-2 beta (TFAP2B, the gene mutated in Char syndrome) and prostacyclin synthase (PTGIS), which are associated with isolated (non-syndromic) PDAs in preterm infants.10 PTGIS and its vasodilatory lipid product, prostacyclin (PGI2), play an important role in maintaining preterm DA patency.11 Similarly, TFAP2B, a transcription factor that regulates endothelin, hypoxia inducible factor 2-alpha (HIF2 alpha), and calponin, plays an important role in DA smooth muscle development.10,12,13 We previously examined one of the TFAP2B polymorphisms (SNP rs2817399(A)) that has been associated with persistent DA patency, for its effects on human fetal DA gene expression and found that it decreased several of the same calcium- and potassium-channel genes previously shown to be involved with oxygen-induced constriction of the DA.6

In contrast with our findings, two subsequent epidemiologic studies14,15 failed to find an association between the same SNPs we identified in TFAP2B and PTGIS and alterations in DA closure. Although differences in both the definition of PDA and the strategies used to treat the PDA might account for the discordant results between studies, another explanation might be the significant differences in genetic ancestry among the study populations. Ninety percent of mothers in our original Iowa-based, single center study self-identified as White/European ancestry.10 In contrast, 50 and 0% of the populations in the subsequent two studies self-identified as European ancestry.14,15 In addition, the Iowa study utilized a family-based approach, which is less susceptible to the effects of population stratification compared to the case–control design used in the latter studies.

We designed the following study to determine whether the PDA-associated SNPs in TFAP2B and PTGIS that we previously identified are indeed associated with unique alterations in gene expression. Our goal was to test the reproducibility of our prior findings in fetal DA obtained from a population with diverse genetic ancestry and to expand the list of genes that might be affected by the TFAP2B and PTGIS polymorphisms. We hypothesized that an interaction exists between the fetus’s genetic ancestry and the SNPs in TFAP2B and PTGIS such that the effects of the SNPs on gene expression only occur in DA with European genetic ancestry.

Methods

We used de-identified DNA and RNA samples, collected as part of a prior study,7 to determine whether common genetic variants in TFAP2B and PTGIS, which have been associated with a PDA in preterm newborns, are associated with unique patterns of gene expression in the human fetal DA. The study was reviewed by the Institutional Review Board of the University of California San Francisco and given an exempt status.

Tissue

Human tissue was obtained under the oversight of the Institutional Review Board at University of California San Francisco. Mid-gestation (130/7–236/7 weeks) human fetal DA and ascending aorta were collected from elective pregnancy terminations in healthy women with no known fetal abnormalities. Consent for the use of fetal tissue for research purposes was obtained by the clinic staff, who had been trained in human subjects’ protections. The consent for the use of fetal tissue for research purposes is separate from the consent for the clinical procedure. Researchers have no patient contact and only receive de-identified tissues.

Prostaglandins were not used during the terminations. Cervical ripening was performed with laminaria (compressed seaweed). Fetal tissue was immediately submerged in calcium- and magnesium-free phosphate-buffered saline at 4 °C following delivery. The DA and aorta were dissected in the chilled buffer solution and the isolated DA and aorta were snap frozen in liquid nitrogen (between 1.5 and 2 h after delivery). Gestational age was determined by fetal foot length.16 De-identified tissues were individually labeled and stored for later analysis. Individual samples were analyzed in “batches” of 90 samples. There was no “pooling” or combining of tissues during the analyses.

During the period of the study, women who donated tissue self-identified their racial origins to the clinic staff as White/European ancestry = 21%, Non-White/Non-European ancestry = 76%, and unknown = 3%. The data on self-reported racial origins were available solely as a population-level statistic. Individual descriptors were not linked to de-identified tissues samples. No clinical information was available for analysis.

Preparation of total RNA, reverse transcription, and quantitative PCR

We examined the RNA expression of 49 “DA closure genes” in each of the 273 human DA samples (Table 1). The “DA closure genes” were chosen because: (1) their expression in the DA has previously been shown to differ from their expression in the aorta, and (2) their mutations or polymorphisms (or their pharmacologic inhibition) has been shown to affect DA closure (see refs. 7,6 for references for “DA closure genes”).

Table 1 Multivariable regression models examining the independent effects of gestational age and non-European genetic ancestry on the RNA expression of “ductus closure genes” (n = 273).

Total RNA was isolated from each individual DA and cDNA was generated as described elsewhere.6,17 We used the TaqMan Universal PCR master mix of PE Applied Biosystems (Foster City, CA) to quantify gene expression in a 96-well format. TaqMan probes were designed using the Primer Express program and labeled with fluorophores FAM (6-caboxy-fluorescein) and TAMRA (6 carboxy-tetramethyl-rhodamine) as reporter and quencher dyes, respectively. An ABI PRISM 7500 Sequence detection system was used to determine the cycle threshold (CT). Reactions were carried out in triplicate. Data were analyzed using the Sequence Detector version 1.6.3 program. The degree of expression of the gene of interest was determined using the relative gene expression method. Malate dehydrogenase (MDH) was used as an internal control to normalize the data.6,18 ΔCT represents the difference in cycle threshold (CT) between the expression of the housekeeping gene (MDH) and the gene of interest. Each unit of ΔCT represents a twofold change in mRNA levels. The more negative the ΔCT, the fewer the number of starting copies of a gene’s mRNA.

DNA genotyping of fetal ductus arteriosus to determine the presence or absence of several TFAP2B and PTGIS SNPs as well as to infer genetic ancestry

DNA was extracted from the ascending aorta of each of the fetal samples using the QIAamp DNA mini kit (Qiagen Inc., Valencia, CA). DNA was quantified spectrophotometrically. Allelic variation was determined by using the TaqMan genotyping system (Applied Biosystems, Foster City, CA), as previously described.19 Allele scoring was performed using the Sequence Detection Systems 2.2 software (Applied Biosystems).

We examined the DNA for the presence of several SNPs in the TFAP2B and PTGIS genes that have been associated with altered DA closure in preterm infants (Fig. 1)10 (Dagle et al., unpublished results). Specifically, we examined the DNA for four SNPs in TFAP2B (rs2817399: (A allele); rs987237: (G allele); rs760900: (C allele); rs2817416 (C allele)) that have been associated with delayed DA closure (even in the presence of indomethacin). We also determined the presence of two TFAP2B SNPs that are unrelated to the timing of DA closure (rs2817419: G allele; and rs2635727: T allele). In addition, we examined one haplotype combination of two neighboring SNPs in the gene PTGIS (rs493694 (G allele) and PTGIS rs693649 (A allele)) that is negatively associated with PDA, suggesting a preventative effect of the allele combination against PDA.10

Fig. 1: Gene structure of human TFAP2B and PTGIS and location of polymorphisms.
figure 1

The TFAP2B polymorphism rs2635727 is located further to the right of the gene and is not shown on the figure. In contrast to PTGIS, the intronic regions of TFAP2B show many areas of conservation, some with consensus transcription factor-binding sites. Figures were obtained from the UCSC Genome Browser. Note that the two genes are represented in opposite orientations and not to the same scale. The exons of each gene are shown as the thick lines at the top of the figure. TFBS = conserved transcription factor-binding sites; conservation estimated sequence = conservation of genetic loci among 100 vertebrate species (PhyloP).24

To identify the genetic ancestry of the fetal tissues, we examined four genes whose sequence polymorphisms are distributed along genetic lines between European ancestry and Non-European ancestry (African, Chinese, and Japanese) populations.20,21,22 These include SLC24A5 SNP rs1426654 (100% European =A allele; 98% Non-European = G allele)20,21; SLC45A2 SNP rs16891982 (100% European = G allele; 100% Non-European = C allele)20,21; DARC SNP rs2814778 (100% European = A allele; 85–100% Non-European = G allele)22; and HERC2 SNP rs12913832 (79% European = G allele; 100% Non-European = A allele).21 Each fetal sample possessed between 0 and 8 of the Non-European ancestry alleles. Although we recognized that the clinic’s population data for race was an imperfect proxy for genetic ancestry, we used the clinic’s statistics to help create a definition for European and Non-European ancestry. Since 76% of the women who donated tissue self-identified as Non-White/Non-European ancestry and 83% of the samples had two or more Non-European ancestry alleles (see “Results”), we defined tissues with two or more Non-European ancestry alleles as “Non-European origin” and those with zero or one Non-European ancestry allele as “European origin” (The bold lettering is to highlight the only change in the SNPs).

Statistical analyses

Stata software (Release 16.1; StataCorp LP, College Station, TX) was used for all statistical analyses. We used multivariable linear regression to build statistical models that adjusted for possible confounding effects of gestational age and genetic ancestry on the relationship between a SNP, or 2-SNP haplotype, and the change in RNA expression of each of the 49 “DA closure genes” (represented by their ΔCT). The multivariable models were analyzed using generalized estimating equations techniques to account for clustering within each of the 90 sample “batch” assays. Coefficients derived from these models were interpreted as the difference (positive or negative) between the RNA expression in the presence of the SNP in the study population and that in the absence of the SNP while holding gestational age and genetic ancestry constant. Models were run individually for each of the 49 “DA closure genes”.

To determine if an SNP’s effect on RNA expression differed depending on whether it occurred in a European ancestry or non-European ancestry DA, we added an interaction term to the model (between the SNP in question and genetic ancestry) and reran the regression.

Our study was an exploratory study designed to identify “DA closure genes” that might be altered by the presence of common genetic variants. Because of its exploratory nature, we considered any association between an SNP and a change in gene expression as possible evidence of association if the regression coefficient for RNA expression had a p value < 0.1. Our purpose was to decrease the likelihood of missing true positive signals, knowing that false-positive signals will inevitably be present.

Results

We analyzed 273 fetal DA samples in the current study (gestational age = 19.8 ± 2.9 weeks (m ± s.d.)). Seventeen percent of the samples had zero or one non-European ancestry allele and were assigned as European ancestry. The allele frequencies of the TFAP2B polymorphisms associated with an increased incidence of PDA were as follows: rs2817399 (A allele) = 81%; rs987237 (G allele) = 37%; rs760900 (C allele) = 89%; and rs2817416 (C allele) = 25%). The frequencies of the TFAP2B polymorphisms that are unrelated to the timing of DA closure were: (rs2817419 (G allele) = 42% and rs2635727 (T allele) = 38%). The frequency of the PTGIS haplotype of two neighboring SNPs that is negatively associated with PDA was: (rs493694 (G allele)/rs693649 (A allele) = 21%).

We created multivariable linear regression models to determine the independent effects of gestational age, genetic ancestry, and the SNP alleles on RNA expression of the 49 “DA closure genes”. As previously reported, advancing gestational age was independently associated with changes in RNA expression for the majority (92%) of the “DA closure genes” (Table 1). In contrast, genetic ancestry was only consistently and independently associated with RNA expression in 2 genes: PTGS2/COX2 (cyclooxygenase 2) and SLOCA2A1 (the prostaglandin transporter which regulates prostaglandin reuptake) (Table 1).

Our main objective was to identify “DA closure genes” that are modified by the TFAP2B and PTGIS SNPs that have previously been shown to alter DA behavior: rs2817399 (A allele), rs987237 (G allele), rs760900 (C allele), and rs2817416 (C allele). In our initial examination of the general population of 273 samples, we found no consistent independent association between the TFAP2B SNPs associated with delayed DA closure and alterations in RNA expression for any of the “DA closure genes” (Table 2—General population).

Table 2 Multivariable regression models examining the independent effects of TFAP2B SNPs (associated with persistent PDA) on the RNA expression of “ductus closure genes” in second trimester human ductus (n = 273).

However, when we tested whether an interaction occurred between the fetus’s genetic ancestry and the same PDA-associated TFAP2B SNPs, we found that several of the “DA closure genes” had consistent, independent changes in gene expression when the SNPs occurred in samples with European ancestry. At least three of the four TFAP2B SNPs were associated with changes in expression in each of the following genes: EPAS1 (HIF2 alpha), CACNB2 (Cavbeta2 calcium channel subunit), ECE1 (endothelin converting enzyme), KCNA2 (potassium channel Kv1.2), ATP2A3 (SERCA, sarcoplasmic reticulum Calcium-ATPase), EDNRA (endothelin A-receptor), EDNRB (endothelin B-receptor), BMP9 (bone morphogenetic protein-9), and BMP10 (bone morphogenetic protein-10) (Table 2—European ancestry). None of these changes were seen when the same SNPs were examined in the samples with Non-European ancestry (Table 2—Non-European ancestry). Nor were the same changes observed when the two TFAP2B polymorphisms that are unrelated to the timing of DA closure (rs2817419 (G allele) and rs2635727 (T allele)) were examined in samples with European ancestry (Table 2—European ancestry/TFAP2B (Non-PDA-associated polymorphisms)).

A similar phenomenon occurred when we tested whether an interaction occurred between the fetus’s genetic ancestry and the 2-SNP haplotype of PTGIS that is negatively associated with PDA (rs493694 (G allele)/rs693649 (A allele)). When the PTGIS haplotype was present in samples with European ancestry, the haplotype was associated with changes in RNA expression in several “DA closure genes” (the most significant change occurring in PTGIS itself) (Table 3).

Table 3 Multivariable regression models examining the independent effects of the PTGIS SNP haplotype rs493694(G)/rs693649(A) (associated with early ductus closure) on the RNA expression of “ductus closure genes” (n = 273).

Discussion

Premature infants born to mothers who self-identify as White/European ancestry are less likely to close their PDA following prostaglandin inhibition than infants born to mothers who self-identify as Non-White/Non-European ancestry.1,2,3,4 This difference does not appear to be due to different rates of indomethacin/ibuprofen metabolism or different serum prostaglandin E2 concentrations.1,2,3,4 Our current study demonstrates that genetic ancestry is associated with changes in the expression of several “DA closure genes”. This occurs through a direct association between genetic ancestry and a limited number of “DA closure genes” (SLCO2A1 (the prostaglandin transporter) and PTGS2 (cyclooxygenase 2)) (Table 1), as well as through a broader, indirect, interactive effect, where genetic ancestry modifies the associations between common genetic polymorphisms and DA gene expression.

We previously identified several polymorphisms in the genes PTGIS and TFAP2B that were associated with different rates of PDA closure in a population composed primarily of preterm infants with European genetic ancestry.10 These associations were not replicated by other investigators using populations with different or more diverse genetic origins.14,15 In line with these discordant observations, our current study found consistent associations between PTGIS and TFAP2B polymorphisms and the expression of “DA closure genes” in DA with European genetic ancestry. On the other hand, no consistent positive or negative associations could be found in our genetically diverse DA population unless an interaction between the polymorphisms and genetic ancestry was taken into account (Tables 2 and 3).

In DA with European genetic ancestry, the PTGIS haplotype (rs493694 (G allele)/rs693649 (A allele)), which is associated with early DA closure, was associated with decreased expression of PTGIS itself as well as NOS3 (endothelial nitric oxide synthase, which regulates nitric oxide production) and several other calcium and potassium regulatory genes (Table 3).

Consistent alterations in gene expression were also found when each of the four TFAP2B SNPs (that are associated with persistent PDA) were present in DA with European genetic ancestry. These changes include decreased expression of calcium and potassium signaling genes, as well as decreased expression of genes regulating endothelin and HIF2 alpha (Table 2). It is interesting to note that similar changes in endothelin and HIF2 alpha were previously found in newborn mice with targeted deletions of Tfap2b (the mouse equivalent of TFAP2B).12

To determine whether the changes in DA gene expression were specific for the TFAP2B SNPs associated with persistent PDA, we examined two other TFAP2B polymorphisms, rs2817419(G) and rs2635727(T), which are unrelated to the incidence of preterm PDA (Table 2). Neither polymorphism was associated with the changes in gene expression described above (Table 2).

Our study has several limitations. The tissues were from pregnancy terminations, which may have altered the gene expression in the DA before tissue processing. We explored a limited number of candidate genes and may have missed others that might have been detected by genome-wide association studies or pathway-based analyses. There was also a relatively small number of tissue samples and a low proportion of European genetic ancestry in our study population which may have limited our ability to identify smaller effects in the “DA closure genes” we studied.

Since our investigation was an exploratory study, we chose to consider results with a p value < 0.1 as possible evidence of association. Although applying a more stringent p value would have reduced the chance of finding false-positive signals, it might have eliminated our ability to detect true positive signals, especially when the genetic effects are small. Our finding that at least three of the four TFAP2B SNPs, that were associated with persistent PDA, also were associated with the same changes in expression of several of the “DA closure genes” (EPAS1, CACNB2, ECE1, KCNA2, ATP2A3, EDNRA, EDNRB, BMP9, and BMP10) increases the confidence that these may actually represent true positive results. None of these changes were seen when the two TFAP2B polymorphisms that were unrelated to the timing of DA closure were examined in samples with European genetic ancestry (Table 2).

As an observational study, we cannot distinguish between causation and association. Nor do we know if the changes in gene expression have a direct effect on DA closure, or if they are merely an indirect effect of other events that are responsible for its closure. However, our findings do provide biologic plausibility to the concept that the PTGIS and TFAP2B SNPs are either functional polymorphisms or in tight association with functional polymorphisms that play an active role in regulating DA closure. Since the SNPs we studied are present in haplotype blocks, the actual genetic variations responsible for the associated changes in gene expression could lie anywhere within that block. We speculate that the increased rate of DA closure associated with the PTGIS 2-SNP haplotype rs493694(G)/rs693649(A) may be due to the associated decrease in prostaglandin I2 synthase expression (and a subsequent decrease in the potent vasodilator, PGI2). On the other hand, we have no similar explanation for the changes associated with the TFAP2B SNPs since none of the SNPs appear to alter TFAP2B mRNA levels (Table 2). It is worth noting that the TFAP2B SNPs we examined are situated in unique, highly conserved regions, that are located between exons, and in proximity to a number of putative transcription factor-binding sites (Fig. 1). SNPs in or near a gene can affect both the amount and function of the mRNA or protein produced. We speculate that alterations in these unique, highly conserved, noncoding regions might alter TFAP2B splicing such that transcript levels are normal but the transcripts themselves are abnormal; or, they may have distant effects (possibly through altered transcription factor binding or microRNA production) on gene expression even beyond the TFAP2B gene in which they are located. These findings are consistent with our current understanding that many disease-associated common variants are noncoding and are enriched in DNA regulatory elements.23 Future studies will be needed to determine how these polymorphisms affect the expression of downstream genes.

In conclusion, we found no consistent associations between the presence of polymorphisms in PTGIS and TFAP2B and the expression of “DA closure genes” unless an interaction between the polymorphisms and genetic ancestry was taken into account. When an interaction between the polymorphisms and ancestry was accounted for, the PTGIS and TFAP2B polymorphisms were associated with consistent changes in DA gene expression in DA from fetuses with European genetic ancestry.