FLT1 and transcriptome-wide polyadenylation site (PAS) analysis in preeclampsia

Abstract

Maternal symptoms of preeclampsia (PE) are primarily driven by excess anti-angiogenic factors originating from the placenta. Chief among these are soluble Flt1 proteins (sFlt1s) produced from alternatively polyadenylated mRNA isoforms. Here we used polyadenylation site sequencing (PAS-Seq) of RNA from normal and PE human placentae to interrogate transcriptome-wide gene expression and alternative polyadenylation signatures associated with early-onset PE (EO-PE; symptom onset < 34 weeks) and late-onset PE (LO-PE; symptom onset > 34 weeks) cohorts. While we observed no general shift in alternative polyadenylation associated with PE, the EO-PE and LO-PE cohorts do exhibit gene expression profiles distinct from both each other and from normal placentae. The only two genes upregulated across all transcriptome-wide PE analyses to date (microarray, RNA-Seq and PAS-Seq) are NRIP1 (RIP140), a transcriptional co-regulator linked to metabolic syndromes associated with obesity, and Flt1. Consistent with sFlt1 overproduction being a significant driver of clinical symptoms, placental Flt1 mRNA levels strongly correlate with maternal blood pressure. For Flt1, just three mRNA isoforms account for > 94% of all transcripts, with increased transcription of the entire locus driving Flt1 upregulation in both EO-PE and LO-PE. These three isoforms thus represent potential targets for therapeutic RNA interference (RNAi) in both early and late presentations.

Introduction

Preeclampsia (PE) is one of the three leading causes of premature birth1. Characterized by hypertension and proteinuria in the third trimester, PE complicates up to 10% of pregnancies, and is estimated to result in more than 76,000 maternal and 500,000 infant deaths each year worldwide (www.preeclampsia.org). In the developed world, significant resources are dedicated to the identification and management of preeclampsia, with US health care costs estimated to be 40–100 times higher for early-onset preeclamptic deliveries than for uncomplicated term deliveries2. PE is a highly heterogeneous disease that initiates in the placenta and is likely driven by multiple mechanisms3,4. In particular, early onset PE (EO-PE; defined by symptom onset < 34 weeks) is thought to be clinically distinct from late onset PE (LO-PE; defined by symptom onset > 34 weeks)5,6,7,8,9. EO-PE is thought to be driven by poor placentation during early development, whereas LO-PE is considered to be a more maternal syndrome10.

Although the root causes of PE are not fully understood, it is now well established that the clinical manifestations are due to excess anti-angiogenic proteins, primarily “soluble fms-like tyrosine kinase 1” proteins (sFlt1s) in the mother’s bloodstream11. sFlt1s are truncated forms of the membrane-bound vascular endothelial growth factor (VEGF) receptor FLT1 (aka, VEGFR1) that normally function to buffer placental growth factor (PlGF) and VEGF signaling. sFlt1s contain the extracellular binding domain present in full length or membrane-bound FLT1 (mFlt1), but are secreted because they lack the transmembrane and intracellular tyrosine kinase domains. When sFlt1s are abnormally high in the mother’s circulatory system, they interfere with her body’s ability to respond to VEGF. Among other functions, VEGF is required for maintenance of the hepatic sinusoidal vasculature and other fenestrated vascular beds in the body12. Breakdown of these structures impairs maternal kidney function, leading to hypertension, proteinuria and cerebral edema13,14,15 – classic features of PE and eclampsia.

In mammals, FLT1 is predominantly expressed in the placenta, with human placental Flt1 mRNA levels being 40–50 times higher than those observed in any other adult tissue16 (Fig. 1a). Whereas the full-length isoform predominates in most adult tissues, placental and liver expression are dominated by truncated isoforms due to alternative polyadenylation (Fig. 1b). These alternative polyadenylation sites occur within introns downstream of the exons encoding the extracellular binding domain but upstream of those encoding the transmembrane and intracellular signaling domains17,18,19. Five different sFlt1 mRNA isoforms have been reported to date20. Two (sFlt1-i13-short and sFlt1-i13-long) result from alternative polyadenylation at different sites in intron 13 to yield mRNAs encoding the same 687 amino acid sFlt1 protein isoform, but with either a 17 or 4146 nt 3′-UTR(Untranslated region), respectively. Three others (sFlt1-i14, sFlt1-e15a and sFlt1-e15b) are due to alternative polyadenylation within intron 14. sFlt1-e15a and sFlt1-e15b result from activation of two different cryptic splice acceptors (cryptic 3′ splice sites) upon use of alternative polyadenylation sites in the latter half of intron 14. In all, these five sFlt1 mRNAs encode four different sFlt1 proteins with different C-termini.

Figure 1
figure1

FLT1 expression and patient selection. (a) Mean RNA-Seq FLT1 gene expression in various human tissues. Expression values are given as Fragments Per Kilobase of transcript per Million reads (FPKM) (n = 2–7; whisker: standard deviation). (b) RNA-Seq data for FLT1 gene illustrating isoform differences across various tissues. All data in (a and b) are from http://v14.proteinatlas.org./ENSG00000102755-FLT1/tissue. (c) Patient segmentation into three groups by maternal systolic blood pressure and serum sFlt1: PlGF: Normal (CTRL, green, n = 6), Early-Onset PE (EO-PE, red, n = 8), and Late-Onset PE (LO-PE, blue, n = 3).

In women with PE, circulating sFlt1 protein levels are 5 to 7 times higher than those observed in normal pregnancies20,21,22. One potential therapeutic approach would be to selectively knock down sFlt1 expression by RNA silencing (RNA interference, RNAi) using short interfering RNAs (siRNAs) targeting sequences unique to the sFlt1 mRNA isoforms and not contained in mFlt1 mRNA (manuscript in preparation). Success of this strategy requires knowing exactly which sFlt1 mRNA isoforms to target. Further, while circulating sFlt1 levels increase in both EO-PE and LO-PE, this increase starts earlier in EO-PE and often with a more aggressive trajectory23. This raises the question of whether EO-PE placentae express a different pattern of sFlt1 mRNA isoforms than LO-PE placentae. That is, is it possible to define a common set of sFlt1 mRNA isoforms suitable for therapeutic intervention in both EO-PE and LO-PE patients?

Here we applied polyadenylation site sequencing (PAS-Seq) method on RNA isolated from normal (CTRL) and PE human placentas to identify the most abundant sFlt1 mRNA isoforms. Because PAS-Seq specifically interrogates polyadenylation sites it provides a much more accurate assessment of transcript 3′ ends than RNA-Seq24,25,26, which is generally biased toward transcript 5′ ends and is a better for investigating alternative splicing pattern27,28. Like RNA-Seq, PAS-Seq can be used for differential gene expression analysis by combining all isoforms detected at an individual gene locus into a single gene expression number24. Therefore, in addition to quantifying alternative polyadenylation both at the Flt1 locus and transcriptome-wide, our PAS-Seq data also allowed us to investigate overall gene expression changes between our patient groups.

Results

Placental PAS-Seq datasets

Several risk factors are associated with PE29,30,31. In an attempt to minimize confounding variables by other associated conditions, we used highly selective criteria for patient inclusion. First, we limited our sample set to singleton pregnancies, and excluded any patients with preexisting hypertension or diabetes mellitus. Second, all of our control set (CTRL; n = 6) delivered at term with no other pregnancy complications (such as preterm labor or premature rupture of membranes). Finally, we limited our PE set (n = 11) to patients with systolic blood pressures ≥ 140 mm Hg and circulating sFlt1:PlGF ratios ≥ 85, an accepted diagnostic measure of PE associated with adverse PE related maternal/fetal outcomes11,32,33. (Fig. 1c and Table 1). Gestational delivery week, ranged from 37 to 40 weeks in the CTRL set and from 28 to 39 weeks in the PE set. Based on the gestational delivery week, we further divided the PE patients into Early-Onset (EO-PE; gestational delivery week < 34; n = 8) and Late-Onset (LO-PE; gestational delivery week > 34; n = 3) subgroups. In all cases, placental tissue was collected within 30 minutes of delivery and flash frozen in liquid nitrogen. Using these frozen samples, we isolated polyA tailed RNA from the villous tissue, which is the spongy layer in between the decidua (maternal side) and the chorion (fetal side), and the main source of sFlt134.

Table 1 PAS-Seq patient characteristics.

To analyze alternative polyadenylation transcriptome-wide, we developed a simple, kit-free method by combining elements of a published PAS-Seq protocol25 with a circularization strategy used in our lab for making short RNA fragment libraries35 (Fig. 2a, see Methods). Single-end sequencing on the Illumina HiSeq. 2000 platform yielded 30–70 million (in average) high quality reads per library, of which 70–85% mapped uniquely to the human genome (hg19) (Supplementary Table S1). Using a custom pipeline (Fig. 2b and Methods), peaks were identified as clusters of ≥ 15 reads (all libraries combined) whose 3′ ends mapped within a 40 nt window (to allow for microheterogeneity in polyadenylation site use)36. Peaks representing true polyadenylation site (i.e., those at which a polyA tail was added post-transcriptionally) were distinguished from peaks due to oligoT internal priming of transcripts at genomically-encoded adenosine-rich regions using a Naives Bayes classifier based probabilistic method25. True polyadenylation site were then assigned to genes using GENCODE (v19) annotations and quantified as counts per million (CPM). Custom UCSC genome browser tracks enabled visualization of read clusters and true polyadenylation site sites (Fig. 2c). In all, we detected 31,615 polyadenylation sites on or within 5 kb downstream of annotated transcripts representing 13,328 genes and 3,482 polyadenylation sites in intergenic regions. The majority of the annotated set was dominated by protein-coding genes (n = 27,395 PAS; 86.6%), but also contained hundreds of small nuclear RNAs, long non-coding RNAs, microRNAs and anti-sense transcripts (Fig. 2d).

Figure 2
figure2

PAS-Seq library and bioinformatics workflow. (a) Schematic overview of PAS-Seq library preparation workflow. (b). PAS-Seq bioinformatics pipeline schematic. (c) Representative view of read alignment locations (top track) and called PAS peaks (bottom track) on the ADAM12 3′-UTR. Our algorithm is able to remove background noise, filter out internal priming sites and accurately identify true polyadenylation sites. (d) Distribution of PAS peaks among named gene classes and intergenic regions.

By summing all true polyadenylation site reads mapping across individual loci, PAS-Seq data can be used to assess overall gene expression. That is, the data can be collapsed into a single gene expression number – counts per million mapped reads (CPM) that encompasses all polyadenylated isoforms derived from each gene24. The top 500 genes in our samples were highly enriched for those expressed in placenta (p ≤ 8.4 e-43) and syncytiotrophoblasts (p ≤ 6.5 e-42) (see Methods). Consistent with a previous RNA-Seq report37, nearly all of the 20 most highly expressed genes in the CTRL, EO-PE and LO-PE samples (Fig. 3a,b,c) were placental or pregnancy-related (Supplementary Table S2). This highest abundance set was dominated by hormones (e.g., CSH1, CSH2 and CGA), positive and negative regulators of cell growth (e.g., GDF15 and TFPI2) and hemoglobin genes (HBA1, HBA2, HBB and HBG2); also included were two high abundance long non-coding RNAs, H19 and NEAT1.

Figure 3
figure3

Differential gene expression analysis. Box-plots of top twenty most abundant transcripts in (a) CTRL, (b) EO-PE, and (c) LO-PE patients. Gene expression measured as Counts Per Million (CPM) mapped reads. Dark bar: median; box: 25–75% interquartile; whiskers and dots: range. (d) Principal component analysis (PCA) of highly variable genes. Axes are principal components 1 and 2 (PC1 and PC2). Each point represents a single patient, with colors indicating the patient type as in Fig. 1a. Volcano plots showing differential gene expression for (e) CTRL vs. EO-PE, (f) CTRL vs. LO-PE, and (g) EO-PE vs. LO-PE. X-axis: log2 Fold Change (FC) between samples; Y-axis: -log10 false discovery rate after false positive removal (padj). Each dot is a gene: purple, padj ≤ 0.01; black, padj ≤ 0.05; n, number of genes in each group. (h) Venn diagram illustrating overlap of differentially expressed genes (padj cutoffs as indicated) between our EO-PE and LO-PE PAS-Seq datasets, two published RNA-Seq datasets37,46 and a large meta analysis of available microarray data47. (i) Scatter plot of FLT1 RNA expression (CPM) vs. Systolic Blood Pressure (mmHg). Each point represents a single patient, with colors indicating the patient type as in Fig. 1a.

EO-PE and LO-PE placentas exhibit distinct gene expression signatures

To determine whether the different patient sets had distinct gene expression patterns, we next performed Principal Component Analysis with all detected genes with sufficient read coverage (total read count across all 17 libraries ≥ 10; n = 15,765). By assessing variability across all samples independent of any externally imposed subsets, Principal Component Analysis (aka PCA) allows one to determine how closely related individual samples are to one another. For the top 2000 most variable genes across all libraries, Principal Component(PC) 1, 2 and 3 accounted for 21.8, 15.4 and 10.4 percent of the variance, respectively. In a plot of Principal Component 1 vs. Principal Component 2, the CTRL samples formed a coherent cluster readily distinguishable from the PE samples (Fig. 3d). Further, the EO-PE and LO-PE samples formed distinct but overlapping clusters primarily defined by PC1, with the EO-PE cluster being more distant from CTRL than the LO-PE cluster. This suggested that EO-PE and LO-PE had overlapping but distinct gene expression signatures.

To define the sets of differentially expressed genes between our three groups, we performed pairwise differential gene expression analyses between CTRL, EO-PE and LO-PE using DESeq. 2 (Fig. 3e,f,g). DESeq. 2 is a software package routinely used to analyze high-throughput PAS-Seq and RNA-Seq datasets38. In agreement with the Principal Component Analysis, the pair exhibiting the greatest differential expression profile was between CTRL and EO-PE (n = 701, padj ≤ 0.05). The most highly upregulated gene in EO-PE compared to CTRL was FLT1 (4.3-fold up), followed by Myosin 7B (MYO7B, 3.4-fold up) and Endoglin (ENG, 3-fold up). Other upregulated genes previously linked to PE included Inhibin Alpha (INHA, 2.8-fold up), Inhibin Beta A (INHBA, 2.5-fold up), Vascular Endothelial Growth Factor A (VEGFA, 2.2-fold up), MicroRNA 210 Host Gene (MIR210HG, 2.4-fold up) and Pappalysin 2 (PAPPA2, 2.6-fold up). Genes exhibiting the greatest downregulation were Somatostatin Receptor Type 1 (SSRT1, 4.3-fold down), Signaling Lymphocytic Activation Molecule Family Member 1 (SLAMF1, 4.1-fold down), Hyaluronan And Proteoglycan Link Protein 1 (HAPLN1, 3.2-fold down) and Cell Adhesion Molecule 3 (CADM3; 2.8-fold down).

Overall, 349 genes were differentially expressed at a padj ≤ 0.01 between CTRL and EO-PE (Fig. 3e). Using a variety of online tools such as GeneCoDis39,40,41 and Genomatix, we performed extensive gene ontology analyses of this set, using all other expressed 13,299 genes as background (Supplementary Table S3). Recurring themes (Supplementary Tables S4 and S5) were Biological/Cellular Adhesion and the HIF1 alpha, HIF2 alpha, Interleukin 6, Angiogenesis, Cadherin, and Wnt signaling pathways, all of which have been previously linked to PE42,43,44. Also enriched were Pol II transcription factors and genes with LEF1 and TATA transcription factor binding sites (padj ≤ 0.01).

Consistent with the principal component analysis, fewer genes were differentially expressed between CTRL and LO-PE, with only 40 reaching the padj ≤ 0.01 threshold (Fig. 3f). Those exhibiting the greatest fold change in LO-PE were Perilipin 2 (PLIN2; 2.1-fold up), Zinc Finger Protein 175 (ZNF175, 2.1-fold up), Acyl-CoA Synthetase Long-Chain Family Member 1 (ACSL1, 2.1-fold up), Dual Specificity Phosphatase 15 (DUSP15, 2.3-fold down) and Forkhead Box S1 (FOXS1, 2.3-fold down). The 97 differentially expressed genes (padj ≤ 0.05; 13,265 background gene set) were highly enriched for Pol II transcription factors and components of the HIF2 alpha signaling pathway (Supplementary Tables S3, S6 and S7).

Comparison of the differentially expressed genes in EO-PE and LO-PE revealed that they have overlapping but clearly distinct gene expression profiles (Supplementary Fig. S2). Gene ontology terms associated with the 50 differentially expressed genes common to both EO-PE and LO-PE (padj ≤ 0.05) includes the HIF1 alpha, HIF2 alpha and VEGFR1 (FLT1) signaling pathways (Supplementary Table S8). Differential expression analysis between EO-PE and LO-PE sets revealed 19 genes reaching the padj ≤ 0.05 threshold (Fig. 3g), with the genes exhibiting the greatest differences being Family With Sequence Similarity 19 Member A2 (FAM19A2, 2.5-fold higher in EO-PE), Pleiomorphic Adenoma Gene-Like 2 (PLAGL2, 2.3-fold higher in EO-PE), TNF Receptor Superfamily Member 10a (TNFRSF10A, 1.9-fold higher in LO-PE) and HAPLN1 (1.9-fold higher in LO-PE). Thus, while EO-PE and LO-PE have similar clinical presentations, their distinct gene expression signatures suggest different mechanistic drivers9,10,45.

Comparison to previous differential gene expression studies

Numerous other studies have employed microarrays or RNA-Seq to investigate differential gene expression in PE. We therefore compared our differentially-expressed EO-PE and LO-PE gene sets (padj ≤ 0.05) to similar gene sets from two recent RNA-Seq studies37,46 and a large scale meta analysis of microarray datasets47 (Fig. 3h). As expected from the heterogeneous and multifactorial nature of PE, the gene sets reported in the three previous studies were largely non-overlapping (Fig. 3h). Our gene lists were most similar to the microarray list47, with 289 of 701 (41%) differentially-expressed EO-PE genes and 32 of 97 (33%) differentially-expressed LO-PE genes being among the 1295 genes (padj ≤ 0.01) in the microarray set. Notably, only one gene, NRIP1 (nuclear receptor-interacting protein 1; aka RIP140), intersected all five lists (Fig. 3h). Upregulated in all datasets, NRIP1 is a ubiquitously-expressed nuclear protein that acts as a co-activator and/or co-repressor for numerous nuclear receptor transcription factors48.

The only other differentially expressed gene reported in all previous studies and ours was FLT1. Although FLT1 did not appear in the list of differentially expressed genes in the Kaartokallio et al. study46, it was stated in the text as being differentially expressed. The magnitude of FLT1 overexpression in PE varies greatly among published reports – whereas the microarray and RNA-Seq values ranged from 1.5 to 2.0-fold37,47, qPCR analyses specifically interrogating the FLT1 locus range from 3.86-fold to 4.5-fold above normal16,37. In our PAS-Seq data, FLT1 was the most highly upregulated gene in the EO-PE patient set, exhibiting a 4.3-fold increase relative to the CTRL set (padj = 2.21E-13; Fig. 3e). FLT1 was also upregulated the LO-PE set, but only by 1.9-fold (padj ≤ 0.01; Fig. 3f). Further, consistent with Flt1 overexpression being a significant driver of clinical symptoms, total Flt1 gene expression strongly correlated with systolic blood pressure over all samples (r = 0.84) (Fig. 3i). Flt1 also correlated with body mass index (BMI), but to a much lesser degree (r = 0.51, Supplementary Fig. S2). Thus our PAS-Seq data validate previous findings that Flt1 overexpression is central to PE.

Non-coding RNAs and intergenic regions

In addition to protein-coding genes, our PAS-Seq data also provided expression values for numerous non-coding RNAs (Supplementary Table S3). For example, many miRNAs (miRs) derive from long non-coding polyA + primary transcripts (pri-miRNAs)49. Pri-miRNAs exhibiting particularly high placental expression include the chromosome 14 and 19 miR clusters (C14MC and C19MC, respectively), the miR-371–3 cluster and the mir210 host gene (mir210HG)50,51,52. C19MC is transcribed by Pol III53, so was not represented in our PAS-Seq data. Of the remaining three, only the mir210HG exhibited differential expression among our patient sets, being 2.4 and 1.8-fold increased in EO-PE and LO-PE, respectively, relative to CTRL. Previously shown to be upregulated in PE54, miR210 is known to downregulate the THSD7A gene55. Consistent with this, THSD7A was 1.5-fold downregulated in EO-PE (padj ≤ 0.043), and 1.3-fold downregulated in LO-PE (although not statistically significant; padj ≤ 0.58).

Other differentially expressed noncoding transcripts with embedded miRNAs were DNM3OS (miR214; 1.7-fold down in both EO-PE and LO-PE) and RP6–99M1.2 (miR221/miR222; 1.6-fold down in EO-PE). Finally, some miRNAs derive from the introns or 3′-UTRs of protein coding genes, including SREBF1 (miR33b and miR6777; 1.6-fold up in EO-PE, 1.5-fold up in LO-PE), PDCD4 (miR4680; 1.5-fold up in EO-PE), and FAM172A (miR2277; 1.4-fold down in EO-PE). Among this set, only miR214 has been previously linked to PE, with microarray data indicating its downregulation56. miR214 is thought to regulate both PLGF57 and components of the β-catenin pathway58, with its own expression being modulated by HIF1 alpha57.

Our datasets also included polyadenylation sites for 170 long non-coding RNAs, 112 anti-sense transcripts and 133 “processed transcripts” (defined in GenCode as a transcript with no open reading frame). Although nine of these were differentially expressed in EO-PE, LO-PE or both (Supplementary Table S3), careful examination of the associated polyadenylation site peaks on the UCSC genome browser revealed that nearly all could be explained by differential expression of an overlapping protein coding gene. Thus we conclude that differential long non-coding RNA expression is not a general hallmark of PE. The sole exception was TP73-AS1 (downstream of and antisense to TP73, expression of which was undetectable in our dataset), a lowly expressed transcript (base mean = 62.5 CPM) that was 1.4 fold-downregulated (padj = 0.02) in EO-PE. TP73-AS1 is differentially expressed in multiple cancers59, and recent evidence suggests that it functions to positively regulate expression of BDH2, a cytosolic type 2 hydroxybutyrate dehydrogenase involved in mitochondrial function59. While BDH2 expression was 1.4-fold reduced in our EO-PE set, this change was not statistically significant (padj = 0.15).

Finally, we examined the 3,482 intergenic polyadenylation sites. Compared to the mean CPM distribution for genic PAS, the vast majority of intergenic PAS were lowly expressed (Supplementary Fig. S1), and DESeq. 2 analysis yielded only 13 and 8 differentially expressed intergenic polyadenylation site in EO-PE and LO-PE, respectively (Supplementary Table S9). Many of these were associated with Long Interspersed Elements (LINEs) and Long Terminal Repeat elements (LTRs), suggesting that activation of select endogenous retroviral elements might occur in PE. For example, three intergenic PAS upregulated in EO-PE lie between the LTF and CCRL2 genes in a region containing numerous retroviral elements, and an RNA-Seq track of multimapping reads confirmed the existence of an independent multiple LTR-containing transcript (Supplementary Fig. S1).

All Flt1 isoforms are upregulated in PE

A major goal of this study was to determine which sFlt1 mRNA isoform(s) to target with therapeutic siRNAs60 (Turanov et al., in preparation). Our PAS-Seq data revealed a total of 16 PAS in the Flt1 gene (Fig. 4b and c; Supplementary Table S10), including all 6 previously reported Flt1 mRNA isoforms. Thirteen are low abundance, each representing < 0.6% of true PAS reads mapping to the Flt1 locus. Two previously reported isoforms, sFlt1-i14 (previously sFlt1-v4) and sFlt1-e15b (previously sFlt1-v3), were among this low abundance set, so contribute little to sFlt1 expression. The four high abundance species are sFlt1-i13S (previously sFlt1_v1 short 3′UTR), sFlt1-i13L (previously sFlt1_v1 long 3′UTR), sFlt1-e15a (previously sFlt1_v2) and m-Flt1 (membrane-bound or full-length Flt1 mRNA) (Fig. 4a). Among these, sFlt1-e15a is the most abundant, accounting for ~50% of Flt1 PAS reads in all samples (Fig. 4b). The sFlt1-i13 short and long isoforms each accounted for 20 to 25% of PAS reads, whereas full-length m-Flt1 mRNA represented only ~5% of PAS reads. Thus, at time of delivery, sFlt1-i13 short, sFlt1-i13 long and sFlt1-e15a comprise the vast majority (88 to 97%) of placental Flt1 mRNAs.

Figure 4
figure4

FLT1 alternative PAS isoform analysis. (a) Schematic of FLT1 gene and six previously identified mRNA isoforms arising from alternative polyadenylation. Thick boxes: protein coding exons; thin boxes: 3′-UTRs; lines: introns; arrows: sFlt1 and mFlt1 polyadenylation sites. Table shows all previous isoform names and new systematic names. (b) UCSC genome browser view showing PAS-Seq peak distribution on the FLT1 gene for a representative CTRL (top track), EO-PE (middle-track) or LO-PE (bottom track) patient. Arrows: sFlt1 and mFlt1 polyadenylation sites for isoforms in (a). (c) Bar graph showing mean expression (counts per million; CPM) across all libraries for all 16 detected mRNA isoforms at the FLT1 locus.

Comparison among datasets revealed that all four high-abundance Flt1 isoforms are upregulated in EO-PE vs. CTRL (p ≤ 0.05; two-way t-test) (Fig. 5a). A similar trend is also observed comparing LO-PE to CTRL, but only the sFlt1-i13 long isoform reached statistical significance (p ≤ 0.05; two-way t-test), likely due to the small sample size. To assess whether this general upregulation of the Flt1 locus was accompanied by any shift in PAS usage, we examined how the fraction of total reads mapping to each of the four major PAS differed among the EO-PE, LO-PE and CTRL datasets (Fig. 5b). The only statistically significant (two-way t-test) differences were a 1.5-fold increase in sFlt1-i13 short and a 2.0-fold reduction in mFlt1 in EO-PE compared to CTRL. Thus in EO-PE, use of the sFlt1-i13 short PAS increases at the expense of the mFlt1 PAS. Nonetheless, the predominant feature associated with both EO-PE and LO-PE is transcriptional upregulation of the entire Flt1 locus.

Figure 5
figure5

Differential PAS isoform analysis on FLT1, SDC1 and ADAM12 genes. (a) Superimposed scatter plots showing expression (counts per million mapped reads; CPM) of the four most abundant FLT1 mRNA isoforms. (b) Superimposed scatter plot comparing fractions of total PAS reads mapping to the FLT1 locus represented by each indicated mRNA isoform. For a. and b., each point represents an individual CTRL, EO-PE or LO-PE patient with the color indicating patient type as described in Fig. 1c. A two-way t-test was used for statistical comparisons between indicated groups. Horizontal line: mean; n.s.: not statistically significant. (c) UCSC genome browser view of SDC1 gene illustrating differential PAS isoform abundance (Distal-to-Proximal Switch) for a representative CTRL (top track), EO-PE (middle-track) or LO-PE (bottom track) patient. (d) Same as c, but ADAM12 gene (Proximal-to-Distal Switch).

Other genes with altered isoform abundances

The altered Flt1 mRNA isoform abundance associated with EO-PE prompted us to ask whether this phenomenon was detectable at any other gene loci. Using one-vs.-others model (see Methods), we observed PAS switches (padj ≤ 0.05) in only eight genes in EO-PE compared to CTRL (ADAM12, FLT1, NEAT1, TIMP2, CRH, PAPPA, PSAP and SDC1) and only four genes in LO-PE compared to CTRL (EIF2A, PSG1, SDC1 and PHLDB2) (Supplementary Table S11), with SDC1 being common to both. The switches were almost equally divided between proximal-to-distal (n = 6) and distal-to-proximal (n = 5) isoform shifts. Most of the altered isoform abundances were limited to shifts within untranslated regions and so were not expected to alter protein isoform ratios. The exceptions were FLT1 (discussed above), SDC1 and ADAM12. In all three cases, the affected mRNA isoforms change the ratio of secreted and membrane-bound isoforms. The SDC1 gene encodes short (secreted or “shed”) and long (membrane-bound) forms of the major transmembrane heparan sulfate proteoglycan syndecan 1 from mRNA isoforms having different PAS (Fig. 5c). In both EO-PE and LO-PE, the isoform encoding membrane-bound SDC1 was preferentially upregulated. Likewise, ADAM12 expresses two alternate PAS mRNA isoforms, ADAM12-S and ADAM12-L, encoding short (S,secreted) and long (L,membrane-bound) forms of metalloproteinase-disintegrin 1261,62 (Fig. 5d). Whereas the entire gene is 1.6-fold upregulated in EO-PE, the ADAM12-L isoform increases more than the ADAM12-S isoform (1.8-fold vs. 1.3-fold, respectively).

Discussion

Here we present comprehensive differential gene expression and PAS isoform abundance analysis of placental RNA from subjects having an abnormal plasma angiogenic profile accompanying either EO-PE or LO-PE. We find that EO-PE and LO-PE are distinct conditions with different but related gene expression signatures. Common to both is overexpression of the anti-angiogenic factor sFlt1, expression of which strongly correlates with systolic blood pressure (Fig. 3i). Our PAS-Seq analysis confirms previous reports identifying sFlt1-e15a as the predominant sFlt1 mRNA isoform in both normal and PE placentae20,22,63,64,65, with the next most abundant isoforms being sFlt1-i13 short and long66. Together sFlt1-i13-short, sFlt1-i13-long and sFlt1-e15a account for > 94% of all sFlt1 expression in both normal and PE placentae. This information enabled us to identify siRNAs that decrease sFlt1 expression as a potential therapeutic for PE66. Our observation that all Flt1 isoforms increase in PE indicates that Flt1 upregulation is primarily a transcriptional response. Intriguingly, the only other gene differentially expressed in PE across all transcriptome-wide analyses to date is the transcriptional coregulator NRIP1 (RIP140)48,67.

Because sFlt1 is a key negative regulator of angiogenesis in both normal68 and cancerous tissues69, how its production is modulated has been the topic of multiple studies outside the placenta. One condition known to alter sFlt1 expression is hypoxia70,71. In human microvascular endothelial cells (HMVECs), the predominant sFlt1 mRNA isoform is sFlt1-i13-short. Under hypoxic conditions, sFlt1-i13-short levels decrease, but mFlt1 levels are unaffected suggestive of a post-transcriptional mechanism72. More recent work73 identified heterogeneous nuclear ribonucleoprotein D (hnRNP D) as a negative regulator of sFlt1-i13-short production, possibly acting through binding to an AU-rich element near the sFlt1-i13-short PAS and thereby blocking access of the polyadenylation machinery to this location. That study further showed that this inhibitory activity was a function of both overall hnRNP D levels and the methylation status of a specific hnRNP D arginine residue. In EO-PE placental samples, however, we observed the opposite trend – a small increase in sFlt1-i13 short at the expense of mFlt1 (Fig. 5b). We also observed no differential gene expression for arginine methyltransferase (PRMT1), arginine demethylase (JMJD6), hnRNP D or any other hnRNP protein (data not shown). Thus sFlt1 upregulation in PE appears to be mechanistically unrelated to the hypoxia-dependent modulation of sFlt1-i13-short production in HMVECs.

Our data clearly show that all FLT1 mRNA isoforms increase in abundance in PE (Fig. 5a), with no general shift toward the more promoter proximal PAS (Fig. 5b). When we considered the entire transcriptome, we detected no general 3′-UTR shortening (data not shown) and only eleven statistically significant PAS isoform abundance changes overall (Supplementary Table S11 and Fig. 5c,d). Among these, there was no consistent proximal-to-distal or distal-to-proximal switch pattern. Thus there is no general polyadenylation pattern shift associated with PE, and alternative polyadenylation is not a major contributor to sFlt1 upregulation. Rather our data indicate that the increased sFlt1 expression in PE is primarily due to increased transcription of the entire FLT1 locus.

What is driving this increased FLT1 transcription? Because of its central role in regulating angiogenesis, FLT1 transcriptional regulation has been intensely investigated in other tissues and several upstream transcription factors including EPAS1, PAX3, p53 and ETS have been characterized74,75,76,77,78,79. Notably, the differentially expressed gene sets (padj ≤ 0.05) in both EO-PE and LO-PE included numerous Pol II transcription factors. Common to both sets were CEBPA, SP3, DLX5, BHLHE40 and SREBF1, all of which had increased expression, and ZEB2, which had decreased expression. The CEBPA (CCAAT/enhancer binding protein alpha) pathway is particularly interesting, as CEBPA is expressed in the labyrinthine trophoblasts (the same cells that express Flt1) and has been shown to regulate placental development80.

Intriguingly, the only other gene identified by every transcriptome-wide gene expression analysis method to date (microarray, RNA-Seq and PAS-Seq) as being differentially expressed in PE is NRIP1 (nuclear receptor-interacting protein 1; aka RIP140) (Fig. 2h). In all cases, NRIP1 was upregulated. A ubiquitously-expressed nuclear protein, NRIP1 modulates the activities of numerous nuclear receptor transcription factors48. Its best understood role is as a regulator of energy expenditure in adipose and muscle tissues, but it has also been linked to ovarian fertility and maintenance of a pregnancy state81. Whether NRIP1 contributes to the metabolic syndrome often associated with PE or it directly regulates sFlt1 expression remains to be explored. We note that NRIP1 has been implicated in the pathogenesis of obesity48,67a major risk factor for PE. It is possible that upregulated NRIP1 may synergize with anti-angiogenic factors to induce the endothelial dysfunction ultimately leading to PE.

Previous microarray studies comparing samples from patients with HELLP syndrome (a particularly severe form of PE associated with hemolysis, elevated liver enzymes and low platelet count) to EO-PE and/or LO-PE with CTRL patients, found that these are discrete conditions with distinct gene expression signatures and different mechanistic drivers9,10,45. Our PAS-Seq data are consistent with these previous findings. We found many more differentially expressed genes in the EO-PE cohort than in the LO-PE cohort, although the small size of our LO-PE dataset (n = 3) may have precluded all but the most consistently differentially expressed genes from reaching statistical significance. An additional caveat is that some gene expression differences could be due to gestational age differences, particularly between the EO-PE (gestational delivery week < 34) and CTRL (gestational delivery week >37–40) groups. Nonetheless the HIF1 alpha, HIF2 alpha and VEGFR1 (FLT1) signaling pathways were differentially expressed in both groups, consistent with hypoxia and altered angiogenesis being generally associated with PE82,83,84,85. But unique to the EO-PE set were genes involved in Biological/Cellular Adhesion and the Interleukin 6, Cadherin, and Wnt signaling pathways. This suggests that EO-PE placentae may have defects in epithelial to mesenchymal transition which may lead to defective trophoblast invasion noted in this syndrome86.

Due to its highly heterogeneous presentation, definitive diagnosis of PE based solely on maternal signs and symptoms can be erroneous87,88. A variety of underlying conditions can serve as mechanistic drivers, confounding gene expression analysis of diverse patient populations. Differences in sample collection methodologies across institutions can also introduce variability. These factors likely explain the relatively poor overlap among prior transcriptome-wide analyses of placental gene expression in PE (Supplementary Fig. S2). For the current study, we aimed to eliminate as many variables as possible. In addition to using highly selective criteria for PE diagnosis based solely on quantitative measures (Fig. 1c) and excluding known confounding drivers (e.g., multi-parity, preexisting hypertension and gestational diabetes mellitus), all samples were collected by the same personnel in a single hospital, all libraries were prepared by a single researcher, and all sequencing was performed on a single instrument. Our libraries should thus represent highly coherent sets. Nonetheless, our study does have its own limitations. Because all but one PE subject delivered at ≤ 36 weeks gestational age, whereas all CTRL patients delivered closer to term (≥ 37 weeks), PE and CTRL placentas were not gestationally matched. Therefore, additional studies using placentas delivered for other medical reasons (e.g., preterm labor) will be needed to evaluate which gene expression changes, if any, are simply attributable to earlier gestational age in the PE subjects. As stated above, our study was also limited by small sample size particularly for the LO-PE group (n = 3). Future studies with adequate sample numbers are needed to fully characterize the transcriptional signature specific to LO-PE.

In addition to assessing protein-coding genes, we interrogated our PAS-Seq data with regard to non-coding and intergenic transcripts. While we found little evidence of differential long non-coding RNA expression in PE, our data do suggest a potential association between EO-PE and endogenous retroviral element activation. Thus it may be of interest to interrogate repeat-associated transcripts in existing PE placental RNA-Seq datasets37,46. Our PAS-Seq data also confirm upregulation of miR210 and downregulation of miR214 in PE54. Therefore, some gene expression differences associated with PE are likely driven by post-transcriptional mechanisms.

Methods

Ethics Statement

The study was approved by the Beth Israel Deaconess Medical Center Institutional Review Board (IRB), #2008P-000061. All subjects provided written informed consent for use of placental and blood samples for research.

Human subjects

Placentas were collected from normal(CTRL) and PE subjects (Table 1) in accordance with all institutional policies and with approval of the institutional review board at the Beth Israel Deaconess Medical Center (Boston, MA, USA). The diagnosis of PE was based on the updated criteria of American College of Obstetrician and Gynecology Task force on Hypertension in Pregnancy89. Patients with history of diabetes, chronic hypertension, renal disease or multiple gestations were excluded. Pre-delivery maternal plasma samples were used to measure circulating sFlt1 and PlGF using commercial ELISA kits (R & D systems, MN) as described elsewhere90.

Tissue collection, storage and RNA isolation

We excised placental biopsies (2 × 2 cm) without basal and chorionic plates and wiped with cotton gauze to remove blood and debris. The villous tissue, the spongy layer that is in between the decidua (maternal side) and the chorion (fetal side) was collected since it is the main source of sFlt134. These tissue samples were flash-frozen in liquid nitrogen within 30 min of placental delivery.

PAS-Seq library preparation and sequencing

For preparation of PAS-Seq libraries, we adapted a previously published method for making libraries from RNA fragments91; (Fig. 2a). Total RNA integrity was first confirmed by agarose electrophoresis (data not shown) and then polyA + RNA enriched by oligoT hybridization. PolyA + RNA samples were then fragmented to 60–80 nt via chemical hydrolysis and reverse transcribed with one of twelve anchored oligoT oligonucleotides containing forward and reverse Illumina sequencing primer sites separated by a hexa-ethyleneglycol spacer (Sp18) linker. At the 5′ end, each oligonucleotide began with 5′p-GG to promote ligation91, followed by 5 random nucleotides (unique molecular index, UMI) to enable PCR duplicate removal. Each primer also harbored a unique 5 nt Hamming barcode (BC), allowing for sample multiplexing. Following cDNA circularization with CircLigase I, libraries were PCR amplified (12–14 cycles) and subjected to single end 100 nt sequencing on the Illumina HiSeq platform in the UMass Medical School Deep Sequencing Core.

PAS-Seq Bioinformatics analysis

After confirming overall library quality with FASTQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/, S.Andrews, 2011), we removed duplicate reads with the same UMI. We then used Cutadapt92 to trim residual adaptor/spacer sequences and poly-A stretches from read 3′ ends and mapped the reads to hg19 reference genome using Bowtie2 (with parameters –m –best –p4). Only uniquely mapping reads with high quality scores (>20) were used for further analysis. Such reads represented 70–85% of total reads for all libraries.

We defined the polyadenylation cleavage site as the base closest to the poly-A stretch captured within read sequence, thus, very last base at the 3′-end site after the trimming. Based on this, we calculated read coverages of every genomic location considering only 3′-end of the aligned reads. This forms a sharp peak of read coverage around the cleavage sites with an average width 40nt (ranging between 1nt-120nt). We then clustered sites that are in proximity closer than 40nt window and summed the counts of clustered sites. In order to determine all possible polyadenylation sites, which might possibly differ from sample to sample, we pooled all candidate locations of all samples and reiterated the clustering using the same window length. Since genomic DNA containing poly-A stretches can be hybridized and pulled with the oligo-T primers, aka internal priming, we used a Naïve Bayes classifier93 based software to determine the likelihood of all sites being false priming sites. In addition to filtering internal priming locations based on this, we also removed background noise caused by widespread base level read alignments by fitting the count distribution of each gene to Poisson distribution. After determining precise PA locations, we then calculated expression/read counts of every PAS for each sample and annotated using Gencode v19. We also marked known polyA sites with PolyA DB (http://exon.umdnj.edu/polya_db/). For differential gene expression tests, we used sum of all PAS counts of each gene and calculated library sizes based on this. The raw read counts for genes were used as an input for differential gene expression analyses using DESeq. 238 in R (https://www.R-project.org). The default normalization using ‘estimateSizeFactors’ function was used. Adjusted p-values were calculated using the Benjamini and Hochberg method94. In order to determine significance levels of alternative polyA site switches, we constructed a 2 by 2 contingency table composed of mean normalized read counts of one site and the mean of rest of the sites, if there are multiple, in each of the condition. We tested the significance with Chi-square test for given site and iterated through all sites of the gene. We then corrected p-values for multiple hypotheses testing using Benjamini and Hochberg method94.

References

  1. 1.

    Ananth, C. V., Keyes, K. M. & Wapner, R. J. Pre-eclampsia rates in the United States, 1980-2010: age-period-cohort analysis. BMJ 347, f6564–f6564 (2013).

  2. 2.

    Shih, T. et al. The Rising Burden of Preeclampsia in the United States Impacts Both Maternal and Child Health. Amer J Perinatol 33, 329–338 (2016).

  3. 3.

    Kang, J. H. et al. Preeclampsia leads to dysregulation of various signaling pathways in placenta. Journal of Hypertension 29, 928–936 (2011).

  4. 4.

    Chaiworapongsa, T. et al. Evidence supporting a role for blockade of the vascular endothelial growth factor system in the pathophysiology of preeclampsia. Young Investigator Award. YMOB 190, 1541–7– discussion 1547–50 (2004).

  5. 5.

    Dadelszen von, P., Magee, L. A. & Roberts, J. M. Subclassification of preeclampsia. Hypertens Pregnancy 22, 143–148 (2003).

  6. 6.

    Romero, R. et al. A longitudinal study of angiogenic (placental growth factor) and anti-angiogenic (soluble endoglin and soluble vascular endothelial growth factor receptor-1) factors in normal pregnancy and patients destined to develop preeclampsia and deliver a small for gestational age neonate. J Matern Fetal & Neonatal Med 21, 9–23 (2009).

  7. 7.

    Kusanovic, J. P. et al. A prospective cohort study of the value of maternal plasma concentrations of angiogenic and anti-angiogenic factors in early pregnancy and midtrimester in the identification of patients destined to develop preeclampsia. The Journal of Maternal-Fetal & Neonatal Medicine, 22(11), 1021–1038 (2009). 

  8. 8.

    Soto, E. et al. Late-onset preeclampsia is associated with an imbalance of angiogenic and anti-angiogenic factors in patients with and without placental lesions consistent with maternal underperfusion. The Journal of Maternal-Fetal & Neonatal Medicine, 25(5), 498–507 (2011). 

  9. 9.

    Várkonyi, T. et al. Microarray profiling reveals that placental transcriptomes of early-onset HELLP syndrome and preeclampsia are similar. Placenta 32(Suppl), S21–9 (2011).

  10. 10.

    Junus, K. et al. Gene expression profiling of placentae from women with early- and late-onset pre-eclampsia: down-regulation of the angiogenesis-related genes ACVRL1 and EGFL7 in early-onset disease. Molecular Human Reproduction 18, 146–155 (2012).

  11. 11.

    Rana, S., Powe, C. E., Salahuddin, S. & Verlohren, S. Angiogenic Factors and the Risk of Adverse Outcomes in Women With Suspected PreeclampsiaClinical Perspective. Circulation (2012).

  12. 12.

    Kamba, T. et al. VEGF-dependent plasticity of fenestrated capillaries in the normal adult microvasculature. Am. J. Physiol. Heart Circ. Physiol. 290, H560–76 (2006).

  13. 13.

    Eremina, V. et al. Glomerular-specific alterations of VEGF-A expression lead to distinct congenital and acquired renal diseases. J. Clin. Invest. 111, 707–716 (2003).

  14. 14.

    Eremina, V. et al. VEGF inhibition and renal thrombotic microangiopathy. The New England Journal of Medicine, 358(11), 1129–1136 (2008). 

  15. 15.

    Young, B. C., Levine, R. J. & Karumanchi, S. A. Pathogenesis of preeclampsia. Annual Review of Pathology: Mechanisms of Disease 5, 173–192 (2010).

  16. 16.

    Jebbink, J. et al. Expression of placental FLT1 transcript variants relates to both gestational hypertensive disease and fetal growth. Hypertension 58, 70–76 (2011).

  17. 17.

    Thomas, C. P., Andrews, J. I. & Liu, K. Z. Intronic polyadenylation signal sequences and alternate splicing generate human soluble Flt1 variants and regulate the abundance of soluble Flt1 in the placenta. FASEB 21, 3885–3895 (2007).

  18. 18.

    Thomas, C. P. C., Raikwar, N. S. N., Kelley, E. A. E. & Liu, K. Z. K. Alternate processing of Flt1 transcripts is directed by conserved cis-elements within an intronic region of FLT1 that reciprocally regulates splicing and polyadenylation. Nucleic Acids Research 38, 5130–5140 (2010).

  19. 19.

    Vorlová, S. et al. Induction of antagonistic soluble decoy receptor tyrosine kinases by intronic polyA activation. Molecular Cell 43, 927–939 (2011).

  20. 20.

    Heydarian, M. et al. Novel splice variants of sFlt1 are upregulated in preeclampsia. Placenta 30, 250–255 (2009).

  21. 21.

    Maynard, S. E. et al. Excess placental soluble fms-like tyrosine kinase 1 (sFlt1) may contribute to endothelial dysfunction, hypertension, and proteinuria in preeclampsia. J. Clin. Invest. 111, 649–658 (2003).

  22. 22.

    Palmer, K. R. et al. Placental-Specific sFLT-1 e15a Protein Is Increased in Preeclampsia, Antagonizes Vascular Endothelial Growth Factor Signaling, and Has Antiangiogenic Activity. Hypertension 66, 1251–1259 (2015).

  23. 23.

    Noori, M., Donald, A. E., Angelakopoulou, A., Hingorani, A. D. & Williams, D. J. Prospective Study of Placental Angiogenic Factors and Maternal Vascular Function Before and After Preeclampsia and Gestational Hypertension. Circulation 122, 478–487 (2010).

  24. 24.

    Derti, A. et al. A quantitative atlas of polyadenylation in five mammals. Genome Research 22, 1173–1183 (2012).

  25. 25.

    Shepard, P. J. et al. Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. RNA 17, 761–772 (2011).

  26. 26.

    Wilkening, S. et al. An efficient method for genome-wide polyadenylation site mapping and RNA quantification. Nucleic Acids Research (2013).

  27. 27.

    Zheng, W., Chung, L. M. & Zhao, H. Bias detection and correction in RNA-Sequencing data. BMC Bioinformatics 12, 290 (2011).

  28. 28.

    Mortazavi, A., Williams, B. A., Mccue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods 5, 1–8 (2008).

  29. 29.

    Sibai, B. M. et al. Risk factors for preeclampsia in healthy nulliparous women: a prospective multicenter study. The National Institute of Child Health and Human Development Network of Maternal-Fetal Medicine Units. YMOB 172, 642–648 (1995).

  30. 30.

    Duckitt, K. Risk factors for pre-eclampsia at antenatal booking: systematic review of controlled studies. BMJ 330, 565–0 (2005).

  31. 31.

    Kenny, L., English, F. & McCarthy, F. Risk factors and effective management of preeclampsia. IBPC 8, 7–6 (2015).

  32. 32.

    Verlohren, S. et al. The sFlt-1/PlGF ratio in different types of hypertensive pregnancy disorders and its prognostic potential in preeclamptic patients. American Journal of Obstetrics and Gynecology, 206(1) 58.e1–8 (2012).

  33. 33.

    Verlohren, S. et al. An automated method for the determination of the sFlt-1/PIGF ratio in the assessment of preeclampsia. American Journal of Obstetrics and Gynecology, 202(2), 161.e1–161.e11 (2010).

  34. 34.

    Rajakumar, A. et al. Transcriptionally active syncytial aggregates in the maternal circulation may contribute to circulating soluble fms-like tyrosine kinase 1 in preeclampsia. Hypertension 59, 256–264 (2012).

  35. 35.

    Heyer, E. E., Ozadam, H., Ricci, E. P., Cenik, C. & Moore, M. J. An optimized kit-free method for making strand-specific deep sequencing libraries from RNA fragments. Nucleic Acids Research 43, e2 (2015).

  36. 36.

    Tian, B., Hu, J., Zhang, H. & Lutz, C. S. A large-scale analysis of mRNA polyadenylation of human and mouse genes. Nucleic Acids Research 33, 201–212 (2005).

  37. 37.

    Sõber, S. et al. Extensive shift in placental transcriptome profile in preeclampsia and placental origin of adverse pregnancy outcomes. Sci Rep 5, 13336 (2015).

  38. 38.

    Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq. 2. Genome Biology 15, 31 (2014).

  39. 39.

    Nogales-Cadenas, R. et al. GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information. Nucleic Acids Research 37, W317–W322 (2009).

  40. 40.

    Tabas-Madrid, D., Nogales-Cadenas, R. & Pascual-Montano, A. GeneCodis3: a non-redundant and modular enrichment analysis tool for functional genomics. Nucleic Acids Research 40, W478–W483 (2012).

  41. 41.

    Carmona-Saez, P., Chagoyen, M., Tirado, F., Carazo, J. M. & Pascual-Montano, A. GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists. Genome Biology 8, R3 (2007).

  42. 42.

    Knöfler, M. & Pollheimer, J. Human placental trophoblast invasion and differentiation: a particular focus on Wnt signaling. Front Genet 4, 190 (2013).

  43. 43.

    Sitras, V. et al. Differential placental gene expression in severe preeclampsia. Placenta 30, 424–433 (2009).

  44. 44.

    Maynard, S. E. & Karumanchi, S. A. Angiogenic factors and preeclampsia. YSNEP 31, 33–46 (2011).

  45. 45.

    Chaiworapongsa, T. et al. Differences and similarities in the transcriptional profile of peripheral whole blood in early and late-onset preeclampsia: insights into the molecular basis of the phenotype of preeclampsiaa. J Perinat Med 41, 485–504 (2013).

  46. 46.

    Kaartokallio, T., Cervera, A., Kyllönen, A., Laivuori, K. & FINNPEC Core Investigator Group. Gene expression profiling of pre-eclamptic placentae by RNA sequencing. Sci Rep 5, 14107 (2015).

  47. 47.

    Leavey, K., Bainbridge, S. A. & Cox, B. J. Large scale aggregate microarray analysis reveals three distinct molecular subclasses of human preeclampsia. PLoS ONE 10, e0116508 (2015).

  48. 48.

    Nautiyal, J., Christian, M. & Parker, M. G. Distinct functions for RIP140 in development, inflammation, and metabolism. Trends Endocrinol. Metab. 24, 451–459 (2013).

  49. 49.

    Lee, Y. et al. MicroRNA genes are transcribed by RNA polymerase II. EMBO J 23, 4051–4060 (2004).

  50. 50.

    Mouillet, J.-F., Ouyang, Y., Coyne, C. B. & Sadovsky, Y. MicroRNAs in placental health and disease. Amer J Obst and Gyn  213, S163–72 (2015).

  51. 51.

    Lee, D. C. et al. miR-210 Targets Iron-Sulfur Cluster Scaffold Homologue in Human Trophoblast Cell Lines. Am. J. Pathol. 179, 590–602 (2011).

  52. 52.

    Pineles, B. L. et al. Distinct subsets of microRNAs are expressed differentially in the human placentas of patients with preeclampsia. Amer J Obst and Gyn 196, 261.e1–261.e6 (2007).

  53. 53.

    Borchert, G. M., Lanier, W. & Davidson, B. L. RNA polymerase III transcribes human microRNAs. Nat Struct Mol Biol 13, 1097–1101 (2006).

  54. 54.

    Enquobahrie, D. A. et al. Differential placental gene expression in preeclampsia. Amer J Obst and Gyn 199, 566.e1–566.e11 (2008).

  55. 55.

    Luo, R. et al. Hypoxia-inducible miR-210 contributes to preeclampsia via targeting thrombospondin type I domain containing 7A. Sci Rep 1–11 (2016).

  56. 56.

    Xu, P. et al. Variations of microRNAs in human placentas and plasma from preeclamptic pregnancy. Hypertension 63, 1276–1284 (2014).

  57. 57.

    Gonsalves, C. S. et al. Erythropoietin-mediated expression of placenta growth factor is regulated via activation of hypoxia-inducible factor-1α and post-transcriptionally by miR-214 in sickle cell disease. Biochem. J. 468, 409–423 (2015).

  58. 58.

    Xia, H., Ooi, L. L. P. J. & Hui, K. M. MiR-214 Targets β-Catenin Pathway to Suppress Invasion, Stem-Like Traits and Recurrence of Human Hepatocellular Carcinoma. PLoS ONE 7, e44206–13 (2012).

  59. 59.

    Zang, W. et al. Knockdown of long non-coding RNA TP73-AS1 inhibits cell proliferation and induces apoptosis in esophageal squamous cell carcinoma. Oncotarget 7, 19960–19974 (2016).

  60. 60.

    Turanov, A. A. et al. Development of Therapeutic anti-sFLT1 siRNA for the Treatment of Preeclampsia.  Manuscript in preparation.

  61. 61.

    Biadasiewicz, K. et al. Extravillous trophoblast-associated ADAM12 exerts pro-invasive properties, including induction of integrin beta 1-mediated cellular spreading. Biology of Reproduction 90, 101 (2014).

  62. 62.

    Christians, J. K. & Beristain, A. G. ADAM12 and PAPP-A: Candidate regulators of trophoblast invasion and first trimester markers of healthy trophoblasts. Cell Adh Migr 10, 147–153 (2016).

  63. 63.

    Thomas, C. P. et al. A recently evolved novel trophoblast-enriched secreted form of fms-like tyrosine kinase-1 variant is up-regulated in hypoxia and preeclampsia. J. Clin. Endocrinol. Metab. 94, 2524–2530 (2009).

  64. 64.

    Sela, S. et al. A novel human-specific soluble vascular endothelial growth factor receptor 1: cell-type-specific splicing and implications to vascular endothelial growth factor homeostasis and preeclampsia. Circulation Research 102, 1566–1574 (2008).

  65. 65.

    Szalai, G. et al. Full-Length Human Placental sFlt-1-e15a Isoform Induces Distinct Maternal Phenotypes of Preeclampsia in Mice. PLoS ONE 10, e0119547 (2015).

  66. 66.

    Kendall, R. L. & Thomas, K. A. Inhibition of vascular endothelial cell growth factor activity by an endogenously encoded soluble receptor. PNAS USA 90, 10705–10709 (1993).

  67. 67.

    Nautiyal, J. Transcriptional coregulator RIP140: an essential regulator of physiology. J. Mol. Endocrinol. 58, R147–R158 (2017).

  68. 68.

    Ambati, B. K. et al. Corneal avascularity is due to soluble VEGF receptor-1. Nature 443, 993–997 (2006).

  69. 69.

    Shibuya, M. Involvement of Flt-1 (VEGF receptor-1) in cancer and preeclampsia. Proc. Jpn. Acad, Ser. B 87, 167–178 (2011).

  70. 70.

    Nagamatsu, T. et al. Cytotrophoblasts up-regulate soluble fms-like tyrosine kinase-1 expression under reduced oxygen: an implication for the placental vascular development and the pathophysiology of preeclampsia. Endocrinology 145, 4838–4845 (2004).

  71. 71.

    Munaut, C. et al. Hypoxia is responsible for soluble vascular endothelial growth factor receptor-1 (VEGFR-1) but not for soluble endoglin induction in villous trophoblast. Human Reproduction 23, 1407–1415 (2008).

  72. 72.

    Ikeda, T. et al. Hypoxia down-regulates sFlt-1 (sVEGFR-1) expression in human microvascular endothelial cells by a mechanism involving mRNA alternative processing. Biochem. J. 436, 399–407 (2011).

  73. 73.

    Ikeda, T. et al. Regulation of soluble Flt-1 (VEGFR-1) production by hnRNP D and protein arginine methylation. Molecular and Cellular Biochemistry 1–10 (2016).

  74. 74.

    Ciribilli, Y. et al. The Coordinated P53 and Estrogen Receptor Cis-Regulation at an FLT1 Promoter SNP Is Specific to Genotoxic Stress and Estrogenic Compound. PLoS ONE 5, e10236–15 (2010).

  75. 75.

    Jinnin, M. et al. Suppressed NFAT-dependent VEGFR1 expression and constitutive VEGFR2 signaling in infantile hemangioma. Nat. Med. 14, 1236–1246 (2008).

  76. 76.

    Barber, T. D. et al. Identification of Target Genes Regulated by PAX3 and PAX3–FKHR in Embryogenesis and Alveolar Rhabdomyosarcoma. Genomics 79, 278–284 (2002).

  77. 77.

    Menendez, D. et al. A Single-Nucleotide Polymorphism in a Half-Binding Site Creates p53 and Estrogen Receptor Control of Vascular Endothelial Growth Factor Receptor 1. Molecular and Cellular Biology 27, 2590–2600 (2007).

  78. 78.

    Das, B. et al. A Hypoxia-Driven Vascular Endothelial Growth Factor/Flt1 Autocrine Loop Interacts with Hypoxia-Inducible Factor-1 through Mitogen-Activated Protein Kinase/Extracellular Signal-Regulated Kinase 1/2 Pathway in Neuroblastoma. Cancer Research 65, 7267–7275 (2005).

  79. 79.

    Koyano-Nakagawa, N. et al. Feedback Mechanisms Regulate Ets Variant 2(Etv2) Gene Expression and Hematoendothelial Lineages. J. Biol. Chem. 290, 28107–28119 (2015).

  80. 80.

    Begay, V., Smink, J. & Leutz, A. Essential Requirement of CCAAT/Enhancer Binding Proteins in Embryogenesis. Molecular and Cellular Biology 24, 9744–9751 (2004).

  81. 81.

    White, R. et al. Role of RIP140 in metabolic tissues: Connections to disease. FEBS Letters 582, 39–45 (2007).

  82. 82.

    Pratt, A. et al. Placenta-derived angiogenic proteins and their contribution to the pathogenesis of preeclampsia. Angiogenesis 18, 115–123 (2014).

  83. 83.

    Soleymanlou, N. et al. Molecular Evidence of Placental Hypoxia in Preeclampsia. J. Clin. Endocrinol. Metab. 90, 4299–4308 (2005).

  84. 84.

    Nevo, O. Increased expression of sFlt-1 in in vivo and in vitro models of human placental hypoxia is mediated by HIF-1. AJP: Regulatory, Integrative and Comparative Physiology 291, R1085–R1093 (2006).

  85. 85.

    Rajakumar, A., Whitelock, K. A. & Weissfeld, L. A. Selective overexpression of the hypoxia-inducible transcription factor, HIF-2α,in placentas from women with preeclampsia. Biology of Reproduction 64, 499–506 (2001).

  86. 86.

    Davies, E. J. et al. Epithelial-mesenchymal transition during extravillous trophoblast differentiation. Cell Adh Migr 10, 310–321 (2016).

  87. 87.

    Luef, B. M. et al. Validation of hospital discharge diagnoses for hypertensive disorders of pregnancy. Acta Obstet Gynecol Scand 95, 1288–1294 (2016).

  88. 88.

    Malshe, A. K. & Sibai, B. M. Angiogenic and Antiangiogenic Markers for Prediction and Risk Classification of Preeclampsia. Clin Obstet Gynecol 60, 134–140 (2017).

  89. 89.

    American College of Obstetricians and GynecologistsTask Force on Hypertension in Pregnancy. Hypertension in pregnancy. Report of the American College of Obstetricians and Gynecologists’ Task Force on Hypertension in Pregnancy. in 122, 1122–1131 (2013).

  90. 90.

    Goel, A. et al. Epidemiology and Mechanisms of De Novo and Persistent Hypertension in the Postpartum Period. Circulation 132, 1726–1733 (2015).

  91. 91.

    Heyer, E. E., Ozadam, H., Ricci, E. P., Cenik, C. & Moore, M. J. An optimized kit-free method for making strand-specific deep sequencing libraries from RNA fragments. Nucleic Acids Research 43, e2–e2 (2015).

  92. 92.

    Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. Journal 17, 10 (2011).

  93. 93.

    Sheppard, S., Lawson, N. D. & Zhu, L. J. Accurate identification of polyadenylation sites from 3′ end deep sequencing using a naive Bayes classifier. Bioinformatics 29, 2564–2571 (2013).

  94. 94.

    Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B … (1995).

Download references

Acknowledgements

We would like to acknowledge Hakan Ozadam, Alan Derr, and Manuel Garber for technical help with bioinformatics analyses and Nathan Lawson for insightful discussions. This study was supported by funding from the Bill and Melinda Gates Foundation Grand Challenges Exploration program and the Howard Hughes Medical Institute; M.J.M was a HHMI Investigator at the time the study was conducted.

Author information

Affiliations

Authors

Contributions

M.J.M. conceived of the project, A.R. collected the samples and prepared total RNA, and A.A.P. made PAS-Seq libraries and performed data analysis. Y.K. wrote custom pipelines for analyzing PAS-Seq data. M.J.M. and A.A.P. wrote the main manuscript text and A.A.P. prepared figures. J.A.B. and S.A.K. provided material support, expert advice and participated in manuscript editing. All authors reviewed the manuscript.

Corresponding author

Correspondence to Melissa J. Moore.

Ethics declarations

Competing Interests

S.A.K. is a co-inventor on multiple patents for preeclampsia diagnostics and therapies that are held by Beth Israel Deaconess Medical Center and reports serving as a consultant to Thermofisher Scientific and has financial interest in Aggamin LLC. The remaining authors report no conflicts.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Ashar-Patel, A., Kaymaz, Y., Rajakumar, A. et al. FLT1 and transcriptome-wide polyadenylation site (PAS) analysis in preeclampsia. Sci Rep 7, 12139 (2017). https://doi.org/10.1038/s41598-017-11639-6

Download citation

Further reading

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.