Abstract
The placenta mediates adverse pregnancy outcomes, including preeclampsia, which is characterized by gestational hypertension and proteinuria. Placental cell type heterogeneity in preeclampsia is not well-understood and limits mechanistic interpretation of bulk gene expression measures. We generated single-cell RNA-sequencing samples for integration with existing data to create the largest deconvolution reference of 19 fetal and 8 maternal cell types from placental villous tissue (n = 9 biological replicates) at term (n = 40,494 cells). We deconvoluted eight published microarray case–control studies of preeclampsia (n = 173 controls, 157 cases). Preeclampsia was associated with excess extravillous trophoblasts and fewer mesenchymal and Hofbauer cells. Adjustment for cellular composition reduced preeclampsia-associated differentially expressed genes (log2 fold-change cutoff = 0.1, FDR < 0.05) from 1154 to 0, whereas downregulation of mitochondrial biogenesis, aerobic respiration, and ribosome biogenesis were robust to cell type adjustment, suggesting direct changes to these pathways. Cellular composition mediated a substantial proportion of the association between preeclampsia and FLT1 (37.8%, 95% CI [27.5%, 48.8%]), LEP (34.5%, 95% CI [26.0%, 44.9%]), and ENG (34.5%, 95% CI [25.0%, 45.3%]) overexpression. Our findings indicate substantial placental cellular heterogeneity in preeclampsia contributes to previously observed bulk gene expression differences. This deconvolution reference lays the groundwork for cellular heterogeneity-aware investigation into placental dysfunction and adverse birth outcomes.
Introduction
The public health burden of adverse pregnancy outcomes is substantial. An important example is preeclampsia, which affected 6.5% of all live births in the United States in 2017 and is characterized by high maternal blood pressure, proteinuria, and damage to other organ systems1. Adverse pregnancy outcomes may lead to myriad health complications including an elevated risk of chronic diseases throughout the life course2. The placenta, a temporary organ that develops early in pregnancy, promotes maternal uterine artery remodeling; mediates transport of oxygen, nutrients, and waste3; secretes hormones to regulate pregnancy; metabolizes various macromolecules and xenobiotics; and can serve as a selective barrier to some, but not all, pathogens and xenobiotics4. The executive summary of the Placental Origins of Adverse Pregnancy Outcomes: Potential Molecular Targets workshop recently concluded that most adverse pregnancy outcomes are rooted in placental dysfunction5. Despite this, the molecular underpinnings of placental dysfunction are poorly understood.
Placenta-specific cell types including cytotrophoblasts, syncytiotrophoblasts, extravillous trophoblasts, and placental resident macrophage Hofbauer cells are all essential for placental development, structure, and function6. Dysfunction of these specific cell types likely plays a role in placental pathogenesis. For example, extravillous trophoblasts are responsible for invading into the maternal decidua early in pregnancy to remodel uterine arteries and increase blood flow to the placenta3. Inadequate or inappropriate invasion of extravillous trophoblasts has previously been implicated in preeclampsia etiology7,8,9. Despite some knowledge of the roles of specific placental cell types in the development of preeclampsia, relatively little is known about how individual cell types contribute to placental dysfunction.
Existing research models used to investigate the function and dysfunction of individual cell types are limited. Protocols to isolate primary placental cells for experimental research are restricted to one or few cell types10,11,12,13,14,15. Cell type-specific assays are costly and require special techniques or training resulting in small sample sizes and have not yet been scalable to large epidemiological studies16,17,18. Furthermore, placental cell lines such as BeWo, derived from choriocarcinoma19, and HTR-8/SVneo, immortalized by SV4020, are typically derived by processes that alter the DNA of the cells, limiting their in vivo translatability. Consequently, the characteristics of even healthy placental cell type function and especially their connections to adverse outcomes such as preeclampsia are incompletely understood.
Measures of gene expression in bulk placental tissue are used to better understand the biological mechanisms underlying adverse pregnancy outcomes21,22,23 and are common in epidemiological studies24. Gene expression profiles differ systematically by cell type25,26. Thus, bulk placental tissue-level gene expression measurements represent a convolution of gene expression signals from individual cells and cell types27,28. Deconvolution refers to the bioinformatic process of estimating the distribution of cell types that constitute the tissue29,30. Deconvoluting tissue-level gene expression profiles is essential to account for effects introduced by unmodeled cell type proportions31 by disentangling shifts in cell type proportions from direct changes to cellular gene expression32. Reference-based deconvolution boasts biologically interpretable cell type proportion estimates with few modeling assumptions but relies on independently collected cell type-specific gene expression profiles as inputs32. Prior placental cell type-specific gene expression measures from term villous tissue16,17 had a limited number of biological replicates and included neither technical replicates nor benchmarking against physically isolated placental cell types. A robust, accessible, and publicly available gene expression deconvolution reference is currently unavailable for healthy placental villous tissue.
To advance the field of perinatal molecular epidemiology, our goal was to develop an accessible and robust gene expression deconvolution reference for healthy placental villous tissue at term. We generated single-cell RNA-sequencing data with technical replicates for integration with existing cell type-specific placental gene expression data16,17. In addition, we benchmarked these single-cell cell type-specific gene expression profiles against placental cell types isolated with more conventional fluorescence-activated cell sorting (FACS) followed by bulk RNA-sequencing. Finally, to assess links between preeclampsia and placental cell types and their proportions, we applied our placenta cell type gene expression reference to deconvolute bulk placental tissues in a secondary data analysis of a case–control study33 of preeclampsia, including a mediation analysis of the preeclampsia-associated genes FLT1, LEP, and ENG that quantifies the role cellular composition plays in explaining bulk gene expression measures.
Results
Single-cell gene expression map of healthy placental villous tissue
A conceptual layout of the laboratory methods and analyses contained within this manuscript is provided in Supplementary Fig. 1. From healthy term placental villous tissue, 9244 cells across a total of two biological replicates and two technical replicates were sequenced and analyzed (Michigan sample). These data were combined with single-cell RNA-sequencing data of 5911 cells from three healthy term villous tissue samples in a previously published study (Pique-Regi sample)17 and 25,339 cells from four healthy term villous tissue samples in another previously published study, two of which were subsampled with an additional peripheral placental villous tissue sample (Tsang sample)16 (Supplementary Table 1). Cells were excluded if they had low RNA content (<500 unique RNA molecules), few genes detected (<200), or were doublets or outliers in mitochondrial gene expression (Supplementary Figs. 2 and 3). Fetal or maternal origin of cells was determined by genetic variation in sequencing data. Fetal sex was determined by XIST expression (Supplementary Fig. 4). The final analytic sample included 40,494 cells and 36,601 genes across nine biological replicates, two of which had a technical replicate and another two included peripheral subsampling.
Uniform Manifold Approximation and Projection (UMAP)34 was used to visualize sequencing results in two dimensions with mutual nearest neighbor batch correction35 (Fig. 1a). Cells clustered into 19 fetal and 8 maternal cell types with 84.4% of all cells being of fetal origin (Table 1). Cell type clustering decisions balanced cluster stability, resolution, and biologic plausibility with prior knowledge. If desired, downstream analyses could collapse cell subtypes into a single, more general cell type cluster. We observed placenta-specific trophoblast cell types including cytotrophoblasts (KRT7), proliferative cytotrophoblasts (KRT7, STMN1 and other proliferation-related genes)36, extravillous trophoblasts (HLA-G)37, and syncytiotrophoblasts (PSG4 and other pregnancy-specific hormone genes) (Supplementary Fig. 5a)38. Proliferative cytotrophoblasts were distinguished from other cytotrophoblasts by overexpression of genes related to cytoplasmic translation (padj = 8.1 × 10−15) and mitotic sister chromatin segregation (padj = 1.5 × 10−12), indicative of their proliferative phenotype (Supplementary Fig. 6). Other fetal-specific cell types included mesenchymal stem cells (COL1A1lo, TAGLNlo, LUMhi), fibroblasts (COL1A1hi, TAGLNhi, LUMlo)39, endothelial cells (PECAM1)40, and Hofbauer cells (CD163)11 (Fig. 1b).
a Uniform Manifold Approximation and Projection (UMAP) plot of all cells (n = 40,494), with each cell colored by cell type cluster. b UMAP plot of fetal cells only (n = 34,165), with each cell colored by cell type cluster. c UMAP plot of maternal cells only (n = 6329), with each cell colored by cell type cluster.
Fetal and maternal lymphocytes, B cells, and monocytes were also captured (Fig. 1b, c). We observed fetal and maternal B cells (CD79A)41 and maternal plasma cells (XBP1, IGHA and other immunoglobulins)42. We also observed fetal and maternal CD14+ monocytes (CD14+/FCGR3A−), maternal FCGR3A+ monocytes (CD14+/FCGR3A+)43, and a small population of fetal plasmacytoid dendritic-like cells (FLT3+/ITM2C+)44,45. We further observed fetal and maternal natural killer cells (NKG7), fetal GZMB+ or GZMK+ natural killer cell subtypes, and fetal natural killer T cells (NKG7+/CD3E+/CD8A-)46,47. Finally, we observed a variety of T cell subtypes: naïve CD4+ (CCR7, CD3E, CD4), naïve CD8+ (CCR7, CD3E, CD8A), memory CD4+ (S100A4, CD3E, CD4, IL2, CCR7lo), and activated CD8+ T cells (NKG7, CD3E, CD8A) (Supplementary Fig. 5b)48.
To identify upregulated genes in each cell type, we compared the expression of a gene in one cell type against that gene’s average expression in all other cell types (Supplementary Data 2). Consequently, the same genes could be upregulated across several cell types of a similar lineage. FLT1 expression was highly upregulated in extravillous trophoblasts (log2 fold-change (FC) = 3.89, padj < 0.001). Trophoblast cell types had the largest and most diverse transcriptomes, characterized by the largest number of unique RNA transcripts and detected genes per cell (Supplementary Fig. 7). Functional analysis of upregulated genes revealed cell type-specific biological processes (Supplementary Data 3). For example, fetal extravillous trophoblasts were enriched for genes relevant to placental structure and function such as cell migration (padj < 0.001) and response to oxygen levels (padj < 0.001) and syncytiotrophoblasts were enriched for genes involved in steroid hormone biosynthetic process (padj < 0.001). Technical replication in Michigan samples 1 and 2 appeared high in UMAP space (Supplementary Fig. 8a, b). Indeed, the average intra-cluster gene expression between technical replicates had an average Spearman correlation (mean ± standard deviation) of 0.94 ± 0.14 for sample 1 and 0.88 ± 0.20 for sample 2 (p values < 0.001).
Single-cell RNA-sequencing deconvolution reference exhibits excellent in silico performance
Based on the single-cell data, we created a placental signature gene matrix that incorporated expression information across an algorithmically selected 5229 signature genes to estimate the cellular composition of 27 fetal and maternal cell types from whole tissue gene expression data (Supplementary Fig. 9). To test the performance and robustness of this placental single-cell RNA-sequencing deconvolution reference, we randomly split our analytic single-cell RNA-sequencing dataset into 50% training and 50% testing subsets with balanced cell type proportions49. The same training dataset was used for each comparison; test mixtures were generated from the testing half of the dataset. Using a signature gene expression matrix generated from the training data, we estimated cell type composition in in silico pseudo-bulk testing data mixtures of known cell type composition with varying contributions of fetal vs. maternal origin cells and male vs. female fetal cells (Fig. 2). In all mixtures, the 27 predicted and actual cell type proportions were correlated (p value < 0.001 for each test). In the primary deconvolution analysis of all cell types at their natural rates (n = 20,242), estimated and actual cell type proportions had a Pearson correlation coefficient of 0.956 (95% CI [0.904, 0.980]). The worst performance was under the unrealistic scenario that the mixture was composed entirely of maternal cell types (n = 3162) with a Pearson correlation of 0.734 (95% CI [0.491, 0.871]) between actual estimated cell type proportions. Our deconvolution reference was also robust to fetal sex when only male fetal cells (Pearson correlation = 0.893, 95% CI [0.776, 0.950]) were included (n = 8394), or only female fetal cells (Pearson correlation = 0.983, 95% CI [0.964, 0.993]) (n = 8394). Together, these results show that our reference panel can successfully deconvolute placental tissues, though some maternal cell types common to both mother and fetus may be erroneously labeled fetal in the absence of fetal cells of those cell types.
Scatter plots summarizing the performance of our single-cell deconvolution reference using CIBERSORTx with in silico mixtures of single-cell libraries from a 50/50 training/test split of the integrated single-cell RNA-seq dataset (n = 40,494). The same training dataset was used for each comparison; test mixtures were generated from the testing half of the dataset. Predicted deconvoluted cell type proportions for each of the 27 cell types are encoded on the x-axis. Actual cell type proportions from the test dataset are encoded on the y-axis. Correlation coefficients and root mean square error measures are presented for each comparison. A linear line of best fit overlays the results. The gray shaded area represents the 95% confidence intervals around the simple linear regression estimates. a The test mixture is the test half of the single-cell dataset (n = 20,242). b The test mixture sampled only fetal cells (n = 17,080). c The test mixture sampled only maternal cells (n = 3162). d The test mixture sampled only female fetal cells (n = 8394). e The test mixture sampled only male fetal cells (n = 8394).
Fluorescence-activated cell sorting of major placental cell types yielded mixed cell type isolation results
We isolated whole bulk placental villous tissue, enriched syncytiotrophoblasts, and sorted five cell types (Hofbauer cells, endothelial cells, fibroblasts, leukocytes, extravillous trophoblasts, and cytotrophoblasts) via FACS from four healthy term, uncomplicated Cesarean sections for bulk RNA-sequencing, labeled Sorted 1 (same sample source as single-cell RNA-sequencing sample 1), Sorted 2, Sorted 3, and Sorted 4 (Supplementary Fig. 10 and Supplementary Table 2). For analysis, as recommended50, we excluded 19,048 genes that were not present in at least 3 samples and an additional 865 genes that did not have a cumulative library size-normalized count of at least 10. Principal component (PC) analysis of whole-transcriptome sorted-cell bulk RNA-sequencing normalized counts is provided in Supplementary Fig. 11.
To identify upregulated genes in each cell type, we compared the expression of a gene in one cell type against that gene’s average expression in all other cell types (Supplementary Fig. 12). Consequently, the same genes could be upregulated across several cell types of a similar lineage. All 38,468 uniquely mapping genes were tested. A total of 746 genes were algorithmically dropped from the syncytiotrophoblast contrast due to excessively low counts, low variability, or extreme outlier status. Large-scale gene expression differences were observed for each cell type (Supplementary Data 4). Functional analysis of upregulated genes revealed cell type-specific biological processes (Supplementary Data 5). For example, syncytiotrophoblasts were enriched for genes relevant to placental structure and function such as angiogenesis, cell-substrate adhesion, and regulation of epithelial cell proliferation (padj < 0.001). To compare sorted and single-cell differential expression and enrichment results, we tabulated the number of unique genes and pathways overlapping between the two analyses after collapsing the single-cell cell type cluster labels to the seven cell type fractions that we had targeted to isolate for downstream analyses (Supplementary Table 3). On average, 15.0% of single-cell upregulated genes and 5.9% of enriched pathways were also identified among the sorted-cell data. On average, 17.5% of sorted cell type upregulated genes and 39.2% of pathways were also identified among the single-cell data. Sorted endothelial cell results were limited due to the limited number of biological replicates.
We applied the single-cell deconvolution reference to estimate cell proportions in the 4 whole tissue (with 1 additional technical replicate) and 19 sorted or enriched cell type fractions. We collapsed the single-cell cell type cluster labels to the seven cell type fractions we targeted for isolation for downstream analyses (Supplementary Data 6, Sheet 1). All deconvoluted samples exhibited high goodness-of-fit between original bulk mixtures and the estimated cell type proportion mixtures (p values < 0.001). Among the signature genes, original bulk and estimated cell type fractions had a Pearson correlation (mean ± standard deviation) of 0.73 ± 0.11 and root mean square error of 0.88 ± 0.04 (Supplementary Data 6, Sheet 2). Deconvolution results (mean ± standard deviation) suggest we successfully isolated fibroblast- (n = 3, 74.7% ± 0.6%) and leukocyte-enriched (n = 4, 82.3% ± 24.8%) cell type fractions. Other cell type targets were less successful (range 0–26% estimated purity). The Hofbauer cell fraction was predicted to be mostly leukocytes (n = 4, 91.5% ± 0.5%).
Cell proportion deconvolution of bulk placental tissue preeclampsia dataset
We applied the single-cell deconvolution reference to estimate cell proportions from bulk placental tissue in 157 preeclampsia cases and 173 controls33 compiled from eight previously published studies33,51,52,53,54,55,56,57. Mean gestational age was 2.2 weeks younger in cases than controls (p value < 0.001, Table 2). All deconvoluted samples exhibited high goodness-of-fit between original bulk mixtures and the estimated cell type proportion mixtures (p values < 0.001). Among the signature genes, original bulk and estimated mixtures had a Pearson correlation (mean ± standard deviation) of 0.70 ± 0.04 and root mean square error of 0.73 ± 0.03 (Supplementary Data 7). Fetal naïve CD4+ T cells and fetal GZMB+ natural killer cells were estimated to be at 0% abundance in all samples and were dropped from downstream analyses. Cytotrophoblasts were the most abundant (mean ± standard deviation) estimated fetal cell type (27.9% ± 4.3%) followed by syncytiotrophoblasts (23.4% ± 5.0%) and mesenchymal stem cells (10.3% ± 3.3%). The most common maternal cell types were naïve CD8+ T cells (2.8% ± 1.5%), plasma cells (1.4% ± 0.7%), and B cells (1.3% ± 0.8%). A comparison of deconvoluted whole tissue cell type proportions among healthy individuals (Supplementary Fig. 13) between the microarray dataset GSE75010 (n = 173 controls), our whole tissue bulk RNA-sequencing samples (sorted samples 1–4), and the single-cell dataset compiled here (single-cell samples 1–9) suggests that syncytiotrophoblasts and endothelial cells are underrepresented in the single-cell data. This is likely due to dissociation bias, which has been commonly observed in single-cell assays of other tissues58. Overall, the Pearson correlation of the average deconvoluted cell type proportion across the 27 cell types between healthy bulk RNA-sequencing and microarray controls was 0.80 (95% CI: [0.60, 0.91]).
Differentially abundant cell type proportions in preeclampsia cases versus controls
To test for differences in cell proportions between preeclampsia cases and controls (Supplementary Fig. 14), we fit beta regression models for each cell type proportion adjusted for study source, fetal sex, and gestational age to estimate the prevalence odds ratio for each cell type (Supplementary Data 8). Among fetal cell types, extravillous trophoblasts (p < 0.001), memory CD4+ T cells (p = 0.007), CD8+ activated T cells (p = 0.005), and natural killer T cells (p = 0.006) were more abundant (Fig. 3) in preeclampsia cases relative to controls. The unadjusted median extravillous trophoblast abundance was 6.4% among cases compared to 2.1% among controls. Mesenchymal stem cells (median percent composition in cases vs. controls, 8.8% vs. 11.0%), Hofbauer cells (2.7% vs. 4.4%), and fetal naive CD8+ T Cells (4.2% vs. 4.5%) were all less abundant among preeclampsia cases compared to controls (p < 0.001). Among maternal cell types, maternal plasma cells (1.6% vs. 1.2%) were more abundant among preeclampsia cases compared to controls (p < 0.001).
Forest plot of multivariate beta regression models’ prevalence odds ratio estimates adjusted for study source, gestational age, and fetal sex tested for a difference in each cell type’s proportions in cases versus controls (n = 157 cases, 173 controls). Horizontal lines indicate the range of the 95% confidence interval.
Differential expression between preeclampsia cases and controls attenuated by cell type proportion adjustment
To test whether microarray gene expression differences between preeclampsia cases and controls are partly driven by differences in cell type abundances, we fit linear differential gene expression models adjusted for covariates study source, fetal sex, and gestational age with and without adjustment for deconvoluted cell type proportions. To reduce the number of model covariates and account for dependence between deconvoluted cell type proportions, we applied PC analysis to deconvoluted cell type proportions. The first five PCs accounted for 87.2% of the variance in deconvoluted cell type proportions and were added as additional covariates to form the cell type-adjusted model. Variation in PCs 1 and 2 was largely driven by syncytiotrophoblasts (33.8%), extravillous trophoblasts (33.5%), and cytotrophoblasts (15.3%) proportions and provided some separation between cases from controls (Supplementary Fig. 15a, c). Variation in PC3 was largely driven by cytotrophoblasts (50.1%) and to a lesser extent syncytiotrophoblasts (16.6%), mesenchymal stems cells (14.5%), and extravillous trophoblasts (13.7%) (Supplementary Fig. 15b, d).
In the cell type-naïve base models (n = 14,651 genes, 173 controls, and 157 cases) adjusted for study source, gestational age, and fetal sex, 550 genes were differentially upregulated and 604 were downregulated in preeclampsia cases versus controls (Fig. 4a and Supplementary Data 9). Gene set enrichment analysis of biological processes identified 41 overrepresented pathways in the base model (Fig. 5a and Supplementary Data 10). Biological process pathways such as aerobic respiration (false discovery adjusted q < 0.001), mitochondrial respiratory chain complex assembly (q < 0.001), glutathione metabolism (q = 0.003), and ribosome biogenesis (q = 0.001) were overrepresented among downregulated genes. No pathways were overrepresented among upregulated genes, though intermediate filament organization (q = 0.26), keratinocyte differentiation (q = 0.77), and endothelial cell development (q = 0.42) had comparable enrichment scores. Remarkably, when the base model was additionally adjusted for the first five PCs of imputed cell type proportions, there were zero differentially expressed genes between preeclampsia cases and controls (Fig. 4b and Supplementary Data 9). Of the cell type-adjusted results, 19 pathways were overrepresented (Fig. 5b and Supplementary Data 10). Downregulation of mitochondrial respiratory chain complex assembly (q < 0.001), aerobic respiration (q = 0.001), ribosome biogenesis (q = 0.001), and glutathione metabolism (q = 0.02) were overrepresented among downregulated genes. Detection of chemical stimulus involved in sensory perception of smell (q = 0.04) and non-coding RNA processing (q = 0.04) were also overrepresented pathways among downregulated genes. Neuroepithelial cell differentiation (q = 0.04) was overrepresented among upregulated genes. Vascular endothelial growth factor receptor signaling pathway (q = 0.15), of which FLT1 is a member, had an enrichment score of 1.77 (up from 1.34, q = 0.43 in the base model) but did not meet the q value cutoff. Overall, downregulation of mitochondrial biogenesis, aerobic respiration, and ribosome biogenesis and related pathways were robust to cell type proportion adjustment.
Volcano plots comparing differentially expressed genes in samples from 153 preeclampsia cases versus 173 healthy controls across two models: a the base model adjusted for covariates fetal sex, study source, and gestational age and b the model adjusted for fetal sex, study source, and gestational age and additionally adjusted for the first five principal components of estimated cell type proportions. Dotted line represents a false discovery rate-adjusted q value of 0.05. FLT1, LEP, and ENG are labeled as genes of interest in preeclampsia.
Top Gene Set Enrichment Analysis pathways from the Gene Ontology: Biological Processes database results for the differential expression analysis by preeclampsia case–control status. Results arranged by descending magnitude of the absolute value of the normalized enrichment score. Pathways colored red are significant at a false discovery rate-adjusted (FDR) q value of 0.05 whereas pathways in blue are statistically insignificant. a Top pathways from the cell type-unadjusted analysis. b Top pathways from the cell type-adjusted analysis.
Differential expression of preeclampsia-associated genes mediated by placental cell type proportions
Overexpression of FLT1 in placental tissue59,60,61,62, detection of a soluble isoform of FLT1 in maternal circulation63,64, and fetal genetic variants near FLT165 have implicated FLT1 in preeclampsia etiology. Because we observed cell type-specific expression patterns of FLT1 in trophoblasts, particularly in extravillous trophoblasts, we hypothesized that the observed attenuation of FLT1 differential expression may be due in part to the differences in cell type proportions observed between preeclampsia cases and controls. To test this hypothesis, we applied a unified mediation and interaction analysis to quantify the proportion of FLT1 expression differences mediated by deconvoluted cell type proportions. We did not observe an interaction between preeclampsia status and cellular composition (overall proportion attributable to interaction = −5.8%, 95% CI [−17.1%, 5.0%]). We therefore dropped interaction parameters from the model for the final analysis. In the model without interaction, 37.8% (95% CI [27.5%, 48.8%]) of the 1.05 (95% CI [0.89, 1.21]) log2 signal intensity increase in the association between preeclampsia and FLT1 expression was attributable to differences in placental cell composition between preeclampsia cases and controls (Fig. 6). Overexpression of LEP and ENG have also been associated with preeclampsia59,60,61,62. Mediation results were similar for LEP (total effect = 2.62 (95% CI [2.26, 2.97] log2 signal intensity increase; proportion mediated = 34.5% (95% CI [26.0%, 44.9%]) and ENG (total effect = 0.93 (95% CI [0.79, 1.07] log2 signal intensity increase; proportion mediated = 34.5% (95% CI [25.0%, 45.3%]).
Mediation of FLT1 gene expression by placental cell type composition (n = 157 cases, 173 controls). Placental cell composition was operationalized as first five principal components of estimated cell type proportions. 95% confidence intervals are provided after effect estimates for each model parameter. The same framework was also applied with LEP or ENG expression as the outcome.
Discussion
To create the largest publicly available placental RNA deconvolution reference of 19 fetal and 8 maternal cell type-specific gene expression profiles, we newly sequenced placental villous cells, integrated those results with data from previously published studies, and built a signature gene matrix for deconvolution of bulk villous tissue gene expression data. In silico testing of our deconvolution reference demonstrated successful and robust deconvolution. To compare single-cell placental cell type expression profiles to more conventional sorting methods, we created a FACS scheme to enrich and sequence RNA from five important placental cell types as well as syncytiotrophoblasts. Deconvolution of sorted cell type fractions with the single-cell deconvolution reference suggested most conventionally sorted cell types are far less pure than what can be accomplished with clustering and aggregation of single-cell results, and at much lower cell type resolution. We applied the single-cell deconvolution reference to estimate cell type proportions in a previously published epidemiologic microarray study of the pregnancy complication preeclampsia, revealing placental cell type proportion differences between preeclampsia cases and controls at term. We then showed that large gene expression differences between preeclampsia cases and controls were markedly attenuated after adjustment for cell type proportions. Preeclampsia-associated pathways, including downregulation of mitochondrial biogenesis, aerobic respiration, and ribosome biogenesis were robust to cell type adjustment, suggesting direct changes to these pathways. Finally, to quantify the attenuation of differential expression of the preeclampsia biomarkers FLT1, LEP, and ENG, we applied mediation analysis to show cellular composition mediated a substantial proportion of the association between preeclampsia and FLT1, LEP, and ENG overexpression. Cell type proportions may be an important and often overlooked factor in gene expression differences in placental tissue studies.
By integrating our new single-cell RNA-sequencing results with those from a previously published study, our integrated dataset, to our knowledge, is the largest and possibly only reference available for healthy, term placental villous tissue to date. We document term cell type-specific gene expression patterns for well-characterized placental cell types, including syncytiotrophoblasts10, cytotrophoblasts13, and extravillous trophoblasts14. In addition, we provide gene expression markers for relatively understudied placental cell types such as endothelial cells, mesenchymal stem cells, and Hofbauer cells as well as maternal peripheral mononuclear cells recovered from the maternal-fetal interface. Compared to the previous analysis of the published samples17 which relied on predominately sex-specific gene expression markers to differentiate proliferative from non-proliferative cytotrophoblasts, we show that functional enrichment analysis revealed broad upregulation of proliferation pathways in proliferative cytotrophoblasts. The low representation of some cell types such as trophoblasts in our single-cell RNA-sequencing results from the Michigan study suggests that these cell types may be especially sensitive to dissociation and disintegrate before transcript capture, commonly referred to as dissociation bias58. Michigan samples 1 and 2 also included a cryopreservation step like those employed in large-scale epidemiological studies that may have exacerbated dissociation bias66; this applies to both single-cell and sorted cell type experiments. Future studies may propose alternative approaches to perform unbiased single-cell RNA-sequencing in placental tissues; indeed, single-nucleus RNA-sequencing has been used to characterize an in vitro syncytiotrophoblast model and may be more appropriate to assay such cell types sensitive to dissociation procedures67. We verified that our deconvolution reference exhibited strong performance even with extremely imbalanced and unlikely real-world test mixture distributions by fetal sex and maternal cell type representation.
Our preeclampsia findings are consistent with a prior pathophysiological understanding of the disorder, linking cell type proportion estimates and gene expression data in bulk tissue. Among preeclampsia cases, we observed an elevated proportion of extravillous trophoblasts and underrepresentation of stromal cell types, which may reflect an arrest in conventional placental cell type differentiation and maturation following insufficient uterine spiral artery remodeling implicated in preeclampsia68,69,70. A recent study of bulk placental gene expression across trimesters suggests that Hofbauer cells may more abundant in the second trimester compared to the third, possibly to support vasculogenesis, though this study involved a small deconvolution reference that contained a limited variety of cell types71. A better understanding of the evolution of temporal placental composition changes may yield greater insight into placental pathologies. In the cell type-naïve differential expression model, consistent with previous findings, placentas from pregnancies with preeclampsia overexpressed FLT1, LEP, and ENG59,60,61,62. In our cell type-adjusted model, FLT1 and LEP remained only nominally significant whereas ENG did not meet the nominal significance threshold. Mediation analysis confirmed that a significant proportion of FLT1, LEP, and ENG overexpression was attributable to differences in the cellular composition of the placenta. These results suggest that placental cell type proportion differences may be an overlooked factor in explaining the well-documented association between preeclampsia and FLT1, LEP, and ENG expression59,60,61,62. Downregulation of mitochondrial biogenesis, aerobic respiration, and ribosome biogenesis was robust to cell type adjustment, indicating direct changes to these pathways beyond shifts in cell type abundance. Indeed, disruption of the mitochondrial fission-fusion cycle72, malperfusion73,74, and inhibited protein synthesis secondary to endoplasmic reticulum stress75,76 have all previously been associated with preeclampsia. Interestingly, cell type adjustment increased the enrichment score results of vascular endothelial growth factor receptor signaling pathway, a mechanistic hypothesis in preeclampsia etiology63,73,77,78, from 1.36 to 1.77 (q = 0.43 to q = 0.15). This approach may reveal the biological mechanisms of other diseases beyond cellular composition differences. Because oxygen tension is a critical factor in trophoblast differentiation, inappropriate oxygenation may partially explain the elevated proportion of extravillous trophoblasts, though regulators of this process such as HIF1A and TGFB379 were not differentially expressed at the tissue level in our analysis. A recent single-cell RNA-sequencing case–control study of preeclampsia, however, identified upregulation of TGFB1 in extravillous trophoblasts, potentially indicative of altered trophoblast differentiation or invasion80,81. A similar study revealed decreased activity of gene network modules regulated by transcription factors ATF3, CEBPB, and GTF2B and decreased expression of CEBPB and GTF2B in preeclamptic extravillous trophoblasts compared to controls; follow-up in vitro experiments suggested CEBPB and GTF2B knockdown reduced extravillous trophoblast viability and invasion82. Consistent with our other findings, this study also observed a similar trend in cell type proportion differences and upregulation of FLT1 in extravillous trophoblasts and ENG in syncytiotrophoblasts between preeclampsia cases and controls80. Future work should consider and account for cell type proportions and the cell type-specific expression patterns of genes that regulate placental development or are associated with preeclampsia to better understand preeclampsia etiology.
This study has several strengths. We profile the parenchymal healthy term villous tissue in the placenta and integrate our dataset with samples from previously published studies to generate the largest, to the best of our knowledge, cell type-specific placental villous tissue gene expression reference to date. Single-cell RNA-sequencing allowed us to agnostically capture diverse placental cell types without a priori knowledge of cell types and their characteristics and tabulate gene expression patterns at high resolution and specificity. Our in silico deconvolution tests demonstrated robust performance to even extreme distributions of maternal or sex of fetal cells. We demonstrate technical replication of single-cell RNA-sequencing in placental villous tissue. We were able to apply our findings to a large target deconvolution dataset of preeclampsia that contained placental measures from hundreds of participants across eight different studies. Most importantly, we evaluate cell type proportion differences in an epidemiological study of placental parenchymal tissue and preeclampsia, and genome-wide gene expression differences accounting for cell type heterogeneity, a critical limitation in bulk tissue assays.
This study also has several limitations. Although our cellular sample size comprised of 40,494 cells is relatively large compared to previous single-cell RNA-sequencing studies of term placental villous tissue, this dataset still represents a limited biologic replicate sample size compared to epidemiologic scale studies. Our newly sequenced samples came from a convenience sample without available demographic information beyond uncomplicated and healthy Cesarean-section status. Similarly, the sample size of FACS-sorted tissues was limited, and some cell type fractions were excluded due to low RNA quality or exhibited poor estimated purity, likely complicated by degradation of cell surface markers from apoptosis characteristic of development and parturition83,84 and sample processing. This study did not include placental tissues for single-cell analysis from preeclamptic patients to confirm intra-cell type gene expression changes. Despite excellent in silico performance, we had no external gold standard to verify deconvolution performance. This deconvolution reference may not be sensitive enough to discriminate between cell subtypes such as proliferative vs. non-proliferative cytotrophoblasts that are clearly delineated in the single-cell analysis; in such cases, investigators may collapse cell type proportions counts into a single major cell type group, such as cytotrophoblasts. Future studies may verify whether cell type proportions estimated in diseased or vaginally delivered tissues are robust to a deconvolution reference generated from healthy villous tissue delivered via Cesarean-section. Residual confounding may remain in our statistical models due to the limited number of common covariates across all eight preeclampsia case–control studies. Due to the nature of villous tissue sampling, our study design is cross-sectional, limiting our ability to establish temporality between exposure and outcome to rule out reverse causation. As with any study conditioned on live birth, selection bias may affect our results. However, the effects of harmful exposures that lead to selection tend to be underestimated in these scenarios85,86. Therefore, our results likely represent a conservative underestimate of the effects of preeclampsia on inappropriate cell composition and preeclampsia status on gene expression.
In summary, we provide a cell type-specific deconvolution reference via single-cell RNA-sequencing in the parenchymal placental term villous tissue. We demonstrated this reference was robust to different distributions of maternal and fetal sex through in silico validation testing. In addition, we benchmarked these single-cell cell type-specific gene expression profiles against placental cell types isolated with more conventional FACS followed by bulk RNA-sequencing. We applied this deconvolution reference to an epidemiologic preeclampsia dataset to reveal biologically relevant shifts in placental cell type proportions between preeclampsia cases and controls. Once cell type proportion differences were accounted for, differential gene expression differences were markedly attenuated between preeclampsia cases and controls. Enrichment analysis revealed downregulation of mitochondrial biogenesis, aerobic respiration, and ribosome biogenesis were robust to cell type adjustment, suggesting direct changes to these pathways. A substantial proportion of the overexpression of the FLT1, LEP, and ENG in preeclampsia was mediated by placental cell composition. These results add to the growing body of literature that emphasizes the centrality of cell type heterogeneity in molecular measures of bulk tissues. We provide a publicly available placental cell type-specific gene expression reference for term placental villous tissue to overcome this critical limitation.
Methods
Placental tissue collection and dissociation
Placentas were collected shortly after delivery from healthy, full-term, singleton uncomplicated Cesarean sections at the University of Michigan Von Voigtlander Women’s Hospital. Pregnant women provided written informed consent for research use of discarded tissues. Study protocols for discarded tissue collection and research use were approved by the University of Michigan Institutional Review Board (HUM00017941, HUM00102038). Villous placental tissue biopsies were collected and minced for dissociation after cutting away the basal and chorionic plates and scraping villous tissue from blood vessels13. We subjected approximately 1 g minced dissected villous tissue to the Miltenyi Tumor Dissociation Kit on the GentleMACS Octo Dissociator with Heaters (Miltenyi Biotec) to yield single-cell suspensions of viable placental cells in 5 μM StemMACS™ Y27632 (Miltenyi Biotec) in RPMI 1640 (Gibco) according to manufacturer’s instructions for soft tumor type. Red blood cells were depleted using RBC lysis buffer (BioLegend) according to manufacturer’s protocol A. Single-cell suspensions were size-filtered at 100 μm to remove undigested tissue and subsequently at 40 μm12,13. To collect a syncytiotrophoblast-enriched fraction, the fraction between 40 and 100 μm was washed from the 40-μm strainers, adapting a previous protocol that collected syncytiotrophoblasts throughout this size range10. Single-cell suspensions <40 μm were cryogenically stored in 5 μM StemMACS™ Y27632 90% heat-inactivated fetal bovine serum (Gibco)/10% dimethyl sulfoxide (Invitrogen). For each placenta, additional whole villous tissue samples were stored in RNALater (Qiagen).
Previously published single-cell RNA-sequencing raw data of healthy, term placental villous tissue samples came from the Database of Genotypes and Phenotypes (Pique-Regi et al., accession number phs001886.v1.p187) SRR10166478 (Sample 3), SRR10166481 (Sample 4), and SRR10166484 (Sample 5)17. The collection and use of human materials for the study were approved by the Institutional Review Boards of the Wayne State University School of Medicine. All participating women provided written informed consent prior to sample collection17. Additional previously published samples came the European Genome-Phenome Archive (Tsang et al., accession number EGAS0000100244988) (Samples 6–9)16. The study was approved by the Joint Chinese University of Hong Kong-New Territories East Cluster Clinical Research Ethics Committee, and informed consent was obtained after the nature and possible consequences of the studies were explained. Pregnant women were recruited from the Department of Obstetrics and Gynecology, Prince of Wales Hospital, Hong Kong with informed consent; the subjects studied had consented to sequencing data archiving16.
Placental single-cell RNA-sequencing
Villous tissue single-cell suspensions were thawed and sorted via FACS with LIVE/DEAD Near-IR stain (Invitrogen) for viability and forward-scatter and side-scatter profiles to eliminate cellular debris and cell doublets. Viability- and size-sorted single-cell suspensions were submitted to the University of Michigan Advanced Genomics Core for single-cell RNA-sequencing. Single cells were barcoded, and cDNA libraries constructed on the Chromium platform (10X Genomics, Single Cell 3’ v2 chemistry). Paired-end 110 base pair reads were sequenced on NovaSeq 6000 (Illumina).
Single-cell RNA-sequencing preprocessing
Raw reads were processed, deconvoluted, droplet filtered, and aligned at the gene level with the Cell Ranger pipeline using default settings (v4.0.0, 10X Genomics) based on the GRCh38 GENCODEv32/Ensembl 98 reference transcriptome with STAR v2.5.1b89. Previously published single-cell RNA-sequencing raw data of healthy, term placental villous tissue samples from the Database of Genotypes and Phenotypes (Pique-Regi et al., accession number phs001886.v1.p1) SRR10166478 (Sample 3), SRR10166481 (Sample 4), and SRR10166484 (Sample 5)17 and from the European Genome-Phenome Archive (Tsang et al., accession number EGAS00001002449) (Samples 6–9)16 were processed identically. The freemuxlet program in the latest version (accessed December 5, 2021) of the “popscle” package was used to assign fetal or maternal origin and identify 736 mosaic doublets for removal based on single nucleotide polymorphisms with minor allele frequency greater than 10% from the 1000 Genomes Phase 3 reference panel (released May 2, 2013)90. Per cell quality control criteria were calculated using the quickQCPerCell() function (scater R package, version 1.18.6) with default settings91 (Supplementary Figs. 2 and 3) and included total unique RNA transcripts (also called unique molecular identifiers), unique genes, and percentage of reads mapping to mitochondrial genes92. According to the current recommended best practice, each batch was quality-controlled separately93. We excluded 6497 low-quality outlier cells defined as cells with less than 500 unique RNA molecules, less than 200 unique genes, or that were outliers in mitochondrial gene mapping rate. Mitochondrial mapping outliers exceeded four median absolute deviations in samples 1 and 2 (mitochondrial reads >9.2%) or three median absolute deviations in samples 3, 4, and 5 (mitochondrial reads >8.9%) and samples 6, 7, 8C, 8P, 9C, and 9P (mitochondrial reads >9.1%). To generate normalized gene expression data for visualizations and analyses that required normalization, single-cell gene counts were library size normalized by dividing the number of counts by the total number of counts expressed in that cell, multiplied by a scale factor of 10,000, and log-transformed with the NormalizeData() function (Seurat R package, version 4.1.1).
Single-cell RNA-sequencing clustering and cluster annotation
Maternal and fetal cells were split into separate datasets for clustering. To integrate data from cells across study sources and visualize clustering results with uniform manifold projection34, we used the mutual nearest neighbor batch correction approach via FastMNN from “SeuratWrapper” with default settings (R package, version 0.3.0)35. Supervised iterative clustering and sub-clustering with “Seurat” (R package, version 4.0.1) function FindClusters at different resolution parameters were evaluated using cluster stability via clustering trees in “clustree”94,95. A priori canonical cell type marker gene expression patterns and cluster marker genes were used to assign cell types to cell clusters (see results). Cells that fell outside cell type clusters or outlying in doublet density calculated with computeDoubletDensity were removed as putative doublets and doublet clusters were identified with findDoubletClusters for removal in “scDblFinder” (R package, version 1.4.0)96. 723 maternal-maternal or fetal-fetal putative doublets were excluded after integration and clustering. Using the manually annotated Michigan (this study) and Pique-Regi (phs001886.v1.p1) cell cluster labels as the reference data, Tsang sample (EGAS00001002449) cells were algorithmically annotated with “SingleR” (R package, version 1.6.1)97 with default settings, followed by manual review. Cells with low prediction certainty (assignment score lower than three median absolute deviations of all cells assigned) were excluded as putative maternal-maternal or fetal-fetal doublets. Fetal sex in Michigan (this study) samples was determined with average normalized XIST expression; fetal sex in Pique-Regi and Tsang samples was determined by annotation and confirmed with average normalized XIST expression (Supplementary Fig. 4). The final analytic sample included 40,494 cells and 36,601 genes across nine biological replicates, two of which had a technical replicate (Samples 1 and 2) and another two included peripheral subsampling (Samples 8 and 9).
Single-cell RNA-sequencing differential expression and biological pathway enrichment statistical analysis
Technical correlation was assessed by Spearman correlation after averaging the normalized expression for each gene by cluster and by technical replicate. Cluster marker genes were identified in “Seurat” with the FindAllMarkers function with default settings on single-cell gene expression counts92,95. Specifically, including both maternal and fetal cell types, the expression level in each cell type cluster was compared against the average expression of that gene across all other cell types using the two-tailed Wilcoxon rank sum test with significance defined at a false discovery rate-adjusted p value less than 0.05 and a log2 FC cutoff of 0.25. Pairwise cluster markers were identified in “Seurat” with the FindMarkers function with an identical testing regime. Overexpressed genes were ranked by decreasing log2 FC for functional enrichment analysis with “gprofiler2” (R package, version 0.2.0, database version e102_eg49_p15_7a9b4d6) using annotated genes as the universe, excluding electronically generated annotations, and with the g:SCS multiple testing correction method applying a significance threshold 0.0598.
In silico testing of deconvolution performance
To test the performance and robustness of our placental single-cell RNA-sequencing deconvolution reference, we randomly split our analytic single-cell RNA-sequencing dataset into 50% training and 50% testing subsets with balanced cell type proportions49. We applied the test subset with the CIBERSORTx Docker container (accessed December 7, 2021) to create a signature gene expression matrix to test deconvolution performance with default settings99. To evaluate the reference’s robustness to fetal sex and ability to discriminate immune cell types of fetal versus maternal origin, we generated in silico pseudo-bulk test mixtures with known distributions of fetal and maternal cells, as well as male and female placental cells. Test mixtures included all of the 50% testing data, only fetal cells from the test data, only maternal cells from the test data, only female fetal cells from the test data, or only male cells from the test data. For the female and male fetal cell test mixtures, the baseline distribution of maternal cells was maintained by randomly down-sampling the maternal cells and randomly down-sampling the male fetal cells to the number of female fetal cells. We used the signature matrix generated from the training data to estimate constituent cell type proportions in these test mixtures using CIBERSORTx with cross-platform S-mode batch correction and 50 permutations to evaluate imputation goodness-of-fit. Pearson correlations and root mean square error between the test set predicted and actual cell type proportions in the test mixtures were used to assess deconvolution performance.
Fluorescence-activated cell sorting of major placental cell types from villous tissue
Villous tissue single-cell suspensions were quickly thawed and stained according to manufacturer’s instructions with five fluorescently labeled antibodies (CD9-FITC, CD45-APC, HLA-A,B,C-PE/Cy7, CD31-BV421, and HLA-G-PE) as well as LIVE/DEAD Near-IR stain (Invitrogen) to isolate six viable populations of placental cells by FACS at the University of Michigan Flow Cytometry Core Facility. Initial flow cytometry experiments included fluorescence minus one, single-color compensation, and isotype controls. Isotype controls were found to be the most conservative and were consequently included in all sorting experiments, as well as single-color compensation controls due to the large number of colors used in sorting. The six populations of cells were Hofbauer cells, endothelial cells, fibroblasts, leukocytes, extravillous trophoblasts, and cytotrophoblasts. We developed a five-marker cell surface FACS scheme to sort cytotrophoblasts (HLA-A,B,C-), endothelial cells (CD31+), extravillous trophoblasts (HLA-G+), fibroblasts (CD9+), Hofbauer cells (CD9-), and leukocytes (CD45+/CD9+) from villous tissue (Supplementary Fig. 10)11,12,14,37,100,101,102,103,104,105,106. Syncytiotrophoblast fragments were enriched from villous tissue digests. We isolated cell type fractions and whole villous tissue from four healthy term, uncomplicated Cesarean sections, labeled Sorted 1 (same sample source as single-cell RNA-sequencing sample 1), Sorted 2, Sorted 3, and Sorted 4. We subjected 24 cell type fractions with sufficient RNA content to RNA-sequencing, including two cytotrophoblast, one endothelial, three extravillous trophoblast, three fibroblast, four Hofbauer cell, four leukocyte, and two syncytiotrophoblast fractions, and five whole tissue samples (Supplementary Table 2).
Detailed antibody information: FITC, marker CD9: Mouse IgG1-kappa, clone HI9a (2.5 μg/mL), BioLegend #312103, lot B188319, BioLegend #312104, lot B232916; isotype control: clone MOPC-21 BioLegend #400107, Lot B199152 (2.5 μg/mL). APC, marker CD45: Mouse IgG1-kappa, clone 2D1, BioLegend #368511, Lot B215062 (0.125 μg/mL); isotype control: clone MOPC-21, BioLegend #400121, lot B216780 (0.125 μg/mL). PE/CY-7, marker HLA-ABC: Mouse IgG2a-kappa, clone W6/32, BioLegend #311429, lot B188649, BioLegend #3111430, lot B238602 (0.44 μg/mL); isotype control: clone MOPC-173, BioLegend #400231, lot B209000 (0.44 μg/mL);. BV421, marker CD31: Mouse IgG1-kappa, clone WM59, BioLegend #303123, lot B204347, BioLegend #303124, lot B232010 (0.625 μg/mL); isotype control: clone MOPC-21, BioLegend #400157, lot B225357 (0.625 μg/mL). PE, marker HLA-G: Mouse IgG2a-kappa, clone 87G, BioLegend #335905, lot B222326, BioLegend #335906, lot B199294 (5 μg/mL); isotype control clone MOPC-173, BioLegend #400211, lot B227641 (5 μg/mL). Mouse IgG1-kappa, clone MEM-G/9, Abcam #24384 Lot GR3176304-1 (2.5 μg/mL); isotype control: monoclonal, Abcam #ab81200, lot GR267131-1 (2.5 μg/mL). Validation information available on the manufacturer’s website under the catalog ID for each antibody.
A cutoff of 0.1% events was used to set a series of gates. Cells were first gated on size and granularity (FSC-HxSSC-H) to eliminate debris, followed by doublet discrimination (FSC-HxFSC-W and SSC-HxSSC-W). Ax750 was used to sort on viability. Extravillous trophoblasts were isolated based on Human Leukocyte Antigen-G (HLA-G) expression (Supplementary Fig. 10a). Cytotrophoblasts are HLA-ABC negative (Supplementary Fig. 10b). HLA-ABC-positive cells were then subjected to a CD45/CD9 gate to isolate Hofbauer cells and a heterogeneous population of leukocytes (Supplementary Fig. 10c). Finally, CD45-/CD9- population is sorted into the endothelial or fibroblast bins based on CD31 expression (Supplementary Fig. 10d).
Bulk placental tissue and sorted placental cell type RNA extraction and sequencing
Approximately 2 mg of bulk RNALater-stabilized (Qiagen) bulk villous tissue was added to 350 μL 1% β-mercaptoethanol (Sigma-Aldrich) RLT Buffer Plus (Qiagen) to Lysing Matrix D vials (MP Biomedicals). Samples were disrupted and homogenized on the MP-24 FastPrep homogenizer (MP Biomedicals) at 6 m/s, setting MP24x2 for 35 s. For the homogenized bulk villous tissue, syncytiotrophoblast-enriched fraction, and sorted cell types, RNA extraction was completed according to the manufacturer’s instructions using the AllPrep DNA/RNA Mini Kit (Qiagen) and stored at −80 °C. RNA samples were submitted to the University of Michigan Advanced Genomics Core for RNA-sequencing. Ribosomal RNAs were depleted with RiboGone (Takara) and libraries were prepared with the SMARTer Stranded RNA-Seq v2 kit (Takara). Paired- or single-end 50 base pair reads were sequenced on the HiSeq platform (Illumina). Raw RNA reads were assessed for sequencing quality using “FastQC” v0.11.5107 and “MultiQC” v1.7108. Reads were aligned to the GRCh38.p12/ GENCODEv28 reference transcriptome using “STAR” v2.6.0c with default settings89. featureCounts from “subread” v1.6.1 was used to quantify and summarize gene expression with default settings109.
Sorted placental cell type differential expression analysis and comparison to single-cell results
For visualizations or analyses that required normalized gene counts, sorted cell type gene counts were library size normalized with the median ratio method using the counts() function (DESeq2 R package, version 1.32.0). As recommended50, we excluded genes that were not present in at least three samples and did not have an expression of 10 library size-normalized counts. To visualize broad cell type-specific gene expression patterns, we used “DESeq2’s” (R package, version 1.32.0) plotPCA() function with the regularized logarithm transformation, blinded to experimental design. Upregulated genes in each cell type were identified using the negative binomial linear model two-tailed Wald test in “DESeq2” (R package, version 1.32.0) adjusted for biological replicate using default settings with contrasts comparing the expression of a gene in one cell type against the average expression across all other cell types at a false discovery rate-adjusted p value less than 0.05 and a log2 FC cutoff of 1.250. Overexpressed genes were ranked by decreasing log-FC for functional enrichment analysis with “gprofiler2” (R package, version 0.2.0, database version e102_eg49_p15_7a9b4d6) using annotated genes as the universe, excluding electronically generated annotations, and with the default g:SCS multiple testing correction method applying significance threshold adjusted p value of 0.0598. To compare sorted and single-cell results, we tabulated unique overlapping differentially expressed genes and overrepresented pathways by cell type (Supplementary Table 3) Peripheral fetal and maternal immune cell types from the single-cell RNA-sequencing data were collapsed to one leukocyte category, cytotrophoblast subtypes to one cytotrophoblast category, and mesenchymal stem cells and fibroblasts to one fibroblast category for this comparison.
We used the CIBERSORTx Docker container (accessed December 7, 2021) to create a signature gene expression matrix for deconvolution from the counts of the single-cell RNA-sequencing data with the following default parameters: differential expression q value < 0.01, no minimum gene expression cutoff, and a 300 gene feature selection floor and a 500 gene feature selection ceiling99. We used the signature matrix to estimate constituent cell type proportions in the 4 whole tissue (with 1 additional technical replicate) and 19 sorted or enriched cell type fractions using CIBERSORTx with cross-platform S-mode batch correction and 50 permutations to evaluate imputation goodness-of-fit. We collapsed the high-resolution single-cell cell type cluster labels to the seven cell type fractions we targeted for comparison with sorted cell type results.
Application: bulk placenta gene expression dataset and CIBERSORTx deconvolution
Bulk placental tissue microarray gene expression (previously batch-corrected and normalized) from eight preeclampsia case–control studies was downloaded from the NCBI Gene Expression Omnibus (accession number GSE75010) for deconvolution33. We used the CIBERSORTx Docker container (accessed December 7, 2021) to create a signature gene expression matrix for deconvolution from the counts of the single-cell RNA-sequencing data with the following default parameters: differential expression q value < 0.01, no minimum gene expression cutoff, and a 300 gene feature selection floor and a 500 gene feature selection ceiling99. We used the signature matrix to estimate constituent cell type proportions in GSE75010 using CIBERSORTx with cross-platform S-mode batch correction and 50 permutations to evaluate imputation goodness-of-fit.
Application: preeclampsia case–control differential cell type abundance, differential gene expression statistical analysis, and mediation analysis
To test for differences in estimated cell type proportions between preeclampsia cases and controls, estimated cell type proportions for GSE75010 were regressed on preeclampsia case–control status using beta regression models adjusted for gestational age, sex, and study source110 (Supplementary Data 8). Statistical significance was assessed using the two-tailed Wald test applying a nominal significance threshold of 0.05. Cell types imputed at zero percent abundance across all samples were excluded. For modelling purposes, zero percent abundance estimates were transformed to \(\frac{1}{2}/n\) where n is the number of observations (n = 330).
Differential expression analysis was conducted in limma111 with default linear models adjusted for gestational age, fetal sex, and study source with empirical Bayes standard error moderated t-test statistics. A cell type-adjusted model was built on the base model adjusted for gestational age, fetal sex, and study source and additionally adjusted for the first five PCs of deconvoluted cell type proportions (Supplementary Data 9). Statistical significance was assessed at false discovery rate-adjusted q value < 0.05 and a log2 FC cutoff of 0.1. Differentially expressed genes were descending-ranked by value of the moderated test statistic for gene set enrichment analysis in desktop version GSEA 4.1.0 with the GSEAPreranked tool with default settings against the c5.go.bp.v7.5.1.symbols.gmt gene set database112,113 (Supplementary Data 10). PC analysis was performed with prcomp() from “stats-package” (R, version 4.0.5) without scaling and with default settings.
A unified mediation and interaction analysis114 was conducted in “CMAverse” (R package, version 0.1.0)115 via the g-formula approach116 to estimate causal randomized-intervention analogs of natural direct and indirect effects117 through direct counterfactual imputation. The model was operationalized with preeclampsia status as the binary exposure, log2 transformed gene expression intensity as the continuous outcome, and the first five PCs of deconvoluted cell type proportions as continuous mediators. Baseline covariates included fetal sex and study source. Continuous gestational age was included as a confounder of the mediator-outcome relationship affected by the exposure. Confidence intervals were bootstrapped with 1000 boots with otherwise default settings. Statistical tests were two-tailed and interpreted at a p value significance threshold of 0.05.
Statistics and reproducibility
Technical replication measured by average intra-cluster gene expression between technical replicates was tested via the two-tailed Spearman correlation test within Samples 1 and 2 assessed across all 32,738 common genes. The number of cells contributing expression data for each cell type is available in Table 1. Single-cell cluster marker genes were identified in “Seurat” with the FindAllMarkers function with default settings on single-cell gene expression counts92,95. Specifically, including cells from both maternal and fetal cell types, the expression level in each cell type cluster was compared against the average expression of that gene across all other cell types using the two-tailed Wilcoxon rank sum test with significance defined at a false discovery rate-adjusted p value less than 0.05 and a log2 FC cutoff of 0.25 (n = 40,494 cells). The final analytic sample included 40,494 cells and 36,601 genes across nine biological replicates, two of which had a technical replicate (Samples 1B and 2B) and another two included peripheral subsampling (Samples 8P and 9P). Pairwise cluster markers were identified in “Seurat” with the FindMarkers function with an identical testing regime (n = 6132 cells for proliferative vs. non-proliferative cytotrophoblasts). Overexpressed genes were ranked by decreasing log-FC for functional enrichment analysis with “gprofiler2” (R package, version 0.2.0, database version e102_eg49_p15_7a9b4d6) using annotated genes as the universe, excluding electronically generated annotations, and with the default g:SCS multiple testing correction method applying significance threshold adjusted p value of 0.0598. Overexpressed genes per cell type cluster are available in Supplementary Data 2 and ontology results in Supplementary Data 3. Overexpressed genes and related enrichment results comparing proliferative to non-proliferative cytotrophoblasts are available in Supplementary Data 1.
Upregulated genes in each cell type were identified using the negative binomial linear model two-tailed Wald test in “DESeq2” (R package, version 1.32.0) adjusted for biological replicate using default settings with contrasts comparing the expression of a gene in one cell type against the average expression across all other cell types at a false discovery rate-adjusted p value less than 0.05 and a log2 FC cutoff of 1.250 (n = 19 cell type fraction samples with breakdown by cell type available in Supplementary Table 2). Overexpressed genes were ranked by decreasing log-FC for functional enrichment analysis with “gprofiler2” (R package, version 0.2.0, database version e102_eg49_p15_7a9b4d6) using annotated genes as the universe, excluding electronically generated annotations, and with the default g:SCS multiple testing correction method applying significance threshold adjusted p value of 0.0598. Differentially expressed genes per cell type available in Supplementary Data 4 and number of differentially expressed genes are summarized in Supplementary Fig. 12. Ontology results are available in Supplementary Data 5.
Bulk placental tissue microarray gene expression (previously batch-corrected and normalized) from eight preeclampsia case–control studies was downloaded from the NCBI Gene Expression Omnibus (GSE75010) for deconvolution (n = 330)33. We used the CIBERSORTx Docker container (accessed December 7, 2021) to create a signature gene expression matrix for deconvolution from the counts of the single-cell RNA-sequencing data with the following default parameters: differential expression q value < 0.01, no minimum gene expression cutoff, and a 300 gene feature selection floor and a 500 gene feature selection ceiling99. We used the signature matrix to estimate constituent cell type proportions in GSE75010 using CIBERSORTx with cross-platform S-mode batch correction and 50 permutations to evaluate imputation goodness-of-fit.
To test for differences in estimated cell type proportions between preeclampsia cases and controls (n = 330), estimated cell type proportions for GSE75010 were regressed on preeclampsia case–control status using beta regression models (n = 25 cell type proportion outcomes) adjusted for gestational age, sex, and study source110. Cell types imputed at zero percent abundance across all samples were excluded (n = 2 excluded: fetal naïve CD4+ T cells and fetal GZMB+ natural killer cells). Statistical significance was assessed using the two-tailed Wald test applying a nominal significance threshold of 0.05.
Differential expression analysis was conducted in limma111 with default settings using linear models (n = 14,651 genes) adjusted for gestational age, fetal sex, and study source (n = 330). A cell type-adjusted model was built on the base model additionally adjusted for the first five PCs of deconvoluted cell type proportions. PC analysis was performed with prcomp from “stats-package” (R, version 4.0.5) without scaling and default settings. Statistical significance was assessed at false discovery rate-adjusted q value < 0.05 and a log2 FC cutoff of 0.1. Differentially expressed genes were descending-ranked by the value of the moderated test statistic for gene set enrichment analysis in desktop version GSEA 4.1.0 with the GSEAPreranked tool with default settings against the c5.go.bp.v7.5.1.symbols.gmt gene set database112,113.
A unified mediation and interaction analysis114 was conducted in “CMAverse” (R package, version 0.1.0)115 via the g-formula approach116 to estimate causal randomized-intervention analogs of natural direct and indirect effects117 through direct counterfactual imputation. The model (n = 330) was operationalized with preeclampsia status as the binary exposure, normalized log2 gene expression signal intensity as the outcome, and the first five PCs of deconvoluted cell type proportions as continuous mediators. Baseline covariates included fetal sex and categorical study source. Continuous gestational age was included as a confounder of the mediator-outcome relationship affected by the exposure. Confidence intervals were bootstrapped with 1000 boots with otherwise default settings. Statistical tests were two-tailed and interpreted at a p value significance threshold of 0.05.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Raw placental single-cell RNA-sequencing and raw placental bulk RNA-sequencing generated by this study are freely available in the Gene Expression Omnibus repository (accession number GSE182381). The cell type signature matrix and related files to deconvolute bulk gene expression measures are available through the Gene Expression Omnibus (accession number GSE182381) as supplementary material for download. This study uses data generated by The Chinese University of Hong Kong (CUHK) Circulating Nucleic Acids Research Group, as reported by Tsang et al. in Proc. Natl Acad. Sci. USA (doi: 10.1073/pnas.1710470114, accession number EGAS00001002449)16,88. The placental single-cell RNA-sequencing data that support the findings of this study can be accessed through the Database of Genotypes and Phenotypes (accession number phs001886.v1.p1)17,87 and the European Genome-Phenome Archive (accession number EGAS00001002449)16,88. The preeclampsia case–control microarray data that support the findings of this study are available in Gene Expression Omnibus repository (accession number GSE75010)33,118. Source data underlying Figs. 3–5 are presented in Supplementary Data 8–10, respectively.
Code availability
All scripts to perform preprocessing, analyses, and deconvolution are available119.
References
Butwick A. J., Druzin M. L., Shaw G. M. & Guo N. Evaluation of US state–level variation in hypertensive disorders of pregnancy. JAMA Netw. Open 3, e2018741. https://doi.org/10.1001/jamanetworkopen.2020.18741 (2020).
Barker, D. J. P. & Thornburg, K. L. Placental programming of chronic diseases, cancer and lifespan: a review. Placenta 34, 841–845 (2013).
Maltepe, E. & Fisher, S. J. Placenta: the forgotten organ. Annu. Rev. Cell Dev. Biol. 31, 523–552 (2015).
Gude, N. M., Roberts, C. T., Kalionis, B. & King, R. G. Growth and function of the normal human placenta. Thromb. Res. 114, 397–407 (2004).
Ilekis, J. V. et al. Placental origins of adverse pregnancy outcomes: potential molecular targets: an Executive Workshop Summary of the Eunice Kennedy Shriver National Institute of Child Health and Human Development. Am. J. Obstet. Gynecol. 215, S1–S46 (2016).
Castellucci, M. & Kaufmann P. Basic structure of the villous irees. In Pathology of the Human Placenta (eds Benirschke, K., Kaufmann, P. & Baergen, R.) 50–120 (Springer New York, New York, NY, 2006).
Lyall, F., Robson, S. C. & Bulmer, J. N. Spiral artery remodeling and trophoblast invasion in preeclampsia and fetal growth restriction. Hypertension 62, 1046–1054 (2013).
Naicker, T., Khedun, S. M., Moodley, J. & Pijnenborg, R. Quantitative analysis of trophoblast invasion in preeclampsia. Acta Obstet. Gynecol. Scand. 82, 722–729 (2003).
Kaufmann, P., Black, S. & Huppertz, B. Endovascular trophoblast invasion: implications for the pathogenesis of intrauterine growth retardation and preeclampsia. Biol. Reprod. 69, 1–7 (2003).
Yabe, S. et al. Comparison of syncytiotrophoblast generated from human embryonic stem cells and from term placentas. Proc. Natl Acad. Sci. USA 113, E2598–E2607 (2016).
Tang, Z. et al. Isolation of Hofbauer cells from human term placentas with high yield and purity. Am. J. Reprod. Immunol. 66, 336–348 (2011).
Li, L. & Schust, D. J. Isolation, purification and in vitro differentiation of cytotrophoblast cells from human term placenta. Reprod. Biol. Endocrinol. 13, 71 (2015).
Petroff, M. G. et al. Isolation and culture of term human trophoblast cells. Methods Mol. Med. 121, 203–217 (2006).
Hirano, T. et al. CD9 is expressed in extravillous trophoblasts in association with integrin α3 and integrin α5. Mol. Hum. Reprod. 5, 162–167 (1999).
Robin, C. et al. Human placenta is a potent hematopoietic niche containing hematopoietic stem and progenitor cells throughout development. Cell Stem Cell 5, 385–395 (2009).
Tsang, J. C. H. et al. Integrative single-cell and cell-free plasma RNA transcriptomics elucidates placental cellular dynamics. Proc. Natl Acad. Sci. USA. 114, E7786–E7795 (2017).
Pique-Regi, R. et al. Single cell transcriptional signatures of the human placenta in term and preterm parturition. eLife 8, e52004 (2019).
Ma, Y. et al. Cell type-specific expression and function of toll-like receptors 2 and 4 in human placenta: implications in fetal infection. Placenta 28, 1024–1031 (2007).
Pattillo, R. A. & Gey, G. O. The establishment of a cell line of human hormone-synthesizing trophoblastic cells in vitro. Cancer Res. 28, 1231–1236 (1968).
Graham, C. H. et al. Establishment and characterization of first trimester human trophoblast cells with extended lifespan. Exp. Cell Res. 206, 204–211 (1993).
Brew, O., Sullivan, M. H. F. & Woodman, A. Comparison of normal and pre-eclamptic placental gene expression: a systematic review with meta-analysis. PLoS One 11, e0161504 (2016).
Lekva, T. et al. Gene expression in term placentas is regulated more by spinal or epidural anesthesia than by late-onset preeclampsia or gestational diabetes mellitus. Sci. Rep. 6, 1–12 (2016).
Delahaye, F. et al. Genetic variants influence on the placenta regulatory landscape. PLoS Genet. 14, e1007785 (2018).
McHale, C. M., Zhang, L., Thomas, R. & Smith, M. T. Analysis of the transcriptome in molecular epidemiology studies. Environ. Mol. Mutagen. 54, 500–517 (2013).
Meissner, A. et al. Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 454, 766–770 (2008).
Reinius, L. E. et al. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PLoS One 7, e41361 (2012).
Holbrook, J. D. et al. Is cellular heterogeneity merely a confounder to be removed from epigenome-wide association studies. Epigenomics 9, 1143–1150 (2017).
Jaffe, A. E. & Irizarry, R. A. Accounting for cellular heterogeneity is critical in epigenome-wide association studies. Genome Biol. 15, R31 (2014).
Shen-Orr, S. S. & Gaujoux, R. Computational deconvolution: extracting cell type-specific information from heterogeneous samples. Curr. Opin. Immunol. 25, 571–578 (2013).
Teschendorff, A. E. & Zheng, S. C. Cell-type deconvolution in epigenome-wide association studies: a review and recommendations. Epigenomics 9, 757–768 (2017).
Hunt, G. J., Freytag, S., Bahlo, M. & Gagnon-Bartsch, J. A. dtangle: accurate and robust cell type deconvolution. Bioinformatics 35, 2093–2099 (2019).
Campbell, K. A., Colacino, J. A., Park, S. K. & Bakulski, K. M. Cell types in environmental epigenetic studies: biological and epidemiological frameworks. Curr. Environ. Health Rep. 7, 185–197 (2020).
Leavey, K. et al. Unsupervised placental gene expression profiling identifies clinically relevant subclasses of human preeclampsia. Hypertension 68, 137–147 (2016).
McInnes, L., Healy, J. & Melville, J. UMAP: Uniform Manifold Approximation and Projection for dimension reduction. Preprint at arXiv:180203426 [cs, stat] (2018).
Haghverdi, L., Lun, A. T. L., Morgan, M. D. & Marioni, J. C. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 36, 421–427 (2018).
Bulmer, J. N., Morrison, L. & Johnson, P. M. Expression of the proliferation markers Ki67 and transferrin receptor by human trophoblast populations. J. Reprod. Immunol. 14, 291–302 (1988).
Gonen-Gross, T. et al. Inhibitory NK receptor recognition of HLA-G: regulation by contact residues and by cell specific expression at the fetal-maternal interface. PLoS One 5, e8941 (2010).
Zhou, G. Q. et al. Highly specific monoclonal antibody demonstrates that pregnancy-specific glycoprotein (PSG) is limited to syncytiotrophoblast in human early and term placenta. Placenta 18, 491–501 (1997).
Baboolal, T. G. et al. Intrinsic multipotential mesenchymal stromal cell activity in gelatinous Heberden’s nodes in osteoarthritis at clinical presentation. Arthritis Res. Ther. 16, R119 (2014).
Dejana, E. Endothelial cell–cell junctions: happy together. Nat. Rev. Mol. Cell Biol. 5, 261–270 (2004).
van Noesel, C. J. et al. The membrane IgM-associated heterodimer on human B cells is a newly defined B cell antigen that contains the protein product of the mb-1 gene. J. Immunol. 146, 3881–3888 (1991).
Shapiro-Shelef, M. & Calame, K. Regulation of plasma-cell development. Nat. Rev. Immunol. 5, 230–242 (2005).
Geissmann, F., Jung, S. & Littman, D. R. Blood monocytes consist of two principal subsets with distinct migratory properties. Immunity 19, 71–82 (2003).
Villani, A.-C. et al. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science 356, eaah4573 (2017).
Musumeci, A., Lutz, K., Winheim, E. & Krug, A. B. What makes a pDC: recent advances in understanding plasmacytoid DC development and heterogeneity. Front. Immunol. 10, 1222 (2019).
Zhao, Y. et al. Single-cell transcriptomic landscape of nucleated cells in umbilical cord blood. Gigascience 8, giz047 (2019).
Yang, C. et al. Heterogeneity of human bone marrow and blood natural killer cells defined by single-cell transcriptome. Nat. Commun. 10, 3931 (2019).
Santana, M. A. & Esquivel‐Guadarrama, F. Cell biology of T cell activation and differentiation. Int. Rev. Cytol. 250, 217–74 (2006).
Avila Cobos, F. et al. Benchmarking of cell type deconvolution pipelines for transcriptomics data. Nat. Commun. 11, 5650 (2020).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Sitras, V. et al. Differential placental gene expression in severe preeclampsia. Placenta 30, 424–433 (2009).
Nishizawa, H. et al. Comparative gene expression profiling of placentas from patients with severe pre-eclampsia and unexplained fetal growth restriction. Reprod. Biol. Endocrinol. 9, 107 (2011).
Tsai, S. et al. Transcriptional profiling of human placentas from pregnancies complicated by preeclampsia reveals disregulation of sialic acid acetylesterase and immune signalling pathways. Placenta 32, 175–182 (2011).
Meng, T. et al. Identification of differential gene expression profiles in placentas from preeclamptic pregnancies versus normal pregnancies by DNA microarrays. OMICS 16, 301–311 (2012).
Xiang, Y. et al. Up-regulated expression and aberrant DNA methylation of LEP and SH3PXD2A in pre-eclampsia. PLoS One 8, e59753 (2013).
Blair, J. D. et al. Widespread DNA hypomethylation at gene enhancer regions in placentas associated with early-onset pre-eclampsia. Mol. Hum. Reprod. 19, 697–708 (2013).
Nishizawa, H. et al. Microarray analysis of differentially expressed fetal genes in placental tissue derived from early and late onset severe pre-eclampsia. Placenta 28, 487–497 (2007).
Hedlund, E. & Deng, Q. Single-cell RNA sequencing: technical advancements and biological applications. Mol. Asp. Med. 59, 36–46 (2018).
Enquobahrie, D. A. et al. Differential placental gene expression in preeclampsia. Am. J. Obstet. Gynecol. 199, 566.e1–566.11 (2008).
Várkonyi, T. et al. Microarray profiling reveals that placental transcriptomes of early-onset HELLP syndrome and preeclampsia are similar. Placenta 32, S21–S29 (2011).
Vennou, K. E., Kontou, P. I., Braliou, G. G. & Bagos, P. G. Meta-analysis of gene expression profiles in preeclampsia. Pregnancy Hypertens. 19, 52–60 (2020).
Sitras, V., Fenton, C. & Acharya, G. Gene expression profile in cardiovascular disease and preeclampsia: a meta-analysis of the transcriptome based on raw data from human studies deposited in Gene Expression Omnibus. Placenta 36, 170–178 (2015).
Luttun, A. & Carmeliet, P. Soluble VEGF receptor Flt1: the elusive preeclampsia factor discovered? J. Clin. Invest. 111, 600–602 (2003).
Maynard, S. E. et al. Excess placental soluble fms-like tyrosine kinase 1 (sFlt1) may contribute to endothelial dysfunction, hypertension, and proteinuria in preeclampsia. J. Clin. Invest. 111, 649–658 (2003).
McGinnis, R. et al. Variants in the fetal genome near FLT1 are associated with risk of preeclampsia. Nat. Genet. 49, 1255–1260 (2017).
Denisenko, E. et al. Systematic assessment of tissue dissociation and storage biases in single-cell and single-nucleus RNA-seq workflows. Genome Biol. 21, 130 (2020).
Khan, T. et al. Single nucleus RNA sequence (snRNAseq) analysis of the spectrum of trophoblast lineages generated from human pluripotent stem cells in vitro. Front Cell Dev. Biol. 9, 695248 (2021).
Raymond, D. & Peterson, E. A critical review of early-onset and late-onset preeclampsia. Obstet. Gynecol. Surv. 66, 497–506 (2011).
Brosens, I. A., Robertson, W. B. & Dixon, H. G. The role of the spiral arteries in the pathogenesis of preeclampsia. Obstet. Gynecol. Annu 1, 177–191 (1972).
Meekins, J. W. et al. A study of placental bed spiral arteries and trophoblast invasion in normal and severe pre-eclamptic pregnancies. Br. J. Obstet. Gynaecol. 101, 669–674 (1994).
Suryawanshi, H. et al. Dynamic genome-wide gene expression and immune cell composition in the developing human placenta. J. Reprod. Immunol. 151, 103624 (2022).
Smith, A. N. et al. The role of mitochondrial dysfunction in preeclampsia: causative factor or collateral damage? Am. J. Hypertens. 34, 442–452 (2021).
Staff, A. C. The two-stage placental model of preeclampsia: an update. J. Reprod. Immunol. 134–135, 1–10 (2019).
Soleymanlou, N. et al. Molecular evidence of placental hypoxia in preeclampsia. J. Clin. Endocrinol. Metab. 90, 4299–4308 (2005).
Burton, G. J. & Yung, H.-W. Endoplasmic reticulum stress in the pathogenesis of early-onset pre-eclampsia. Pregnancy Hypertens. 1, 72–78 (2011).
Burton, G. J., Yung, H.-W., Cindrova-Davies, T. & Charnock-Jones, D. S. Placental endoplasmic reticulum stress and oxidative stress in the pathophysiology of unexplained intrauterine growth restriction and early onset preeclampsia. Placenta 30(Suppl A), S43–S48 (2009).
Shibuya, M. Vascular endothelial growth factor receptor-1 (VEGFR-1/Flt-1): a dual regulator for angiogenesis. Angiogenesis 9, 225–230 (2006).
Luttun, A., Tjwa, M. & Carmeliet, P. Placental growth factor (PlGF) and its receptor Flt-1 (VEGFR-1): novel therapeutic targets for angiogenic disorders. Ann. N. Y. Acad. Sci. 979, 80–93 (2002).
Caniggia, I. et al. Hypoxia-inducible factor-1 mediates the biological effects of oxygen on human trophoblast differentiation through TGFbeta(3). J. Clin. Invest. 105, 577–587 (2000).
Zhang, T. et al. Dissecting human trophoblast cell transcriptional heterogeneity in preeclampsia using single-cell RNA sequencing. Mol. Genet Genomic Med. 9, e1730. https://doi.org/10.1002/mgg3.1730 (2021).
Cheng, J.-C., Chang, H.-M. & Leung, P. C. K. Transforming growth factor-β1 inhibits trophoblast cell invasion by inducing snail-mediated down-regulation of vascular endothelial-cadherin protein. J. Biol. Chem. 288, 33181–33192 (2013).
Zhou, W. et al. Trophoblast cell subtypes and dysfunction in the placenta of individuals with preeclampsia revealed by single-cell RNA sequencing. Mol. Cells 45, 317–328 (2022).
Phillippe, M. Cell-free fetal DNA, telomeres, and the spontaneous onset of parturition. Reprod. Sci. 22, 1186–1201 (2015).
Sharp, A. N., Heazell, A. E. P., Crocker, I. P. & Mor, G. Placental apoptosis in health and disease. Am. J. Reprod. Immunol. 64, 159–169 (2010).
Bruckner, T. A. & Catalano, R. Selection in utero and population health: theory and typology of research. SSM Popul. Health 5, 101–113 (2018).
Whitcomb, B. W., Schisterman, E. F., Perkins, N. J. & Platt, R. W. Quantification of collider-stratification bias and the birthweight paradox. Paediatr. Perinat. Epidemiol. 23, 394–402 (2009).
Pique-Regi, R. et al. Single cell transcriptional signatures of the human placenta in term and preterm parturition. Database of Genotypes and Phenotypes phs001886.v1.p1. https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001886.v1.p1 (2019).
Tsang, J. C. H. et al. Integrative single-cell and cell-free plasma RNA transcriptomics elucidates placental cellular dynamics. European Genome-Phenome Archive EGAS00001002449. https://ega-archive.org/studies/EGAS00001002449 (2017).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Zhang, F., Yan, Y. & Kang, H. M. popscle. https://github.com/statgen/popscle (2021).
Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R | Bioinformatics | Oxford Academic (accessed 28 Jan 2021); https://academic.oup.com/bioinformatics/article/33/8/1179/2907823.
Luecken, M. D. & Theis, F. J. Current best practices in single-cell RNA-seq analysis: a tutorial. Mol. Syst. Biol. 15, e8746 (2019).
Amezquita, R., Lun, A., Hicks, S. & Gottardo, R. Correcting batch effects | Multi-Sample Single-Cell Analyses with Bioconductor. In Orchestrating Single-Cell Analysis. Ch. 1 (2021).
Zappia, L. & Oshlack, A. Clustering trees: a visualization for evaluating clusterings at multiple resolutions. Gigascience 7, giy083 (2018).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902.e21 (2019).
Germain, P. et al. Doublet identification in single-cell sequencing data using scDblFinder [version 2; peer review: 2 approved]. F1000Research 10, 979 (2022).
Aran, D. et al. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 20, 163–172 (2019).
Raudvere, U. et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 47, W191–W198 (2019).
Newman, A. M. et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782 (2019).
Schmon, B., Hartmann, M., Jones, C. J. & Desoye, G. Insulin and glucose do not affect the glycogen content in isolated and cultured trophoblast cells of human term placenta. J. Clin. Endocrinol. Metab. 73, 888–893 (1991).
Boyd, A. W. Human leukocyte antigens: an update on structure, function and nomenclature. Pathology 19, 329–337 (1987).
Blaschitz, A., Weiss, U., Dohr, G. & Desoye, G. Antibody reaction patterns in first trimester placenta: implications for trophoblast isolation and purity screening. Placenta 21, 733–741 (2000).
Kaplan, A. et al. Group B streptococcus induces trophoblast death. Microb. Pathog. 45, 231–235 (2008).
Zozzaro-Smith, P. E. et al. Whole mount immunofluorescence analysis of placentas from normotensive versus preeclamptic pregnancies. Placenta 36, 1310–1317 (2015).
Coukos, G. et al. Platelet-endothelial cell adhesion molecule-1 is expressed by a subpopulation of human trophoblasts: a possible mechanism for trophoblast-endothelial interaction during haemochorial placentation. Mol. Hum. Reprod. 4, 357–367 (1998).
Cervar‐Zivkovic, M. & Stern C. Trophoblast isolation and culture. In The Placenta. (eds Kay, H. H., Nelson, D. M. & Wang, Y.) 153–162 (John Wiley & Sons, Ltd, 2011).
Andrews, S. FastQC: a quality control tool for high throughput sequence data (2010).
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048 (2016).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Grün, B., Kosmidis, I. & Zeileis, A. Extended beta regression in R: shaken, stirred, mixed, and partitioned. J. Stat. Softw. 48, 1–25 (2012).
Smyth, G. K. limma: linear models for microarray data. In Bioinformatics and Computational Biology Solutions Using R and Bioconductor (eds Gentleman, R. et al.) 397–420 (Springer New York, New York, NY, 2005).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Nalt Acad. Sci. USA. 102, 15545–15550 (2005).
Mootha, V. K. et al. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 34, 267–273 (2003).
VanderWeele, T. J. A unification of mediation and interaction: a four-way decomposition. Epidemiology 25, 749–761 (2014).
Shi, B. et al. CMAverse: a suite of functions for reproducible causal mediation analyses. Epidemiology 32, e20–e22 (2021).
Robins, J. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Math. Model. 7, 1393–1512 (1986).
VanderWeele, T. J., Vansteelandt, S. & Robins, J. M. Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology 25, 300–306 (2014).
Leavey, K. et al. Unsupervised placental gene expression profiling identifies clinically relevant subclasses of human preeclampsia. Gene Expression Omnibus GSE75010 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE75010 (2016).
Campbell, K. A. et al. bakulskilab/Placental-cell-type-deconvolution-reveals-that-cell-proportions-drive-preeclampsia-gene-expression: Placenta RNA deconvolution. https://zenodo.org/badge/latestdoi/599836831.
Acknowledgements
This research was supported by the Michigan Lifestage Environmental Exposures and Disease center (P30 ES017885), Michigan State University’s Environmental Influences on Child Health Outcomes program (UG3 OD023285, UH3 OD023285), the University of Michigan M-cubed pilot grant program, the Puerto Rico Testsite for Exploring Contamination Threats (P42 ES017198), and the NIH R35 RIVER award (ES031686). K.A.C. was supported by the National Institutes of Health (T32 HG00040). J.A.C. was supported by the Ravitz Family Foundation, the Forbes Institute for Cancer Discovery at the University of Michigan Rogel Cancer Center, and the National Institutes of Health (R01 ES028802, U01 ES026697, and P30 CA046592). M.P. was supported by the National Institutes of Health (T32 ES007062). E.R.E was supported by the National Institutes of Health (T32 DK071212). K.M.B. was supported by research grants from the National Institute of Environmental Health Sciences (R01 ES025531; R01 ES025574). We acknowledge the submitting investigators of previously published placental single-cell RNA-sequencing data used in this study. This study uses data generated by The Chinese University of Hong Kong (CUHK) Circulating Nucleic Acids Research Group, as reported by Tsang et al. in Proc. Natl Acad. Sci. USA (doi: 10.1073/pnas.1710470114, accession number EGAS00001002449). We also acknowledge the submitting investigators of Database of Genotypes and Phenotypes (accession number phs001886.v1.p1), whose work was sponsored in part by the Perinatology Research Branch, Division of Obstetrics and Maternal-Fetal Medicine, Division of Intramural Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, U.S. Department of Health and Human Services.
Author information
Authors and Affiliations
Contributions
K.A.C.: conceptualization, methodology, software, validation, formal analysis, investigation, data curation, writing—original draft, visualization. J.A.C.: conceptualization, methodology, software, validation, investigation, resources, writing—review and editing, visualization, supervision, project administration, funding acquisition. M.P.: investigation, writing—review and editing. J.F.D.: software, validation, data curation. E.R.E.: writing—review and editing. S.S.H.: methodology, writing—review and editing, funding acquisition. S.E.D.: methodology, writing—review and editing. D.C.D.: writing—review and editing, funding acquisition. J.M.G.: writing—review and editing, funding acquisition. R.L.-C.: conceptualization, methodology, resources, writing—review and editing, supervision, project administration, funding acquisition. V.P.: conceptualization, resources, writing—review and editing, project administration, resources, funding acquisition. K.M.B.: conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing—original draft, visualization, supervision, project administration, funding acquisition.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Debarka Sengupta and George Inglis. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Campbell, K.A., Colacino, J.A., Puttabyatappa, M. et al. Placental cell type deconvolution reveals that cell proportions drive preeclampsia gene expression differences. Commun Biol 6, 264 (2023). https://doi.org/10.1038/s42003-023-04623-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-023-04623-6
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.