Exome sequencing in pooled DNA samples to identify maternal pre-eclampsia risk variants

Kaartokallio, Tea; Wang, Jingwen; Heinonen, Seppo; Kajantie, Eero; Kivinen, Katja; Pouta, Anneli; Gerdhem, Paul; Jiao, Hong; Kere, Juha; Laivuori, Hannele

doi:10.1038/srep29085

Download PDF

Article
Open access
Published: 07 July 2016

Exome sequencing in pooled DNA samples to identify maternal pre-eclampsia risk variants

Tea Kaartokallio¹^na1,
Jingwen Wang²^na1,
Seppo Heinonen³,
Eero Kajantie^4,5,6,
Katja Kivinen⁷,
Anneli Pouta^6,8,
Paul Gerdhem^9,10,
Hong Jiao²,
Juha Kere^2,11,12 &
…
Hannele Laivuori^1,3,13

Scientific Reports volume 6, Article number: 29085 (2016) Cite this article

2286 Accesses
14 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Pre-eclampsia is a common pregnancy disorder that is a major cause for maternal and perinatal mortality and morbidity. Variants predisposing to pre-eclampsia might be under negative evolutionary selection that is likely to keep their population frequencies low. We exome sequenced samples from a hundred Finnish pre-eclamptic women in pools of ten to screen for low-frequency, large-effect risk variants for pre-eclampsia. After filtering and additional genotyping steps, we selected 28 low-frequency missense, nonsense and splice site variants that were enriched in the pre-eclampsia pools compared to reference data, and genotyped the variants in 1353 pre-eclamptic and 699 non-pre-eclamptic women to test the association of them with pre-eclampsia and quantitative traits relevant for the disease. Genotypes from the SISu project (n = 6118 exome sequenced Finnish samples) were included in the binary trait association analysis as a population reference to increase statistical power. In these analyses, none of the variants tested reached genome-wide significance. In conclusion, the genetic risk for pre-eclampsia is likely complex even in a population isolate like Finland, and larger sample sizes will be necessary to detect risk variants.

Analysis of HLA-G long-read genomic sequences in mother–offspring pairs with preeclampsia

Article Open access 18 November 2020

Genetic profiling of Vietnamese population from large-scale genomic analysis of non-invasive prenatal testing data

Article Open access 05 November 2020

Polygenic prediction of preeclampsia and gestational hypertension

Article 29 May 2023

Introduction

Pre-eclampsia is a common and complex vascular pregnancy disorder. It is characterized by hypertension and proteinuria, and often involves impaired placental development^1,2. The disease is a major cause for both maternal and perinatal mortality and morbidity³, and predicts increased risk of chronic cardiometabolic diseases later in life^4,5.

Genetic factors contribute to pre-eclampsia susceptibility. Heritability estimates for pre-eclampsia range between 0.54 and 0.68, consisting of both maternal and fetal contribution^6,7. Risk for pre-eclampsia is elevated after first pre-eclamptic pregnancy⁸, and in women with an affected first-degree relative⁹. Genome-wide linkage studies in pre-eclampsia families have revealed several susceptibility loci for the disease^{10,11,12,13,14}. Also, numerous candidate gene studies (summarized in meta-analyses^15,16,17), mostly assessing the effect of maternal genotype, and two relatively modestly sized genome-wide association studies on maternal pre-eclampsia risk have been published^18,19. Despite these attempts to discover genetic risk variants for pre-eclampsia, no robustly replicated candidate genes have yet been identified.

Because pre-eclampsia is a major cause for both maternal and fetal mortality, preterm birth and fetal growth restriction^3,20,21, it can be assumed to reduce reproductive success. Therefore, negative evolutionary selection likely maintains population frequencies of pre-eclampsia risk variants low. Following this reasoning, we focus on screening for low-frequency (minor allele frequency (MAF) 1–5%) maternal variants with large or moderate effect on the risk of pre-eclampsia.

The population history of Finland is characterized by strong founder effect, periods of rapid population growth, and internal migrations and establishment of population isolates as late as in the 16^th century (summarized in refs 22,23). The Finnish population has therefore gone through multiple relatively recent bottlenecks, and evolutionary selection has not eliminated deleterious variants as effectively as in older and more outbred populations. This has led to enrichment of low-frequency loss-of-function variants²⁴. Number of samples required to reach adequate statistical power to detect associations with some pathogenic variants might therefore be reduced in the Finnish population²⁵, making it an ideal choice for studies concentrating on low-frequency variants.

Although the cost of next-generation sequencing has reduced considerably over the past years, costs for studies utilizing these methods still remain substantially high. Pooling DNA samples together can significantly decrease the cost of genome-wide sequencing, especially in the studies targeting rare or low-frequency variants²⁶. Here we exploited this idea by pooling DNA from a hundred pre-eclamptic women in pools of ten, and by exome sequencing the pools to screen for pre-eclampsia risk variants.

Samples and Methods

Study participants

The sample set utilised in the exome sequencing included a hundred Finnish pre-eclamptic women, pooled in pools of ten. Ninety of the study participants were selected from the Finnish Genetics of Pre-eclampsia Consortium (FINNPEC) case-control cohort that was collected from five Finnish university hospitals during 2008 to 2011²⁷. Ten participants belong to separate families chosen from the pre-eclampsia family cohort recruited from the Kainuu and Helsinki regions¹⁰. All the FINNPEC cases included in the exome sequencing had severe pre-eclampsia. Two of the pools contained women with early-onset disease, three pools women with high proteinuria, one pool women with previous miscarriages, and one pool women with recurrent pre-eclampsia (Supplementary Table S1). Women with severe pre-eclampsia were prioritised in the sample selection as they are more likely to possess genetic risk factors for the disease. Clinical definitions as well as the sample collection and DNA extraction methods are described in Supplementary File 1.

The first individually Sequenom genotyped sample consisted of 180 pre-eclamptic and 180 healthy pregnant women, including the 100 pre-eclamptic cases originally exome sequenced, and additional study subjects from the FINNPEC cohort. The exclusion criteria for the controls were any pregnancy complication in the current pregnancy, pre-eclampsia in any pregnancy, chronic hypertension and pregestational diabetes mellitus.

The second individually genotyped sample set included 1353 pre-eclamptic and 699 non-pre-eclamptic women. These also include all the samples from the first Sequenom genotyping, all of which except the family samples were re-genotyped in the second run. All the women, except for the ten family cohort members, were selected from the FINNPEC cohort. The inclusion criterion for the cases was pre-eclampsia, and for the controls non-pre-eclamptic pregnancy. The exclusion criteria for the controls were pre-eclampsia in any pregnancy, small-for-gestational age (SGA) infant or placental insufficiency, gestational hypertension, chronic hypertension and placental ablation.

All study participants have provided a written informed consent. The study protocols have been approved by the Coordinating Ethics Committee of the Hospital District of Helsinki and Uusimaa, and the methods were carried out in accordance with the approved guidelines.

Variant data from a hundred Swedish scoliosis cases²⁸ that were exome sequenced with a pooled strategy identical to ours were utilised in the exome sequencing variant filtering to omit false positive and common variants.

The Sequencing Initiative Suomi (SISu 3.0) data set was utilised in the Sequenom phase 2 association analysis as population specific reference data to increase the statistical power. The SISu database is available online ( http://www.sisuproject.fi/) and at the time of the analysis contained exome sequence data from over 6100 Finns. The data set was utilized as a reference unselected for the pre-eclampsia phenotype as this information was not available for the sequenced individuals.

A summary of the study participants is shown in Table 1.

Table 1 Summary of the study participants utilised in the exome sequencing and in the association analysis.

Full size table

Exome sequencing

Details of the exome sequencing are described in Supplementary File 1. The initial quality control of the sequencing data was performed by the sequencing service provider (Science for Life Laboratory, Stockholm, Sweden). The paired-end reads were aligned to the human reference genome hg19 with the Burrows-Wheeler Aligner (BWA) software package, version 0.6.1²⁹. SAMtools version 0.1.18³⁰ was used to remove PCR duplicates in each pool, to filter out multiply mapped reads (mapping quality score <20) and to call variants. The uniquely mapped reads were used as input for variant (SNV) calling for each pool with the SAMtools default setting, and the variants identified were merged together. Annotation was completed using the ANNOVAR software³¹ with dbSNP version 137 and the 1000 Genomes 2012 APR³². BEDtools, version 2.16.2³³ was applied for evaluating read depth and coverage.

The following filtering criteria were applied to the variants called. The missense, nonsense and splice site variants that were present in at least two pre-eclampsia pools were considered further. Insertions and deletions were excluded from the analysis, because they could not be called reliably. Exome sequencing data from Swedish scoliosis patients were produced with a pooling strategy identical to ours and were utilised in the filtering as a technical control to exclude false positive and common variants. The variants present in over five scoliosis pools were excluded. The variants present in less than two scoliosis pools were filtered for European allele frequency in the 1000 Genomes data, and only the variants with MAF ≤0.05 were included. MAF of each variant in the exome-sequenced samples was estimated by calculating the proportion of reads carrying minor allele in the total amount of reads covering the position. The MAF estimation is based on an assumption that each sample is equally represented within a pool. The MAF estimates in the pre-eclampsia pools were compared to the MAFs in the reference data sets (SISu 2014, the 1000 Genomes European data APR2012 and the scoliosis exome sequencing data). By using three reference data sets we were able to obtain comprehensive picture of the variant frequencies in the Finnish and European populations, and also to estimate relevance of those variants that were absent in the population-specific reference data SISu. The variants with pre-eclampsia_MAF/reference_MAF ratio ≥1.5 in all the available comparisons were selected. In addition, five variants that were close to the cut-off or had high ratio in one or two of the comparisons were included. The following procedures were also utilised. Variation located in the genes listed in the papers by Fuentes Fajardo et al.³⁴ and Ju et al.³⁵ were excluded. These papers have listed genes that contain excess amount of variation, likely due to assembly errors in the reference sequence or alignment errors e.g. in highly polymorphic genomic regions. The variants flagged “suspected” in the dbSNP database were excluded. For the variants with no MAF information available in our default reference data sets, we utilised data from other sources (the 1000 Genomes ALL, CSAgilent), and excluded the variants with MAF higher than or similar to the MAF in our data. In addition to the main filtering strategy, we also focused on the linkage peak regions identified in the previous studies by our group^10,36, and selected the linkage peak region variants that were present in the pool containing pre-eclampsia family samples, had the 1000 Genomes EUR MAF ≤0.05, and pre-eclampsia_MAF/reference_MAF≥1.5. We also looked for candidate variants among the SNPs whose reference allele was the minor allele. For this filtering strategy we selected the missense, nonsense and splice site variants with the 1000 Genomes European reference allele frequency ≤0.05, and pre-eclampsia_refAllele/SISu and pre-eclampsia_refAllele/scoliosis_refAllele ≥1.5, and selected the variants whose reference allele was present in ≤2 scoliosis pools and in over two pre-eclampsia pools.

Sequenom genotyping

Two rounds of Sequenom genotyping were performed. The purpose of the first round was to verify the presence and enrichment of the selected variants individually in the original exome-sequenced samples, whereas the second round was conducted to test the association of the variants with pre-eclampsia.

Of the 59 variants selected based on the exome sequencing, 46 were directly fitted to three Sequenom iplexes. In addition, two of the variants were captured through tagging SNPs. Rs117741116 tagged rs139702277 with r² = 1 and D′ = 1 in the 1000 Genomes FIN and CEU populations, and rs12462506 tagged rs3745601 with r² = 0.96 and D′ = 1 in the 1000 Genomes FIN population. The rest of the SNPs were left out, because of issues in assay design. In total, 48 variants were genotyped. The first, the second and the third iplexes contained 24, 17 and 7 variants, respectively. For the second genotyping run, 28 variants were selected for genotyping in two iplexes. In addition, rs6025 in F5 was included to the panel to investigate if the previously found association between the variant and pre-eclampsia^15,16,17 would be replicated in our data. The assay design and the genotyping were performed with Sequenom MassArray system at the Institute for Molecular Medicine Finland FIMM Technology Centre, University of Helsinki. The Technology Centre performed routine quality control steps to ensure high quality of the genotyping.

Association analysis

Association of the variants with pre-eclampsia was evaluated by chi-square test in the PLINK software v1.07³⁷. Genotypes from the Finnish SISu (3.0) data set containing 6118 individuals were combined with our control data to increase the statistical power. Association test was conducted both in the FINNPEC case-control data set and in the merged FINNPEC – SISu data. Two X chromosomal variants were excluded from the latter analysis, because the SISu data contain also males. The allele frequencies of autosomal variants are not assumed to differ between females and males. Any Sequenom genotyped sample with failed genotyping for >2 variants that had been otherwise successfully genotyped was removed from the analysis. Hardy-Weinberg equilibrium (HWE) was calculated independently for the Sequenom genotyped case-control samples, for the SISu data and for the merged FINNPEC-SISu data. Differential genotype missingness between the cases and controls was tested in the FINNPEC cohort.

A standard linear regression model in the PLINK software was applied for testing association of the genetic variants with quantitative traits relevant for pre-eclampsia in the FINNPEC cohort. The quantitative traits utilized in the association testing included the highest systolic blood pressure, the highest diastolic blood pressure, proteinuria and relative birth weight of the baby. An additive model was applied in the analyses.

Statistical power was calculated with Genetic Power Calculator using the “case-control for discrete traits” module³⁸ ( http://pngu.mgh.harvard.edu/~purcell/gpc/cc2.html). With a pre-eclampsia prevalence of 0.05 and a risk allele frequency of 0.05, our cohort of 1353 pre-eclamptics and ~6800 controls (the non-pre-eclamptic FINNPEC controls and the SISu (3.0) data unselected for the pre-eclampsia phenotype combined), was estimated to be sufficient to detect an effect size of 1.65 for risk heterozygote and 3 for risk homozygote with power of 0.80 when α < 5 × 10⁻⁸. Under the aforementioned parameters, for the variants with a risk allele frequency of 0.01, we could detect effect sizes of 2.52 and 5 for the risk heterozygote and homozygote, respectively.

Results

Clinical characteristics

Clinical characteristics of the study participants are presented in Table 2. The pre-eclamptic women in both Sequenom sample sets delivered on average earlier and had babies with lower absolute and relative birth weight than the control women. In the second Sequenom genotyping sample set the number of primiparous women was larger and BMI higher among the pre-eclamptic women, and larger percentage of them was affected by pregestational or gestational diabetes.

Table 2 Clinical characteristics of the study participants.

Full size table

Exome sequencing data

The sequencing provider delivered a total number of 299 to 482 million reads in each sample pool. Over 98% of the reads could be mapped to the reference genome hg19. Per pool on average, 90% and 80% of the SureSelect target regions were covered by a depth of at least 30x and 60x, respectively, and the average depth of the target regions per pool was between 95x and 277x, the average of all pools being 220x. Eight of the ten pools had average coverage over 200x in the enrichment regions. The total number of variants called was 2,308,376. The main filtering strategy applied to these variants is illustrated in Fig. 1 and explained in detail in the Samples and Methods section. After excluding the variants with mapping quality <20 or depth <10x, there were 259,919 variants left, of which 31,579 were located in protein-coding regions. By applying the filtering steps, we identified 59 candidate variants that seemed to be enriched in the exome sequenced pre-eclamptic women (Supplementary Table S2).

**Figure 1: The main filtering strategy for the variants identified in the exome sequencing.**

The first round of Sequenom genotyping in the FINNPEC samples

The first round of Sequenom genotyping was carried out in a sample set of 180 cases and 180 controls including the 100 exome sequenced samples. The purpose of this step was to verify the presence of the variants in the original samples and the MAF difference between pre-eclamptic women and general population. All assays had a success rate >95%. Of the 48 variants that were genotyped, three were monomorphic. After excluding these variants, correlation coefficient between the MAF estimates from the exome sequencing and the MAFs obtained from the Sequenom genotyping for the exome sequenced samples was 0.94, showing that we were able to estimate MAFs in the exome sequencing fairly accurately. Four of the variants were not in HWE in the controls. Twenty-eight variants that had OR ≤0.65 or ≥1.3 were selected for the second round of Sequenom genotyping in a larger case-control cohort. The results of the first Sequenom round are shown in Supplementary Table S3.

The second round of Sequenom genotyping in the FINNPEC samples

In the second round of genotyping all the variants except failed rs6681 were successfully genotyped. After excluding 17 cases and 5 controls with more than 2 failed genotypes, the genotyping rate for the variants in the remaining individuals was over 96%. Concordance rate of the genotypes in the 350 individuals included in both Sequenom runs was 0.9992. Three individuals with at least one discordant genotype between the runs were excluded. Association of the variants with pre-eclampsia was first tested in the Sequenom genotyped FINNPEC case-control sample set. Three variants deviated from HWE in the FINNPEC controls and were omitted. In the association analysis, one of the genotyped variants was nominally associated with pre-eclampsia (rs79744308 in NRTN; p-value = 0.0314, OR (95% CI) = 0.69 (0.48–0.97)), but the variant did not pass differential missingness test between the cases and controls. None of the association tests reached genome-wide significance. Full results are shown in Supplementary Table S4.

Association analysis in the combined FINNPEC-SISu data set

In order to increase statistical power, we merged the FINNPEC data with the Finnish SISu (3.0) data, and tested association of the genotyped variants with pre-eclampsia in this combined data set. Three of the variants deviated from HWE in the controls in these data, and were excluded from the analysis. Furthermore, the X chromosomal variants were excluded as the SISu data contain both males and females. In the association analysis in the merged data, we detected nominal association of four variants (Table 3). None of them, however, reached genome-wide significance. Full results from this analysis are presented in Supplementary Table S4. Comparison of MAFs between different sample subsets for the 28 variants genotyped in the second Sequenom genotyping is shown in Supplementary Table S5.

Table 3 The SNPs nominally associated with pre-eclampsia in the analysis in the combined FINNPEC and SISu data set.

Full size table

The variant rs6025, which is located in the F5 gene and has previously been connected to pre-eclampsia^15,16,17, was not associated with pre-eclampsia in our study in either of the analyses.

Quantitative traits association analysis

Assuming an additive model of genetic inheritance, we investigated the association between the genotyped SNPs and quantitative clinical characteristics of pre-eclampsia (Supplementary Table S6). Among the SNPs nominally associated with pre-eclampsia (Table 3), rs3803339/G allele in TP53BP1 showed nominal association (p-value = 0.026) with the highest systolic blood pressure, and rs61747120/T in ZFR2 with proteinuria (p-value = 0.039), both in the pre-eclamptic women. In addition to those two SNPs, there were several other variants associated with the highest systolic or diastolic blood pressure, relative birth weight of the baby or proteinuria in the pre-eclamptic patients, the non-pre-eclamptic controls or the whole case-control sample set (Supplementary Table S6). Rs2291516 in RGL3 was associated with both proteinuria (p-value = 0.001) and relative birth weight of the baby (p-value = 7.3 × 10⁻⁴). However, none of the SNPs were associated with the quantitative traits at the genome-wide significance level.

Discussion

In this first attempt to screen exome-wide for low-frequency, moderate or large-effect risk variants for pre-eclampsia in a Finnish founder population, we did not find any genome-wide significantly associated risk variants. Whereas many complex diseases such as type 2 diabetes or cardiovascular diseases often have onset at midlife or later, pre-eclampsia affects women at their reproductive years, or if the phenotype of offspring is considered, already during the fetal period. Consequently, the disease decreases reproductive success, and there is a reason to hypothesise that pre-eclampsia risk variants are under negative evolutionary selection, which would keep their population frequencies low. A couple of GWA studies have assessed the role of common variation in pre-eclampsia susceptibility^18,19, but low-frequency variation has not previously been studied exome- or genome-wide. A founder population such as Finland, which has gone through relatively recent bottlenecks and is enriched for low-frequency loss-of-function mutations²⁴, should be an ideal study population for this screening.

The DNA samples were exome sequenced in pools of ten in order to maximise the number of sequenced samples while maintaining the sequencing costs low. With this strategy we were able to perform cost-effective exome-wide screen for pre-eclampsia risk variants, but the approach also has several limitations, such as missing information on individual genotypes and on exact MAF of each variant in the study sample. We were able to estimate MAFs from the pooled exome-sequenced samples fairly accurately: correlation coefficient between our estimates and the 1000 Genomes European data was 0.972 when the sequencing depth was at least 30x per pool. We however acknowledge that variants with biased MAF estimates were prone to be selected for the Sequenom genotyping, whereas some truly enriched variants could have been missed. In the association analysis we utilized the large SISu data as a population specific reference to increase statistical power of the analysis. Both the FINNPEC data and the data sets contributing to the SISu project have been collected to represent the Finnish population. However, population substructure within Finland^39,40 could cause bias especially when studying low-frequency variants. For some of the variants allele frequencies differ between the FINNPEC and SISu controls indicating potential for subtle population stratification. We were unable to test the existence of population stratification due to the limited number of SNPs genotyped in our data. Another potential source of bias is that the data sets have been genotyped with different methods.

There are several possible explanations for not identifying pre-eclampsia risk variants in this study. Our screening did not cover non-coding regions, and also structural variation was omitted from the analysis. Furthermore, variants in low-coverage regions could have been missed. At the time of the exome sequencing data analysis robust variant callers with ploidy setting were not available. Utilizing a caller assuming a diploid genome might have caused us to miss some candidate variants. Some risk variants may have been lost due to choices made in the filtering. As shown previously, variant annotations are much dependent on a transcript set and annotation software utilized, and often there are several plausible annotations for a single variant⁴¹. Variants with incorrect or alternative annotations may have been unintentionally filtered out. Genetic risk for pre-eclampsia might be heterogeneous even in an isolated population, and a larger sample size might have been needed to find predisposing variants. As the sample size in the exome sequencing was modest, false negative results can occur. Furthermore, our study was underpowered to detect rare variants or variants with small effect sizes. Especially in the case of rare pre-eclampsia risk variants, gene-based testing methods might be needed to reveal disease association. The approach taken in this study was nevertheless justified. In another study we and co-workers used identical design to reveal genetic variants associated with morbid obesity, and identified a low-frequency variant that showed strong association with BMI⁴², a complex trait that is similarly to pre-eclampsia affected by multiple genetic and environmental factors.

Although we could not detect any robust risk variant for pre-eclampsia in this study, we mention here five missense variants nominally associated either with the disease phenotype (rs3803339 in TP53BP1, rs61747120 in ZFR2, rs113926353 in ANO9, and rs142394560 in TMTC1), or with proteinuria and relative birth weight of the baby in the pre-eclamptic cases (rs2291516 in RGL3). These variants or genes have not previously been linked to pre-eclampsia or any other pregnancy disorders. TP53BP1 encodes a protein involved in DNA damage response and cell cycle regulation⁴³. Of interest, variants in TP53BP1 have shown a suggestive association with blood pressure⁴⁴. ZFR2 is a zinc finger RNA binding protein with an unknown function. ANO9 belongs to a family of calcium-dependent chloride channels, and might suppress baseline chloride conductance^45,46, and TMTC1 is an endoplasmic reticulum protein involved in calcium homeostasis⁴⁷. RGL3, which interacts with Rap-family G-proteins⁴⁸, has been shown to affect cell growth and morphology^48,49. TP53BP1, TMTC1 and RGL3⁴⁸ are expressed in a wide range of tissues, ANO9 most abundantly in skin and digestive tract and ZFR2 in adrenal gland, cerebral cortex and testis ( http://www.proteinatlas.org/). A larger sample size would have been needed to state anything conclusive about the role of the variants in these genes in the risk of pre-eclampsia.

Rs6025 located in the F5 gene has been associated with pre-eclampsia in tens of candidate gene studies (summarized in meta-analyses^15,16,17), and was therefore included as an additional variant in the Sequenom panel. F5 encodes the Factor V protein, a central component in the coagulation pathway. The amino acid change produced by rs6025 prevents inactivation of Factor V, which increases tendency to thrombosis⁵⁰. To the best of our knowledge this is one of the largest original studies to investigate connection between rs6025 and pre-eclampsia. In contrast to many previous studies, our study does not provide support for the association of this variant with pre-eclampsia.

Along with two published GWA studies, this study is one of the firsts to screen for pre-eclampsia risk variants exome- or genome-wide. In future studies, a hypothesis-free design with larger sample sizes is required, as shown for many other complex phenotypes. One of the features of pre-eclampsia is the involvement of two individuals: a mother and a child. Therefore, the genetic information of children should be included in studies on pre-eclampsia, and interaction between fetal and maternal genotypes should be studied more comprehensively. The multinational InterPregGen consortium has addressed the need for larger sample sizes and for studying maternal and fetal genotypic interaction by GWAS genotyping 7600 pre-eclamptic mothers, 4000 pre-eclamptic infants and 46000 control women with the aim of identifying genetic risk factors for pre-eclampsia⁵¹. The FINNPEC cohort is involved as a replication cohort in this biggest effort in genetics of pre-eclampsia to date. In parallel with case-control studies on sporadic pre-eclampsia, studies in pre-eclampsia families may help to identify genes and pathways involved in the disease susceptibility. We have taken this approach and are currently screening for risk variants in the Finnish pre-eclampsia families with next-generation sequencing methods.

In this first exome-wide screening for pre-eclampsia risk variants we did not find any variant that would have been robustly associated with the disease phenotype. We conclude that even in a population isolate like Finland, the genetic risk for pre-eclampsia is likely complex and heterogeneous, and genome-wide approaches with larger sample sizes will be necessary to detect risk factors.

Additional Information

How to cite this article: Kaartokallio, T. et al. Exome sequencing in pooled DNA samples to identify maternal pre-eclampsia risk variants. Sci. Rep. 6, 29085; doi: 10.1038/srep29085 (2016).

References

Brosens, I. A., Robertson, W. B. & Dixon, H. G. The role of the spiral arteries in the pathogenesis of preeclampsia. Obstet. Gynecol. Annu. 1, 177–191 (1972).
CAS PubMed Google Scholar
Salafia, C. M., Pezzullo, J. C., Ghidini, A., Lopez-Zeno, J. A. & Whittington, S. S. Clinical correlations of patterns of placental pathology in preterm pre-eclampsia. Placenta 19, 67–72 (1998).
Article CAS Google Scholar
World Health Organization. World Health Report 2005: Make Every Mother and Child Count. World Health Organization, Geneva, Switzerland (2005).
Bellamy, L., Casas, J. P., Hingorani, A. D. & Williams, D. J. Pre-eclampsia and risk of cardiovascular disease and cancer in later life: systematic review and meta-analysis. BMJ 335, 974 (2007).
Article Google Scholar
Lykke, J. A. et al. Hypertensive pregnancy disorders and subsequent cardiovascular morbidity and type 2 diabetes mellitus in the mother. Hypertension 53, 944–951 (2009).
Article CAS Google Scholar
Cnattingius, S., Reilly, M., Pawitan, Y. & Lichtenstein, P. Maternal and fetal genetic factors account for most of familial aggregation of preeclampsia: a population-based Swedish cohort study. Am. J. Med. Genet. A. 130A, 365–371 (2004).
Article Google Scholar
Salonen Ros, H., Lichtenstein, P., Lipworth, L. & Cnattingius, S. Genetic effects on the liability of developing pre-eclampsia and gestational hypertension. Am. J. Med. Genet. 91, 256–260 (2000).
Article CAS Google Scholar
Lie, R. T. et al. Fetal and maternal contributions to risk of pre-eclampsia: population based study. BMJ 316, 1343–1347 (1998).
Article CAS Google Scholar
Skjaerven, R. et al. Recurrence of pre-eclampsia across generations: exploring fetal and maternal genetic components in a population based cohort. BMJ 331, 877 (2005).
Article Google Scholar
Laivuori, H. et al. Susceptibility loci for preeclampsia on chromosomes 2p25 and 9p13 in Finnish families. Am. J. Hum. Genet. 72, 168–177 (2003).
Article CAS Google Scholar
Moses, E. K. et al. A genome scan in families from Australia and New Zealand confirms the presence of a maternal susceptibility locus for pre-eclampsia, on chromosome 2. Am. J. Hum. Genet. 67, 1581–1585 (2000).
Article CAS Google Scholar
Arngrimsson, R. et al. A genome-wide scan reveals a maternal susceptibility locus for pre-eclampsia on chromosome 2p13. Hum. Mol. Genet. 8, 1799–1805 (1999).
Article CAS Google Scholar
Lachmeijer, A. M. et al. A genome-wide scan for preeclampsia in the Netherlands. Eur. J. Hum. Genet. 9, 758–764 (2001).
Article CAS Google Scholar
Harrison, G. A. et al. A genomewide linkage study of preeclampsia/eclampsia reveals evidence for a candidate region on 4q. Am. J. Hum. Genet. 60, 1158–1167 (1997).
CAS PubMed PubMed Central Google Scholar
Buurma, A. J. et al. Genetic variants in pre-eclampsia: a meta-analysis. Hum. Reprod. Update 19, 289–303 (2013).
Article CAS Google Scholar
Staines-Urias, E. et al. Genetic association studies in pre-eclampsia: systematic meta-analyses and field synopsis. Int. J. Epidemiol. 41, 1764–1775 (2012).
Article Google Scholar
Fong, F. M. et al. Maternal genotype and severe preeclampsia: a HuGE review. Am. J. Epidemiol. 180, 335–345 (2014).
Article Google Scholar
Johnson, M. P. et al. Genome-wide association scan identifies a risk locus for preeclampsia on 2q14, near the inhibin, beta B gene. PLoS One 7, e33666 (2012).
Article CAS ADS Google Scholar
Zhao, L., Bracken, M. B. & DeWan, A. T. Genome-wide association study of pre-eclampsia detects novel maternal single nucleotide polymorphisms and copy-number variants in subsets of the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study cohort. Ann. Hum. Genet. 77, 277–287 (2013).
Article CAS Google Scholar
Roberts, C. L., Algert, C. S., Morris, J. M., Ford, J. B. & Henderson-Smart, D. J. Hypertensive disorders in pregnancy: a population-based study. Med. J. Aust. 182, 332–335 (2005).
PubMed Google Scholar
Ananth, C. V., Savitz, D. A., Luther, E. R. & Bowes, W. A. Jr. Preeclampsia and preterm birth subtypes in Nova Scotia, 1986 to 1992. Am. J. Perinatol. 14, 17–23 (1997).
Article CAS Google Scholar
Kere, J. Human population genetics: lessons from Finland. Annu. Rev. Genomics Hum. Genet. 2, 103–128 (2001).
Article CAS Google Scholar
Peltonen, L., Jalanko, A. & Varilo, T. Molecular genetics of the Finnish disease heritage. Hum. Mol. Genet. 8, 1913–1923 (1999).
Article CAS Google Scholar
Lim, E. T. et al. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet. 10, e1004494 (2014).
Article Google Scholar
Palotie, A., Widen, E. & Ripatti, S. From genetic discovery to future personalized health research. N. Biotechnol. 30, 291–295 (2013).
Article CAS Google Scholar
Ramos, E. et al. Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing. BMC Genomics 13, 683-2164-13-683 (2012).
Kaartokallio, T. et al. Microsatellite polymorphism in the heme oxygenase-1 promoter is associated with nonsevere and late-onset preeclampsia. Hypertension 64, 172–177 (2014).
Article CAS Google Scholar
Grauers, A. et al. Candidate gene analysis and exome sequencing confirm LBX1 as a susceptibility gene for idiopathic scoliosis. Spine J. 15, 2239–2246 (2015).
Article Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Article Google Scholar
1000 Genomes Project Consortium et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS Google Scholar
Fuentes Fajardo, K. V. et al. Detecting false-positive signals in exome sequencing. Hum. Mutat. 33, 609–613 (2012).
Article CAS Google Scholar
Ju, Y. S. et al. Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals. Nat. Genet. 43, 745–752 (2011).
Article CAS Google Scholar
Majander, K. K., Villa, P. M., Kivinen, K., Kere, J. & Laivuori, H. A follow-up linkage study of Finnish pre-eclampsia families identifies a new fetal susceptibility locus on chromosome 18. Eur. J. Hum. Genet. 21, 1024–1026 (2013).
Article CAS Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS Google Scholar
Purcell, S., Cherny, S. S. & Sham, P. C. Genetic Power Calculator: design of linkage and association genetic mapping studies of complex traits. Bioinformatics 19, 149–150 (2003).
Article CAS Google Scholar
Hannelius, U. et al. Population substructure in Finland and Sweden revealed by the use of spatial coordinates and a small number of unlinked autosomal SNPs. BMC Genet. 9, 54 (2008).
Article Google Scholar
Jakkula, E. et al. The genome-wide patterns of variation expose significant substructure in a founder population. Am. J. Hum. Genet. 83, 787–794 (2008).
Article CAS Google Scholar
McCarthy, D. J. et al. Choice of transcripts and software has a large effect on variant annotation. Genome Med. 6, 26 (2014).
Article Google Scholar
Jiao, H. et al. Exome sequencing followed by genotyping suggests SYPL2 as a susceptibility gene for morbid obesity. Eur. J. Hum. Genet. 23, 1216–1222 (2015).
Article CAS Google Scholar
Wang, B., Matsuoka, S., Carpenter, P. B. & Elledge, S. J. 53BP1, a mediator of the DNA damage checkpoint. Science 298, 1435–1438 (2002).
Article CAS ADS Google Scholar
Wang, L. et al. Common genetic variations in the vitamin D pathway in relation to blood pressure. Am. J. Hypertens. 27, 1387–1395 (2014).
Article CAS Google Scholar
Schreiber, R. et al. Expression and function of epithelial anoctamins. J. Biol. Chem. 285, 7838–7845 (2010).
Article CAS Google Scholar
Kunzelmann, K. et al. Expression and function of epithelial anoctamins. Exp. Physiol. 97, 184–192 (2012).
Article CAS Google Scholar
Sunryd, J. C. et al. TMTC1 and TMTC2 are novel endoplasmic reticulum tetratricopeptide repeat-containing adapter proteins involved in calcium homeostasis. J. Biol. Chem. 289, 16085–16099 (2014).
Article CAS Google Scholar
Xu, J., Shi, S., Matsumoto, N., Noda, M. & Kitayama, H. Identification of Rgl3 as a potential binding partner for Rap-family small G-proteins and profilin II. Cell. Signal. 19, 1575–1582 (2007).
Article CAS Google Scholar
Ehrhardt, G. R., Korherr, C., Wieler, J. S., Knaus, M. & Schrader, J. W. A novel potential effector of M-Ras and p21 Ras negatively regulates p21 Ras-mediated gene induction and cell growth. Oncogene 20, 188–197 (2001).
Article CAS Google Scholar
Bertina, R. M. et al. Mutation in blood coagulation factor V associated with resistance to activated protein C. Nature 369, 64–67 (1994).
Article CAS ADS Google Scholar
Morgan, L. et al. InterPregGen: genetic studies of pre-eclampsia in three continents. Nor. Epidemiol. 24, 141–146 (2014).
Article Google Scholar

Download references

Acknowledgements

We express our deep gratitude to all the study participants. We appreciate the contribution of the following members of the FINNPEC Study Group: Eeva Ekholm (Turku University Central Hospital, Turku, Finland), Kaarin Mäkikallio-Anttila, (Oulu University Hospital, Oulu, Finland), Reija Hietala, Susanna Sainio and Terhi Saisto (Helsinki University Hospital, Helsinki, Finland), Tia Aalto-Viljakainen, Sanna Heino and Anna Inkeri Lokki (University of Helsinki, Helsinki, Finland), and Leena Georgiadis (Kuopio University Hospital, Kuopio, Finland). Anna Grauers is thankfully acknowledged for her contribution with the scoliosis cohort. The expert technical assistance of Katariina Hirvonen, Elina Huovari, Eija Kortelainen, Satu Leminen, Aija Lähdesmäki, Susanna Mehtälä, and Christina Salmen is gratefully acknowledged. We would also like to acknowledge support from Science for Life Laboratory, the Swedish national infrastructure SNISS, Uppmax and IT Center for Science (CSC) for providing assistance in massively parallel sequencing and computational infrastructure, and from the Institute for Molecular Medicine Finland FIMM Technology Centre, University of Helsinki for performing the genotyping. The Sequencing Initiative Suomi (SISu) project is an international collaboration between research groups aiming to build tools for genomic medicine. These groups are generating whole genome and whole exome sequence data from Finnish samples and provide data resources for the research community. Key groups of the project are from Universities of Eastern Finland, Oulu and Helsinki and The Institute for Health and Welfare, Finland, Lund University, The Wellcome Trust Sanger Institute, University of Oxford, The Broad Institute of Harvard and MIT, University of Michigan, Washington University in St. Louis, and University of California, Los Angeles (UCLA). The project is coordinated in the Institute for Molecular Medicine Finland at the University of Helsinki. This work was supported by Academy of Finland; Jane and Aatos Erkko Foundation; Päivikki and Sakari Sohlberg Foundation; Research Funds of the University of Helsinki; Government Special state subsidy for Health Sciences (EVO funding) at Helsinki and Uusimaa Hospital District; Novo Nordisk Foundation; Finnish Foundation for Pediatric Research; Emil Aaltonen Foundation; Sigrid Jusélius Foundation; Biocentrum Helsinki; Doctoral Programme in Biomedicine (DPBM) to TK; Doctoral Programme in Clinical Research (KLTO) to T.K., Research Foundation of the University of Helsinki to T.K. and Biomedicum Helsinki Foundation to T.K. The collection and analyses of scoliosis samples was supported by the Swedish Research Council (number K-2013-52X-22198-01-3).

Author information

Tea Kaartokallio and Jingwen Wang: These authors contributed equally to this work.

Authors and Affiliations

Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Tea Kaartokallio & Hannele Laivuori
Department of Biosciences and Nutrition, and Science for Life Laboratory, Karolinska Institutet, SE-141 83 Stockholm, Sweden
Jingwen Wang, Hong Jiao & Juha Kere
Obstetrics and Gynaecology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Seppo Heinonen & Hannele Laivuori
Chronic Disease Prevention Unit, National Institute for Health and Welfare, Helsinki, Finland
Eero Kajantie
Children’s Hospital, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
Eero Kajantie
PEDEGO Research Unit, MRC Oulu, Oulu University Hospital and University of Oulu, Oulu, Finland
Eero Kajantie & Anneli Pouta
Division of Cardiovascular Medicine, University of Cambridge, Cambridge, UK
Katja Kivinen
Department of Government services, National Institute for Health and Welfare, Helsinki, Finland
Anneli Pouta
Department of Orthopedics, Karolinska University Hospital, Stockholm, Sweden
Paul Gerdhem
Department of Clinical Sciences, Intervention and Technology (CLINTEC), Karolinska Institutet, SE-141 86 Stockholm, Sweden
Paul Gerdhem
Molecular Neurology Research Program, University of Helsinki, Helsinki, Finland
Juha Kere
Folkhälsan Institute of Genetics, Helsinki, Finland
Juha Kere
Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
Hannele Laivuori

Authors

Tea Kaartokallio
View author publications
You can also search for this author in PubMed Google Scholar
Jingwen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Seppo Heinonen
View author publications
You can also search for this author in PubMed Google Scholar
Eero Kajantie
View author publications
You can also search for this author in PubMed Google Scholar
Katja Kivinen
View author publications
You can also search for this author in PubMed Google Scholar
Anneli Pouta
View author publications
You can also search for this author in PubMed Google Scholar
Paul Gerdhem
View author publications
You can also search for this author in PubMed Google Scholar
Hong Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Juha Kere
View author publications
You can also search for this author in PubMed Google Scholar
Hannele Laivuori
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L., J.K., H.J., S.H., E.K., K.K., J.W. and T.K. were involved in study design. H.L., J.K., S.H., E.K., K.K., A.P. and P.G. designed collection of the biological samples and clinical information. J.W. and T.K. performed the analyses under supervision of H.J., J.K. and H.L. T.K. and J.W. drafted the manuscript and all the authors revised and edited it. All the authors contributed critical discussion and approved the final version of the manuscript.

Corresponding author

Correspondence to Tea Kaartokallio.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 258 kb)

Supplementary Table S1 (XLS 50 kb)

Supplementary Table S2 (XLS 38 kb)

Supplementary Table S3 (XLS 43 kb)

Supplementary Table S4 (XLS 39 kb)

Supplementary Table S5 (XLS 61 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Kaartokallio, T., Wang, J., Heinonen, S. et al. Exome sequencing in pooled DNA samples to identify maternal pre-eclampsia risk variants. Sci Rep 6, 29085 (2016). https://doi.org/10.1038/srep29085

Download citation

Received: 17 February 2016
Accepted: 14 June 2016
Published: 07 July 2016
DOI: https://doi.org/10.1038/srep29085

This article is cited by

Identification of genetic polymorphisms modulating nausea and vomiting in two series of opioid-treated cancer patients
- Francesca Colombo
- Giulia Pintarelli
- Augusto Tommaso Caraceni
Scientific Reports (2020)
Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples
- Jingwen Wang
- Tiina Skoog
- Hong Jiao
Scientific Reports (2016)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Samples and Methods

Study participants

Exome sequencing

Sequenom genotyping

Association analysis

Results

Clinical characteristics

Exome sequencing data

The first round of Sequenom genotyping in the FINNPEC samples

The second round of Sequenom genotyping in the FINNPEC samples

Association analysis in the combined FINNPEC-SISu data set

Quantitative traits association analysis

Discussion

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links