An endometrial gene expression signature accurately predicts recurrent implantation failure after IVF

Koot, Yvonne E. M.; van Hooff, Sander R.; Boomsma, Carolien M.; van Leenen, Dik; Groot Koerkamp, Marian J. A.; Goddijn, Mariëtte; Eijkemans, Marinus J. C.; Fauser, Bart C. J. M.; Holstege, Frank C. P.; Macklon, Nick S.

doi:10.1038/srep19411

Download PDF

Article
Open access
Published: 22 January 2016

An endometrial gene expression signature accurately predicts recurrent implantation failure after IVF

Yvonne E. M. Koot¹^na1,
Sander R. van Hooff²^na1,
Carolien M. Boomsma¹^na1,
Dik van Leenen²^na1,
Marian J. A. Groot Koerkamp²^na1,
Mariëtte Goddijn³^na1,
Marinus J. C. Eijkemans^1,4^na1,
Bart C. J. M. Fauser¹^na1,
Frank C. P. Holstege²^na1 &
…
Nick S. Macklon^1,5^na1

Scientific Reports volume 6, Article number: 19411 (2016) Cite this article

15k Accesses
119 Citations
218 Altmetric
Metrics details

Subjects

Abstract

The primary limiting factor for effective IVF treatment is successful embryo implantation. Recurrent implantation failure (RIF) is a condition whereby couples fail to achieve pregnancy despite consecutive embryo transfers. Here we describe the collection of gene expression profiles from mid-luteal phase endometrial biopsies (n = 115) from women experiencing RIF and healthy controls. Using a signature discovery set (n = 81) we identify a signature containing 303 genes predictive of RIF. Independent validation in 34 samples shows that the gene signature predicts RIF with 100% positive predictive value (PPV). The strength of the RIF associated expression signature also stratifies RIF patients into distinct groups with different subsequent implantation success rates. Exploration of the expression changes suggests that RIF is primarily associated with reduced cellular proliferation. The gene signature will be of value in counselling and guiding further treatment of women who fail to conceive upon IVF and suggests new avenues for developing intervention.

Targeted gene expression profiling for accurate endometrial receptivity testing

Article Open access 26 August 2023

A molecular staging model for accurately dating the endometrial biopsy

Article Open access 06 October 2023

Differential expression of ion channel coding genes in the endometrium of women experiencing recurrent implantation failures

Article Open access 27 August 2024

Introduction

Despite advances in assisted reproductive techniques (ART), the majority of IVF attempts still do not result in a successful pregnancy. Failure of implantation of apparently morphologically sound embryos now represents the major limiting step in IVF success. A significant proportion of couples undergoing IVF experience recurrent implantation failure (RIF), a devastating occurrence for patients in which serial transfers of high quality embryos fail to result in a pregnancy¹. The perceived need among clinicians and patients to intervene to address RIF successfully has led to the introduction of numerous empirical and thus far ineffective adjuvant interventions. Successful management of this frustrating and costly complication of IVF requires a greater understanding both of the underlying mechanisms and of the individual prognosis for ultimate success when RIF occurs.

RIF has been defined as the absence of implantation after three or more transfers of high quality embryos or after placement of 10 or more embryos in multiple transfers^2,3. Studies of the probability of a systemic underlying cause for implantation failure in these patients, rather than simply a chance effect, indicate that an underlying aetiology is likely to exist in most RIF patients^3,4,5,6,7,8. Multiple aetiologies for implantation failure have been proposed but until recently the primary focus has been on the embryo and in particular the impact of aneuploidy^5,9,10. Maternal factors may also contribute and the clinical approach to investigating RIF now involves the exclusion of thrombophilic gene mutations, autoimmune conditions and uterine anomalies. However, in the majority of cases no clear cause can be identified^3,7,11,12. In recent years it has become apparent that constitutive endometrial dysfunction could represent an important contributor to this condition^11,13,14.

Recent studies have advanced our understanding of the mechanisms which renders the human endometrium receptive for a limited period in the mid-luteal phase of the menstrual cycle and of the biological significance of this putative ‘window of receptivity’^11,15,16,17. Although no single, clinically relevant morphological, molecular or histological marker capable of indicating endometrial receptivity has been identified, global transcriptomic and secretomic analyses of human endometria are now providing us with novel insights into patterns of gene and protein expression which characterise the receptive endometrium.

Endometrial gene expression has been shown to be sensitive to cyclical hormonal regulation^{16,18,19,20,21,22}, ovarian stimulation for IVF^23,24,25 and to be disrupted in the presence of gynaecological pathologies such as endometriosis^26,27 or during the use of an intrauterine device²⁸. Genome-wide analyses have yielded insight into mRNA expression changes during the natural endometrium cycle²⁹ and growing evidence supports the concept of a receptive gene expression profile^22,30,31, which may be disrupted in patients experiencing RIF^13,14,22. The potential value of identifying a gene expression profile predictive of RIF is considerable as this would not only guide prognosis, but inform appropriate and effective therapeutic intervention^14,32.

Thus far studies that directly compare endometrial mRNA expression in RIF patients with controls, have been limited¹³ and not subject to validation on an independent cohort, a requirement to draw any firm conclusions. This is in part due to the challenge of recruiting sufficient participants willing to undergo an endometrium biopsy. In a different approach, a selected subset of genes with altered expression in the endometrium during the natural cycle of healthy women³³ has been studied in RIF patients, but a significant association with RIF was not found for this particular subset of genes³². In the study presented here, we sought to rigorously determine whether endometrial gene expression differs between women with RIF and controls, with the goal of identifying and validating the endometrial gene expression signature associated with RIF.

Results

Patients and samples

Mid-luteal phase endometrial biopsies were obtained from 43 women with RIF and 72 controls, i.e. women who gave live birth after IVF/ICSI. Clinical characteristics of the RIF patients and controls are described in Table 1. No significant differences in age, BMI, smoking, number of women with primary infertility and cause of infertility were identified between controls and RIF patients. As a corollary of selection criteria, the mean number of embryo transfers carried out before endometrial sampling was greater in RIF patients than controls (5.3 vs. 2.4), together with the mean number of transferred embryos (7.8 vs. 3.1). No more than two embryos were inserted in the uterine cavity per transfer and for patients under the age of 36, only single embryo transfers were performed during the first two IVF or ICSI attempts. The mean implantation rate per transferred embryo for RIF patients was 3.2%, compared with 62.7% in the control group. Six patients in the RIF group reported having a live birth before embarking on further unsuccessful IVF/ICSI treatment. Two derived from spontaneous conception and four from multiple ART treatments. All six subjects met the criteria for inclusion in the RIF group after failing multiple IVF/ICSI treatments trying to conceive a second child. Nineteen patients (26%) from the control population had delivered two live births and four patients (6%) delivered three live births after ART treatment prior to inclusion in the study.

Table 1 Patient characteristics at time of biopsy.

Full size table

Expression profiles

mRNA expression profiles were successfully obtained from all samples. Principle component analysis (PCA) indicated batch effects and an effect related to slight variations in the biopsy timing (Methods, Supplementary Fig. 1 and Supplementary Table 1). The batch effects were consistent with the batch-wise processing of samples and were successfully removed by statistical modeling (Methods, Supplementary Fig. 1 and Supplementary Table 1). A minor batch effect related to the medical center where the biopsy was performed was not removed because the number of samples was too small to accurately model the effect (n = 9 for the Academic Medical Center (AMC) versus n = 106 for the University Medical Center Utrecht (UMCU)). None of the other variables (age, BMI, smoking, nature of infertility and nature of treatment) showed a significant effect on the gene expression profiles (Supplementary Fig. 2 & Supplementary Table 2).

Gene signature based prediction

As described in detail in the Methods section, samples were randomly assigned into a signature discovery set (n = 81) and an independent validation set (n = 34), keeping the ratio of RIF patients to controls similar. Iterative rounds of cross-validation (Supplementary Fig. 3) were applied within the signature discovery set to find genes capable of distinguishing RIF patients from controls. The use of cross-validation reduces the risk of over-fitting on the signature discovery set. Each iteration results in a separate gene set. All genes were then ranked according to how frequently they were present in the separate gene sets. Selecting all genes with a frequency of 5% or higher, results in a 303 gene signature (Supplementary Table 3). The 303 genes selected by the cross-validation approach are those most suitable for distinguishing between the two groups and were therefore subsequently employed in a support vector machine (SVM) classifier built using the entire discovery set (Supplementary Fig. 3 and Methods).

Figure 1a shows the classification of samples in the discovery set using the 303-gene classifier. The accuracy of the RIF prediction (PPV) was 90% with a sensitivity of 90% (Table 2). Most importantly, application to the independent validation set confirms the signature’s ability to distinguish RIF patients from controls (Fig. 1b). All samples classified as RIF were indeed RIF patients (PPV = 100%) with a sensitivity of 58%. The non-RIF classification was accurate in 81% of cases (Table 2). The areas under the ROC curves (Fig. 1c and Supplementary Fig. 4) confirm that gene expression based classification in general (Supplementary Fig. 4) and the 303-gene signature in particular (Fig. 1c), can robustly classify RIF patients.

Table 2 Classification metrics.

Full size table

An obvious difference between RIF patients and controls is the frequency of successful implantations. Differences in gene expression between RIF patients and controls could therefore be due to the effects of previous pregnancy rather than reflecting a more direct link to implantation failure. To investigate this, the 303-gene expression signatures of patients with and without a previous pregnancy were compared. No difference was found (Supplementary Fig. 5), ruling out that the RIF status prediction reported here is confounded by (the absence of) previous pregnancies.

Functional analysis of RIF endometrial gene expression

Besides classification of patients, an additional benefit of gene expression analyses is the potential to shed light on factors underlying a particular condition. A striking characteristic of the RIF signature genes is that there are many more genes with decreased expression (81%, Fig. 2d). Gene set enrichment analysis (GSEA) of the entire expression-profiles using Gene Ontology (GO) slim categories indicates more specifically the various cellular processes and structures differentially affected in RIF patients versus controls (Table 3). Most striking is the down-regulation in RIF patients of genes involved in cell cycle regulation and cell division (Table 3, Fig. 2a), indicative of a reduced rate of cellular proliferation. Besides reduced expression of many genes involved in general proliferative processes (Table 3), the RIF endometrium transcriptome also shows reduced expression of genes involved in cytoskeleton and cilia formation (Table 3, Fig. 2b). The latter is of interest given the presence of ciliated cells during the implantation window of healthy women³⁴. A previous study indicated that a considerable fraction of genes down-regulated in RIF patients are estrogen dependent¹³. Here, no strong indication of estrogen dependent down-regulation of gene expression is found in RIF patients (Supplementary Fig. 6). A possible explanation lies in the definition of ‘estrogen dependent’. The previous study¹³ referred to the effect of estrogen depletion on gene expression in general³⁵, whereas our analysis (Supplementary Fig. 6) applies the specific Gene Ontology term ‘response to estrogen’. Notably, Bourdeau et al. demonstrated the association between estrogen depletion and cell cycle progression³⁵. The differentially regulated genes are therefore likely to be cell cycle related and not necessarily directly regulated by estrogen. This is consistent with our observation of a down-regulation of cell proliferation in RIF patients. Similar to the number of up-regulated genes in the RIF signature, the number of enriched GO categories with increased expression is more limited, but do exhibit common features: processes involved in extracellular organization and cell motility (Table 3).

Table 3 GSEA results.

Full size table

The gene set enrichment analysis was performed on the entire transcriptomes of RIF patients and controls. For individual genes from the RIF signature, Fig. 2e shows their representation within the various functional classes (see Supplementary Fig. 7 for the complete overview of all signature genes). The individual signature genes are also listed in Supplementary Table 3. Another feature of the RIF signature is the high proportion of transcription factors, indicated by enrichment of the GO term DNA binding (Table 3). Besides many uncharacterized DNA binding proteins, the signature gene set also contains established transcription factors such as the forkhead transcription factor FOXK2, family members of which have previously been implicated in ovary development and function^36,37.

Further expression-based patient stratification

The gene expression classifier yields a binary result, classifying subjects as either RIF or control. Some RIF patients are incorrectly classified (Fig. 1a,b). To investigate whether this was a random occurrence influenced by the composition of the final gene set, the iterative signature discovery procedure (Supplementary Fig. 3) was repeated using all patients and controls. Each round of the signature discovery procedure consists of predicting patients and controls using a different gene set (Methods). By assessing the predictions for each round, a robust classification error rate was determined for each patient (Fig. 3a). Strikingly, one group of RIF patients is almost always misclassified (Fig. 3a, right, > 90% error rate). This indicates that misclassification is not a random event and suggests that this subpopulation actually represents a distinct class of RIF patients for which the underlying cause of RIF is different and not represented in the endometrial gene expression pattern.

Analysis of the classification error rate (Fig. 3a) also indicates two other groups: RIF patients with a low classification error rate (Fig. 3a, left, < 10% error rate) and an intermediate group, more difficult to classify by gene expression (error rate between 10% and 90%). To determine whether this stratification has any clinical relevance, the implantation success rate per IVF cycle was determined for the three groups (Fig. 3b). By definition all three RIF groups have a low implantation success rate. Interestingly, RIF patients with an intermediate classification error rate have a significantly higher implantation success rate compared to the RIF patients with a low classification error rate (Fig. 3b). The higher implantation success rate for the intermediate group fits with the idea that this group presents an intermediate expression-based RIF phenotype as judged by the classification error rate. This further stratification strengthens the conclusion that expression-based classification is clinically relevant. The RIF patients with the highest classification error rate (>90%) seem to have a low implantation success rate (Fig. 3b). Although this would agree with their status as a distinct group of patients, suffering from RIF but without presenting an aberrant expression signature, the difference in implantation success is not statistically significant compared to either of the two other groups.

Discussion

Studies of the impact of the endogenous and exogenously manipulated hormonal milieu on endometrial gene expression^25,29 have indicated the susceptibility of the endometrium to disruption by such factors. An early genome-wide study, directly comparing RIF patients with controls has strengthened this¹³. However, there has been a lack of large human studies allowing for the robust determination of a RIF-associated gene expression signature while also incorporating an independent validation. In this study, the number of samples has been sufficiently high to determine a gene signature and show on an independent set that the signature is robustly capable of distinguishing between women experiencing RIF and IVF controls.

A gene signature that predicts RIF with a high PPV offers the ability to identify patients whose chances of a successful pregnancy are very small, which is of clear value in counselling patients as to the wisdom of investing further time and effort and money in further treatment. Moreover, the strong negative predictive value (NPV) of 81% indicates that, given a non-RIF expression profile, the chance of an endometrial factor impeding treatment success is small. This would suggest a positive outlook for continued treatment, especially if embryo quality can be improved, by using donated oocytes for example. Discerning whether RIF in a particular patient reflects impaired endometrial or embryo quality, a perturbed dialog between the embryo and the endometrium, or simply a chance event, remains challenging. However, the different predictive accuracies observed for RIF patients (Fig. 3a), whereby some are almost always correctly classified while others are not, enables those with a significant endometrial factor to be identified. Those who are ‘misclassified’ and therefore more closely resemble an expression profile of a healthy, receptive endometrium, are likely to have a significant embryo quality issue. However, a sizeable number of patients fall in-between the two groups, implying a non-binary distinction between refractory and receptive endometrium. It is interesting to observe that this group of patients had a better IVF implantation rate compared to the patients with a clear RIF expression signature. This suggests that the strength of the RIF-associated expression profile is correlated with the severity of the RIF phenotype.

Gene function analysis shows a strong skew towards down-regulation of processes in RIF patients. This is consistent with a previous study of RIF¹³ that also reported enrichment for cell cycle associated gene function within the set of down-regulated genes. Interestingly, an endometrium lagging behind the normal phase of proliferative and secretory events has recently been put forward as a cause of implantation failure³² and a gene expression test designed to identify this has recently been commercialized (Endometrial Receptivity Array (ERA), iGenomix, Valencia, Spain)³². The genes present on the ERA were determined from studies of the endometrial cycle in healthy individuals^25,22 and the test does not distinguish between patients with RIF and controls³². In contrast, the classifier presented here was determined directly from differential expression between RIF patients and controls and thus performs well in distinguishing between these two groups (Fig. 1).

The invasive nature of endometrial biopsy required to obtain tissue for testing precludes its use in a treatment cycle. However, clinical application of the 303-gene signature could be practical in a non-treatment cycle after multiple failed IVF treatments, when a RIF diagnosis has already been established. A signature-based diagnostic test to investigate the contribution of an endometrial factor in implantation failure would be a meaningful enhancement to the current limited clinical management strategies. However, perhaps the greatest clinical benefit is to be found in diagnosing receptive or refractory endometrium prior to the start of IVF/ICSI treatment. Up to one third of couples presenting with infertility have no cause identified by routine investigations⁹. However, at present there is no validated test of endometrial receptivity. It can be proposed that couples presenting with infertility would benefit from a test of endometrial receptivity as part of their initial investigations, in order to identify whether there is a significant endometrial factor which may affect their chances of conceiving either spontaneously or by IVF treatment. Clearly, validation of the 303-gene predictor in prospective cohorts of IVF/ICSI patients would be required.

Methods

Study design and tissue collection

The study consists of two serial cohorts. In each cohort women experiencing RIF and fertile IVF/ICSI controls participated. The study was approved by the Medical Review Ethics Committee of the University Medical Center Utrecht and the Medical Review Ethics Committee of the Academic Medical Center and performed in accordance with the approved guidelines. Written informed consent was obtained from all participating subjects.

All participants had undergone IVF/ICSI treatment in two tertiary referral academic hospitals. They were aged ≤38 years at time of biopsy (one patients was 39 and 5 days at biopsy, 38 at diagnosis RIF), had regular menstrual cycles of 25–35 days and did not use oral contraceptives or an intra-uterine device.

IVF and ICSI treatments were performed according to local protocols. Treatments were started after at least one year of trying to conceive spontaneously. All included women had undergone routine fertility investigations and all had an indication for IVF/ ICSI treatments according to the Dutch Society of Obstetrics and Gynaecology guidelines³⁸. Women underwent controlled ovarian hyperstimulation with recombinant FSH (Puregon, MSD, The Netherlands/ Gonal-F, Merck Serono, Germany) or HMG (Menopur, Ferring, The Netherlands). To prevent a premature LH surge, pituitary suppression was achieved by co-treatment with the GnRH agonist triptorelin (Decapeptyl, Ferring, The Netherlands) or a GnRH antagonist (Orgalutran, MSD, The Netherlands/ Cetrotide, Merck Serono, Germany). Follicular maturation was induced by 10,000 IU urinary hCG (Pregnyl, MSD, The Netherlands). Cumulus oocyte complexes were retrieved by transvaginal ultrasound-guided follicle aspiration 36 hours after hCG injection. Oocytes were inseminated with 10,000–15,000 progressively motile spermatozoa (IVF) or injected with a single spermatozoon 2–4 hours after follicle aspiration (ICSI). Embryo transfer was performed 3–4 days after oocyte retrieval using a Wallace catheter (Smiths Medical, USA). Luteal phase support was performed by intravaginal progesterone 400–600 mg/day (Utrogestan, Besins, Belgium).

Supernumerary good quality embryos were frozen on day of the embryo transfer using a slow freeze protocol. Frozen-thawed embryos were transferred in ultrasound monitored natural cycles after ovulation induction with 5,000 IU hCG (Pregnyl), or in artificial cycles using estradiol (Progynova, Bayer Health Care Schering, Germany) and intravaginal progesterone (Utrogestan).

RIF was defined as ≥3 failed IVF or ICSI treatments or transfer of ≥10 embryos without the occurrence of a pregnancy (including frozen-thawed cycles). A pregnancy was defined as a positive HCG serum test or a positive at home urine test 14 days after embryo transfer.

Women meeting the definition for RIF were excluded if implantation failure could be explained by other factors; i.e. poor embryo quality (defined as less than 8 cell embryonal stage at day 3 after oocyte retrieval, or less than 12 cell stage at day 4), poor ovarian response (defined as <4 oocytes retrieved on adequate ovarian stimulation), or known disturbances in the uterine cavity or endometrial pathology, such as uterine anomalies, hydrosalpinx, or evidence of endometriosis (including endometriomas detected by ultrasound or disease diagnosed by laparoscopy). All RIF patients were screened for relevant inherited and acquired thrombophilias and abnormalities in glycosylated haemoglobin (Hb_A1c) and thyroid-stimulating hormone levels and excluded when results were aberrant.

The control group consisted of healthy women who had conceived within the first three cycles of IVF or ICSI treatment.

Between June 2006 and November 2007, the first cohort of 22 RIF patients and 23 controls (all ICSI treatment) participated in the University Medical Center Utrecht (UMCU) in Utrecht. Between October 2011 and May 2013, a second cohort was recruited to obtain a sufficient sample size for of signature discovery and validation. This cohort was recruited in the UMCU and the Academic Medical Center (AMC), in Amsterdam and consisted of 21 RIF patients and 49 controls (26 after ICSI and 23 after IVF treatment).

An endometrial biopsy was scheduled 6 (n = 27) or 7 (n = 71) days after the putative luteinizing hormone (LH) surge in a natural (non-treatment) cycle in patients and controls, which is considered to be a representative day of the window of implantation. The biopsy was never performed in the first natural cycle after an unsuccessful IVF or ICSI cycle, to eliminate the effect of hormonal influence from the ovarian stimulation treatment. Due to scheduling wishes of the participants, several biopsies were performed late on day 5 (n = 8) or early on day 8 (n = 9) after the LH surge. Urinary LH was monitored by a home LH ovulation predictor kit (Ovulady, Clindia Benelux, The Netherlands). Biopsies were collected using an endometrium sampling device (Endobiops Standard CH9, Gynotec, The Netherlands) under sterile conditions. The sample was snap frozen in liquid nitrogen and stored at −80 °C until use.

Gene expression profiling

RNA isolation

Total RNA was isolated from individual tissue samples using Trizol reagent (Invitrogen) following the manufacturer’s protocol (including optional centrifugation), followed by a purification using the RNeasy Mini Kit (Qiagen) and a DNAse treatment using the Qiagen DNA-free kit. The yield and quality of total RNA was checked by spectrophotometry and by the Agilent Bioanalyser (Agilent, Belgium). All 115 samples showed good RNA quality, with clear 18 S and 28S ribosomal bands and RNA integrity numbers (RIN) between 5 and 10.

Microarray hybridization

For each total RNA sample, two expression profiles were generated in dye-swap. The samples of both patients and controls were compared against a commercial reference (Universal Human Reference RNA catalog #740000, Stratagene). The microarrays were human whole genome gene expression microarrays V2 (Agilent, Belgium) representing 34,127 H.sapiens 60-mer probes in a 4 × 44K layout. Probe sequences from this array were re-annotated by BLAST-searching against database version 71.37 at ENSEMBL. cDNA synthesis, cRNA amplification, labeling, quantification, quality control and fragmentation were performed with an automated system (Caliper Life Sciences NV/SA, Belgium), starting with 3 μg total RNA from each sample, all as previously described in detail³⁹. Microarray hybridization and washing was with a HS4800PRO system with QuadChambers (Tecan, Benelux) using 1000 ng, 1–2% Cy5 or Cy3 labeled cRNA per channel as described³⁹. Slides were scanned on an Agilent G2565BA scanner at 100% laser power, 30% PMT.

Data normalization

After automatic data extraction using Imagene 8.0 (BioDiscovery), the mean spot intensities were normalized using quantile normalization⁴⁰ and the two dye-swapped profiles per sample were merged by averaging the fold changes. Quality control checks using principal component analysis (PCA, sva R package⁴¹) indicated batch effects as well as a correlation between gene expression variation and the time of biopsy relative to the LH surge (Supplementary Fig. 1A & Supplementary Table 1). Both effects were modelled for each gene individually using linear models (limma R package⁴²) and the effect estimates were used to transform the data. Because one of the groups consisted of only six samples (“cohort 2, batch 2”), rather than fitting a single model including biopsy timing, batch and sample class, we opted to estimate the biopsy timing effect using all control samples from the first batch of the second cohort (n = 45). This avoided confounding the biopsy timing effect with the batch effect (by only including a single batch) or with the sample effect (by including only control samples). The effects estimates were used to transform the data using LH + 7 as a reference thereby removing the biopsy timing effect (Supplementary Fig. 1B, right panel and Supplementary Table 1). Batch effects were similarly removed. One batch effect was between the two cohorts. The second cohort also separated into two batches which had been processed for expression profiling in separate runs (Supplementary Fig. 1A, B left panels, Supplementary Table 1). The batch effects were estimated using only control samples (n = 72). This avoids confounding the batch effect with the sample effect (by including only control samples). After estimating the batch effect the data was transformed using the first batch of the second cohort as the reference. Supplementary Fig. 1C shows the transformed data used for all subsequent analyses, devoid of major biopsy timing or batch effects (Supplementary Table 1). All microarray gene expression data have been deposited in the public data repositories ArrayExpress (E-MTAB-2591) and GEO (GSE58144).

Data analysis

Signature discovery

For RIF signature discovery, a randomly selected subset of samples (signature discovery set, n = 81, 38% RIF) was created whereby the ratio of RIF patients to controls was kept similar to the full complement of samples. The remaining samples were assigned to the validation set (n = 34, 35% RIF).

For signature discovery, array probes with a low median intensity in the samples (log2 ≤ 6) were filtered out, as were probes with a standard deviation of expression fold-change versus the reference material that was below the median standard deviation of all probes. Both median intensity and standard deviation were based on the signature discovery set measurements. The intensity filtering was performed to eliminate lowly expressed genes, which are inherently more prone to measurement noise. Filtering against low standard deviation eliminates stably expressed genes with little variation among samples. After filtering expression measurements for intensity and standard deviation, 15,502 of the 34,217 initial array probes remained (12,198 of 22,395 unique genes).

Signature discovery consisted of 100 rounds of randomly selecting a training subset (4/5 of all samples) from the signature discovery set (resampling without replacement) which was subsequently used to rank genes based on their potential to differentiate RIF patients from controls (Supplementary Fig. 3). The signal to noise ratio was used as the ranking metric⁴³. The 100 top ranked genes were used to build a linear support vector machine (SVM) classifier (e1071 R package⁴⁴) using the training subset as input. The SVM classifier was built with the option to compute class probabilities. We termed the resultant probability estimate the signature score and used a cutoff of 0.5 (equal probabilities for either class) for classification ( < 0.5 control classification, ≥ 0.5 RIF patient classification). The trained SVM classifier was employed to predict the class of the samples not part of the training subset (test subset). This procedure was repeated 100 times recording the genes selected as well as the prediction results used for generating the ROC curve. After 100 rounds, genes were ranked based on the number of times they appeared in the list of 100 top genes and all genes that appeared > 5 times or more were selected into the final gene signature (Supplementary Fig. 3). This final gene signature contained 320 array probes which represented 303 unique genes. For the sake of consistency, this is referred to as the 303-gene signature. Leave-one-out cross-validation was used to test the effectiveness of this gene signature on the signature discovery set (using an SVM classifier).

Finally, as an independent validation of the gene signature, an SVM classifier was built using the full signature discovery set as input and this classifier was employed to predict the class of the samples in the validation set, which had not been used in any of the previous steps to ensure an independent validation. See Supplementary Fig. 3 for a graphical representation of the signature discovery and validation procedures.

ROC curves for classifier performance were calculated and plotted using the pROC R package⁴⁵.

Gene function analysis

To explore the processes differentially regulated between RIF patient and controls gene set enrichment analysis (GSEA)⁴⁶ was performed using the generic GO slim subset of Gene Ontology (GO) terms⁴⁷ as gene sets (database version 2013-03-02). First we calculated the t-statistic for every gene by contrasting log2 fold changes of RIF patients against controls. Secondly, for every gene set/GO term, we calculated a gene set Z-score, based on the gene specific t-statistics, which enabled calculation of a two-sided P for every GO term from the standard normal distribution⁴⁸. Reported P values were corrected for multiple testing using Bonferroni correction.

Patient Stratification

During the iterative rounds of signature discovery we noticed that RIF patients in the signature discovery set were not all predicted equally well, with some consistently misclassified as controls, while others were correctly classified in most cases. To investigate this, the signature discovery procedure was repeated, only now including all patients and controls (n = 115) and using 1,000 resamplings instead of 100. In each iteration the outcomes of the predictions were recorded and an overall error rate was calculated (false predictions/number of predictions) for every patient. The patients were divided into three groups based on two, arbitrarily chosen, error rate thresholds (0.1 and 0.9).

Additional Information

Accession codes: All microarray gene expression data have been deposited in the public data repositories ArrayExpress (http://www.ebi.ac.uk/arrayexpress/, accession number E-MTAB-2591) and GEO (http://www.ncbi.nlm.nih.gov/geo, accession number GSE58144).

How to cite this article: Koot, Y. E. M. et al. An endometrial gene expression signature accurately predicts recurrent implantation failure after IVF. Sci. Rep. 6, 19411; doi: 10.1038/srep19411 (2016).

References

Polanski, L. T. et al. What exactly do we mean by ‘recurrent implantation failure’? A systematic review and opinion. Reprod. Biomed. Online 28, 409–423 (2014).
PubMed Google Scholar
Thornhill, A. R. et al. ESHRE PGD Consortium ‘Best practice guidelines for clinical preimplantation genetic diagnosis (PGD) and preimplantation genetic screening (PGS)’. Hum. Reprod. Oxf. Engl. 20, 35–48 (2005).
CAS Google Scholar
Simon, A. & Laufer, N. Repeated implantation failure: clinical approach. Fertil. Steril. 97, 1039–1043 (2012).
PubMed Google Scholar
Koot, Y. E. M., Teklenburg, G., Salker, M. S., Brosens, J. J. & Macklon, N. S. Molecular aspects of implantation failure. Biochim. Biophys. Acta 1822, 1943–1950 (2012).
CAS PubMed Google Scholar
Das, M. & Holzer, H. E. G. Recurrent implantation failure: gamete and embryo factors. Fertil. Steril. 97, 1021–1027 (2012).
PubMed Google Scholar
Simon, A. & Laufer, N. Assessment and treatment of repeated implantation failure (RIF). J. Assist. Reprod. Genet. 29, 1227–1239 (2012).
PubMed PubMed Central Google Scholar
Penzias, A. S. Recurrent IVF failure: other factors. Fertil. Steril. 97, 1033–1038 (2012).
PubMed Google Scholar
Koot, Y. E. M. & Macklon, N. S. Embryo implantation: biology, evaluation and enhancement. Curr. Opin. Obstet. Gynecol. 25, 274–279 (2013).
PubMed Google Scholar
Urman, B., Yakin, K. & Balaban, B. Recurrent implantation failure in assisted reproduction: how to counsel and manage. B. Treatment options that have not been proven to benefit the couple. Reprod. Biomed. Online 11, 382–391 (2005).
PubMed Google Scholar
Vanneste, E. et al. Chromosome instability is common in human cleavage-stage embryos. Nat. Med. 15, 577–583 (2009).
CAS Google Scholar
Revel, A. Defective endometrial receptivity. Fertil. Steril. 97, 1028–1032 (2012).
PubMed Google Scholar
Macklon, N. S. & Boomsma, C. M. in Early Pregnancy (Cambridge University Press, 2010) (Date of access:23/10/2015). at < http://dx.doi.org/10.1017/CBO9780511777851.021>
Koler, M. et al. Disrupted gene pattern in patients with repeated in vitro fertilization (IVF) failure. Hum. Reprod. 24, 2541–2548 (2009).
CAS PubMed Google Scholar
Ruiz-Alonso, M., Galindo, N., Pellicer, A. & Simón, C. What a difference two days make: ‘personalized’ embryo transfer (pET) paradigm: a case report and pilot study. Hum. Reprod. Oxf. Engl. 29, 1244–1247 (2014).
CAS Google Scholar
Brosens, J. J. et al. Uterine selection of human embryos at implantation. Sci. Rep. 4, 3894 (2014).
PubMed PubMed Central Google Scholar
Ruiz-Alonso, M., Blesa, D. & Simón, C. The genomics of the human endometrium. Biochim. Biophys. Acta 1822, 1931–1942 (2012).
CAS PubMed Google Scholar
von Grothusen, C., Lalitkumar, S., Boggavarapu, N. R., Gemzell-Danielsson, K. & Lalitkumar, P. G. Recent advances in understanding endometrial receptivity: molecular basis and clinical applications. Am. J. Reprod. Immunol. N. Y. N 1989 72, 148–157 (2014).
Google Scholar
Haouzi, D., Dechaud, H., Assou, S., De Vos, J. & Hamamah, S. Insights into human endometrial receptivity from transcriptomic and proteomic data. Reprod. Biomed. Online 24, 23–34 (2012).
CAS PubMed Google Scholar
Kao, L. C. et al. Global gene profiling in human endometrium during the window of implantation. Endocrinology 143, 2119–2138 (2002).
CAS PubMed Google Scholar
Mirkin, S. et al. In search of candidate genes critically expressed in the human endometrium during the window of implantation. Hum. Reprod. Oxf. Engl. 20, 2104–2117 (2005).
CAS Google Scholar
Riesewijk, A. et al. Gene expression profiling of human endometrial receptivity on days LH+2 versus LH+7 by microarray technology. Mol. Hum. Reprod. 9, 253–264 (2003).
CAS PubMed Google Scholar
Díaz-Gimeno, P. et al. A genomic diagnostic tool for human endometrial receptivity based on the transcriptomic signature. Fertil. Steril. 95, 50–60, 60.e1–15 (2011).
PubMed Google Scholar
Haouzi, D. et al. Gene expression profile of human endometrial receptivity: comparison between natural and stimulated cycles for the same patients. Hum. Reprod. Oxf. Engl. 24, 1436–1445 (2009).
CAS Google Scholar
Macklon, N. S., van der Gaast, M. H., Hamilton, A., Fauser, B. C. J. M. & Giudice, L. C. The impact of ovarian stimulation with recombinant FSH in combination with GnRH antagonist on the endometrial transcriptome in the window of implantation. Reprod. Sci. Thousand Oaks Calif 15, 357–365 (2008).
CAS Google Scholar
Horcajadas, J. A. et al. Controlled ovarian stimulation induces a functional genomic delay of the endometrium with potential clinical implications. J. Clin. Endocrinol. Metab. 93, 4500–4510 (2008).
CAS PubMed Google Scholar
Hauzman, E. E., Garcia-Velasco, J. A. & Pellicer, A. Oocyte donation and endometriosis: What are the lessons? Semin. Reprod. Med. 31, 173–177 (2013).
PubMed Google Scholar
Kao, L. C. et al. Expression profiling of endometrium from women with endometriosis reveals candidate genes for disease-based implantation failure and infertility. Endocrinology 144, 2870–2881 (2003).
CAS PubMed Google Scholar
Horcajadas, J. A. et al. Effect of an intrauterine device on the gene expression profile of the endometrium. J. Clin. Endocrinol. Metab. 91, 3199–3207 (2006).
CAS PubMed Google Scholar
Talbi, S. et al. Molecular phenotyping of human endometrium distinguishes menstrual cycle phases and underlying biological processes in normo-ovulatory women. Endocrinology 147, 1097–1121 (2006).
CAS PubMed Google Scholar
Garrido-Gómez, T. et al. Profiling the gene signature of endometrial receptivity: clinical results. Fertil. Steril. 99, 1078–1085 (2013).
PubMed Google Scholar
Bhagwat, S. R. et al. Endometrial receptivity: a revisit to functional genomics studies on human endometrium and creation of HGEx-ERdb. PloS One 8, e58419 (2013).
CAS ADS PubMed PubMed Central Google Scholar
Ruiz-Alonso, M. et al. The endometrial receptivity array for diagnosis and personalized embryo transfer as a treatment for patients with repeated implantation failure. Fertil. Steril. 100, 818–824 (2013).
PubMed Google Scholar
Díaz-Gimeno, P. et al. The accuracy and reproducibility of the endometrial receptivity array is superior to histology as a diagnostic method for endometrial receptivity. Fertil. Steril. 99, 508–517 (2013).
PubMed Google Scholar
Bartosch, C., Lopes, J. M., Beires, J. & Sousa, M. Human endometrium ultrastructure during the implantation window: a new perspective of the epithelium cell types. Reprod. Sci. Thousand Oaks Calif 18, 525–539 (2011).
Google Scholar
Bourdeau, V. et al. Mechanisms of primary and secondary estrogen target gene regulation in breast cancer cells. Nucleic Acids Res. 36, 76–93 (2008).
CAS PubMed Google Scholar
Georges, A. et al. FOXL2: a central transcription factor of the ovary. J. Mol. Endocrinol. 52, R17–33 (2014).
CAS PubMed Google Scholar
Richards, J. S. & Pangas, S. A. The ovary: basic biology and clinical implications. J. Clin. Invest. 120, 963–972 (2010).
CAS PubMed PubMed Central Google Scholar
Landelijke netwerkrichtlijn subfertiliteit. (2010) (Date of access:23/10/2015). at < http://nvog-documenten.nl/uploaded/docs/LandelijkenetwerkrichtlijnSubfertiliteitdef.pdf>
van Wageningen, S. et al. Functional overlap and regulatory links shape genetic interactions between signaling pathways. Cell 143, 991–1004 (2010).
CAS PubMed PubMed Central Google Scholar
Bolstad, B. M., Irizarry, R. A., Astrand, M. & Speed, T. P. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinforma. Oxf. Engl. 19, 185–193 (2003).
CAS Google Scholar
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinforma. Oxf. Engl. 28, 882–883 (2012).
CAS Google Scholar
Smyth, G. K. in Bioinformatics and Computational Biology Solutions using R and Bioconductor 397–420 (Springer, 2005).
Golub, T. R. et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999).
CAS PubMed Google Scholar
Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A. & Leisch, F. e1071: Misc Functions of the Department of Statistics (e1071), TU Wien. (2014) (Date of access:23/10/2015). at < http://CRAN.R-project.org/package=e1071>
Robin, X. et al. pROC: an open-source package for R and S+to analyze and compare ROC curves. BMC Bioinformatics 12, 77 (2011).
PubMed PubMed Central Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
CAS ADS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).
CAS PubMed PubMed Central Google Scholar
Irizarry, R. A., Wang, C., Zhou, Y. & Speed, T. P. Gene set enrichment analysis made simple. Stat. Methods Med. Res. 18, 565–575 (2009).
MathSciNet PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would first like to thank the patients who were willing to participate in this study, without whom this study wouldn’t have been possible. Furthermore we thank our colleagues from Meander Medical Center Amersfoort and Diakonessenhuis Utrecht for their help in including patients in the first cohort of the study and our colleagues, especially Tessa de Vries in the AMC for their help in including patients in the second cohort. The research was funded by the University Medical Center Utrecht (first cohort) and a GFI grant from Merck Serono (second cohort).

Author information

Koot Yvonne E. M., van Hooff Sander R., Holstege Frank C. P. and Macklon Nick S. contributed equally to this work.

Authors and Affiliations

Department of Reproductive Medicine and Gynaecology, University Medical Center Utrecht, Utrecht, The Netherlands
Yvonne E. M. Koot, Carolien M. Boomsma, Marinus J. C. Eijkemans, Bart C. J. M. Fauser & Nick S. Macklon
Molecular Cancer Research, University Medical Center Utrecht, Utrecht, The Netherlands
Sander R. van Hooff, Dik van Leenen, Marian J. A. Groot Koerkamp & Frank C. P. Holstege
Center for Reproductive Medicine, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands
Mariëtte Goddijn
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
Marinus J. C. Eijkemans
Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, United Kingdom
Nick S. Macklon

Authors

Yvonne E. M. Koot
View author publications
You can also search for this author in PubMed Google Scholar
Sander R. van Hooff
View author publications
You can also search for this author in PubMed Google Scholar
Carolien M. Boomsma
View author publications
You can also search for this author in PubMed Google Scholar
Dik van Leenen
View author publications
You can also search for this author in PubMed Google Scholar
Marian J. A. Groot Koerkamp
View author publications
You can also search for this author in PubMed Google Scholar
Mariëtte Goddijn
View author publications
You can also search for this author in PubMed Google Scholar
Marinus J. C. Eijkemans
View author publications
You can also search for this author in PubMed Google Scholar
Bart C. J. M. Fauser
View author publications
You can also search for this author in PubMed Google Scholar
Frank C. P. Holstege
View author publications
You can also search for this author in PubMed Google Scholar
Nick S. Macklon
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.E.M.K. contributed to the design of the study, the acquisition, analysis and interpretation of data and drafted the manuscript. S.R.v.H. contributed to the design of the study, performed the analysis and interpretation of the data, made figures and drafted the manuscript. C.M.B contributed to the design of the study, acquisition and interpretation of the data and revised the manuscript. D.v.L. contributed to the microarray analysis. M.J.A.G.K. contributed to the microarray analysis. M.G. contributed to the acquisition of the samples and revised the manuscript. M.J.C.E. contributed to the design of the study, interpretation of the data and revised the manuscript. B.J.C.M.F. contributed to the design of the study, interpretation of the data and revised the manuscript. F.C.P.H. developed the concept for the study, contributed to the design of the study, interpretation of the data and revised the manuscript. N.S.M. initiated and developed the concept for the study, contributed to its design, interpretation of the data and revised the manuscript.

Ethics declarations

Competing interests

BCJM Fauser has received fees and grant support from the following companies: Organon, Schering Plough, Merck Serono, Ferring, Wyeth, Ardana, Andromed, Pantharei Bioscience and PregLem. NS Macklon has received fees and grant support from the following companies: Organon, Schering Plough, MSD, Anecova, IBSA, Merck Serono and Ferring. The other authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Koot, Y., van Hooff, S., Boomsma, C. et al. An endometrial gene expression signature accurately predicts recurrent implantation failure after IVF. Sci Rep 6, 19411 (2016). https://doi.org/10.1038/srep19411

Download citation

Received: 04 August 2015
Accepted: 11 December 2015
Published: 22 January 2016
DOI: https://doi.org/10.1038/srep19411

This article is cited by

The reproductive potential of vitrified-warmed euploid embryos declines following repeated uterine transfers
- A. Almohammadi
- F. Choucair
- Johnny T. Awwad
Reproductive Biology and Endocrinology (2024)
Embryo Transfer Strategies for Women with Recurrent Implantation Failure During the Frozen-thawed Embryo Transfer Cycles: Sequential Embryo Transfer or Double-blastocyst Transfer?
- Qiao-hang Zhao
- Yu-wei Song
- Min Jin
Current Medical Science (2024)
Uterine WNTS modulates fibronectin binding activity required for blastocyst attachment through the WNT/CA2+ signaling pathway in mice
- Yuefei Lou
- Laurie Pinel
- Daniel Dufort
Reproductive Biology and Endocrinology (2023)
Klinische Aspekte des Implantationsversagens
- Gregor Weiss
- Michael Schenk
Journal für Gynäkologische Endokrinologie/Österreich (2022)
Das intrauterine Mikrobiom – Schrödingers Katze der Reproduktionsmedizin
- T. K. Eggersmann
- N. Hamala
- G. Griesinger
Gynäkologische Endokrinologie (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Patients and samples

Expression profiles

Gene signature based prediction

Functional analysis of RIF endometrial gene expression

Further expression-based patient stratification

Discussion

Methods

Study design and tissue collection

Gene expression profiling

RNA isolation

Microarray hybridization

Data normalization

Data analysis

Signature discovery

Gene function analysis

Patient Stratification

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links