Introduction

There are now multiple randomized controlled trials (RCTs), which have demonstrated meaningful improvements in the clinical outcomes of IVF with the incorporation of comprehensive chromosome screening (CCS) of aneuploidy1, 2, 3, 4 even if additional studies are necessary to determine which patients may or may not benefit from CCS.5 However, the RCT alone is incapable of determining the clinical negative predictive value of a CCS aneuploidy diagnosis, as the embryos diagnosed as abnormal are not transferred. Without such an evaluation, there remains a risk that reproductively competent embryos are erroneously discarded owing to technical errors (false positives) in the screening method itself.

For example, a previous study demonstrated that nearly 60% of blastocysts were chromosomally normal in four sections and for all 24 chromosomes despite a cleavage stage fluorescent in situ hybridization (FISH) aneuploidy diagnosis.6 This indicates that FISH is poorly predictive of aneuploidy as a result of a high rate of false positives when used on single cells. Several other methods of chromosome screening have demonstrated poor predictive value of an aneuploidy diagnosis based on finding normalcy later in development.7, 8 However, a more relevant assessment of the negative predictive value of a screening method is the ultimate clinical outcome after transfer of an embryo that would have been predicted as aneuploidy. Indeed, such a study has been conducted and demonstrated that an aneuploidy diagnosis by a specific CCS method provided 96% predictive value of a negative clinical outcome.9 Given that this single-nucleotide polymorphism (SNP) array-based CCS method has also been extensively evaluated for preclinical accuracy10 and clinical efficacy,4, 11 it represents a legitimate benchmark to evaluate new methods of CCS.

For example, quantitative real-time PCR (qPCR)-based CCS was recently developed to provide a more rapid and inexpensive method than array-based methods such as SNP arrays and array comparative genomic hybridization (aCGH). qPCR was evaluated for preclinical accuracy using cell lines with known abnormalities and embryos with prior array-based diagnoses.12 The results indicated 98–99% consistency with expected karyotypes of cell lines and embryos. In addition to having published preclinical accuracy, this method has also demonstrated clinical efficacy in two RCTs.2, 3

In contrast, aCGH CCS has not published preclinical accuracy using single cells isolated from cell lines with previously established karyotypes or blastomeres from embryos with prior validated CCS diagnoses. Instead, studies claiming validation of aCGH have used concordance with FISH,13, 14 despite the fact that FISH-based technology has failed to demonstrate clinical validity in multiple RCTs.15 A recent RCT of aCGH indicated improved clinical efficacy in good-prognosis patients.1 However, the reported aneuploidy rates were unusually high (45%) for patients with a mean maternal age of 32 years. The detriment of overdiagnosis of aneuploidy in younger good-prognosis patients may not be manifested within the first attempts at embryo transfer. However, in patients of advanced maternal age or patients with single-gene disorders performing also an aneuploidy screening cycle, the impact of false-positive aneuploidy may be more severe. These issues only reiterate the importance of determining the clinical negative predictive value and preclinical accuracy of new CCS methodologies before routine implementation. Such information is currently not available for aCGH.

As an alternative to performing a nonselection study of aCGH, the present study will provide a comparison of aCGH and qPCR, by using preponderance of evidence from three contemporary methods of CCS performed on the same embryos. Importantly, the use of three methods of analysis provides the added benefit of controlling for mosaicism within the embryo as both qPCR and aCGH are given the same chance of being influenced by biological variation within the embryo. Furthermore, in this study different contemporary CCS methods have been blindly used to perform 24-chromosome copy number analysis on multiple trophectoderm (TE) biopsies from the same embryos, providing the first unbiased assessment of the reliability of the blastocyst-stage aneuploidy screening approach.

Materials and methods

Study design and sample specification

This is a prospective double-blinded observational study performed between October 2012 and December 2013. The analysis was focused on TE biopsies (Figure 1) to compare qPCR-based 24-chromosome screening with the most widely used aCGH method in the preimplantation genetic screening (PGS) field. Blastocysts from 45 patients underwent TE biopsy at GENERA and aneuploidy screening by 24sure and BlueGnome (Cambridge, UK) aCGH at a commercial reference laboratory. Blastocysts (Supplementary Table 1) showing copy number aneuploidies by aCGH were warmed, re-biopsied, and blinded for evaluation by 4-h qPCR. Following the second biopsy, blastocysts were cryopreserved according to Italian law. Embryos showing discordant qPCR results were subsequently warmed again, rebiopsied, blinded, and evaluated by a third method of CCS based on SNP arrays. When SNP arrays confirmed one of the initial methods, the remaining method was considered to have given a discordant result. The impact of biological variation due to mosaicism in the embryo is specifically minimized given the known random distribution in the different TE samples from the same embryo.16 That is, mosaicism will similarly affect the two CCS methods under investigation. As this is an observational study where only aneuploid blastocysts were selected for reanalysis, the assessment of aCGH and qPCR diagnostic performances was performed on a per chromosome basis.

Figure 1
figure 1

Flow chart of phase 1 preclinical validation of aCGH and qPCR-based aneuploidy screening of trophectoderm biopsies.

To corroborate initial findings and to potentially include in the analysis also euploid embryos, 39 blastocysts were simultaneously analyzed using aCGH and qPCR. Following diagnosis, discordant cases were blinded and reanalyzed using SNP array on a second TE biopsy and results were evaluated as previously described.

The Ethics Committee of Clinica Valle Giulia approved the study and signed informed consent for the chromosome reanalysis of aneuploid blastocysts was obtained from all patients who underwent a PGS cycle at GENERA, Reproductive Medicine Center. IRB approval for analysis of discarded material from the second phase was obtained from western IRB.

IVF procedures

During ICSI/PGS cycles fertilized oocytes were sequentially cultured in separate 35-μl microdrops (Sage In-Vitro Fertilization, Inc., Trumbull, CT, USA) up to blastocyst stage (Day 5/6) in a humidified atmosphere containing 5% O2 and 6% CO2. Expanding and expanded blastocysts underwent biopsy of TE cells and cryopreservation on day 5. Cavitating morulas were transferred to a fresh individual 35-ml drop of blastocyst medium (Quinn's Advantage Blastocyst Medium, Cooper Surgical, Trumbull, CT, USA) with 15% serum protein substitute (Quinns Advantage Serum Protein Substitute, Cooper Surgical, Trumbull, CT, USA) and biopsy was attempted 24 h later on day 6 or 48 h later on day 7. Blastocyst quality was assessed immediately before TE biopsy, defined according to the criteria presented by Gardner and Schoolcraft.17 Biopsied blastocysts were vitrified according to the protocol described by Nagy et al.18 Aneuploid blastocysts were selected for reanalysis and thawed for a second TE biopsy. All the TE biopsy procedures were performed on the heated stage of a Nikon IX-70 microscope (Nikon instruments, Campi Bisenzio FI, Italy), equipped with micromanipulation tools, in dishes prepared with three droplets of 10 ml of HEPES-buffered medium (Sage In-Vitro Fertilization, Inc.) overlaid with pre-equilibrated mineral oil. A diode laser (Research Instruments, Cornwall, UK) was used to assist the opening of a 10–20 μm hole in the zona pellucida. The biopsy procedure was performed as previously described.19 5–10 TE cells were aspirated into the TE biopsy pipette (Research Instruments) followed by laser-assisted removal of the target cells from the body of the embryo. TE rebiopsies on aneuploid embryos were carried out following the same protocols used for TE biopsies during clinical PGS cycles. All TE biopsies were washed in sterile phosphate-buffered saline (PBS) or in hypotonic solution (depending on the CCS method) in a laminar flow cabinet to avoid any contamination of the sample, placed in microcentrifuge tubes to be further processed for CCS.

Genetic testing methods

Each method of CCS was performed using protocols that are in clinical use by each of the respective laboratories. The initial diagnosis obtained by aCGH was performed by a commercial reference laboratory according to the manufacturer’s protocol (BlueGnome) and as previously described.20 The second analysis was performed using qPCR-based CCS as previously described.12 SNP array-based CCS was performed as previously described.10 qPCR-based CCS has undergone multiple phases of validation including the evaluation of qPCR results of cells from cell lines, which had already had conventional G-banding. The same approach was initially used for the SNP array method used in the present study.10 In addition, qPCR results were compared with those obtained by SNP array-based CCS on the same embryos. While the results were highly concordant, they did not impact the way in which qPCR or SNP array is performed and therefore there is no reason to expect that qPCR would be any more similar to SNP array than to aCGH, as they are all independent methods.

Statistical analysis

Numerical and categorical data are reported as mean and proportion with 95% confidence interval (CI). Consistency between the three CCS methods used on the same embryos was evaluated at the level of individual chromosome copy numbers for the entire 24 chromosomes of each sample tested and for the overall diagnosis of aneuploidy or euploidy. Discordance was reported when a CCS method did not match with the other two. In particular, discordant aneuploidy call rate was defined as aneuploidy detected for a specific chromosome from one CCS method but not confirmed from the other two CCS methods. Discordant euploidy call was reported when a normal chromosomal segregation was detected from one CCS method but the other two CCS methods reported the same aneuploidy for that chromosome. The diagnostic sensitivity and specificity of qPCR and aCGH based on concordance between the two CCS methods were measured on a per chromosome basis as the proportion of actual positives, which are consistently identified as such, and the proportion of negatives, which are consistently identified. Continuous data and categorical variables are presented as mean and percentage frequency with 95% CI, respectively. A Fisher’s exact test was used to assess differences between categorical variables. Alpha was set at 0.05.

Results

One hundred and twenty-four aneuploid blastocysts were selected from 45 patients who underwent a single PGS cycle with aCGH-based CCS (Figure 1). Detailed information about patients basal characteristics and PGS cycle outcomes are reported in Table 1. One hundred and twenty two (98.4%; 122/124; 95% CI 94.3–99.8) blastocysts survived the first warming procedure and after re-expansion were biopsied for a TE sample and blinded for qPCR analysis. Two samples gave no result following qPCR and were excluded from subsequent analysis. Interpretable qPCR results were obtained in 98.4% (120/122; 95% CI 94.2–99.8) of the TE samples. Although 81.7% (98/120; 95% CI 73.6–88.1) of embryos showed the same diagnosis when comparing aCGH and qPCR-based full karyotype analysis, 18.3% (22/120; 95% CI 11.9–26.4) of embryos were discordant for at least one chromosome. Blastocysts showing a discordant result were warmed again and a third TE biopsy was performed on the 21 embryos surviving the second warming (95.4%; 21/22; 95 %CI 77.2–99.9) for blinded SNP array reanalysis. SNP array reanalysis of the 21 embryos yielded a conclusive diagnosis on 18 cases (85.7%; 95% CI 66.7–96.9). In four of these embryos (3.3%; 4/120; 95%CI 0.9–8.3) all three CCS methods did not match when considering the full karyotype (Table 2) and were interpreted as mosaic aneuploid embryos. In 10 embryos (8.9%; 10/112; 95% CI 4.4–15.8) an inconsistent diagnosis was reported for aCGH, mostly due to discordant aneuploidy calls for aCGH. SNP array reanalysis revealed 10 discordant aneuploidy calls and 2 discordant euploidy calls at the single chromosome level by aCGH in these 10 embryos (embryo number 5, 16, 30, 81, 102, 110, 3, 11, 22, and 45; Table 2). These errors resulted in a discordant diagnosis of aneuploidy in the original aCGH diagnosis for six embryos (5.4%; 6/112; 95% CI 2.0–11.3), where euploidy was diagnosed by both qPCR and SNP array. Five of these six discordant embryos were originally diagnosed by aCGH as single aneuploid, whereas one was diagnosed as double aneuploid (the profile plots of three of them are shown in Figure 2). An inconsistent diagnosis was reported for qPCR in the remaining four cases (3.6%; 4/112; 95% CI 0.1–8.9). Three discordant euploidy and one discordant aneuploidy were observed at the individual chromosome level. However, none of the qPCR discordances at the individual chromosome level translates in a diagnosis conversion from euploid to aneuploid or vice versa (0%; 0/112; 95% CI 0.0–3.2).

Table 1 Baseline characteristics of patients involved in the study and PGS cycles outcome
Table 2 Discordant cases between aCGH and qPCR with reassessment of diagnosis performed by a third trophectoderm biopsy and SNP array based CCS
Figure 2
figure 2

Profile plot comparison of some discordant cases between aCGH and qPCR with confirmation of diagnosis performed on a third biopsy and SNP array-based CCS. (a) False-positive and -negative chromosome error rates according to CCS methods under investigation. (b) Profile plots of aCGH showing a monosomy for the chromosome 18 for that embryo, shown to be normal by qPCR and SNP array-based CCS follow-up analysis. (c) aCGH profile plot consistent with a trisomy for chromosome 22 for that embryo, shown to be normal by qPCR and SNP array-based CCS follow-up analysis. (d) Profile plots of aCGH showing a double trisomy for chromosome 15 and 17 for that embryo, shown to be trisomy 15 by qPCR and SNP array-based CCS follow-up analysis.

Looking at the single-chromosome level, 174 and 164 aneuploidies were reported by aCGH and qPCR, respectively. A consistent chromosome copy number diagnosis was observed in 99.4% (2561/2576; 95% CI 99.0–99.7) of the 2576 chromosomes analyzed. In this per-chromosome analysis, the discordance aneuploidy call rate was significantly higher for aCGH (5.7%; 10/174; 95% CI 2.8–10.3) compared with qPCR (0.6%; 1/164, 95% CI 0.0–3.3; P<0.01). The two CCS methods showed similar discordant euploidy call rates (0.1%; 2/2402, 95% CI 0.0–0.3 for aCGH and 0.1% 3/2412, 95% CI 0.0–0.4 for qPCR, NS). qPCR and aCGH performed similarly in terms of sensitivity (98.2%, 163/166, 95% CI 94.8–99.6 vs 98.8%, 164/166, 95% CI 95.7–99.8, respectively, NS), whereas qPCR displayed a significantly higher specificity compared with aCGH (99.9%, 2409/2410, 95% CI 99.8–100 vs 99.6%, 2412/2422, 95% CI 99.2–99.8, respectively, P=0.01).

To corroborate these findings, 39 embryos were simultaneously analyzed by aCGH and qPCR during blastocyst-stage CCS cycles (Supplementary Table 2). Thirty-five matched including all 21 euploid embryos. The four that did not match had two aneuploid chromosomes by aCGH, one by qPCR. Blinded SNP analysis on rebiopsies of the four embryos matched qPCR (Table 3).

Table 3 Four embryos simultaneously biopsied for aCGH and qPCR during blastocyst-stage CCS cycles and giving discordant diagnosis, then reassessed by aSNP-based CCS performed on a third trophectoderm biopsy in the second phase confirmation analysis

Considering all embryos analyzed in the two phases of the study, a consistent chromosome copy number diagnosis from aCGH and qPCR on two different TE biopsies was reported for 99.86% (3468/3473; 95% CI 0.99–1) of the chromosomes analyzed. The discordant aneuploidy call rate was significantly higher for aCGH (7.0%; 14/201; 95% CI 3.9–11.4) compared with qPCR (0.5%; 1/188, 95% CI 0–2.9). The two CCS methods showed similar discordant euploidy rates (0.1%; 3/3272, 95% CI 0–0.3 for aCGH and 0.1%, 3/3285, 95% CI 0–0.3 for qPCR, NS).

Discussion

This is the first prospective double-blinded observational study performed to evaluate the diagnostic performances of two contemporary technologies for CCS on TE biopsies in blastocyst-stage PGS cycles.

Because the presence of mosaicism in blastocysts as a result of post-zygotic mitotic aneuploidy development represents a well-documented phenomenon that could contribute to biologic variation in blastocysts,5, 16 the preponderance of evidence from three methods was used. Because mosaicism is randomly distributed to all three methods, one can justifiably attribute the observed overall differences in consistency between the methods to inherent inaccuracies in the technologies rather than to biological variation in the embryo. Furthermore, because each method was applied in its actual clinical setting, one can consider the results to be representative of actual clinical performance.

Overall, the analysis on a per chromosome basis of these blastocysts demonstrated a high concordance level between aCGH- and qPCR-based CCS, where 99.86% of chromosome predictions were consistent between the two methods applied on different TE biopsies from the same embryo. Even if the analysis of the whole TE sample cannot identify chromosomal non-disjunction events where a mix of trisomic and monosomic cells for the same chromosome is present, results from this study corroborate previous findings showing no major impact of chromosomal mosaicism and high diagnostic reliability when testing at the blastocyst stage.6, 16 Only in four embryos all three methods did not match for all chromosomes, suggesting the presence of a true mosaicism in those samples and making it difficult to estimate the actual embryonic karyotype.

However, the SNP-based reanalysis of discordant cases highlighted a significantly higher discordant aneuploidy call rate from aCGH compared with qPCR-based CCS. Even if this is an end-point analysis and this study was not designed to investigate the reason behind a discordant chromosome aneuploidy prediction, some hypotheses can be formulated to explain the higher discordant aneuploidy detection rate from aCGH compared with qPCR.

One possibility is the lack of chromosome-specific cutoffs for predicting aneuploidy from aCGH data. Instead, a universal cutoff is used to assign aneuploidy status to all chromosomes, possibly leading to false-positive diagnoses. Given that WGA (in this case Sureplex PCR-based WGA) may introduce chromosome-specific amplification bias owing to the unique GC content of each chromosome, there may be a need to incorporate chromosome-specific thresholds for diagnosis of aneuploidy by aCGH. In contrast, qPCR-based CCS uses chromosome-specific cutoffs established from actual data obtained from embryos with previously identified aneuploidies, thus providing the potential for more precise chromosome-specific diagnoses.

Another possible contribution to the higher aneuploidy rate observed for aCGH is related to the inherent limitations associated with competitive DNA hybridization. aCGH was originally developed to investigate the genetics of cancer. In this more conventional setting, the DNA is allowed to hybridize to the arrays for ~72 h, and is applied to relatively large quantities of DNA. In contrast, the protocol established for aCGH in the PGS setting is significantly shortened to 4 h and starts with far less than the recommended quantity of DNA. This represents a dramatic change from the conventional application of aCGH technology. In contrast, qPCR was originally established to allow for rapid completion (~1–2 h) and when using limited starting material, making its adaptation to the PGS setting far less dramatic.

Avoiding false-positive results in the aneuploidy screening is of particular importance for patients of advanced maternal age or with single-gene disorders performing also PGS, where fewer reproductively competent or otherwise genetically normal embryos may be available per cycle. Furthermore, minimizing false-positive diagnoses is critical to obtaining the same pregnancy rate per cycle when comparing cumulative outcomes of PGS cycles to IVF cycles without PGS. Even if at a lower rate, there remains a risk that reproductively competent embryos are erroneously discarded owing to technical errors (false aneuploidy calls) in the screening method itself when poorly validated chromosomal screening methods are extensively applied in the clinical practice of PGS.

These results reiterate the importance of establishing the ability of an aneuploid diagnosis of new CCS methods to predict a negative clinical outcome. Until such time as aCGH methods are appropriately evaluated for their clinical negative predictive value, the present study represents the only available comparative evidence and indicates that the application of aCGH in the clinical setting can be expected to result in a higher aneuploidy rate than other contemporary and better validated methods of CCS.

Furthermore, owing to the high overall consistency of chromosome diagnosis reported in this study using different CCS methods on multiple TE biopsies from the same embryos, blastocyst-stage aneuploidy screening proved to be a highly reliable and effective approach for PGS.