Main

Microarray-based comparative genomic hybridization (array CGH) testing has been shown to be efficacious in the diagnostic setting and is being rapidly incorporated as an adjunct to traditional cytogenetic techniques in clinical diagnostic laboratories.1,2 Although array CGH testing has diagnostic value, this technique is highly complex, involving hundreds to thousands of individual probes targeted to various regions of the genome. Appropriate validation for platforms of this complexity is difficult at best and has not been standardized across laboratories. Few, if any, validation studies have been published based on commercially available array CGH platforms. To address the lack of these studies, we have undertaken a large, blinded comparison of a commercially available array CGH platforms with fluorescence in situ hybridization (FISH) for the detection of subtelomeric abnormalities.

Cryptic deletions and duplications in the subtelomeric regions of chromosomes have been implicated in more than 2.5% of cases of idiopathic mental retardation, dysmorphism, and developmental delay.3 Diagnostic testing for these abnormalities has become a widely used adjunct to routine cytogenetic and molecular genetic testing in this group of patients. Currently, the detection of such subtelomeric abnormalities is most frequently performed using FISH with chromosome-specific telomere probes.4 However, subtelomere FISH testing is time-consuming and labor-intensive and provides only limited information regarding the size of any detected abnormality. Array CGH has the potential to decrease the time and labor involved in this testing while providing more complete information regarding the nature and extent of these anomalies.5,6 The ability of array CGH to detect copy number changes in subtelomeric rearrangements has been described.7,8 However, no comprehensive assessment of the utility of array CGH as a replacement for subtelomere FISH studies in a clinical setting has been reported.

Herein, we report the blinded comparison of a commercially available targeted bacterial artificial chromosome (BAC) array CGH platform (Spectral Genomics Constitutional Chip 3.0; Spectral Genomics, Inc., Houston, TX) to standard subtelomere analysis by FISH in a clinical setting. Sixty-seven patients with previously identified subtelomeric deletions and duplications representing the vast majority of the 41 clinically relevant sites were studied, and the sensitivity and specificity of this array CGH platform is reported. To our knowledge, this is the most comprehensive analytic validation of any commercially available array CGH platform reported to date.

MATERIALS AND METHODS

Patients and sample preparation

A database of patient samples with previously identified subtelomeric abnormalities identified by subtelomere FISH analysis (ToTelVysion, Abbott Molecular, Des Plaines, IL) was queried to identify abnormal patient samples representing both a deletion and a duplication at all 41 clinically relevant subtelomeric loci. Where multiple samples were available for a particular abnormality, samples that were normal by chromosome analysis or those with the most subtle chromosomal rearrangements were preferentially chosen to minimize detection bias due to large rearrangements. Controls were selected from specimens that were reported as normal by subtelomere FISH analysis. Genomic DNA from fixed cell pellets was extracted using a QIAamp DNA Mini kit (Qiagen, Valencia, CA). Genomic DNA from products of conception (POC) specimens was extracted using standard phenol-chloroform extraction procedures. DNA was quantified using a ND-1000 spectrophotometer (NanoDrop Technologies, Wilmington, DE). Informed consent and approval from the Mayo Clinic Institutional Review Board was obtained before initiating this study.

Array CGH

Following the manufacturer's protocol, 1 μg each of test and opposite-gender control DNA was labeled by random priming using Cy3 and Cy5 dyes and buffers (Spectral Genomics, Inc.). A reciprocal labeling reaction in opposite fluorochromes was concurrently performed. The labeled DNA was hybridized to a Spectral Genomics Constitutional Chip 3.0 Human BAC array slide at 37°C for 16 hours. Slides were washed in 0.5% sodium dodecyl sulfate, 50% formamide, 0.1% Igepal, 0.2xSSC, and 80% ethyl alcohol and dried with a nitrogen gas stream. Slides were scanned and TIFF images acquired using a GenePix 4000B scanner (Molecular Devices, Union City, CA). Raw array data were generated with GenePix Pro V6.0.1.26 software (Molecular Devices, Union City, CA). Array clones with an SD of triplicates >0.2 were investigated closely for background and other array artifacts and edited to remove outliers. Clones with only one remaining replicate were failed. Data were normalized using global linear regression, and fluorescence intensity ratios of Cy5/Cy3 were plotted for all array clones in two parallel dye-swap experiments with SpectralWare V2.2.40 (Spectral Genomics, Inc., Houston, TX). A priori thresholds of copy number change were set at 0.8 and 1.25 (log2 ≈ ±0.3) for the detection of deletions and duplications, respectively. Clones exceeding thresholds for both labeling reactions were denoted as gain or loss.

FISH verification

In regions where gain or loss was expected, BAC clones that did not exceed the thresholds in one or both labeling reactions were studied by FISH. BAC DNA provided by Spectral Genomics was labeled by nick translation (Abbott Molecular). Performance of the FISH probes was evaluated on five normal male controls to document appropriate chromosomal location and to detect any cross-hybridization. Metaphase and/or interphase FISH was then performed on the abnormal specimen to evaluate the copy number status. In addition, poorly performing clones were checked for low copy repeat (LCR) content using the University of California Santa Cruz genome browser (http://genome.ucsc.edu),9 and the scanned array image was reviewed to rule out poor clone spotting or other technical artifacts that could explain suboptimal detection of copy number change.

RESULTS

Sixty-seven patients with subtelomere anomalies previously confirmed by FISH studies and 10 normal controls were selected for a blinded analysis using the Spectral Genomics Constitutional Chip 3.0 Human BAC array. The results of previous cytogenetic studies used to characterize the abnormalities in these patients are listed in Table 1. The 67 patients harbored a total of 61 subtelomeric deletions and 45 subtelomeric duplications (Fig. 1). At least one sample was tested for each of the 41 clinically relevant subtelomeric sites. Of the 82 possible subtelomeric abnormalities (both deletion and duplication at 41 loci), 74 were tested. Patient samples were not available to test the following subtelomeric abnormalities: 1pter+, 2pter−, 8qter−, 16qter−, 17pter+, 19pter−, 19qter−, and 21qter+.

Table 1 Patient samples included in the blinded study
Fig 1
figure 1

Schematic representation of the type and location of the subtelomeric abnormalities in the samples tested. Red bars indicate deletion. Blue bars indicate duplication.

Overall, the arrays performed quite well. The DNA spots on the chips were consistent and had very few failed spots due to printing irregularities, chip scratches, or artifactual material on the slides. Technically, the DNA labeling, hybridization, and washing steps were straightforward and robust. Only one experiment failed to produce acceptable data due to poor DNA quality. This sample with a 16qter deletion was removed from the blinded study and replaced with a similar sample (Table 1, sample 77) in a nonblinded fashion. The quality of the scanned data were highly interpretable with high signal/noise and signal/background ratios. Few false-positive or uninterpretable data points were encountered. Software analysis with SpectralWare V2.2.40 was fast and the data generated by the software was acceptable and without major flaws.

The samples were unblinded after completion of the array CGH experiments. All 10 normal controls were correctly interpreted as normal. In the 67 patient samples, abnormalities at 72 of the 74 (97%) clinically relevant loci tested demonstrated at least one abnormal clone on the array and were concordant with previous cytogenetic studies (Table 1). An additional subtelomeric abnormality not identified by FISH in previous studies was identified in sample 27, which demonstrated a 9pter deletion in addition to the expected 4qter duplication. Two patients with 20q deletion or duplication previously detected by FISH (Table 1, samples 47 and 59) did not demonstrate a 20q abnormality on the array (Fig. 2a). In addition, the most distal 20q subtelomere clone (GS-81-F12) was normal in another patient with a larger 20q duplication (Table 1, sample 9; array data not shown). To investigate the apparent failure of clone GS-81-F12, DNA from a POC specimen with trisomy 20 confirmed that GS-81-F12 does not detect copy number changes on chromosome 20q (Fig. 2, B). In addition, two patients with 12p subtelomere abnormalities (Table 1, Patients 18 and 74) showed GS-81-F12 deviation simultaneously with 12p deletion or duplication (data not shown). Array CGH data from a POC specimen with trisomy 12 confirmed that the DNA spotted on the array for GS-81-F12 is actually derived from a clone on chromosome 12p (Fig. 2c).

Fig 2
figure 2

Determination that the incorrect DNA was spotted on the array for the terminal subtelomeric clone on chromosome 20q. (A) Failure to detect the 20q subtelomere abnormality in sample 59. (B) GS-81-F12 does not deviate in a POC specimen with trisomy 20. (C) POC specimen with trisomy 12 demonstrating duplication of GS-81-F12 on chromosome 20q. POC, products of conception.

Although no other subtelomeric abnormalities in the patient samples included in this study were completely undetected by the array, several additional clones on the array did not perform as expected and could have led to ambiguous or misleading clinical interpretations (Tables 2 and 3). All clones listed in Tables 2 and 3 were investigated with additional FISH studies and/or array experiments with POC specimens harboring appropriate trisomies to clarify the anomalous array results. Table 2 lists additional subtelomeric region clones where the DNA spotted on the array is actually derived from a clone on a different chromosome. In total, DNA from at least seven clones spotted on the array did not represent the actual clone or chromosomal location that was designated for this chip.

Table 2 Mismapped clones
Table 3 Poor performing clones

Table 3 lists clones that did not perform appropriately in some or all array experiments. In each case, the clones in question harbor significant LCR sequences that likely dampen the ability of these clones to detect copy number differences and contribute to their poor performance in the array experiments. FISH experiments performed for this study and reported in previous studies indicate that six of these eight clones cross-hybridize to other regions of the genome.4 For example, the chromosome 16 array plot for sample 49 (Table 1) demonstrated a large 16p duplication (Fig. 3, A). However, the distal clone on this chromosome (RP11-568F1) did not show the duplication. Follow-up FISH studies with DNA from this clone demonstrated that this clone hybridized as expected to the appropriate location on both normal chromosomes 16 and to the derivative chromosome 8. However, this clone also hybridized to numerous other subtelomeric loci (Fig. 3, B). The significant proportion of LCR sequence contained in this clone likely explains the observed cross-hybridization signals (Fig. 3, C).

Fig 3
figure 3

Example of a poorly performing clone and the effect of low copy repeat (LCR) structure on the ability of array comparative genomic hybridization (CGH) to detect copy number changes. (A) Array CGH plot of sample 49 showing 16pter duplication. The terminal subtelomere clone RP11-568F1 does not demonstrate duplication. (B) Fluorescence in situ hybridization using clones RP11-568F1 (labeled in red) and GS-121-I4 (labeled in green) on 16pter to produce yellow fusion signals in cells from Patient 49. Arrows indicate fusion signals on the two normal chromosomes 16 and the derivative chromosome 8. Significant cross-hybridization of the RP11-568F1 (red) probe to multiple additional chromosomal locations in both metaphase and interphase was present. (C) UCSC Genome Browser representation of the genomic region covered by clone RP11-568F1. Brackets indicate LCR sequences present within this clone and on chromosomes 1, 2, 9, 15, 19, X, and Y.

Copy number variants (CNVs) are very common in the human genome and can present interpretive difficulties if included on an array CGH chip.10,11 Several CNVs were detected in the 77 individuals tested in this study (Table 4) and their frequencies ranged from 2.6% to 35.1%. All but one of these CNVs have been reported as variants.8,12–15

Table 4 Observed CNVs in 77 patient specimens

DISCUSSION

Publications on the validation of commercially available array CGH products for use in clinical laboratories have been essentially nonexistent in the literature. In this study, we attempted to validate the Spectral Genomics Constitutional Chip 3.0 Human BAC array for use as a replacement for subtelomere FISH analysis in clinical testing. Sixty-seven patients with abnormalities defined by chromosome analysis and a subtelomere FISH panel and 10 normal controls were selected for a blinded analysis to determine the efficacy of this array platform. In total, 72 of the 74 subtelomeric abnormalities tested on the chip were detected by the array yielding an overall analytical sensitivity of 97%. No false-positive results were obtained. The array also detected a 9pter deletion that was not observed by clinical FISH testing, indicating that neither platform is 100% sensitive for the detection of subtelomeric rearrangements.

The results of this blinded study demonstrated that the Spectral Genomics Constitutional Chip 3.0 Human BAC array system was technically capable of producing robust, reliable, and consistent data for the detection of subtelomeric abnormalities. However, several of the clones representing the subtelomeric regions on the array were identified as problematic and could have led to false-negative results or misleading clinical interpretations.

In the process of completing this study, seven clones were identified in which the DNA spotted on the array did not represent the correct clone (Table 2). These spots represent DNA from unknown clones in completely different chromosomal locations. In addition to the mismapped clones, eight clones performed poorly in samples containing abnormalities in the regions represented by the clones (Table 3). All of these clones contain significant amounts (>30%) of LCR sequences. The LCR sequences within these BAC clones are highly homologous to other regions of the genome, which dampens the ability of these clones to detect copy number differences and contributes to their suboptimal performance. The LCR sequences also result in significant cross-hybridization signals in FISH experiments, complicating FISH verification of abnormal results. These results demonstrate that clones used for array CGH should be carefully selected to avoid significant LCR sequences that mask bona fide abnormalities and contribute to interpretive difficulties.

Mismapped or poorly performing clones on the array could lead to false-negative results or misinterpretation of the abnormalities they are intended to detect. For the terminal clones on the chromosome, the results may be misinterpreted as an interstitial deletion rather than a terminal deletion or sufficiently small abnormalities could be missed altogether. The mismapped clone at 20qter (GS-81-F12) led to the two false-negative results detected in this study. Based on the relative positions of the 20q subtelomere clones on the array, 20q subtelomere abnormalities less than approximately 1 Mb in size will not be detected by this array. In a clinical setting, the use of this array without appropriate 20q subtelomere coverage could lead to a completely negative result (as in sample 47) or to the interpretation of an unbalanced translocation as a deletion (as in sample 59). The second situation is particularly concerning in a clinical setting. The parents of a child with an unbalanced translocation would likely be screened for a balanced form of the translocation, the results of which would have implications for recurrence risks in future pregnancies, whereas the parents of a child with a deletion may not have follow-up studies due to relatively low recurrence risks.

A second example highlights another clinically significant subtelomere abnormality that would be missed by this array. The two most distal clones at 6qter on the chip are mismapped (GS-57-H24, Table 2) or LCR rich and perform poorly (RP1-191N21, Table 3). Based on the positions of these clones, deletions or duplications smaller than approximately 1.7 Mb would appear normal on the array. This presents a significant diagnostic problem because patients with 6qter deletions smaller than 1.7 Mb have a clinical phenotype.16

Finally, abnormalities involving mismapped or poorly performing clones that are proximal to the most terminal clones could be misinterpreted as noncontiguous deletions or duplications. For example, the third and fourth clones from the 7q subtelomere (RP11-324E12 and RP11-11B21, Table 2) are both mismapped, leading to confusing and potentially misleading results.

The results of these studies indicate that although this array CGH platform is capable of producing very robust data, additional work needs to be done to implement more effective clone selection, quality control, and validation processes. The selection of clones for inclusion on an array CGH platform should include a process to exclude clones containing LCR sequences using available online resources (http://genome.ucsc.edu/; http://humanparalogy.gs.washington.edu/structuralvariation/; http://projects.tcag.ca/humandup/). Our data demonstrate that clones with as little as 30% LCR can have significant effects on clone performance. Therefore, clones with even minimal LCR content have the potential to be affected, and we suggest using only clones with <10% LCR content for inclusion on an array CGH platform. Quality control processes to ensure that the appropriate DNA is spotted on the slides are also critical for optimal array performance and appropriate clinical interpretation. At a minimum, DNA from each clone should be end-sequenced, PCR verified, or FISH verified before spotting on the chip to eliminate mismapped clones. Finally, initial array CGH experiments to verify that clones are sensitive to dose changes should be performed. The use of specimens with monosomy or trisomy of various chromosomes was very helpful in this regard (POCs with autosomal trisomies and patients with Turner, Klinefelter, or XYY syndrome). In summary, the appropriate experiments should be performed to identify and remove clones that do not perform appropriately or do not hybridize to the appropriate genomic locations.

Few, if any, studies have been published validating the performance of commercially available array CGH platforms. This presents particular difficulties in the context of the Standards and Guidelines for Clinical Genetic Laboratories document regarding microarray analysis for constitutional cytogenetic abnormalities (Shaffer et al., see page 654 in this issue). These recommendations by the Laboratory Quality Assurance Committee of the American College of Medical Genetics provide specific recommendations for clinical laboratories beginning to use array CGH technology. The guidelines suggest that for an investigational use only product intended for use as an adjunct to traditional cytogenetic analysis (such as the platform evaluated in this study), each laboratory should validate the array platform with a series of normal and abnormal specimens to address the regions of the genome represented on the array. A minimum of 30 abnormal specimens is suggested for this validation. However, these guidelines do not address initial and, in our view, critical comprehensive validation studies that the platform and the probes included on the platform perform appropriately, particularly those products that are commercially available and labeled as investigational use only. These guidelines also do not discuss the roles and responsibilities of the manufacturer providing these platforms before the validation studies suggested by the guidelines.

We propose two levels of validation before clinical implementation of any array CGH platform is considered. First, the company providing the array performs initial validation that includes (1) verification that each probe on the chip represents the intended genomic region (using end-sequence, polymerase chain reaction, or FISH techniques) and provides these data to the end users and (2) comprehensive verification of probe performance through the analysis of normal and abnormal specimens. It is recognized that companies marketing array CGH chips may not have formal affiliations with CLIA (Clinical Laboratory Improvement Amendments) approved clinical laboratories and may not have access to appropriate clinical samples for validation of probe performance. In this case, a consortium of clinical laboratories may be necessary to appropriately validate array platforms since a single laboratory may not have access to all the necessary specimens. These specimens may include samples with known aneuploidy, syndrome-specific, or subtelomeric abnormalities and peer-reviewed publication of the initial validation of probe performance so that all laboratories can access and rely on these data.

The second level of preclinical validation involves the laboratory intending to use a particular array CGH platform for clinical testing. This laboratory should perform additional validation, as outlined in the ACMG guidelines, primarily to demonstrate performance characteristics and proficiency with the particular platform being implemented. These guidelines include running a series of normal and abnormal controls, blinded sample exchange with another laboratory, and various other aspects of analytic standards and quality control.

In summary, the utility of array CGH has been demonstrated in multiple publications and is rapidly being implemented for routine use by clinical laboratories. However, guidelines for the appropriate validation of array CGH platforms have lagged behind this technology. The results presented in this study demonstrate that although the platform tested produces reproducible and robust data, there are clone selection and platform validation problems that should be addressed to avoid false-negative or misleading results in the clinical setting. These data also highlight potential deficiencies in the American College of Medical Genetics guidelines for array CGH platform validation. Appropriate manufacturer validation of these highly complex assays is necessary to ensure the highest quality of clinical testing, a problem that will only become more complex as whole-genome platforms based on high-density oligonucleotides begin to be used as a replacement for established cytogenetic and FISH testing.