Main

The use of microarrays for the assessment of copy number changes (CNCs) in individuals with constitutional disorders is becoming widespread in clinical cytogenetic laboratories. Although this technology has significantly increased the detection of chromosome imbalances that are known to be associated with abnormal phenotypes, the finding of CNCs in healthy individuals (also known as copy number variants or CNVs) has created new challenges for laboratorians and clinicians who must interpret the clinical significance of previously undescribed CNCs. In clinical laboratories using high-density oligonucleotide or single nucleotide polymorphism arrays, multiple CNCs can be detected in every patient. Although some of these CNCs are clearly not responsible for a patient's abnormal phenotype based on the frequency with which they are detected in normal control populations, determination of the clinical significance of individually rare CNCs can be much more difficult.

Recently, approaches for evaluating the pathogenicity of CNCs in the diagnostic setting have been proposed.13 One of the most useful criteria for distinguishing a benign from a pathogenic CNC is whether it is de novo or inherited. In general, with the exception of CNCs that demonstrate variable expressivity and decreased penetrance, de novo CNCs are more likely to be pathogenic compared with CNCs inherited from a phenotypically normal parent. However, in a clinical setting, an initial decision regarding the likely pathogenicity of a CNC must be rendered to determine if parental testing should even be pursued. If a CNC is likely benign based on other criteria, parental testing may be deemed unnecessary. Also, it may be impractical to delay reporting until parental specimens are received, and one or both parents may never be available for testing. Therefore, it is often necessary to make an initial categorization of a CNC based on other criteria. These other criteria that have been previously discussed include cross-referencing the change to internal or publicly available databases of known benign CNCs, the size and gene content of the CNC, and whether the CNC is a duplication or deletion.16 Even when these criteria are used, there is likely to be variability between laboratories not only in terms of which CNCs they decide to report but also in how these CNCs are reported. Internal databases of benign CNCs will vary depending on the volume of testing that a laboratory performs and the platform that the laboratory is using. There will also be subjectivity in evaluating whether specific genes may contribute to a phenotype, in determining the importance of the size of a CNC, and in how much emphasis to place on duplications versus deletions.

Variability in the interpretation of sequence variants became apparent after the introduction of full gene sequencing in the clinical laboratory. This variability has led to the development of recommendations for standards for interpretation and reporting of sequence variations by the American College of Medical Genetics and other working groups.7,8 Array-based copy number assessment is still relatively new to clinical cytogenetics, and a recent proficiency survey implemented by the College of American Pathologists (CAP) indicates that interpretation of CNCs detected by array-based technology may be just as problematic as sequence variants.9 This proficiency survey involved copy number assessment of an individual with autism spectrum disorder. While participants were asked to list only clinically significant abnormalities in their responses, 76% reported only a gain on 15q and 21% reported both a gain on 15q and a loss on 22q. The duplication in 15q consisted of the 15q11.2q13 region that has been reported in approximately 3% of patients with autism spectrum disorder. The deletion in 22q11.2, which is distal to the DiGeorge/velocardiofacial syndrome critical region, was designated a benign CNV by the majority (75%) of participants in a supplemental question. Therefore, although the 22q11.2 CNC was considered to be benign by the majority of participating laboratories, a fairly high percentage of participants reported it as a clinically significant CNC. The purpose of this study was to better document the variability between laboratories in the interpretation of CNCs detected using array-based technology and to ascertain the specific criteria that different laboratories are using for determining the clinical significance of CNCs.

MATERIALS AND METHODS

Information from bacterial artificial chromosome (BAC) and oligonucleotide array cases was provided to clinical laboratories that perform array-based CNC assessment for diagnostic purposes. Information on a single CNC from each case was given to participants who were asked to assess the CNC for pathogenicity as they normally would in their daily practice. All of the cases were originally received and processed in a single laboratory for diagnostic testing. The CNCs that were chosen for this study did not represent a random sampling of results from this laboratory, but rather, they were chosen because they potentially posed an interpretive challenge in terms of their clinical significance. The CNCs consisted of both duplications and deletions and represented a range of sizes. Some of the CNCs were not visualized by fluorescence in situ hybridization (FISH), either because there was not a probe available within the region in question or because the CNC was most likely below the level of resolution of FISH. Cases that were not FISH confirmed were thought to represent true positive array findings, as opposed to false-positive results due to technical reasons, based on criteria for detection of a CNC in the laboratory performing the arrays.

Participating laboratories

Survey materials were sent to 11 different laboratories, not including the laboratory that originally generated the results. Ten laboratories returned responses. Participants were from both academic and commercial laboratories that handle a range of array volume. For some laboratories, responses came from only one director, whereas at least one laboratory returned consensus responses that were reached between more than one director.

Array CGH and FISH

All specimens were received for routine clinical testing in a single laboratory. DNA was obtained from peripheral blood for testing on either a BAC (6 cases) or an oligonucleotide (7 cases) array at the request of the ordering provider. The BAC arrays contained 4685 clones covering 1543 loci (SignatureSelectTM v. 2.0). The oligonucleotide arrays consisted of approximately 105 K features (88,953 autosomal probes, 6369 X chromosome, and 1677 Y chromosome probes) printed by Agilent Technologies (SignatureChipOSTM).10 Some of the patients had other concurrent or previously normal conventional cytogenetic or molecular test results. Criteria for detection of a CNC identified on a BAC array consisted of a single clone, or multiple adjacent clones, demonstrating a log2 ratio of at least 0.3 and −0.3 in two dye-reversed hybridizations. A CNC on an oligonucleotide array consisted of at least five adjacent probes that demonstrated a gain or loss in a single hybridization experiment. FISH was performed for all of the CNCs detected by BAC array. FISH was not attempted in four oligonucleotide array cases, either because of lack of an available FISH probe in the region in question or because the original laboratory that performed the testing decided that the change was most likely clinically benign. Examples of representative array and FISH results from two of the cases used in the survey are shown in Figure 1. These images were not provided to participating laboratories at the time of the survey. Parental follow-up information was available for some of the cases at the time the survey was performed, but it was not provided to participants to simulate a real clinical testing situation in which an initial assessment of CNCs must often be made before obtaining parental specimens.

Fig. 1
figure 1

Array CGH and FISH results from two representative cases used in this survey. A, Partial Chromosome 3 array plot (left panel), interphase FISH image (middle panel), and metaphase FISH image (right panel) from Case 1. The array plot shows a two clone duplication in the short arm of Chromosome 3, at band p26.3, detected on a BAC array (cen = centromere). The interphase FISH image is representative of the signal pattern observed in 62% of nuclei from the patient when hybridized with one of the duplicated clones (RP11-624H2) detected by array CGH, compared with this signal pattern observed in 7% of nuclei from a mixture of normal control individuals hybridized with the same probe (arrowheads = signals from homologue with tandem duplication, arrow = signal from homologue without duplication). The metaphase FISH image is consistent with a tandem duplication in this patient (RP11-624H2 probe in orange indicated by arrows; D3Z1 centromere probe in green) and excludes insertion of the duplicated material into a different region of the genome. B, Partial Chromosome 16 array plot (left) and metaphase FISH image (right) from Case 9. The array plot shows a loss of nine adjacent probes detected on an oligonucleotide array. The metaphase FISH image shows absence of signal from the short arm of one homologue of Chromosome 16 (arrow) using a probe from within the deleted region detected by array CGH (WI2–2909M10 probe in orange indicated by arrowhead; D16Z2 centromere probe in green).

Information provided to participating laboratories

Table 1 lists information that was provided to participating laboratories. Participants were given the clinical indication for array testing, as well as results of previous or concurrent cytogenetic or molecular testing from each case. For each case, nucleotide coordinates (UCSC 2006, hg18 assembly) for one CNC detected in the case, including any adjacent gaps in array coverage, were provided. A gap was defined as the distance between a probe showing an imbalance and the nearest adjacent probe with a normal copy number. The CNCs consisted of both duplications (7 cases) and deletions (6 cases) and were from a variety of autosomes as well as the X chromosome. All of the CNCs were 1 Mb or less in size, with the exception of two duplications that could possibly be bigger than 1.0 Mb, depending on the involvement of adjacent gaps in coverage on the arrays. The single CNC given for each case was not necessarily the only CNC detected in the case, although participants were told that none of the cases were found to have other CNCs in critical regions of the genome known to be associated with well-documented syndromes or abnormal phenotypes. In addition, information on FISH confirmation of the CNC was provided.

Table 1 Clinical indication, other test results, copy number change, gaps in probe coverage, and FISH confirmation for cases

Case questionnaire

For each case, participants were asked to fill out a questionnaire (Fig. 2).

Fig. 2
figure 2

Questionnaire for each case and ancillary questions.

RESULTS

Survey responses

All questionnaire responses for each case from each participant are provided in Table, Supplemental Digital Content 1, http://links.lww.com/GIM/A89. For none of the thirteen cases was there 100% concordance in how a result was classified (Table 2). Responses for classification of a CNC ranged from normal to abnormal in some cases. Some participants appeared to use a binary classification system (normal versus abnormal), whereas other participants used additional categories (normal, likely benign, uncertain clinical significance [UCS], likely pathogenic/abnormal). Some participants had a lower threshold for designating CNCs in the UCS or abnormal categories than others (for example, in Table 2, participants 4 and 11 were compared with 3 and 8,). All but one of the BAC array cases was designated abnormal by at least one participant, whereas none of the oligonucleotide cases were classified as abnormal. This observation may be due to the presence of larger gaps in coverage adjacent to the CNC in some of the BAC array cases, potentially causing ambiguity in the sizing and determination of the gene content of the region in question.

Table 2 Reporting responses from participants for copy number changes

Duplications appeared just as likely to be categorized as UCS when compared with deletions (Table 3). There also did not appear to be good correlation between the size of a CNC and the frequency with which it was categorized as normal, as small CNCs containing multiple genes were just as likely to be categorized as UCS compared with larger CNCs (see Cases 8 and 9; Table 3). A CNC was just as likely to be categorized as abnormal or UCS regardless of whether it was confirmed by FISH; however, FISH visualization of array findings can still be useful, especially when duplicated material is inserted into another chromosome. In the case of insertion duplications, the possibility exists that even if the duplicated material itself is benign, there could be disruption or alteration of a gene or genes at the site of insertion.

Table 3 Characteristics and reporting of copy number changes

The duplication on the X chromosome in Case 10 was categorized as UCS by four participants, although one participant noted that the female control DNA used in their laboratory has a duplication in this region, and another participant also noted that this duplication is frequently seen in their laboratory (Table, Supplemental Digital Content 1, http://links.lww.com/GIM/A89). The presence of known benign CNCs in either public or personal databases was also useful to participants for categorizing CNCs in other cases (see Cases 11 and 12; Table 3), although the presence of benign CNCs did not always correlate with a participant categorizing a CNC as normal (Table 3 and Table, Supplemental Digital Content 1, http://links.lww.com/GIM/A89). The latter phenomenon may be due to variables pertaining to public databases, such as the frequency with which a benign CNC appears in public databases, and ambiguities in sizing of CNCs in public databases due to different array platforms.

This survey did not address the specific databases that were used by individual participants to decide whether a CNC is benign. That information has already been collected from a larger number of clinical laboratories through the CAP CYCGH proficiency surveys. In a recent CAP survey9 that included 34 participants, 97% reported using the Database of Genomic Variants (http://projects.tcag.ca/variation/),11 56% used an internal database, 47% used the Ensembl Genome Browser (http://www.ensembl.org), and 18% used the Human Structural Variation Database (http://humanparalogy.gs.washington.edu/structuralvariation/). It is clear from this information that many laboratories use multiple databases to assist in the interpretation of CNCs. In response to another question in the CAP CYCGH survey, 88% of participants stated that they do not list all CNCs considered to be benign in their clinical reports. Although not included as a formal question in either this survey or the CAP survey, some participants in this survey noted that they use other databases, such as Online Mendelian Inheritance in Man (OMIM), DECIPHER, and PubMed searches, in addition to the databases of benign variants cited earlier, to evaluate CNCs for pathogenic effects.

Responses to ancillary questions

Some laboratories have absolute size cutoffs below which CNCs are not reported. These size cutoffs range anywhere from 50 Kb to 500 Kb for deletions and 150 Kb to 500 Kb for duplications. Some laboratories will potentially report any size CNC as a finding other than normal, depending on variables such as gene content. Five participating laboratories perform array testing for CNCs on prenatal specimens. Their criteria for classifying CNCs are the same for prenatal and postnatal cases, but some laboratories use different arrays for prenatal cases than for postnatal cases (targeted arrays for prenatal and whole-genome arrays for postnatal).

DISCUSSION

We report the results of a survey to examine the variability among laboratories for interpretation of gains and losses detected by microarray analysis. We prefer the use of CNC over CNV because of confusion over the meaning of “variant” itself. The term variant has often been used to imply a normal population heteromorphism,12 which would be consistent with benign CNCs, but not with pathogenic CNCs. However, variant has also been applied to pathogenic CNCs, or has been used to refer to both benign and pathogenic CNCs.13

The results of this survey demonstrate that while laboratories may have similar approaches for attempting to determine the clinical significance of a CNC that has not been previously associated with an abnormal phenotype, the final interpretation between different laboratories can vary significantly. Thus, different laboratories could detect the same CNC, but it may be considered benign and not reported by some laboratories, whereas others may report it as a finding of clinical significance or possible clinical significance. There were some potential limitations of this survey that may have exaggerated the differences in interpretation that were observed for certain cases. Participants were asked to evaluate the CNCs as they normally would in their daily practice, although the information provided to the participants may not have exactly duplicated all of the information that a laboratory would have available to them if they processed the case in their own laboratory. Also, some of the participants may not have experience with the specific platforms that were used to generate the array findings for this survey. Some participants may have limited diagnostic reporting categories but may indicate in the text of the report the level of uncertainty regarding the clinical significance of a CNC. The latter information may not have been captured in the responses, even though an option for “Other-specify” was given in the questionnaire. Despite these potential limitations, the results still indicate that there is a lack of consensus between laboratories in the interpretation and reporting of CNCs.

Variability in the interpretation of CNCs could be due to differences in the content of internal databases containing benign CNC information between laboratories, differing opinions as to the importance of deletion or duplication of specific genes within or overlapping a CNC, and the comfort level of a director in reporting or not reporting a CNC. The content of internal benign CNC databases may vary not only because of the volume of array cases that a laboratory experiences but also by the criteria that a laboratory uses to determine that a given CNC is benign. For example, how many times should a CNC be found to be inherited from a phenotypically normal parent before it can be considered benign? Given the finding of variable expressivity and incomplete penetrance associated with other CNCs in the genome,1316 the argument could be made that inheritance from a phenotypically normal parent needs to be documented more than once before a CNC is deemed benign; however, the specific number will vary at the discretion of the laboratory. For example, one laboratory requires that a CNC is seen five or more times as inherited from a phenotypically normal mother and five or more times inherited from a phenotypically normal father before their reporting criteria are adjusted to reflect this change as a benign CNC. Also, before designating a CNC as benign, the laboratory should have a high degree of confidence that the parents have been properly evaluated for subtle abnormal phenotypic features.

The observation in this survey of at least one CNC (Case 10; Table 3) that could easily be categorized as benign by at least two laboratories based on internal database information, but that was classified as a finding of UCS by four other participants, emphasizes the necessity for reliable, well-annotated, publicly available databases that contain CNC information. Toward this end, recent studies that have used high-density platforms to screen large cohorts of apparently phenotypically normal individuals provide invaluable information to assist in the interpretation of CNCs in the clinical laboratory.6,17 Interestingly, the study performed by Shaikh et al.6 specifically addresses difficulties in interpreting one of the CNCs included in the present survey. Case 2 is from a patient with developmental delay who has a deletion of 800 Kb that is entirely contained within the CSMD1 gene (OMIM #608397), but involves multiple exons. This gene is a putative tumor suppressor and is expressed in fetal and adult brain, although little additional information is available on its function. This deletion was considered normal, of UCS, or abnormal by different participants (Table 3). Shaikh et al. detected hundreds of CNCs within CSMD1 in phenotypically normal individuals; however, these CNCs were much smaller than those previously reported in this gene, and only 0.8% of them disrupted exonic sequence, whereas 24.5% of previously reported CNCs were predicted to disrupt one or more exons. These results suggest that the size of previously reported CNCs could be overstated, or that the larger imbalances represent rare variants that may or may not confer disease risk. Additional studies of this type are needed to resolve these ambiguities, so that decisions regarding the likely pathogenicity of CNCs are based on reliable information.

Although the ISCN (2009) provides a standardized system for reporting CNCs, it does not aid in their interpretation.18 The development of interpretation recommendations analogous to those proposed for sequence variants identified by sequence-based testing would be extremely useful for providing guidance to clinical laboratories performing array-based copy number assessment. A categorization system that provides enough granularity to convey the degree of uncertainty in the clinical significance of a CNC would give useful information for genetic counseling, including the need for parental carrier testing. In addition to normal and abnormal categories, such a classification system could further subdivide UCS cases into “likely benign,” “uncertain,” and “likely pathogenic.” The abnormal category should be reserved for those CNCs for which there is well-documented evidence of an association with an abnormal phenotype, but that do not display significant variability between different individuals or incomplete penetrance. The “likely pathogenic” category could be used for those CNCs for which there is well-documented evidence of association with an abnormal phenotype, but with known variability in phenotypic expression. This category could also be used for large CNCs that contain genes for which there is evidence of copy-dependent pathogenicity and rare case reports, but no well-documented syndrome. The “likely benign” category could be reserved for CNCs that are relatively small and that lack known genes, or for which there is fairly convincing evidence of lack of pathogenicity based on independent database entries of benign CNCs that completely overlap the CNC in question. CNCs in this category may not warrant parental follow-up testing. The “UCS” category could include those CNCs that fall between the “likely pathogenic” and the “likely benign” categories.

A standardized classification system would not completely eliminate reporting discrepancies between laboratories, but it would alleviate markedly discrepant results such as one laboratory not reporting a CNC at all and another reporting it as abnormal. A standardized classification system would also give the ordering provider a better idea of the level of uncertainty of the clinical significance of a CNC. A weighted scoring system that attempts to quantitate different variables that are used to assess pathogenicity could even be considered to assist laboratories in placing CNCs into different categories. For example deletions, which typically are more deleterious, could be given a higher point value than duplications. Weighted scores could also be based on the size of a CNC. In a recent study that examined the frequency of CNCs based on size, 65% to 80% of apparently phenotypically normal individuals harbor a CNC of at least 100 kb in size, 5% to 10% of normal individuals carry a CNC of at least 500 kb in length, and 1% of normal individuals carry a variant of at least 1 Mb.17 Although these findings confirm that size alone cannot be used as a predictor of pathogenicity, the difference in the frequency of small versus large CNCs could justify assigning increasing point scores to CNCs of increasing size.

In addition to the findings presented here, other variability in clinical reports undoubtedly exists. Consensus recommendations from working groups could also provide laboratories with guidance regarding content of reports. For example, should clinical reports include a list of all genes within a CNC? Should they include information on gene expression patterns and information on the potential clinical relevance of all genes within the region in question, even though this information can be accessed by anyone from publicly available databases? Should reports state the size of gaps in array coverage adjacent to CNCs? How much clinical interpretation of a CNC should be provided by the laboratory, in light of the limited clinical information on patients that is usually given to the laboratory?

The findings presented in this study demonstrate that our ability to interpret the significance of some CNCs is lagging behind our ability to detect them. As the number of clinical laboratories that perform array-based copy number assessment increases, both laboratorians and providers may be increasingly asked to interpret reports that are generated outside their own institution. Clinical laboratories, and the providers who order these tests and communicate the results to patients, could greatly benefit from more uniform classification of CNCs and report content between laboratories.