Main

Learning disability (LD) or mental retardation, is a significant impairment of the cognitive and adaptive functions, with onset before the age of 18 years1,2; other common terms are learning difficulty, intellectual disability, developmental delay, impaired cognition, and mental handicap. LD is a common condition affecting 1–3% of individuals worldwide,3 with genetic factors estimated to be the main cause in around half of all patients with severe LD and around 15% of patients with mild LD.4 The past decade has seen advances in genetics research highlighting the importance of genetic factors and in particular genomic imbalance in the etiology of LD. This has led to improved diagnostic capability and, by providing a diagnosis, improving the welfare of patients and their families.5

Clinical assessment of children with LD typically involves examination by a pediatrician followed by appropriate biochemical and hematological tests as well as chromosomal tests and other molecular genetic tests. Existing cytogenetic tests involve a karyotype analysis followed by fluorescent in situ hybridization or multiplex ligation-dependent probe amplification, which can both identify submicroscopic chromosomal deletions and even single gene deletions located on specific chromosomes.6 A new method of analysis, array-based comparative genomic hybridization (array CGH), is increasingly being used routinely in patients with LD, in addition to existing cytogenetic techniques when such tests prove negative.7,8

Array CGH identifies copy-number variations (either amplifications or deletions) across the entire genome at high resolution.7 The technology combines fluorescence techniques with the microarray platform and allows the comparison of DNA content in two differentially labeled genomes, a test genome (patient), and a reference genome (control). The microarray platform also allows the use of thousands of individual DNA sequences from throughout the genome, providing precise information in a single experiment about the locations of any identified aberrations. Array CGH has many advantages over conventional cytogenetic techniques in that it can provide rapid genome-wide assessments at a high resolution (≤1 Mb) and precise location information linked to physical and genetic maps of the human genome. It can also detect single-copy gains and losses across whole chromosomes (including telomeric regions and prespecified chromosomal regions). However, balanced translocations cannot be detected. Another drawback is the potential for identifying novel copy number variants that may not be responsible for the patient's LD.9,10

The clinical significance of a copy number variant is usually determined either by observing whether the associated phenotype segregates along with the variant within a family, the rearrangement being already associated with a recognizable phenotype, or by whether the measured size of the variant is so large that it is unlikely to be without a phenotypic consequence. Causality is concluded when any of these occurs. Variants are considered benign (noncausal) if a phenotypically normal parent carries the same variant, although care must be taken to ensure that the breakpoints of the variant do not differ in the patient or parent because such a difference could be clinically relevant. Small de novo variants of unclear clinical significance are also identified, where neither parent carries the variant when the variant has not previously been associated with the phenotype in question.

In 2007, we reported that there was insufficient evidence to recommend the introduction of this test as a routine addition to current cytogenetic analysis although it could be helpful in certain circumstances. The subsequent increase in clinical use and the publication of 12 new studies evaluating array CGH has enabled us to update our previous findings.

METHODS

Systematic review inclusion criteria

Studies were included that used array CGH to identify genetic abnormalities in patients with LD and congenital anomalies, in whom conventional cytogenetic analysis proved negative. Both case series and cohort studies were eligible for inclusion.

Search strategy and data extraction

MEDLINE, EMBASE, and Web of Science databases were searched during March 2008 with both free text and MeSH terms using the strategy previously outlined,11 appropriately modified for the specific database. No language or other search restrictions were imposed and reference lists of primary studies were checked for additional references. Two reviewers (G.S.S. and A.S.B.) independently extracted data using the forms used in our first review.11 Reviewers compared results and resolved any differences through discussion. Where there were multiple publications of the same study, we extracted data from each publication and identified the most complete and up-to-date information.

Assessment of study quality

The following quality indicators were assessed: (1) clear description of the setting and study population; (2) whether criteria used for patient selection were clearly described; (3) evidence of appropriate pretesting with karyotyping, fluorescent in situ hybridization or telomere tests; (4) whether control samples were included, and if so, described clearly; (5) description of the array CGH platform, software, and assay process; (6) description of steps to identify and exclude known copy number polymorphisms using genome databases; (7) appropriate follow-up testing; and (8) clear description of the process of interpretation of array CGH results.

Statistical analysis

Diagnostic yield was defined and calculated for each study as the number of patients who had variants detected by array CGH that were judged to be causal, divided by the total number of patients tested. False-positive yield was defined and calculated as the number of patients who had variants detected by array CGH that were judged to be noncausal or of unknown significance, divided by the total number of patients tested. Meta-analyses were conducted on the scale of the logit of these proportions. The number needed to test to obtain one patient with a causal variant was estimated as the inverse of the diagnostic yield.

Before meta-analysis, inconsistency of findings (heterogeneity) was tested using standard χ2 methods and by using the I2 statistic, which describes the proportion of total variation in estimates that is due to heterogeneity rather than random error.12 Funnel plots and associated statistical tests13,14 were used to assess assumptions involved in meta-analysis and to explore the relationship between precision and magnitude of estimates. The meta-analysis was conducted using a random-effects model, assuming that heterogeneity can be represented by a normal distribution for the underlying effects. Meta-regression was used to investigate variation across studies using the following study- level covariates: study sample size (<100, 100–499, ≥500), array resolution (<1 Mb, 1 Mb, targeted array), patient source (genetic laboratory, clinical setting), patient source (Europe, North America, mixed, Japan), and use of de Vries clinical score (yes, no).

RESULTS

Study characteristics

Nineteen primary studies,6,1532 incorporating a total of 13,926 subjects, were identified that met the inclusion criteria (Table 1). Twelve studies were identified since the publication of our previous meta-analysis.2132 Seven studies were conducted on patients based in the United States,2123,25,28,31,32 seven in Europe,6,15,17,18,24,29,30 four using patients from multiple sources based in North America, South America, or Europe19,20,26,27 and one in Japan.16 All studies included sampling of control DNA as part of their protocol. Seven studies used a 1 Mb array for investigating the whole genome,6,15,1820,22,29 four used a targeted array,16,26,27,31 three used an oligonucleotide array with 30–35 k resolution,21,25,28 two studies used a 100-k array,23,30 one used a tiling BAC array,17 one study used both a targeted array and a 1-Mb array,32 and another used a 1-Mb array supplemented with an additional 3000 gene and region-specific BAC clones increasing the resolution to 0.5 Mb.24 Control samples varied from 2 to 316 normal people, whereas Menten et al.15 used samples from other patients in the cohort as controls. There was some variation in the clinical criteria for patient selection and testing, with some investigators using the de Vries clinical severity score.17

Table 1 Identified studies and their characteristics

Test performance

The combined diagnostic yield of causal genetic abnormalities in the 19 studies was 10% (95% confidence interval [CI]: 8%, 12%) (Table 2, Fig. 1). There was evidence of heterogeneity (χ2 = 63.75, P < 0.001; I2 = 72%, 95% CI: 55%, 82%). In meta-regression analysis, the study level characteristics of sample size (explaining 72% of the between-study variance of logit proportions, P = 0.003) and array resolution (50% of the variance, P = 0.007) individually explain much of this heterogeneity. Studies in which array resolution was less than 1 Mb had a combined diagnostic yield of 14% (95% CI: 9%, 20%). Funnel plot asymmetry was observed with the possibility of an excess of smaller studies with striking results (Egger's test P = 0.002). The combined diagnostic yield of the three largest studies (>1000 participants each) was 7% (95% CI: 7%, 8%). The number needed to test to identify one new causal variant was 10 (95% CI: 8%, 13%).

Table 2 Genetic abnormalities identified by array CGH in idiopathic learning disability and congenital anomalies
Fig. 1
figure 1

Random-effects meta-analysis of diagnostic yield from array-based CGH in patients with learning disability and congenital anomalies. CI, confidence interval. The diamond represents the overall estimate of diagnostic yield.

The proportion of noncausal variants detected by array CGH ranged from 1 to 67%. A meta-analysis of the 18 studies with available data (excluding Schoumans et al.18) gives a combined false-positive yield of 7% (95% CI: 5%, 10%) (Table 2, Fig. 2). There was strong evidence of heterogeneity (χ2 = 187.29, P < 0.001; I2 = 91%, 95% CI: 87%, 94%). Sample size accounted for 24% of this heterogeneity although this was not statistically significant (P = 0.136). We did not observe funnel plot asymmetry from either visual inspection or test statistics (Egger's test P = 0.796). In our previous review, we excluded the study by Miyake et al.16 because it distorted the false-positive range with its large false-positive value (the other studies ranged 1–19%) and relatively large contribution (because of the small number of studies included). With the subsequent increase in study number and sample size, the study by Miyake et al.16 contributes less to the overall false-positive yield (a weight of 6% under random effects), and a meta-analysis of the remaining seventeen studies gives a combined false-positive yield of 6% (95% CI: 4%, 8%) (Fig. 2).

Fig. 2
figure 2

Random-effects meta-analysis of false-positive yield from array-based CGH in patients with learning disability and congenital anomalies. CI, confidence interval. The upper diamond represents the overall estimate of false-positive yield based on 18 studies. The lower diamond represents the overall estimate of false-positive yield based on 17 studies having excluded the study by Miyake et al.16.

DISCUSSION

This updated review was based on more than twice the number of studies in our previous review,11 with a 30-fold increase in the number of subjects from 462 to 13,926 (Table 3). Because of the increase in the number of studies and participants, our estimates of average diagnostic yield are much more precise (current review 10%, 95% CI: 8%, 12% versus previous review 13%, 95% CI: 10%, 17%; Table 3). The increased number of studies also enabled us to investigate study heterogeneity.

Table 3 Comparison of findings of our present updated meta-analyses with our previous meta-analyses11

Much of the heterogeneity observed in our analysis of diagnostic yield could be attributed to sample size and array resolution. Our analysis of array resolution suggests arrays with a resolution of less than 1 Mb have a greater diagnostic yield (14%, 95% CI: 9%, 20%) although additional studies would be required to further improve precision. The relationship with sample size (larger yields in smaller studies) is compatible with several explanations. A key threat to literature-based reviews and meta-analyses is the possibility of reporting bias, such as a publication bias, which often manifests itself as an excess of smaller studies with more striking results. An alternative explanation could be that the larger studies involved patients with less severe LD, and hence with fewer genetic causes. However, the spectrum of patients tested appears to be similar, all studies having selected patients with LD and congenital anomalies. All selected patients also had negative results for conventional cytogenetic tests. The consistent use of a clinical severity score such as that employed by de Vries et al.,17 along with stratification of data by such a clinical score, would allow the selection of patients who are most likely to benefit from evaluation of array CGH by clinical severity. Also, although 12 new studies were identified in this updated review, important differences in ethnicity of patients could not be investigated because of the identification of only a single study of Asian origin. This is an area in which further research is also warranted. This meta-analysis therefore provides direct evidence only for highly selected, mainly Western (European and North American) patients, using array CGH to identify patients previously undiagnosed by other tests.

Array CGH is able to identify causal genetic abnormalities in patients with LD and congenital anomalies, in whom previous conventional cytogenetic analysis has proven negative. However, it also identifies genetic abnormalities deemed to be noncausal or of unknown relevance (false positives) at a similar frequency. It remains to be seen how these noncausal variants should be viewed in practice and how such results should be communicated to patients. Nevertheless, our updated analysis in 19 studies shows that the false-positive yield is low (7%, 95% CI: 5%, 10%). Future studies should concentrate on trying to minimize the number of false positives being identified.

Databases such as DECIPHER (http://decipher.sanger.ac.uk) and ECARUCA (http://www.ecaruca.net) facilitate and expedite the identification of new disorders in clinical cytogenetics. Access to these databases allows the opportunity to identify overlapping phenotypic and genotypic features against the aggregated patient case reports.33 In addition to these databases, the Toronto Database of Genomic Variants (http://projects.tcag.ca/variation/project.html) provides comprehensive information regarding copy number variants identified as benign and unlikely to be pathogenic if identified in a patient with LD and congenital anomalies. We would strongly encourage clinicians to use these online resources and to contribute identified copy number variants and validate existing genotypic and phenotypic data. The most commonly identified cytogenetic regions in the published literature include 1p36.3, 15q11-q12, and 22q11.2.26,31,32,34 These regions could be targeted for greater coverage in future array design to allow a more accurate estimation of the size and characteristics of any rearrangements identified.

The use of array CGH as part of the investigation of the child with LD is gaining considerable momentum as the costs of the technology decrease. Services in which there is limited cytogenetics capacity, such as many middle income countries, may also begin to use this technology as a first line diagnostic investigation. However, a continued evidence-based approach to evaluate array CGH within the clinical pathway will be important in considering how, when, and for which patients it should be incorporated into routine services. The results of this updated meta-analysis provide new evidence to support the routine use of array CGH in investigating patients with LD and congenital anomalies, in whom previous conventional cytogenetic analysis have proven negative. However, given that the technology also identifies false positives at a similar rate to causal variants, some caution in clinical practice is also required. In particular, at present a decision on the use of array CGH is the responsibility of a clinical geneticist who will be able to assess likely clinical utility and interpret results for the parents. Further, although this systematic review has provided additional information about the use of array CGH in a clinically selected group of patients where conventional cytogenetics is negative, findings cannot be extrapolated to an unselected group, where LD may be less severe and the likelihood of a genetic cause is less. The use of array CGH as a first line investigation in all patients with LD should therefore be further evaluated through large prospective studies.