Introduction

In Europe, mental retardation with or without dysmorphic signs and malformations is recognised in around 1% of children.1, 2, 3 The true incidence may be higher due to non-ascertainment. An unbalanced karyotype can be found in 10%4 to 16%5 of these individuals by routine chromosome analysis (microscopy of G-banded metaphases from a T-lymphocyte blood culture). Other patients have a cryptic (ie invisible) chromosomal imbalance detectable by chromosome-based high-resolution comparative genomic hybridisation (HR-CGH).6, 7 We and others have found that deletions above 3 Mb can be detected by HR-CGH.8 Duplications are usually more difficult to detect. In recent years, CGH profiles have been determined using genomic microarrays instead of metaphase chromosomes to obtain higher sensitivity.9, 10, 11, 12, 13, 14 The resolution is no longer limited by the quality of the normal metaphase spreads, but by the density of genomic clones. Many laboratories have employed bacterial artificial chromosome (BAC)-based arrays with an average clone distance of 1 Mb for screening patients with developmental delay and/or patients suspected for having a cryptic chromosomal imbalance, hereafter called 1 Mb array-CGH. The 1 Mb array-CGH pick-up rate for clinically relevant chromosomal imbalances in mentally retarded patients with normal karyotypes was 8.4% when data from five different studies with a total of 332 patients were taken into account.10 Using a more limited 831-clone BAC array targeted to regions of known clinical relevance, a genomic alteration was found in 5.6% of 1500 consecutive cases with a variety of developmental problems.15

In our laboratory, we have used HR-CGH as a screening method for cryptic chromosomal imbalances for several years. In an initial study, we found a cryptic chromosomal imbalance in 5 out of 50 patients with abnormal development and normal karyotypes,7 in line with other observations.6 As an extension of this study and to map minor imbalances more accurately, we have recently used 1 Mb array-CGH to investigate all HR-CGH findings obtained during the past 6 years. Thus, this work gives detailed information on the position of the genomic imbalances in 80 patients with abnormal phenotypes, adding data on phenotype–genotype correlations.

Patients and methods

Patients

Blood samples from 590 patients (age range 1–58 years), all except 3 being mentally retarded, and most also having dysmorphic features and/or malformations, were received for analysis at the Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, during a 6-year period (2001–2006). Blood samples were referred to us from the whole of Norway by medical geneticists or experienced paediatricians. After conventional G-banded karyotyping, the samples were examined by HR-CGH analysis. This was performed either because the observed chromosomal abnormality was small and doubtful or difficult to classify, or because the patient phenotype was suggestive of a chromosomal abnormality despite the karyotype being normal. When array-CGH became available, all HR-CGH-positive samples were investigated by 1 Mb array-CGH for fine mapping. In addition, 1 Mb array-CGH analysis was carried out on a subgroup of 20 patients with strong clinical impression of having a genomic imbalance despite normal findings on G-banding and HR-CGH. The layout of the study is summarised in Figure 1. Parental analyses were carried out to investigate whether the finding was de novo or the result of a balanced rearrangement in one of the parents. Parental DNA was not available for case 32 (Table 1). Many patients were later referred to us for clinical evaluation and genetic counselling. In other patients, more detailed clinical information was asked for. The patients' phenotypes are summarised in Tables 1, 2 and 3.

Figure 1
figure 1

Study layout.

Table 1 Array-CGH fine mapping of chromosomal imbalances detected by HR-CGH, but not by routine G-banding
Table 2 Array-CGH fine-mapping of chromosomal aberrations detected by routine G-banding
Table 3 Chromosomal imbalances detected by array-CGH only

HR-CGH

Chromosome-based HR-CGH was performed as described previously.7 Briefly, patient DNA and normal reference DNA (800 ng each) were labelled with fluorescein isothiocyanate-12-dUTP and Texas Red-5-dUTP (NEN Life Science Products Inc., USA), respectively. Commercial reference DNA was used, consisting of a pool from either 10 normal female donors or 10 normal male donors (Promega GmbH, Germany). The labelled DNAs together with 20 μg of human Cot-1 DNA (Invitrogen Inc., USA) were hybridised onto normal metaphase chromosomes (from the same individual). Sex-matched hybridisations were performed. Analysis was performed using the high-resolution CGH analysis software that is part of the CytoVision System (Applied Imaging, UK). Imbalances were scored as a deviation from the standard reference interval CGH ratio profiles at 99.5% confidence intervals.16 The CGH results were verified either by re-examination of G-banded high-quality chromosomes (band level above 600) or by BAC-based fluorescence in situ hybridisation (FISH) analysis.7

Array-CGH

For array-CGH, a BAC/PAC clone set consisting of approximately 3500 clones with an average interclonal distance of 1 Mb was obtained from the Wellcome Trust Sanger Institute (UK).17 Production of genomic 1 Mb arrays based on the clone set was done by the Norwegian Microarray Consortium (NMC; www.microarray.no) at the RR-HF/UiO Microarray Core Facility in Oslo, Norway. Arrays were produced as described previously,18 but tile path resolution on chromosome 1q was not included in our arrays. Each clone is represented by four DNA spots on the array. Array-CGH was performed as described previously.18 Briefly, patient DNA and normal reference DNA (500 ng each) were labelled with Cy3-dCTP and Cy5-dCTP (NEN Life Science Products), respectively. Commercial reference DNA described above was used (Promega). The labelled DNAs together with 135 μg of human Cot-1 DNA (Invitrogen) were hybridised onto the array. Sex-matched hybridisations were carried out. Arrays were scanned in an Agilent G2565B scanner (Agilent Technologies, USA), and images were analysed using GenePix Pro 5.0 software (Molecular Devices Corp., USA). Further data processing, including normalisation and filtering, was performed using the M-CGH software.19 In brief, spots with intensities lower than the background, or signal intensities below local background plus twice the standard deviation (SD) of the background, or replicates with SD>0.2, were excluded from the analysis. Classification (gains, normals, and losses) and calculation of the log 2 ratio thresholds for the amplicons were computed by the M-CGH software.19 BAC clone information was based on the Ensembl database v44 (www.ensembl.org), NCBI build 36.2.

Determination of the size of a deletion or duplication

The 1 Mb distance between the BAC clones is an average. The real distance varies depending on chromosomal location, and a breakpoint may also be in a region partly covered by an abnormal clone. Therefore, the sizes of the deletions and duplications (Tables 1, 2 and 3) are given both as the minimum and maximum sizes. The terminal clones of the imbalance define the minimum size and the position of the imbalance, and the normal flanking clones define the maximum size. The end coordinates of the clones were used for size determination. Aberration calls were based on two or more consecutive clones.

Results

In our sample of 590 patients with abnormal development, which has been screened for minor or cryptic chromosomal imbalances by G-banding and HR-CGH, we applied array-CGH on a subset of 96 patients, using 1 Mb resolution BAC clone arrays. The patients' phenotypes are summarised in Tables 1, 2 and 3. The specificity of the 1 Mb BAC arrays was tested in advance by running a total of 10 normal-versus-normal hybridisations with DNA from normal male or female blood donors (data not shown). A low level of background noise, that is, clones variably showing a signal intensity ratio outside the limits defined for normal DNA copy numbers, was detected. On average, 15 non-consecutive single clones (0.4%), behaving differently from experiment to experiment in the normal-versus-normal hybridisations, were detected. To test the sensitivity and reproducibility of the array slides, DNA samples from two patients with deletions of known size after FISH mapping (case 6 in Table 1 and case 69 in Table 2) were run for analysis four times each. The genomic ratio profiles of the four repeated experiments in both cases were consistent (data not shown).

Detection of cryptic chromosomal imbalances and occult trisomy mosaicisms

Among the 590 patients described above, 36 had a small visible chromosome aberration detected by G-banding and 554 patients had a normal karyotype. When genomic BAC arrays became available from the NMC, 1 Mb array-CGH analysis was used for fine mapping of all HR-CGH findings and for examination of selected patients with normal findings on both G-banding and HR-CGH. In the group of 554 patients having a normal karyotype, HR-CGH analysis detected a genomic imbalance in 40 of them (7.2%; Table 1). Of note, four patients had occult mosaicism: three for trisomy 9 and one for trisomy 14. The percentage of mosaicism in these patients (cases 25, 26, 27 and 33; Table 1) was determined by interphase FISH to be 21, 23, 15 and 36%, respectively.

Among the 40 patients, all imbalances were known to be de novo aberrations except cases 5, 11 and 20 (parent translocation carrier) and case 32 (unknown). Of the 40 patients, 72% (29/40) had deletions, 10% (4/40) had occult trisomy mosaicisms, 10% (4/40) had cryptic translocations and 8% (3/40) had duplications (Table 1). The mean minimum size of an HR-CGH-detected isolated deletion was 6.0 Mb (Table 1), whereas the mean maximum size was 8.5 Mb. For the three isolated duplications found (Table 1), the mean minimum size was 5.6 Mb and the mean maximum size was 8.9 Mb. The theoretical size maximum of the smallest duplication detected (case 12 in Table 1) was 5.3 Mb.

In the group of 36 patients with small chromosome aberrations detected by G-banding, HR-CGH and array-CGH were applied for better characterisation (Table 2). Among these patients, all imbalances were known to be de novo aberrations except cases 62 (parent translocation carrier) and 69 (familial inversion). Of the 36 patients, 13 had deletions, 12 had duplications and 11 had translocations/complex single-chromosome rearrangements. Thus, in the group of 40 patients with normal G-banded karyotypes (Table 1), a deletion (29/40) was a 10 times more common finding than duplication (3/40), whereas in the group of 36 patients with aberrations detectable by G-banding (Table 2), small deletions and duplications were equally common findings. In Table 1, all six non-mosaic patients known to have severe mental retardation had deletions (mean minimum size 7.3 Mb). In contrast, in Table 2, all six patients known to have severe mental retardation had duplications (mean minimum size=15.7 Mb).

In Figure 2, the location and size of all deletions and duplications are shown apart from the four cases with occult trisomy mosaicism. A total of 91% of the imbalances were de novo. Except for two patients (case 69; Table 2 and case 80; Table 3), all single-chromosome imbalances were de novo (Figure 2). Imbalances were found on all chromosomes. The size and number of imbalances varied greatly between chromosomes, and the majority of the imbalances were found on chromosomes 2–10, 13 and 14. When only the interstitial chromosome rearrangements were counted, the imbalances were more equally spread on all chromosomes.

Figure 2
figure 2

Overview of chromosomal imbalances mapped by 1 Mb array-CGH: 400-band chromosome ideograms are shown with duplications drawn as green lines and deletions as red lines. Imbalances caused by translocations are shown in a lighter colour.

As expected, the array-CGH results showed that the plots made by the HR-CGH software could be misleading. In 3 of the 40 patients with cryptic imbalances (Table 1), HR-CGH indicated the position of a deletion or duplication that missed its true position: The 5q13 duplication in case 12, the subterminal 9p deletion in case 28, and the 10q26 deletion in case 30. In case 58 (Table 2), the HR-CGH finding was both incorrect and incomplete. Here, a terminal deletion combined with a subterminal duplication on 7q was only shown as a subterminal 7q duplication by the HR-CGH software. The finding of a terminal deletion combined with a subterminal duplication suggests that the mechanism might have been a breakage–fusion–bridge cycle after 7q arm fusion.20 In two patients with small but visible imbalances (Table 2), the HR-CGH software excluded the subtelomeric regions from the imbalance,16 for example, case 72, where a terminal 19q duplication that appeared to be interstitial (Figure 3a) and, case 74, where the HR-CGH software suppressed the terminal part of a 22q13 deletion. This demonstrates that HR-CGH software may overlook even visible subtelomeric imbalances.

Figure 3
figure 3

Inaccurate mapping of a chromosomal imbalance by HR-CGH (a) and array-CGH detection of a chromosomal imbalance in a case with normal findings on G-banding and HR-CGH (b): the 99.5% confidence interval HR-CGH ratio profiles together with the array-CGH profiles of case 72 (Table 2) and case 79 (Table 3) are shown. The imbalances mapped by HR-CGH and array-CGH analyses are shown as green (gains) or red (losses) bars below an 850-band ideogram. (a) HR-CGH of case 72 indicated a non-terminal 19q duplication from 19q13.2 to 19q13.3, whereas array-CGH showed a terminal duplication (dup(19)(q13.41qter)). (b) Normal findings by HR-CGH of case 79, whereas array-CGH detected a 14q deletion (del(14)(q32.2q32.32)).

Array-CGH on a subset of patients with normal karyotype and HR-CGH

Twenty patients with a phenotype strongly suggesting a chromosomal abnormality despite normal findings on both G-banding and HR-CGH were selected for 1 Mb array-CGH. In four of them (20%), a chromosomal imbalance was found: three deletions (mean minimum size 3.2 Mb, range 3.1–3.3 Mb) and one duplication (minimum size 8.7 Mb) (Table 3). All findings were verified by array-CGH analysis on CytoChip-arrays (BlueGnome, UK), a commercial BAC array with 1 Mb resolution (data not shown). Among the four patients, three imbalances were known to be de novo, whereas the aberration in case 80 was caused by a parental between-arm insertion. Even though the deletion on chromosome 14 in case 79 is close to the telomere, it would not have been detected by subtelomere-FISH screening (Figure 3b). In case 77, the finding of a deletion involving the ZFHX1B gene was expected on clinical grounds because the patient had Mowat–Wilson syndrome, known to be caused by haploinsufficiency of this gene.21 This finding was also verified by a BAC-FISH probe against the ZFHX1B locus. None of the deletions were picked up by HR-CGH analysis, even at lower confidence intervals, but the duplication in case 80 was seen when the confidence intervals were set at 95%. However, the amount of noise on the HR-CGH profiles at such low confidence intervals makes analysis at this level of sensitivity unfeasible.

Discussion

The main purpose of this study was fine mapping for improved genotype–phenotype information on a well-characterised group of patients with minor and cryptic chromosomal imbalances diagnosed during a 6-year period. In addition, the performance of the two CGH techniques could be compared.

In patients with normal karyotypes and a combination of mental retardation and dysmorphic signs and/or malformations, the detection rate of HR-CGH was 7.2%. Our results are comparable to a group of 332 patients from five different European studies, where the 1 Mb array-CGH diagnostic detection rate, all data combined, was 8.4%.10 This does not indicate that array-CGH is only slightly better in detecting clinically relevant genomic imbalances than HR-CGH. The yield of different diagnostic approaches is dependent on patient ascertainment, although the influence of patient selection on the detection rate might be smaller than previously thought.22 In the five European studies summarised above, the 1 Mb array-CGH diagnostic detection rates varied from 8%10 to 16%.23 Of note, some of the patients in these studies had been pre-screened for subtelomeric imbalances, and elimination of these ten patients from our cohort would have reduced our diagnostic detection rate to 5.4%. In addition, our results show that HR-CGH may overlook even visible subtelomeric imbalances. Therefore, HR-CGH should be combined with conventional G-banding whenever feasible. In a subgroup of 20 patients with normal findings on G-banding and HR-CGH but with a strong clinical suspicion of a genomic imbalance, four patients (20%) had a genomic imbalance. The sample is too small to conclude if this high detection rate is a random finding or due to skilled patient selection.

Among the 80 patients with genomic imbalances, findings were on all chromosomes, in agreement with other 1 Mb array-CGH studies.10 The largest numbers of aberrations, with deletions being as common as duplications, were seen on chromosomes 7 and 8. On chromosome 19, our most gene-dense chromosome, only duplications were found. Another interesting observation was that minor but visible duplications were as common as deletions, whereas cryptic deletions were 10 times more common than cryptic duplications. One explanation may be that deletions are easier to detect than duplications because the actual change in DNA amount is 100% for deletions and 50% for duplications. It is also possible that small duplications are less likely to cause a phenotype that makes the patient look ‘chromosomal’, that is, reduces the chance of ascertainment.

Although useful, it is difficult to evaluate the clinical significance of deletions and duplications based only on size, especially until we have a more extensive and in-depth knowledge about copy number variations (CNVs) in the human genome. The number of microdeletion syndromes that are solely or largely caused by the absence of one gene among many is increasing (eg EHMT1 in 9q34 deletions,24 RAI1 in Smith–Magenis syndrome,25 and LIS1 in Miller–Dieker syndrome). Therefore, parental DNAs should be available to verify that a finding is de novo. Using different types of arrays, others have found that CNVs occur probably in all healthy individuals.26, 27, 28 It is not yet clear how much of the human genome that is quite tolerant to CNV, but it appears to be at least 12%.28 Eventually, we will know the identity of most genes that cannot be duplicated or deleted without affecting normal development or brain function.

Looking at individual patients with interesting phenotypes, it is noted that in case 3, an 9.3 Mb deletion of 2q32.3–2q33.1 was associated with Duane anomaly, autistic traits and ataxia. This deletion on chromosome 2 is 5 Mb distal to the published locus for Duane retraction syndrome type 2 (DURS2), mapped by linkage analysis to 2q31.29 Maybe the deletion is associated with an inversion disrupting the DURS2 gene. In case 4 with severe autism and self-destructive behaviour, a terminal deletion on 2q was found, a region previously found to be deleted in autistic patients.30 A patient with a similar-sized terminal 2q deletion who had severe behavioural disturbance was recently described.31 The deletion in case 17, where bifid thumbs and bifid first toes were found in addition to epilepsy and a small penis, includes the GLI3 gene. Deletions or mutations in this gene are associated with Greig cephalopolysyndactyly (OMIM 175700) and Pallister–Hall syndromes (OMIM 146510), and epilepsy and a small penis are also features of the latter. The small deletion of 7q22.1–7q31.1 in case 19 includes reelin (RELN), a gene that may be mutated in Norman–Roberts-type lissencephaly (OMIM 257320) and thought to be associated with autism and schizophrenia. This patient is growth retarded (length just below the 2.5 centile) but has a head circumference in the upper normal range (75–90 centile). Anterior synechiae and microcoria are peculiar features. In case 48, an 13.6 Mb deletion of 5q11.2–5q13.1 was associated with choanal atresia, macrocephaly and pronounced dysmorphism. In case 49, a similar-sized (11.3 Mb) but 6 Mb more distal deletion (5q12–5q13.2) was not associated with dysmorphic features but congenital deafness. The deletions overlap with 8 Mb. No dominant deafness genes or genes associated with choanal atresia or macrocephaly are known to be located in the interval. One gene with high expression in the central nervous system (CNS) that is located in the 3–4 Mb region deleted in case 49 but not in case 48 is MAP1B (microtubule-associated protein 1B). Heterozygosity for an insertional knockout in this gene leads to severe CNS affection in mice.32 Finally, case 33 (Table 1), with occult mosaicism for trisomy 14, deserves to be mentioned, as this is a rare type of trisomy mosaicism.33 It illustrates that such mosaicism, similar to our three patients with mosaic trisomy 9 (cases 25–27; Table 1), can be detected by CGH using blood sample DNA even though metaphase studies of blood lymphocyte cultures are completely normal.10, 34 The lowest HR-CGH-detected mosaicism grade was 15%, and this was found in a patient with trisomy 9 (case 27). Previously, mosaicism grade as low as 8% has been detected by 1 Mb array-CGH analysis.10

This work adds to our knowledge on minor and cryptic chromosome imbalances that cause varying degrees of mental impairment, malformations and/or dysmorphism. Mapping of genomic imbalances having distinct phenotypes may be useful not only for evaluation of the clinical importance of a de novo imbalance, but also for narrowing down regions of particular interest when searching for genes whose dosage is critical for normal development.