Introduction

The estimated frequency of balanced chromosome rearrangements in a population of unselected newborns is 0.52%.1 The majority of these translocations, insertions and inversions is transmitted from one of the parents and not associated with abnormal phenotypes.2 In 1991, Warburton3 reported data on the frequency and outcome of cases with apparently balanced, de novo, rearrangements detected at amniocentesis in over 350 000 pregnancies. She found that a microscopically balanced, de novo, reciprocal translocation was detected in 1 out of every 2000 pregnancies. The frequency of congenital abnormalities in fetuses and newborns with de novo, reciprocal translocations or inversions has been estimated at 6.1 and 9.4%, respectively.3 This is more than twice as high as the risk of 2–3% in the general population. The increased number of abnormal phenotypes can be caused by: (1) a microdeletion or microduplication at the translocation or inversion breakpoint(s) which is only detectable by high-resolution techniques, (2) disruption or modulation of the expression of gene(s) located at the breakpoint(s) and (3) otherwise inactivation (position effect) of gene(s) at the breakpoint region(s). Thus, an apparently balanced, de novo, chromosome rearrangement can underlie an abnormal phenotype, but it may also be coincidental. The actual confirmation or rejection of causality by detecting a cryptic deletion or duplication at the assumed breakpoints or elsewhere in the genome is often lacking. The unbalanced nature of small rearrangements will most often escape detection, as the resolution of standard cytogenetic banding techniques is only 5–10 Mb. It has already been shown that the yield of chromosome abnormalities in patients with developmental delay (DD) and/or multiple congenital anomalies (MCA) increases considerably with the resolution of the technique used. A microscopically visible chromosome abnormality can be detected by routine karyotyping in 3–5% of all DD/MCA patients, excluding Down's syndrome,4, 5, 6 whereas genome-wide array-based techniques are able to detect a chromosome imbalance in up to 15–20% of such cases.6, 7, 8

Recent studies have reported on genome-wide array analysis used to identify cryptic imbalances in cohorts of DD/MCA patients with an apparently balanced, de novo, chromosome rearrangement (Table 1).9, 10, 11, 12, 13, 14, 15 A cryptic imbalance was detected by genome-wide array analysis in 33–100% of DD/MCA patients with a de novo chromosome rearrangement. In the majority of patients, the imbalance was detected at one or more breakpoints, although a large percentage of imbalances (15–40%) was found elsewhere in the genome. The frequency of detected imbalances is significantly higher in patients with a more complex chromosome rearrangement (CCR), involving more than two chromosomes and/or more than two breakpoints.9, 10, 14 In all studies, the reported imbalances were assumed to cause the abnormal phenotype.

Table 1 Overview of genome-wide array studies in patients with apparently balanced, de novo chromosome rearrangements and abnormal phenotypes

In contrast to the studies performed in DD/MCA patients, Baptista et al12, 16 compared a cohort of 31 phenotypically normal individuals carrying a balanced chromosome rearrangement with a cohort of 14 DD/MCA patients. No genomic imbalances at the breakpoints, or elsewhere in the genome, were detected in the 31 normal carriers, whereas a disease-causing imbalance was detected in 4 out of 14 DD/MCA patients. The authors concluded that translocations in patients with a clinically abnormal phenotype are molecularly distinct from those in normal individuals. An unexpected finding was that the frequency of gene disruption due to a chromosome rearrangement did not differ between phenotypically abnormal patients and the normal study population.12 However, the percentage of disrupted genes that have a role in the nervous system was higher in the phenotypically abnormal patients.

Since there is limited data on patients with apparently balanced chromosome rearrangements, we decided to evaluate the results obtained from genome-wide array analysis in a cohort of 54 DD/MCA patients and a cytogenetically balanced, de novo, chromosome rearrangement. Since this is the largest postnatal cohort of DD/MCA patients with de novo balanced rearrangements reported thus far, we were able to improve the estimated percentage of submicroscopic imbalances detected by genome-wide array analysis in de novo chromosome rearrangements. The size of the cohort also enabled us to relate the probability of finding an imbalance to the clinical phenotype of the patient by using the De Vries scoring system.17

Patients and methods

Patient selection

Clinical data and samples were collected from 54 patients with an apparently balanced, de novo, structural chromosome rearrangement. All patients had been referred for karyotyping because of DD and/or MCA and were enrolled in the study for diagnostic purposes. All chromosome rearrangements were detected by routine cytogenetic analysis at a minimum band level of 500: 46 patients carried a two-breakpoint rearrangement; 40 patients had a reciprocal translocation, while 6 patients carried an inversion. Eight patients had a CCR with at least three breakpoints.

All patients, parents or legal representatives gave informed consent for this study, according to local guidelines.

Collection of clinical data

Clinical data were derived from medical records using a standardized form. Additional information was requested from the referring clinicians whenever necessary. All patients were scored according to adapted De Vries criteria, which provided a checklist for patients with submicroscopic subtelomeric rearrangements (Table 2).17 Family history was replaced by DD in this scoring system, because a positive family history, either compatible or non-compatible with Mendelian inheritance, does not enhance the chance of finding imbalances in the breakpoint regions in patients with a de novo chromosome rearrangement. In contrast, the De Vries criteria were developed for patients with an intellectual disability, while not all the patients in our study had a DD. Therefore, one and two points were given for mild-to-moderate and severe DD, respectively. Severe DD was defined as a Developmental Quotient <30, while mild-to-moderate DD was a Developmental Quotient between 30 and 70. In this way, the maximum number of points that could be scored remained 10 (Table 2).

Table 2 De Vries score and adjusted De Vries score for assessing clinical phenotypes

Genome-wide array analysis

Array analysis with an average genome-wide resolution of 200 kb was performed using either an Agilent 105k or 244k oligo array, a 32k BAC array as previously described,18 or the Affymetrix 250k SNP array platform,19 following the protocols provided by the manufacturers (Agilent Technologies and Affymetrix Inc., Santa Clara, CA, USA).

For the Agilent array reference DNA, a mixture of 40 male or female DNA samples of the same gender was used as control. The data were processed using Feature Extraction V.9.1 and CGH analytics V.3.4.27 provided by the manufacturer (Agilent Technologies). For the SNP array experiments, copy number estimates were determined using the updated version 2.0 of the CNAG (Copy Number Analyzer for Affymetrix GeneChip mapping) software package.20 The normalized ratios were then analyzed for genomic imbalances by a standard Hidden Markov Model, essentially as described before.18 The SNP array data obtained from patient DNA were compared with SNP array data from 10 healthy, sex-matched individuals.

Regardless of the array platform employed, genome-wide data analysis was performed using previously determined criteria which provide 95% confidence of representing a true copy number variation (CNV).21 A CNV was considered significant if five or more consecutive probes showed a single copy number loss (N=1), or at least seven consecutive SNPs showed a single copy number gain (N=3) for the Affymetrix array, or four or more consecutive probes showed gains or losses for the Agilent array. For interpretation purposes, various public web sources were consulted, including the Online Mendelian Inheritance of Man (http://www.ncbi.nlm.nih.gov/Omim), the DECIPHER database (http://decipher.sanger.ac.uk) and ECARUCA (http://www.ecaruca.net). A CNV was considered a normal genomic variant if it had been detected in at least three control individuals as reported in the Database of Genomic Variants (http://projects.tcag.ca/variants), and/or been encountered in at least three in-house control samples. Data analyses were based on the NCBI36/hg18 build of the human genome.

Fluorescent in situ hybridization analysis

To validate the gains or losses identified by genome-wide array analysis, region-specific fluorescent in situ hybridization (FISH) was performed following routine protocols. Bacterial Artificial Chromosome (BAC) clones were selected from the human library RPCI-11 according to the UCSC Human Genome Assembly (freeze March 2006) and kindly provided by the Wellcome Trust Sanger Institute (http://www.sanger.ac.uk) or obtained from the 32k set of BAC DNAs in the Nijmegen laboratory. BAC DNA was indirectly labeled with biotin- or digoxigenin-11-dUTP using Nick translation. Slides were hybridized overnight at 37 °C and fluorescently labeled with FITC or Texas Red.

Multiplex ligation-dependent probe amplification

To validate the gains identified by array analysis, region-specific multiplex ligation-dependent probe amplification (MLPA) was performed. For each region, two uniquely sized probes were developed in accordance with a protocol provided by MRC Holland (Amsterdam, the Netherlands). Ten probes were combined in one MLPA assay together with a DNA quantity and a DNA denaturation control mix (EK-1 kit, MRC Holland). The procedure was further carried out as described by De Vries et al.18

Results

In this study, 54 patients with an apparently balanced, de novo chromosome rearrangement and an abnormal clinical phenotype were analyzed for submicroscopic chromosome imbalances by genome-wide array analysis. Forty-six patients had a two-breakpoint rearrangement upon routine karyotyping. In eight patients, a more complex aberration was found. All patients had facial dysmorphisms and/or congenital malformations and 46 out of 52 patients (88%) showed DD, varying from mild psychomotor retardation and speech delay to severe DD. Development could not be assessed in two patients because they died at the age of 1 day and 2 months, respectively (patients 12 and 43). A detailed description of all the phenotypes is presented in Table 3.

Table 3 Overview of all patients giving karyotype, phenotype and De Vries score

The total number of CNVs, including well-known benign CNVs, detected by the platforms used ranged from 2 to 12 with an average of 5.6 per patient (Table 3). All the potentially causative, copy number alterations detected by array could be confirmed by FISH (losses), MLPA (gains) or an independent array platform.

CNVs at or near the breakpoint regions

In 11 out of 54 patients (20%), the apparently balanced rearrangement was found to be unbalanced at the breakpoint region(s). We found no gains but 16 losses in these 11 patients in total (Table 4A). The size of the losses varied from 0.1 to 15.3 Mb. Seven patients had a single loss, two patients (6 and 53) had losses at multiple breakpoints and two patients (49 and 51) had multiple, non-overlapping losses in one breakpoint region. Patient 51 also had a loss elsewhere in the genome (Table 4B). In patient 6, with a loss at both breakpoints (1p22.1 and 6q15), the 1.1-Mb loss of chromosome 1 appeared to contain no known genes. The 4.25-Mb deletion in patient 42 contained the FBN1 gene, explaining the observed Marfan phenotype.22

Table 4 Summary of patients with an imbalance at the breakpoint regions (A) or elsewhere in the genome (B) detected by genome-wide array analysis

Five out of forty-six (11%) patients with a two-breakpoint chromosome rearrangement had a cryptic imbalance related to their reciprocal translocation. No imbalances were found related to inversions (n=6). Six out of eight (75%) patients with a CCR (more than two breakpoints) appeared to have an abnormal genome profile upon array analysis. All of these six patients had losses at the breakpoint regions.

Imbalances elsewhere in the genome

Copy number changes elsewhere in the genome were present in seven patients: six patients had a reciprocal translocation and one patient had a CCR (Table 4B). Six losses and three gains not related to the breakpoints were detected in total in these seven patients. Only the patient with a CCR (51) had additional copy number alterations at one of the breakpoint regions (Table 4A). In this and two other patients (30 and 32), the respective CNV was inherited from a healthy parent. Patient 29 had three imbalances: two losses were de novo (on the paternal allele) and one loss was also observed in his healthy father.

In an adult patient (10) with a translocation (1;17)(p36.1;q11), a 650-kb gain in 1p34.1 was found. Unfortunately, this patient's parents were not available for further investigation. A partially overlapping, de novo 650 kb gain was found in a clinically more severely affected boy (18). Both gains overlap a 450-kb region in 1p34.1.

The last imbalance detected elsewhere was a 270-kb deletion at 5p13.1 in a patient with a translocation (2;10) (patient 16). Unfortunately, this patient's parents were not available for further investigation.

Clinical criteria

All but three patients (n=51) could be scored according to the adapted clinical De Vries criteria (Tables 2 and 3).17 Patient 43 died 1 day postpartum, patient 12 died at the age of 2 months, and patient 52 had Sotos syndrome due to an NSD1 mutation interfering with the phenotype. The distribution of the scores is shown in Supplementary Figure 1. All patients with a chromosome imbalance in the breakpoint region (n=11) had a score of at least 3 with a median score of 5 (range 3–9). The highest score was found in the patient with imbalances both at a breakpoint and elsewhere (score 9 in patient 51). Patients with a chromosome imbalance restricted to elsewhere in the genome (n=6) tended to have a lower score (median 4.5, range 1–6). One of the imbalances in this group was considered not clinically relevant (see Discussion and Table 4B). Correction for this patient 32 led to a median score of 5 (range 1–6). The median score in the total group with a possibly clinically relevant CNV (n=16) was 5 (range 1–9), while the median score in the group without a significant CNV (n=35) was 3 (range 0–7). The difference between these two groups is significant (P=0.002, Mann–Whitney U-test).

Discussion

In this study, 54 patients with an apparently balanced, de novo chromosome rearrangement were examined by high-resolution genome-wide array analysis. The mean number of CNVs, including well-known recurrent copy number polymorphisms, that was detected was 5.6 per patient (range 2–12). In general, the number of CNVs detected per patient depends on the platform and detection thresholds used, but the number found in our study does not differ substantially from patients without apparently balanced rearrangements.18, 23

Out of 46 patients with a two-breakpoint chromosome rearrangement, 11 (25%) appeared to have an abnormal genome profile encompassing six losses, each at one of the breakpoints in five patients, and five losses and three gains elsewhere in the genome in six patients. From analysis of their parents, two of the latter category could be specified as rare, inherited CNVs. Six out of eight patients with a CCR were found to have one or more clinically significant losses at one of the breakpoints. In addition, one of these six patients had a paternally inherited imbalance elsewhere in the genome. Although the overall percentage of patients with a cryptic or submicroscopic, clinically significant imbalance in this cohort is 31%, there is a remarkable difference between patients with a two-breakpoint chromosome rearrangement (24%) and those with a more complex rearrangement (75%).

The number of imbalances seen in our patient cohort is similar to the studies of Sismani et al11 and Baptista et al,12 but lower than the studies of others (Table 1).9, 10, 13, 14, 15 This might be due to differences in patient selection (reflected in the high number of aberrations found elsewhere in the genome in the studies of Gribble, Higgins and Gijsbers) and to the higher number of complex rearrangements studied by De Gregori. Compiling the data of the previous and present studies, we conclude that in almost half of the patients with a de novo chromosome rearrangement, a genomic imbalance can be detected by genome-wide array analysis. We confirmed that, in complex rearrangements, the chance of finding copy number alterations at the breakpoints is very high: 75 and 72%, in our study and the combined studies, respectively.

Imbalances are not always located at breakpoints

In most patients (20%) with clinically relevant copy number alterations, the imbalance is detected in or near the breakpoints of the chromosomes involved (Table 4A). However, in 13% an imbalance is found elsewhere in the genome (Table 4B). As shown here and in previous studies, this was especially true for two-breakpoint de novo aberrations. In 19% of all patients with a two-breakpoint rearrangement, imbalances are found elsewhere. Especially in these cases, the clinical significance of the detected CNVs should be determined by parental analysis, among other investigations. The observed percentage of 19% is in agreement with the general figure of 17% of imbalances that is found in the DD/MCA population.7, 8 These results underline the importance of a genome-wide approach in patients with an apparently balanced, de novo chromosome rearrangement. If imbalances are found independent of the rearrangement breakpoints, this may have implications for the recurrence risk and warrants studies in the parents to exclude cryptic balanced translocations and insertions. Furthermore, it is crucial to critically examine an apparently balanced rearrangement after initial detection, because they are often more complex than they appear at first.

Losses are more frequent than gains at breakpoints

The clinically significant imbalances at the breakpoint regions found in this study were all deletions. Breakpoint deletions are more frequent in patients with a CCR than in patients with a two-breakpoint rearrangement. In the present study, we detected deletions in six out of eight CCR patients (75%). This is comparable to the results of De Gregori et al10 and Schluth-Bolard et al,14 who detected deletions in 69 and 80% of patients with a de novo CCR, respectively. Thus, deletions may be the main cause of phenotypic abnormalities in patients with a CCR.

The preponderance of deletions is similar to the results of others (Table 1).9, 10, 12, 13, 14 Recently, Howarth et al24 showed that in breast cancer cell lines reciprocal translocations arising during mitosis may result in both deletions (up to 31 Mb) and duplications (up to 200 kb) at the breakpoint regions. They demonstrated that the underlying mechanism most likely is stalled replication bubbles during the interchromosomal exchange. De novo constitutional translocations have their origin during meiosis. Nonetheless, the same mechanism may cause imbalances during meiotic interchromosomal exchanges. That we and others did not find breakpoint duplications in DD/MCA patients might be explained by their size (often under the detection threshold) and the fact that small duplications rarely result in a phenotype.

Clinical significance of the detected imbalances

The size of the deletions and gains in our patients ranged from 100 kb to 15.3 Mb and from 240 to 650 kb, respectively. In patient 6 with deletions at both breakpoints, the abnormal phenotype was considered to be a consequence of the 9.2-Mb deletion at chromosome 6, because the small deletion at chromosome 1 did not contain any known genes. All other breakpoint deletions were considered pathogenic based on the criteria mentioned in Methods.

In four of the seven patients with an imbalance elsewhere in the genome, the imbalance was found to be inherited from a clinically unaffected parent. The deletion 16p13.11 (patient 30) and deletion 1q21.1 (patient 51) are known microdeletion syndromes with variable phenotypes.25, 26, 27 Patient 51 also carries two significant losses at a breakpoint region, but we cannot exclude that the 1q21.1 deletion also contributes to the phenotype. The maternally inherited gain in 1q23.3 (patient 32) was considered unlikely to be clinically relevant because a larger gain has been detected in two control individuals from one study in the Database of Genomic Variants.23 The paternally inherited loss in patient 29 is in a gene-less region of 12p11.22 and therefore likely to be benign. Of the two de novo losses in the same patient (29), the 4.9-Mb loss in 2q33.3q34 is most likely to be clinically relevant. The 9q21.12q21.1 loss has not been detected before; and thus, its clinical significance remains uncertain, although a contribution to the clinical phenotype of patient 29 cannot be excluded.

The 650-kb gain in 1p34 in patient 10 is not a known polymorphism according to the Database of Genomic Variants, and is only partially overlapping gains that have been found in normal individuals (Nijmegen and Groningen in-house control data). Patient 18 had a similarly sized duplication, of which 450 kb overlapped with the gain of patient 10. The distal 200 kb, non-overlapping region, contains several genes, including POMGNT1. The phenotype of patient 18 is similar to previously published patients with larger overlapping duplications that included this gene.28

The 270-kb loss in 5p13.1 (patient 16) is not a known polymorphism but only contains the LIFR gene involved in autosomal recessive Stuve-Wiedemann syndrome, although the patient's clinical features do not resemble this syndrome. Unfortunately, the parents were unavailable for further studies and the clinical significance of the deletion remains unclear, as no similar microdeletion has been found in controls or other patients so far.

Thus, in at least four of the seven patients with imbalances elsewhere, the detected imbalance was considered to contribute to the abnormal phenotype.

Clinical features pointing to an imbalance

All 16 patients with a potentially clinically relevant CNV showed DD, ranging from mild psychomotor or speech delay (in five patients) to severe DD (in seven patients). As discussed above, the gain in patient 32 with severe DD was, in retrospect, considered very unlikely to be causative for the phenotype. If we had only analyzed patients with an adapted De Vries score >3, we would not have missed any clinically relevant chromosome imbalances at the breakpoint regions (Supplementary Figure 1). This is in line with the results of the original study using De Vries criteria: all patients with a subtelomeric aberration had a De Vries score of at least 3.17

Two out of six patients with an aberration elsewhere in the genome had a score <3. This concerned the maternally inherited 1q23 gain in patient 32 (score 2) that was considered unlikely to be clinically relevant, and one 1p34 gain in patient 10 of uncertain clinical relevance (score 1). The median De Vries score of all 14 patients with a certainly clinically relevant CNV (Table 4) was 5 (range 3–9), while in the 35 patients without a relevant CNV the median score was 3 (range 0–7). Three patients could not be scored (see Results), and two patients had an imbalance of uncertain clinical relevance.

Other mechanisms causing DD/MCA in balanced rearrangements

A truly balanced, de novo chromosome rearrangement may still contribute to an abnormal clinical phenotype due to disruption of a gene or due to a position effect. An example of the former was seen in patient 45 who appeared to have a disruption of the TCF4 gene at 18q21.1, as described in a previous study.29 Conventional methods for mapping chromosome breakpoints, such as FISH, are laborious, and often fail to identify the disrupted gene. Combining DNA array hybridization with chromosome sorting improves the efficiency of breakpoint mapping, but can only be applied when the physical properties of the derivative chromosomes allow them to be flow sorted. Nowadays more efficient and accurate breakpoint identification can be performed by next-generation paired-end sequencing.30

A position effect was most likely responsible for the split-hand-feet syndrome (SHFM) in patient 37 with an inversion breakpoint in 7q near the SHFM1 locus and the candidate genes DSS1, DLX5 and DLX6.31

Conclusion

The combined results of our study and previous reports show that in 79/171 (46%) of DD/MCA patients with a de novo chromosome rearrangement, a genomic imbalance could be detected by genome-wide array analysis. In patients with a rearrangement involving more than two breakpoints, there is a high chance of detecting an imbalance at one of the breakpoints (21/29; 72%). In two-breakpoint rearrangements, an imbalance located at a breakpoint was detected in 26/142 (18%) patients. However, a substantial number of imbalances were also detected outside the breakpoint regions: in 33/171 (19%) patients, an imbalance was found elsewhere in the genome, which is comparable to the general DD/MCA population. In conclusion, diagnostic studies should not only focus on the rearrangement breakpoints, but a genome-wide approach should be used to investigate patients with apparently balanced, de novo chromosome rearrangements.