Introduction

Somatic chromosomal mosaicism is defined as the presence of two or more chromosomally distinct cell lines and represents a challenging task not only to detect but also to interpret. Owing to various factors including subtle phenotypic abnormalities, technical limitations, and inherent tissue-specificity, mosaicism is often undetected and underdiagnosed.

Advances in chromosomal microarray analysis (CMA) have transformed the microarray into a powerful genome-wide analysis tool able to detect a wide range of aberrations of the human genome in a clinical setting. Detection of mosaicism by CMA for aneuploidy as well as structural abnormalities has been previously reported.1, 2, 3, 4 In 2006 and 2007, Ballif et al. and Cheung et al. each concluded in their respective retrospective studies that the use of CMA via array comparative genomic hybridization (CGH) is more sensitive than traditional cytogenetics in the likelihood of detecting mosaicism and provides a more accurate first tier test.1, 2 However, Bi et al. recently presented evidence for the value of chromosome studies in conjunction with CMA to maximize the detection of mosaicism, citing the shortcomings of CMA in detecting low levels of mosaicism (<10%) for either abnormal or normal cell lines, and the presence of two or more abnormal cell lines involving the same region.5 In their study, six (0.16%) cases of mosaic abnormalities identified by chromosome analysis were undetected by CMA, four of which had a level of mosaicism <10% supporting the concern that array technologies are not reliable for routinely detecting very low-level mosaicism. However, mosaicism for whole chromosome aneuploidy, the most common type of mosaicism,6 has been detected as low as 5% by both CMA2 and single nucleotide polymorphism (SNP) arrays.3 In 2007, Lu et al. reported detection of a case of low level mosaicism for trisomy 14 by a bacterial artificial chromosome (BAC) array that subsequently was validated by fluorescence in situ hybridization (FISH) on cultured cells and chromosome analysis finding a mosaicism level of 2%.7

Until recently, most clinical array platforms have used either oligonucleotides or SNPs as probes, each having an advantage over the other with regard to detecting specific types of abnormalities that can be present in the mosaic state. Specifically, SNP arrays provide data not attainable by oligonucleotide-based CMA arrays such as loss of heterozygosity and copy-number-neutral changes which allows for detection of uniparental disomy. CMA has the flexibility of routinely detecting very small mosaic copy number changes such as intragenic, exonic copy number variants (CNVs), which can be limited in SNP arrays due to the random distribution of SNPs across the genome.5, 8 Therefore, to maximize detection, oligonucleotide platforms that include SNPs are becoming the prevalent array used for clinical diagnostics.3, 4, 9, 10

In 2009, our clinical diagnostic molecular cytogenetics laboratory added exonic coverage for over 1700 known disease and candidate genes to our custom-designed oligonucleotide array, making it possible to detect mosaicism for a CNV within a single gene. Four cases of mosaicism involving a CNV within a single gene were identified using this array and included in this study, two of which have previously been reported.8 Recently, there have been several reports of exonic mosaicism detected by array CGH11, 12, 13 underscoring the importance of detecting this type of aberration. For example, Castronovo et al. described a patient who presented with an atypical phenotype and, following array CGH, a low-level mosaic intragenic NSD1 deletion was discovered ultimately resulting in a diagnosis of Sotos syndrome.11 Saitsu et al. described a patient with a microdeletion involving a portion of the CASK gene that was inherited from his apparently healthy mother who had somatic mosaicism (20%) for the deletion.12 Bartnik et al. report the first cases (three cases) of mosaic exonic deletions of the gene CDKL5 whose mutations have been associated with X-linked dominant early infantile epileptic encephalopathy-2.13

Here we present the cases of mosaicism detected in a clinical cohort of 10 362 consecutive patients referred for clinical testing using our custom-designed, exon-targeted whole-genome oligonucleotide array. Specifically, our study highlights those cases with complex genomic rearrangements, multiple abnormal cells lines, exonic CNVs and where conventional cytogenetic and CMA methodologies are discordant.

Materials and methods

Subjects

In total, 10 362 consecutive patient samples were submitted to the Medical Genetics Laboratories at Baylor College of Medicine for CMA analysis during the period from June 2009 to February 2011. In all, 1063 of these cases that had concurrent CMA and chromosome analysis have been previously reported by Bi et al.5 The indication for ordering CMA was provided for 8898 cases. In total, 5138 patients studied had a clinical indication of cognitive impairment (primarily developmental delay, intellectual disability, and autism). This study focuses on the 57 cases in which mosaicism was detected by CMA using the version 8 oligoarray (V8 OLIGO).

Cytogenetic and FISH analyses

Routine cytogenetics was performed according to standard GTG-banding protocol, either concurrently with CMA or for confirmation of mosaicism. T cells were stimulated with PHA from peripheral blood collected in sodium heparin tubes and cultured for 72 h. The number of metaphases analyzed ranged from confirmation using only 5 cells, to standard 20 cells, up to 100 cells.

Confirmatory FISH analyses on cultured cells were performed with BACs or fosmid clones using standard procedures.

Chromosomal Microarray Analysis

BCM V8 OLIGO is a custom-designed array with approximately 180 000 interrogating oligonucleotides, manufactured by Agilent Technologies, Inc. (Santa Clara, CA, USA). This array contains the ‘best-performing’ oligonucleotides (oligos) selected from Agilent’s online library (eArray; https://earray.chem.agilent.com/earray/) and has been further empirically optimized. Genomic features of the V8 OLIGO design include interrogation of all known microdeletion and microduplication syndrome regions as well as pericentromeric and subtelomeric regions and computationally predicted NAHR-mediated genomic instability regions flanked by low-copy repeats (LCRs) as previously described.14 Also included is exonic coverage for 1700 selected known or candidate disease genes as well as introns greater than 10 kb. The entire nuclear genome is covered with an average resolution of 30 kb, excluding LCRs and other repetitive sequences.8

Calculation of mosaicism

All data analysis following feature extraction of the TIFF images were performed using in-house software written in the R open source statistical software. 15 Briefly, the normalized red (test) and green (control) intensities extracted from TIFF images are used for the calculation of binary logarithmic ratios (log2-ratio) at each probe. DNAcopy was used to identify regions of copy number loss or gain.16 Default settings in DNAcopy were used for segmentation, but with post processing to ensure at least three consecutive same sign probes with strong mean log2-ratios.

Mosaicism for CNV was modeled as a mixture of two cell populations; one cell population with a normal chromosome complement (either 46,XX or 46,XY), and the other with a single loss or gain at any particular locus (either segmental or whole chromosome). It is not possible to determine mosaic percentages for cases with multiple cell lines affecting the same overlapping region (see Table 1). We calculated the percentage of mosaicism using the formula

Table 1 Mosaic abnormalities detected by CMA

where δ =2logR and logR=mean[log2(Test/Control)].

Results

Of the 10 362 patients evaluated by the V8 OLIGO array, a total of 57 cases involving somatic mosaicism were identified (Table 1, Supplementary Table 1) and classified into the following categories: (1) aneuploidy, including sex chromosome abnormalities and isochromosomes (22 cases), (2) ring or marker chromosomes (12 cases), (3) single deletion/duplication CNVs (11 cases), (4) multiple deletion/duplication CNVs (5 cases), (5) exonic CNVs(4 cases), and (6) unbalanced translocations (3 cases). Each case was confirmed by either FISH, chromosome analysis, or both. The range of the level of mosaicism varied depending on the method of analysis: we calculated a range of 5–80% using CMA data; we observed a range from 10 to 93% by FISH analysis on cultured cells; and a range from 3 to 96% by G-banded chromosome analysis. In general, the percentage of mosaicism estimated from CMA is highly correlated when compared to that of FISH analysis (Spearman correlation 0.7255, P=8.4 × 10−6 or Pearson correlation 0.726, P=8.15 × 10−6, n=29) and chromosome analysis (Spearman correlation 0.791, P=.006 or Pearson correlation 0.832, P=.0015, n=11). Overall, 37% (21/57) of cases had two or more abnormal cell lines (ranging from two to five abnormal cell lines). Of the eight cases where parental studies were performed, all were found to be de novo.

There were 14 mosaicism cases studied concurrently by CMA and chromosome analysis. Ten cases were concordant (71%) and four cases had normal chromosome results (29%). Two of these four discordant cases (cases 4 and 8) were mosaic for aneuploidy while the remaining two cases (cases 47 and 54) involved CNVs beyond the resolution of standard chromosome analysis. Overall, mosaic CNVs not detectable by chromosome analysis (<3 Mb) were present in 18% (10/57) of all mosaic cases.

Aneuploidy

Aneuploidy including sex chromosomes and isochromosomes represents the most common type of mosaicism observed in 22 cases (39%) as shown in Table 1/Supplementary Table 1. The most frequent aneuploidy detected involved a sex chromosome (12 cases). Trisomy 8 was the second most common aneuploidy detected in four cases (cases 1–4, Supplementary Figure 1), followed by trisomy 9 in three cases (cases 5–7). The remaining mosaic aneuploidy cases include two cases of trisomy 14 (cases 8 and 9) and one case of trisomy 21 (case 10).

CMA and chromosome results were discordant for cases 4 and 8. For case 4, the chromosome results from a 20-cell analysis were normal while the CMA plot shows a very slight increase in the intensity ratio for the chromosome 8 region (Supplementary Figure 1, bottom plot), suggesting low-level mosaicism, which was calculated to be 5%. FISH confirmed the mosaicism for trisomy 8 to be present in 16% of cells (Supplementary Figures 1a,b,c). Case 8 was mosaic for trisomy 14 (estimated to be 16% by CMA with FISH confirmation showing 13% mosaicism) yet was normal by chromosome analysis of 30 cells.

In case 7, an additional cell line was detected from confirmatory FISH analysis that was not evident from the CMA results. The CMA results suggested mosaic trisomy 9 in 23% of cells. FISH confirmation performed on cultured cells detected trisomy 9 in 10% of cells which was consistent with the CMA estimation, but also revealed a second abnormal cell line with monosomy 9 in 9% of the cells. Overall, for the aneuploidy cases, the percent of mosaicism estimated by CMA and observed by FISH and/or chromosomes were concordant except for cases 6 and 12, which differed by 26 and 22 percentage points, respectively (Table 1).

Ring and marker chromosomes

Twelve of the mosaicism cases (21%) involved either a ring or a marker chromosome. Cases 23–28 involved cells with 45,X and 46,X,r(X). The ring X chromosome in these six cases varied in size from a small marker of 1 Mb in size to a large ring of 29 Mb (Table 1, Supplementary Table 1). CMA analysis of case 30 suggested a mosaic gain in copy number surrounding the centromeric region in chromosome 6 (15 Mb), suggestive of a mosaic ring for chromosome 6 (Figure 1a1). FISH studies not only confirmed this finding, but also showed three cell lines (Figure 1a2): 57% had one ring, 10% had 2 rings, and 33% contained no ring chromosomes. Each of the four cases with CMA percent mosaicism estimates was concordant with FISH results.

Figure 1
figure 1

Two illustrative cases with multiple abnormal cell lines. (a) Case 30 is a 20-year-old female with indications of moderate developmental delay, dysmorphic features and a known ring chromosome. (a1) The array plot for the pericentromeric region of chromosome 6 suggestive of two mosaic rings of different sizes as indicated by the green bars. (a2) Metaphase FISH analysis using probes for centromere 6 (green), 6q12 (RP11-586B7 in red), and 6p12.1 (RP11-1055O15 in red) confirmed the presence of two rings, each of different sizes. Interphase FISH shows the presence of multiple cell lines containing none, one or two ring 6 chromosomes. (b) Case 48 is a 10-year-old male with epilepsy. (b1) CMA shows two mosaic deletions of different sizes in the short arm of chromosome 17 (circle and square). (b2) Partial karyotype of chromosome 17 from case 48 showing a deletion in the short arm of chromosome 17 indicated by the arrow. Chromosome analysis indicated this deletion is present in 5 out of 30 metaphases (18%) analyzed. (b3 and b4) FISH analysis using multiple probes – subtelomere 17p probes cohybridized together (overlapping red and green), LIS1 (red), 17 centromere (aqua), and 17q21.1 control (green) – confirmed cell lines with two different deletions. The circled area on the plot shows a smaller deletion (17p13.3p13.3) present in all cells corresponding to b3. The deleted 17 on the left of the figure shows the 2 control probes as well as a red signal indicating that the LIS1 gene is still intact. The boxed area in the array plot shows a larger deletion (17p13.3p13.1) corresponding to b4. The chromosome 17 at the bottom of Figure 1b4 shows a deletion of the LIS1 gene as well as the subtelomere, with signals from only the 17q21 control and centromere probes.

Single deletions and duplications

There were 11 cases (cases 35–45) involving a simple segmental abnormality, that is, a single genomic CNV (Table 1, Supplementary Table 1). The size of the CNV ranged from 318 kb to whole-arm duplications. Eight of the eleven detected mosaic cases were deletions while the remaining three were duplications. Case 35 involves a 318 kb deletion encompassing the TBR1 gene in 61% (125/204) of cells. Further details of this case were reported by Burrage et al.17 For all cases in which a CMA estimate could be determined, results were concordant.

Multiple deletions and duplications

Five cases (cases 46–50, Table 1, Supplementary Table 1) involved multiple CNVs which required a combination of methodologies to fully characterize the abnormality. For example, Figure 1b shows case 48, with a complex de novo deletion of the p arm of chromosome 17. CMA results showed a complex loss in the 17p region, suggestive of mosaicism for two different cell lines (Figure 1b1). The concurrent chromosome analysis showed a deletion of 17p13.1 present in 18% (5/30) of cells (Figure 1b2). However, FISH studies revealed two abnormal cell lines. The first was present in all cells and consisted of a smaller deletion (1.832 Mb) in the p13.3 region (Figure 1b3). The second was present in 22% (9/40) of cells and consisted of a larger deletion (6.784 Mb) on the same homologue in the p13.3p13.1 region (Figure 1b4). Case 49 (Supplementary Figure 2) involves both duplication in the 22q13.31q13.32 region (from 46.09 to 48.12 Mb map position hg 18) as well as a deletion in the adjacent 22q13.32q13.33 region (from 48.16 to 49.52 Mb). CMA results suggested a complex gain in the proximal q-arm region with a loss in the distal q-arm region (Supplementary Figures 2a,b). Interphase FISH showed five different cell lines for the 22q13.31 region (Supplementary Figure 2c) with one (31%), two (10%), three (17%), four (36%), and five (6%) red target signals, respectively, as compared with the control (22q11.21) with two green signals. Upon further analysis, a ring chromosome structure was observed by metaphase FISH, confirming the duplication of the 22q13.31q13.32 segment (Supplementary Figure 2d).

Single genes

Four cases (cases 51–54, Table 1, Supplementary Table 1) in which copy number losses encompassed or disrupted a single gene were identified. The average number of probes covering the small regions in these four cases was 38 probes, with an average size CNV of 118 kb. Case 52 involves a 185 kb deletion of part of the TCF4 gene (Figure 2a–c). Mutations in this gene are associated with Pitt–Hopkins syndrome.18 Figure 2a shows the results of CMA, which suggest a mosaic loss in the long arm of chromosome 18 estimated at 61%. FISH analysis on cultured cells detected the mosaic 18q21.2 deletion in 16% of the cells (Figure 2b). The remaining three cases had percent mosaicism estimates by CMA concordant with FISH analysis.

Figure 2
figure 2

Case 52 is a 17-year-old male with dysmorphic features, attention deficit hyperactivity disorder, anxiety, self-mutilation and stiff joints. (a) An enlarged image of the array plot of chromosome 18q21 showing the region of copy number loss detected by 35 interrogating probes within the TCF4 gene which is associated with Pitt-Hopkins syndrome. (b) FISH on interphase cells confirming the mosaic deletion (probe RP11-111C17 in red at 18q21.2 and centromere 18 in green) present in 16% of cells (32/200) and a second normal interphase FISH cell. A partial FISH metaphase cell shows the deletion at 18q21.2. (c) Visualization of the deleted exons shown on the UCSC genome browser. (d) Case 55 is a 3-year-old female with developmental delay. Partial karyotype of chromosome 17 showing the additional material on 17p indicated by the arrow. Concurrent chromosome analysis revealed additional material of unknown origin on the short arm of chromosome 17, later revealed to be material from the short arm of chromosome 5. (e) The array plot showing a complete loss of material approximately 1.9 Mb in size in the 17p13.3 region, circled in red, in addition to a mosaic gain in the short arm of chromosome 5 of 28.6 Mb in size, circled in green. (f) A FISH probe hybridizing to the 5p15.2 region (D5S23) is shown in green, while the 17p13.3 region (RP11-411G7) is shown in red. Metaphase FISH shows a cell line with a derivative 17 chromosome [der(17)t(5;17)(p14.1;p13.3)], indicated by the green signal on the 17 and absence of a red signal. (g) A FISH probe hybridizing to the 5p14.1 region (RP11-15B5) is shown in green and 5p14.3 (RP11-44L10) in red. Metaphase FISH confirms trisomy for the 5p13.33p14.1 region seen in 40% of cells analyzed. The inverted DAPI insert shows a close-up of the derivative 17 chromosome with the material from the 5p. (h) FISH image showing the second cell line. FISH probes for 17p13.3 (RP11-411G7) in red and 5p15.2 (D5S23) in green showing a deletion in the short arm of chromosome 17 and absence of a derivative 17. Monosomy for the 17p13.3 region was observed in all cells.

Unbalanced translocations

Three cases (cases 55–57, Table 1, Supplementary Table 1) involving unbalanced translocations were detected by CMA, including two derivative X chromosomes, and a derivative 17. Concurrent chromosome analysis for case 55 (Figure 2d) revealed a chromosome 17 with additional material of unknown origin in 25% (5/20) of cells. CMA results for this case (Figure 2e) suggested an unbalanced translocation between chromosomes 5 and 17. Interestingly, the FISH studies not only confirmed the derivative 17 (Figures 2f and g) but further revealed a second abnormal cell line in which only one copy of 17p13.3 without the derivative 17 was present (Figure 2h), resulting in monosomy of 17p13.3 in all cells examined and trisomy of 5p13.33p14.1 in 25% of cells. Percent mosaicism estimates by CMA were available only for cases 55 and 56 and are concordant with FISH and chromosome results.

Discussion

Mosaicism is inherently challenging to detect, because by definition the abnormality will not be present in all cells. Since mosaicism can be confined to specific tissues and not all tissues can be feasibly sampled, it is currently impossible to truly determine the level of mosaicism for a given abnormality in an individual. Despite this seemingly insurmountable difficulty for genomic analyses, technological improvements have enabled the discovery of clinically significant mosaicism not detectable by conventional cytogenetic studies. In this study, we present cases of genomic mosaicism ranging in size from aneuploidy to exonic deletions, the smallest being 35 kb in size, identified by an exon-targeted high-resolution oligonucleotide array and highlight those cases with discordant results for the different methodologies utilized.

Although the true prevalence of mosaicism in the general population and its incidence are difficult to determine, the detection rate appears to be increasing as more sensitive technologies allow for greater detection. In 1982, Hansteen et al. detected three cases of true mosaicism out of 1830 (0.16%) consecutively born infants by chromosome analysis.19 In our study, somatic mosaicism was detected in a total of 57 cases (0.55%). Extrapolating from our detection rate, we see a slightly higher incidence of approximately 1 in 180 cases referred for CMA testing. Excluding those cases submitted with an indication of Turner or Down syndrome, which are more likely to have mosaicism, the detection rate was 0.49%. Our detection rate is comparable to previous oligonucleotide and BAC array CGH studies (Supplementary Table 2). In contrast, SNP arrays have been reported to have a higher mosaic detection rate (1%) although the rate is more comparable to this study if only comparing blood samples.3 While SNP array platforms are considered more sensitive due to the additional data available from the B allele frequency3, 4, 20, a limitation may still exist for detecting single exon CNVs since SNPs predominate in non-coding regions.13

The various cytogenetic techniques utilized for detection of mosaicism each have their own strengths and weaknesses. While CMA is able to analyze total DNA from all cells in a given sample (mostly from peripheral blood), conventional cytogenetic analysis relies on PHA-stimulated T cells. The stimulation and culturing of cells introduces a bias and can lead to a missed diagnosis of mosaicism.1, 2 There are known syndromes with mosaic chromosome abnormalities, such as the isochromosome 12p seen in Pallister–Killian syndrome (PKS), that are typically not detected by chromosome studies on peripheral blood.21 A recent study by Bi et al. examined the value of conventional cytogenetics in the context of an ever-increasing use of CMA technology and concluded that, in relation to detection of mosaicism, chromosome studies remain valuable in detecting low-level mosaicism for either abnormal or normal cell lines.5 However, detection of this low level of mosaicism by chromosome studies requires a large number of metaphases to be analyzed, which is labor intensive, and may still not detect mosaicism as exemplified by cases 4 and 8 in this study. Although extremely low levels (<10%) of mosaicism may not reliably be detected by CMA, it should be noted that case 4, which showed an estimated 5% mosaicism for trisomy 8, was detected by CMA only. Without dispute, chromosome studies are especially valuable for mosaic cases with multiple cell lines affecting the same overlapping region, rendering the calculation of mosaicism using log2 intensity values ineffective. FISH provides a clear advantage over chromosome studies in the ability to quickly and easily score hundreds of interphase cells to determine the level of mosaicism, but only when the abnormality has been detected by another assay such as CMA or chromosome analysis, making FISH unsatisfactory as an initial test for detecting unsuspected mosaicism. Both FISH and chromosome analysis aid CMA by providing a visual confirmation, which helps elucidate the complete genome picture and the spectrum of mechanistically observed end products, such as case 48 involving a complex deletion. Most interestingly, over a third (21/57) of the cases unexpectedly had multiple abnormal cell lines revealed by FISH and/or chromosome analysis that was not always apparent by CMA. More specifically, over half (13/21) of these cases involved chromosome X. This highlights the importance of performing additional cytogenetic studies for detection of multiple cell lines especially for chromosome X.

Somatic mosaicism detected by CMA but missed by chromosome analysis has been previously reported.1, 2, 7, 22 Our array detected two cases of mosaicism, both whole chromosome aneuploidies, that should have been detected by conventional cytogenetic analyses (>3 Mb) yet were normal. Case 4, mosaic for trisomy 8, was normal by chromosome analysis of 20 cells and case 8 was mosaic for trisomy 14 yet was normal by chromosome analysis of 30 cells.23 Cheung et al. reported 9 out of 12 mosaic cases first detected by CMA with previously normal routine chromosome analysis and 8 of the 9 cases involved whole chromosome aneuploidy.1 Ballif et al. reported 6 out of 18 mosaic cases with a normal karyotype. Three of the six were aneuploidy, while two involved deletions >5 Mb in size, and one involved a mosaic derivative chromosome.2 Hoang et al. reported four cases out of twelve for mosaicism with previously normal karyotypes, where three of the four were aneuploidy.22 Overall, there appears to be susceptibility for mosaic aneuploidies to be missed by conventional cytogenetics, suggesting a selection process occurring in cultured cells in favor of normal cells. Similar to a mosaic isochromosome 12p, the cytogenetic hallmark of PKS, CMA has been shown to be able to detect low levels of mosaicism of the i(12p) in peripheral blood of PKS individuals while the abnormality is not always detected by FISH or chromosome analysis.24

One of the main advantages of CMA over chromosome analysis is its ability to detect small CNVs, enabling discovery of mosaicism for abnormalities not previously detectable. Ten cases (18%) involved CNVs (<3 Mb) that would not be expected to be detectable by conventional cytogenetic studies. Due to the extensive exonic coverage of the custom array used in this study, we were able to detect CNVs involving single genes in four cases, three of which are known to be disease-causing. Case 52 involves a 185 kb (38 probes) deletion encompassing the TCF4 gene in which haploinsufficiency is known to cause Pitt-Hopkins syndrome.18 The finding is consistent with the patient’s phenotype, which includes dysmorphic features, ADHD, anxiety, self-mutilation, and stiff joints.8 Only two other cases have ever been reported to involve a somatic mosaic deletion of TCF4.25, 26 Case 53 involves a 134 kb (32 probes) de novo deletion encompassing exons 2–5 of the CASK gene. Deletions involving this gene have been reported in patients with intellectual disability and microcephaly with pontine and cerebellar hypoplasia (OMIM# 300749) and Ohtahara syndrome.19 The phenotype of this five-year-old male patient (developmental delay, microcephaly, craniofacial dysmorphic features, and speech delay) appears consistent with this diagnosis. Case 54 involved a 35 kb (18 probes) deletion in the IL1RAPL1 gene which is associated with X-linked intellectual disability (OMIM# 300206), and also consistent with the patient’s phenotype.8 Each of these cases involved deletions far beyond the resolution of conventional cytogenetics and would have been missed without CMA testing. Exonic coverage allows for detection of mosaicism at an unprecedented level and highlights the potential for mosaicism involving exons of genes, as a group of abnormalities, to be an under-recognized cause of phenotypic abnormality.

Importantly, there were 35% (20 of 57) of cases where we were unable to estimate the mosaicism levels. These cases typically involved multiple abnormal cell lines and presented the most challenge for interpretation by CMA alone. One abnormality category especially affected is the ring/marker group. Because of the unstable nature of ring/marker chromosomes, they are most frequently seen in the mosaic state and it is not unusual to observe multiple cell lines, such as in case 30 which had cell lines containing none, one or two ring 6 chromosomes. Most interestingly, case 49 showed an astonishing five different cell lines. While FISH analysis showed a ring chromosome structure for this case, this type of chromosome abnormality could also be the result of an inverted duplication and terminal deletion as reported by others.27, 28, 29 The most common formation of ring chromosomes involves breakage in both arms of a chromosome, resulting in a loss of the distal fragments, followed by fusion of the proximal broken ends as exemplified in cases 23–31.30 In contrast, some ring chromosome formations are complex rearrangements leading to concurrent deletion and contiguous duplication such as those seen in case 49. While the ring itself appears to have remained stable as indicated by the consistent normal copy number for the control probe, the distal duplicated region at band 22q13 is unstable as evidenced by multiple cell lines ranging in CNVs from one to five suggesting a post-zygotic event involving a multiple-step process. The initial event was likely a terminal deletion [del(22)(q13.32)] that was rescued by a classical ring formation in 31% of cells (Supplementary Figure 2C). In the other 59% of cells, telomere healing via a breakage-fusion-bridge cycle may have occurred, leading to an unstable dicentric chromosome. Its subsequent uneven break likely resulted in a formation of an inverted dup(22)(q13.31q13.32). The head-to-head structure is further supported by the inverted duplication pattern of FISH signals (Supplementary Figure 2E). In 17% of cells, this chromosome was rescued by a ring formation similar to the ring formation of other acrocentric chromosomes.28 In the remaining cells, the unstable inv dup/del chromosome may have been rescued independently two times by aberrant DNA replication such as ‘fork stalling and template switching’ (FoSTeS)31 or a microhomology-mediated break-induced replication,32 resulting in the q13.31q13.32 fragment being triplicated (36%) and quadruplicated (6%) (Supplementary Figure 2D) and forming different ring chromosomes. Replication errors have been recently proposed to explain similar terminal triplication and complex dup-trp/inv-dup structures.33 However, the limited resolution of CMA did not reveal such complex structures in our case. Interestingly, the fact that we did not observe any double rings in chromosome analysis and the unchanged log ratio of the 22pter-q13.32 segment in the CMA data are consistent with the mechanism for formation of inv dup del type of rearrangments.

In addition, when the ring or marker chromosome is present in low levels or exists in more than one cell line, its presence can be masked by the major cell line. For example, in case 24, the whole-genome array plot depicts a copy number loss of the entire X chromosome plus a suspicion of a ring X chromosome (Supplementary Figure 3A). However, the array plot for the X chromosome only did not exhibit a pattern suggesting a ring X chromosome (Supplementary Figure 3B). The presence of the ring chromosome was clearly evident from chromosome analysis which showed it to be in 15% of the cells (Supplementary Figure 3C) and also provided an estimate of the size of the ring which was not discernible by CMA. Similarly, in case 20, the array profile depicted a copy number loss of what appears to be the entire X chromosome (Supplementary Figure 4A,B) while FISH analysis revealed the presence of an isodicentric X chromosome from Xp11.2 to Xqter in 8% of the cells with the remaining cells showing 45,X (Supplementary Figure 4C,D).

Not unsurprisingly, the category with the fewest number of detected mosaic cases were those involving unbalanced translocations, as illustrated by case 53 (Figure 2d–h). Mosaicism for an unbalanced translocation is rare. Hsu et al. reported 555 (0.3%) cases of chromosomal mosaicism in 179 665 prenatal cases and found only 1 unbalanced translocation out of the 57 cases that were mosaic for autosomal structural abnormalities.34 Recently, Gijsbers et al. reported three cases with mosaic unbalanced translocations using SNP microarray and reviewed the literature and found a total of 10 cases.35 In addition, Robberecht et al. studied nine cases of mosaic structural imbalances, three of which were unbalanced translocations.36 Our cases together with those of Gijsbers and Robberecht suggest that array technologies may assist with the detection of more of these types of cases.

The ability to analyze total genomic DNA content in a given sample may provide for a more accurate determination of the level of mosaicism as compared to methods involving selected cell populations such as G-banded chromosome analysis. Overall, our calculation of percent mosaicism by CMA has shown close concordance compared to FISH and/or chromosome results (Table 1). However, three cases showed a variation of greater than 20 percentage points between CMA and FISH: case 6 involving a whole chromosome 9 (51% CMA vs 25% FISH), case 12 involving a whole X chromosome (33% CMA vs 55% FISH), and case 52, a 185 kb deletion encompassing the TCF4 gene (61% CMA vs 16% FISH). In two of the three cases the estimate of mosaicism was greater for CMA. We speculate this is because CMA analyzes total DNA while FISH was performed on cultured, PHA-stimulated T-cells, perhaps leading to selection against abnormal cells. Recently, Thompson and Compton showed that the majority of lagging chromosomes during cell division do not missegregate but instead segregate accurately to end up in micronuclei in daughter cells.37 It is therefore possible that these lagging chromosomes in micronuclei can be recovered during DNA extraction for CMA but lost during the process of preparing interphase nuclei for FISH analysis, such as in case 6. In contrast, it is also possible that certain abnormal cells are selected for, or normal cell lines are selected against, in cultured samples as seen in case 12.

In summary, our study reinforces the effectiveness of CMA in the detection of somatic mosaicism for a wide variety of genomic abnormalities and demonstrates that the estimated level of mosaicism determined by CMA is comparable to the levels determined by FISH and/or chromosome analysis. Furthermore, our mosaicism detection rate of approximately 0.5% is in close concordance with that published by others. Importantly, with the addition of exon coverage on the array used for clinical testing, mosaicism can be detected at an unprecedented level of resolution involving exons of single genes opening new opportunities to determine the cause of a subset of phenotypic abnormalities that have been previously under-ascertained. While the methodologies utilized in this study have various strengths and weaknesses as presented, used collectively they each contribute a piece of the complex picture aptly known as mosaicism.