Main

Dilated cardiomyopathy (DCM) is a disease of the cardiac muscle, characterized by left ventricular enlargement and systolic dysfunction. It is responsible for significant morbidity and mortality and is a leading indication for heart failure and transplant.1 The prevalence of idiopathic DCM is estimated to be 1 in 2500. Approximately 30–50% of these patients have a family history, indicating an underlying genetic etiology in an important proportion (reviewed in Ref.2). To date, mutations in >30 genes have been implicated in causing DCM, typically showing autosomal dominant inheritance. Unlike hypertrophic cardiomyopathy (HCM), which is considered to be mainly a disease of the cardiac sarcomere, the genetic etiology of DCM is more diverse. Affected structures include the sarcomere, the Z-disk, the nuclear lamina, the intermediate filaments, and the dystrophin-associated glycoprotein complex (reviewed in Ref.3). Because of the high cost of dideoxy sequencing, most previous multigene studies have screened only a subset of genes or tested a small number of patients.412 Therefore, accurate mutation detection rates for any of these genes are difficult to define. Clinical genetic testing laboratories must attempt a balance between maximizing clinical sensitivity (i.e., the number of genes tested) and minimizing test price and turn around time (TAT). For disorders that are characterized by a high degree of locus and allelic heterogeneity, genes with low variant detection rates are, therefore, often not included in a clinical testing panel.

Because of the availability of novel sequencing platforms, it is now possible to screen a much larger number of genes simultaneously and, thereby, improve clinical sensitivity. One of these novel sequencing platforms is the Affymetrix GeneChip® Resequencing Array, a customizable high-density oligonucleotide microarray. This platform has been used successfully to sequence mitochondrial, bacterial, or small eukaryotic genomes and selected genes for specific diseases, such as retinitis pigmentosa, congenital myasthenic syndromes, and HCM.1318 Although initial studies showed high false negative and positive rates, improvements including redundant tiling and custom bioinformatics algorithms have enabled adaptation into clinical services at a reasonable analytical sensitivity.19 Parallel processing of a large number of genes can result in a gain in labor, time, and cost efficiency as outlined for an HCM resequencing array.14 However, it is important to point out that this effect needs to be confirmed for every new test because individual parameters can vary greatly depending on the selected genes and/or the laboratory performing the test.

We used this platform to develop a novel resequencing test for DCM that screens the entire coding sequence and splice sites of 19 genes implicated in DCM. Ten of these genes were already part of existing clinical DCM tests in our laboratory and elsewhere, and included the sarcomere genes, MYBPC3,4,9,12 MYH7,46,9,11,12 TNNT2,47,9,11,12 TNNI3,6 TPM1,4,6,12 and ACTC,12 the Z-disk gene, LDB3,5,20,21 the nuclear lamina gene, LMNA,12,2227 the cardiac Ca++ATPase inhibitor gene, PLN,11 and the Barth syndrome gene, TAZ.28 Nine new genes were added based on published evidence, with a focus on mutation detection rate and additional supporting evidence, such as absence in healthy controls and/or familial segregation analyses (Tables 1 and 6 and references within). Emphasis was given to genes whose products are part of the continuum of structures already known to be affected in DCM patients (such as the Z-disk and the nuclear lamina). Specifically, we included emerin (EMD), a nuclear lamina protein known for its role in Emery-Dreifuss muscular dystrophy, which can present with DCM2931 (reviewed in Ref.32), and the intermediate filament gene desmin (DES), known for its role not only in myofibrillar myopathy but also in isolated DCM.3336 Furthermore, we added several genes encoding the Z-disk proteins alpha actinin (ACTN28), cardiac muscle LIM protein (CSRP38,3740), titin cap/telethonin (TCAP37,39,41), and metavinculin (VCL4244). Dominant mutations in these genes have been found in DCM patients, and for TCAP, recessive inheritance has been described for patients with of limb girdle muscular dystrophy 2G.45 Another gene, delta sarcoglycan (SGCD) is part of the dystrophin-associated protein complex, and recessive mutations have been implicated in limb girdle muscular dystrophy 2F. More recently, dominant SGCD mutations have been described in isolated DCM.46,47 Finally, two genes whose products are not part of the above-mentioned structures were the cytokine cardiotrophin (CTF148) and the ATP sensitive potassium channel ABCC9.49,50 Mutations in several additional genes of interest have been described, but these genes were not included because of various reasons. For example, titin (TTN, reviewed in Ref.51) could not be included because of the large size (80 kb) of its coding sequence. Several other genes were not considered at this point because of promising but as of yet insufficient evidence that is awaiting further confirmation and characterization by the scientific community (e.g., PSEN1,52 PSEN2,52 LAMA4,53 and DMN12).

Table 1 Contents of the DCM CardioChip
Table 6 Clinical sensitivity

Here, we describe the technical validation of the DCM CardioChip, its analytical performance parameters, and its significant impact on test cost and TAT. We also discuss the results obtained from testing 73 DCM probands and provide a projected clinical sensitivity.

MATERIALS AND METHODS

Samples

Patient samples

This study was approved by the Partners HealthCare and Brigham and Women's Hospital Institutional Review Boards. Samples were used as follows: DCM CardioChip validation: (a) Positive controls: samples with known sequence alterations were anonymized and used to create a set of 7 “pooled positive controls” including 199 known sequence alterations. The DNA source for individual amplicons came from multiple patients with unique mutations allowing for each control to contain between 20 and 56 mutations per pool. (b) Additional technical validation: complete dideoxy sequencing data and chip-based sequencing data were generated for 25 anonymized samples that were previously tested and did not contain any disease causing variants in a subset of the genes present on the chip (MYH7, MYBPC3, TNNT2, TNNI3, TPM1, ACTC, LDB3, PLN, LMNA, and TAZ). Clinical sensitivity: 73 patient samples were tested to estimate the clinical sensitivity. Cohort 1 included the 25 samples used for technical validation, which represent patients with DCM from a broad referral population. Cohort 2 comprised 48 samples from well-characterized patients seen at the Brigham and Women's Hospital in Boston. Patients were not included if there was significant coronary artery disease, uncontrolled hypertension, significant valvular disease, or other identifiable cause of DCM (e.g., peripartum and anthracycline exposure). Age at diagnosis was 40.5 ± 14.2 years, 76% were male gender; 92% were provider or self-identified Caucasian, the remaining were African American (6%) or multiracial (2%). A definite or potential family history of DCM was present at 58% and 23% of probands, respectively. Left ventricular ejection fraction was reduced in all subjects (mean 29 ± 12%), and none had features suggestive of HCM, arrhythmogenic right ventricular cardiomyopathy, or left ventricular noncompaction. Eleven patients (23%) died or required cardiac transplantation or placement of a left ventricular assist device.

Healthy control samples

Novel variants were genotyped in ethnically matched control samples.

Caucasian (n = 200).

Subjects were disease free and older than the age of 18 years. “Disease” was defined as current or past diagnoses made by a medical care provider, which required medication or other forms of treatment or therapy. Surgery was considered an exclusionary criterion, as was the use of prescription medications or regularly used over the counter medications. The subjects completed a structured questionnaire that detailed current and past history of illness and treatment. All responses were confirmed by clinical interview by physicians. Physical examinations were completed at the study visit.

African American (n = 200).

These were anonymized samples collected through the Crimson sample collection mechanism at Brigham and Women's Hospital (the Crimson collection is a large cohort of subjects, without a chronic disease diagnosis, who where seen at the Brigham and Women's Hospital). All subjects were between the ages of 15 and 45 years and were originally chosen to be controls for an asthma genetics study.

Genotyping (healthy controls)

Sequenom I-Plex technology was used. Multiplex polymerase chain reaction (PCR) assays were designed using Sequenom SpectroDESIGNER software (version 3.0.0.3; Sequenom, San Diego, CA) by inputting sequence containing the single nucleotide polymorphism (SNP) site and the 100-bp flanking sequences on either side of the variant. The variants were grouped into multiplexes, and reactions were performed using standard conditions.

PCR and Sanger sequencing

Exons were amplified by PCR using standard conditions (15 μL reactions containing 5 ng genomic DNA, 2 mM MgCl2, 0.75 μL dimethyl sulfoxide, 1 M betaine, 0.2 mM dNTPs, 20 pmol primers, and 0.2 μL AmpliTaq Gold® (Applied Biosystems) using the following cycling conditions: 95°C for 10 minutes; 95°C for 30 seconds, 60°C for 30 seconds, and 72°C for 1 minute for 30 cycles; and 72°C for 10 minutes. PCR products are purified with Ampure® Magnetic Beads (Agencourt) and then directly sequenced bidirectionally by dye-terminator sequencing using a universal primer. Sequencing products are purified using Cleanseq™ Magnetic Beads (Agencourt) and separated by capillary electrophoresis on an ABI 3730 DNA Analyzer (Applied Biosystems).

DCM CardioChip custom design

Gene chip specifications

The DCM CardioChip is a “49-format” array (accommodating up to 300 kb of DNA sequencing capacity) with 8-μm features and was designed using a shift mask design requiring 17 masks.

Content

Coding exons and ±10 flanking intronic bases for 19 genes (39,533 bases) involved in DCM (Table 1).

Redundant probe sets

To increase call rates, a triplicate set of probes were tiled for each base. This amounted to a minimum of 24 probes per base (4 probes per strand × 2 strands per base × triplicate tiling). In addition, to improve variant detection and call rates, each of the three probe sets contained a different position for interrogation (positions 9, 13, and 17 within the 25-nucleotide probe).

Minor allele tiling

To avoid allele dropout of mutations through lack of hybridization because of the presence of a nearby SNP, an additional triplicate set of sequencing probes surrounding common SNPs was tiled. These additional sequences contained the minor allele and allowed correct hybridization whenever the minor allele of a common variant was present in a patient (Fig. 1).

Fig. 1
figure 1

Design and data output. A, The basic design of a resequencing chip: each base is interrogated by four probes, containing the four possible bases, tiled in a quadrant (four additional probes query the reverse strand and are not shown). Typical hybridization patterns obtained from samples that are homozygous wild type or heterozygous for a variant are shown to the right. B, Triple tiling and minor allele tiling strategy: each probe is tiled in triplicate, with the interrogating position located at the 9th, 13th, and 17th positions on the 25-bp probe (arrows). C, When a known SNP (e.g., C>T) is located near the interrogating position, an additional set of triplicate probes were tiled containing the minor allele. D, Representative data output after the integrated GCOS, GSEQ, and BRLMM analysis. All positions requiring confirmatory analysis are listed in the “follow-up” section (top). For each position, the gene, exon, and wild type base is indicated. In the “call” column, triplicate calls are listed except when the presence of a nearby SNP triggered the use of the three additional minor allele tiling data sets.

Genotyping

The DCM CardioChip also contains genotyping probes for the variant allele of known disease-causing DCM variants and all common SNPs (see Tables, Supplemental Digital Content 1 and 2, http://links.lww.com/GIM/A98 and http://links.lww.com/GIM/A99). Together with the probe signals from wild type sequencing data, genotyping calls were made using the Bayesian Robust Linear Modeling using Mahalanobis distance (BRLMM) algorithm (http://www.affymetrix.com/support/technical/whitepapers/brlmm_whitepaper.pdf). Default parameters recommended by Affymetrix were used (quant-norm.sketch = 50,000, PM-only, brlmm.transform = ccs, K = 4, MS = 0.5).

Bioinformatics pipeline

The files generated by the Affymetrix Gene Chip Operating System (GCOS) software were processed with GSEQ version 4.1 batch analysis using default settings with the following exceptions: no signal threshold = 1.0, genome model = 0, quality score threshold = 1.0 (Affymetrix recommends 3), and base reliability threshold across samples = 0. BRLMM was used to derive genotypes for common SNPs and pathogenic variants and the use of the “minor allele” resequencing data sets was triggered whenever a heterozygous or homozygous SNP was detected. BRLMM was also used to derive genotyping calls for known disease-causing variants. The output was exported, triplicate tilings and genotyping calls were aligned, and a final list of variant calls and no-call bases was generated. All variant calls made by the chip and bases without call were confirmed by dideoxy sequencing. (See Figure, Supplemental Digital Content 3, http://links.lww.com/GIM/A100 for a diagram of the bioinformatic pipeline.)

Chip assay

DNA was amplified using 101 multiplexes (2–3 primer pairs each) and 37 singleplex reactions (15 μL reaction volume containing 2.5 mM MgCl2, 1 M betaine monohydrate, 5.3% dimethyl sulfoxide, 133 μM dNTPs, and 1 U AmpliTaq Gold [ABI]). Primer sequences are available on request. Cycling conditions were as follows: 1 × (95°C for 10 minutes), 30 × (95°C for 30 seconds, 60°C for 30 seconds, and 72°C for 30 seconds), and 1 × (72°C for 10 minutes). Products were pooled using empirically determined amounts (generally 6–8 μL). PCR products were cleaned using the QIAquick PCR Purification Kit (Qiagen). Samples were eluted in 45 μL and then fragmented and labeled using the GeneChip Resequencing Assay Kit (Affymetrix) according to the manufacturer's instructions. Hybridization was performed using a GeneChip Hybridization Oven 640 (Affymetrix). A GeneChip Fluidics Station 450 (Affymetrix) was used for the washing and staining protocols, and the GCOS version 1.4.0.036 software was used for fluidics and scanning.

RESULTS

Validation and test development approach

The DCM CardioChip was designed to resequence the coding region and splice junctions of 19 DCM genes (40 kb) in triplicate. An additional set of triplicate sequences containing the minor alleles of common variants and a set of genotyping probes querying 281 known disease-causing variants were included to increase sensitivity. The chip's technical performance was validated by comparing array-based sequencing data to the current “gold standard” (dideoxy sequencing), each generated for 25 samples and for a set of samples enriched for a large number of known sequence alterations. A bioinformatics pipeline integrating triplicate resequencing data with available genotyping calls was developed. The final clinical test consists of the array-based sequencing assay, followed by dideoxy sequencing-based confirmatory follow-up of all variant calls.

The DCM CardioChip added 9 new genes to the 10 for which clinical resequencing was already offered. To project the chip's overall clinical sensitivity and to increase the likelihood of identifying novel variants in the new genes, we selected 73 samples from patients in whom no clinically significant variants were identified in the 10 previously tested genes (MYH7, MYBPC3, TNNT2, TNNI3, TPM1, ACTC1, LMNA, LDB3, PLN, and TAZ).

Analytical performance parameters

Call rate

The raw data were processed using the Affymetrix GCOS software, followed by the GSEQ version 4.1 batch analysis. The average call rate was 98.47% (n = 42, 95.47–99.00%), which is comparable with other published resequencing arrays.1318

Confirmatory follow-up

Each final base call was derived from a set of three independent calls (generated by the three redundant probe sets, including three additional sets of minor allele probe sets when common heterozygous or homozygous SNPs were present) and the genotyping calls for known DCM mutations whenever available. All ambiguous calls, variant calls, and bases with no call were sequenced by dideoxy sequencing leading to a final assay call rate of 100%.

Analytical sensitivity

A series of 7 “pooled positive controls” containing 199 unique sequence alterations (166 substitutions and 33 insertions, deletions, or insertion/deletions) across the 268-coding exons covered by the DCM CardioChip were generated to determine the analytical sensitivity of the test. Because of the presence of several variants in more than 1 sample, the total number of variants analyzed was 1109 (989 substitutions and 120 insertions and deletions).

Substitutions.

Of the unique substitutions, 5 of 166 (3.0%) were missed by the chip using only the resequencing portion (Table 2). Consistent with previously published results, the GC content in all three probe sets was high at these positions (56–84%; Table 3).13,54 When all substitutions present were included, the false negative rate was 18 of 989 (1.82%). This corresponds to the false negative rate for novel substitutions variants (for which no additional genotyping probes are present on the chip). When the genotyping portion of the chip was added to the analysis and only variants were analyzed for which genotyping probes were present (n = 124), the false negative substitution rate dropped to 0% (Table 3). This constitutes the false negative rate for known substitution variants. In summary, the analytical sensitivity of the DCM CardioChip is 100% for known substitutions and 98.0% for novel substitutions.

Table 2 Analytical sensitivity
Table 3 False negative substitutions
Insertions and deletions (insertion/deletions).

It is well established that novel insertions and deletions are difficult to detect on hybridization-based resequencing arrays.13,19 However, because they often lead to aberrant calls in neighboring bases, they are sometimes detected, especially, in a clinical setting, where every variant call or no-call is followed up on by an independent test.19 Our validation set consisted of 33 unique (120 total) insertions, deletions, or insertion/deletions (23 deletions, 8 insertions, and 2 insertion/deletions), ranging from 1 to 20 bases, with most being <5 bases (n = 25; Table 4). When all 120 insertion/deletions across all experiments were analyzed, the false negative rate was 58.33% (Table 2). However, we noticed that there was a high degree of variation with regard to how often variants that were present in more than one sample were missed. Hence, we performed a “best case/worst case” analysis of the detection rates. Under the worst case scenario, a variant was scored as “not detected” when it was missed once (but detected several other times, e.g., “No-Yes-Yes-Yes”). Conversely, under the best case scenario, a variant was scored as “detected” if it was detected at least once, regardless of how many times it was missed (e.g., “Yes, No, No, No”). Using this approach, the false negative rate for insertions and deletions ranged from 30.30% to 84.85% (Tables 2 and 4), which is consistent with previously described detection rates.55,56 Inclusion of genotyping data for known insertion/deletions reduced the false negative rate to 6.33–18.18%. Of note, the detectability did not correlate with the size of the insertion or deletion (Table 4). Whether a given insertion or deletion will be detected by the array is highly dependent on the sequence content of the variant and the surrounding sequence. The false negative rates described here can, therefore, only serve as an estimate, although the true false negative rate for insertions and deletions is expected to lie in between the two extremes.

Table 4 Detection of insertions and deletions

Specificity

The resequencing portion of the chip had an average false positive rate of 0.09%, (36 of 39,533 bases queried per experiment) and, therefore, an analytical specificity of 99.91%. Because in a clinical setting, each positive call is followed up by an independent assay (dideoxy sequencing in our laboratory), the final false positive rate of the test is close to 0% (equal to the false positive rate of dideoxy sequencing).

Cost and TAT analysis

Because of the complexity of many genetic tests, their cost and TAT can be extremely high compared with other clinical tests. For genetically heterogeneous diseases such as DCM, diagnostic laboratories must strive to increase clinical sensitivity by including more genes. However, in the past, the limiting factor has been the high labor and reagent cost of Sanger sequencing. Fokstuen et al.14 reported substantial cost savings comparing a similar array covering 12 HCM genes with conventional dideoxy-based sequencing. Because laboratory practices and workflows, array designs, and analytical performance parameters, all of which impact test cost, can differ greatly, we conducted a similar analysis and compared the test cost and TAT of the DCM CardioChip with two Sanger-based tests: the previously offered DCM panels (10 genes, 2 tiers of tests) and a hypothetical test covering all of the 19 genes present on the DCM chip. Our laboratory considers the three following factors the main drivers of test cost: labor, reagents/laboratory ware, and test development expenses. Compared with Sanger sequencing of all 19 genes, the greatest savings using the DCM Cardiochip occurred with labor (55% reduction) and reagents/laboratory ware (51% reduction). Besides the savings that occur through parallel processing of a large number of genes, the main factor leading to a reduction in labor is the reduced amount of manual data review, which is analyzed automatically by an informatics pipeline. In our laboratory, the only manual review left is that associated with the confirmatory analysis of calls made by the chip, which is still done by Sanger sequencing. Overall, the DCM CardioChip is still almost 50% cheaper compared with a hypothetical Sanger sequencing test covering the same genes and almost 30% cheaper than the previously offered two DCM panels, which covered only 10 of the 19 genes present on the chip. The TAT of the DCM CardioChip since test launch has averaged at about 7 weeks. This is a 50% reduction compared with the hypothetical Sanger sequencing test and a 30% reduction compared with the previously offered DCM panels.

Detected variants and clinical sensitivity

Sixteen missense, splice, and frameshift variants (4 of them previously reported) were identified in 17 patients (Table 5). All other patients remained negative for clinically significant variants. All novel variants were genotyped in a minimum of 300 ethnically matched control chromosomes (black and/or white, see Materials and Methods). Variants identified in individuals of Hispanic origin or of unknown race were genotyped in both cohorts, although, here, absence from controls has to be interpreted with more caution as it remains possible that the individuals were of neither racial origin. Silent variants were assumed to be benign and were not genotyped in controls (Table 5).

Table 5 Variants identified

Variants of likely clinical significance

A variant was considered likely to be clinically significant if it was absent from healthy controls and showed a high degree of evolutionary conservation or was predicted to impact splicing or truncate/degrade the protein through a frameshift or nonsense mutation mechanism. Seven variants (c.897 + 4_897 + 5del in DES, p.Ala50Thr in CSRP3, p.Val115Met, p.Asp841fs, and p.His775Tyr in ACTN2, and p.Arg105Gln and p.Arg1125Cys in VCL) met these criteria (Table 5 and Table, Supplemental Digital Content 2, http://links.lww.com/GIM/A99). It is important to point out that current genetic tools may be insufficient to adequately identify rare benign variants. On the one hand, recent genome-wide analyses suggest that the majority of missense variants have a deleterious effect of some magnitude (although it may be insufficient to cause overt disease) and that a variant's population frequency correlates with its potential clinical significance.57 However, another recent study estimated that the number of control chromosomes used by us and the majority of similar studies (200–400) will likely only identify 80% of variants with a frequency of 0.1% or higher and that at least 3000 control individuals are necessary to rule out that a variant is not a rare benign change.58 Therefore, additional genetic evidence such as functional studies or segregation with disease would be needed to confirm the significance of variants for which pathogenicity is implicated based on evolutionary conservation and absence from a small control cohort alone.

Variants of unknown clinical significance

An additional three variants were also absent from controls but were considered to be of unknown clinical significance (Table 5 and Table, Supplemental Digital Content 2, http://links.lww.com/GIM/A99), because the affected amino acid was less conserved (p.Thr179Ala in CSRP3; Figure, Supplemental Digital Content 4, http://links.lww.com/GIM/A101), the variant (p.Ala732Thr in ACTN2) was found in the presence of another variant of likely clinical significance (p.Asp841fs in ACTN2), or the variant was present in dbSNP (p.His636Arg in VCL), although no clinical information or population frequency was available.

Variants of unlikely clinical significance

Finally, five missense variants and one potential splice variant were present in controls at varying frequencies and are most likely benign when present in isolation. Four of them (p.Val734Ile in ABCC9, p.Ala213Val, and p.Val459Ile in DES and p.Trp4Arg in CSRP3) have been previously published as potentially disease causing and warrant a more detailed discussion here. The p.Val734Ile variant (ABCC9) has been detected in 1.7–3.2% of Caucasians (rs61688134). Although it was associated with an increased risk of myocardial infarction,50 its high population frequency excludes it from being a highly penetrant mutation for DCM (the population prevalence of DCM is much lower than the population frequency of this variant). The p.Ala213Val variant (DES) has been reported by several studies and was initially thought to be disease causing.33,34,59 However, two subsequent studies identified this variant in a total of 6 of 570 control chromosomes (1% population frequency).36,60 In addition, it did not segregate with disease in one family.36 This variant is also present in dbSNP (rs41272699) although no frequency information is available. The p.Val459Ile variant (DES) is one of two previously reported desmin variants located in the tail domain of the protein (the majority of all disease-causing desmin variants are located in the central rod domain). It has been reported in dbSNP (rs73991549, no frequency information available). In addition, it has been reported in 2 of 69 black patients with DCM (2 of 138 chromosomes) and in 2 of 200 black Coriell control chromosomes (but was absent from 300 Caucasian control chromosomes). The authors concluded that it is a low penetrance variant restricted to the black population based on a subtle effect on desmin filament assembly in vitro.36 We identified this variant in one black DCM proband as part of our validation study. In our opinion, its presence in 1% of black control chromosomes, presence in black patients with DCM with a similar frequency (1.4%), and location outside the domain harboring the majority of reported desmin variants suggest that it is insufficient to cause disease when present in isolation although a modifying effect cannot be ruled out. The p.Trp4Arg variant (CSRP3) has been reported, but the available data surrounding it is conflicting. One study detected this variant in 10 of 536 DCM probands, not in 640 control chromosomes, and provided functional data supporting a pathogenic role.39 However, additional studies detected the p.Trp4Arg variant in healthy control chromosomes (0.02% Caucasian,8,37,38,40 4 of 334 Caucasian and 2 of 376 black chromosomes; our own data). In addition, the variant was detected in patients with HCM and DCM,8,37,38,40 which further argues against a primary disease-causing role, because the molecular mechanisms underlying the two cardiomyopathies are fundamentally different.

Clinical sensitivity

Because no single published study analyzed all 19 genes present on the DCM CardioChip, it is difficult to derive an accurate clinical sensitivity for this test. However, a meta-analysis of all published studies (each having tested overlapping subsets of these genes) combined with the data presented here allows a projection.

The DCM CardioChip added 9 new genes to 10 previously available clinical tests, for which combined published data suggest a detection rate of 22% (Table 6). Therefore, our cohort of 73 patients, who did not carry clinically significant variants in these 10 previously tested genes, corresponds to 94 untested patients. Ten of these patients carried variants of likely (n = 7) or unknown clinical significance (n = 3, one present with another variant). Taking both variants of likely and unknown significance into account, the projected maximum detection rate is 10.7% (10 of 94). However, it is likely that some of these variants are rare benign variants, particularly in light of the fact that the sizes of our control cohorts were moderate. Therefore, a more conservative detection rate for the nine new genes on the DCM CardioChip, including only the variants of likely significance, is 7.5% (7 of 94). It is important to point out that, because of the small size of our cohort and most previously published cohorts, this number remains an estimate and that the true clinical sensitivity for these nine genes will emerge over time. In summary, these data combined with all previously published data project the overall clinical sensitivity of the DCM CardioChip to be 26–29%.

DISCUSSION

The combination of marked locus and allelic heterogeneity of DCM poses a substantial challenge to sequencing-based clinical genetic testing to identify disease-causing mutations. Although traditional dideoxy-based sequencing technology is still considered the gold standard for molecular diagnostics, it is too costly to allow comprehensive screening of the myriad of DCM genes. To reduce test cost and TAT, many laboratories have offered reflexive testing strategies, where genes are grouped into panels allowing a sequential analysis starting with the panel offering the highest detection rate and proceeding with additional panels only if this initial test is negative. Although cost effective for those patients carrying a disease-causing variant in the commonly mutated genes, this strategy is extremely costly and time consuming for all other patients. In addition, rare cases of compound or double heterozygotes may be missed by traditional analysis if the first panel identifies a clinically significant variant and the subsequent/second panel is not performed. Our analyses show that the DCM CardioChip is significantly cheaper (almost 50%) and quicker (7 weeks TAT vs. an estimated 14 weeks) compared with a Sanger-based sequencing test covering the same 19 genes. Parallel processing of all genes also eliminates the danger of missing compound and double heterozygotes.

Our array-based resequencing test contains 19 DCM genes, 10 of which were previously offered in 2 separate test panels. Based on available published data, variants in the nine newly added genes (ACTN2, VCL, CSRP3, TCAP, ABCC9, CTF1, SGCD, DES, and EMD) are generally rare and less well studied, which previously prevented inclusion into a clinical test. Our results suggest that the overall added sensitivity for these genes may be at least 7.5%, which is in the same range as what has been reported across the few available studies so far (Table 6 and references within). However, because our cohort and most other cohorts analyzed to date are small, the true clinical sensitivity for this set of genes is going to emerge over time.

The DCM CardioChip has a high analytical sensitivity for substitution variants but a reduced ability to detect novel insertions and deletions, which is a well-established limitation of array-based resequencing tests. Although it is a general limitation of the technology, it may not be significantly limiting for clinical testing of DCM because insertion/deletion mutations are generally rare among the main, well-characterized DCM genes (MYBPC3, MYH7, TNNT2, TNNI3, TPM1, ACTC, LMNA, LDB3, and PLN). Only MYBPC3 is known to have a significant prevalence of insertions and deletions, but this seems to be true for only HCM (internal unpublished observations). The variant spectrum of all other genes (ACTN2, VCL, CSRP3, TCAP, ABCC9, CTF1, SGCD, DES, and EMD) has not been well characterized, and mutations in these genes are generally rare. As such, the DCM CardioChip is a worthwhile compromise considering that cost-effective screening of these genes could otherwise not be offered at this point. The genetic basis for DCM is only partially understood today, and new genes are being discovered at a rapid pace. For example, promising candidates that have been discovered since the DCM CardioChip was designed include the cardiac ankyrin repeat protein, ANKRD1,61 the RNA-binding protein RBM20,62 and the Z-disk protein nexilin, NXN.63 Together with other genes that were not included into this design of the DCM CardioChip for various reasons outlined earlier (see Introduction), there is already a large and growing number of genes under consideration for an expanded DCM test. However, given the large number of additional genes, the size limitations of array-based sequencing chips, and the relative difficulty of adding new content, it is likely that further genes will be added when new sequencing platforms such as Next Generation technologies are established in clinical testing laboratories.