Main

Huntington disease (HD), which affects approximately 1 in 10,000 individuals, is an adult-onset, autosomal dominant neurodegenerative disease. People affected by this fatal disorder exhibit progressive chorea, rigidity, and dementia as well as various psychotic and behavioral symptoms. A juvenile form of HD that results in more severe symptoms and rapid progression has also been observed.1

The HD gene (IT15) was mapped to 4p16.3,2 is composed of 67 exons spanning 180 kb3 and encodes a 348 kDa protein of unknown function.4 HD is caused by an expansion of an unstable polymorphic trinucleotide (CAG)n repeat in the first exon of the HD gene.4 Normal alleles have 26 or fewer repeats.5 Mutable normal alleles have 27–35 repeats and can be meiotically unstable.6 Alleles with 36–39 repeats have reduced disease penetrance, whereas alleles with 40 or more repeats are fully penetrant6.

The discovery of the HD gene and the association of the (CAG)n repeat length with the risk for developing HD made predictive genetic testing possible. The molecular detection of an allele associated with HD or confirmation of its absence, has an important, and potentially life-altering effect on patients and their families. Because the difference between a normal allele and one that has the potential to cause HD can be as little as one CAG repeat, it is especially important for clinical assays to be very accurate.

Accurate determination of the CAG repeat length in the HD gene is technically challenging and difficult to interpret. There are no commercially available FDA-approved test kits and, until recently, no reference materials (RMs)7 for HD genetic testing were available. Thus, each clinical laboratory offering this test develops its own in-house polymerase chain reaction (PCR)-based assay and RMs. Although testing for HD has been available for more than 10 years, only one Huntington cell line has been developed for use as a quality control (QC) material.8 QC and RMs are urgently needed by the genetic testing community to facilitate accurate size detection of HD alleles.

RMs are essential for many aspects of genetic testing. Regulatory requirements and recommendations of professional societies stipulate that positive controls should be run at least once each day in which samples are run and in the same manner as patient specimens to detect errors due to test system failure or operator error.6,918 In addition, QC and RMs are needed for test development and validation, lot-testing of new reagent batches, and for performance evaluation (proficiency testing and external quality assurance [PT/EQA]) programs.

The lack of available QC and RMs for genetic testing has been recognized as a critical need of the genetic testing community.19,20 To address this need, the Centers for Disease Control and Prevention (CDC), the National Institutes for Health, and the National Institute of Standards and Technology (NIST) held a series of three meetings to develop a sustainable process to collect, store, validate, and distribute RMs (mainly focused on genomic DNA) at a reasonable cost.10

Based on recommendations from the meeting participants, a new CDC-based program, the Genetic Testing Reference Material Coordination Program21 (GeT-RM, formerly called the Genetic Testing Quality Control Materials Program [GTQC]), was established in partnership with the genetics community. The goal of this program is to coordinate a self-sustaining community process to improve the availability of appropriate and characterized RMs for QC, proficiency testing, test development, and research. The GeT-RM Program is coordinated by the CDC, but all of the actual work, including decisions about RM priorities, mutation confirmation schemes, specimen collection, material development, and characterization, occurs through voluntary cooperation of the laboratories in the genetics community.

This article describes the collaborative development of characterized genomic DNA RMs derived from 14 Huntington cell lines by the GeT-RM program and the genetics community.

METHODS

Cell lines and DNA preparation

Fourteen Epstein-Barr virus-transformed lymphoblast cell lines containing HD alleles with repeat sizes ranging from 15 to 100 were selected from the National Institute of General Medical Sciences Human Genetic Cell Repository at Coriell Cell Repositories. The cell lines were cultured using previously described methods.22 Approximately 2 mg of DNA was prepared from each of the selected cell lines by Coriell Cell Repositories.23

Laboratory selection

A total of 10 clinical genetic laboratories that offer HD testing volunteered to participate in this mutation confirmation study. Nine of the laboratories are in the United States; one is in Vancouver, British Columbia, Canada.

Assay methods

Each of the 10 clinical laboratories used their own in-house developed PCR assay to determine allele sizes in all of the DNA samples. PCR primers were designed by each laboratory to hybridize just 5′ and 3′ of the CAG repeat region in exon 1. The location of primers for each laboratory is summarized in Figure 1.

Fig 1
figure 1

Location of PCR primers flanking the CAG repeat region of the HD gene. The CAG trinucleotide repeats are indicated by shaded boxes. Calculation of the number of CAG repeats is specific to the primers used. The forward and reverse primers used by each laboratory are indicated. The numbers next to each primer correspond to individual laboratories. Primer set 11 (NIST) was used for amplification and DNA sequence analysis. The arrows indicate the direction of DNA synthesis from the primers. Primers used to measure the CCG repeats are not shown but are described in “Methods.”

In addition, three of the laboratories utilized a second 3′ primer that allowed the amplification of the polymorphic CCG repeat region (green text, Fig. 1) that is just 3′ to the CAG repeats. Use of these downstream primers allowed discrimination between alleles when the CAG repeat sizes seemed to be homozygous. In one laboratory (Laboratory 5, Fig. 1), the size of the CCG repeat region was only determined in the case of apparent homozygosity. Laboratory 5 used downstream primer 5′GGCTGAGGAAGCTGAGGAG for this indication. Two other laboratories routinely measured the CCG repeat region in each sample. One laboratory (Laboratory 1, Fig. 1) used a multiple dye multiplex PCR assay with separation on an ABI3100 capillary electrophoresis instrument. This laboratory utilized primers pairs: (1) flanking only the CAG repeat (5′CCTTCGAGTCCCTCAAGTCCTTC and 5′HEX-GGCGGCGGTGGCGGCTGTTG), (2) flanking only the CCG repeat (5′NED- AGCAGCAGCAGCAACAGCC and 5′GGCTGAGGAAGCTGAGGAG), and (3) flanking both repeats (5′CCTTCGAGTCCCTCAAGTCCTTC and 5′6FAM-GGCTGAGGAAGCTGAGGAG). The other laboratory (Laboratory 2, Fig. 1) utilized a second downstream primer 5′GCGGCTGAGGAAGCTGA.

Four of the laboratories used fluorescently labeled PCR primers, and the PCR products were analyzed using capillary electrophoresis on an automated DNA sequencer. A fifth laboratory also used fluorescently labeled PCR primers, but the PCR products were analyzed using an automated gel based sequencer. CAG repeat length of each product was determined by automated analysis. The remaining five laboratories either labeled PCR products with radioisotopes and separated the products using polyacrylamide gel electrophoresis or generated unlabeled PCR products and later hybridized with a radioisotope labeled probe specific to the CAG repeat region. CAG repeat length was determined by manual comparison to a size standard or digitizing against a standard and extrapolating repeat size.

Each laboratory received an aliquot of DNA from a previously characterized Huntington cell line, CD0002224 with CAG repeat sizes of 18/31 for use as a size control. Laboratories were also permitted to use their own in-house validated controls. None of the laboratories reported making adjustments for abnormal migration of HD amplicons during electrophoresis.

The laboratories reported using different methods to distinguish the band of interest from other “stutter bands” depending on the method of detection. Laboratories using separation by gel electrophoresis based their selection on both the intensity (brightest) and the position of the bands. Often, but not always, the band of interest is the highest band. For laboratories using capillary electrophoresis, the prominent peak that satisfies the threshold was selected.

Protocol

Each of the 10 testing laboratories received 50-μg aliquots of DNA from each of the 14 Huntington cell lines being tested as well as control sample CD00022. The expected CAG repeat size in each of the 14 samples was not revealed to the laboratories. The laboratories measured the CAG repeat lengths in each DNA sample in three separate assays using their in-house methods. These results were sent to the study coordinator (Sue Richards), who examined the data for quality, checked for discrepancies and compiled the data for statistical analysis.

Statistical analysis

Mean repeat values and 95% confidence intervals were calculated using SAS/STAT® PROC GLM. Modes were determined by counting the most common CAG repeat length reported from each allele.

DNA sequence analysis

DNA sequence analysis was performed to determine the CAG repeat length in each of the 14 DNA samples. PCR products were gel-purified using a filter column (QIAquick® gel extraction kit, Qiagen, Valencia, CA). Purified DNA samples were cycle sequenced (unidirectional) using BigDye® Terminator sequencing kit, version 1.1 (Applied Biosystems, Foster City, CA), using the forward primer. Cycle sequencing reactions were performed on a PE Applied Biosystems GeneAmp® PCR System 9700 thermal cycler. DNA sequencing was performed in triplicate using an ABI PRISM® Model 310 Genetic Analyzer with POP-6™ polymer system and 47 cm × 50 μm capillary. Sequencing data were analyzed with Sequencing Analysis Software, version 3.3.

The ABI sequencing data were analyzed by DNASTAR Inc (Madison, WI) Lasergene6.1 SeqManII software to determine the quality scores.25

RESULTS

RM needs for HD genetic testing were identified through discussions with clinical laboratory directors and other experts. Fourteen cell lines were selected for study. These cell lines contained a large range of allele sizes and combinations including normal alleles and alleles at important diagnostic cutoffs, e.g., 35–36 and 39–40 repeats. Additionally, some of the cell lines were included because they contained HD alleles useful for technical reasons, such as homozygous alleles, two alleles that were close to each other in repeat size or alleles with large CAG repeat sizes.

All 10 testing laboratories were able to report data from three independent size determinations for 27 of the 28 Huntington CAG repeat regions measured in this study. The 30 size measurements for each of the 27 alleles were used to determine the mean and mode CAG repeat number. The size of the remaining allele, approximately 100 CAG repeats, proved more problematic to measure. Eight laboratories were able to report CAG repeat length from three independent assays for this allele, one laboratory reported results from only one of three assays and one laboratory was unable to amplify this allele. The available data (25 measurements) were used to determine the mean and mode repeat length of this allele.

Using the composite data from the 10 clinical laboratories, the mean CAG repeat length, 95% confidence intervals, and the modal values for each cell line were determined (Table 1). There was good agreement among laboratories, as indicated by narrow 95% confidence interval ranges. In addition, except for the largest allele, the modal value was the same as the mean. This indicates that the values had a normal distribution and were not skewed. We did not see differences between different detection methods used, suggesting that the methods are comparable.

Table 1 Mean CAG repeat length in DNA from 14 HD cell lines

The CAG repeat length for each DNA sample was also determined by NIST using DNA sequence analysis (Table 1). The repeat lengths determined by DNA sequencing and laboratory analysis were identical for 25 of the 28 alleles. For the remaining three alleles, the value determined by DNA sequence analysis fell within the 95% confidence limits of the mean values determined by the 10 laboratories. Quality scores were generated from the data for each of the alleles sequenced. The scores were in the acceptable range (>40) for all alleles, except for an allele in sample NA20207 with 21 repeats. It is likely that the score for this allele (37.8) was low because of difficulty physically separating the PCR products for the two alleles (19/21) before sequence analysis. It is also possible that there is some minor contamination or mosaicism in sample NA20207, which could account for both the lower modal value and quality score for this allele.

CONCLUSION

This study showed that there were no significant differences in the analytic values obtained for the 14 samples tested using different HD assays or methodologies among the 10 laboratories. There was also very good agreement between the CAG repeat sizes obtained by the laboratories and the NIST DNA sequence analysis. All three of the alleles in which a discrepancy was found between the results of DNA sequencing and clinical assays were larger than 45 repeats, which are more difficult to size or sequence accurately. Because these three alleles were in the fully penetrant allele range, the interpretation of alleles in this size range would be unaffected.

The CAG repeat length of 22 of the 28 HD alleles measured in this study using the laboratory developed PCR methods agreed with the allele size designated by the original submitter of the cell lines to Coriell. However, the CAG repeat size of six alleles (21%) differed. This result suggests the need for users to confirm the mutations or alleles in genomic DNA materials from any source using sequencing or other less equivocal methods, and also underscores the need for characterization studies of potential RMs.

For most genetic tests and mutations, there is no publicly available source of DNA or cell lines that can be used as RMs for QC, PT/EQA, genetic test development/validation or research. In lieu of publicly available materials, clinical laboratories and test developers use residual patient specimens when available, or synthesized oligonucleotides. The lack of available materials also affects the ability of proficiency test providers to produce sufficient and varied challenges and also limits the development and validation of new tests.

There is evidence that the availability of RMs improves the accuracy of HD repeat sizing. Data collected by the National European External Quality Assessment Service indicated a marked decrease in the variance of HD repeat sizing after a 35 repeat HD RM was provided to the participating laboratories.26 Results of recent CAP proficiency surveys for HD testing indicate that laboratories generally are able to accurately measure the CAG repeat length. In each challenge, however, there are a variety of repeat lengths reported for a given allele. At times, the variation is 10–20% or more around the consensus value, suggesting that the availability of HD RMs may help to improve analytical performance.

The genomic DNA materials characterized in this project will be useful for QC, proficiency testing, test development and research, and should help to ensure the accuracy of HD genetic testing. This is especially important for alleles that are close to the diagnostic cutoffs because inaccuracy in sizing by even one repeat could impact the clinical interpretation. In addition, these studies validate our voluntary community-based approach to RM development, and may serve as a model for similar projects in the future.

DNA samples purified from these cell lines, as well as other materials developed by GeT-RM, are publicly available from the Coriell Cell Repositories.27 More information about the program and available QC and RMs can be found on the GeT-RM website.21