Main

Molecular genetic analysis in humans raises complex ethics issues which often impact significantly on the pace of genetic research programs. Research ethics boards that consider genetics proposals typically worry about privacy and discrimination, informed consent, who has the right to speak for the family with regard to participation in genetic research, the sharing or nonsharing of results, and the subsequent use of DNA for other studies. Yet analyses of genetic information are carried out routinely in the clinical diagnostic setting without direct oversight from an ethics board. Our goal is to review scenarios commonly observed in molecular diagnostic and genetic research settings, and to consider the two paradigms from both practical and ethics perspectives.

Research is defined as “a systematic investigation to establish facts, principles or generalizable knowledge.”1 It is universally accepted that research involving human subjects must be regulated and overseen by independent ethics review boards (REBs or IRBs).2 However, information about patients is also routinely gathered by the medical system and is often publicly communicated in the form of case reports that are not typically considered research. Thus, such reporting is generally exempt from previous ethics review.

The ambiguous distinction between research and practice was recognized already in the Belmont Report, the earliest codification in North America of ethical principles and guidelines for research involving human subjects.3 The authors of the report wanted to ensure, in part, that not every departure from standard or accepted practice with an untested or different intervention was ipso facto declared to be research and thus subject to research ethics review. At the same time it was important that major innovations were incorporated into formal research early on to determine their safety and efficacy. However, the authors of the report were thinking primarily of innovative procedures and devices, and did not anticipate fully the advent of the genetics era in which the “patient” can include other family members who may not currently be attending the clinic. At the same time the line between genetic research and practice is often indistinct as genetic research can produce results that are immediately relevant to the clinical management of a particular patient or family. In some cases the clinical management of a particular disease produces new information ascertained through patients who had never officially enrolled in a research study.

Bevan and Hardy4 state that “In general, we do not think of typical case reports as research requiring REB review, as case reports/case series are retrospective descriptions of a course of events that document some new aspect of care, the data are not collected systematically, there is no data analysis or testing of a hypothesis, and the work has already been done.” Shevell5 suggests that “Case reports be conceptualized as a formulized description of a particular individual's history with a disease presentation and progress … to warrant publication, such observations must be in some way novel and serve to advance our understanding of the disease reported.” Close reading of these quotations illustrates that the standard definitions of case reporting are somewhat uncertain regarding the boundary between clinical diagnosis and true research. Indeed it has been noted that the distinction between “research” and “diagnostic” genetics is often ambiguous.6 This may lead to confusion with regard to appropriate practice. Even the obligation to seek informed consent from patients before publishing case reports varies between institutions and medical journals.4

THE PURVIEW OF DIAGNOSTIC GENETICS

Scenarios of current genetic diagnostic testing include (1) screening for specific previously observed mutations in known genes, in new cases ascertained by nonmolecular (i.e., clinical) presentation; (2) screening for new mutations in known genes already proven to be causal for a clinical condition, in newly diagnosed cases; (3) screening for mutations in individuals genealogically related to cases already testing positive for a mutation in a specific gene, where such individuals might themselves be at risk; (4) screening by cytogenetic examination for individuals clinically ascertained with phenotypes matching those of known chromosomal abnormalities, or in some cases novel developmental disorders. Typically such testing is performed within a clinically certified hospital laboratory, or under the supervision of a certified molecular diagnostician. Results are communicated to the patient either by a clinical specialist or by a medical genetics team that may include genetic counseling. Informed consent may be sought from patients before molecular testing, depending on the exact nature of the tests to be performed. REBs play no role in this process.

Despite the tremendous increase in medical genetic knowledge over the past decades, molecular diagnostic laboratories regularly detect genetic variants whose interpretation is at least initially uncertain. These can include rare single nucleotide coding variants in specific genes such as BRCA1,7,8 or novel cytogenetic abnormalities in complex pleiotropic developmental conditions,911 which are sometimes resolved by ascertainment of additional examples.12 These outcomes may be withheld or provided to the patient with appropriate counseling, with the understanding that the test outcomes are, at least for the time being, ambiguous. Diagnostic analysis may also be extended to family members in an attempt to determine whether the identified genetic variant segregates with the disease in additional individuals. In some cases family studies are critical to provide evidence for or against a genetic etiology.13 This defines one important distinction between molecular genetic and other forms of medical diagnosis, in that family studies (of multiple individuals) may be essential for proper diagnosis of individuals. This blurs the boundary between case study and research more particularly for genetic studies than for other types of clinical analysis. Published genetic case reports, with or without molecular results, often report family-based data rather than just those of individual patients (see selected examples1418).

Recognizing this level of complexity, Parker et al.19 have noted that “an unambiguous distinction between clinical practice and research is impossible” for rare genetic disorders. They propose to define research primarily based on outreach beyond a “single family to the solicitation of affected but unrelated individuals.” Although this criterion is nominally precise, scenarios arise in which it is difficult to carry out in practice. In populations known for strong founder effects, formally unrelated individuals are often genealogically closer than they themselves realize. Many examples exist of specific mutations occurring at a much higher frequency in some regions than others through founder sampling effects.2022 It may be difficult to define the degree of relatedness of multiple affected individuals within a population, and hence the concept of “single family” becomes fuzzy. Extension of molecular diagnostics beyond nuclear families of probands may play a vital role in medical genetics practice depending on the local population structure.23

Even in cases of clearly unrelated families, the criterion of “solicitation of affected but unrelated individuals” seems excessively strict. Imagine two diagnostic laboratories working with families from very different populations, and independently identifying the same novel mutation in a gene under diagnostic analysis. If either laboratory alone is performing a clinical diagnosis, and if the laboratories communicate results to each other in an anonymized format, are they suddenly performing research? Such situations do arise in the research literature,2426 and it is not clear that there is universal agreement on the distinction between genetic case reporting and genetic research in them. As with other case studies, clinical samples for genetic analysis have been collected and tested through a diagnostic laboratory, and results have been reported to the patient as part of the medical diagnosis. By extension, results may also have been reported to family members (defined in some fashion which may depend on local population structure). If patients voluntarily consent to allow publication of these results in medical research journals in the form of case reports, this would seem to obviate the need for REB review. It is difficult to ascertain from the wide literature on human genetics to what extent prior or post hoc REB approval is sought for such genetic case reporting, but it is certainly far from a universal practice.

NOVEL GENETIC ANALYSIS

Patients lacking any molecular evidence of mutation of a known gene have reached the end of the current clinical diagnostic process. Further molecular analysis has historically fallen squarely under the definition of research. In attempting to identify a novel causal genetic variant, the next step is typically a complete genome scan of patients plus associated family members, possibly with inclusion of multiple families exhibiting a similar clinical phenotype. For the past decade such genome scans have involved panels of polymorphic microsatellite repeat markers, commercially available and distributed across the entire genome at defined intervals. Recently, dense biallelic single nucleotide polymorphism (SNP) panels encompassing hundreds of thousands of individual markers have become commercially available and are beginning to be used for family-based linkage analysis.2730 Following statistical linkage analysis, if a small chromosomal region is identified which shows evidence that it may encompass a causal variant, positional cloning ensues. In the past this has involved physical mapping of the relevant chromosomal region, determination of the protein-coding gene content in the region, and eventually mutation detection. With the advent of the Human Genome Project and steadily improving genomic annotations, this process has simplified significantly. Statistical analysis of mapping data are now usually followed directly by mutation detection. A number of methodologies exist for identifying the presence of variations in patient DNA samples indirectly, but the gold standard ultimately is always DNA sequencing for identification of single nucleotide or small insertion/deletion variants, or other molecular techniques for detection of larger rearrangements.

Novel gene discovery using current positional cloning methodology may cost on the order of $25–50,000 per gene, depending on stochastic factors such as size of the family, nature of disease transmission (dominant versus recessive), occurrence of individual recombination events in families defining the size of a linked chromosomal interval containing the causal gene, reliability of phenotypic diagnosis (i.e., penetrance), gene content of linked chromosomal intervals, and timing of detection of actual causal variants (i.e., first gene sequenced versus twentieth gene sequenced). Such a level of expense and uncertainty is effectively intolerable to purely clinical budgets. Realistically, positionally cloned genes (approximately 2000 molecularly characterized monogenic disorders are currently curated in the Online Mendelian Inheritance in Man database http://www.ncbi.nlm.nih.gov/sites/entrez?db=OMIM) have been predominantly supported through public/government or private research funding agencies at the local, national or international level. Senior scientists managing these discovery projects may be MDs, PhDs or MD/PhDs, generally in academic or corporate research institutions with molecular biology research laboratories and formalized publication milestones. In this context, taking account of the mandated goals of the individuals overseeing the work, the sources of funding, and the institutional homes, it seems clear that such activities as now performed represent research, not clinical diagnosis, and must be overseen by institutional REBs.

One area where there is already clear overlap between clinical diagnosis and research is in molecular cytogenetics. As noted above, cytogenetic examination of stained chromosomes by light microscopy represents a form of low resolution whole genome analysis already routinely practiced for clinical diagnosis, particularly in cases including mental retardation, developmental delay or syndromic developmental defects.13 It also presents an historical progression of steadily increasing resolution, using techniques with greater and greater sophistication. Beginning with whole chromosome aneuploidy (such as trisomy-21), karyotyping by chromosomal banding has identified numerous microscopically detectable chromosomal rearrangements (>3 million base pairs) in the appropriately ascertained clinical patients. Karyotyping with resolution at the level of individual genes can be obtained using fluorescence in situ hybridization, however this method requires postulating a specific candidate gene or small chromosomal regions (as in Cri-du-chat, Williams, Prader-Willi, and Angelmann Syndromes and subtelomeric rearrangements, among others). Fluorescence in situ hybridization and lower resolution chromosomal banding fail to detect smaller chromosomal variations or variations outside defined candidate loci. Recently microarray technology has been brought to bear on this problem, especially with regard to changes in gene dosage or copy number (i.e., copy number variants [CNVs]). The resolution of detection depends on technical details of the arrays used, but ranges between 1000 and 3 million base pairs.3133 Some efforts have been made to develop explicit guidelines for clinical uptake of array-based molecular cytogenetics.34,35 They currently suggest that whole genome analysis not be employed as a first-pass technology in cytogenetic analysis. However, they also note that the main issue is interpretation of findings, given that array methods routinely detect multiple CNVs even in patients without a clear medical ascertainment. To the extent that rearrangements and CNVs can be detected in individual patients with high explanatory value, the future will likely include whole genome ultra-high resolution karyotypic analysis as a diagnostic procedure in clinical genetics.

NOVEL GENETIC ANALYSIS AS DIAGNOSIS

The case may nevertheless be made that novel discovery work is intellectually more closely modeled as diagnosis rather than research, particularly where explicit intent exists to provide information to patients and families. For example, whole genome SNP mapping data, obtained while studying copy number variation, can also be analyzed statistically for linkage or homozygosity in related individuals or for linkage disequilibrium in founder populations. It is a straightforward extension that family based genome scans, and follow-up DNA sequencing, could potentially be performed completely in a diagnostic context, especially if laboratory costs continue to fall. The sharing of molecular data from multiple families, obtained in the same or different diagnostic laboratories, might in principle be no different than sharing of other diagnostic information. However, inasmuch as genetic information is often “familial” rather than “individual,” and insofar as clinical genetic practice does not generally require prior REB approval, the standard of appropriate ethics oversight for clinical genetics activities requires ongoing consideration.

MONOGENIC VERSUS “COMPLEX” GENETICS

Distinctions must be made among different types of genetic analysis. High penetrance monogenic disorders whose causal genes are already known are routinely diagnosed (with or without molecular analysis) and reported to individual patients and their family members in a clinical context. The responsible clinician and the diagnostic laboratory anticipate from the beginning that molecular results will usually be interpretable, and are likely to be reported to the patient who presumably is known to desire such results (or would otherwise not have agreed to testing). Extension of such analyses to novel monogenic phenotypes seems straightforward. It is worth noting that 90% of the estimated 22,000 protein-coding genes in the human genome36 remain genetically uncharacterized, thus a large body of discovery work remains to be performed even for monogenic disorders.37 Nevertheless, emphasis in genetic research has shifted toward the study of complex, multifactorial diseases with oligogenic or polygenic inheritance and strong environmental modifiers.38 In studying the genetic component of such disorders, including for example obesity, diabetes, asthma, migraine, multiple sclerosis, depression, parkinsonism, etc., it is often envisioned that results will remain highly statistical in nature, and not routinely interpretable in terms of the genotypes of individuals. Although in theory participants could be provided with results in the form of probabilistic increases (or decreases) in risk for each illness, the medical value of this seems to be questionable for now.39 There is no clear path from the research results to a clinical molecular diagnosis in individual patients, as cautioned in a recent discussion of ethics of CNV studies in autism.40 For the moment at least, complex genetic analysis remains clearly in the realm of research (notwithstanding direct public access to commercial genotyping services providing dubious interpretations), although this too is likely to evolve over the coming decades as our understanding of the functional consequences of subtle low-penetrance genetic variants improves.41 Even for well-understood monogenic disorders such as phenylketonuria or cystic fibrosis, considerable variability in penetrance and phenotypic severity are documented.4244 Despite the difficulties posed for diagnosis and clinical management by such observations, this has not led to questioning the value of clinical genetic testing for these diseases. In fact, there have been pioneering efforts toward population-based screening for large numbers of genetic metabolic disorders,45 many or all of which presumably have some degree of diagnostic complexity at the level of the individual patient. Such efforts have even been accompanied by cost-benefit analyses, pointing in favor of pre-emptive molecular diagnosis rather than symptom-based ascertainment for both metabolic and cancer predisposition diseases.4649

WHOLE GENOME RESEQUENCING

A full discussion of whole genome resequencing of individual patients is beyond our current scope. It is highly significant though that one such study has already been published.50 This study—clearly identified as research and not case reporting—nonetheless reports sequence information for specific genes with known monogenic phenotypes (for example Huntington Disease, for which the patient is reported as normal). SNPs with previously reported statistical associations to traits such as risk-taking or cardiovascular disease susceptibility are also reported. Resequencing of a second full human genome has likewise just been reported,51 as has resequencing a flow-sorted purified X-chromosome.52 For the moment, the costs of whole genome resequencing remain excessive even for genomics research programs. However, costs have dropped precipitously over the past year and several companies are committed to bringing such costs within the range of even modest laboratories, and hence potentially within the range of clinical units.53,54 Implementation of such technologies is certain to impact medical diagnostics in the near future.55

PATIENT RIGHTS

As elements of genetic analysis mature from research toward diagnosis, a crucial question is that of guaranteeing patient rights. Who is the guarantor in scenarios outside the domain of research ethics board oversight? Genetics does not present insoluble problems in this area. Ethical behavior in medicine generally is assumed to be both possible and administrable, and is expected to be the standard of practice in all medical institutions, including those engaged in both clinical medicine and research. Obtaining biological materials for genetic analysis, use of those samples for defined clinical tests (that may ultimately include routine whole genome sequence analysis), reporting of results to patients and family, and public communication of results as case reports in legitimate medical journals with appropriate confidentiality, are already realized in current molecular diagnostic clinical practice. Extensions to novel gene discovery, at least for high penetrance monogenic disorders, seem incremental rather than substantive. McGuire et al.56 provide an interesting and cutting edge discussion of these issues with regard to genome sequencing in a predominantly research context.

The requirements for clinical use of patient materials differ from those of research use. This is equally a concern under current standards. The lack of consensus among research ethics boards for many issues in genetics57,58 is to some extent contrasted by more normalized guidelines in the molecular diagnostic setting. In the latter setting, quality control, sample tracking and data confidentiality are formally assessed and validated independently. The challenge is to provide appropriate ethics oversight of such clinical activities without succumbing to the bureaucratic burden of ethics review all too common in the research context.

In our experience there do not seem to be significant differences between the United States and Canada regarding the implementation and administration of clinical molecular diagnostics. However, we do note a potential difference in the context of genetics research, in that Canadian researchers seem to have greater freedom to transmit research results to patients, given that research results in general are usually not obtained in a Clinical Laboratory Improvement Amendment (CLIA)-approved laboratory. The unavoidable limitations to quality control in research laboratories are (or should be) made clear to research participants in the process of obtaining informed consent.

Indirectly related to the issue of patient rights is the question of reimbursement. Research costs are typically absorbed by the grants administered by research investigators, whereas clinical costs are managed through the health care system(s). Acknowledging that this is a significant area of concern, especially as expensive technologies such as whole genome arrays and even sequencing are brought into the clinical realm, it is beyond the scope of the current discussion to delve into the economics of molecular diagnostics in greater detail. The issue is similar to that for new medical devices and drugs, as discussed by Garber.59

Our personal experience is that patients with inherited disorders are profoundly impacted by the outcome of a definitive molecular diagnosis, generally positively. There is a wide range of patient interest and acceptance of the potential for molecular diagnosis.60 However, we believe that for high penetrance genetic disorders patients generally prefer to know such results, rather than to remain in a limbo of uncertain diagnosis. Patients may express relief simply if they perceive interest by the medical community in even establishing their diagnosis, particularly in the case of pleiotropic syndromes where even specialists without genetic training may find diagnosis difficult. The implementation of molecular analysis in a diagnostic setting has the potential to expedite such outcomes significantly, and an informal network of clinical molecular testing laboratories has been established covering many, though far from all, molecularly characterized monogenic disorders.61 The emotional and health care burdens to patients who lack definitive diagnosis while experimental protocols pass through funding searches and then extensive research ethics board review should be considered as major factors weighing in favor of extending the definition of diagnostics in molecular genetic analysis.

CONCLUSIONS

The problem we have attempted to address is whether the distinction between genetics research and genetics clinical diagnosis is philosophical and fixed, or operational and fluid. We believe the latter description is more accurate. We hope to encourage the health care and research systems to be more consistent and less severe with genetic research protocols than they often are, as they may not fully recognize the fundamental similarities between such research protocols and clinical tools already in routine use. Moreover, genetic research in some instances leads with high probability to a potential clinical molecular diagnosis. Although diagnosis is not itself the stated aim of research, in many genetics study paradigms the two are inseparable. In some cases the research may only have been possible with prior understanding that the patients and families would explicitly benefit from sharing of research findings, either via the research laboratory itself or through an associated clinical diagnostic unit. In these cases the incompletely defined route from research findings to the clinic is in fact a drawback, which might be best solved if the primary findings themselves could be made in a fully clinical context. We believe that some genetic discovery work, especially for disorders or syndromes most consistent with single gene etiology, might better serve the needs of patients if brought into the established realm of diagnostics. At a minimum, many aspects of molecular genetic work that are currently considered “research” clearly involve only modest and logical extensions of protocols which are currently considered “diagnostic.” Some amount of intrinsic uncertainty in molecular diagnosis is already accepted, and accounted for in dealing with patients and their families within the context of trained genetic counseling units. Moreover the tremendous variability in responses of research ethics boards to genetics research proposals does not accurately reflect the enormous experience and consensus the clinical genetics community has accumulated in dealing with the outcomes of molecular genetic analysis in human patients.

All genetics attempts to relate phenotype to genotype, and the relationship between the research and the issue of clinical care rests in part upon the severity of the disease, the possibility of amelioration through treatment or screening, the penetrance of the gene and the recurrence risk to family members. Our key point is that the boundary between genetics research and clinical diagnosis is not only fundamentally blurry, but also moving. Despite this, in an attempt to place some clarity on the issue at least in the current context, we define a continuum of scenarios as follows:

  1. 1

    Gene finding that is definitely related to clinical care includes diseases with high recurrence risk, high penetrance, significant morbidity and mortality and amelioration.

  2. 2

    Gene finding that is possibly research/possibly clinical care:

    1. a

      The determination of genetic variants that may alter (for better or worse) a disease of the type outlined above in which a major gene has already been found.

    2. b

      Genetic studies that deal with diseases with high morbidity and mortality, but for which the penetrance may be unknown and where the underlying genetic inheritance may be unknown, particularly those conditions with the possibility of amelioration.

    3. c

      Outcomes of whole genome resequencing protocols depending on the exact findings and the strength of phenotype/genotype correlation for a clinical diagnosis linked to a particular gene.

  3. 3

    Gene finding that is definitely research:

    1. a

      Most whole genome association and CNV studies.

    2. b

      Anonymous population studies.

    3. c

      The search for genes that are quirky but have no known significant clinical ramifications, e.g., male pattern baldness.

  4. 4

    For monogenic conditions, or diseases with significant burden and a genetic component, the ideal structure within which genetic research studies should occur is perhaps that of a clinical genetic register system.62