MutaDATABASE: a centralized and standardized DNA variation database

Bale, Sherri; Devisscher, Martijn; Criekinge, Wim Van; Rehm, Heidi L; Decouttere, Frederik; Nussbaum, Robert; Dunnen, Johan T Den; Willems, Patrick

doi:10.1038/nbt.1772

Correspondence
Published: 07 February 2011

MutaDATABASE: a centralized and standardized DNA variation database

Sherri Bale¹,
Martijn Devisscher²,
Wim Van Criekinge³,
Heidi L Rehm⁴,
Frederik Decouttere²,
Robert Nussbaum⁵,
Johan T Den Dunnen⁶ &
…
Patrick Willems⁷

Nature Biotechnology volume 29, pages 117–118 (2011)Cite this article

2002 Accesses
27 Citations
23 Altmetric
Metrics details

Subjects

You have full access to this article via your institution.

Download PDF

To the Editor:

The field of molecular diagnostics for genetic disease is entering a new era of whole-genome sequencing. The unprecedented speed and scale with which human sequence variants are being identified is presenting the diagnostics community with practical logistical problems in retrieving, collecting, sharing and depositing clinical and molecular information on genetic disease. In response to this problem, a large consortium of diagnostic testing laboratories in Europe, the United States, Australia and Asia have recently joined efforts to create a central database that provides a repository of DNA variations and allows open access to the whole community. Here, we outline the main features of this universal database, referred to as MutaDATABASE (Fig. 1; http://www.MutaDATABASE.org).

**Figure 1: Organization and structure of MutaDATABASE.**

There are several reasons why the finding of a pathogenic DNA variation is important for an individual affected with a genetic disease and for his or her family. A molecular diagnosis may offer confirmation of a clinically or biochemically suspected disease, carrier detection, disease risk assessment, prenatal diagnosis, management guidance and/or long-term prognosis and may also facilitate genetic counseling of family members.

Of crucial importance is obtaining certainty that the DNA variation identified in an affected individual is indeed a pathogenic variant responsible for the person's symptoms, and not simply one of the multitude of benign variations with limited or no clinical significance. Even before the advent of whole-genome sequencing, the clinical significance of many DNA variations was unclear, leaving both affected patients and their treating physicians in the dark.

We need a central DNA variation database as a repository that allows open access to the whole community.

One reason why this has been the case is that many DNA variations are not made public through reporting in scientific journals or submission into public databases. Even when variants are published, there is usually only limited accompanying clinical information to allow a molecular correlation. Relatively few novel variations are currently being reported in the literature, as it is difficult to get variation reports accepted in journals. Furthermore, only a minority of variations get entered into public databases because such data entry is tedious and laborious with the current platforms; submission is not demanded by journals, granting organizations or accreditation bodies (such as Clinical Laboratory Improvement Amendments (CLIA) in the United States or the International Organization for Standardization (ISO) in Europe and Australia); some testing labs may not see the added value of doing so; and a few may consider the data on variants they are paid to find as intellectual property belonging to them and therefore decline to share the information.

The problem is far greater when it comes to submission of the clinical information necessary to make genotype-phenotype correlations. First, few diagnostic labs even receive detailed clinical information upon submission of a test sample from the physician or referral laboratory. Second, it is not commonplace for physicians to access DNA variation databases and deposit information. There are also serious issues concerning standardization of clinical information, as well as privacy concerns with depositing an individual's clinical information in a public database, even when de-identified.

Thus, even if DNA variations are entered into a database, detailed clinical information on the individual showing a given variation is usually lacking. Consequently, only a small fraction of DNA variations is currently listed in any publicly accessible database, and even for those, correlation with clinical data is very often unavailable or inadequate. Furthermore, many DNA variation classifications in the current literature are incorrect as a result of the lack of standards and nomenclature for classifying variants, the limited data sets available and the statistically nonsignificant associations that have been made to date. These incorrect classifications have the potential to cause harm to affected individuals, if they have not already done so. Last but by no means least, although many DNA variation databases exist, no freely available common platform is available wherein all disease genes are represented.

This problem is expected to increase significantly in the upcoming years, for two main reasons. First, the amount of DNA variation identified will grow exponentially in the era of whole-exome or whole-genome sequencing, which is around the corner. Currently, one or a few candidate disease genes are analyzed in each individual, revealing, in the best case, a few DNA variations. However, in upcoming years many affected patients will have their complete exome or genome analyzed through massive parallel sequencing on the currently available 'next-generation' and upcoming 'third-generation' sequencing platforms. This will reveal thousands of variants in each individual tested, and the clinical significance of each of these variants will have to be determined. Second, current molecular testing is limited to a well-selected group of patients usually with a suspected monogenic disorder. As monogenic disease is rare (with an incidence of about 1–3 per 300 births), only a limited number of individuals have been tested in the past decade. But molecular testing is now shifting to the evaluation of risk factors for common multifactorial diseases that affect the majority of the population. It is therefore anticipated that in the near future the whole genome (or exome) of many more people will be sequenced.

It is therefore clear that we need a central DNA variation database as a repository that allows open access to the whole community. Recently, several diagnostic testing laboratories from around the world have joined efforts to create the MutaDATABASE. This is a new, freely available, online database developed to contain standardized information on each human disease gene that not only will list all DNA variations identified in that gene but also aims to combine that data with clinical information for the individuals carrying the DNA variation. As shown in Figure 1, each single gene in the database will be curated by experts in the field ('MutaCURATORS') who collect and review all information from the literature and other gene databases as well as information entered by labs and clinicians. Molecular and clinical information can be submitted into MutaDATABASE directly or through specifically designed software ('MutaREPORTER'). The software additionally allows its users to circulate all information and questions in community groups centering around a particular disease gene ('MutaCIRCLES'), while keeping track both of the initial submitter of a novel variant and of every observer. The whole system functions as a closed-loop, fully automated information system whereby molecular and clinical info can be both extracted and submitted, with gene-specific curators as gatekeepers reviewing all information. In addition, gene-specific reviews will be published online and become freely available. These 'MutaREVIEWS' will contain detailed information on each gene, DNA variations in that gene and diseases caused by those DNA variations.

Many obstacles remain before we attain the goal of a freely accessible database of variants with the clinical correlation necessary to assess their pathogenicity. The effort to create the MutaDATABASE is a first step toward resolving some of the logistical issues and is therefore an important and necessary step towards fulfilling the promise of gene testing in improving the health and counseling of our patients and their families.

Author information

Authors and Affiliations

GeneDx, Gaithersburg, Maryland, USA
Sherri Bale
GENOHM, Ghent, Belgium
Martijn Devisscher & Frederik Decouttere
BIOBIX, University of Ghent, Ghent, Belgium
Wim Van Criekinge
Harvard Medical School, Cambridge, Massachusetts, USA
Heidi L Rehm
Institute of Human Genetics, University of California, San Francisco, San Francisco, California, USA
Robert Nussbaum
Leiden University Medical Center, Leiden, The Netherlands
Johan T Den Dunnen
GENDIA, Antwerp, Belgium
Patrick Willems

Authors

Sherri Bale
View author publications
You can also search for this author in PubMed Google Scholar
Martijn Devisscher
View author publications
You can also search for this author in PubMed Google Scholar
Wim Van Criekinge
View author publications
You can also search for this author in PubMed Google Scholar
Heidi L Rehm
View author publications
You can also search for this author in PubMed Google Scholar
Frederik Decouttere
View author publications
You can also search for this author in PubMed Google Scholar
Robert Nussbaum
View author publications
You can also search for this author in PubMed Google Scholar
Johan T Den Dunnen
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Willems
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Patrick Willems.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bale, S., Devisscher, M., Criekinge, W. et al. MutaDATABASE: a centralized and standardized DNA variation database. Nat Biotechnol 29, 117–118 (2011). https://doi.org/10.1038/nbt.1772

Download citation

Published: 07 February 2011
Issue Date: February 2011
DOI: https://doi.org/10.1038/nbt.1772

This article is cited by

DNA variant databases improve test accuracy and phenotype prediction in Alport syndrome
- Judy Savige
- Elisabet Ars
- Helen Storey
Pediatric Nephrology (2014)
Molecular genetic testing and the future of clinical genomics
- Sara Huston Katsanis
- Nicholas Katsanis
Nature Reviews Genetics (2013)
The Laboratory‐Clinician Team: A Professional Call to Action to Improve Communication and Collaboration for Optimal Patient Care in Chromosomal Microarray Testing
- Karen E. Wain
- Erin Riggs
- W. Andrew Faucett
Journal of Genetic Counseling (2012)
The case for locus-specific databases
- Mark E. Samuels
- Guy A. Rouleau
Nature Reviews Genetics (2011)
Clarity and claims in variation/mutation databasing
- Raymond Dalgleish
- William S Oetting
- Anthony J Brookes
Nature Biotechnology (2011)

MutaDATABASE: a centralized and standardized DNA variation database

Subjects

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

This article is cited by

DNA variant databases improve test accuracy and phenotype prediction in Alport syndrome

Molecular genetic testing and the future of clinical genomics

The Laboratory‐Clinician Team: A Professional Call to Action to Improve Communication and Collaboration for Optimal Patient Care in Chromosomal Microarray Testing

The case for locus-specific databases

Clarity and claims in variation/mutation databasing

Search

Quick links

Subjects

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

DNA variant databases improve test accuracy and phenotype prediction in Alport syndrome

Molecular genetic testing and the future of clinical genomics

The Laboratory‐Clinician Team: A Professional Call to Action to Improve Communication and Collaboration for Optimal Patient Care in Chromosomal Microarray Testing

The case for locus-specific databases

Clarity and claims in variation/mutation databasing

Search

Quick links