DoCM: a database of curated mutations in cancer

Ainscough, Benjamin J; Griffith, Malachi; Coffman, Adam C; Wagner, Alex H; Kunisaki, Jason; Choudhary, Mayank NK; McMichael, Joshua F; Fulton, Robert S; Wilson, Richard K; Griffith, Obi L; Mardis, Elaine R

doi:10.1038/nmeth.4000

Correspondence
Published: 29 September 2016

DoCM: a database of curated mutations in cancer

Benjamin J Ainscough^1,2,
Malachi Griffith ORCID: orcid.org/0000-0002-6388-446X^1,2,3,
Adam C Coffman¹,
Alex H Wagner ORCID: orcid.org/0000-0002-2502-8961^1,2,
Jason Kunisaki¹,
Mayank NK Choudhary ORCID: orcid.org/0000-0002-9824-7217³,
Joshua F McMichael¹,
Robert S Fulton^1,2,3,
Richard K Wilson^1,2,3,4,
Obi L Griffith^1,2,3,4 &
…
Elaine R Mardis^1,2,3,4

Nature Methods volume 13, pages 806–807 (2016)Cite this article

3895 Accesses
67 Citations
20 Altmetric
Metrics details

Subjects

Access through your institution

Buy or subscribe

To the Editor:

Large-scale cancer genomics discovery projects such as The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) have systematically characterized the molecular lesions in human cancer genomes, thereby laying the foundation for precision cancer medicine. However, a curated set of somatic variants with established relevance to cancer biology is essential for clinical annotation and for use in computational data analysis. We have created a database of curated mutations in cancer (DoCM, http://docm.info), an open-source, openly licensed resource to enable the cancer research community to aggregate, store, and track biologically important cancer variants with provenance supported by the literature.

This is a preview of subscription content, access via your institution

Relevant articles

Open Access articles citing this article.

Integrating chromatin accessibility states in the design of targeted sequencing panels for liquid biopsy
- Pegah Taklifi
- , Fahimeh Palizban
- & Mahya Mehrmohamadi
Scientific Reports Open Access 21 June 2022
A platform for oncogenomic reporting and interpretation
- Caralyn Reisle
- , Laura M. Williamson
- … Steven J. M. Jones
Nature Communications Open Access 09 February 2022
JCGA: the Japanese version of the Cancer Genome Atlas and its contribution to the interpretation of gene alterations detected in clinical cancer genome sequencing
- Masakuni Serizawa
- , Maki Mizuguchi
- … Ken Yamaguchi
Human Genome Variation Open Access 30 September 2021

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Putting DoCM in the context of other resources.**

References

Van Allen, E.M. et al. Nat. Med. 20, 682–688 (2014).
Article CAS Google Scholar
Forbes, S.A. et al. Nucleic Acids Res. 43, D805–D811 (2015).
Article CAS Google Scholar
Zhang, J. et al. Database (Oxford) 2011, bar026 (2011).
Google Scholar
Yeh, P. et al. Clin. Cancer Res. 19, 1894–1901 (2013).
Article CAS Google Scholar
Dienstmann, R. et al. Mol. Oncol. 8, 859–873 (2014).
Article Google Scholar
MacConaill, L.E. et al. J. Mol. Diagn. 16, 660–672 (2014).
Article CAS Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge L. Trani, J. Hodges, and A. Wollam for efforts in manual review. T. Ley, R. Bose, R. Govindan, and S. Devarakonda provided valuable curation input. D. Larson provided valuable analysis input. M.G. was supported by the National Human Genome Research Institute (NIH NHGRI K99HG007940). O.L.G. was supported by the National Cancer Institute (NIH NCI K22CA188163). This work was supported by a grant to R.K.W. from the National Human Genome Research Institute (NIH NHGRI U54HG003079).

Author information

Authors and Affiliations

McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri, USA
Benjamin J Ainscough, Malachi Griffith, Adam C Coffman, Alex H Wagner, Jason Kunisaki, Joshua F McMichael, Robert S Fulton, Richard K Wilson, Obi L Griffith & Elaine R Mardis
Siteman Cancer Center, Washington University School of Medicine, St. Louis, Missouri, USA
Benjamin J Ainscough, Malachi Griffith, Alex H Wagner, Robert S Fulton, Richard K Wilson, Obi L Griffith & Elaine R Mardis
Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA
Malachi Griffith, Mayank NK Choudhary, Robert S Fulton, Richard K Wilson, Obi L Griffith & Elaine R Mardis
Department of Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
Richard K Wilson, Obi L Griffith & Elaine R Mardis

Authors

Benjamin J Ainscough
View author publications
You can also search for this author in PubMed Google Scholar
Malachi Griffith
View author publications
You can also search for this author in PubMed Google Scholar
Adam C Coffman
View author publications
You can also search for this author in PubMed Google Scholar
Alex H Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Jason Kunisaki
View author publications
You can also search for this author in PubMed Google Scholar
Mayank NK Choudhary
View author publications
You can also search for this author in PubMed Google Scholar
Joshua F McMichael
View author publications
You can also search for this author in PubMed Google Scholar
Robert S Fulton
View author publications
You can also search for this author in PubMed Google Scholar
Richard K Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Obi L Griffith
View author publications
You can also search for this author in PubMed Google Scholar
Elaine R Mardis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Malachi Griffith or Obi L Griffith.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 Overview of DoCM resource

(a) Outline of criteria to curate a variant. Variants are evaluated for inclusion and then curated elements are identified. (b) Summary of current DoCM contents. DoCM contains SNSs and indels across many cancer subtypes with easy identification of the journal article that outlines the variant's relevance. (c) Screenshot of the DoCM web application available at http://docm.info. (d) Illustration of the API. An HTTP GET request for a variety of parameters including gene, chromosome, position etc. and returns a JSON response with the PubMed ids, diseases and other useful information. The API is thoroughly documented at http://docm.genome.wustl.edu/api.

Supplementary Figure 2 Screenshot of DoCM batch submission form.

In the batch submission form, users can enter all the parameters necessary for inclusion into DoCM, including the name of the batch, the rationale statement outlining the reason for including the variants and curation details, any relevant urls, tags to be applied to the whole batch, the TSV file with variants and submitter information. Following submission the user will be given a link to review the batch and any messages from moderators.

Supplementary Figure 3 Screenshot of moderators view of the submitted batches queue.

Once a batch has been submitted, it can be reviewed in the password protected moderator queue. A listing of current DoCM moderators can be viewed at http://docm.genome.wustl.edu/about. Moderators can select a batch, such as the Drug Gene Knowledge Database highlighted in purple above, to review the batch. Once multiple batches have been accepted a moderator can create a new DoCM version using the blue button at the bottom-right of the screen. The “Drug Gene Knowledge Database” link is highlighted in purple as it is the subject of Supplementary Figure 4.

Supplementary Figure 4 Screenshot of moderator review page.

A moderator can review all information submitted with a batch and evaluate whether it fits the scope and quality requirements of DoCM. Individual variants can be accepted or rejected and the moderator can leave a message to the submitter.

Supplementary Figure 5 Number of papers in PubMed indexed by “Cancer” per year.

Searching PubMed with the search term “Cancer” yields the number of papers relating to cancer per year. This serves as an upper-bound limit of the number of papers that need to be curated to accurately summarize important cancer variants. There is a need for public resources that reduce the duplication of curation effort.

Supplementary Figure 6 Overview of variant curation for entry into DoCM

An anecdotal example of the curation involved for the variant BRAF V600E is shown. Typically the literature only lists the gene and amino acid change (purple in the figure), requiring extensive curation to uniquely identify the variant. Correct genomic coordinates on a consistent genome build need to be identified, with accompanying nucleotide and strand information. Occasionally there are multiple nucleotide changes that are synonymous with a particular amino acid change. A representative transcript that correctly models the variant described in the literature also needs to be specified. Cancer subtypes are specified using the disease ontology nomenclature. Green boxes note the class of information that needs to be captured in DoCM, black boxes show the subtype of each class, and white boxes denote the value.

Supplementary Figure 7 Overview of analysis and validation sequencing of four TCGA projects

(a) Outline of the manual review strategy. DoCM sites with two or more reads of support are evaluated for obvious errors. (b) Summary of the variants that passed manual review and were not identified in the original TCGA analyses. (c) Summary of the variants that were validated in the 93 validation samples. (d) Comparison of DoCM-MSRV to ClinSek and the Bayesian classifier.

Supplementary Figure 8 Coverage of the custom capture validation sequencing

Heatmap illustrating the coverage obtained at all target sites in validation sequencing. Bar graphs on the x and y-axes illustrate the mean coverage at each case/position.

Supplementary Figure 9 Overview of validation sequencing results.

Variant allele fraction plot illustrating the types of variants identified through manual review that validated. Variants called in the original TCGA study are highlighted in blue and those missed are in green. Note that TCGA was unable to call variants below ∼10% VAF while the MSRV approach was able to recover many such variants. Density plots on the x and y-axes show the distribution of tumor VAF and coverage depth for validated variants respectively.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ainscough, B., Griffith, M., Coffman, A. et al. DoCM: a database of curated mutations in cancer. Nat Methods 13, 806–807 (2016). https://doi.org/10.1038/nmeth.4000

Download citation

Published: 29 September 2016
Issue Date: October 2016
DOI: https://doi.org/10.1038/nmeth.4000

This article is cited by

Integrating chromatin accessibility states in the design of targeted sequencing panels for liquid biopsy
- Pegah Taklifi
- Fahimeh Palizban
- Mahya Mehrmohamadi
Scientific Reports (2022)
A platform for oncogenomic reporting and interpretation
- Caralyn Reisle
- Laura M. Williamson
- Steven J. M. Jones
Nature Communications (2022)
Current cancer driver variant predictors learn to recognize driver genes instead of functional variants
- Daniele Raimondi
- Antoine Passemiers
- Yves Moreau
BMC Biology (2021)
OncoGEMINI: software for investigating tumor variants from multiple biopsies with integrated cancer annotations
- Thomas J. Nicholas
- Michael J. Cormier
- Aaron R. Quinlan
Genome Medicine (2021)
JCGA: the Japanese version of the Cancer Genome Atlas and its contribution to the interpretation of gene alterations detected in clinical cancer genome sequencing
- Masakuni Serizawa
- Maki Mizuguchi
- Ken Yamaguchi
Human Genome Variation (2021)

DoCM: a database of curated mutations in cancer

Subjects

Relevant articles

Integrating chromatin accessibility states in the design of targeted sequencing panels for liquid biopsy

A platform for oncogenomic reporting and interpretation

JCGA: the Japanese version of the Cancer Genome Atlas and its contribution to the interpretation of gene alterations detected in clinical cancer genome sequencing

Access options

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary Figure 1 Overview of DoCM resource

Supplementary Figure 2 Screenshot of DoCM batch submission form.

Supplementary Figure 3 Screenshot of moderators view of the submitted batches queue.

Supplementary Figure 4 Screenshot of moderator review page.

Supplementary Figure 5 Number of papers in PubMed indexed by “Cancer” per year.

Supplementary Figure 6 Overview of variant curation for entry into DoCM

Supplementary Figure 7 Overview of analysis and validation sequencing of four TCGA projects

Supplementary Figure 8 Coverage of the custom capture validation sequencing

Supplementary Figure 9 Overview of validation sequencing results.

Supplementary information

Supplementary Text and Figures

Supplementary Data 1

Supplementary Data 2

Rights and permissions

About this article

Cite this article

This article is cited by

Integrating chromatin accessibility states in the design of targeted sequencing panels for liquid biopsy

A platform for oncogenomic reporting and interpretation

Current cancer driver variant predictors learn to recognize driver genes instead of functional variants

OncoGEMINI: software for investigating tumor variants from multiple biopsies with integrated cancer annotations

JCGA: the Japanese version of the Cancer Genome Atlas and its contribution to the interpretation of gene alterations detected in clinical cancer genome sequencing

Search

Quick links

Subjects

Relevant articles

Access options

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Integrated supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links