Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

The NCI Genomic Data Commons

An Author Correction to this article was published on 18 May 2021

This article has been updated

The National Cancer Institute (NCI) Genomic Data Commons (GDC) contains more than 2.9 petabytes of genomic and associated clinical data from more than 60 NCI-funded and other contributed cancer genomics research projects. The GDC consists of five applications over a common data model and a common application programming interface.

This is a preview of subscription content, access via your institution

Relevant articles

Open Access articles citing this article.

Access options

Rent or buy this article

Get just this article for as long as you need it


Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Screenshot of the GDC DAVE tools.
Fig. 2
Fig. 3: Various daily GDC statistics from 1 October 2017 to 30 October 2020.

Change history


  1. Grossman, R. L. Cancer J. 24, 122–126 (2018).

    Article  Google Scholar 

  2. Grossman, R. L., Heath, A., Murphy, M., Patterson, M. & Wells, W. Comput. Sci. Eng. 18, 10–20 (2016).

    Article  Google Scholar 

  3. Wilkinson, M. D. et al. Sci. Data 3, 160018 (2016).

    Article  Google Scholar 

  4. Lawrence, M. S. et al. Nature 505, 495–501 (2014).

    Article  CAS  Google Scholar 

  5. Wilson, S. et al. Cancer Res. 77, e15–e18 (2017).

    Article  CAS  Google Scholar 

  6. Leek, J. T. et al. Nat. Rev. Genet. 11, 733–739 (2010).

    Article  CAS  Google Scholar 

  7. Mailman, M. D. et al. Nat. Genet. 39, 1181–1186 (2007).

    Article  CAS  Google Scholar 

  8. Heath, A. P. et al. J. Am. Med. Inform. Assoc. 21, 969–975 (2014).

    Article  Google Scholar 

  9. Hinkson, I. V. et al. Front. Cell Dev. Biol. 5, 83 (2017).

    Article  Google Scholar 

  10. Zhang, Z. et al. Nat. Commun. (2021).

  11. Jia, P. et al. Genome Biol. 15, 489 (2014).

    Article  Google Scholar 

  12. Lillie, E. O. et al. Per. Med. 8, 161–173 (2011).

    Article  Google Scholar 

  13. Levine, R. L. et al. Cancer Cell 7, 387–397 (2005).

    Article  CAS  Google Scholar 

Download references


This project was funded in part with Federal funds from the National Cancer Institute, National Institutes of Health, agreement 14X050 and task order T02 under agreement 17X147 under contract HHSN261200800001E. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products or organizations imply endorsement by the US Government. The project is grateful for the contributions of S. Marechek and E. Miller, both of whom have passed away.

Author information

Authors and Affiliations



The GDC software was developed and tested by A. Khurana, A. Kadam, A.W., A.H., A.C., A.Z., B.F.C., B.L.W., B.R., B.B., C.F.B., C.W., C.D., C.K.Y., C.Y., C.P.R., F. Gomez, F. Gerthoffert, F.C., G.L.G., I.M., J.C.A., J.J.P., J.B., J.A.M., J.P., J. Spring, J. Sislow, J.T.Y., J.S.M., J.Z., J.H.B.B., K.W., K.H., K.R., K.C., K.M., K.M.J.K., K.A.S., L.L., L.X., M.A., M.W.M., M.Y.P., M.S.F., M. Ford, M. Fukuma, P.L.P., P.-M.D., P.M., R.P., R.A., R.L.G., R.B., R.J., R.O.O., S.R., S.S., S.W., S.J., S.A., T.N., T.I.G., V.E.K., V.F., W.P.W., Y.T. and Y.Z. Bioinformatics, data curation and data modeling were performed by A.P.K., D.P.M., F.M.O., J.H.S., J.Z., K.M.H., L.S., M.A.J., M.L.F., R.L.G., R.B., S.L., T.M.L., T.I.G., V.F., W.P.W., Z.Z. and Z.W. The project managers were A.P.H., B.I., C.K.Y., D.S.G., E.M., F.G., H.D.T., H.S., J.L., J.C.Z., L.Y., L. Stein, L. Staudt, M.A.J., M.L.F., M.T., R.L.G., S.S.G., S.G., T.D., T.J.S., V.F. and Z.W. The manuscript was written and revised by A.P.H., D.S.G., J.C.Z., L. Staudt, R.L.G. and Z.Z.

Corresponding author

Correspondence to Robert L. Grossman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Genetics thanks the anonymous reviewers for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary Note

Rights and permissions

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Heath, A.P., Ferretti, V., Agrawal, S. et al. The NCI Genomic Data Commons. Nat Genet 53, 257–262 (2021).

Download citation

  • Published:

  • Issue Date:

  • DOI:

This article is cited by


Quick links

Nature Briefing: Cancer

Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.

Get what matters in cancer research, free to your inbox weekly. Sign up for Nature Briefing: Cancer