Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

The NCI Genomic Data Commons

The National Cancer Institute (NCI) Genomic Data Commons (GDC) contains more than 2.9 petabytes of genomic and associated clinical data from more than 60 NCI-funded and other contributed cancer genomics research projects. The GDC consists of five applications over a common data model and a common application programming interface.

Access options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Fig. 1: Screenshot of the GDC DAVE tools.
Fig. 2
Fig. 3: Various daily GDC statistics from 1 October 2017 to 30 October 2020.


  1. 1.

    Grossman, R. L. Cancer J. 24, 122–126 (2018).

    Article  Google Scholar 

  2. 2.

    Grossman, R. L., Heath, A., Murphy, M., Patterson, M. & Wells, W. Comput. Sci. Eng. 18, 10–20 (2016).

    Article  Google Scholar 

  3. 3.

    Wilkinson, M. D. et al. Sci. Data 3, 160018 (2016).

    Article  Google Scholar 

  4. 4.

    Lawrence, M. S. et al. Nature 505, 495–501 (2014).

    CAS  Article  Google Scholar 

  5. 5.

    Wilson, S. et al. Cancer Res. 77, e15–e18 (2017).

    CAS  Article  Google Scholar 

  6. 6.

    Leek, J. T. et al. Nat. Rev. Genet. 11, 733–739 (2010).

    CAS  Article  Google Scholar 

  7. 7.

    Mailman, M. D. et al. Nat. Genet. 39, 1181–1186 (2007).

    CAS  Article  Google Scholar 

  8. 8.

    Heath, A. P. et al. J. Am. Med. Inform. Assoc. 21, 969–975 (2014).

    Article  Google Scholar 

  9. 9.

    Hinkson, I. V. et al. Front. Cell Dev. Biol. 5, 83 (2017).

    Article  Google Scholar 

  10. 10.

    Zhang, Z. et al. Nat. Commun. (2021).

  11. 11.

    Jia, P. et al. Genome Biol. 15, 489 (2014).

    Article  Google Scholar 

  12. 12.

    Lillie, E. O. et al. Per. Med. 8, 161–173 (2011).

    Article  Google Scholar 

  13. 13.

    Levine, R. L. et al. Cancer Cell 7, 387–397 (2005).

    CAS  Article  Google Scholar 

Download references


This project was funded in part with Federal funds from the National Cancer Institute, National Institutes of Health, agreement 14X050 and task order T02 under agreement 17X147 under contract HHSN261200800001E. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products or organizations imply endorsement by the US Government. The project is grateful for the contributions of S. Marechek and E. Miller, both of whom have passed away.

Author information




The GDC software was developed and tested by A. Khurana, A. Kadam, A.W., A.H., A.C., A.Z., B.F.C., B.L.W., B.R., B.B., C.F.B., C.W., C.D., C.K.Y., C.Y., C.P.R., F. Gomez, F. Gerthoffert, F.C., G.L.G., I.M., J.C.A., J.J.P., J.B., J.A.M., J.P., J. Spring, J. Sislow, J.T.Y., J.S.M., J.Z., J.H.B.B., K.W., K.H., K.R., K.C., K.M., K.M.J.K., K.A.S., L.L., L.X., M.A., M.W.M., M.Y.P., M.S.F., M. Ford, M. Fukuma, P.L.P., P.-M.D., P.M., R.P., R.A., R.L.G., R.B., R.J., R.O.O., S.R., S.S., S.W., S.J., S.A., T.N., T.I.G., V.E.K., V.F., W.P.W., Y.T. and Y.Z. Bioinformatics, data curation and data modeling were performed by A.P.K., D.P.M., F.M.O., J.H.S., K.M.H., L.S., M.A.J., M.L.F., R.L.G., R.B., S.L., T.M.L., T.I.G., V.F., W.P.W., Z.Z. and Z.W. The project managers were A.P.H., B.I., C.K.Y., D.S.G., E.M., F.G., H.D.T., H.S., J.L., J.C.Z., L.Y., L. Stein, L. Staudt, M.A.J., M.L.F., M.T., R.L.G., S.S.G., S.G., T.D., T.J.S., V.F. and Z.W. The manuscript was written and revised by A.P.H., D.S.G., J.C.Z., L. Staudt, R.L.G. and Z.Z.

Corresponding author

Correspondence to Robert L. Grossman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Genetics thanks the anonymous reviewers for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary Note

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Heath, A.P., Ferretti, V., Agrawal, S. et al. The NCI Genomic Data Commons. Nat Genet 53, 257–262 (2021).

Download citation


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing