A draft map of the human proteome

Kim, Min-Sik; Pinto, Sneha M.; Getnet, Derese; Nirujogi, Raja Sekhar; Manda, Srikanth S.; Chaerkady, Raghothama; Madugundu, Anil K.; Kelkar, Dhanashree S.; Isserlin, Ruth; Jain, Shobhit; Thomas, Joji K.; Muthusamy, Babylakshmi; Leal-Rojas, Pamela; Kumar, Praveen; Sahasrabuddhe, Nandini A.; Balakrishnan, Lavanya; Advani, Jayshree; George, Bijesh; Renuse, Santosh; Selvan, Lakshmi Dhevi N.; Patil, Arun H.; Nanjappa, Vishalakshi; Radhakrishnan, Aneesha; Prasad, Samarjeet; Subbannayya, Tejaswini; Raju, Rajesh; Kumar, Manish; Sreenivasamurthy, Sreelakshmi K.; Marimuthu, Arivusudar; Sathe, Gajanan J.; Chavan, Sandip; Datta, Keshava K.; Subbannayya, Yashwanth; Sahu, Apeksha; Yelamanchi, Soujanya D.; Jayaram, Savita; Rajagopalan, Pavithra; Sharma, Jyoti; Murthy, Krishna R.; Syed, Nazia; Goel, Renu; Khan, Aafaque A.; Ahmad, Sartaj; Dey, Gourav; Mudgal, Keshav; Chatterjee, Aditi; Huang, Tai-Chung; Zhong, Jun; Wu, Xinyan; Shaw, Patrick G.; Freed, Donald; Zahari, Muhammad S.; Mukherjee, Kanchan K.; Shankar, Subramanian; Mahadevan, Anita; Lam, Henry; Mitchell, Christopher J.; Shankar, Susarla Krishna; Satishchandra, Parthasarathy; Schroeder, John T.; Sirdeshmukh, Ravi; Maitra, Anirban; Leach, Steven D.; Drake, Charles G.; Halushka, Marc K.; Prasad, T. S. Keshava; Hruban, Ralph H.; Kerr, Candace L.; Bader, Gary D.; Iacobuzio-Donahue, Christine A.; Gowda, Harsha; Pandey, Akhilesh

doi:10.1038/nature13302

Article
Published: 28 May 2014

A draft map of the human proteome

Min-Sik Kim^1,2,
Sneha M. Pinto³,
Derese Getnet^1,4,
Raja Sekhar Nirujogi³,
Srikanth S. Manda³,
Raghothama Chaerkady^1,2,
Anil K. Madugundu³,
Dhanashree S. Kelkar³,
Ruth Isserlin⁵,
Shobhit Jain⁵,
Joji K. Thomas³,
Babylakshmi Muthusamy³,
Pamela Leal-Rojas^1,6,
Praveen Kumar³,
Nandini A. Sahasrabuddhe³,
Lavanya Balakrishnan³,
Jayshree Advani³,
Bijesh George³,
Santosh Renuse³,
Lakshmi Dhevi N. Selvan³,
Arun H. Patil³,
Vishalakshi Nanjappa³,
Aneesha Radhakrishnan³,
Samarjeet Prasad¹,
Tejaswini Subbannayya³,
Rajesh Raju³,
Manish Kumar³,
Sreelakshmi K. Sreenivasamurthy³,
Arivusudar Marimuthu³,
Gajanan J. Sathe³,
Sandip Chavan³,
Keshava K. Datta³,
Yashwanth Subbannayya³,
Apeksha Sahu³,
Soujanya D. Yelamanchi³,
Savita Jayaram³,
Pavithra Rajagopalan³,
Jyoti Sharma³,
Krishna R. Murthy³,
Nazia Syed³,
Renu Goel³,
Aafaque A. Khan³,
Sartaj Ahmad³,
Gourav Dey³,
Keshav Mudgal⁷,
Aditi Chatterjee³,
Tai-Chung Huang¹,
Jun Zhong¹,
Xinyan Wu^1,2,
Patrick G. Shaw¹,
Donald Freed¹,
Muhammad S. Zahari²,
Kanchan K. Mukherjee⁸,
Subramanian Shankar⁹,
Anita Mahadevan^10,11,
Henry Lam¹²,
Christopher J. Mitchell¹,
Susarla Krishna Shankar^10,11,
Parthasarathy Satishchandra¹³,
John T. Schroeder¹⁴,
Ravi Sirdeshmukh³,
Anirban Maitra^15,16,
Steven D. Leach^1,17,
Charles G. Drake^16,18,
Marc K. Halushka¹⁵,
T. S. Keshava Prasad³,
Ralph H. Hruban^15,16,
Candace L. Kerr¹⁹^nAff21,
Gary D. Bader⁵,
Christine A. Iacobuzio-Donahue^15,16,17,
Harsha Gowda³ &
…
Akhilesh Pandey^{1,2,3,4,15,16,20}

Nature volume 509, pages 575–581 (2014)Cite this article

116k Accesses
1561 Citations
571 Altmetric
Metrics details

Subjects

Abstract

The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Overview of the workflow and comparison of data with public repositories.**

**Figure 2: Landscape of the normal human proteome.**

**Figure 3: Isoform-specific expression.**

**Figure 5: Translation of pseudogenes and identification of novel N termini.**

A high-stringency blueprint of the human proteome

Article Open access 16 October 2020

Mass spectrometry-based draft of the mouse proteome

Article 16 June 2022

The proteome landscape of the kingdoms of life

Article 17 June 2020

References

The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012)
Aebersold, R. & Mann, M. Mass spectrometry-based proteomics. Nature 422, 198–207 (2003)
Article ADS CAS PubMed Google Scholar
Bensimon, A., Heck, A. J. & Aebersold, R. Mass spectrometry-based proteomics and network biology. Annu. Rev. Biochem. 81, 379–405 (2012)
Article CAS PubMed Google Scholar
Cravatt, B. F., Simon, G. M. & Yates, J. R., III The biological impact of mass-spectrometry-based proteomics. Nature 450, 991–1000 (2007)
Article ADS CAS PubMed Google Scholar
Nagaraj, N. et al. System-wide perturbation analysis with nearly complete coverage of the yeast proteome by single-shot ultra HPLC runs on a bench top Orbitrap. Mol. Cell. Proteomics 11, M111.013722 (2012)
Article PubMed CAS Google Scholar
Picotti, P. et al. A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis. Nature 494, 266–270 (2013)
Article ADS CAS PubMed PubMed Central Google Scholar
Kelkar, D. S. et al. Proteogenomic analysis of Mycobacterium tuberculosis by high resolution mass spectrometry. Mol. Cell. Proteomics 10, M111.011627 (2011)
Article PubMed PubMed Central Google Scholar
Huttlin, E. L. et al. A tissue-specific atlas of mouse protein phosphorylation and expression. Cell 143, 1174–1189 (2010)
Article CAS PubMed PubMed Central Google Scholar
Gholami, A. M. et al. Global proteome analysis of the NCI-60 cell line panel. Cell Rep. 4, 609–620 (2013)
Article CAS PubMed Google Scholar
Branca, R. M. et al. HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics. Nature Methods 11, 59–62 (2014)
Article CAS PubMed Google Scholar
Farrah, T. et al. The state of the human proteome in 2012 as viewed through PeptideAtlas. J. Proteome Res. 12, 162–171 (2013)
Article CAS PubMed Google Scholar
Craig, R., Cortens, J. P. & Beavis, R. C. Open source system for analyzing, validating, and storing protein identification data. J. Proteome Res. 3, 1234–1242 (2004)
Article CAS PubMed Google Scholar
Gaudet, P. et al. neXtProt: organizing protein knowledge in the context of human proteome projects. J. Proteome Res. 12, 293–298 (2013)
Article CAS PubMed Google Scholar
Uhlen, M. et al. Towards a knowledge-based Human Protein Atlas. Nature Biotechnol. 28, 1248–1250 (2010)
Article CAS Google Scholar
Pruitt, K. D. et al. RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 42, D756–D763 (2014)
Article CAS PubMed Google Scholar
Perkins, D. N., Pappin, D. J., Creasy, D. M. & Cottrell, J. S. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551–3567 (1999)
Article CAS PubMed Google Scholar
Eng, J. K., McCormack, A. L. & Yates, J. R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5, 976–989 (1994)
Article CAS PubMed Google Scholar
Käll, L., Canterbury, J. D., Weston, J., Noble, W. S. & MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nature Methods 4, 923–925 (2007)
Article PubMed CAS Google Scholar
Lane, L. et al. Metrics for the human proteome project 2013–2014 and strategies for finding missing proteins. J. Proteome Res. 13, 15–20 (2014)
Article CAS PubMed Google Scholar
Mosley, A. L. et al. Highly reproducible label free quantitative proteomic analysis of RNA polymerase complexes. Mol. Cell. Proteomics 10, M110.000687 (2011)
Article PubMed CAS Google Scholar
Fountoulakis, M., Juranville, J. F., Dierssen, M. & Lubec, G. Proteomic analysis of the fetal brain. Proteomics 2, 1547–1576 (2002)
Article CAS PubMed Google Scholar
Ying, W. et al. A dataset of human fetal liver proteome identified by subcellular fractionation and multiple protein separation and identification technology. Mol. Cell. Proteomics 5, 1703–1707 (2006)
Article CAS PubMed Google Scholar
Jansen, R., Greenbaum, D. & Gerstein, M. Relating whole-genome expression data with protein-protein interactions. Genome Res. 12, 37–46 (2002)
Article CAS PubMed PubMed Central Google Scholar
Ge, H., Liu, Z., Church, G. M. & Vidal, M. Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae. Nature Genet. 29, 482–486 (2001)
Article CAS PubMed Google Scholar
Ruepp, A. et al. CORUM: the comprehensive resource of mammalian protein complexes–2009. Nucleic Acids Res. 38, D497–D501 (2010)
Article CAS PubMed Google Scholar
Ferrington, D. A. & Gregerson, D. S. Immunoproteasomes: structure, function, and antigen presentation. Prog. Mol. Biol. Transl. Sci. 109, 75–112 (2012)
Article CAS PubMed PubMed Central Google Scholar
Steen, H. & Mann, M. The abc’s (and xyz’s) of peptide sequencing. Nature Rev. Mol. Cell Biol. 5, 699–711 (2004)
Article CAS Google Scholar
Sugimoto, J., Sugimoto, M., Bernstein, H., Jinno, Y. & Schust, D. A novel human endogenous retroviral protein inhibits cell-cell fusion. Sci. Rep. 3, 1462 (2013)
Article ADS PubMed PubMed Central CAS Google Scholar
Guttman, M., Russell, P., Ingolia, N. T., Weissman, J. S. & Lander, E. S. Ribosome profiling provides evidence that large noncoding RNAs do not encode proteins. Cell 154, 240–251 (2013)
Article CAS PubMed PubMed Central Google Scholar
Kalyana-Sundaram, S. et al. Expressed pseudogenes in the transcriptional landscape of human cancers. Cell 149, 1622–1634 (2012)
Article CAS PubMed PubMed Central Google Scholar
Pei, B. et al. The GENCODE pseudogene resource. Genome Biol. 13, R51 (2012)
Article CAS PubMed PubMed Central Google Scholar
Abecasis, G. R. et al. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012)
Article ADS PubMed CAS Google Scholar
Peri, S. & Pandey, A. A reassessment of the translation initiation codon in vertebrates. Trends Genet. 17, 685–687 (2001)
Article CAS PubMed Google Scholar
Legrain, P. et al. The human proteome project: current state and future direction. Mol. Cell. Proteomics 10, M111.009993 (2011)
Article PubMed PubMed Central CAS Google Scholar
Paik, Y. K. et al. The Chromosome-Centric Human Proteome Project for cataloging proteins encoded in the genome. Nature Biotechnol. 30, 221–223 (2012)
Article CAS Google Scholar
Marko-Varga, G., Omenn, G. S., Paik, Y. K. & Hancock, W. S. A first step toward completion of a genome-wide characterization of the human proteome. J. Proteome Res. 12, 1–5 (2013)
Article CAS PubMed Google Scholar
Shevchenko, A., Tomas, H., Havlis, J., Olsen, J. V. & Mann, M. In-gel digestion for mass spectrometric characterization of proteins and proteomes. Nature Protocols 1, 2856–2860 (2007)
Article CAS Google Scholar
Wang, Y. et al. Reversed-phase chromatography with multiple fraction concatenation strategy for proteome profiling of human MCF10A cells. Proteomics 11, 2019–2026 (2011)
Article CAS PubMed PubMed Central Google Scholar
Olsen, J. V. et al. Parts per million mass accuracy on an Orbitrap mass spectrometer via lock mass injection into a C-trap. Mol. Cell. Proteomics 4, 2010–2021 (2005)
Article CAS PubMed Google Scholar
Vizcaíno, J. A. et al. The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063–D1069 (2013)
Article PubMed CAS Google Scholar
Craig, R. & Beavis, R. C. TANDEM: matching proteins with tandem mass spectra. Bioinformatics 20, 1466–1467 (2004)
Article CAS PubMed Google Scholar
Meyer, L. R. et al. The UCSC Genome Browser database: extensions and updates 2013. Nucleic Acids Res. 41, D64–D69 (2013)
Article CAS PubMed Google Scholar
Razick, S., Magklaras, G. & Donaldson, I. M. iRefIndex: a consolidated protein interaction database with provenance. BMC Bioinformatics 9, 405 (2008)
Article PubMed PubMed Central CAS Google Scholar
Zuberi, K. et al. GeneMANIA prediction server 2013 update. Nucleic Acids Res. 41, W115–W122 (2013)
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to acknowledge the National Development and Research Institutes for some of the tissues. We acknowledge the assistance of V. Sandhya, V. Puttamallesh, U. Guha and B. Cole for help with analysis of some of the samples. We thank L. Lane and B. Amos for their assistance with the list of missing genes. This work was supported by an NIH roadmap grant for Technology Centers of Networks and Pathways (U54GM103520), NCI’s Clinical Proteomic Tumor Analysis Consortium initiative (U24CA160036), a contract (HHSN268201000032C) from the National Heart, Lung and Blood Institute and the Sol Goldman Pancreatic Cancer Research Center. The authors acknowledge the joint participation by the Adrienne Helis Malvin Medical Research Foundation and the Diana Helis Henry Medical Research Foundation through its direct engagement in the continuous active conduct of medical research in conjunction with The Johns Hopkins Hospital and the Johns Hopkins University School of Medicine and the Foundation’s Parkinson’s Disease Programs. The analysis work was partially supported by the National Resource for Network Biology (P41GM103504). A.Mah., S.K.Sh., P.S. and T.S.K.P. are supported by DBT Program Support on Neuroproteomics (BT/01/COE/08/05) to IOB and NIMHANS. H.G. is a Wellcome Trust-DBT India Alliance Early Career Fellow. We thank Council of Scientific and Industrial Research, University Grants Commission and Department of Science and Technology, Government of India for research fellowships for S.M.P., R.S.N., A.R., M.K., G.J.S., S.C., P.R., J.S., S.S.M., D.S.K., S.R., S.K.Sr., K.K.D., Y.S., A.S., S.D.Y., N.S., S.A. and G.D.

Author information

Candace L. Kerr
Present address: Present address: Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.,

Authors and Affiliations

McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, 21205, Maryland, USA
Min-Sik Kim, Derese Getnet, Raghothama Chaerkady, Pamela Leal-Rojas, Samarjeet Prasad, Tai-Chung Huang, Jun Zhong, Xinyan Wu, Patrick G. Shaw, Donald Freed, Christopher J. Mitchell, Steven D. Leach & Akhilesh Pandey
Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, 21205, Maryland, USA
Min-Sik Kim, Raghothama Chaerkady, Xinyan Wu, Muhammad S. Zahari & Akhilesh Pandey
Institute of Bioinformatics, International Tech Park, Bangalore 560066, India,
Sneha M. Pinto, Raja Sekhar Nirujogi, Srikanth S. Manda, Anil K. Madugundu, Dhanashree S. Kelkar, Joji K. Thomas, Babylakshmi Muthusamy, Praveen Kumar, Nandini A. Sahasrabuddhe, Lavanya Balakrishnan, Jayshree Advani, Bijesh George, Santosh Renuse, Lakshmi Dhevi N. Selvan, Arun H. Patil, Vishalakshi Nanjappa, Aneesha Radhakrishnan, Tejaswini Subbannayya, Rajesh Raju, Manish Kumar, Sreelakshmi K. Sreenivasamurthy, Arivusudar Marimuthu, Gajanan J. Sathe, Sandip Chavan, Keshava K. Datta, Yashwanth Subbannayya, Apeksha Sahu, Soujanya D. Yelamanchi, Savita Jayaram, Pavithra Rajagopalan, Jyoti Sharma, Krishna R. Murthy, Nazia Syed, Renu Goel, Aafaque A. Khan, Sartaj Ahmad, Gourav Dey, Aditi Chatterjee, Ravi Sirdeshmukh, T. S. Keshava Prasad, Harsha Gowda & Akhilesh Pandey
Adrienne Helis Malvin Medical Research Foundation, New Orleans, 70130, Louisiana, USA
Derese Getnet & Akhilesh Pandey
The Donnelly Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada,
Ruth Isserlin, Shobhit Jain & Gary D. Bader
Department of Pathology, Universidad de La Frontera, Center of Genetic and Immunological Studies-Scientific and Technological Bioresource Nucleus, Temuco 4811230, Chile,
Pamela Leal-Rojas
School of Medicine, Imperial College London, South Kensington Campus, London SW7 2AZ, UK,
Keshav Mudgal
Department of Neurosurgery, Postgraduate Institute of Medical Education & Research, Chandigarh 160012, India,
Kanchan K. Mukherjee
Department of Internal Medicine Armed Forces Medical College, Pune 411040, India,
Subramanian Shankar
Department of Neuropathology, National Institute of Mental Health and Neurosciences, Bangalore 560029, India,
Anita Mahadevan & Susarla Krishna Shankar
Human Brain Tissue Repository, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences, Bangalore 560029, India,
Anita Mahadevan & Susarla Krishna Shankar
Department of Chemical and Biomolecular Engineering and Division of Biomedical Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong,
Henry Lam
Department of Neurology, National Institute of Mental Health and Neurosciences, Bangalore 560029, India,
Parthasarathy Satishchandra
Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, 21224, Maryland, USA
John T. Schroeder
Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, 21231, Maryland, USA
Anirban Maitra, Marc K. Halushka, Ralph H. Hruban, Christine A. Iacobuzio-Donahue & Akhilesh Pandey
Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, 21231, Maryland, USA
Anirban Maitra, Charles G. Drake, Ralph H. Hruban, Christine A. Iacobuzio-Donahue & Akhilesh Pandey
Department of Surgery, Johns Hopkins University School of Medicine, Baltimore, 21231, Maryland, USA
Steven D. Leach & Christine A. Iacobuzio-Donahue
Departments of Immunology and Urology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, 21231, Maryland, USA
Charles G. Drake
Department of Obstetrics and Gynecology, Johns Hopkins University School of Medicine Baltimore, 21205, Maryland, USA
Candace L. Kerr
Diana Helis Henry Medical Research Foundation, New Orleans, 70130, Louisiana, USA
Akhilesh Pandey

Authors

Min-Sik Kim
View author publications
You can also search for this author in PubMed Google Scholar
Sneha M. Pinto
View author publications
You can also search for this author in PubMed Google Scholar
Derese Getnet
View author publications
You can also search for this author in PubMed Google Scholar
Raja Sekhar Nirujogi
View author publications
You can also search for this author in PubMed Google Scholar
Srikanth S. Manda
View author publications
You can also search for this author in PubMed Google Scholar
Raghothama Chaerkady
View author publications
You can also search for this author in PubMed Google Scholar
Anil K. Madugundu
View author publications
You can also search for this author in PubMed Google Scholar
Dhanashree S. Kelkar
View author publications
You can also search for this author in PubMed Google Scholar
Ruth Isserlin
View author publications
You can also search for this author in PubMed Google Scholar
Shobhit Jain
View author publications
You can also search for this author in PubMed Google Scholar
Joji K. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Babylakshmi Muthusamy
View author publications
You can also search for this author in PubMed Google Scholar
Pamela Leal-Rojas
View author publications
You can also search for this author in PubMed Google Scholar
Praveen Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Nandini A. Sahasrabuddhe
View author publications
You can also search for this author in PubMed Google Scholar
Lavanya Balakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Jayshree Advani
View author publications
You can also search for this author in PubMed Google Scholar
Bijesh George
View author publications
You can also search for this author in PubMed Google Scholar
Santosh Renuse
View author publications
You can also search for this author in PubMed Google Scholar
Lakshmi Dhevi N. Selvan
View author publications
You can also search for this author in PubMed Google Scholar
Arun H. Patil
View author publications
You can also search for this author in PubMed Google Scholar
Vishalakshi Nanjappa
View author publications
You can also search for this author in PubMed Google Scholar
Aneesha Radhakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Samarjeet Prasad
View author publications
You can also search for this author in PubMed Google Scholar
Tejaswini Subbannayya
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Raju
View author publications
You can also search for this author in PubMed Google Scholar
Manish Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Sreelakshmi K. Sreenivasamurthy
View author publications
You can also search for this author in PubMed Google Scholar
Arivusudar Marimuthu
View author publications
You can also search for this author in PubMed Google Scholar
Gajanan J. Sathe
View author publications
You can also search for this author in PubMed Google Scholar
Sandip Chavan
View author publications
You can also search for this author in PubMed Google Scholar
Keshava K. Datta
View author publications
You can also search for this author in PubMed Google Scholar
Yashwanth Subbannayya
View author publications
You can also search for this author in PubMed Google Scholar
Apeksha Sahu
View author publications
You can also search for this author in PubMed Google Scholar
Soujanya D. Yelamanchi
View author publications
You can also search for this author in PubMed Google Scholar
Savita Jayaram
View author publications
You can also search for this author in PubMed Google Scholar
Pavithra Rajagopalan
View author publications
You can also search for this author in PubMed Google Scholar
Jyoti Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Krishna R. Murthy
View author publications
You can also search for this author in PubMed Google Scholar
Nazia Syed
View author publications
You can also search for this author in PubMed Google Scholar
Renu Goel
View author publications
You can also search for this author in PubMed Google Scholar
Aafaque A. Khan
View author publications
You can also search for this author in PubMed Google Scholar
Sartaj Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Gourav Dey
View author publications
You can also search for this author in PubMed Google Scholar
Keshav Mudgal
View author publications
You can also search for this author in PubMed Google Scholar
Aditi Chatterjee
View author publications
You can also search for this author in PubMed Google Scholar
Tai-Chung Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Xinyan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Patrick G. Shaw
View author publications
You can also search for this author in PubMed Google Scholar
Donald Freed
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad S. Zahari
View author publications
You can also search for this author in PubMed Google Scholar
Kanchan K. Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Subramanian Shankar
View author publications
You can also search for this author in PubMed Google Scholar
Anita Mahadevan
View author publications
You can also search for this author in PubMed Google Scholar
Henry Lam
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Mitchell
View author publications
You can also search for this author in PubMed Google Scholar
Susarla Krishna Shankar
View author publications
You can also search for this author in PubMed Google Scholar
Parthasarathy Satishchandra
View author publications
You can also search for this author in PubMed Google Scholar
John T. Schroeder
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Sirdeshmukh
View author publications
You can also search for this author in PubMed Google Scholar
Anirban Maitra
View author publications
You can also search for this author in PubMed Google Scholar
Steven D. Leach
View author publications
You can also search for this author in PubMed Google Scholar
Charles G. Drake
View author publications
You can also search for this author in PubMed Google Scholar
Marc K. Halushka
View author publications
You can also search for this author in PubMed Google Scholar
T. S. Keshava Prasad
View author publications
You can also search for this author in PubMed Google Scholar
Ralph H. Hruban
View author publications
You can also search for this author in PubMed Google Scholar
Candace L. Kerr
View author publications
You can also search for this author in PubMed Google Scholar
Gary D. Bader
View author publications
You can also search for this author in PubMed Google Scholar
Christine A. Iacobuzio-Donahue
View author publications
You can also search for this author in PubMed Google Scholar
Harsha Gowda
View author publications
You can also search for this author in PubMed Google Scholar
Akhilesh Pandey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P., H.G., R.C., M.-S.K. designed the study; A.P., H.G., M.-S.K. managed the study; D.G., C.L.K., C.A.I.-D., K.R.M. collected human cells/tissues; M.-S.K., R.C., D.G. developed the pipeline of experiment and analysis; D.G., M.-S.K., S.M.P., K.M., R.C., S.R., J.Z., X.W., P.G.S., M.S.Z., T.-C.H. prepared peptide samples for LC-MS/MS; M.-S.K., R.S.N., S.M.P., R.C., D.S.K., S.R., G.J.S. performed LC-MS/MS; M.-S.K., S.M.P., S.P., S.S.M., C.J.M., J.A. and A.K.M. processed MS data and managed data; A.K.M., S.S.M., B.G., A.H.P., Y.S., M.-S.K. performed comparison analysis with PeptideAtlas, neXtProt and GPMDB; R.I., S.Jai., G.D.B. performed interaction and complex analysis; M.-S.K., S.M.P., S.S.M., P.K., A.K.M., N.A.S., R.S.N., L.B., L.D.N.S., D.S.K., V.N., A.R., T.S., M.K., S.K.Sr., G.D., A.Mar., R.R., S.C., K.K.D., A.S., S.D.Y., S.Jay., P.R., A.H.P., B.G., J.S., N.S., R.G., G.J.S., A.A.K., S.A., D.F., T.S.K.P., H.G., A.P. performed proteogenomic analysis; A.C., H.L., R.S., J.T.S., K.K.M., S.S., A.Mah., S.K.Sh., P.S., S.D.L., C.G.D., A.Mai., M.K.H., R.H.H., C.L.K., C.A.I.-D. assisted with analysis of the data; M.-S.K., S.M.P., T.-C.H., P.L.-R. performed western blot experiments; M.-S.K., J.K.T., A.K.M., B.M., S.P., S.M.P. designed the Human Proteome Map web portal; M.-S.K., A.K.M., J.K.T. generated selected reaction monitoring (SRM) database; M.-S.K., K.M., G.D., S.M.P., S.S.M. illustrated figures with help of other authors; A.P., M.-S.K., H.G. wrote the manuscript with inputs from other authors.

Corresponding authors

Correspondence to Harsha Gowda or Akhilesh Pandey.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the dataset identifier PXD000561.

Extended data figures and tables

Extended Data Figure 1 Summary of proteome analysis.

a, Mass error in parts per million for precursor ions of all identified peptides. b, Number of peptides detected per gene binned as shown. c, Distribution of sequence coverage of identified proteins. d–f, %FDR with a q value of <0.01 plotted against peptide length in number of amino acids, charge state of peptide ion and number of cleavage sites missed by enzyme. P values computed from two-tailed t-test are shown. Error bars indicate s.d. calculated from FDRs of multiple fetal samples. g, h, A comparison of peptides identified in this study with PeptideAtlas and GPMDB. i, Mass error in parts per million for precursor ions identified from proteogenomics analysis.

Extended Data Figure 2 Tissue-wise gene expression and housekeeping proteins.

a, A heat map shows a partial list of not well-characterized, hypothetical genes. b, The bulk of protein mass is contributed by only a small number of genes. Only 2,350 ‘housekeeping genes’ account for ∼75% of proteome mass. c, The number of cell/tissue types where a gene was observed was counted. Some genes were found to be specifically restricted in a few samples while others were observed in the majority of samples analysed. For example, 1,537 genes were detected only in one sample, and 2,350 genes were found in all samples. These latter genes can be defined as highly abundant ‘housekeeping proteins’. d, Distribution of genes in the RefSeq database based on the number of protein isoforms resulting from their annotated transcripts (left). Distribution of the transcripts with two or more protein isoforms annotated based on the number of isoform-specific or shared peptides (right). e, A representative example of sequence coverage of PSMB8 protein along with tissue distribution of all of its identified peptides and the MS/MS spectrum of one of the peptides is shown along with seven selected reaction monitoring (SRM) transitions.

Extended Data Figure 3 Western blot analysis of select tissue-restricted proteins.

a, Eight proteins showing tissue-restricted expression were tested using western blot analysis in 17 adult tissues. GAPDH was used as a loading control. b, Four proteins found to be expressed in a broad range of tissues, although bands that do not correspond to the expected molecular weight are also observed. CST, Cell Signalling Technology; SCB, Santa Cruz Biotechnology.

Extended Data Figure 4 Identification of novel genes/ORFs and translated non-coding RNAs.

a, An example of a novel ORF in an alternate reading frame located in the 3′ UTR of CHTF8 gene. The relative abundance of peptides from the CHTF8 protein and the protein encoded by the novel ORF is shown (bottom). b, An example of translated non-coding RNA (NR_027693.1) identified by searching 3-frame-translated transcript database. The MS/MS spectrum of one of the five identified peptides (LEVASSPPVSEAVPR) is shown along with a similar fragmentation pattern observed from the corresponding synthetic peptide.

Extended Data Figure 5 Human genome annotation through proteogenomic analysis using GeneSpring.

a, Four genome search specific peptides (GSSPs; red boxes) map to an upstream ORF (denoted as black hashes) located in 5′ UTR of the SLC35A4 gene (ORF shown as blue rectangle). b, GSSP mapping in the intergenic region between two RefSeq annotated genes NDUFv3 and PKNOX1. The ORF region is depicted in dotted lines of human endogenous retroviral element (HERV). c, GSSPs mapping to an annotated pseudogene MAGEB6P1, the alignments of parent gene and pseudogene are shown below the peptides.

Extended Data Figure 6 Frequency of nucleotides surrounding translational start sites.

a, Frequency of nucleotides at positions ranging from −5 to +1 surrounding the AUG codon for confirmed translational start sites. b, Frequency of nucleotides at positions ranging from −5 to +1 surrounding the AUG codon for novel translational start sites identified in this study.

Supplementary information

Supplementary Information

This file contains a Supplementary Discussion and additional references. (PDF 106 kb)

Supplementary Data

This file contains Supplementary Data. (PDF 3594 kb)

Supplementary Table 1

This file contains a summary of results from proteogenomics analysis; a list of peptides indicating novel signal peptide cleavage sites; and a draft map of the human proteome. (XLSX 1178 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

PowerPoint slide for Fig. 5

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, MS., Pinto, S., Getnet, D. et al. A draft map of the human proteome. Nature 509, 575–581 (2014). https://doi.org/10.1038/nature13302

Download citation

Received: 09 August 2013
Accepted: 31 March 2014
Published: 28 May 2014
Issue Date: 29 May 2014
DOI: https://doi.org/10.1038/nature13302

This article is cited by

Harnessing the power of proteomics in precision diabetes medicine
- Nigel Kurgan
- Jeppe Kjærgaard Larsen
- Atul S. Deshmukh
Diabetologia (2024)
Biological big-data sources, problems of storage, computational issues, and applications: a comprehensive review
- Jyoti Kant Chaudhari
- Shubham Pant
- Dev Bukhsh Singh
Knowledge and Information Systems (2024)
Epigenetics of the far northern Yakutian population
- Alena Kalyakulina
- Igor Yusipov
- Mikhail Ivanchenko
Clinical Epigenetics (2023)
Splicing complexity as a pivotal feature of alternative exons in mammalian species
- Feiyang Zhao
- Yubin Yan
- Ruolin Yang
BMC Genomics (2023)
Epigenetic marks associated with gestational diabetes mellitus across two time points during pregnancy
- Teresa Linares-Pineda
- Nerea Peña-Montero
- Sonsoles Morcillo
Clinical Epigenetics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.