Abstract
Analysis of genomes, transcriptomes and proteomes reveals the existence of hundreds to thousands of translated, yet non-annotated, short open reading frames (small ORFs or smORFs). The discovery of smORFs and their protein products, smORF-encoded polypeptides (SEPs), points to a fundamental gap in our knowledge of protein-coding genes. Various studies have identified central roles for smORFs in metabolism, apoptosis and development. The discovery of these bioactive SEPs emphasizes the functional potential of this unexplored class of biomolecules. Here, we provide an overview of this emerging field and highlight the opportunities for chemical biology to answer fundamental questions about these novel genes. Such studies will provide new insights into the protein-coding potential of genomes and identify functional genes with roles in biology and disease.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Change history
17 December 2015
In the version of this article originally published in print, the citation for ref. 48 was omitted. It should have been cited as follows on p. 910 near the top right: "The translation of the uORFs regulates downstream ORF translation48 (Fig. 1c)." The error has been corrected in the online PDF and HTML versions of the article.
References
Gardner, D. & Shoback, D. Greenspan's Basic and Clinical Endocrinology edn. 9 (McGraw-Hill Education, 2011).
Kastin, A. Handbook of Biologically Active Peptides (Elsevier Science, 2013).
Wilkinson, M. & Brown, R.E. An Introduction to Neuroendocrinology (Cambridge University Press, 2015).
Bliss, M. The Discovery of Insulin (University of Chicago Press, 2013).
Bliss, M. & Purkis, R. The Discovery of Insulin (University of Chicago Press Chicago, 1982).
de Lecea, L. et al. The hypocretins: hypothalamus-specific peptides with neuroexcitatory activity. Proc. Natl. Acad. Sci. USA 95, 322–327 (1998).
Sakurai, T. et al. Orexins and orexin receptors: a family of hypothalamic neuropeptides and G protein-coupled receptors that regulate feeding behavior. Cell 92, 573–585 (1998).
Vale, W., Spiess, J., Rivier, C. & Rivier, J. Characterization of a 41-residue ovine hypothalamic peptide that stimulates secretion of corticotropin and beta-endorphin. Science 213, 1394–1397 (1981).
Zhang, Y. et al. Positional cloning of the mouse obese gene and its human homologue. Nature 372, 425–432 (1994).
Eng, J., Kleinman, W., Singh, L., Singh, G. & Raufman, J. Isolation and characterization of exendin-4, an exendin-3 analogue, from Heloderma suspectum venom. Further evidence for an exendin receptor on dispersed acini from guinea pig pancreas. J. Biol. Chem. 267, 7402–7405 (1992).
Finan, B. et al. A rationally designed monomeric peptide triagonist corrects obesity and diabetes in rodents. Nat. Med. 21, 27–36 (2015).
Hruby, V.J. Designing peptide receptor agonists and antagonists. Nat. Rev. Drug Discov. 1, 847–858 (2002).
Sammons, M.F. & Lee, E.C. Recent progress in the development of small-molecule glucagon receptor antagonists. Bioorg. Med. Chem. Lett. 25, 4057–4064 (2015).
Brown, N.J. & Vaughan, D.E. Angiotensin-converting enzyme inhibitors. Circulation 97, 1411–1420 (1998).
Thornberry, N.A. & Weber, A.E. Discovery of JANUVIA™ (Sitagliptin), a selective dipeptidyl peptidase IV inhibitor for the treatment of type 2 diabetes. Curr. Top. Med. Chem. 7, 557–568 (2007).
Basrai, M.A., Hieter, P. & Boeke, J.D. Small open reading frames: beautiful needles in the haystack. Genome Res. 7, 768–771 (1997).
Aspden, J.L. et al. Extensive translation of small open reading frames revealed by Poly-Ribo-Seq. Elife 3, e03528 (2014).
Frith, M.C. et al. The abundance of short proteins in the mammalian proteome. PLoS Genet. 2, e52 (2006).
Hanada, K. et al. Small open reading frames associated with morphogenesis are hidden in plant genomes. Proc. Natl. Acad. Sci. USA 110, 2395–2400 (2013).
Hemm, M.R., Paul, B.J., Schneider, T.D., Storz, G. & Rudd, K.E. Small membrane proteins found by comparative genomics and ribosome binding site models. Mol. Microbiol. 70, 1487–1501 (2008).
Kastenmayer, J.P. et al. Functional genomics of genes with small open reading frames (sORFs) in S. cerevisiae. Genome Res. 16, 365–373 (2006).
Ladoukakis, E., Pereira, V., Magny, E.G., Eyre-Walker, A. & Couso, J.P. Hundreds of putatively functional small open reading frames in Drosophila. Genome Biol. 12, R118 (2011).
Oyama, M. et al. Diversity of translation start sites may define increased complexity of the human short ORFeome. Mol. Cell. Proteomics 6, 1000–1006 (2007).
Slavoff, S.A. et al. Peptidomic discovery of short open reading frame–encoded peptides in human cells. Nat. Chem. Biol. 9, 59–64 (2013).
Vanderperre, B. et al. Direct detection of alternative open reading frames translation products in human significantly expands the proteome. PLoS ONE 8, e70698 (2013).
Vanderperre, B., Lucier, J.F. & Roucou, X. HAltORF: a database of predicted out-of-frame alternative open reading frames in human. Database (Oxford) 2012, bas025 (2012).
Yang, X. et al. Discovery and annotation of small proteins using genomics, proteomics, and computational approaches. Genome Res. 21, 634–641 (2011).
Anderson, D.M. et al. A micropeptide encoded by a putative long noncoding RNA regulates muscle performance. Cell 160, 595–606 (2015).
Galindo, M.I., Pueyo, J.I., Fouix, S., Bishop, S.A. & Couso, J.P. Peptides encoded by short ORFs control development and define a new eukaryotic gene family. PLoS Biol. 5, e106 (2007).
Kondo, T. et al. Small peptide regulators of actin-based cell morphogenesis encoded by a polycistronic mRNA. Nat. Cell Biol. 9, 660–665 (2007).
Lee, C. et al. The mitochondrial-derived peptide MOTS-c promotes metabolic homeostasis and reduces obesity and insulin resistance. Cell Metab. 21, 443–454 (2015).
Magny, E.G. et al. Conserved regulation of cardiac calcium uptake by peptides encoded in small open reading frames. Science 341, 1116–1120 (2013).
White, J.W. & Saunders, G.F. Structure of the human glucagon gene. Nucleic Acids Res. 14, 4719–4730 (1986).
Richard, J.P. et al. Cell-penetrating peptides. A reevaluation of the mechanism of cellular uptake. J. Biol. Chem. 278, 585–590 (2003).
Lodish, H. Molecular Cell Biology (W.H. Freeman, 2008).
Andrews, S.J. & Rothnagel, J.A. Emerging evidence for functional peptides encoded by short open reading frames. Nat. Rev. Genet. 15, 193–204 (2014).
Cheng, H. et al. Small open reading frames: current prediction techniques and future prospect. Curr. Protein Pept. Sci. 12, 503–507 (2011).
Crappé, J. et al. Combining in silico prediction and ribosome profiling in a genome-wide search for novel putatively coding sORFs. BMC Genomics 14, 648 (2013).
Bazzini, A.A. et al. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 33, 981–993 (2014).
Mackowiak, S.D. et al. Extensive identification and analysis of conserved small ORFs in animals. Genome Biol. 16, 179 (2015).
Fritsch, C. et al. Genome-wide search for novel human uORFs and N-terminal protein extensions using ribosomal footprinting. Genome Res. 22, 2208–2218 (2012).
Ingolia, N.T., Ghaemmaghami, S., Newman, J.R. & Weissman, J.S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324, 218–223 (2009).
Jackson, R.J., Hellen, C.U. & Pestova, T.V. The mechanism of eukaryotic translation initiation and principles of its regulation. Nat. Rev. Mol. Cell Biol. 11, 113–127 (2010).
Jousse, C. et al. Inhibition of CHOP translation by a peptide encoded by an open reading frame localized in the chop 5′ UTR. Nucleic Acids Res. 29, 4341–4351 (2001).
Iacono, M., Mignone, F. & Pesole, G. uAUG and uORFs in human and rodent 5′ untranslated mRNAs. Gene 349, 97–105 (2005).
Calvo, S.E., Pagliarini, D.J. & Mootha, V.K. Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. Proc. Natl. Acad. Sci. USA 106, 7507–7512 (2009).
Ingolia, N.T., Lareau, L.F. & Weissman, J.S. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell 147, 789–802 (2011).
Morris, D.R. & Geballe, A.P. Upstream open reading frames as regulators of mRNA translation. Mol. Cell. Biol. 20, 8635–8642 (2000).
Szamecz, B. et al. eIF3a cooperates with sequences 5′ of uORF1 to promote resumption of scanning by post-termination ribosomes for reinitiation on GCN4 mRNA. Genes Dev. 22, 2414–2425 (2008).
Parola, A.L. & Kobilka, B.K. The peptide product of a 5′ leader cistron in the beta 2 adrenergic receptor mRNA inhibits receptor synthesis. J. Biol. Chem. 269, 4497–4505 (1994).
Nekrutenko, A., Makova, K.D. & Li, W.H. The KA/KS ratio test for assessing the protein-coding potential of genomic regions: an empirical and simulation study. Genome Res. 12, 198–202 (2002).
Washburn, M.P., Wolters, D. & Yates, J.R. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. Biotechnol. 19, 242–247 (2001).
Castellana, N.E. et al. Discovery and revision of Arabidopsis genes by proteogenomics. Proc. Natl. Acad. Sci. USA 105, 21034–21038 (2008).
Branca, R.M. et al. HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics. Nat. Methods 11, 59–62 (2014).
Stern-Ginossar, N. et al. Decoding human cytomegalovirus. Science 338, 1088–1093 (2012).
Hemm, M.R. et al. Small stress response proteins in Escherichia coli: proteins missed by classical proteomic studies. J. Bacteriol. 192, 46–58 (2010).
Wadler, C.S. & Vanderpool, C.K. A dual function for a bacterial small RNA: SgrS performs base pairing-dependent regulation and encodes a functional polypeptide. Proc. Natl. Acad. Sci. USA 104, 20454–20459 (2007).
Guo, B. et al. Humanin peptide suppresses apoptosis by interfering with Bax activation. Nature 423, 456–461 (2003).
Hashimoto, Y. et al. A rescue factor abolishing neuronal cell death by a wide spectrum of familial Alzheimer's disease genes and Aßa. Proc. Natl. Acad. Sci. USA 98, 6336–6341 (2001).
Lluch-Senar, M. et al. Defining a minimal cell: essentiality of small ORFs and ncRNAs in a genome-reduced bacterium. Mol. Syst. Biol. 11, 780 (2015).
Kondo, T. et al. Small peptides switch the transcriptional activity of Shavenbaby during Drosophila embryogenesis. Science 329, 336–339 (2010).
Zanet, J. et al. Pri sORF peptides induce selective proteasome-mediated protein processing. Science 349, 1356–1358 (2015).
Bal, N.C. et al. Sarcolipin is a newly identified regulator of muscle-based thermogenesis in mammals. Nat. Med. 18, 1575–1579 (2012).
MacLennan, D.H. & Kranias, E.G. Phospholamban: a crucial regulator of cardiac contractility. Nat. Rev. Mol. Cell Biol. 4, 566–577 (2003).
Schmitt, J.P. et al. Dilated cardiomyopathy and heart failure caused by a mutation in phospholamban. Science 299, 1410–1413 (2003).
Odermatt, A. et al. Characterization of the gene encoding human sarcolipin (SLN), a proteolipid associated with SERCA1: absence of structural mutations in five patients with Brody disease. Genomics 45, 541–553 (1997).
Shackelford, D.B. & Shaw, R.J. The LKB1–AMPK pathway: metabolism and growth control in tumour suppression. Nat. Rev. Cancer 9, 563–575 (2009).
Ji, B., Kim, M., Higa, K.K. & Zhou, X. Boymaw, overexpressed in brains with major psychiatric disorders, may encode a small protein to inhibit mitochondrial function and protein translation. Am. J. Med. Genet. B. Neuropsychiatr. Genet. 168B, 284–295 (2015).
Lo, W.S. et al. Human tRNA synthetase catalytic nulls with diverse functions. Science 345, 328–332 (2014).
Lauressergues, D. et al. Primary transcripts of microRNAs encode regulatory peptides. Nature 520, 90–93 (2015).
Agarwal, S. et al. Isolation, characterization, and genetic complementation of a cellular mutant resistant to retroviral infection. Proc. Natl. Acad. Sci. USA 103, 15933–15938 (2006).
Slavoff, S.A., Heo, J., Budnik, B.A., Hanakahi, L.A. & Saghatelian, A. A human short open reading frame (sORF)-encoded polypeptide that stimulates DNA end joining. J. Biol. Chem. 289, 10950–10957 (2014).
Walker, J.R., Corpina, R.A. & Goldberg, J. Structure of the Ku heterodimer bound to DNA and its implications for double-strand break repair. Nature 412, 607–614 (2001).
Pierce, A.J., Hu, P., Han, M., Ellis, N. & Jasin, M. Ku DNA end-binding protein modulates homologous repair of double-strand breaks in mammalian cells. Genes Dev. 15, 3237–3242 (2001).
Lam, S.S. et al. Directed evolution of APEX2 for electron microscopy and proximity labeling. Nat. Methods 12, 51–54 (2015).
Rhee, H.-W. et al. Proteomic mapping of mitochondria in living cells via spatially restricted enzymatic tagging. Science 339, 1328–1331 (2013).
Li, Y.-C. et al. A versatile platform to analyze low-affinity and transient protein-protein interactions in living cells in real time. Cell Rep. 9, 1946–1958 (2014).
Hollander, P.A. et al. Pramlintide as an adjunct to insulin therapy improves long-term glycemic and weight control in patients with type 2 diabetes: a 1-year randomized controlled trial. Diabetes Care 26, 784–790 (2003).
Johnson, L.M. et al. A potent α/β-peptide analogue of GLP-1 with prolonged action in vivo. J. Am. Chem. Soc. 136, 12848–12851 (2014).
Denton, E.V. et al. A β-peptide agonist of the GLP-1 receptor, a class B GPCR. Org. Lett. 15, 5318–5321 (2013).
Bird, G.H. et al. Hydrocarbon double-stapling remedies the proteolytic instability of a lengthy peptide therapeutic. Proc. Natl. Acad. Sci. USA 107, 14093–14098 (2010).
Buse, J.B. et al. Liraglutide once a day versus exenatide twice a day for type 2 diabetes: a 26-week randomised, parallel-group, multinational, open-label trial (LEAD-6). Lancet 374, 39–47 (2009).
Lindgren, M. & Langel, Ü. in Cell-Penetrating Peptides 3–19 (Springer, 2011).
Cronican, J.J. et al. Potent delivery of functional proteins into mammalian cells in vitro and in vivo using a supercharged protein. ACS Chem. Biol. 5, 747–752 (2010).
Cronican, J.J. et al. A class of human proteins that deliver functional proteins into mammalian cells in vitro and in vivo. Chem. Biol. 18, 833–838 (2011).
Joliot, A. & Prochiantz, A. Transduction peptides: from technology to physiology. Nat. Cell Biol. 6, 189–196 (2004).
Pueyo, J.I. & Couso, J.P. The 11-aminoacid long Tarsal-less peptides trigger a cell signal in Drosophila leg development. Dev. Biol. 324, 192–201 (2008).
Arkin, M.R., Tang, Y. & Wells, J.A. Small-molecule inhibitors of protein-protein interactions: progressing toward the reality. Chem. Biol. 21, 1102–1114 (2014).
Acknowledgements
The authors acknowledge support from the US National Institutes of Health (GM102491 to A.S.), The Leona M. and Harry B. Helmsley Charitable Trust (grant no. 2012-PG-MED002 to A.S.) and a Wellcome Trust Senior Fellowship (08756 to J.P.C.).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
About this article
Cite this article
Saghatelian, A., Couso, J. Discovery and characterization of smORF-encoded bioactive polypeptides. Nat Chem Biol 11, 909–916 (2015). https://doi.org/10.1038/nchembio.1964
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nchembio.1964
This article is cited by
-
Mitochondria-derived peptide MOTS-c: effects and mechanisms related to stress, metabolism and aging
Journal of Translational Medicine (2023)
-
Regulation of acetate tolerance by small ORF-encoded polypeptides modulating efflux pump specificity in Methylomonas sp. DH-1
Biotechnology for Biofuels and Bioproducts (2023)
-
LINC01013 Is a Determinant of Fibroblast Activation and Encodes a Novel Fibroblast-Activating Micropeptide
Journal of Cardiovascular Translational Research (2023)
-
A Review on Mitochondrial Derived Peptide Humanin and Small Humanin-Like Peptides and Their Therapeutic Strategies
International Journal of Peptide Research and Therapeutics (2023)
-
Mitochondrial DNA variation in Alzheimer’s disease reveals a unique microprotein called SHMOOSE
Molecular Psychiatry (2023)