Abstract
To verify the genome annotation and to create a resource to functionally characterize the proteome, we attempted to Gateway-clone all predicted protein-encoding open reading frames (ORFs), or the 'ORFeome,' of Caenorhabditis elegans. We successfully cloned approximately 12,000 ORFs (ORFeome 1.1), of which roughly 4,000 correspond to genes that are untouched by any cDNA or expressed-sequence tag (EST). More than 50% of predicted genes needed corrections in their intron-exon structures. Notably, approximately 11,000 C. elegans proteins can now be expressed under many conditions and characterized using various high-throughput strategies, including large-scale interactome mapping. We suggest that similar ORFeome projects will be valuable for other organisms, including humans.
Access options
Subscribe to Journal
Get full journal access for 1 year
70,80 €
only 5,90 € per issue
All prices include VAT for France.
Rent or Buy article
Get time limited or full article access on ReadCube.
from$8.99
All prices are NET prices.
References
- 1.
Mardis, E., McPherson, J., Martienssen, R., Wilson, R.K. & McCombie, W.R. What is finished, and why does it matter. Genome Res. 12, 669–671 (2002).
- 2.
Vidal, M. A biological atlas of functional maps. Cell 104, 333–339 (2001).
- 3.
Ideker, T., Galitski, T. & Hood, L. A new approach to decoding life: systems biology. Annu. Rev. Genomics Hum. Genet. 2, 343–372 (2001).
- 4.
Kapranov, P. et al. Large-scale transcriptional activity in human chromosomes 21 and 22. Science 296, 916–919 (2002).
- 5.
The International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
- 6.
Reboul, J. et al. Open-reading-frame sequence tags (OSTs) support the existence of at least 17,300 genes in C. elegans. Nat. Genet. 27, 332–336 (2001).
- 7.
Blandin, G. et al. Genomic exploration of the hemiascomycetous yeasts: 4. The genome of Saccharomyces cerevisiae revisited. FEBS Lett. 487, 31–36 (2000).
- 8.
Oshiro, G. et al. Parallel identification of new genes in Saccharomyces cerevisiae. Genome Res. 12, 1210–1220 (2002).
- 9.
Zhu, H. et al. Global analysis of protein activities using proteome chips. Science 293, 2101–2105 (2001).
- 10.
MacBeath, G. & Schreiber, S.L. Printing proteins as microarrays for high-throughput function determination. Science 289, 1760–1763 (2000).
- 11.
Ziauddin, J. & Sabatini, D.M. Microarrays of cells expressing defined cDNAs. Nature 411, 107–110 (2001).
- 12.
Gera, J.F., Hazbun, T.R. & Fields, S. Array-based methods for identifying protein–protein and protein–nucleic acid interactions. Methods Enzymol. 350, 499–512 (2002).
- 13.
Mammalian Gene Collection (MGC) Program Team. Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc. Natl. Acad. Sci. USA 99, 16899–16903 (2002).
- 14.
The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I & II Team. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420, 563–573 (2002).
- 15.
Seki, M. et al. Functional annotation of a full-length Arabidopsis cDNA collection. Science 296, 141–145 (2002).
- 16.
Stein, L., Sternberg, P., Durbin, R., Thierry-Mieg, J. & Spieth, J. WormBase: network access to the genome and biology of Caenorhabditis elegans. Nucleic Acids Res. 29, 82–86 (2001).
- 17.
Walhout, A.J. et al. Gateway recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes. Methods Enzymol. 328, 575–592 (2000).
- 18.
Walhout, A.J. et al. Protein interaction mapping in C. elegans using proteins involved in vulval development. Science 287, 116–122 (2000).
- 19.
Hartley, J.L., Temple, G.F. & Brasch, M.A. DNA cloning using in vitro site-specific recombination. Genome Res. 10, 1788–1795 (2000).
- 20.
The C. elegans Sequencing Consortium. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282, 2012–2018 (1998).
- 21.
Morin, X., Daneman, R., Zavortink, M. & Chia, W. A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila. Proc. Natl. Acad. Sci. USA 98, 15050–15055 (2001).
- 22.
Harrison, P.M., Echols, N. & Gerstein, M.B. Digging for dead genes: an analysis of the characteristics of the pseudogene population in the Caenorhabditis elegans genome. Nucleic Acids Res. 29, 818–830 (2001).
- 23.
Vaglio, P. et al. WorfDB: the C. elegans ORFeome Database. Nucleic Acids Res. 31, 237–240 (2003).
- 24.
Hazbun, T.R. & Fields, S. Networking proteins in yeast. Proc. Natl. Acad. Sci. USA 98, 4277–4278 (2001).
- 25.
Davy, A. et al. A protein–protein interaction map of the Caenorhabditis elegans 26S proteasome. EMBO Rep. 2, 821–828 (2001).
- 26.
Boulton, S.J. et al. Combined functional genomic maps of the C. elegans DNA damage response. Science 295, 127–131 (2002).
- 27.
Kinoshita, N., Minshull, J. & Kirschner, M.W. The identification of two novel ligands of the FGF receptor by a yeast screening method and their activity in Xenopus development. Cell 83, 621–630 (1995).
- 28.
Braun, P. et al. Proteome-scale purification of human proteins from bacteria. Proc. Natl. Acad. Sci. USA 99, 2654–2659 (2002).
- 29.
Hammarstrom, M., Hellgren, N., van Den Berg, S., Berglund, H. & Hard, T. Rapid screening for improved solubility of small human proteins produced as fusion proteins in Escherichia coli. Protein Sci. 11, 313–321 (2002).
- 30.
Hillier, L. & Green, P. OSP: a computer program for choosing PCR and DNA sequencing primers. PCR Methods Appl. 1, 124–128 (1991).
Acknowledgements
We thank the C. elegans Sequencing Consortium for the genome sequence; the participants of the annual ORFeome meeting for their input and numerous suggestions; the members of M.V.'s laboratory for their input and help; C. McCowan for administrative assistance; B. Sobhian, A.-S. Nicot, N. Tzellas and the GenomeVision Service sequencing staff at Genome Therapeutics for technical assistance; and P. Braun for the protein expression plasmids. This work was supported by grants from the National Cancer Institute, the National Human Genome Research Institute, the National Institute of General Medical Sciences and the Merck Genome Research Institute awarded to M.V.
Author information
Author notes
- Jérôme Reboul
- , Troy Moore
- , James R. Hudson Jr.
- , James L. Hartley
- , Michael A. Brasch
- , Simon Boulton
- & Lynn Doucette-Stamm
Present addresses: INSERM Unité 119, Institut Paoli Calmette, 13009 Marseille, France (J.R.); Open Biosystems, Huntsville, Alabama 35806, USA (T.M.); Cityscapes, Huntsville, Alabama 35801, USA (J.R.H.); SAIC/National Cancer Institute, Frederick, Maryland 21702, USA (J.L.H.); Atto Bioscience, Rockville, Maryland 20850, USA (M.A.B.); Cancer Research UK, Clare Hall, Herts EN6 3LD, UK (S.B.); Agencourt Biosciences Corporation, Beverly, Massachusetts 01915, USA (L.D.).
- Jérôme Reboul
- , Philippe Vaglio
- , Jean-François Rual
- & Philippe Lamesch
These authors contributed equally to this work.
Affiliations
Dana-Farber Cancer Institute and Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.
- Jérôme Reboul
- , Philippe Vaglio
- , Jean-François Rual
- , Philippe Lamesch
- , Monica Martinez
- , Christopher M. Armstrong
- , Siming Li
- , Laurent Jacotot
- , Nicolas Bertin
- , Rekin's Janky
- , Simon Boulton
- , David E. Hill
- & Marc Vidal
Unité de Recherche en Biologie Moléculaire, Facultés Universitaires Notre-Dame de la Paix, Namur, 5000, Belgium.
- Jean-François Rual
- , Philippe Lamesch
- & Jean Vandenhaute
Research Genetics /Invitrogen, Huntsville, Alabama, USA.
- Troy Moore
- & James R. Hudson Jr.
Life Technologies /Invitrogen, Rockville, Maryland, USA.
- James L. Hartley
- & Michael A. Brasch
Protedyne Corporation, Windsor, Connecticut 06095, USA.
- Gregory A. Endress
Department of Surgery, McGill University, Montreal, Canada.
- Sarah Jenna
- & Eric Chevet
Center for Applied Genomics, Public Health Research Institute, Newark, New Jersey 07103, USA.
- Vasilis Papasotiropoulos
- & Peter P. Tolias
Yale University, New Haven, Connecticut 06520, USA.
- Jason Ptacek
- & Mike Snyder
Center for Synchrotron Biosciences and Department of Physiology & Biophysics, Albert Einstein College of Medicine, Bronx, New York 10461, USA.
- Raymond Huang
- & Mark R. Chance
Genome Therapeutics, Waltham, Massachusetts 02453, USA.
- Hongmei Lee
- & Lynn Doucette-Stamm
Authors
Search for Jérôme Reboul in:
Search for Philippe Vaglio in:
Search for Jean-François Rual in:
Search for Philippe Lamesch in:
Search for Monica Martinez in:
Search for Christopher M. Armstrong in:
Search for Siming Li in:
Search for Laurent Jacotot in:
Search for Nicolas Bertin in:
Search for Rekin's Janky in:
Search for Troy Moore in:
Search for James R. Hudson in:
Search for James L. Hartley in:
Search for Michael A. Brasch in:
Search for Jean Vandenhaute in:
Search for Simon Boulton in:
Search for Gregory A. Endress in:
Search for Sarah Jenna in:
Search for Eric Chevet in:
Search for Vasilis Papasotiropoulos in:
Search for Peter P. Tolias in:
Search for Jason Ptacek in:
Search for Mike Snyder in:
Search for Raymond Huang in:
Search for Mark R. Chance in:
Search for Hongmei Lee in:
Search for Lynn Doucette-Stamm in:
Search for David E. Hill in:
Search for Marc Vidal in:
Competing interests
T.M. has financial interests in Open Biosystems, one of the companies responsible for the distribution of the C. elegans ORFeome version 1.1.
Corresponding author
Correspondence to Marc Vidal.
Supplementary information
PDF files
Rights and permissions
To obtain permission to re-use content from this article visit RightsLink.
About this article
Further reading
-
1.
Applied Biochemistry and Biotechnology (2016)
-
2.
BMC Evolutionary Biology (2016)
-
3.
BMC Genomics (2015)
-
4.
BMC Genomics (2015)
-
5.
SH3 interactome conserves general function over specific form
Molecular Systems Biology (2014)