the mouse genome

Nature 420, 563-573 (5 December 2002) | doi:10.1038/nature01266; Received 19 September 2002; Accepted 28 October 2002

Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs

The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I & II Team*Y. Okazaki1,2, M. Furuno1, T. Kasukawa1,3, J. Adachi1, H. Bono1, S. Kondo1, I. Nikaido1,4, N. Osato1, R. Saito1,5, H. Suzuki1, I. Yamanaka1, H. Kiyosawa1,4, K. Yagi1, Y. Tomaru1,6, Y. Hasegawa1,4, A. Nogami1,4, C. Schönbach7, T. Gojobori8, R. Baldarelli9, D. P. Hill9, C. Bult9, D. A. Hume10, J. Quackenbush11, L. M. Schriml12, A. Kanapin13, H. Matsuda14, S. Batalov15, K. W. Beisel16, J. A. Blake9, D. Bradt9, V. Brusic17, C. Chothia18, L. E. Corbani9, S. Cousins9, E. Dalla19, T. A. Dragani20, C. F. Fletcher15,21, A. Forrest10, K. S. Frazer9,22, T. Gaasterland23, M. Gariboldi20, C. Gissi24, A. Godzik25, J. Gough18, S. Grimmond10, S. Gustincich26, N. Hirokawa27, I. J. Jackson28, E. D. Jarvis29, A. Kanai5, H. Kawaji3,14, Y. Kawasawa30, R. M. Kedzierski30, B. L. King9, A. Konagaya7, I. V. Kurochkin7, Y. Lee11, B. Lenhard31, P. A. Lyons32, D. R. Maglott12, L. Maltais9, L. Marchionni19, L. McKenzie9, H. Miki27, T. Nagashima7, K. Numata5, T. Okido8, W. J. Pavan33, G. Pertea11, G. Pesole24, N. Petrovsky34, R. Pillai17, J. U. Pontius12, D. Qi9, S. Ramachandran9, T. Ravasi10, J. C. Reed25, D. J. Reed9, J. Reid19, B. Z. Ring35, M. Ringwald9, A. Sandelin31, C. Schneider19, C. A. M. Semple28, M. Setou27, K. Shimada36,37, R. Sultana11, Y. Takenaka14, M. S. Taylor28, R. D. Teasdale10, M. Tomita5, R. Verardo19, L. Wagner12, C. Wahlestedt31, Y. Wang11, Y. Watanabe36,37, C. Wells10, L. G. Wilming38, A. Wynshaw-Boris39, M. Yanagisawa30, I. Yang11, L. Yang9, Z. Yuan10, M. Zavolan23, Y. Zhu9 & A. Zimmer40 for FANTOM Consortium:, P. Carninci2, N. Hayatsu1, T. Hirozane-Kishikawa1, H. Konno1, M. Nakamura1, N. Sakazume1, K. Sato6, T. Shiraki1 & K. Waki1 for RIKEN Genome Exploration Research Group Phase I Team:, J. Kawai1,2, K. Aizawa1, T. Arakawa1, S. Fukuda1, A. Hara1, W. Hashizume1, K. Imotani1, Y. Ishii1, M. Itoh2, I. Kagawa1, A. Miyazaki1, K. Sakai1, D. Sasaki1, K. Shibata2, A. Shinagawa1, A. Yasunishi1 & M. Yoshino1 for RIKEN Genome Exploration Research Group Phase II Team:, R. Waterston41, E. S. Lander42, J. Rogers38 & E. Birney13 for Mouse Genome Sequencing Consortium: & Y. Hayashizaki1,2,4,6 for Scientific management:

Top

Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense–antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.

  1. Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan;
  2. Genome Science Laboratory, Discovery and Research Institute, RIKEN Wako Main Campus, 2-1 Hirosawa, Wako, Saitama, 351-0198,Japan;
  3. NTT Software Corporation, 223-1 Yamashita-cho, Naka-ku, Yokohama, Kanagawa, 231-8554, Japan;
  4. Division of Genomic Information Resources, Science of Biological Supramolecular Systems, Graduate School of Integrated Science, Yokohama City University, 1-7-29 Suehiro-Cho, Tsurumi-Ku, Yokohama, 230-0045, Japan;
  5. Institute for Advanced Biosciences, Keio Univ, 403-1 Tsuruoka-city, Yamagata, 997-0017, Japan;
  6. Institute of Basic Medical Sciences, University of Tsukuba, 1-1-1 Tennoudai, Tsukuba, Ibaraki, 305-8577, Japan;
  7. Biomedical Knowledge Discovery Team, Bioinformatics Group, RIKEN Genomic Sciences Center (GSC), RIKEN Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan;
  8. Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan;
  9. Mouse Genome Informatics Group, The Jackson Laboratory, 600 Main Street, Bar Harbor, Maine 04609, USA;
  10. Institute for Molecule Bioscience and ARC Special Research Centre for Functional and Applied Genomics, University of Queensland, Queensland 4072, Australia;
  11. The Institute for Genomic Research (TIGR), 9712 Medical Center Drive, Rockville, Maryland 20850, USA;
  12. National Center for Biotechnology Information, NIH, Bldg 38A, 8600 Rockville Pike, Bethesda, Maryland 20894, USA;
  13. European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK;
  14. Graduate School of Information Science and Technology, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka, 560-8531, Japan;
  15. Genomics Institute of the Novartis Research Foundation (GNF), 10675 John Jay Hopkins Drive, San Diego, California 92121, USA;
  16. Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA;
  17. Laboratories for Information Technology, 21, Heng Mui Keng Terrace, Singapore, 119613, Singapore;
  18. Structural Studies, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK;
  19. LNCIB, Functional Genomics, AREA Science Park, Padriciano, 99 Trieste, 34012, Italy;
  20. Istituto Tumori Milano, Milano, 20133, Italy;
  21. The Scripps Research Institute, 10550N. Torrey Pines Road, La Jolla, California 92037, USA;
  22. The Zebrafish International Resource Center, University of Oregon, Eugene, Oregon 97403-5274, USA;
  23. Laboratory of Computational Genomics, The Rockefeller University, 1230 York Avenue, New York, New York 10021-6399, USA;
  24. Universita' di Milano, Milano, 20133, Italy;
  25. The Burnham Institute, 10901N. Torrey Pines Road, La Jolla, California 92037, USA;
  26. Department of Neurobiology, Harvard Medical School, 220 Longwood Avenue, Boston, Massachusetts 02115, USA;
  27. Graduate School of Medicine, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan;
  28. MRC Human Genetics Unit, Crewe Road, Edinburgh, UK;
  29. Duke University Medical Center, Department of Neurobiology, Box 3209 Durham, North Carolina, 27710, USA;
  30. Howard Hughes Medical Institute, Department of Molecular Genetics, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Boulevard, Dallas, Texas 75390-9050, USA;
  31. Center for Genomics and Bioinformatics, Karolinska Institutet, Berzelius vag 35, 17177 Stockholm, Sweden;
  32. JDRF/WT Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, Addenbrookes Hospital Hills Road, Cambridge CB2 2XY, UK;
  33. National Human Genome Research Institute, National Institutes of Health, 49/4A82 49 Convent Drive, MSC4472, Bethesda, Maryland 20892-4472, USA;
  34. Autoimmunity Research Unit, The Canberra Hospital, Yamba Drive, Woden, ACT 2606, Australia;
  35. Applied Genomics, Inc. 525 Del Rey Avenue, Sunnyvale, California 94085, USA;
  36. Hirakata Ryoikuen, 2-1-1 Tsudahigashi, Hirakata, Osaka, 565-0874, Japan;
  37. Institute for Advanced Medical Sciences, Hyogo College of Medicine, Mukogawa 1-1, Nishinomiya, Hyogo, 663-8501, Japan;
  38. Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK;
  39. University of California, San Diego School of Medicine, Pediatrics/Medicine 9500 Gilman Drive, La Jolla, California 92093-0627 USA;
  40. Department of Psychiatry, University of Bonn, Sigmund-Freud-Strasse 25, Bonn, 53105, Germany;
  41. Genome Sequencing Center, Washington University School of Medicine, Campus Box 8501, 4444 Forest Park Avenue, St Louis, Missouri 63108, USA;
  42. Whitehead Institute/MIT Center for Genome Research, 320 Charles Street, Cambridge, Massachusetts 02141, USA
  43. Correspondence and requests for materials should be addressed to Y.H. (e-mail: Email: yosihide@gsc.riken.go.jp). Details of methods, and accession numbers for all 60,770 cDNA clones, are available as Supplementary Information. Clone availability information regarding access to the FANTOM2 clones is available at http://www.riken.go.jp/.
    Authors' contributions: Y. Okazaki, M. Furuno, T. Kasukawa, C. Schönbach, R. Baldarelli, D. P. Hill, C. Bult, D. A. Hume, J. Quackenbush, L. M. Schriml, A. Kanapin and Y. Hayashizaki are core authorship members; P. Carninci and J. Kawai are team organizers.

Extra navigation

.

Open Innovation Challenges

naturejobs

natureproducts


ADVERTISEMENT