the mouse genome
Nature 420, 520-562 (5 December 2002) | doi:10.1038/nature01262; Received 18 September 2002; Accepted 31 October 2002
Initial sequencing and comparative analysis of the mouse genome
Mouse Genome Sequencing ConsortiumAsif T. Chinwalla1,47, Lisa L. Cook1, Kimberly D. Delehaunty1, Ginger A. Fewell1, Lucinda A. Fulton1,47, Robert S. Fulton1, Tina A. Graves1, LaDeana W. Hillier1,47, Elaine R. Mardis1, John D. McPherson1, Tracie L. Miner1, William E. Nash1, Joanne O. Nelson1, Michael N. Nhan1, Kymberlie H. Pepin1, Craig S. Pohl1, Tracy C. Ponce1, Brian Schultz1, Johanna Thompson1, Evanne Trevaskis1, Robert H. Waterston1,47, Michael C. Wendl1, Richard K. Wilson1 & Shiaw-Pyng Yang1,47 for Genome Sequencing Center:, Peter An2, Eric Berry2,47, Bruce Birren2, Toby Bloom2, Daniel G. Brown2,15,47, Jonathan Butler2,47, Mark Daly2,47, Robert David2, Justin Deri2, Sheila Dodge2, Karen Foley2, Diane Gage2, Sante Gnerre2,47, Timothy Holzer2, David B. Jaffe2,47, Michael Kamal2,47, Elinor K. Karlsson2,47, Cristyn Kells2, Andrew Kirby2,47, Edward J. Kulbokas, III2,47, Eric S. Lander2,46,47, Tom Landers2, J. P. Leger2, Rosie Levine2, Kerstin Lindblad-Toh2,47, Evan Mauceli2,47, John H. Mayer2, Megan McCarthy2, Jim Meldrim2, Jim Meldrim2, Jill P. Mesirov2,47, Robert Nicol2, Chad Nusbaum2, Steven Seaman2, Ted Sharpe2, Andrew Sheridan2, Jonathan B. Singer2,47, Ralph Santos2, Brian Spencer2, Nicole Stange-Thomann2, Jade P. Vinson2,47, Claire M. Wade2,47, Jamey Wierzbowski2, Dudley Wyman2 & Michael C. Zody2,47 for Whitehead Institute/MIT Center for Genome Research:, Ewan Birney3,47, Nick Goldman3,47, Arkadiusz Kasprzyk3,47, Emmanuel Mongin3, Alistair G. Rust3, Guy Slater3,47, Arne Stabenau3,47, Abel Ureta-Vidal3 & Simon Whelan3,47 for European Bioinformatics Institute:, Rachel Ainscough4, John Attwood4, Jonathon Bailey4, Karen Barlow4, Stephan Beck4, John Burton4, Michele Clamp4,47, Christopher Clee4, Alan Coulson4, James Cuff4,47, Val Curwen4,47, Tim Cutts4,47, Joy Davies4, Eduardo Eyras4,47, Darren Grafham4, Simon Gregory4,47, Tim Hubbard4,47, Adrienne Hunt4, Matthew Jones4, Ann Joy4, Steven Leonard4, Christine Lloyd4, Lucy Matthews4, Stuart McLaren4, Kirsten McLay4, Beverley Meredith4, James C. Mullikin4,47, Zemin Ning4,47, Karen Oliver4, Emma Overton-Larty4, Robert Plumb4, Simon Potter4,47, Michael Quail4, Jane Rogers4, Carol Scott4, Steve Searle4,47, Ratna Shownkeen4, Sarah Sims4, Melanie Wall4, Anthony P. West4, David Willey4 & Sophie Williams4 for Wellcome Trust Sanger Institute, Josep F. Abril5,47, Roderic Guigó5,47 & Genís Parra5,47 for Research Group in Biomedical Informatics, Pankaj Agarwal6,47 for Bioinformatics, Richa Agarwala7, Deanna M. Church7,47, Wratko Hlavina7,47, Donna R. Maglott7,47 & Victor Sapojnikov7,47 for National Center for Biotechnology Information, Marina Alexandersson8,47 & Lior Pachter8,47 for Department of Mathematics, Stylianos E. Antonarakis9,47, Emmanouil T. Dermitzakis9,47, Alexandre Reymond9,47 & Catherine Ucla9,47 for Division of Medical Genetics, Robert Baertsch10,47, Mark Diekhans10,47, Terrence S. Furey10,47, Angela Hinrichs10,47, Fan Hsu10,47, Donna Karolchik10,47, W. James Kent10,47, Krishna M. Roskin10,47, Matthias S. Schwartz10,47, Charles Sugnet10,47 & Ryan J. Weber10,47 for Center for Biomolecular Science and Engineering, Peer Bork11,47, Ivica Letunic11,47, Mikita Suyama11,47, David Torrents11,47 & Evgeny M. Zdobnov11,47 for EMBL, Marc Botcherby12, Stephen D. Brown12, Robert D. Campbell12 & Ian Jackson12 for UK MRC Mouse Sequencing Consortium, Nicolas Bray13,47, Olivier Couronne13,47, Inna Dubchak13,47, Alex Poliakov13,47 & Edward M. Rubin13 for Lawrence Berkeley National Laboratory, Michael R. Brent14,47, Paul Flicek14,47, Evan Keibler14,47 & Ian Korf14,47 for Department of Computer Science, S. Batalov15 for School of Computer Science, Carol Bult16,47 & Wayne N. Frankel16,47 for The Jackson Laboratory, Piero Carninci17, Yoshihide Hayashizaki17, Jun Kawai17 & Yasushi Okazaki17 for Laboratory for Genome Exploration, Simon Cawley18,47, David Kulp18,47 & Raymond Wheeler18,47 for Affymetrix Inc., Francesca Chiaromonte19,47 for Departments of Statistics and Health Evaluation Sciences, Francis S. Collins20,47, Adam Felsenfeld20,47, Mark Guyer20, Jane Peterson20 & Kris Wetterstrand20 for National Human Genome Research Institute, Richard R. Copley21,47 & Richard Mott21,47 for Wellcome Trust Centre for Human Genetics, Colin Dewey22,47 for Department of Electrical Engineering, Nicholas J. Dickens23,47, Richard D. Emes23,47, Leo Goodstadt23,47, Chris P. Ponting23,47 & Eitan Winter23,47 for Department of Human Anatomy and Genetics, Diane M. Dunn24, Andrew C. von Niederhausern24 & Robert B. Weiss24 for Department of Human Genetics, Sean R. Eddy25,47, L. Steven Johnson25 & Thomas A. Jones25 for Howard Hughes Medical Institute and Department of Genetics, Laura Elnitski26,47 & Diana L. Kolbe26,47 for Departments of Biochemistry and Molecular Biology and Computer Science and Engineering, Pallavi Eswara27,47, Webb Miller27,47, Michael J. O'Connor27 & Scott Schwartz27,47 for Department of Computer Science and Engineering, Richard A. Gibbs28 & Donna M. Muzny28 for Baylor College of Medicine, Gustavo Glusman29,47 & Arian Smit29,47 for The Institute for Systems Biology, Eric D. Green30,47 for National Human Genome Research Institute, Ross C. Hardison31,47 & Shan Yang31 for Department of Biochemistry and Molecular Biology, David Haussler32,47 for Howard Hughes Medical Institute, Axin Hua33 & Bruce A. Roe33 for Department of Chemistry and Biochemistry, Raju S. Kucherlapati34 & Kate T. Montgomery34 for Departments of Genetics and Medicine and Harvard-Partners Center for Genetics and Genomics, Jia Li35,47 for Department of Statistics, Ming Li36,47 for Department of Computer Science, Susan Lucas37 for US DOE Joint Genome Institute, Bin Ma38,47 for Department of Computer Science, W. Richard McCombie39 for Cold Spring Harbor Laboratory, Michael Morgan40 for Wellcome Trust, Pavel Pevzner41,47 & Glenn Tesler41,47 for Department of Computer Science and Engineering, Jörg Schultz42,47 for Max Planck Institute for Molecular Genetics, Douglas R. Smith43 for Genome Therapeutics Corporation, John Tromp44,47 for Bioinformatics Solutions Inc., Kim C. Worley45,47 for Department of Molecular and Human Genetics, Eric S. Lander2,46,47 for Department of Biology & Josep F. Abril5,47, Pankaj Agarwal6,47, Marina Alexandersson8,47, Stylianos E. Antonarakis9,47, Robert Baertsch10,47, Eric Berry2,47, Ewan Birney3,47, Peer Bork11,47, Nicolas Bray13,47, Michael R. Brent14,47, Daniel G. Brown2,15,47, Jonathan Butler2,47, Carol Bult16,47, Francesca Chiaromonte19,47, Asif T. Chinwalla1,47, Deanna M. Church7,47, Michele Clamp4,47, Francis S. Collins20,47, Richard R. Copley21,47, Olivier Couronne13,47, Simon Cawley18,47, James Cuff4,47, Val Curwen4,47, Tim Cutts4,47, Mark Daly2,47, Emmanouil T. Dermitzakis9,47, Colin Dewey22,47, Nicholas J. Dickens23,47, Mark Diekhans10,47, Inna Dubchak13,47, Sean R. Eddy25,47, Laura Elnitski26,47, Richard D. Emes23,47, Pallavi Eswara27,47, Eduardo Eyras4,47, Adam Felsenfeld20,47, Paul Flicek14,47, Wayne N. Frankel16,47, Lucinda A. Fulton1,47, Terrence S. Furey10,47, Sante Gnerre2,47, Gustavo Glusman29,47, Nick Goldman3,47, Leo Goodstadt23,47, Eric D. Green30,47, Simon Gregory4,47, Roderic Guigó5,47, Ross C. Hardison31,47, David Haussler32,47, LaDeana W. Hillier1,47, Angela Hinrichs10,47, Wratko Hlavina7,47, Fan Hsu10,47, Tim Hubbard4,47, David B. Jaffe2,47, Michael Kamal2,47, Donna Karolchik10,47, Elinor K. Karlsson2,47, Arkadiusz Kasprzyk3,47, Evan Keibler14,47, W. James Kent10,47, Andrew Kirby2,47, Diana L. Kolbe26,47, Ian Korf14,47, Edward J. Kulbokas, III2,47, David Kulp18,47, Eric S. Lander2,46,47, Ivica Letunic11,47, Ming Li36,47, Kerstin Lindblad-Toh2,47, Bin Ma38,47, Donna R. Maglott7,47, Evan Mauceli2,47, Jill P. Mesirov2,47, Webb Miller27,47, Richard Mott21,47, James C. Mullikin4,47, Zemin Ning4,47, Lior Pachter8,47, Genís Parra5,47, Pavel Pevzner41,47, Alex Poliakov13,47, Chris P. Ponting23,47, Simon Potter4,47, Alexandre Reymond9,47, Krishna M. Roskin10,47, Victor Sapojnikov7,47, Jörg Schultz42,47, Matthias S. Schwartz10,47, Scott Schwartz27,47, Steve Searle4,47, Jonathan B. Singer2,47, Guy Slater3,47, Arian Smit29,47, Arne Stabenau3,47, Charles Sugnet10,47, Mikita Suyama11,47, Glenn Tesler41,47, David Torrents11,47, John Tromp44,47, Catherine Ucla9,47, Jade P. Vinson2,47, Claire M. Wade2,47, Ryan J. Weber10,47, Raymond Wheeler18,47, Eitan Winter23,47, Shiaw-Pyng Yang1,47, Evgeny M. Zdobnov11,47, Robert H. Waterston1,47, Simon Whelan3,47, Kim C. Worley45,47 & Michael C. Zody2,47 for Members of the Mouse Genome Analysis Group
Abstract
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.
- Genome Sequencing Center, Washington University School of Medicine, Campus Box 8501, 4444 Forest Park Avenue, St Louis, Missouri 63108, USA;
- Whitehead Institute/MIT Center for Genome Research, 320 Charles Street, Cambridge, Massachusetts 02141, USA;
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK;
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK;
- Research Group in Biomedical Informatics, Institut Municipal d'Investigacio, Medica/Universitat Pompeu Fabra, Centre de Regulacio Genomica, Barcelona, Catalonia, Spain;
- Bioinformatics, GlaxoSmithKline, UW2230, 709 Swedeland Road, King of Prussia, Pennsylvania 19406, USA;
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland 20892, USA;
- Department of Mathematics, University of California at Berkeley, 970 Evans Hall, Berkeley, California 94720, USA;
- Division of Medical Genetics, University of Geneva Medical School, 1 rue Michel-Servet, CH-1211 Geneva, Switzerland;
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA;
- EMBL, Meyerhofstrasse 1, Heidelberg 69117, Germany;
- UK MRC Mouse Sequencing Consortium, MRC Mammalian Genetics Unit, Harwell OX11 0RD, UK;
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mailstop 84-171, Berkeley, California 94720, USA;
- Department of Computer Science, Washington University, Box 1045, St Louis, Missouri 63130, USA;
- School of Computer Science, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada;
- The Jackson Laboratory, 600 Main Street, Bar Harbor, Maine 04609, USA;
- Laboratory for Genome Exploration, RIKEN Genomic Sciences Center, Yokohama Institute, 1-7-22 Suchiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan;
- Affymetrix Inc., Emeryville, California 94608, USA;
- Departments of Statistics and Health Evaluation Sciences, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- National Human Genome Research Institute, National Institutes of Health, 31 Center Drive, Room 4B09, Bethesda, Maryland 20892, USA;
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK;
- Department of Electrical Engineering, University of California, Berkeley, 231 Cory Hall, Berkeley, California 94720, USA;
- Department of Human Anatomy and Genetics, MRC Functional Genetics Unit, University of Oxford, South Parks Road, Oxford OX1 3QX, UK;
- Department of Human Genetics, University of Utah, Salt Lake City, Utah 84112, USA;
- Howard Hughes Medical Institute and Department of Genetics, Washington University School of Medicine, St Louis, Missouri 63110, USA;
- Departments of Biochemistry and Molecular Biology and Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Baylor College of Medicine, Human Genome Sequencing Center, One Baylor Plaza, MSC-226, Houston, Texas 77030, USA;
- The Institute for Systems Biology, 1441 North 34th Street, Seattle, Washington 98103, USA;
- National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Building 50, Room 5523, Bethesda, Maryland 20892, USA;
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Howard Hughes Medical Institute, University of California, Santa Cruz, California 95064, USA;
- Department of Chemistry and Biochemistry, University of Oklahoma Advanced Center for Genome Technology, University of Oklahoma, 620 Parrington Oval, Room 311, Norman, Oklahoma 73019, USA;
- Departments of Genetics and Medicine and Harvard-Partners Center for Genetics and Genomics, Harvard Medical School, Boston, Massachusetts 02115, USA;
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Department of Computer Science, University of California, Santa Barbara, California 93106, USA;
- US DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, California 94598, USA;
- Department of Computer Science, University of Western Ontario, London, Ontario N6A 5B7, Canada;
- Cold Spring Harbor Laboratory, PO Box 100, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA;
- Wellcome Trust, 183 Euston Road, London NW1 2BE, UK;
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0114, USA;
- Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany;
- Genome Therapeutics Corporation, 100 Beaver Street, Waltham, Massachusetts 02453, USA;
- Bioinformatics Solutions Inc., 145 Columbia Street W, Waterloo, Ontario N2L 3L2, Canada;
- Department of Molecular and Human Genetics, Baylor College of Medicine, Mailstop BCM226, Room 1419.01, One Baylor Plaza, Houston, Texas 77030, USA;
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02138, USA
- Members of the Mouse Genome Analysis Group
Correspondence to:
Correspondence and requests for materials should be addressed to R.H.W. (e-mail: Email: waterston@gs.washington.edu), K.L.T. (e-mail: Email: kersli@genome.wi.mit.edu) or E.S.L. (e-mail: Email: lander@genome.wi.mit.edu).
Authors' contributions: The following authors contributed to project leadership: R. H. Waterston, K. Lindblad-Toh, E. Birney, J. Rogers, M. R. Brent, F. S. Collins, R. Guigó, R. C. Hardison, D. Haussler, D. B. Jaffe, W. J. Kent, W. Miller, C. P. Ponting, A. Smit, M. C. Zody and E. S. Lander.

