Analysis
Nature 431, 927-930 (21 October 2004) | doi:10.1038/nature03062; Received 21 February 2004; Accepted 27 September 2004
Shotgun sequence assembly and recent segmental duplications within the human genome
Xinwei She1, Zhaoshi Jiang1, Royden A. Clark2, Ge Liu2, Ze Cheng1, Eray Tuzun1, Deanna M. Church3, Granger Sutton4, Aaron L. Halpern5 and Evan E. Eichler1
Abstract
Complex eukaryotic genomes are now being sequenced at an accelerated pace primarily using whole-genome shotgun (WGS) sequence assembly approaches. WGS assembly was initially criticized because of its perceived inability to resolve repeat structures within genomes. Here, we quantify the effect of WGS sequence assembly on large, highly similar repeats by comparison of the segmental duplication content of two different human genome assemblies. Our analysis shows that large (> 15 kilobases) and highly identical (> 97%) duplications are not adequately resolved by WGS assembly. This leads to significant reduction in genome length and the loss of genes embedded within duplications. Comparable analyses of mouse genome assemblies confirm that strict WGS sequence assembly will oversimplify our understanding of mammalian genome structure and evolution; a hybrid strategy using a targeted clone-by-clone approach to resolve duplications is proposed.
- Department of Genome Sciences, University of Washington School of Medicine, 1705 NE Pacific Street, Seattle, Washington 98195, USA
- Department of Genetics, Case Western Reserve University, Cleveland, Ohio 44106, USA
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, Maryland 20894, USA
- Applied Biosystems, 45 West Gude Drive, and
- The Center for the Advancement of Genomics, 1901 Research Boulevard, Suite 600, Rockville, Maryland 20850, USA
Correspondence to: Evan E. Eichler1 Correspondence and requests for materials should be addressed to E.E.E. (Email: eee@gs.washington.edu).
MORE ARTICLES LIKE THIS
These links to content published by NPG are automatically generated.
NEWS AND VIEWS
The rat comes cleanNature Genetics News and Views (01 May 2004)
Fly meets shotgun: shotgun winsNature Genetics News and Views (01 Apr 2000)
See all 8 matches for News And ViewsRESEARCH
Finishing the euchromatic sequence of the human genomeNature Article (21 Oct 2004)
The structure and evolution of centromeric transition regions within the human genomeNature Article (19 Aug 2004)
See all 22 matches for Research