Analysis of the DNA sequence and duplication history of human chromosome 15

Zody, Michael C.; Garber, Manuel; Sharpe, Ted; Young, Sarah K.; Rowen, Lee; O'Neill, Keith; Whittaker, Charles A.; Kamal, Michael; Chang, Jean L.; Cuomo, Christina A.; Dewar, Ken; FitzGerald, Michael G.; Kodira, Chinnappa D.; Madan, Anup; Qin, Shizhen; Yang, Xiaoping; Abbasi, Nissa; Abouelleil, Amr; Arachchi, Harindra M.; Baradarani, Lida; Birditt, Brian; Bloom, Scott; Bloom, Toby; Borowsky, Mark L.; Burke, Jeremy; Butler, Jonathan; Cook, April; DeArellano, Kurt; DeCaprio, David; Dorris, Lester; Dors, Monica; Eichler, Evan E.; Engels, Reinhard; Fahey, Jessica; Fleetwood, Peter; Friedman, Cynthia; Gearin, Gary; Hall, Jennifer L.; Hensley, Grace; Johnson, Ericka; Jones, Charlien; Kamat, Asha; Kaur, Amardeep; Locke, Devin P.; Madan, Anuradha; Munson, Glen; Jaffe, David B.; Lui, Annie; Macdonald, Pendexter; Mauceli, Evan; Naylor, Jerome W.; Nesbitt, Ryan; Nicol, Robert; O'Leary, Sinéad B.; Ratcliffe, Amber; Rounsley, Steven; She, Xinwei; Sneddon, Katherine M. B.; Stewart, Sandra; Sougnez, Carrie; Stone, Sabrina M.; Topham, Kerri; Vincent, Dascena; Wang, Shunguang; Zimmer, Andrew R.; Birren, Bruce W.; Hood, Leroy; Lander, Eric S.; Nusbaum, Chad

doi:10.1038/nature04601

Letter
Published: 30 March 2006

Analysis of the DNA sequence and duplication history of human chromosome 15

Michael C. Zody¹,
Manuel Garber¹,
Ted Sharpe¹,
Sarah K. Young¹,
Lee Rowen²,
Keith O'Neill¹,
Charles A. Whittaker¹^nAff6,
Michael Kamal¹,
Jean L. Chang¹,
Christina A. Cuomo¹,
Ken Dewar¹^nAff7,
Michael G. FitzGerald¹,
Chinnappa D. Kodira¹,
Anup Madan²^nAff8,
Shizhen Qin²,
Xiaoping Yang¹,
Nissa Abbasi²^nAff9,
Amr Abouelleil¹,
Harindra M. Arachchi¹,
Lida Baradarani²^nAff10,
Brian Birditt²^nAff11,
Scott Bloom²,
Toby Bloom¹,
Mark L. Borowsky¹,
Jeremy Burke²,
Jonathan Butler¹,
April Cook¹,
Kurt DeArellano¹,
David DeCaprio¹,
Lester Dorris III¹,
Monica Dors²,
Evan E. Eichler³,
Reinhard Engels¹,
Jessica Fahey²^nAff12,
Peter Fleetwood²^nAff13,
Cynthia Friedman⁴,
Gary Gearin¹,
Jennifer L. Hall¹,
Grace Hensley²^nAff14,
Ericka Johnson²^nAff15,
Charlien Jones¹,
Asha Kamat¹,
Amardeep Kaur²,
Devin P. Locke³,
Anuradha Madan²^nAff16,
Glen Munson¹,
David B. Jaffe¹,
Annie Lui¹,
Pendexter Macdonald¹,
Evan Mauceli¹,
Jerome W. Naylor¹,
Ryan Nesbitt²,
Robert Nicol¹,
Sinéad B. O'Leary¹,
Amber Ratcliffe²^nAff17,
Steven Rounsley¹,
Xinwei She³,
Katherine M. B. Sneddon⁵,
Sandra Stewart²,
Carrie Sougnez¹,
Sabrina M. Stone¹,
Kerri Topham¹,
Dascena Vincent²^nAff18,
Shunguang Wang¹,
Andrew R. Zimmer¹,
Bruce W. Birren¹,
Leroy Hood²,
Eric S. Lander¹ &
…
Chad Nusbaum¹

Nature volume 440, pages 671–675 (2006)Cite this article

8811 Accesses
57 Citations
3 Altmetric
Metrics details

Abstract

Here we present a finished sequence of human chromosome 15, together with a high-quality gene catalogue. As chromosome 15 is one of seven human chromosomes with a high rate of segmental duplication¹, we have carried out a detailed analysis of the duplication structure of the chromosome. Segmental duplications in chromosome 15 are largely clustered in two regions, on proximal and distal 15q; the proximal region is notable because recombination among the segmental duplications can result in deletions causing Prader-Willi and Angelman syndromes^2,3. Sequence analysis shows that the proximal and distal regions of 15q share extensive ancient similarity⁴. Using a simple approach, we have been able to reconstruct many of the events by which the current duplication structure arose. We find that most of the intrachromosomal duplications seem to share a common ancestry. Finally, we demonstrate that some remaining gaps in the genome sequence are probably due to structural polymorphisms between haplotypes; this may explain a significant fraction of the gaps remaining in the human genome.

You have full access to this article via your institution.

Download PDF

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Josefin Stiller, Shaohong Feng, … Guojie Zhang

The variation and evolution of complete human centromeres

Article Open access 03 April 2024

Glennis A. Logsdon, Allison N. Rozanski, … Evan E. Eichler

scGHOST: identifying single-cell 3D genome subcompartments

Article 08 April 2024

Kyle Xiong, Ruochi Zhang & Jian Ma

Main

The present work describes the completion of a physical map, high-quality finished sequence, and gene catalogue for the euchromatic q arm of human chromosome 15, representing 2.9% of the human genome. The finished sequence contains 81,871,010 bases and is interrupted by nine euchromatic gaps and one gap containing the heterochromatic p arm and centromere regions (Fig. 1). The total size of the euchromatic gaps is estimated at 544 kilobases (kb) (Methods and Supplementary Table S1). These gaps remain despite the screening of genomic libraries containing a combined ∼53-fold physical coverage, and are refractory to current cloning and mapping technology; six are within or adjacent to large duplicated regions. Of the finished sequence, 74% was generated by the Broad Institute of MIT and Harvard (formerly the Whitehead Institute/MIT Center for Genome Research (WICGR)), 25% by the Multimegabase Sequencing Center (initially at the University of Washington, currently at the Institute for Systems Biology), and the remaining ∼1% by three other groups (Supplementary Table S2). The analyses here are referenced to NCBI Build 35; however, we have slightly improved this sequence (including closing one of the euchromatic gaps), and provide the updated clone path in Supplementary Table S3. Details of construction of the clone map and sequencing are described in the Supplementary Information. The short arm of chromosome 15, as in other acrocentric human chromosomes (chromosomes 13, 14, 21 and 22), is heterochromatic and was not sequenced as part of the Human Genome Project; it is estimated at 17 Mb (ref. 5) and contains arrays of ribosomal RNA genes, satellite sequences and other repeated sequences⁶.

Figure 1: **Overview and duplication content of human chromosome 15.**

We assessed the local accuracy of the clone path by aligning paired-end sequences from a human fosmid library (designated WIBR2, representing 10 × physical coverage) to the finished sequence^7,8. This analysis revealed no aberrant clones. In addition, an independent quality assessment exercise commissioned by the National Human Genome Research Institute⁹ estimated the accuracy of the finished sequence to be better than one error in 100,000 bases (J. Schmutz, personal communication).

Several analyses suggest that nearly the entire euchromatic region of chromosome 15 is present and accurately represented in the finished sequence. All genes in the RefSeq¹⁰ database (596 loci, 742 transcripts) previously mapped to chromosome 15 are present and complete in the finished sequence. Furthermore, the finished sequence shows excellent alignment to genetic and radiation hybrid maps (Supplementary Fig. S1). The genetic map¹¹ shows perfect alignment, with no discrepancies among 125 sequence-based genetic markers (Supplementary Table S4). The radiation hybrid map¹² contains only local discrepancies, owing to its lower resolution (Supplementary Table S5). A large gap in the radiation hybrid coordinates (254–280 cR) at ∼74 Mb in the physical map, near a region where chromosome breakage has been observed independently in multiple mammalian lineages (see below), is probably the result of non-random breakage in the generation of the radiation hybrid panel.

We produced a manually curated⁸ catalogue of genes, containing 695 gene loci (including all genes in RefSeq) and 250 pseudogene loci on chromosome 15. Table 1 classifies the genes according to standardized categories. The 3% of genes in the ‘novel’ and ‘putative’ categories were annotated based only on spliced expressed-sequence-tag (EST) evidence; some of these may prove to be pseudogenes. The full-length transcripts of known genes have an average length of 3,267 bp, with an average of 11.6 exons. Internal exon lengths average 156 bp. Gene loci have an average of 4.6 distinct transcripts, with 66% having at least two transcripts. These gene statistics are similar to recent reports^{8,13,14,15,16}. Examples of genes that represent extremes of these distributions are described in the Supplementary Information. Most (74%) of the 250 pseudogenes are processed. In addition, we identified 9 transfer RNA genes (Supplementary Table S6) and found six known microRNAs mapping to chromosome 15 (Supplementary Table S7).

Table 1 Chromosome 15 gene content

Full size table

In most aspects of its landscape, chromosome 15 is close to genome-wide averages⁷. The overall gene density is 8.6 genes per Mb. There are 18 gene deserts (defined as 500 kb without an identified coding gene, Supplementary Table S8) comprising 14.9 Mb (∼18.3% of the chromosome). The overall G + C content is 42.2%, but varies substantially across the chromosome (Fig. 1b). Transposable element fossils cover 38.3%. Chromosome 15 is also typical in its content of non-coding sequence conservation (see Supplementary Information).

Chromosome 15 is, however, one of seven autosomes that are significantly enriched in segmental duplications (defined as regions >1 kb that are not high-copy repeats and have >90% identity to another region in the genome¹⁷), with 8.8% of its euchromatin composed of such sequence (Supplementary Fig. S2). As with other heavily duplicated chromosomes, chromosome 15 has a large fraction of intrachromosomal duplication: 50% is strictly intrachromosomal, 30% is both intra- and interchromosomal, and 20% is solely interchromosomal (largely in the proximal 1.5 Mb). The proportion of purely interchromosomal duplication might be even lower, as some undetected tandem duplication may exist near the centromere (see below). Recombination among segmental duplications within the region 15q11–q13 gives rise to deletions that are known to cause Prader-Willi and Angelman syndromes^2,3 (Supplementary Information).

We sought to investigate the duplication landscape of chromosome 15 by studying the relationships among the duplicated segments. Previous work has shown that a sequence within the Prader-Willi/Angelman syndrome region, termed LCR15 (ref. 4), is also duplicated on distal 15q (Supplementary Fig. S2). By extending our analysis to detect more ancient relationships (sequence identity less than 90%), we found much more extensive similarity among the duplicated sequences in both proximal and distal 15q (Fig. 1a). We clustered together segmental duplications containing related sequence (Methods) and found that most fell into a single large cluster, which we refer to as ‘class 1’. The class includes 67% of all bases in segmental duplications and 91% of all pairwise duplication events (as some bases reside within multiple independent events) (Supplementary Table S9).

Although the segmental duplications are related to one another in a complex fashion, we sought to identify a ‘core element’ that was present in many of the class 1 elements. We took the longest duplicated class 1 region (213 kb starting at 18.89 Mb, within the Prader-Willi/Angelman syndrome region) and aligned all duplicated regions of the chromosome to it, counting the number of different duplication regions that aligned to each base. We selected a core element that includes the highest peak of coverage (Supplementary Fig. S3); the element is 2,920 bp long and lies within the ∼15-kb LCR15 element.

The human genome contains 41 nearly full-length copies of the core element: there are 37 on chromosome 15, two on the Y chromosome, and one each on chromosomes 2 and 10. To understand the origins of the element, we compared the core element to the dog¹⁸ and mouse¹⁹ genomes. The dog and mouse genomes each contain a single copy of the element, which is orthologous to the copy on human chromosome 2. The similarity among the sequences is shown in a phylogenetic tree (Fig. 2, see Methods). The copy on chromosome 2 is at the root of the human duplications, closest to mouse and dog, as would be expected from conserved synteny. The duplications on chromosome 15 fall into two distinct and well-separated branches: a proximal branch containing all the elements in the Prader-Willi/Angelman syndrome region (chromosome position 18–32 Mb), and a distal branch containing all the elements from 73 to 88 Mb, with a tight clustering of elements around 80–83 Mb. A further two repeats in the subtelomeric region (98–100 Mb) are closely related to the proximal branch. Pairwise divergence between elements in the two branches is ∼11%, indicating that they share an ancient origin followed by local duplications, but with no recent interaction between branches.

Figure 2: **Phylogenetic tree of the 41 human copies and the unique dog and mouse copies of the conserved core element.**

From the tree, it is possible to reconstruct the likely history of the core element. The sequence on chromosome 2 lies in the 3′ untranslated region (UTR) of a splice variant of the gene intersectin 2 (ITSN2). This sequence seems to have moved by retroposition to chromosome 10 (at 30.68 Mb), inserting immediately downstream of the 5′ coding sequence of an interchromosomally duplicated copy of GOLGA2 (the origin of which is on chromosome 9). A combined unit (15 kb, consisting of GOLGA2 and the ITSN2 UTR) then was copied to chromosome 15, where it has duplicated extensively. Finally, two copies exist on the arms of a large palindrome on the Y chromosome, and seem to have moved to the Y chromosome by segmental duplication of ∼40 kb of chromosome 15 (at 82.7 Mb).

We next sought to understand why the large regions of segmental duplication in proximal 15q (denoted ‘A’) and distal 15q (denoted ‘C’) are separated by a large stretch that contains almost no duplicated sequence (denoted ‘B’). Analysis of conserved synteny with other species allows a reconstruction of the history of chromosome 15 (Fig. 3). Briefly, the three segments were adjacent in the boreoeutherian ancestor (the common ancestor of Euarchontoglires and Laurasiatheria), but were found in the order A–C–B. In the primate lineage, the chromosome apparently underwent a single large inversion that separated segments A and C. (Details of the reconstruction and comparison to recent reports^20,21 can be found in the Supplementary Information and Supplementary Fig. S4.) This suggests that the core element was transferred to chromosome 15 before the divergence of apes and Old World monkeys, and expanded locally (in the originally contiguous A–C region). The inversion subsequently separated regions A and C, and the element continued to expand separately in each region.

Figure 3: **History of the structural organization of human chromosome 15.**

To test this hypothesis, we examined the current draft assembly of the rhesus macaque genome (rheMac1; R. Gibbs, personal communication). We found at least 12 nearly full-length copies of the core element that we added to the evolutionary tree (Supplementary Fig. S5). We also found unique orthologues of the copies on human chromosomes 2 and 10. The remaining macaque elements were split between the proximal and distal clusters, confirming that the element had already appeared and begun to duplicate on chromosome 15 before the divergence of Old World monkeys and apes. The human and macaque elements are grouped into separate clusters in both the proximal and distal branches, indicating that local duplication has continued to occur in both the human and macaque lineages.

The analysis of conserved synteny also reveals that the segmental duplications are closely associated with chromosomal rearrangements. Chromosome 15 has 15 human-specific breakpoints of conserved synteny, all of which are inversions. Of these, 13 occur in regions containing class 1 duplications. This suggests that the segmental duplications may have mediated the inversions and that these inversions may have helped to disperse the elements.

The class 1 core element serves as a useful marker for tracing chromosomal history. However, the ubiquity of the core element raises the possibility that it had a causal role in the process of segmental duplication on chromosome 15. The element is derived from a UTR on chromosome 2, of which at least 500 bases are highly conserved across mammals and thus are presumably functional. Moreover, many of the copies on chromosome 15 are transcribed: 13 known genes on chromosome 15 (all golgins or golgin-like proteins) contain this duplicated UTR, and another 16 transcripts stop just short of it (Supplementary Table S10). It will be interesting to investigate whether functional properties of the fusion element on chromosome 15 promote local duplication, and to explore whether this had significant implications for primate evolution.

Finally, we note that the segmental duplications represent the main challenge in closing the remaining gaps in the sequence of chromosome 15. Build 35 contains ten gaps, seven of which lie within or immediately adjacent to class 1 duplications (Fig. 1). In some cases, the duplicated sequences flanking the gaps are so similar (> 99.7% identity) that they may represent allelic variants. Moreover, six of the seven duplication-associated gaps are adjacent to or within reported sites of copy-number polymorphism^22,23 (Supplementary Table S1). We have recently been able to close one gap (at 82.7 Mb) (decreasing the number of gaps to nine) by finding previously missed overlap between two flanking clones; another clone spanning this gap carries an alternative haplotype with an additional 100 kb, including an 80-kb near-perfect duplication. Examination of three of the other gaps suggests that they might also be due to structural variation, although more work will be required to confirm this.

The finished sequence of chromosome 15 offers a window into the natural history of segmental duplications and the structural history of chromosomes. Notably, most of the intrachromosomal duplication involves a single class of duplicons. On the basis of these results, we suggest an important role for such duplicons in structural evolution and gene diversification.

Methods

Production of gene catalogue and annotation

The gene catalogue was produced as described previously⁸. Gene symbols were assigned by the HUGO Gene Nomenclature Committee for biologically characterized loci. A complete list of gene symbols from this paper can be found in Supplementary Table S11. Annotation was performed as described previously⁸. Our annotations are available from the Vertebrate Genome Annotation database (VEGA, http://vega.sanger.ac.uk/Homo_sapiens).

Segmental duplications

Segmental duplications were defined as pairs of regions of 90% or greater identity (excluding repeat-masked bases) that extend for 1 kb or more. The map of segmental duplications was prepared using a method adapted from ref. 17, by concatenating all-against-all MegaBlast²⁴ alignments. A genome database was built using hard-masked sequence. This same hard-masked sequence was presented to MegaBlast as a probe, chromosome by chromosome. All alignments of 80% or better identity with expectation <10^-4 were kept. Alignments were then concatenated if they were contiguous except for masked repeats. Unmasked gaps could be crossed but were penalized to prevent over-merging by being treated as bases of 50% identity. Final segments meeting the 1-kb length and 90% identity criteria were retained.

Duplication class clustering

Pairwise intrachromosomal duplications were defined as above. A pairwise duplication A∼A′ was considered to be in the same class as another pairwise duplication B∼B′ if B or B′ overlapped A or A′ by 150 bp or more. We extended this by transitive closure to build maximally linked sets (that is, if A∼A′ linked to B∼B′ and C∼C′, all were clustered, even if B∼B′ did not overlap C∼C′). The number of duplications in a class is counted as the number of distinct pairwise alignments X∼X′ that were clustered. The number of bases in a class is counted as the number of distinct bases covered by at least one pairwise duplication in that class.

Construction of core element phylogeny

Full-length or nearly full-length copies of the core element in human were identified by MegaBlast (release 2.2.11). Copies in the mouse and dog genomes were identified by MegaBlast followed by blastn (release 2.2.11) to refine the boundaries and extend the regions. Multiple alignments of the elements were generated with ClustalW (v.1.83). Pairwise and multiple alignment parameters were adjusted by reducing the gap extension penalty to 0.1 and replacing the standard DNA matrix with a custom matrix scoring 10 for any match, -5 for any mismatch, and 0 for any alignment to an unknown base (N). The trees were output in phylip format and all gaps of length >1 converted to single indels by substitution of ‘?’ characters for all but the first ‘ - ’ in the gap to avoid generating disproportionately long branches for element copies with substantial deletion. Terminal gaps were also treated this way. Trees were built with the dnapars parsimony module of phylip (v.3.65)²⁵. The tree represented is the first of 15 equally likely trees that differ only in the leaf placement of the seven nearly identical copies of the element at 80 and 82 Mb on chromosome 15.

References

Bailey, J. A. et al. Recent segmental duplications in the human genome. Science 297, 1003–1007 (2002)
Article ADS CAS Google Scholar
Emanuel, B. S. & Shaikh, T. H. Segmental duplications: an ‘expanding’ role in genomic instability and disease. Nature Rev. Genet. 2, 791–800 (2001)
Article CAS Google Scholar
Soejima, H. & Wagstaff, J. Imprinting centers, chromatin structure, and disease. J. Cell. Biochem. 95, 226–233 (2005)
Article CAS Google Scholar
Pujana, M. A. et al. Additional complexity on human chromosome 15q: Identification of a set of newly recognized duplicons (LCR15) on 15q11-q13, 15q24, and 15q26. Genome Res. 11, 98–111 (2001)
Article CAS Google Scholar
Morton, N. Parameters of the human genome. Proc. Natl Acad. Sci. USA 88, 7474–7476 (1991)
Article ADS CAS Google Scholar
Kehrer-Sawatzki, H. et al. Mapping of members of the low-copy-number repetitive DNA sequence family from chAB4 within the p arms of human acrocentric chromosomes: characterization of Robertsonian translocations. Chromosome Res. 6, 429–435 (1998)
Article CAS Google Scholar
International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature 431, 931–945 (2004)
Article ADS Google Scholar
Nusbaum, C. et al. DNA sequence and analysis of human chromosome 18. Nature 437, 551–555 (2005)
Article ADS CAS Google Scholar
Schmutz, J. et al. Quality assessment of the human genome sequence. Nature 429, 365–368 (2004)
Article ADS CAS Google Scholar
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, D501–D504 (2005)
Article CAS Google Scholar
Kong, A. et al. A high-resolution recombination map of the human genome. Nature Genet. 31, 241–247 (2002)
Article CAS Google Scholar
Schuler, G. D. et al. A gene map of the human genome. Science 274, 540–546 (1996)
Article ADS CAS Google Scholar
Nusbaum, C. et al. DNA sequence and analysis of human chromosome 8. Nature 439, 331–335 (2006)
Article ADS CAS Google Scholar
Hillier, L. W. et al. Generation and annotation of the DNA sequences of human chromosomes 2 and 4. Nature 434, 724–731 (2005)
Article ADS CAS Google Scholar
Grimwood, J. et al. The DNA sequence and biology of human chromosome 19. Nature 428, 529–535 (2004)
Article ADS CAS Google Scholar
Deloukas, P. et al. The DNA sequence and comparative analysis of human chromosome 10. Nature 429, 375–381 (2004)
Article ADS CAS Google Scholar
Bailey, J. A. et al. Segmental duplications: organization and impact within the current human genome project assembly. Genome Res. 11, 1005–1017 (2001)
Article CAS Google Scholar
Lindblad-Toh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803–819 (2005)
Article ADS CAS Google Scholar
Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002)
Article Google Scholar
Murphy, W. J. et al. Dynamics of mammalian chromosome evolution inferred from multispecies comparative maps. Science 309, 613–617 (2005)
Article ADS CAS Google Scholar
Bourque, G., Zdobnov, E. M., Bork, P., Pevzner, P. A. & Tesler, G. Comparative architectures of mammalian and chicken genomes reveal highly variable rates of genomic rearrangements across different lineages. Genome Res. 15, 98–110 (2005)
Article CAS Google Scholar
Sharp, A. J. et al. Segmental duplications and copy-number variation in the human genome. Am. J. Hum. Genet. 77, 78–88 (2005)
Article CAS Google Scholar
Sebat, J. et al. Large-scale copy number polymorphism in the human genome. Science 305, 525–528 (2004)
Article ADS CAS Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. NAR 25, 3389 (1997)
Article CAS Google Scholar
Felsenstein, J. PHYLIP—Phylogeny Inference Package (version 3.2). Cladistics 5, 164–166 (1989)
Google Scholar

Download references

Acknowledgements

We thank L. Gaffney for help with figures and text. We are grateful to T. Furey for help with lists of genetic markers and placement of RefSeqs, and to K. Lindblad-Toh for sharing data from the genome projects of dog and opossum. Fluorescence in situ hybridization (FISH) data for opossum were provided by M. Breen. We thank the members of the Baylor College of Medicine Human Genome Sequencing Center, the J. Craig Venter Institute Joint Technology Center, and the Washington University Genome Sequencing Center for generation and early release of the assembly of the rhesus macaque genome. We thank the Sanger Institute for gap sizing by FISH. We also acknowledge the HUGO Gene Nomenclature Committee (S. Povey (chair), E. A. Bruford, V. K. Khodiyar, R. C. Lovering, M. J. Lush, T. P. Sneddon, C. C. Talbot Jr and M. W. Wright) for assigning official gene symbols. We are grateful to all members, present and past, of the Broad (and Whitehead) sequencing platform for their dedication and the consistent high quality of their data.

Author information

Charles A. Whittaker
Present address: MIT Center for Cancer Research, 77 Massachusetts Avenue E18-570, Cambridge, Massachusetts, 02139, USA
Ken Dewar
Present address: McGill University and Genome Quebec Innovation Centre, Montreal, Quebec, H3A 1A4, Canada
Anup Madan
Present address: Neurogenomics Research Lab, 200 B EMRB, University of Iowa, Iowa City, Iowa, 52242, USA
Nissa Abbasi
Present address: Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington, 98109, USA
Lida Baradarani
Present address: Blue Heron Technologies, Bothell, Washington, 98021, USA
Brian Birditt
Present address: Department of Microbiology, Box 358070, University of Washington, Seattle, Washington, 98195, USA
Jessica Fahey
Present address: Seattle University School of Nursing, Seattle, Washington, 98122, USA
Peter Fleetwood
Present address: Corbis Corporation, Seattle, Washington, 98104, USA
Grace Hensley
Present address: Geospiza, 100 West Harrison North Tower, Suite 330, Seattle, Washington, 98119, USA
Ericka Johnson
Present address: Division of Medical Genetics, Box 357720, University of Washington, Seattle, Washington, 98195, USA
Anuradha Madan
Present address: 3095 Medical Laboratories, Department of Neurosurgery, University of Iowa, Iowa City, Iowa, 52242, USA
Amber Ratcliffe
Present address: Nanostring Technologies, 201 Elliott Avenue West, Suite 300, Seattle, Washington, 98119, USA
Dascena Vincent
Present address: Genelex Corporation, 3000 1st Avenue, Suite 1, Seattle, Washington, 98121, USA

Authors and Affiliations

Broad Institute of MIT and Harvard, 320 Charles Street, Massachusetts, 02141, Cambridge, USA
Michael C. Zody, Manuel Garber, Ted Sharpe, Sarah K. Young, Keith O'Neill, Charles A. Whittaker, Michael Kamal, Jean L. Chang, Christina A. Cuomo, Ken Dewar, Michael G. FitzGerald, Chinnappa D. Kodira, Xiaoping Yang, Amr Abouelleil, Harindra M. Arachchi, Toby Bloom, Mark L. Borowsky, Jonathan Butler, April Cook, Kurt DeArellano, David DeCaprio, Lester Dorris III, Reinhard Engels, Gary Gearin, Jennifer L. Hall, Charlien Jones, Asha Kamat, Glen Munson, David B. Jaffe, Annie Lui, Pendexter Macdonald, Evan Mauceli, Jerome W. Naylor, Robert Nicol, Sinéad B. O'Leary, Steven Rounsley, Carrie Sougnez, Sabrina M. Stone, Kerri Topham, Shunguang Wang, Andrew R. Zimmer, Bruce W. Birren, Eric S. Lander & Chad Nusbaum
Institute for Systems Biology, 1441 North 34th Street, Washington, 98103, Seattle, USA
Lee Rowen, Anup Madan, Shizhen Qin, Nissa Abbasi, Lida Baradarani, Brian Birditt, Scott Bloom, Jeremy Burke, Monica Dors, Jessica Fahey, Peter Fleetwood, Grace Hensley, Ericka Johnson, Amardeep Kaur, Anuradha Madan, Ryan Nesbitt, Amber Ratcliffe, Sandra Stewart, Dascena Vincent & Leroy Hood
Department of Genome Sciences, University of Washington, Washington, 98195, Seattle, USA
Evan E. Eichler, Devin P. Locke & Xinwei She
Division of Human Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Washington, 98109, Seattle, USA
Cynthia Friedman
HUGO Gene Nomenclature Committee (HGNC), The Galton Laboratory, Department of Biology, University College London, Wolfson House, 4 Stephenson Way, NW1 2HE, London, UK
Katherine M. B. Sneddon

Authors

Michael C. Zody
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Garber
View author publications
You can also search for this author in PubMed Google Scholar
Ted Sharpe
View author publications
You can also search for this author in PubMed Google Scholar
Sarah K. Young
View author publications
You can also search for this author in PubMed Google Scholar
Lee Rowen
View author publications
You can also search for this author in PubMed Google Scholar
Keith O'Neill
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Whittaker
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kamal
View author publications
You can also search for this author in PubMed Google Scholar
Jean L. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Christina A. Cuomo
View author publications
You can also search for this author in PubMed Google Scholar
Ken Dewar
View author publications
You can also search for this author in PubMed Google Scholar
Michael G. FitzGerald
View author publications
You can also search for this author in PubMed Google Scholar
Chinnappa D. Kodira
View author publications
You can also search for this author in PubMed Google Scholar
Anup Madan
View author publications
You can also search for this author in PubMed Google Scholar
Shizhen Qin
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Yang
View author publications
You can also search for this author in PubMed Google Scholar
Nissa Abbasi
View author publications
You can also search for this author in PubMed Google Scholar
Amr Abouelleil
View author publications
You can also search for this author in PubMed Google Scholar
Harindra M. Arachchi
View author publications
You can also search for this author in PubMed Google Scholar
Lida Baradarani
View author publications
You can also search for this author in PubMed Google Scholar
Brian Birditt
View author publications
You can also search for this author in PubMed Google Scholar
Scott Bloom
View author publications
You can also search for this author in PubMed Google Scholar
Toby Bloom
View author publications
You can also search for this author in PubMed Google Scholar
Mark L. Borowsky
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Burke
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Butler
View author publications
You can also search for this author in PubMed Google Scholar
April Cook
View author publications
You can also search for this author in PubMed Google Scholar
Kurt DeArellano
View author publications
You can also search for this author in PubMed Google Scholar
David DeCaprio
View author publications
You can also search for this author in PubMed Google Scholar
Lester Dorris III
View author publications
You can also search for this author in PubMed Google Scholar
Monica Dors
View author publications
You can also search for this author in PubMed Google Scholar
Evan E. Eichler
View author publications
You can also search for this author in PubMed Google Scholar
Reinhard Engels
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Fahey
View author publications
You can also search for this author in PubMed Google Scholar
Peter Fleetwood
View author publications
You can also search for this author in PubMed Google Scholar
Cynthia Friedman
View author publications
You can also search for this author in PubMed Google Scholar
Gary Gearin
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer L. Hall
View author publications
You can also search for this author in PubMed Google Scholar
Grace Hensley
View author publications
You can also search for this author in PubMed Google Scholar
Ericka Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Charlien Jones
View author publications
You can also search for this author in PubMed Google Scholar
Asha Kamat
View author publications
You can also search for this author in PubMed Google Scholar
Amardeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Devin P. Locke
View author publications
You can also search for this author in PubMed Google Scholar
Anuradha Madan
View author publications
You can also search for this author in PubMed Google Scholar
Glen Munson
View author publications
You can also search for this author in PubMed Google Scholar
David B. Jaffe
View author publications
You can also search for this author in PubMed Google Scholar
Annie Lui
View author publications
You can also search for this author in PubMed Google Scholar
Pendexter Macdonald
View author publications
You can also search for this author in PubMed Google Scholar
Evan Mauceli
View author publications
You can also search for this author in PubMed Google Scholar
Jerome W. Naylor
View author publications
You can also search for this author in PubMed Google Scholar
Ryan Nesbitt
View author publications
You can also search for this author in PubMed Google Scholar
Robert Nicol
View author publications
You can also search for this author in PubMed Google Scholar
Sinéad B. O'Leary
View author publications
You can also search for this author in PubMed Google Scholar
Amber Ratcliffe
View author publications
You can also search for this author in PubMed Google Scholar
Steven Rounsley
View author publications
You can also search for this author in PubMed Google Scholar
Xinwei She
View author publications
You can also search for this author in PubMed Google Scholar
Katherine M. B. Sneddon
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Carrie Sougnez
View author publications
You can also search for this author in PubMed Google Scholar
Sabrina M. Stone
View author publications
You can also search for this author in PubMed Google Scholar
Kerri Topham
View author publications
You can also search for this author in PubMed Google Scholar
Dascena Vincent
View author publications
You can also search for this author in PubMed Google Scholar
Shunguang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Andrew R. Zimmer
View author publications
You can also search for this author in PubMed Google Scholar
Bruce W. Birren
View author publications
You can also search for this author in PubMed Google Scholar
Leroy Hood
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Lander
View author publications
You can also search for this author in PubMed Google Scholar
Chad Nusbaum
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Michael C. Zody or Chad Nusbaum.

Ethics declarations

Competing interests

Accession numbers for all clones contributing to the finished sequence of human chromosome 15 can be found in Supplementary Table S3. The updated human chromosome 15 sequence can be accessed through GenBank accession number NC_000015. Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.

Supplementary information

Supplementary Notes

This file contains the Supplementary Methods, Supplementary Data, Supplementary Discussion and additional references. (DOC 78 kb)

Supplementary Tables

This file contains Supplementary Tables 1–11. The Supplementary Tables cover: gaps, contributing centres, clones on the finished path, genetic markers, radiation hybrid markers, tRNA genes, known microRNA genes, gene deserts, duplication classes, genes in the class 1 core element, gene names and symbols. (DOC 3384 kb)

Supplementary Figure 1

Relationship of the finished sequence map to genetic and radiation hybrid maps of chromosome 15. (PDF 249 kb)

Supplementary Figure 2

Segmental duplication structure of human Chromosome 15. (PDF 171 kb)

Supplementary Figure 3

Density plot(s) of the proximal element (coverage map). (PDF 317 kb)

Supplementary Figure 4

Detailed history of the structural organization of human Chromosome 15. (PDF 250 kb)

Supplementary Figure 5

Phylogenetic tree of core element with macaque included. (PDF 221 kb)

Supplementary Figure 6

Segmental duplication structure of the Prader-Willi/Angelman syndrome region on proximal Chromosome 15. (PDF 1947 kb)

Supplementary Figure Legends

Text to accompany the above Supplementary Figures. (DOC 27 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zody, M., Garber, M., Sharpe, T. et al. Analysis of the DNA sequence and duplication history of human chromosome 15. Nature 440, 671–675 (2006). https://doi.org/10.1038/nature04601

Download citation

Received: 09 November 2005
Accepted: 26 January 2006
Issue Date: 30 March 2006
DOI: https://doi.org/10.1038/nature04601

This article is cited by

Rare partial octosomy and hexasomy of 15q11-q13 associated with intellectual impairment and development delay: report of two cases and review of literature
- Haiyu Li
- Juan Du
- Yue-Qiu Tan
Molecular Cytogenetics (2018)
Copy number changes and methylation patterns in an isodicentric and a ring chromosome of 15q11-q13: report of two cases and review of literature
- Qin Wang
- Weiqing Wu
- Jiansheng Xie
Molecular Cytogenetics (2015)
Palindromic GOLGA8 core duplicons promote chromosome 15q13.3 microdeletion and evolutionary instability
- Francesca Antonacci
- Megan Y Dennis
- Evan E Eichler
Nature Genetics (2014)
Copy Number Variations at the Prader‐Willi Syndrome Region on Chromosome 15 and associations with Obesity in Whites
- Yuan Chen
- Yong‐Jun Liu
- Hong‐Wen Deng
Obesity (2011)
The genomic distribution of intraspecific and interspecific sequence divergence of human segmental duplications relative to human/chimpanzee chromosomal rearrangements
- Tomàs Marques-Bonet
- Ze Cheng
- Arcadi Navarro
BMC Genomics (2008)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.