DNA sequence and analysis of human chromosome 8

Nusbaum, Chad; Mikkelsen, Tarjei S.; Zody, Michael C.; Asakawa, Shuichi; Taudien, Stefan; Garber, Manuel; Kodira, Chinnappa D.; Schueler, Mary G.; Shimizu, Atsushi; Whittaker, Charles A.; Chang, Jean L.; Cuomo, Christina A.; Dewar, Ken; FitzGerald, Michael G.; Yang, Xiaoping; Allen, Nicole R.; Anderson, Scott; Asakawa, Teruyo; Blechschmidt, Karin; Bloom, Toby; Borowsky, Mark L.; Butler, Jonathan; Cook, April; Corum, Benjamin; DeArellano, Kurt; DeCaprio, David; Dooley, Kathleen T.; Dorris, Lester; Engels, Reinhard; Glöckner, Gernot; Hafez, Nabil; Hagopian, Daniel S.; Hall, Jennifer L.; Ishikawa, Sabine K.; Jaffe, David B.; Kamat, Asha; Kudoh, Jun; Lehmann, Rüdiger; Lokitsang, Tashi; Macdonald, Pendexter; Major, John E.; Matthews, Charles D.; Mauceli, Evan; Menzel, Uwe; Mihalev, Atanas H.; Minoshima, Shinsei; Murayama, Yuji; Naylor, Jerome W.; Nicol, Robert; Nguyen, Cindy; O'Leary, Sinéad B.; O'Neill, Keith; Parker, Stephen C. J.; Polley, Andreas; Raymond, Christina K.; Reichwald, Kathrin; Rodriguez, Joseph; Sasaki, Takashi; Schilhabel, Markus; Siddiqui, Roman; Smith, Cherylyn L; Sneddon, Tam P.; Talamas, Jessica A.; Tenzin, Pema; Topham, Kerri; Venkataraman, Vijay; Wen, Gaiping; Yamazaki, Satoru; Young, Sarah K.; Zeng, Qiandong; Zimmer, Andrew R.; Rosenthal, Andre; Birren, Bruce W.; Platzer, Matthias; Shimizu, Nobuyoshi; Lander, Eric S.

doi:10.1038/nature04406

Letter
Published: 19 January 2006

DNA sequence and analysis of human chromosome 8

Chad Nusbaum¹,
Tarjei S. Mikkelsen¹,
Michael C. Zody¹,
Shuichi Asakawa²,
Stefan Taudien³,
Manuel Garber¹,
Chinnappa D. Kodira¹,
Mary G. Schueler⁴,
Atsushi Shimizu²,
Charles A. Whittaker¹^nAff7,
Jean L. Chang¹,
Christina A. Cuomo¹,
Ken Dewar¹^nAff8,
Michael G. FitzGerald¹,
Xiaoping Yang¹,
Nicole R. Allen¹,
Scott Anderson¹,
Teruyo Asakawa²,
Karin Blechschmidt³,
Toby Bloom¹,
Mark L. Borowsky¹,
Jonathan Butler¹,
April Cook¹,
Benjamin Corum¹,
Kurt DeArellano¹,
David DeCaprio¹,
Kathleen T. Dooley¹,
Lester Dorris III¹,
Reinhard Engels¹,
Gernot Glöckner³,
Nabil Hafez¹,
Daniel S. Hagopian¹,
Jennifer L. Hall¹,
Sabine K. Ishikawa²,
David B. Jaffe¹,
Asha Kamat¹,
Jun Kudoh²,
Rüdiger Lehmann³,
Tashi Lokitsang¹,
Pendexter Macdonald¹,
John E. Major¹,
Charles D. Matthews¹,
Evan Mauceli¹,
Uwe Menzel³^nAff9,
Atanas H. Mihalev¹,
Shinsei Minoshima²^nAff10,
Yuji Murayama²,
Jerome W. Naylor¹,
Robert Nicol¹,
Cindy Nguyen¹,
Sinéad B. O'Leary¹,
Keith O'Neill¹,
Stephen C. J. Parker¹^nAff11,
Andreas Polley³^nAff12,
Christina K. Raymond¹,
Kathrin Reichwald³^nAff13,
Joseph Rodriguez¹,
Takashi Sasaki²,
Markus Schilhabel³,
Roman Siddiqui³,
Cherylyn L Smith¹,
Tam P. Sneddon⁵,
Jessica A. Talamas¹,
Pema Tenzin¹,
Kerri Topham¹,
Vijay Venkataraman¹,
Gaiping Wen³^nAff14,
Satoru Yamazaki²,
Sarah K. Young¹,
Qiandong Zeng¹,
Andrew R. Zimmer¹,
Andre Rosenthal³^nAff15,
Bruce W. Birren¹,
Matthias Platzer³,
Nobuyoshi Shimizu² &
…
Eric S. Lander¹

Nature volume 439, pages 331–335 (2006)Cite this article

17k Accesses
92 Citations
34 Altmetric
Metrics details

Abstract

The International Human Genome Sequencing Consortium (IHGSC) recently completed a sequence of the human genome¹. As part of this project, we have focused on chromosome 8. Although some chromosomes exhibit extreme characteristics in terms of length, gene content, repeat content and fraction segmentally duplicated, chromosome 8 is distinctly typical in character, being very close to the genome median in each of these aspects. This work describes a finished sequence and gene catalogue for the chromosome, which represents just over 5% of the euchromatic human genome. A unique feature of the chromosome is a vast region of ∼15 megabases on distal 8p that appears to have a strikingly high mutation rate, which has accelerated in the hominids relative to other sequenced mammals. This fast-evolving region contains a number of genes related to innate immunity and the nervous system, including loci that appear to be under positive selection²—these include the major defensin (DEF) gene cluster^3,4 and MCPH1^5,6, a gene that may have contributed to the evolution of expanded brain size in the great apes. The data from chromosome 8 should allow a better understanding of both normal and disease biology and genome evolution.

You have full access to this article via your institution.

Download PDF

The structure, function and evolution of a complete human chromosome 8

Article Open access 07 April 2021

Pan-genomics in the human genome era

Article 07 February 2020

The complete sequence of a human Y chromosome

Article 23 August 2023

Main

The finished sequence of chromosome 8 contains 145,556,489 bases and is interrupted by only four euchromatic gaps, one gap at the 8p telomere and one gap containing the centromeric heterochromatin (Fig. 1 and Supplementary Table S1). These gaps are refractory to current cloning and mapping technology. The estimated total size of the euchromatic gaps is 427 kilobases (kb), based on direct sizing of three gaps and estimation of the remaining two gaps at the genome-wide average of ∼100 kb each. This corresponds to ∼0.3% of the euchromatic length of the chromosome, similar to the genome average^{1,7,8,9,10,11}. In all, 182.3 megabases (Mb) of finished sequence were generated by the Broad Institute of MIT and Harvard (formerly Whitehead Institute/MIT Center for Genome Research (WICGR)), 27.9 Mb by Keio University School of Medicine, 8.4 Mb by the Institute of Molecular Biotechnology in Jena, and 5.8 Mb by 10 other groups (Supplementary Tables S2 and S3). These sequences (which include overlap) were combined to yield the finished path (see Methods).

Figure 1: **Overview of human chromosome 8.**

We assessed the local accuracy of the clone path by aligning paired-end sequences from a human Fosmid library (WIBR2, representing ×10 physical coverage) to the finished sequence⁷. Errors in the clone path were detected by identifying discrepancies between the predicted and observed distances between Fosmid ends⁷. This revealed two deleted clones, which were replaced. Finally, an independent quality assessment exercise commissioned by NHGRI estimated the accuracy of the finished sequence at less than 1 error in 100,000 bases¹² (J. Schmutz, personal communication).

Several analyses support the idea that nearly the entire euchromatic region of chromosome 8 is present and accurately represented. From the well-curated RefSeq¹³ data set 681 transcripts (from 573 unique genes) mapped to chromosome 8. All but one of these are present and complete in the finished sequence. The finished sequence shows excellent co-linearity with the genetic map¹⁴ (Supplementary Fig. S1). Among 247 sequence-based genetic markers (Supplementary Table S4) there are six discrepancies. One discrepancy consists of eight markers and spans a region in 8p23 known to be the site of a polymorphic inversion in the human population^15,16 (see below). Five discrepancies each consist of single markers out of order by one position; all occur in small regions where the genetic map shows no recombination in one of the two sexes (Supplementary Table S4). The sequence also shows good agreement with the radiation hybrid (RH) map¹⁷ (Supplementary Table S5).

We produced a manually curated gene catalogue, containing 793 gene loci and 301 pseudogene loci (see Methods). The catalogue includes all previously known genes on chromosome 8 (Table 1). According to the Hawk2 categorization scheme¹⁸, there are 614 ‘known’ genes, 109 ‘novel CDS’, 43 ‘novel transcripts’, 14 ‘putatives’ and 13 ‘gene fragments’. The small set of novel and putative categories were annotated by spliced expressed sequence tag (EST) evidence only; some ‘putative novel’ loci may prove to be pseudogenes. Comparison of manual annotation performed at the Broad Institute of MIT and Harvard to manual annotation for specific regions done at Jena and Keio indicated that they were largely the same, and that virtually all differences could be attributable to edge effects (see Supplementary Information).

Table 1 Chromosome 8 gene content

Full size table

Full-length transcripts of known genes contain an average of 9.9 exons, comparable to recently published reports^8,9,10,11,19, have an average length of 3,056 base pairs (bp), and internal exons have an average length of 155 bp. There is evidence of extensive alternate splicing. Gene loci have an average of 4.1 distinct transcripts, with 63% having at least two transcripts, values that are similar to recent reports^8,9,11,20. Of the 301 pseudogenes on chromosome 8, ∼84% are processed pseudogenes arising from retrotransposition; the remaining 16% are unprocessed. We also identified 13 tRNA genes (Supplementary Table S6). Examples of genes that represent extremes from these averages are described in Supplementary Information.

Several aspects of the genome landscape are notable. The overall gene density is 5.6 genes Mb^-1, below the genome average of ∼10 genes Mb^-1. Gene distribution is highly heterogeneous, with 44 gene deserts (500 kb without a coding gene, Supplementary Table S7) that together comprise 41.9 Mb or ∼29% the total length. The overall G + C content is 39.2%, but varies substantially across the chromosome (Fig. 1). Nearly half of the chromosome is composed of repeat sequences, with transposable element fossils comprising 44.5%, low complexity sequence (including simple sequence repeats and satellite sequences) comprising 1.8%, and segmental duplications comprising ∼2.1% (with interchromosomal and intrachromosomal duplications at ∼1.5% each, with some sequence included in both categories) (E. Eichler and X. She, personal communication).

Chromosome 8 is the first human autosome and one of only two chromosomes (the other being chromosome X²⁰) for which sequences span the entire pericentromeric region. The regions on both arms stretch from unique euchromatin through pericentromeric satellites and into the higher-order alpha-satellite array (Fig. 2). Three variant higher-order repeat units populate the chromosome 8 higher-order array, D8Z2 (ref. 21 and Supplementary Information). The proximal termini of both the 8p and 8q sequence contigs are comprised of nine copies of the 1.9-kb unit. The p and q arm higher-order units are highly identical to each other (96–98%) and occur in the same head-to-tail orientation, indicating that these sequences sample the edges of the chromosome 8-specific array. Analysis of the finished pericentromeric sequence of chromosome 8 is essential to test and further develop primate centromere evolution hypotheses using an autosomal model.

**Figure 2: **8p and 8q pericentromeric contigs extend into chromosome 8-specific higher-order alpha satellite,** ***D8Z2***.**

The most striking feature on chromosome 8 emerges from evolutionary and population genetic comparisons (Fig. 3). The most distal 15 Mb on chromosome 8p show an extremely high divergence between human and chimpanzee (0.021 substitutions per site, 4.0 s.d. above the mean of 0.012). The region also shows a strikingly high polymorphism rate in the human population (0.0018, 3.2 s.d. above the mean of 0.0010). The peak divergence reaches 0.032 (8.6 s.d.), and diversity 0.0028 (7.1 s.d.), across a 1-Mb region (3.3–4.3 Mb) overlapping the CSMD1 gene. This is the highest divergence level seen across all autosomes and chromosome X. Only regions of chromosome Y may be more rapidly diverging, driven by the high mutation rate in the male germ line. We excluded trivial explanations for this observation, such as unresolved segmental duplications (Supplementary Information). Diversity is also locally high in the chimpanzee, although the data are more limited.

Figure 3: **Diversity and divergence on 8p.**

The high rate of divergence and diversity at distal 8p might reflect either an extraordinary mutation rate or population genetic history. The latter alternative would require an unusually long coalescence time to the most recent common ancestor over a very large region; this would be remarkable inasmuch as local coalescence times tend to be correlated over short distances, as the correlation falls below 0.5 within 20 kb (ref. ref. 22). We sought to resolve the issue by examining the divergence rates with more distant mammalian species, where the impact of population genetic history should be negligible.

Comparison of ancestral interspersed repeats in the human, dog²³ and mouse²⁴ genomes reveals that the region exhibits above-average lineage-specific divergence rates on all three lineages across 100 million years of evolution, but that the rate is the most elevated relative to the genome-wide mean in the lineage leading to humans. The greatest elevation is seen in the most distal 6 Mb of 8p, where the ancestral interspersed repeat divergence rates in the orthologous sequences have been 0.19 (3.3 s.d. above the mean of 0.14) on the human lineage and 0.41 (1.0 s.d. above the mean of 0.38) in the mouse lineage since the primate–rodent split, and 0.24 (1.9 s.d. above the mean of 0.20) in the dog lineage since the divergence from the common boreo-eutherian ancestor.

The biological basis for the apparently high mutation rate is unclear. Three major factors have been associated with high mutation rates in the human genome: proximity to telomeres, high recombination rate and high A + T content^25,26. The region on chromosome 8p has all three factors. The mean sex-averaged recombination rate across the first 6 Mb is 2.7 cM Mb^-1, with a 1-Mb window peak of 3.5, as compared to the genome-wide average of 1.2. The region from 2.5–6 Mb is 62% A + T, as compared to a genome-wide average of 59%. It is unusual in this regard, because subtelomeric regions with high recombination rates are typically (A + T)-poor. Notably, the region is not subtelomeric in the mouse, where the lowest rate elevation is observed.

The distal region on chromosome 8p also contains at least two loci that appear to be undergoing positive selection (Fig. 3). The first locus is the major cluster of defensin genes, which lies within the region of high mutation (5.5–7.5 Mb), although ∼2.5 Mb from the peak. The defensin genes express small cationic antimicrobial peptides crucial to the innate immune response²⁷. Studies^2,3 have suggested that defensins have been under positive selection, with a high ratio of non-synonymous to synonymous changes detected in the mature peptide coding exon. Moreover, gene and segmental duplication within the cluster have led to extensive copy number^28,29 and haplotype³⁰ polymorphism within and across populations, which are thought to influence variation in disease susceptibility and contribute to ongoing adaptive evolution in both the human and chimpanzee species. The second locus showing positive selection is MCPH1, mutations in which cause microcephaly (Online Mendelian Inheritance in Man (OMIM): 251200); there is clear evidence of accelerated non-synonymous divergence correlating with the expansion of brain size throughout the lineage from simian ancestors to the human and chimpanzee^4,5.

To investigate the diversity of copy number in the defensin clusters, we resequenced several dozen polymerase chain reaction (PCR) products from representative intervals from DEFB105A (beta-defensin cluster) and DEFA1 (alpha-defensin cluster) in 14 chimpanzees, 1 gibbon, 1 macaque and 4 breeds of dog (see Methods and Supplementary Information). In all species studied, the gene family has multiple members, and the members are more similar within a species than across species. Thus, the defensin clusters have either independently duplicated in each species or have undergone gene conversion events within species.

Finally, we note that the majority of the genes in the region of high divergence in distal 8p play important roles in development or signalling in the nervous system. Notably, the extremely large CSMD1 gene, which lies at the peak of divergence and diversity, is widely expressed in brain tissues. High regional mutation rates and positive selection are generally assumed to be distinct, but it is possible that the former may facilitate the latter by increasing the rate of appearance of potentially advantageous single, or interacting, alleles (see also ref. 31). It is intriguing to speculate whether the accelerated divergence rate of this region has contributed to the rapid expansion and evolution of the primate brain.

Methods

See Supplementary Information for details on clone path building, generation of sequence map, sizing of gaps and gene annotation. The final version of the clone path is available in AGP format (see http://www.ncbi.nlm.nih.gov/genome/guide/glossary.htm) at http://www.broad.mit.edu/tools/data/data-human.html.

Gene amplification and sequencing

TBLASTN (http://www.ncbi.nlm.nih.gov/BLAST) was used to identify DEFB105 and DEFA1 orthologues in 16 chimpanzees, 1 gibbon, 1 macaque and 4 dog breeds (akita, golden retriever, greyhound and mastiff). PCR primers for gene amplification were designed using Primer3 (http://frodo.wi.mit.edu/primer3) based on the species reference sequence. Human and macaque primers were used for gibbon. Amplified products were cloned, and for each individual/gene combination, 48 or 96 clones were sequenced.

Haplotype analysis

Neighbourhood Quality Standard³² (NQS) scores were computed for all sequenced products using the published constraints³². Reads were trimmed to the first and last three consecutive NQS bases, and aligned to the reference sequence using PatternHunter (http://www.bioinformaticssolutions.com). Multiple sequence alignments were built from the pairwise alignments and inspected to find SNPs that were: at NQS bases, supported by at least two reads, and in a ten base window where not more than two other variations were observed. To minimize false positives due to errors during PCR amplification, we restricted our analysis to haplotypes that differed in >3 bases.

References

International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001)
Article Google Scholar
Vallender, E. J. & Lahn, B. T. Positive selection on the human genome. Hum. Mol. Genet. 13 (suppl. 2), R245–R254 (2004)
Article CAS Google Scholar
Maxwell, A. I., Morrison, G. M. & Dorin, J. R. Rapid sequence divergence in mammalian β-defensins by adaptive evolution. Mol. Immunol. 40, 413–421 (2003)
Article CAS Google Scholar
Xiao, Y. et al. A genome-wide screen identifies a single β-defensin gene cluster in the chicken: implications for the origin and evolution of mammalian defensins. BMC Genom. 5, 56 (2004)
Article Google Scholar
Evans, P. D., Anderson, J. R., Vallender, E. J., Choi, S. S. & Lahn, B. T. Reconstructing the evolutionary history of microcephalin, a gene controlling human brain size. Hum. Mol. Genet. 13, 1139–1145 (2004)
Article CAS Google Scholar
Evans, P. D. et al. Microcephalin, a gene regulating brain size, continues to evolve adaptively in humans. Science 309, 1717–1720 (2005)
Article CAS ADS Google Scholar
International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature 431, 931–945 (2004)
Article ADS Google Scholar
Grimwood, J. et al. The DNA sequence and biology of human chromosome 19. Nature 428, 529–535 (2004)
Article CAS ADS Google Scholar
Deloukas, P. et al. The DNA sequence and comparative analysis of human chromosome 10. Nature 429, 375–381 (2004)
Article CAS ADS Google Scholar
Martin, J. et al. The sequence and analysis of duplication-rich human chromosome 16. Nature 432, 988–994 (2004)
Article CAS ADS Google Scholar
Nusbaum, C. et al. DNA sequence and analysis of human chromosome 18. Nature 437, 551–555 (2005)
Article CAS ADS Google Scholar
Schmutz, J. et al. Quality assessment of the human genome sequence. Nature 429, 365–368 (2004)
Article CAS ADS Google Scholar
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, D501–D504 (2005)
Article CAS Google Scholar
Kong, A. et al. A high-resolution recombination map of the human genome. Nature Genet. 31, 225–226 (2002)
Article Google Scholar
Giglio, S. et al. Olfactory receptor-gene clusters, genomic-inversion polymorphisms, and common chromosome rearrangements. Am. J. Hum. Genet. 68, 874–883 (2001)
Article CAS Google Scholar
Shimokawa, O. et al. Molecular characterization of inv dup del(8p): analysis of five cases. Am. J. Med. Genet. A 128, 133–137 (2004)
Article Google Scholar
Deloukas, P. et al. A physical map of 30,000 genes. Science 282, 744–746 (1998)
Article CAS ADS Google Scholar
Ashurst, J. L. et al. The Vertebrate Genome Annotation (Vega) database. Nucleic Acids Res. 33, D459–D465 (2005)
Article CAS Google Scholar
Hillier, L. W. et al. Generation and annotation of the DNA sequences of human chromosomes 2 and 4. Nature 434, 724–731 (2005)
Article CAS ADS Google Scholar
Ross, M. T. et al. The DNA sequence of the human X chromosome. Nature 434, 325–337 (2005)
Article CAS ADS Google Scholar
Ge, Y., Wagner, M. J., Siciliano, M. & Wells, D. E. Sequence, higher order repeat structure, and long-range organization of alpha satellite DNA specific to human chromosome 8. Genomics 13, 585–593 (1992)
Article CAS Google Scholar
Reich, D. E. et al. Human genome sequence variation and the influence of gene history, mutation and recombination. Nature Genet. 32, 135–142 (2002)
Article CAS Google Scholar
Lindblad-Toh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803–819 (2005)
Article CAS ADS Google Scholar
Mouse Genome Sequencing Consortium, Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002)
Article Google Scholar
The Chimpanzee Sequencing and Analysis Consortium. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437, 69–87 (2005)
Article Google Scholar
Hellmann, I. et al. Why do human diversity levels vary at a megabase scale? Genome Res. 15, 1222–1231 (2005)
Article CAS Google Scholar
Lehrer, R. I. Primate defensins. Nature Rev. Microbiol. 2, 727–738 (2004)
Article CAS Google Scholar
Hollox, E. J., Armour, J. A. & Barber, J. C. Extensive normal copy number variation of a β-defensin antimicrobial-gene cluster. Am. J. Hum. Genet. 73, 591–600 (2003)
Article CAS Google Scholar
Mars, W. M. et al. Inheritance of unequal numbers of the genes encoding the human neutrophil defensins HP-1 and HP-3. J. Biol. Chem. 270, 30371–30376 (1995)
Article CAS Google Scholar
Taudien, S. et al. Polymorphic segmental duplications at 8p23.1 challenge the determination of individual defensin gene repertoires and the assembly of a contiguous human reference sequence. BMC Genom. 5, 92 (2004)
Article Google Scholar
Wyckoff, G. J., Malcom, C. M., Vallender, E. J. & Lahn, B. T. A highly unexpected strong correlation between fixation probability of nonsynonymous mutations and mutation rate. Trends Genet. 21, 381–385 (2005)
Article CAS Google Scholar
Altshuler, D. et al. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 407, 513–516 (2000)
Article CAS ADS Google Scholar

Download references

Acknowledgements

We thank L. Gaffney for help with figures and tables; L. French and her group at the Sanger Institute for attempting fibre FISH analysis to size some clone gaps in the tiling path of chromosome 8; E. Eichler and X. She for sharing their data on segmental duplications; T. Furey for help with lists of genetic markers and placement of RefSeq genes; M. Kamal for assistance and advice with synteny analysis; and K. Lindblad-Toh for sharing data from the dog genome project. We also acknowledge the HUGO Gene Nomenclature Committee (S. Povey, chair) for assigning official gene symbols. We are deeply grateful to all the members, present and past, of the Genome Sequencing Platform of the Broad Institute (and Whitehead Center for Genome Research), Keio University School of Medicine and the Institute of Molecular Biology at Jena for their dedication and for the consistent high quality of their data that made this work possible. This work was supported by grants from the National Human Genome Research Institute, RIKEN, the ‘Research for the Future’ Program from the Japan Society for the Promotion of Science (JSPS), the Ministry of Education, Culture, Sports, Science and Technology of Japan (MEXT), the Federal German Ministry of Education, Research and Technology, and the Thüringer Kultusministerium.

Author information

Charles A. Whittaker
Present address: MIT Center for Cancer Research, 77 Massachusetts Avenue, E18-570, Cambridge, Massachusetts, 02139, USA
Ken Dewar
Present address: McGill University and Genome Quebec Innovation Centre, Montreal, Quebec, H3A 1A4, Canada
Uwe Menzel
Present address: Department of Genetics and Pathology, Uppsala University, SE-751 85, Uppsala, Sweden
Shinsei Minoshima
Present address: Photon Medical Research Center, Hamamatsu University School of Medicine, Handayama, Hamamatsu, Shizuoka, 431-3192, Japan
Stephen C. J. Parker
Present address: Boston University Bioinformatics and Systems Biology Program, 24 Cummington St, Boston, Massachusetts, 02215, USA
Andreas Polley
Present address: TraitGenetics GmbH, Am Schwabeplan 1b, 06466, Gatersleben, Germany
Kathrin Reichwald
Present address: University Clinic for Child and Adolescent Psychiatry, University of Duisburg-Essen, Virchowstr. 174, 45147, Essen, Germany
Gaiping Wen
Present address: GSF-Forschungszentrum für Umwelt und Gesundheit, Ingolstädter Landstraße 1, 85674, Neuherberg, Germany
Andre Rosenthal
Present address: Signature Diagnostics AG, Voltaireweg 4B, 14469, Potsdam, Germany

Authors and Affiliations

Broad Institute of MIT and Harvard, 320 Charles St, Massachusetts, 02141, Cambridge, USA
Chad Nusbaum, Tarjei S. Mikkelsen, Michael C. Zody, Manuel Garber, Chinnappa D. Kodira, Charles A. Whittaker, Jean L. Chang, Christina A. Cuomo, Ken Dewar, Michael G. FitzGerald, Xiaoping Yang, Nicole R. Allen, Scott Anderson, Toby Bloom, Mark L. Borowsky, Jonathan Butler, April Cook, Benjamin Corum, Kurt DeArellano, David DeCaprio, Kathleen T. Dooley, Lester Dorris III, Reinhard Engels, Nabil Hafez, Daniel S. Hagopian, Jennifer L. Hall, David B. Jaffe, Asha Kamat, Tashi Lokitsang, Pendexter Macdonald, John E. Major, Charles D. Matthews, Evan Mauceli, Atanas H. Mihalev, Jerome W. Naylor, Robert Nicol, Cindy Nguyen, Sinéad B. O'Leary, Keith O'Neill, Stephen C. J. Parker, Christina K. Raymond, Joseph Rodriguez, Cherylyn L Smith, Jessica A. Talamas, Pema Tenzin, Kerri Topham, Vijay Venkataraman, Sarah K. Young, Qiandong Zeng, Andrew R. Zimmer, Bruce W. Birren & Eric S. Lander
Department of Molecular Biology, Keio University School of Medicine, 35 Shinanomachi, 160-8582, Shinjuku-ku, Tokyo, Japan
Shuichi Asakawa, Atsushi Shimizu, Teruyo Asakawa, Sabine K. Ishikawa, Jun Kudoh, Shinsei Minoshima, Yuji Murayama, Takashi Sasaki, Satoru Yamazaki & Nobuyoshi Shimizu
Genome Analysis, Institute of Molecular Biotechnology, Beutenbergstrasse 11, 07745, Jena, Germany
Stefan Taudien, Karin Blechschmidt, Gernot Glöckner, Rüdiger Lehmann, Uwe Menzel, Andreas Polley, Kathrin Reichwald, Markus Schilhabel, Roman Siddiqui, Gaiping Wen, Andre Rosenthal & Matthias Platzer
National Human Genome Research Institute, National Institutes of Health, 50 South Drive Rm 5529, Maryland, 20982, Bethesda, USA
Mary G. Schueler
HUGO Gene Nomenclature Committee (HGNC), The Galton Laboratory, Department of Biology, University College London, Wolfson House, 4 Stephenson Way, NW1 2HE, London, UK
Tam P. Sneddon

Authors

Chad Nusbaum
View author publications
You can also search for this author in PubMed Google Scholar
Tarjei S. Mikkelsen
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Zody
View author publications
You can also search for this author in PubMed Google Scholar
Shuichi Asakawa
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Taudien
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Garber
View author publications
You can also search for this author in PubMed Google Scholar
Chinnappa D. Kodira
View author publications
You can also search for this author in PubMed Google Scholar
Mary G. Schueler
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Whittaker
View author publications
You can also search for this author in PubMed Google Scholar
Jean L. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Christina A. Cuomo
View author publications
You can also search for this author in PubMed Google Scholar
Ken Dewar
View author publications
You can also search for this author in PubMed Google Scholar
Michael G. FitzGerald
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Yang
View author publications
You can also search for this author in PubMed Google Scholar
Nicole R. Allen
View author publications
You can also search for this author in PubMed Google Scholar
Scott Anderson
View author publications
You can also search for this author in PubMed Google Scholar
Teruyo Asakawa
View author publications
You can also search for this author in PubMed Google Scholar
Karin Blechschmidt
View author publications
You can also search for this author in PubMed Google Scholar
Toby Bloom
View author publications
You can also search for this author in PubMed Google Scholar
Mark L. Borowsky
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Butler
View author publications
You can also search for this author in PubMed Google Scholar
April Cook
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Corum
View author publications
You can also search for this author in PubMed Google Scholar
Kurt DeArellano
View author publications
You can also search for this author in PubMed Google Scholar
David DeCaprio
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen T. Dooley
View author publications
You can also search for this author in PubMed Google Scholar
Lester Dorris III
View author publications
You can also search for this author in PubMed Google Scholar
Reinhard Engels
View author publications
You can also search for this author in PubMed Google Scholar
Gernot Glöckner
View author publications
You can also search for this author in PubMed Google Scholar
Nabil Hafez
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Hagopian
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer L. Hall
View author publications
You can also search for this author in PubMed Google Scholar
Sabine K. Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar
David B. Jaffe
View author publications
You can also search for this author in PubMed Google Scholar
Asha Kamat
View author publications
You can also search for this author in PubMed Google Scholar
Jun Kudoh
View author publications
You can also search for this author in PubMed Google Scholar
Rüdiger Lehmann
View author publications
You can also search for this author in PubMed Google Scholar
Tashi Lokitsang
View author publications
You can also search for this author in PubMed Google Scholar
Pendexter Macdonald
View author publications
You can also search for this author in PubMed Google Scholar
John E. Major
View author publications
You can also search for this author in PubMed Google Scholar
Charles D. Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Evan Mauceli
View author publications
You can also search for this author in PubMed Google Scholar
Uwe Menzel
View author publications
You can also search for this author in PubMed Google Scholar
Atanas H. Mihalev
View author publications
You can also search for this author in PubMed Google Scholar
Shinsei Minoshima
View author publications
You can also search for this author in PubMed Google Scholar
Yuji Murayama
View author publications
You can also search for this author in PubMed Google Scholar
Jerome W. Naylor
View author publications
You can also search for this author in PubMed Google Scholar
Robert Nicol
View author publications
You can also search for this author in PubMed Google Scholar
Cindy Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Sinéad B. O'Leary
View author publications
You can also search for this author in PubMed Google Scholar
Keith O'Neill
View author publications
You can also search for this author in PubMed Google Scholar
Stephen C. J. Parker
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Polley
View author publications
You can also search for this author in PubMed Google Scholar
Christina K. Raymond
View author publications
You can also search for this author in PubMed Google Scholar
Kathrin Reichwald
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Sasaki
View author publications
You can also search for this author in PubMed Google Scholar
Markus Schilhabel
View author publications
You can also search for this author in PubMed Google Scholar
Roman Siddiqui
View author publications
You can also search for this author in PubMed Google Scholar
Cherylyn L Smith
View author publications
You can also search for this author in PubMed Google Scholar
Tam P. Sneddon
View author publications
You can also search for this author in PubMed Google Scholar
Jessica A. Talamas
View author publications
You can also search for this author in PubMed Google Scholar
Pema Tenzin
View author publications
You can also search for this author in PubMed Google Scholar
Kerri Topham
View author publications
You can also search for this author in PubMed Google Scholar
Vijay Venkataraman
View author publications
You can also search for this author in PubMed Google Scholar
Gaiping Wen
View author publications
You can also search for this author in PubMed Google Scholar
Satoru Yamazaki
View author publications
You can also search for this author in PubMed Google Scholar
Sarah K. Young
View author publications
You can also search for this author in PubMed Google Scholar
Qiandong Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Andrew R. Zimmer
View author publications
You can also search for this author in PubMed Google Scholar
Andre Rosenthal
View author publications
You can also search for this author in PubMed Google Scholar
Bruce W. Birren
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Platzer
View author publications
You can also search for this author in PubMed Google Scholar
Nobuyoshi Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Eric S. Lander
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chad Nusbaum or Nobuyoshi Shimizu.

Ethics declarations

Competing interests

Accession numbers for all clones contributing to the finished sequence of human chromosome 8 can be found in Supplementary Table S2. The updated human chromosome 8 sequence can be accessed through GenBank accession number NC_000008. Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.

Supplementary information

Supplementary Notes

This file contains Supplementary Methods (mapping, sequencing and annotation; Defensin gene sequencing), Supplementary Results (gaps in the clone path, extreme genes, CSMD gene family, structure of the chromosome 8 pericentromeric region) and a Supplementary Discussion of annotation rules and methods, comparison of manual annotations of selected regions of human chromosome 8 and alternative explanations for apparent rapid divergence on 8p22-23. (DOC 56 kb)

Supplementary Tables

This file contains Supplementary Tables 1–8. (DOC 2643 kb)

Supplementary Figure 1

Relationship of the finished sequence map to genetic and radiation hybrid maps of chromosome 8. (PDF 286 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nusbaum, C., Mikkelsen, T., Zody, M. et al. DNA sequence and analysis of human chromosome 8. Nature 439, 331–335 (2006). https://doi.org/10.1038/nature04406

Download citation

Received: 05 August 2005
Accepted: 06 October 2005
Issue Date: 19 January 2006
DOI: https://doi.org/10.1038/nature04406

This article is cited by

Mechanisms and regulation of defensins in host defense
- Jie Fu
- Xin Zong
- Yizhen Wang
Signal Transduction and Targeted Therapy (2023)
A generalizable deep learning framework for inferring fine-scale germline mutation rate maps
- Yiyuan Fang
- Shuyi Deng
- Cai Li
Nature Machine Intelligence (2022)
PVT1 signals an androgen-dependent transcriptional repression program in prostate cancer cells and a set of the repressed genes predicts high-risk tumors
- Alexandre Videira
- Felipe C. Beckedorff
- Sergio Verjovski-Almeida
Cell Communication and Signaling (2021)
High-resolution mapping of centromeric protein association using APEX-chromatin fibers
- Eftychia Kyriacou
- Patrick Heun
Epigenetics & Chromatin (2018)
Centromere evolution and CpG methylation during vertebrate speciation
- Kazuki Ichikawa
- Shingo Tomioka
- Shinich Morishita
Nature Communications (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

DNA sequence and analysis of human chromosome 8

Abstract

Similar content being viewed by others

The structure, function and evolution of a complete human chromosome 8

Pan-genomics in the human genome era

The complete sequence of a human Y chromosome

Main

Methods

Gene amplification and sequencing

Haplotype analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Supplementary Notes

Supplementary Tables

Supplementary Figure 1

Rights and permissions

About this article

Cite this article

This article is cited by

Mechanisms and regulation of defensins in host defense

A generalizable deep learning framework for inferring fine-scale germline mutation rate maps

PVT1 signals an androgen-dependent transcriptional repression program in prostate cancer cells and a set of the repressed genes predicts high-risk tumors

High-resolution mapping of centromeric protein association using APEX-chromatin fibers

Centromere evolution and CpG methylation during vertebrate speciation

Comments

Search

Quick links

Abstract

Similar content being viewed by others

Main

Methods

Gene amplification and sequencing

Haplotype analysis

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links