The DNA sequence and analysis of human chromosome 13

Dunham, A.; Matthews, L. H.; Burton, J.; Ashurst, J. L.; Howe, K. L.; Ashcroft, K. J.; Beare, D. M.; Burford, D. C.; Hunt, S. E.; Griffiths-Jones, S.; Jones, M. C.; Keenan, S. J.; Oliver, K.; Scott, C. E.; Ainscough, R.; Almeida, J. P.; Ambrose, K. D.; Andrews, D. T.; Ashwell, R. I. S.; Babbage, A. K.; Bagguley, C. L.; Bailey, J.; Bannerjee, R.; Barlow, K. F.; Bates, K.; Beasley, H.; Bird, C. P.; Bray-Allen, S.; Brown, A. J.; Brown, J. Y.; Burrill, W.; Carder, C.; Carter, N. P.; Chapman, J. C.; Clamp, M. E.; Clark, S. Y.; Clarke, G.; Clee, C. M.; Clegg, S. C. M.; Cobley, V.; Collins, J. E.; Corby, N.; Coville, G. J.; Deloukas, P.; Dhami, P.; Dunham, I.; Dunn, M.; Earthrowl, M. E.; Ellington, A. G.; Faulkner, L.; Frankish, A. G.; Frankland, J.; French, L.; Garner, P.; Garnett, J.; Gilbert, J. G. R.; Gilson, C. J.; Ghori, J.; Grafham, D. V.; Gribble, S. M.; Griffiths, C.; Hall, R. E.; Hammond, S.; Harley, J. L.; Hart, E. A.; Heath, P. D.; Howden, P. J.; Huckle, E. J.; Hunt, P. J.; Hunt, A. R.; Johnson, C.; Johnson, D.; Kay, M.; Kimberley, A. M.; King, A.; Laird, G. K.; Langford, C. J.; Lawlor, S.; Leongamornlert, D. A.; Lloyd, D. M.; Lloyd, C.; Loveland, J. E.; Lovell, J.; Martin, S.; Mashreghi-Mohammadi, M.; McLaren, S. J.; McMurray, A.; Milne, S.; Moore, M. J. F.; Nickerson, T.; Palmer, S. A.; Pearce, A. V.; Peck, A. I.; Pelan, S.; Phillimore, B.; Porter, K. M.; Rice, C. M.; Searle, S.; Sehra, H. K.; Shownkeen, R.; Skuce, C. D.; Smith, M.; Steward, C. A.; Sycamore, N.; Tester, J.; Thomas, D. W.; Tracey, A.; Tromans, A.; Tubby, B.; Wall, M.; Wallis, J. M.; West, A. P.; Whitehead, S. L.; Willey, D. L.; Wilming, L.; Wray, P. W.; Wright, M. W.; Young, L.; Coulson, A.; Durbin, R.; Hubbard, T.; Sulston, J. E.; Beck, S.; Bentley, D. R.; Rogers, J.; Ross, M. T.

doi:10.1038/nature02379

Article
Published: 01 April 2004

The DNA sequence and analysis of human chromosome 13

A. Dunham¹,
L. H. Matthews¹,
J. Burton¹,
J. L. Ashurst¹,
K. L. Howe¹,
K. J. Ashcroft¹,
D. M. Beare¹,
D. C. Burford¹,
S. E. Hunt¹,
S. Griffiths-Jones¹,
M. C. Jones¹,
S. J. Keenan¹,
K. Oliver¹,
C. E. Scott¹,
R. Ainscough¹,
J. P. Almeida¹,
K. D. Ambrose¹,
D. T. Andrews¹,
R. I. S. Ashwell¹,
A. K. Babbage¹,
C. L. Bagguley¹,
J. Bailey¹,
R. Bannerjee¹,
K. F. Barlow¹,
K. Bates¹,
H. Beasley¹,
C. P. Bird¹,
S. Bray-Allen¹,
A. J. Brown¹,
J. Y. Brown¹,
W. Burrill¹,
C. Carder¹,
N. P. Carter¹,
J. C. Chapman¹,
M. E. Clamp¹,
S. Y. Clark¹,
G. Clarke¹,
C. M. Clee¹,
S. C. M. Clegg¹,
V. Cobley¹,
J. E. Collins¹,
N. Corby¹,
G. J. Coville¹,
P. Deloukas¹,
P. Dhami¹,
I. Dunham¹,
M. Dunn¹,
M. E. Earthrowl¹,
A. G. Ellington¹,
L. Faulkner¹,
A. G. Frankish¹,
J. Frankland¹,
L. French¹,
P. Garner¹,
J. Garnett¹,
J. G. R. Gilbert¹,
C. J. Gilson¹,
J. Ghori¹,
D. V. Grafham¹,
S. M. Gribble¹,
C. Griffiths¹,
R. E. Hall¹,
S. Hammond¹,
J. L. Harley¹,
E. A. Hart¹,
P. D. Heath¹,
P. J. Howden¹,
E. J. Huckle¹,
P. J. Hunt¹,
A. R. Hunt¹,
C. Johnson¹,
D. Johnson¹,
M. Kay¹,
A. M. Kimberley¹,
A. King¹,
G. K. Laird¹,
C. J. Langford¹,
S. Lawlor¹,
D. A. Leongamornlert¹,
D. M. Lloyd¹,
C. Lloyd¹,
J. E. Loveland¹,
J. Lovell¹,
S. Martin¹,
M. Mashreghi-Mohammadi¹,
S. J. McLaren¹,
A. McMurray¹,
S. Milne¹,
M. J. F. Moore¹,
T. Nickerson¹,
S. A. Palmer¹,
A. V. Pearce¹,
A. I. Peck¹,
S. Pelan¹,
B. Phillimore¹,
K. M. Porter¹,
C. M. Rice¹,
S. Searle¹,
H. K. Sehra¹,
R. Shownkeen¹,
C. D. Skuce¹,
M. Smith¹,
C. A. Steward¹,
N. Sycamore¹,
J. Tester¹,
D. W. Thomas¹,
A. Tracey¹,
A. Tromans¹,
B. Tubby¹,
M. Wall¹,
J. M. Wallis¹,
A. P. West¹,
S. L. Whitehead¹,
D. L. Willey¹,
L. Wilming¹,
P. W. Wray¹,
M. W. Wright²,
L. Young¹,
A. Coulson¹,
R. Durbin¹,
T. Hubbard¹,
J. E. Sulston¹,
S. Beck¹,
D. R. Bentley¹,
J. Rogers¹ &
…
M. T. Ross¹

Nature volume 428, pages 522–528 (2004)Cite this article

15k Accesses
69 Citations
9 Altmetric
Metrics details

Abstract

Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb.

You have full access to this article via your institution.

Download PDF

Genome-wide association studies

Article 26 August 2021

Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries

Article Open access 30 April 2024

Single-cell analysis reveals context-dependent, cell-level selection of mtDNA

Article Open access 24 April 2024

Main

The draft sequence of the human genome¹ has provided the basis for a systematic effort to finish each chromosome^1,2,3,4,5 in order to produce an accurate and detailed description of the entire genome. In common with the other acrocentric autosomes (14, 15, 21, and 22) the short arm of chromosome 13 is heterochromatic and contains families of repeated sequences, including the ribosomal RNA gene arrays⁶. The long arm is euchromatic and contains most or all of the protein-coding genes of the chromosome. We have completed the euchromatic sequence and examined the characteristics of this gene-poor autosome in relation to other human chromosomes, and to the corresponding sequence in other species. The annotation of the sequence is available via the Vertebrate Genome Annotation (VEGA) database (http://vega.sanger.ac.uk/Homo_sapiens), and will provide a platform for the continued study of medical genetics, genome instability and evolution of human chromosomes.

Clone map and finished sequence

The physical map of the long arm of chromosome 13 comprises five contigs. A minimally overlapping set of 863 clones (the ‘tilepath’) was selected that contained bacterial artificial chromosome (BAC) clones, supplemented with three yeast artificial chromosome (YAC) clones where no bacterial clones could be found (see Supplementary Table S1). The sequence derived from the tilepath clones comprises 95,564,076 base pairs (bp), determined to an accuracy well above 99.99% using procedures described previously⁷. At the proximal end, the finished sequence stretches into the pericentromeric region. Here the sequence is highly similar (93.9% sequence identity) to the pericentromeric region of chromosome 21. As a result, we have not so far been able to extend the map further towards the centromere. At the distal end, the finished sequence extends to within 15 kilobases (kb) of the TTAGGG telomeric repeats (http://www.wistar.upenn.edu/Riethman). Six gaps remain in the tilepath despite our screening of genomic libraries containing a combined 87-fold coverage of the chromosome. All the gap sizes were estimated by fluorescence in situ hybridization (FISH) analysis and represent a combined total of 1,665 kb. The long arm of chromosome 13 therefore measures 97.2 Mb, and the finished sequence covers 98.3% of the total (see http://www.sanger.ac.uk/HGP/Chr13 and Supplementary Table S2).

The finished sequence contains all markers previously mapped to chromosome 13 in the deCODE genetic map⁸, 98% of those in the Marshfield genetic map⁹ and 98% of those in the Genemap ’99 radiation hybrid map¹⁰. Furthermore, there is excellent concordance between marker order on the maps and in the sequence. An additional check of the integrity of the finished sequence was obtained by examining the alignment of paired end-sequences of fosmid or BAC inserts in the finished sequence for evidence of deletions or mis-assemblies (D. Jaffe, personal communication). These alignments were consistent throughout the euchromatic sequence, with a single exception. One BAC (AL355611) was found to be ∼20 kb longer than three fosmids aligned to that region. Further examination of the physical map in this region confirmed that the fingerprint patterns of the other BACs in the map are entirely consistent with the sequence of AL355611. We therefore conclude that there is a length variation between the DNAs represented in the BAC versus the fosmid libraries.

Global analysis of the sequence was performed to provide data on distribution of genes, repeats, G + C content, CpG islands, single nucleotide polymorphisms (SNPs), and recombination rates. The complete analysis of the sequence can be seen in Supplementary Fig. 1. Some specific features are highlighted below.

Gene index

Gene structures were curated manually following analysis of the finished sequence by alignment to all publicly available expressed sequences and application of gene prediction algorithms. A total of 929 gene structures were identified and classified (according to the definitions described in Methods) as known genes (231), novel genes (97), novel transcripts (145), putative genes (160) and pseudogenes (296, of which 268 are processed and 28 are unprocessed) (see Table 1 and Supplementary Table S3). At the time of this analysis, one gene that was previously assigned to chromosome 13 (RHOK) was missing from the analysed sequence, and two other genes were only partially represented (ATP4B in AL4421245 and RASA3 in AL161774). The missing parts of ATP4B and RASA3 have since been identified in the sequences of two recently finished clones, BX537316 and BX537329, respectively. The sequence of BX537316, which lies adjacent to gap 4 in the chromosome map (Supplementary Table S2), also contains the first two exons of RHOK.

Table 1 Summary of the annotated genes on chromosome 13, excluding predicted ncRNA genes

Full size table

The largest gene on chromosome 13 spans 1,468,562 bp (GPC5). The largest exon is the single exon of the Spastic ataxia gene (SACS), which measures 12,865 bp. The average number of exons per gene is 5.19, ranging from single-exon genes, of which there are 44, to a gene containing 84 exons (MYCBP2). Predicted proteins were classified using the Interpro database (http://www.ebi.ac.uk/interpro/). The most populous protein families on chromosome 13 (Supplementary Table S4) broadly reflect those that are most populous in the genome as a whole¹. We searched for CpG islands 5 kb upstream and 1 kb downstream of each annotated gene (see Methods), and found that 67% of known and novel genes are associated with a CpG island, which is consistent with previous reports^2,5,11.

Owing to the absence of coding potential and the lack of primary sequence conservation, non-coding RNA (ncRNA) genes pose a significant challenge for computational prediction. Furthermore, the discrimination between genes and pseudogenes by computational methods has not been possible for most ncRNA genes. However, recent software advances and the development of a database of structural RNA alignments (Rfam)¹² now permit a more comprehensive search for ncRNA genes (see Supplementary Table S5). Chromosome 13 contains only five of the 616 transfer RNA genes that have been found in the human genome. In addition, two tRNA pseudogenes were found. Rfam analysis identifies 98 additional putative ncRNA genes on chromosome 13, including 37 spliceosomal RNAs and 20 Y RNAs (components of the ribonucleoprotein particle Ro). Two of these genes, a U6 and a U2 spliceosomal RNA, are confirmed on the basis of near identity to previously published sequences.

MicroRNAs (miRNAs) are approximately 21-bp double-stranded products excised from a hairpin precursor, which regulate the expression of other genes by complementary binding to untranslated regions of a messenger RNA target. The miRNA Registry (http://www.sanger.ac.uk/Software/Rfam/mirna/) lists 147 identified human miRNAs, eight of which are found in two clusters in the chromosome 13 sequence. miR-15 and miR-16 locate to 13q14 and are separated by only 100 bp. These genes are deleted or downregulated in patients with chronic lymphocytic leukaemia¹³. A second group of six miRNAs is clustered within an 800 bp region of 13q32. The clustering of these miRNAs suggests that each set may be processed from the same primary transcript¹⁴. Both miRNAs in the first cluster and five of the genes in the second cluster are conserved in order and orientation on mouse chromosome 14.

Chromosome landscape

Chromosome 13 shows striking features of low gene density compared to the other finished, annotated autosomes, and also a very variable gene distribution. The overall gene density is the lowest of all the sequenced autosomes (see Table 2), with an average (excluding pseudogenes and ncRNA genes) of 6.5 genes per Mb (refs 2 , 3 , 4 , 5, 15, 16). This analysis extends and confirms previous observations^10,17,18. Consistent with the low gene density, the G + C content of 38.5% and the predicted CpG island density of 5.4 Mb^-1 are considerably below the genome averages (Table 2). Exon coverage of chromosome 13 sequence is also substantially lower (1.3%) than that of other autosomes, except chromosome 7. However, the average gene length on chromosome 13 is 57 kb, which is almost double that of other chromosomes (31 and 26 kb, for chromosomes 6 and 22, respectively). As a result, the 633 genes cover 37% of the sequence, which is only slightly lower than that of the other finished, annotated autosomes (Table 2).

Table 2 Comparison of chromosome 13 features with those of other sequenced autosomes

Full size table

Gene density varies considerably along the chromosome, as do other characteristics of the gene-rich and gene-poor regions (see Fig. 1 and Supplementary Fig. 1). Here we define ‘gene-rich’ as containing more than 15 genes per Mb, and ‘gene-poor’ as containing fewer than 5 genes per Mb. A detailed picture of the characteristics of an example of each regional class is shown in Fig. 2. A comparison of the two regions can be seen in Table 3. There is a 37.8-Mb region, from 52.9–90.7 Mb, where the average gene density drops to 3 genes per Mb. This region actually comprises two gene-poor segments (52.9–71.9 Mb and 78.9–89.9 Mb) flanking a section with a gene density of 7 Mb^-1 (Fig. 1). The first gene-poor section contains a 3-Mb region with no genes (53.9–56.9 Mb).

**Figure 1: Genetic and physical characteristics of the chromosome 13 sequence.**

**Figure 2: Characteristics of a gene-rich (a) and a gene-poor (b) region.**

Table 3 Comparison of the gene-rich (17.9–21 Mb) and gene-poor (53–56 Mb) regions

Full size table

The two major gene-rich areas are at either end of the q arm, with 90 of the predicted coding genes lying within 3 Mb of either end of the euchromatic region. As observed with other completed human chromosomes, gene-poor regions have a low G + C content, a low SINE coverage and a high LINE coverage, relative to genome averages. These trends are reversed for the gene-rich regions. Figure 1 shows that between positions 90 and 100 Mb, although there are very few genes, a large percentage of the region is covered by transcribed gene structures (exons plus introns). This region contains the largest gene on the chromosome (GPC5) as well as two others (GPC6 and HS6ST3) each covering over 500 kb. In sharp contrast to the protein-coding genes, the ncRNA genes are distributed evenly between the gene-rich and gene-poor regions (see Fig. 1).

A total of 96,894 SNPs, from the dbSNP database, were mapped onto the sequence. The coding regions of the annotated genes contain 654 SNPs (1 SNP per 1.6 kb). These can be subdivided into 345 synonymous and 309 non-synonymous cSNPs. To analyse the overall distribution along the chromosome, a subset (38,069) of SNPs identified previously by alignment of random shotgun sequence to the draft sequence were plotted separately (see Fig. 2a, b and Supplementary Fig. 1). From this distribution plot, there is no obvious difference in the variation rate between the gene-rich and gene-poor areas. There is one region between positions 18.0 and 18.4 Mb (Figs 1 and 2a), where the SNP density is substantially higher than the average, reaching one SNP per 0.3 kb (1,329 SNPs in 400 kb). There is a known duplication with chromosome 21 in this region, and it is possible that the apparently high SNP density is due to the presence of paralogous sequence variants, as has been suggested previously¹⁶.

Around 5% of the human genome may be accounted for by segmental duplications, and this may play an important role in genetic disease and genome evolution^19,20. A study by Cheung and colleagues¹⁹, which classified regions as segmental duplications if they show at least 90% homology over a minimum of 5 kb, suggested that there is approximately 1.8 Mb of duplicated sequence on chromosome 13. This sequence comprises 0.9 Mb of intrachromosomal and 1.2 Mb of interchromosomal duplications²⁰. This includes 0.3 Mb of sequence common to both categories. For example, the TPIP gene on chromosome 13 has undergone both inter- and intrachromosomal duplications. Guipponi and colleagues²¹ described phylogenetic analysis of the TPTE gene family, of which TPIP is a member, and suggested that all family members originate from a common ancestor since the divergence of human and mouse lineages, because there is only a single Tpte gene in mouse. There are seven other genes surrounding the mouse Tpte gene on chromosome 8, each of which has a functional homologue on human chromosome 13. This observation and the fact that four of the genes have homology only to chromosome 13 suggest that the human orthologue of the Tpte gene lies on chromosome 13. There have been a number of duplication events resulting in one functional copy and four pseudogenes of TPIP on chromosome 13. In addition, there is another functional member of the gene family, TPTE, on chromosome 21 and a number of pseudogenes on chromosomes 3, 15, 22 and Y.

Comparison to the genetic map

Recombination rates in the human genome are higher on average in females than males, and vary considerably along each chromosome. The sex-average genetic length of 13q is 129.52 cM, equating to an average recombination rate of 1.3 cM Mb^-1 (slightly higher than the genome average of 1.13 cM Mb^-1). Overall female and male rates for chromosome 13 are 1.6 and 1.1 cM Mb^-1 respectively. Figure 1 illustrates male, female and sex-averaged recombination as a function of chromosome position, correlated with a range of other features of the sequence annotation described above. In the 2 Mb closest to the centromere, the recombination rate in females compared to males is high (4.4 versus 1.7 cM Mb^-1) (Supplementary Table S6). Near the telomere, by contrast, the male recombination rate is the higher of the two (4.5 compared to 1.9 cM Mb^-1 in females).

Recombination rate appears to be correlated with gene density (Fig. 1). The recombination rate in male meiosis is particularly low (0.36 cM Mb^-1) in the central 42 Mb portion of the chromosome between D13S1269 and D13S71, which corresponds to the region of lowest gene density on the chromosome. The female rate is 1.1 cM Mb^-1 over the same region. Within this region there is also a noticeable increase in female recombination rate in the region of higher gene density (at position 72–80 Mb).

From analysis of the draft human genome sequence, Yu²² predicted two recombination ‘deserts’ on chromosome 13, which have a sex-average recombination rate less than or equal to 0.3 cM Mb^-1 for physical distances up to 5 Mb in length, D13S164–D13S1228 (0.12 cM Mb^-1) and D13S1301–D13S233 (0.22 cM Mb^-1). Re-analysis of these regions using the finished sequence indicates that the former has a sex-average recombination rate of 0.84 cM Mb^-1, and so cannot be classified as a recombination desert. The latter, however, was contained within a 3.7-Mb region with a recombination rate of 0.16 cM Mb^-1, and so meets the criteria for a recombination desert. In the same study²², recombination ‘jungles’ were previously defined as having a sex-average recombination rate of 3 cM Mb^-1 or more in regions up to 5 Mb, but none were predicted on chromosome 13. In our analysis there is one region of 4.1 Mb that meets the criteria for a jungle of high recombination, D13S1315–D13S1825 (3.2 cM Mb^-1).

Comparative analysis

The availability of assembled genome sequences for Mus musculus (mouse), Rattus norvegicus (rat), Tetraodon nigroviridis (Tetraodon), Fugu rubripes (Fugu) and Danio rerio (zebrafish), allowed us to identify regions where there is conservation between them and human chromosome 13 (see Supplementary Table S7). In general, greater similarity is expected between human and mouse or rat compared to Fugu, Tetraodon and zebrafish. We observed that 96% of known and novel genes on human chromosome 13 have exons conserved in both rodents, whereas only 81% have exons conserved in all three fishes. Most (95%) of the regions conserved in humans and fish correspond to annotated exons. By contrast, only 25% of the regions conserved in both mouse and rat overlap annotated exons, and at least some of the remainder are expected to result from selection for functionally important non-coding regions. It is of interest that regions with the largest coverage of conserved sequence between humans and rodents are found in gene-poor areas of the chromosome (see mouse and rat homology tracks in Fig. 1).

To estimate the completeness of our annotation of exons in protein-coding genes, we adopted the approach of previous studies^2,5. 2,553 regions on chromosome 13 were found to be conserved in all six reference genomes analysed here. Of these, 2,441 share overlap with 2,337 annotated exons on 13q, while the remaining 112 do not. On this basis, at least 95.4% (2,337/2,337 + 112) of exons on chromosome 13 are included in our annotation.

There are 112 conserved regions that do not correspond to annotated exons. Some or all of them might be exons that remain unconfirmed owing to a lack of transcriptional evidence, or they might be conserved regulatory regions. Recent reports suggest that non-exonic sequences conserved across species may be regulatory or structural elements^23,24. Nobrega and colleagues showed that seven evolutionarily conserved sequences, from in and around the mouse Dach gene, enhanced expression of reporter genes in specific tissues²³. These regions were also identified in our analysis. In fact, 28 of the 112 conserved regions are located in the introns of the DACH1 gene. The MultiPipMaker program²⁵ was used to compare sequence from the DACH1 gene in human, mouse, rat, Fugu and zebrafish (Fig. 3). One of the regions of conservation has a 100% match to a putative miRNA²⁶. Further studies are required to discover whether other of the conserved intronic regions have any regulatory effect.

**Figure 3: MultiPipMaker analysis²⁵ of the human *DACH1* gene showing conserved regions in mouse, rat, *Fugu* and zebrafish.**

Medical implications

Forty-eight mendelian conditions listed in Online Mendelian Inheritance in Man (OMIM) have been linked to genes on chromosome 13 (Supplementary Table S8). Thirty-five of these genes have been cloned and can be positioned on the sequence. The map and sequence data for chromosome 13 have contributed to the identification of a number of these. Notably, an early phase of the sequencing led to characterization of the breast cancer type-2 gene BRCA2 (ref. 27). More recently, sequence from the 13q34 region was used in the identification of the DAOA locus, which is associated with schizophrenia and bipolar disorder^28,29.

The sequence is being applied to search for genes that are implicated in human disease but are as yet unidentified. B-cell chronic lymphocytic leukaemia (CLL) is one of the most common leukaemias in the western world and approximately 10% of CLL patients have a homozygous deletion in 13q14.3 (ref. 30): the chromosome 13 data have allowed further refinement of the region of minimal deletion³⁰. Follicular lymphoma (FL) accounts for approximately 25% of non-Hodgkin's lymphomas in the western world³¹. Using clones from the 13q32–q33 region of the tilepath, a region of recurrent amplification in FL patients has been identified³². Asthma and some forms of dermatitis are atopic or immunoglobulin E (IgE)-mediated diseases, which have shown consistent linkage to 13q14 (refs 33, 34). Zhang and colleagues have localized a quantitative trait locus influencing IgE levels and asthma using a dense SNP map of the region³⁵. In the same study, an association with severe clinical asthma was found with several alleles of the PHF11 gene.

In addition to BRCA2, a number of other genes involved in tumorigenesis have been identified on chromosome 13, including the retinoblastoma gene RB1 and the alveolar rhabdomyosarcoma gene FOXO1A. As described in Supplementary Table S8, several other such genes have been linked to chromosome 13 but remain to be identified. In the Mitleman database of recurrent chromosome aberrations in cancer (http://cgap.nci.nih.gov/Chromosomes/Mitelman), more than 1,400 cases affect chromosome 13. The availability of a tilepath of sequence clones will allow techniques such as array comparative genomic hybridization to refine the regions of rearrangement in these cases.

Methods

Using the Genemap ’99 radiation hybrid map¹⁰ as a starting point, a landmark mapping approach was used to seed and anchor BAC clone contigs³⁶. A total of 863 markers (8.8 markers Mb^-1) were used to screen the RPCI-11 BAC library³⁷, resulting in the identification of 3,354 clones. Using the whole genome fingerprint data from the International Human Genome Mapping Consortium³⁸, the BACs were built into sequence-ready contigs using a combination of shared HindIII fingerprint bands and marker content. Gaps in the map were closed by directed screening of BAC, P1-derived artificial chromosome (PAC) and yeast artificial chromosome (YAC) libraries with probes derived from contig ends. Additionally, end-sequence data from the WIBR whole-genomic fosmid library were used in an effort to identify further contig extensions. All the gaps were sized using fluorescent in situ hybridization on DNA fibres, apart from gap 1 (Supplementary Table S2), which was sized on metaphase chromosomes.

All the tilepath clones processed at the Sanger Institute were sequenced using a shotgun approach and assembled as previously reported⁵. All finished clones meet or exceed the agreed international finishing standards of 99.99% sequence accuracy.

The finished genomic sequence was analysed using an automatic ENSEMBL³⁹ pipeline with modifications to aid the manual curation process. The G + C content of each clone sequence was analysed and putative CpG islands marked. CpG islands were predicted using a modification of the CpG program (G. Micklem, personal communication). The identification of interspersed repeats using RepeatMasker; simple repeats using Tandem Repeat Finder⁴⁰; matches to vertebrate complementary DNAs and expressed sequence tags (ESTs) using WU-BLASTN (W. Gish 1996–2002, http://blast.wustl.edu) and EST_GENOME; and ab initio gene prediction using FGENESH and GENSCAN, were as described previously². A protein database combining non-redundant data from SwissProt⁴¹ and TrEMBL⁴¹ was searched using WU-BLASTX. ENSEMBL gene predictions, including the EST gene build, were displayed on each clone present in the finished sequence assembly. The predicted gene structures were manually annotated according to the human annotation workshop (HAWK) guidelines (http://www.sanger.ac.uk/HGP/havana/hawk.shtml). The gene categories used were as described on the VEGA website. Known genes are identical to known human cDNAs or protein sequences and should have an entry in Locuslink (http://www.ncbi.nlm.nih.gov/LocusLink). Novel genes have an open reading frame (ORF), are identical to spliced ESTs or have some similarity to other genes or proteins. Novel transcripts are similar to novel genes, but the ORF cannot be determined with confidence. Putative genes are identical to spliced human ESTs, but do not contain an ORF. Processed pseudogenes are non-functional copies of genes that lack introns. Unprocessed pseudogenes are non-functional copies of genes that contain introns. Where possible, gene symbols were approved by the HUGO Gene Nomenclature Committee⁴².

tRNA genes were predicted using tRNAscan-SE v1.23 (ref. 43) in eukaryotic mode with default parameters and a threshold of 20 bits. Other ncRNAs were predicted by searching the chromosome sequence against the Rfam database of RNA families (version 4.1)¹². Predicted ncRNA genes were compared by WU-BLASTN to reference sequences collected from the EMBL nucleotide sequence database⁴⁴, in an effort to discriminate between genes and pseudogenes. True ncRNA genes were operationally defined as BLAST hits with at least 95% sequence identity over 95% of the query length, as previously described¹. Published human microRNA gene sequences were downloaded from the miRNA Registry v2.0 and mapped to the chromosome sequence using BLASTN.

All markers from the deCODE genetic map were mapped back onto the finished sequence using a combination of electronic PCR⁴⁵ and SSAHA⁴⁶. For the comparative analysis with other genomes the following methods were used. The mouse and rat sequence were compared to the human sequence using BLASTZ⁴⁷. The programs axtBest and subsetAxt (W. J. Kent, http://www.soe.ucsc.edu/~kent/src) were used to post-process the resulting matches, as previously described⁴⁷, to select the best match and make the alignments relatively specific to exons by using a specific scoring matrix and threshold. The sequences from Tetraodon, Fugu and zebrafish were aligned to the chromosome using WU-TBLASTX, using the same scoring matrix, parameters and filtering strategy used in the Exofish procedure⁴⁸. The resources used for these comparisons are shown in the Supplementary Information.

References

International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001)
Article ADS Google Scholar
Deloukas, P. et al. The DNA sequence and comparative analysis of human chromosome 20. Nature 414, 865–871 (2001)
Article ADS CAS PubMed Google Scholar
Heilig, R. et al. The DNA sequence and analysis of human chromosome 14. Nature 421, 601–607 (2003)
Article ADS CAS PubMed Google Scholar
Hillier, L. W. et al. The DNA sequence of human chromosome 7. Nature 424, 157–164 (2003)
Article ADS CAS PubMed Google Scholar
Mungall, A. J. et al. The DNA sequence and analysis of human chromosome 6. Nature 425, 805–811 (2003)
Article ADS CAS PubMed Google Scholar
Kehrer-Sawatzki, H. et al. Mapping of members of the low-copy-number repetitive DNA sequence family chAB4 within the p arms of human acrocentric chromosomes: characterization of Robertsonian translocations. Chromosome Res. 6, 429–435 (1998)
Article CAS PubMed Google Scholar
Felsenfeld, A., Peterson, J., Schloss, J. & Guyer, M. Assessing the quality of the DNA sequence from the Human Genome Project. Genome Res. 9, 1–4 (1999)
Article CAS PubMed Google Scholar
Kong, A. et al. A high-resolution recombination map of the human genome. Nature Genet. 31, 241–247 (2002)
Article CAS PubMed Google Scholar
Broman, K. W., Murray, J. C., Sheffield, V. C., White, R. L. & Weber, J. L. Comprehensive human genetic maps: individual and sex-specific variation in recombination. Am. J. Hum. Genet. 63, 861–869 (1998)
Article CAS PubMed PubMed Central Google Scholar
Deloukas, P. et al. A physical map of 30,000 human genes. Science 282, 744–746 (1998)
Article ADS CAS PubMed Google Scholar
Dunham, I. et al. The DNA sequence of human chromosome 22. Nature 402, 489–495 (1999)
Article ADS CAS PubMed Google Scholar
Griffiths-Jones, S., Bateman, A., Marshall, M., Khanna, A. & Eddy, S. R. Rfam: an RNA family database. Nucleic Acids Res. 31, 439–441 (2003)
Article CAS PubMed PubMed Central Google Scholar
Calin, G. A. et al. Frequent deletions and down-regulation of micro-RNA genes miR15 and miR16 at 13q14 in chronic lymphocytic leukemia. Proc. Natl Acad. Sci. USA 99, 15524–15529 (2002)
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, Y., Jeon, K., Lee, J. T., Kim, S. & Kim, V. N. MicroRNA maturation: stepwise processing and subcellular localization. EMBO J. 21, 4663–4670 (2002)
Article CAS PubMed PubMed Central Google Scholar
Collins, J. E. et al. Reevaluating human gene annotation: a second-generation analysis of chromosome 22. Genome Res. 13, 27–36 (2003)
Article CAS PubMed PubMed Central Google Scholar
Hattori, M. et al. The DNA sequence of human chromosome 21. Nature 405, 311–319 (2000)
Article ADS CAS PubMed Google Scholar
Saccone, S. et al. Correlations between isochores and chromosomal bands in the human genome. Proc. Natl Acad. Sci. USA 90, 11929–11933 (1993)
Article ADS CAS PubMed PubMed Central Google Scholar
Craig, J. M. & Bickmore, W. A. The distribution of CpG islands in mammalian chromosomes. Nature Genet. 7, 376–382 (1994)
Article CAS PubMed Google Scholar
Cheung, J. et al. Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence. Genome Biol. [online] 4, R25 (2003)
Article PubMed PubMed Central Google Scholar
Bailey, J. A. et al. Recent segmental duplications in the human genome. Science 297, 1003–1007 (2002)
Article ADS CAS PubMed Google Scholar
Guipponi, M. et al. The murine orthologue of the Golgi-localized TPTE protein provides clues to the evolutionary history of the human TPTE gene family. Hum. Genet. 109, 569–575 (2001)
Article CAS PubMed Google Scholar
Yu, A. et al. Comparison of human genetic and sequence-based physical maps. Nature 409, 951–953 (2001)
Article ADS CAS PubMed Google Scholar
Nobrega, M. A., Ovcharenko, I., Afzal, V. & Rubin, E. M. Scanning human gene deserts for long-range enhancers. Science 302, 413 (2003)
Article CAS PubMed Google Scholar
Glazko, G. V., Koonin, E. V., Rogozin, I. B. & Shabalina, S. A. A significant fraction of conserved noncoding DNA in human and mouse consists of predicted matrix attachment regions. Trends Genet. 19, 119–124 (2003)
Article CAS PubMed Google Scholar
Schwartz, S. et al. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518–3524 (2003)
Article CAS PubMed PubMed Central Google Scholar
Lim, L. P. et al. The microRNAs of Caenorhabditis elegans. Genes Dev. 17, 991–1008 (2003)
Article CAS PubMed PubMed Central Google Scholar
Wooster, R. et al. Identification of the breast cancer susceptibility gene BRCA2. Nature 378, 789–792 (1995)
Article ADS CAS PubMed Google Scholar
Hattori, E. et al. Polymorphisms at the G72/G30 gene locus, on 13q33, are associated with bipolar disorder in two independent pedigree series. Am. J. Hum. Genet. 72, 1131–1140 (2003)
Article CAS PubMed PubMed Central Google Scholar
Chumakov, I. et al. Genetic and physiological data implicating the new human gene G72 and the gene for D-amino acid oxidase in schizophrenia. Proc. Natl Acad. Sci. USA 99, 13675–13680 (2002)
Article ADS CAS PubMed PubMed Central Google Scholar
Rowntree, C. et al. Deletion analysis of chromosome 13q14.3 and characterisation of an alternative splice form of LEU1 in B cell chronic lymphocytic leukemia. Leukemia 16, 1267–1275 (2002)
Article CAS PubMed Google Scholar
Armitage, J. O. & Weisenburger, D. D. New approach to classifying non-Hodgkin's lymphomas: clinical features of the major histologic subtypes. Non-Hodgkin's Lymphoma Classification Project. J. Clin. Oncol. 16, 2780–2795 (1998)
Article CAS PubMed Google Scholar
Neat, M. J. et al. Localisation of a novel region of recurrent amplification in follicular lymphoma to an approximately 6.8 Mb region of 13q32–33. Genes Chromosom. Cancer 32, 236–243 (2001)
Article CAS PubMed Google Scholar
Beyer, K. et al. Association and linkage of atopic dermatitis with chromosome 13q12–14 and 5q31–33 markers. J. Invest. Dermatol. 115, 906–908 (2000)
Article CAS PubMed Google Scholar
Kimura, K. et al. Linkage and association of atopic asthma to markers on chromosome 13 in the Japanese population. Hum. Mol. Genet. 8, 1487–1490 (1999)
Article CAS PubMed Google Scholar
Zhang, Y. et al. Positional cloning of a quantitative trait locus on chromosome 13q14 that influences immunoglobulin E levels and asthma. Nature Genet. 34, 181–186 (2003)
Article CAS PubMed Google Scholar
Bentley, D. R. et al. The physical maps for sequencing human chromosomes 1, 6, 9, 10, 13, 20 and X. Nature 409, 942–943 (2001)
Article ADS CAS PubMed Google Scholar
Osoegawa, K. et al. A bacterial artificial chromosome library for sequencing the complete human genome. Genome Res. 11, 483–496 (2001)
Article CAS PubMed PubMed Central Google Scholar
International Human Genome Mapping Consortium. A physical map of the human genome. Nature 409, 934–941 (2001)
Article ADS Google Scholar
Hubbard, T. et al. The Ensembl genome database project. Nucleic Acids Res. 30, 38–41 (2002)
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999)
Article CAS PubMed PubMed Central Google Scholar
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003)
Article CAS PubMed PubMed Central Google Scholar
Wain, H. M., Lush, M., Ducluzeau, F. & Povey, S. Genew: the human gene nomenclature database. Nucleic Acids Res. 30, 169–171 (2002)
Article CAS PubMed PubMed Central Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997)
Article CAS PubMed PubMed Central Google Scholar
Stoesser, G. et al. The EMBL Nucleotide Sequence Database: major new developments. Nucleic Acids Res. 31, 17–22 (2003)
Article CAS PubMed PubMed Central Google Scholar
Schuler, G. D. Sequence mapping by electronic PCR. Genome Res. 7, 541–550 (1997)
Article CAS PubMed PubMed Central Google Scholar
Ning, Z., Cox, A. J. & Mullikin, J. C. SSAHA: a fast search method for large DNA databases. Genome Res. 11, 1725–1729 (2001)
Article CAS PubMed PubMed Central Google Scholar
Schwartz, S. et al. Human-mouse alignments with BLASTZ. Genome Res. 13, 103–107 (2003)
Article CAS PubMed PubMed Central Google Scholar
Roest Crollius, H. et al. Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence. Nature Genet. 25, 235–238 (2000)
Article CAS PubMed Google Scholar
Down, T. A. & Hubbard, T. J. Computational detection and location of transcription start sites in mammalian genomic DNA. Genome Res. 12, 458–461 (2002)
Article CAS PubMed PubMed Central Google Scholar
Sachidanandam, R. et al. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 409, 928–933 (2001)
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

We thank H. M. Wain, R. C. Lovering, E. A. Bruford, M. J. Lush, V. K. Khodiyar, C. C. Talbot and S. Povey from the HUGO Gene Nomenclature Committee for official gene nomenclature, H. Reithman and R. Sudbrak for provision of clones, Y. Chen for assistance with the SNP mapping and the Wellcome Trust for financial support.

Author information

Authors and Affiliations

The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus Cambridgeshire, CB10 1SA, Hinxton, UK
A. Dunham, L. H. Matthews, J. Burton, J. L. Ashurst, K. L. Howe, K. J. Ashcroft, D. M. Beare, D. C. Burford, S. E. Hunt, S. Griffiths-Jones, M. C. Jones, S. J. Keenan, K. Oliver, C. E. Scott, R. Ainscough, J. P. Almeida, K. D. Ambrose, D. T. Andrews, R. I. S. Ashwell, A. K. Babbage, C. L. Bagguley, J. Bailey, R. Bannerjee, K. F. Barlow, K. Bates, H. Beasley, C. P. Bird, S. Bray-Allen, A. J. Brown, J. Y. Brown, W. Burrill, C. Carder, N. P. Carter, J. C. Chapman, M. E. Clamp, S. Y. Clark, G. Clarke, C. M. Clee, S. C. M. Clegg, V. Cobley, J. E. Collins, N. Corby, G. J. Coville, P. Deloukas, P. Dhami, I. Dunham, M. Dunn, M. E. Earthrowl, A. G. Ellington, L. Faulkner, A. G. Frankish, J. Frankland, L. French, P. Garner, J. Garnett, J. G. R. Gilbert, C. J. Gilson, J. Ghori, D. V. Grafham, S. M. Gribble, C. Griffiths, R. E. Hall, S. Hammond, J. L. Harley, E. A. Hart, P. D. Heath, P. J. Howden, E. J. Huckle, P. J. Hunt, A. R. Hunt, C. Johnson, D. Johnson, M. Kay, A. M. Kimberley, A. King, G. K. Laird, C. J. Langford, S. Lawlor, D. A. Leongamornlert, D. M. Lloyd, C. Lloyd, J. E. Loveland, J. Lovell, S. Martin, M. Mashreghi-Mohammadi, S. J. McLaren, A. McMurray, S. Milne, M. J. F. Moore, T. Nickerson, S. A. Palmer, A. V. Pearce, A. I. Peck, S. Pelan, B. Phillimore, K. M. Porter, C. M. Rice, S. Searle, H. K. Sehra, R. Shownkeen, C. D. Skuce, M. Smith, C. A. Steward, N. Sycamore, J. Tester, D. W. Thomas, A. Tracey, A. Tromans, B. Tubby, M. Wall, J. M. Wallis, A. P. West, S. L. Whitehead, D. L. Willey, L. Wilming, P. W. Wray, L. Young, A. Coulson, R. Durbin, T. Hubbard, J. E. Sulston, S. Beck, D. R. Bentley, J. Rogers & M. T. Ross
HUGO Gene Nomenclature Committee, The Galton Laboratory, Wolfson House, 4 Stephenson Way, London, NW1 2HE, UK
M. W. Wright

Authors

A. Dunham
View author publications
You can also search for this author in PubMed Google Scholar
L. H. Matthews
View author publications
You can also search for this author in PubMed Google Scholar
J. Burton
View author publications
You can also search for this author in PubMed Google Scholar
J. L. Ashurst
View author publications
You can also search for this author in PubMed Google Scholar
K. L. Howe
View author publications
You can also search for this author in PubMed Google Scholar
K. J. Ashcroft
View author publications
You can also search for this author in PubMed Google Scholar
D. M. Beare
View author publications
You can also search for this author in PubMed Google Scholar
D. C. Burford
View author publications
You can also search for this author in PubMed Google Scholar
S. E. Hunt
View author publications
You can also search for this author in PubMed Google Scholar
S. Griffiths-Jones
View author publications
You can also search for this author in PubMed Google Scholar
M. C. Jones
View author publications
You can also search for this author in PubMed Google Scholar
S. J. Keenan
View author publications
You can also search for this author in PubMed Google Scholar
K. Oliver
View author publications
You can also search for this author in PubMed Google Scholar
C. E. Scott
View author publications
You can also search for this author in PubMed Google Scholar
R. Ainscough
View author publications
You can also search for this author in PubMed Google Scholar
J. P. Almeida
View author publications
You can also search for this author in PubMed Google Scholar
K. D. Ambrose
View author publications
You can also search for this author in PubMed Google Scholar
D. T. Andrews
View author publications
You can also search for this author in PubMed Google Scholar
R. I. S. Ashwell
View author publications
You can also search for this author in PubMed Google Scholar
A. K. Babbage
View author publications
You can also search for this author in PubMed Google Scholar
C. L. Bagguley
View author publications
You can also search for this author in PubMed Google Scholar
J. Bailey
View author publications
You can also search for this author in PubMed Google Scholar
R. Bannerjee
View author publications
You can also search for this author in PubMed Google Scholar
K. F. Barlow
View author publications
You can also search for this author in PubMed Google Scholar
K. Bates
View author publications
You can also search for this author in PubMed Google Scholar
H. Beasley
View author publications
You can also search for this author in PubMed Google Scholar
C. P. Bird
View author publications
You can also search for this author in PubMed Google Scholar
S. Bray-Allen
View author publications
You can also search for this author in PubMed Google Scholar
A. J. Brown
View author publications
You can also search for this author in PubMed Google Scholar
J. Y. Brown
View author publications
You can also search for this author in PubMed Google Scholar
W. Burrill
View author publications
You can also search for this author in PubMed Google Scholar
C. Carder
View author publications
You can also search for this author in PubMed Google Scholar
N. P. Carter
View author publications
You can also search for this author in PubMed Google Scholar
J. C. Chapman
View author publications
You can also search for this author in PubMed Google Scholar
M. E. Clamp
View author publications
You can also search for this author in PubMed Google Scholar
S. Y. Clark
View author publications
You can also search for this author in PubMed Google Scholar
G. Clarke
View author publications
You can also search for this author in PubMed Google Scholar
C. M. Clee
View author publications
You can also search for this author in PubMed Google Scholar
S. C. M. Clegg
View author publications
You can also search for this author in PubMed Google Scholar
V. Cobley
View author publications
You can also search for this author in PubMed Google Scholar
J. E. Collins
View author publications
You can also search for this author in PubMed Google Scholar
N. Corby
View author publications
You can also search for this author in PubMed Google Scholar
G. J. Coville
View author publications
You can also search for this author in PubMed Google Scholar
P. Deloukas
View author publications
You can also search for this author in PubMed Google Scholar
P. Dhami
View author publications
You can also search for this author in PubMed Google Scholar
I. Dunham
View author publications
You can also search for this author in PubMed Google Scholar
M. Dunn
View author publications
You can also search for this author in PubMed Google Scholar
M. E. Earthrowl
View author publications
You can also search for this author in PubMed Google Scholar
A. G. Ellington
View author publications
You can also search for this author in PubMed Google Scholar
L. Faulkner
View author publications
You can also search for this author in PubMed Google Scholar
A. G. Frankish
View author publications
You can also search for this author in PubMed Google Scholar
J. Frankland
View author publications
You can also search for this author in PubMed Google Scholar
L. French
View author publications
You can also search for this author in PubMed Google Scholar
P. Garner
View author publications
You can also search for this author in PubMed Google Scholar
J. Garnett
View author publications
You can also search for this author in PubMed Google Scholar
J. G. R. Gilbert
View author publications
You can also search for this author in PubMed Google Scholar
C. J. Gilson
View author publications
You can also search for this author in PubMed Google Scholar
J. Ghori
View author publications
You can also search for this author in PubMed Google Scholar
D. V. Grafham
View author publications
You can also search for this author in PubMed Google Scholar
S. M. Gribble
View author publications
You can also search for this author in PubMed Google Scholar
C. Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
R. E. Hall
View author publications
You can also search for this author in PubMed Google Scholar
S. Hammond
View author publications
You can also search for this author in PubMed Google Scholar
J. L. Harley
View author publications
You can also search for this author in PubMed Google Scholar
E. A. Hart
View author publications
You can also search for this author in PubMed Google Scholar
P. D. Heath
View author publications
You can also search for this author in PubMed Google Scholar
P. J. Howden
View author publications
You can also search for this author in PubMed Google Scholar
E. J. Huckle
View author publications
You can also search for this author in PubMed Google Scholar
P. J. Hunt
View author publications
You can also search for this author in PubMed Google Scholar
A. R. Hunt
View author publications
You can also search for this author in PubMed Google Scholar
C. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
D. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
M. Kay
View author publications
You can also search for this author in PubMed Google Scholar
A. M. Kimberley
View author publications
You can also search for this author in PubMed Google Scholar
A. King
View author publications
You can also search for this author in PubMed Google Scholar
G. K. Laird
View author publications
You can also search for this author in PubMed Google Scholar
C. J. Langford
View author publications
You can also search for this author in PubMed Google Scholar
S. Lawlor
View author publications
You can also search for this author in PubMed Google Scholar
D. A. Leongamornlert
View author publications
You can also search for this author in PubMed Google Scholar
D. M. Lloyd
View author publications
You can also search for this author in PubMed Google Scholar
C. Lloyd
View author publications
You can also search for this author in PubMed Google Scholar
J. E. Loveland
View author publications
You can also search for this author in PubMed Google Scholar
J. Lovell
View author publications
You can also search for this author in PubMed Google Scholar
S. Martin
View author publications
You can also search for this author in PubMed Google Scholar
M. Mashreghi-Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar
S. J. McLaren
View author publications
You can also search for this author in PubMed Google Scholar
A. McMurray
View author publications
You can also search for this author in PubMed Google Scholar
S. Milne
View author publications
You can also search for this author in PubMed Google Scholar
M. J. F. Moore
View author publications
You can also search for this author in PubMed Google Scholar
T. Nickerson
View author publications
You can also search for this author in PubMed Google Scholar
S. A. Palmer
View author publications
You can also search for this author in PubMed Google Scholar
A. V. Pearce
View author publications
You can also search for this author in PubMed Google Scholar
A. I. Peck
View author publications
You can also search for this author in PubMed Google Scholar
S. Pelan
View author publications
You can also search for this author in PubMed Google Scholar
B. Phillimore
View author publications
You can also search for this author in PubMed Google Scholar
K. M. Porter
View author publications
You can also search for this author in PubMed Google Scholar
C. M. Rice
View author publications
You can also search for this author in PubMed Google Scholar
S. Searle
View author publications
You can also search for this author in PubMed Google Scholar
H. K. Sehra
View author publications
You can also search for this author in PubMed Google Scholar
R. Shownkeen
View author publications
You can also search for this author in PubMed Google Scholar
C. D. Skuce
View author publications
You can also search for this author in PubMed Google Scholar
M. Smith
View author publications
You can also search for this author in PubMed Google Scholar
C. A. Steward
View author publications
You can also search for this author in PubMed Google Scholar
N. Sycamore
View author publications
You can also search for this author in PubMed Google Scholar
J. Tester
View author publications
You can also search for this author in PubMed Google Scholar
D. W. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
A. Tracey
View author publications
You can also search for this author in PubMed Google Scholar
A. Tromans
View author publications
You can also search for this author in PubMed Google Scholar
B. Tubby
View author publications
You can also search for this author in PubMed Google Scholar
M. Wall
View author publications
You can also search for this author in PubMed Google Scholar
J. M. Wallis
View author publications
You can also search for this author in PubMed Google Scholar
A. P. West
View author publications
You can also search for this author in PubMed Google Scholar
S. L. Whitehead
View author publications
You can also search for this author in PubMed Google Scholar
D. L. Willey
View author publications
You can also search for this author in PubMed Google Scholar
L. Wilming
View author publications
You can also search for this author in PubMed Google Scholar
P. W. Wray
View author publications
You can also search for this author in PubMed Google Scholar
M. W. Wright
View author publications
You can also search for this author in PubMed Google Scholar
L. Young
View author publications
You can also search for this author in PubMed Google Scholar
A. Coulson
View author publications
You can also search for this author in PubMed Google Scholar
R. Durbin
View author publications
You can also search for this author in PubMed Google Scholar
T. Hubbard
View author publications
You can also search for this author in PubMed Google Scholar
J. E. Sulston
View author publications
You can also search for this author in PubMed Google Scholar
S. Beck
View author publications
You can also search for this author in PubMed Google Scholar
D. R. Bentley
View author publications
You can also search for this author in PubMed Google Scholar
J. Rogers
View author publications
You can also search for this author in PubMed Google Scholar
M. T. Ross
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. Dunham.

Ethics declarations

Competing interests

The authors declare that they have no competing financial interests.

Supplementary information

Supplementary Figure S1

Figure showing annotation and analysis of complete sequence. (PDF 8408 kb)

Supplementary Figure S1 legend (DOC 21 kb)

Supplementary Table S1

Table showing clone libraries used in the tilepath. (DOC 40 kb)

Supplementary Table S2

Table showing contig and gap sizes. (DOC 32 kb)

Supplementary Table S3

Characteristics of the annotated genes on chromosome 13q. (DOC 50 kb)

Supplementary Table S4

Protein family clusters on chromosome 13. (DOC 39 kb)

Supplementary Table S5

Predicted ncRNA genes on chromosome 13q. (DOC 128 kb)

Supplementary Table S6

Comparison of genetic and physical distances on chromosome 13. (DOC 245 kb)

Supplementary Table S7

Fraction of human annotated gene structures overlapping regions conserved in other vertebrate genome sequences. (DOC 30 kb)

Supplementary Table S8

Disease genes on chromosome 13. (DOC 48 kb)

Supplementary Information: Resources

Genome assemblies used for the comparative analysis. (DOC 19 kb)

Supplementary References (DOC 19 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dunham, A., Matthews, L., Burton, J. et al. The DNA sequence and analysis of human chromosome 13. Nature 428, 522–528 (2004). https://doi.org/10.1038/nature02379

Download citation

Received: 05 December 2003
Accepted: 27 January 2004
Published: 01 April 2004
Issue Date: 01 April 2004
DOI: https://doi.org/10.1038/nature02379

This article is cited by

Retinoblastoma: present scenario and future challenges
- Vishnu Vardhan Byroju
- Aisha Shigna Nadukkandy
- Lekha Dinesh Kumar
Cell Communication and Signaling (2023)
Chromosomal instability (CIN) in HAP1 cell lines revealed by multiplex fluorescence in situ hybridisation (M-FISH)
- Ruby Banerjee
- Cibele G. Sotero-Caio
- Fengtang Yang
Molecular Cytogenetics (2022)
13q deletion syndrome resulting from balanced chromosomal rearrangement in father: the significance of parental karyotyping
- Sabine Dittner-Moormann
- Madlen Reschke
- Deniz Kanber
Molecular Cytogenetics (2020)
Chromosomal scan of single sperm cells by combining fluorescence-activated cell sorting and next-generation sequencing
- Quoc Ty Tran
- Tatjana Jatsenko
- Andres Salumets
Journal of Assisted Reproduction and Genetics (2019)
Have we done our last amniocentesis? Updates on cell-free DNA for Down syndrome screening
- Kathryn J. Gray
- Louise E. Wilkins-Haug
Pediatric Radiology (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Abstract

Similar content being viewed by others

Main

Clone map and finished sequence

Gene index

Chromosome landscape

Comparison to the genetic map

Comparative analysis

Medical implications

Methods

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links