The zebrafish reference genome sequence and its relationship to the human genome

Howe, Kerstin; Clark, Matthew D.; Torroja, Carlos F.; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E.; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C.; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T.; Guerra-Assunção, José A.; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F.; Laird, Gavin K.; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Elliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Begum, Sharmin; Mortimore, Beverley; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Lloyd, Christine; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James D.; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M.; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Lanz, Christa; Raddatz, Günter; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Schuster, Stephan C.; Carter, Nigel P.; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M. J.; Enright, Anton; Geisler, Robert; Plasterk, Ronald H. A.; Lee, Charles; Westerfield, Monte; de Jong, Pieter J.; Zon, Leonard I.; Postlethwait, John H.; Nüsslein-Volhard, Christiane; Hubbard, Tim J. P.; Crollius, Hugues Roest; Rogers, Jane; Stemple, Derek L.

doi:10.1038/nature12111

Download PDF

Letter
Open access
Published: 17 April 2013

The zebrafish reference genome sequence and its relationship to the human genome

Kerstin Howe¹^na1,
Matthew D. Clark^1,2^na1,
Carlos F. Torroja^1,3,
James Torrance¹,
Camille Berthelot^4,5,6,
Matthieu Muffato⁷,
John E. Collins¹,
Sean Humphray^1,8,
Karen McLaren¹,
Lucy Matthews¹,
Stuart McLaren¹,
Ian Sealy¹,
Mario Caccamo²,
Carol Churcher¹,
Carol Scott¹,
Jeffrey C. Barrett¹,
Romke Koch⁹,
Gerd-Jörg Rauch¹⁰,
Simon White¹,
William Chow¹,
Britt Kilian¹,
Leonor T. Quintais⁷,
José A. Guerra-Assunção⁷,
Yi Zhou¹¹,
Yong Gu¹,
Jennifer Yen¹,
Jan-Hinnerk Vogel¹,
Tina Eyre¹,
Seth Redmond¹,
Ruby Banerjee¹,
Jianxiang Chi¹,
Beiyuan Fu¹,
Elizabeth Langley¹,
Sean F. Maguire¹,
Gavin K. Laird¹,
David Lloyd¹,
Emma Kenyon¹,
Sarah Donaldson¹,
Harminder Sehra¹,
Jeff Almeida-King¹,
Jane Loveland¹,
Stephen Trevanion¹,
Matt Jones¹,
Mike Quail¹,
Dave Willey¹,
Adrienne Hunt¹,
John Burton¹,
Sarah Sims¹,
Kirsten McLay¹,
Bob Plumb¹,
Joy Davis¹,
Chris Clee¹,
Karen Oliver¹,
Richard Clark¹,
Clare Riddle¹,
David Elliott¹,
Glen Threadgold¹,
Glenn Harden¹,
Darren Ware¹,
Sharmin Begum¹,
Beverley Mortimore¹,
Giselle Kerry¹,
Paul Heath¹,
Benjamin Phillimore¹,
Alan Tracey¹,
Nicole Corby¹,
Matthew Dunn¹,
Christopher Johnson¹,
Jonathan Wood¹,
Susan Clark¹,
Sarah Pelan¹,
Guy Griffiths¹,
Michelle Smith¹,
Rebecca Glithero¹,
Philip Howden¹,
Nicholas Barker¹,
Christine Lloyd¹,
Christopher Stevens¹,
Joanna Harley¹,
Karen Holt¹,
Georgios Panagiotidis¹,
Jamieson Lovell¹,
Helen Beasley¹,
Carl Henderson¹,
Daria Gordon¹,
Katherine Auger¹,
Deborah Wright¹,
Joanna Collins¹,
Claire Raisen¹,
Lauren Dyer¹,
Kenric Leung¹,
Lauren Robertson¹,
Kirsty Ambridge¹,
Daniel Leongamornlert¹,
Sarah McGuire¹,
Ruth Gilderthorp¹,
Coline Griffiths¹,
Deepa Manthravadi¹,
Sarah Nichol¹,
Gary Barker¹,
Siobhan Whitehead¹,
Michael Kay¹,
Jacqueline Brown¹,
Clare Murnane¹,
Emma Gray¹,
Matthew Humphries¹,
Neil Sycamore¹,
Darren Barker¹,
David Saunders¹,
Justene Wallis¹,
Anne Babbage¹,
Sian Hammond¹,
Maryam Mashreghi-Mohammadi¹,
Lucy Barr¹,
Sancha Martin¹,
Paul Wray¹,
Andrew Ellington¹,
Nicholas Matthews¹,
Matthew Ellwood¹,
Rebecca Woodmansey¹,
Graham Clark¹,
James D. Cooper¹,
Anthony Tromans¹,
Darren Grafham¹,
Carl Skuce¹,
Richard Pandian¹,
Robert Andrews¹,
Elliot Harrison¹,
Andrew Kimberley¹,
Jane Garnett¹,
Nigel Fosker¹,
Rebekah Hall¹,
Patrick Garner¹,
Daniel Kelly¹,
Christine Bird¹,
Sophie Palmer¹,
Ines Gehring¹⁰,
Andrea Berger¹⁰,
Christopher M. Dooley^1,10,
Zübeyde Ersan-Ürün¹⁰,
Cigdem Eser¹⁰,
Horst Geiger¹⁰,
Maria Geisler¹⁰,
Lena Karotki¹⁰,
Anette Kirn¹⁰,
Judith Konantz¹⁰,
Martina Konantz¹⁰,
Martina Oberländer¹⁰,
Silke Rudolph-Geiger¹⁰,
Mathias Teucke¹⁰,
Christa Lanz¹⁰,
Günter Raddatz¹⁰,
Kazutoyo Osoegawa¹²,
Baoli Zhu¹²,
Amanda Rapp¹³,
Sara Widaa¹,
Cordelia Langford¹,
Fengtang Yang¹,
Stephan C. Schuster¹⁰,
Nigel P. Carter¹,
Jennifer Harrow¹,
Zemin Ning¹,
Javier Herrero⁷,
Steve M. J. Searle¹,
Anton Enright⁷,
Robert Geisler^10,14,
Ronald H. A. Plasterk⁹,
Charles Lee¹⁵,
Monte Westerfield¹³,
Pieter J. de Jong¹²,
Leonard I. Zon¹¹,
John H. Postlethwait¹³,
Christiane Nüsslein-Volhard¹⁰,
Tim J. P. Hubbard¹,
Hugues Roest Crollius^4,5,6,
Jane Rogers^1,2 &
…
Derek L. Stemple¹

Nature volume 496, pages 498–503 (2013)Cite this article

175k Accesses
3161 Citations
739 Altmetric
Metrics details

Subjects

Comparative genomics

A Corrigendum to this article was published on 11 December 2013

Abstract

Zebrafish have become a popular organism for the study of vertebrate gene function^1,2. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease^3,4,5. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes⁶, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination.

Towards complete and error-free genome assemblies of all vertebrate species

Article Open access 28 April 2021

A map of cis-regulatory elements and 3D genome structures in zebrafish

Article 25 November 2020

Multiomic atlas with functional stratification and developmental dynamics of zebrafish cis-regulatory elements

Article Open access 04 July 2022

Main

The zebrafish (Danio rerio) was first identified as a genetically tractable organism in the 1980s. The systematic application of genetic screens led to the phenotypic characterization of a large collection of mutations^1,2. These mutations, when driven to homozygosity, can produce defects in a variety of organ systems with pathologies similar to human disease. Such investigations have also contributed notably to our understanding of basic vertebrate biology and vertebrate development. In addition to enabling the systematic definition of a large range of early developmental phenotypes, screens in zebrafish have contributed more generally to our understanding of the factors controlling the specification of cell types, organ systems and body axes of vertebrates^7,8,9.

Although its contributions have already been substantial, zebrafish research holds further promise to enhance our understanding of the detailed roles of specific genes in human diseases, both rare and common. Increasingly, zebrafish experiments are included in studies of human genetic disease, often providing independent verification of the activity of a gene implicated in a human disease^3,5,10. Essential to this enterprise is a high-quality genome sequence and complete annotation of zebrafish protein-coding genes with identification of their human orthologues.

The zebrafish genome-sequencing project was initiated at the Wellcome Trust Sanger Institute in 2001. We chose Tübingen as the zebrafish reference strain as it had been used extensively to identify mutations affecting embryogenesis². Our strategy resembled the clone-by-clone sequencing approach adopted previously for both the human and mouse genome projects. The Zv9 assembly is a hybrid of high-quality finished clone sequence (83%) and whole-genome shotgun (WGS) sequence (17%), with a total size of 1.412 gigabases (Gb) (Table 1). The clone and WGS sequence is tied to a high-resolution, high-density meiotic map called the Sanger AB Tübingen map (SATmap), named after the strains of zebrafish used to make the map (Supplementary Information).

Table 1 Assembly and annotation statistics for the Zv9 assembly

Full size table

Zebrafish are members of the teleostei infraclass, a monophyletic group that is thought to have arisen approximately 340 million years ago from a common ancestor¹¹. Compared to other vertebrate species, this ancestor underwent an additional round of whole-genome duplication (WGD) called the teleost-specific genome duplication (TSD)¹². Gene duplicates that result from this process are called ohnologues (after Susumu Ohno who suggested this mechanism of gene duplication)¹³. Zebrafish possess 26,206 protein-coding genes⁶, more than any previously sequenced vertebrate, and they have a higher number of species-specific genes in their genome than do human, mouse or chicken. Some of this increased gene number is likely to be a consequence of the TSD.

A direct comparison of the zebrafish and human protein-coding genes reveals a number of interesting features. First, 71.4% of human genes have at least one zebrafish orthologue, as defined by Ensembl Compara¹⁴ (Table 2). Reciprocally, 69% of zebrafish genes have at least one human orthologue. Among the orthologous genes, 47% of human genes have a one-to-one relationship with a zebrafish orthologue. The second largest orthology class contains human genes that are associated with many zebrafish genes (the ‘one-human-to-many-zebrafish’ class), with an average of 2.28 zebrafish genes for each human gene, and this probably reflects the TSD. A few notable human genes have no clearly identifiable zebrafish orthologue; for example, the leukaemia inhibitory factor (LIF), oncostatin M (OSM) or interleukin-6 (IL6) genes, although the receptors lifra, lifrb, osmr and il6r are clearly present in the zebrafish genome. It is possible that zebrafish proteins with functionally similar activities to LIF, OSM and IL-6 exist, but that their sequence divergence is so great that they cannot be recognized as orthologues. Similarly, the zebrafish genome has no BRCA1 orthologue, but does have an orthologue of the BRCA1-associated BARD1 gene, which encodes an associated and functionally similar protein and a brca2 gene, which plays an important role in oocyte development, probably reflecting its role in DNA damage repair¹⁵.

Table 2 Comparison of human and zebrafish protein-coding genes and their orthology relationships

Full size table

Zebrafish have been used successfully to understand the biological activity of genes orthologous to human disease-related genes in greater detail^3,4,5. To investigate the number of potential disease-related genes, we compared the list of human genes possessing at least one zebrafish orthologue with the 3,176 genes bearing morbidity descriptions that are listed in the Online Mendelian Inheritance in Man (OMIM) database. Of these morbid genes, 2,601 (82%) can be related to at least one zebrafish orthologue. A similar comparison identified at least one zebrafish orthologue for 3,075 (76%) of the 4,023 human genes implicated in genome wide association studies (GWAS).

Zv9 shows an overall repeat content of 52.2%, the highest reported so far in a vertebrate. All other sequenced teleost fish exhibit a much lower repeat content, with an average of less than 30%. This result suggests that the evolutionary path leading to the zebrafish experienced an expansion of repeats, possibly facilitated by a population bottleneck. Alternatively, the repeat content of the other sequenced teleost species may be under-represented, as these assemblies are mostly WGS¹⁶.

The majority of transposable elements found in the human genome are type I (retrotransposable elements), with more than 4.3 million placements covering 44% of the sequence, whereas only 11% of the zebrafish genome sequence is covered by type I elements in less than 500,000 instances. In contrast, the zebrafish genome contains a marked excess of type II DNA transposable elements. Indeed, 2.3 million instances of type II DNA transposable elements cover 39% of the zebrafish genome sequence (Supplementary Table 12), whereas type II repeats cover only 3.2% of the human genome.

This pronounced abundance of type II transposable elements is unique among the sequenced vertebrate genomes, and the genome sequence shows evidence of recently active type II transposable elements. The closest vertebrate species in terms of the abundance of type II transposable elements is Xenopus tropicalis (25% type II transposable elements), whereas the sequenced and annotated teleost fish (the pufferfish Takifugu and Tetraodon, the three-spined stickleback (Gasterosteus aculeatus) and the medaka (Oryzias latipes)) each possess type II transposable element coverage of less than 10%, which may relate to the fact that the zebrafish genome diverges basally from the other sequenced and annotated teleost genomes¹⁷. Zebrafish type II transposable elements are divided into 14 superfamilies with 401 repeat families in total (Supplementary Table 12). The DNA and hAT superfamilies are the most abundant and diverse in the zebrafish genome, together covering 28% of the sequence. The type II transposable element abundance of zebrafish, or lack of retrotransposable elements, may provide an explanation for the low zebrafish pseudogene content (Supplementary Table 14).

The long arm of chromosome 4 is unique among zebrafish genomic regions, owing to its relative lack of protein-coding genes and its extensive heterochromatin. Chromosome 4 is known to be late-replicating and hybridization studies suggest that genomic copies of 5S ribosomal DNA (rDNA), which are not notably present on any other chromosome, are scattered along the long arm at high redundancy¹⁸. Immediately after the presumed centromere at approximately 24 megabases (Mb), the sequence landscape (Fig. 1 and Supplementary Fig. A4) shows a remarkable increase in repeat content, which continues through to the telomere of the long arm. At approximately 27 Mb, the otherwise uniform presence of the satellite repeat SAT-2 on the long arm ends abruptly. This location is also the starting point of uniform MOSAT-2 distribution, a satellite repeat that is nearly absent from all other chromosomes but highly enriched on the long arm of chromosome 4. The subtelomeric region of the long arm shows a distinct distribution of repeat elements, with relatively fewer interspersed elements and an increased content of satellite, simple and tandem repeats that do not harbour 5S rDNA sequences. Moreover, the gene content is reduced on the long arm and the guanine–cytosine content is slightly increased.

**Figure 1: Landscape of chromosome 4.**

The long arm of chromosome 4 also has a special structure with respect to gene orthology and synteny. Approximately 80% of the genes present have no identifiable orthologues in human. In fact, 110 genes (out of 663) have no identifiable orthologues in any other sequenced teleost genome and indeed seem to be zebrafish-specific genes. The genes in this region are highly duplicated, with 31 ancestral gene families alone providing 77.5% of the genes, the largest of which contains no less than 109 duplicates in this region. The largest of these families correspond to NOD-like receptor proteins¹⁹ with putative roles in innate immunity and zinc finger proteins. We also observed a very high density of small nuclear RNAs (snRNAs) on chromosome 4, and in particular those that encode spliceosome components. The cohort of snRNAs carried on the long arm of chromosome 4 accounts for 53.2% of all snRNAs in the zebrafish genome. In addition, in a specific group of zebrafish derived recently from a natural population, the subtelomeric region of the long arm of chromosome 4 has been found to contain a major sex determinant with alleles that are 100% predictive of male development and 85% predictive of female development, suggesting that this chromosome may be, might have been, or may be becoming, a sex chromosome in this particular population²⁰.

In addition to the chromosome 4 sex determinant, three other separate genomic regions have been identified as influencing sex determination, and these vary between the strains and even within the families studied^20,21. Our meiotic map, SATmap, which was generated to anchor the genomic sequence, provided an opportunity to examine whether there are any strong signals for sex determination. To generate SATmap we took advantage of the fact that it is possible to create double haploid individuals that contain only maternally derived DNA, that are homozygous at every locus and that can be raised until they are fertile²² (Fig. 2a). To investigate the interesting finding that SATmap F₁ fish could be either male or female while being genetically identical and heterozygous at every polymorphic locus, we sought a genetic signal for sex determination in the F₂ generation, in which these polymorphisms segregate. Using morphological secondary sexual traits, we were able to score the sex of 332 genotyped F₂ individuals. Although most chromosomes showed no significant genetic bias for a particular sex, we found that most of chromosome 16 carried a strong signal (P = 9.1 × 10⁻⁷) with a broad peak around the centromere (Fig. 2b, c). Homozygotes for the Tübingen (grandmaternal) allele had a very high probability of being female, whereas homozygotes for the AB (grandpaternal) allele were very unlikely to be female (Fig. 2).

**Figure 2: Sex determination signal on chromosome 16.**

The number of protein-coding genes among vertebrates is relatively stable, although even closely related species may show great disparities in the nature of their protein-coding gene content. We carried out a four-way comparison between the proteome of two mammals (human and mouse), a bird (chicken) and the zebrafish to quantify the fraction of shared and species-specific genes present in each genome (Fig. 3a). A core group of 10,660 genes is found in all four species and probably approximates an essential set of vertebrate protein-coding genes. This number is somewhat less than the core set of 11,809 vertebrate genes identified previously as being common to three fish genomes (Tetraodon, medaka, zebrafish) and three amniotes (human, mouse, chicken)¹⁶, but the discrepancy probably reflects the improved annotation of these genomes that often results in fusing fragmented gene structures. Each taxon has between 2,596 and 3,634 species-specific genes. The notable excess observed in zebrafish may be a consequence of the WGD, because pairs of duplicated genes that arose from the WGD, but with no orthologue in amniotes, are counted as two specific genes. Furthermore, 2,059 genes are found in human, mouse and zebrafish but not in chicken, and this number is two times higher than the number of genes that are found in all amniotes but not in zebrafish (892). It is unclear whether these genes have been lost along the evolutionary branch leading to the chicken, or whether this is due to annotation or orthology assignation errors in the chicken genome.

**Figure 3: Evolutionary aspects of the zebrafish genome.**

We identified double-conserved synteny (DCS) blocks between all sequenced tetrapods and four fish genomes (zebrafish, medaka, stickleback and Tetraodon). DCS blocks are defined as runs of genes in the non-duplicated species that are found on two different chromosomes in the species that underwent a WGD²³, although the genes may not be adjacent in the duplicated species²⁴. The DCS between zebrafish and human are represented on either side of each human chromosome (Supplementary Fig. 15). Using DCS blocks, we identified zebrafish paralogous genes that are part of DCS blocks and consistent with the locally alternating chromosomes, hence with an origin at the TSD. We identified 3,440 pairs of such ohnologues (26% of the all genes), for a total of 8,083 genes when subsequent duplications are taken into account. It is notable that although true pairs of ohnologues may exist within the same chromosome owing to post-TSD rearrangements, we excluded such cases as we cannot reliably distinguish them from segmental duplications. This number of ancestral genes retained as duplicates in zebrafish is higher, both in absolute number and in proportion, than in other fish genomes (chi-squared test, all P < 3 × 10⁻⁵).

We compared the 8,083 zebrafish TSD ohnologues with human ohnologues originating from the two rounds of WGD that are common to all vertebrates and find that the two sets overlap strongly (chi-squared test, P <2 × 10⁻¹⁶). In general, zebrafish ohnologous pairs are enriched in specific functions (neural activity, transcription factors) and are orthologous to mammalian genes under stronger evolutionary constraint than genes that have lost their second copy.

A circular representation of ohnologue pairs (Fig. 3b) highlights chromosomes, or parts of chromosomes, that descended from the same pre-duplication ancestral chromosome (for example, chromosomes 3 and 12, 17 and 20, 16 and 19). Among zebrafish chromosomes, chromosome 16 and chromosome 19 are unique in their one-to-one conservation of synteny. Consistent with the conservation of synteny, chromosome 16 and chromosome 19 possess clusters of orthologues of genes associated with the mammalian major histocompatibility complex (MHC) as well as the hoxab and hoxaa clusters, respectively, which are each orthologous to the human HOXA cluster²⁵.

Since the earliest whole-genome shotgun-only assembly became public in 2002, the zebrafish reference genome sequence has enabled many new discoveries to be made, in particular the positional cloning of hundreds of genes from mutations affecting embryogenesis, behaviour, physiology, and health and disease. Moreover, the annotated reference genome has enabled the generation of accurate whole-exome enrichment reagents, which are accelerating both positional cloning projects and new genome-wide mutation discovery efforts^26,27. Although the zebrafish reference genome sequencing is complete, a few poorly assembled regions remain, which are being resolved by the Genome Reference Consortium (http://genomereference.org).

Methods Summary

We generated cloned libraries of large fragments of genomic DNA, assembled a physical map of large-insert clones and completely sequenced a set of minimally overlapping clones. In addition, we generated WGS sequences by end-sequencing a mixture of large- and short-insert libraries. Overlapping clone sequences were combined with WGS sequences and tied to the meiotic map, SATmap, which enabled independent placement and orientation of clones in the genome sequence. The sequence data can be found in the BioProject database, under accession number PRJNA11776.

To obtain evidence for a more complete description of protein-coding genes, we used high-throughput short-read complementary DNA sequencing and obtained a deep-coverage data set for messenger RNAs expressed in zebrafish at various stages of development and in adult tissues⁶. Finally, a standard Ensembl gene build, incorporating filtered elements from the complementary DNA sequencing gene build, was merged with the manually curated gene models to produce a comprehensive annotation in Ensembl version 67 (http://may2012.archive.ensembl.org/Danio_rerio/Info/Index). Detailed descriptions of all the methods used for this project are available in the Supplementary Information.

Accession codes

Primary accessions

BioProject

PRJNA11776

Data deposits

Sequence data have been submitted to the BioProject database under accession PRJNA11776.

References

Driever, W. et al. A genetic screen for mutations affecting embryogenesis in zebrafish. Development 123, 37–46 (1996)
CAS PubMed Google Scholar
Haffter, P. et al. The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio. Development 123, 1–36 (1996)
CAS PubMed Google Scholar
Golzio, C. et al. KCTD13 is a major driver of mirrored neuroanatomical phenotypes of the 16p11.2 copy number variant. Nature 485, 363–367 (2012)
Article ADS CAS PubMed PubMed Central Google Scholar
Panizzi, J. R. et al. CCDC103 mutations cause primary ciliary dyskinesia by disrupting assembly of ciliary dynein arms. Nature Genet. 44, 714–719 (2012)
Article CAS PubMed Google Scholar
Roscioli, T. et al. Mutations in ISPD cause Walker-Warburg syndrome and defective glycosylation of alpha-dystroglycan. Nature Genet. 44, 581–585 (2012)
Article CAS PubMed Google Scholar
Collins, J. E., White, S., Searle, S. M. & Stemple, D. L. Incorporating RNA-seq data into the zebrafish Ensembl genebuild. Genome Res. 22, 2067–2078 (2012)
Article CAS PubMed PubMed Central Google Scholar
Talbot, W. S. et al. A homeobox gene essential for zebrafish notochord development. Nature 378, 150–157 (1995)
Article ADS CAS PubMed Google Scholar
Gritsman, K. et al. The EGF-CFC protein one-eyed pinhead is essential for nodal signaling. Cell 97, 121–132 (1999)
Article CAS PubMed Google Scholar
Ober, E. A., Verkade, H., Field, H. A. & Stainier, D. Y. Mesodermal Wnt2b signalling positively regulates liver specification. Nature 442, 688–691 (2006)
Article ADS CAS PubMed Google Scholar
Tobin, D. M. et al. Host genotype-specific therapies can optimize the inflammatory response to mycobacterial infections. Cell 148, 434–446 (2012)
Article CAS PubMed PubMed Central Google Scholar
Amores, A., Catchen, J., Ferrara, A., Fontenot, Q. & Postlethwait, J. H. Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication. Genetics 188, 799–808 (2011)
Article CAS PubMed PubMed Central Google Scholar
Meyer, A. & Schartl, M. Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of novel gene functions. Curr. Opin. Cell Biol. 11, 699–704 (1999)
Article CAS PubMed Google Scholar
Wolfe, K. Robustness–it's not where you think it is. Nature Genet. 25, 3–4 (2000)
Article CAS PubMed Google Scholar
Vilella, A. J. et al. EnsemblCompara GeneTrees: complete, duplication-aware phylogenetic trees in vertebrates. Genome Res. 19, 327–335 (2009)
Article CAS PubMed PubMed Central Google Scholar
Rodríguez-Mari, A. et al. Roles of brca2 (fancd1) in oocyte nuclear architecture, gametogenesis, gonad tumors, and genome stability in zebrafish. PLoS Genet. 7, e1001357 (2011)
Article PubMed PubMed Central Google Scholar
Kasahara, M. et al. The medaka draft genome and insights into vertebrate genome evolution. Nature 447, 714–719 (2007)
Article ADS CAS PubMed Google Scholar
Postlethwait, J. H. The zebrafish genome in context: ohnologs gone missing. J. Exp. Zool. B 308, 563–577 (2007)
Article Google Scholar
Sola, L. & Gornung, E. Classical and molecular cytogenetics of the zebrafish, Danio rerio (Cyprinidae, Cypriniformes): an overview. Genetica 111, 397–412 (2001)
Article CAS PubMed Google Scholar
Stein, C., Caccamo, M., Laird, G. & Leptin, M. Conservation and divergence of gene families encoding components of innate immune response systems in zebrafish. Genome Biol. 8, R251 (2007)
Article PubMed PubMed Central Google Scholar
Anderson, J. L. et al. Multiple sex-associated regions and a putative sex chromosome in zebrafish revealed by RAD mapping and population genomics. PLoS ONE 7, e40701 (2012)
Article ADS CAS PubMed PubMed Central Google Scholar
Bradley, K. M. et al. An SNP-based linkage map for zebrafish reveals sex determination loci. G3 (Bethesda) 1, 3–9 (2011)
Article CAS Google Scholar
Streisinger, G., Walker, C., Dower, N., Knauber, D. & Singer, F. Production of clones of homozygous diploid zebra fish (Brachydanio rerio). Nature 291, 293–296 (1981)
Article ADS CAS PubMed Google Scholar
Kellis, M., Birren, B. W. & Lander, E. S. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428, 617–624 (2004)
Article ADS CAS PubMed Google Scholar
Jaillon, O. et al. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431, 946–957 (2004)
Article ADS PubMed Google Scholar
Amores, A. et al. Developmental roles of pufferfish Hox clusters and genome evolution in ray-fin fish. Genome Res. 14, 1–10 (2004)
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Kettleborough, R. N. W. et al. A systematic genome-wide analysis of zebrafish protein-coding gene function. Nature (in the press)
Varshney, G. K. et al. A large-scale zebrafish gene knockout resource for the genome-wide study of gene function. Genome Res. 23, 727–735 (2013)
Article CAS PubMed PubMed Central Google Scholar
Freeman, J. L. et al. Definition of the zebrafish genome using flow cytometry and cytogenetic mapping. BMC Genomics 8, 195 (2007)
Article PubMed PubMed Central Google Scholar
The 1000 Genomes Project Consortium An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012)
Article PubMed Central Google Scholar
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009)
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We wish to thank R. Durbin, E. Birney, A. Scally, C. P. Ponting, E. Busch-Nentwich and R. Kettleborough for helpful discussions, as well as F. L. Marlow and P. Aanstad for critical reading and helpful comments on manuscripts. We thank the zebrafish information network (ZFIN) for funding part of the manual annotation of the zebrafish genome and the ZFIN staff for support with gene nomenclature and other genome issues. We also thank the Genome Reference Consortium for the maintenance and improvement of the zebrafish genome assembly. We are indebted to the Ensembl team for providing a browser and database that greatly facilitated the use and the analyses of the zebrafish genome. We thank A. Pirani at Affymetrix for genotyping advice support, and the Zebrafish International Resource Center (ZIRC) for distributing the SAT strain. J.H.P. was supported by the National Institutes of Health (NIH) grant R01 GM085318 (to J.H.P.), NIH grant P01 HD22486 (to J.H.P.) and R01 OD011116 (later changed to R01 RR020833) (to J.H.P.). We would like to acknowledge the support of the European Commission's Sixth Framework Programme (contract no. LSHG-CT-2003-503496, ZF-MODELS) and Seventh Framework Programme (grant no. HEALTH-F4-2010-242048, ZF-HEALTH). R.G. was supported by the German Human Genome Project (DHGP Grant 01 KW 9627 and 01 KW 9919). C.N.-V., G.-J.R. and R.G. were supported by the NIH (NIH grant 1 R01 DK55377-01A1). S.C.S. was supported by the German Research Foundation (DFG Grant NU 22/5). The Zebrafish Genome Project at the Wellcome Trust Sanger Institute was funded by Wellcome Trust grant number 098051.

Author information

Kerstin Howe and Matthew D. Clark: These authors contributed equally to this work.

Authors and Affiliations

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK,
Kerstin Howe, Matthew D. Clark, Carlos F. Torroja, James Torrance, John E. Collins, Sean Humphray, Karen McLaren, Lucy Matthews, Stuart McLaren, Ian Sealy, Carol Churcher, Carol Scott, Jeffrey C. Barrett, Simon White, William Chow, Britt Kilian, Yong Gu, Jennifer Yen, Jan-Hinnerk Vogel, Tina Eyre, Seth Redmond, Ruby Banerjee, Jianxiang Chi, Beiyuan Fu, Elizabeth Langley, Sean F. Maguire, Gavin K. Laird, David Lloyd, Emma Kenyon, Sarah Donaldson, Harminder Sehra, Jeff Almeida-King, Jane Loveland, Stephen Trevanion, Matt Jones, Mike Quail, Dave Willey, Adrienne Hunt, John Burton, Sarah Sims, Kirsten McLay, Bob Plumb, Joy Davis, Chris Clee, Karen Oliver, Richard Clark, Clare Riddle, David Elliott, Glen Threadgold, Glenn Harden, Darren Ware, Sharmin Begum, Beverley Mortimore, Giselle Kerry, Paul Heath, Benjamin Phillimore, Alan Tracey, Nicole Corby, Matthew Dunn, Christopher Johnson, Jonathan Wood, Susan Clark, Sarah Pelan, Guy Griffiths, Michelle Smith, Rebecca Glithero, Philip Howden, Nicholas Barker, Christine Lloyd, Christopher Stevens, Joanna Harley, Karen Holt, Georgios Panagiotidis, Jamieson Lovell, Helen Beasley, Carl Henderson, Daria Gordon, Katherine Auger, Deborah Wright, Joanna Collins, Claire Raisen, Lauren Dyer, Kenric Leung, Lauren Robertson, Kirsty Ambridge, Daniel Leongamornlert, Sarah McGuire, Ruth Gilderthorp, Coline Griffiths, Deepa Manthravadi, Sarah Nichol, Gary Barker, Siobhan Whitehead, Michael Kay, Jacqueline Brown, Clare Murnane, Emma Gray, Matthew Humphries, Neil Sycamore, Darren Barker, David Saunders, Justene Wallis, Anne Babbage, Sian Hammond, Maryam Mashreghi-Mohammadi, Lucy Barr, Sancha Martin, Paul Wray, Andrew Ellington, Nicholas Matthews, Matthew Ellwood, Rebecca Woodmansey, Graham Clark, James D. Cooper, Anthony Tromans, Darren Grafham, Carl Skuce, Richard Pandian, Robert Andrews, Elliot Harrison, Andrew Kimberley, Jane Garnett, Nigel Fosker, Rebekah Hall, Patrick Garner, Daniel Kelly, Christine Bird, Sophie Palmer, Christopher M. Dooley, Sara Widaa, Cordelia Langford, Fengtang Yang, Nigel P. Carter, Jennifer Harrow, Zemin Ning, Steve M. J. Searle, Tim J. P. Hubbard, Jane Rogers & Derek L. Stemple
The Genome Analysis Centre, Norwich Research Park, Norwich NR4 7UH, UK,
Matthew D. Clark, Mario Caccamo & Jane Rogers
Bioinformatics Unit, Centro Nacional de Investigaciones Cardiovasculares, Madrid, 28029, Spain
Carlos F. Torroja
Ecole Normale Supérieure, Institut de Biologie de l’ENS, IBENS, 46 rue d’Ulm, Paris F-75005, France,
Camille Berthelot & Hugues Roest Crollius
INSERM, U1024, 46 rue d’Ulm, Paris, F-75005, France
Camille Berthelot & Hugues Roest Crollius
CNRS, UMR 8197, 46 rue d’Ulm, Paris, F-75005, France
Camille Berthelot & Hugues Roest Crollius
EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK,
Matthieu Muffato, Leonor T. Quintais, José A. Guerra-Assunção, Javier Herrero & Anton Enright
Illumina Cambridge, Chesterford Research Park, Little Chesterford, CB10 1XL, Saffron Walden, UK
Sean Humphray
Hubrecht Laboratory, Uppsalalaan 8, 3584 CT Utrecht, The Netherlands,
Romke Koch & Ronald H. A. Plasterk
Max Planck Institute for Developmental Biology, Spemannstraße 35, 72076 Tübingen, Germany,
Gerd-Jörg Rauch, Ines Gehring, Andrea Berger, Christopher M. Dooley, Zübeyde Ersan-Ürün, Cigdem Eser, Horst Geiger, Maria Geisler, Lena Karotki, Anette Kirn, Judith Konantz, Martina Konantz, Martina Oberländer, Silke Rudolph-Geiger, Mathias Teucke, Christa Lanz, Günter Raddatz, Stephan C. Schuster, Robert Geisler & Christiane Nüsslein-Volhard
Stem Cell Program and Division of Hematology and Oncology, Children's Hospital and Dana Farber Cancer Institute, 1 Blackfan Circle, Karp 7, Boston, Massachusetts 02115, USA,
Yi Zhou & Leonard I. Zon
Children's Hospital Oakland, 747 52nd Street, Oakland, 94609, California, USA
Kazutoyo Osoegawa, Baoli Zhu & Pieter J. de Jong
Institute of Neuroscience, University of Oregon, 1254 University of Oregon, 222 Huestis Hall, Eugene, Oregon 97403-1254, USA,
Amanda Rapp, Monte Westerfield & John H. Postlethwait
Karlsruhe Institute of Technology (KIT), Campus North, Institute of Toxicology and Gentics (ITG), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany,
Robert Geisler
Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, 02115, Massachusetts, USA
Charles Lee

Authors

Kerstin Howe
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Clark
View author publications
You can also search for this author in PubMed Google Scholar
Carlos F. Torroja
View author publications
You can also search for this author in PubMed Google Scholar
James Torrance
View author publications
You can also search for this author in PubMed Google Scholar
Camille Berthelot
View author publications
You can also search for this author in PubMed Google Scholar
Matthieu Muffato
View author publications
You can also search for this author in PubMed Google Scholar
John E. Collins
View author publications
You can also search for this author in PubMed Google Scholar
Sean Humphray
View author publications
You can also search for this author in PubMed Google Scholar
Karen McLaren
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Stuart McLaren
View author publications
You can also search for this author in PubMed Google Scholar
Ian Sealy
View author publications
You can also search for this author in PubMed Google Scholar
Mario Caccamo
View author publications
You can also search for this author in PubMed Google Scholar
Carol Churcher
View author publications
You can also search for this author in PubMed Google Scholar
Carol Scott
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey C. Barrett
View author publications
You can also search for this author in PubMed Google Scholar
Romke Koch
View author publications
You can also search for this author in PubMed Google Scholar
Gerd-Jörg Rauch
View author publications
You can also search for this author in PubMed Google Scholar
Simon White
View author publications
You can also search for this author in PubMed Google Scholar
William Chow
View author publications
You can also search for this author in PubMed Google Scholar
Britt Kilian
View author publications
You can also search for this author in PubMed Google Scholar
Leonor T. Quintais
View author publications
You can also search for this author in PubMed Google Scholar
José A. Guerra-Assunção
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yong Gu
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Yen
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Hinnerk Vogel
View author publications
You can also search for this author in PubMed Google Scholar
Tina Eyre
View author publications
You can also search for this author in PubMed Google Scholar
Seth Redmond
View author publications
You can also search for this author in PubMed Google Scholar
Ruby Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Jianxiang Chi
View author publications
You can also search for this author in PubMed Google Scholar
Beiyuan Fu
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Langley
View author publications
You can also search for this author in PubMed Google Scholar
Sean F. Maguire
View author publications
You can also search for this author in PubMed Google Scholar
Gavin K. Laird
View author publications
You can also search for this author in PubMed Google Scholar
David Lloyd
View author publications
You can also search for this author in PubMed Google Scholar
Emma Kenyon
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Donaldson
View author publications
You can also search for this author in PubMed Google Scholar
Harminder Sehra
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Almeida-King
View author publications
You can also search for this author in PubMed Google Scholar
Jane Loveland
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Trevanion
View author publications
You can also search for this author in PubMed Google Scholar
Matt Jones
View author publications
You can also search for this author in PubMed Google Scholar
Mike Quail
View author publications
You can also search for this author in PubMed Google Scholar
Dave Willey
View author publications
You can also search for this author in PubMed Google Scholar
Adrienne Hunt
View author publications
You can also search for this author in PubMed Google Scholar
John Burton
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Sims
View author publications
You can also search for this author in PubMed Google Scholar
Kirsten McLay
View author publications
You can also search for this author in PubMed Google Scholar
Bob Plumb
View author publications
You can also search for this author in PubMed Google Scholar
Joy Davis
View author publications
You can also search for this author in PubMed Google Scholar
Chris Clee
View author publications
You can also search for this author in PubMed Google Scholar
Karen Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Richard Clark
View author publications
You can also search for this author in PubMed Google Scholar
Clare Riddle
View author publications
You can also search for this author in PubMed Google Scholar
David Elliott
View author publications
You can also search for this author in PubMed Google Scholar
Glen Threadgold
View author publications
You can also search for this author in PubMed Google Scholar
Glenn Harden
View author publications
You can also search for this author in PubMed Google Scholar
Darren Ware
View author publications
You can also search for this author in PubMed Google Scholar
Sharmin Begum
View author publications
You can also search for this author in PubMed Google Scholar
Beverley Mortimore
View author publications
You can also search for this author in PubMed Google Scholar
Giselle Kerry
View author publications
You can also search for this author in PubMed Google Scholar
Paul Heath
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Phillimore
View author publications
You can also search for this author in PubMed Google Scholar
Alan Tracey
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Corby
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Dunn
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Wood
View author publications
You can also search for this author in PubMed Google Scholar
Susan Clark
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Pelan
View author publications
You can also search for this author in PubMed Google Scholar
Guy Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Smith
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Glithero
View author publications
You can also search for this author in PubMed Google Scholar
Philip Howden
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Barker
View author publications
You can also search for this author in PubMed Google Scholar
Christine Lloyd
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Stevens
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Harley
View author publications
You can also search for this author in PubMed Google Scholar
Karen Holt
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Panagiotidis
View author publications
You can also search for this author in PubMed Google Scholar
Jamieson Lovell
View author publications
You can also search for this author in PubMed Google Scholar
Helen Beasley
View author publications
You can also search for this author in PubMed Google Scholar
Carl Henderson
View author publications
You can also search for this author in PubMed Google Scholar
Daria Gordon
View author publications
You can also search for this author in PubMed Google Scholar
Katherine Auger
View author publications
You can also search for this author in PubMed Google Scholar
Deborah Wright
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Collins
View author publications
You can also search for this author in PubMed Google Scholar
Claire Raisen
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Dyer
View author publications
You can also search for this author in PubMed Google Scholar
Kenric Leung
View author publications
You can also search for this author in PubMed Google Scholar
Lauren Robertson
View author publications
You can also search for this author in PubMed Google Scholar
Kirsty Ambridge
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Leongamornlert
View author publications
You can also search for this author in PubMed Google Scholar
Sarah McGuire
View author publications
You can also search for this author in PubMed Google Scholar
Ruth Gilderthorp
View author publications
You can also search for this author in PubMed Google Scholar
Coline Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Deepa Manthravadi
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Nichol
View author publications
You can also search for this author in PubMed Google Scholar
Gary Barker
View author publications
You can also search for this author in PubMed Google Scholar
Siobhan Whitehead
View author publications
You can also search for this author in PubMed Google Scholar
Michael Kay
View author publications
You can also search for this author in PubMed Google Scholar
Jacqueline Brown
View author publications
You can also search for this author in PubMed Google Scholar
Clare Murnane
View author publications
You can also search for this author in PubMed Google Scholar
Emma Gray
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Humphries
View author publications
You can also search for this author in PubMed Google Scholar
Neil Sycamore
View author publications
You can also search for this author in PubMed Google Scholar
Darren Barker
View author publications
You can also search for this author in PubMed Google Scholar
David Saunders
View author publications
You can also search for this author in PubMed Google Scholar
Justene Wallis
View author publications
You can also search for this author in PubMed Google Scholar
Anne Babbage
View author publications
You can also search for this author in PubMed Google Scholar
Sian Hammond
View author publications
You can also search for this author in PubMed Google Scholar
Maryam Mashreghi-Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Barr
View author publications
You can also search for this author in PubMed Google Scholar
Sancha Martin
View author publications
You can also search for this author in PubMed Google Scholar
Paul Wray
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Ellington
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Matthews
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Ellwood
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Woodmansey
View author publications
You can also search for this author in PubMed Google Scholar
Graham Clark
View author publications
You can also search for this author in PubMed Google Scholar
James D. Cooper
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Tromans
View author publications
You can also search for this author in PubMed Google Scholar
Darren Grafham
View author publications
You can also search for this author in PubMed Google Scholar
Carl Skuce
View author publications
You can also search for this author in PubMed Google Scholar
Richard Pandian
View author publications
You can also search for this author in PubMed Google Scholar
Robert Andrews
View author publications
You can also search for this author in PubMed Google Scholar
Elliot Harrison
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Kimberley
View author publications
You can also search for this author in PubMed Google Scholar
Jane Garnett
View author publications
You can also search for this author in PubMed Google Scholar
Nigel Fosker
View author publications
You can also search for this author in PubMed Google Scholar
Rebekah Hall
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Garner
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Christine Bird
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Ines Gehring
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Berger
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Dooley
View author publications
You can also search for this author in PubMed Google Scholar
Zübeyde Ersan-Ürün
View author publications
You can also search for this author in PubMed Google Scholar
Cigdem Eser
View author publications
You can also search for this author in PubMed Google Scholar
Horst Geiger
View author publications
You can also search for this author in PubMed Google Scholar
Maria Geisler
View author publications
You can also search for this author in PubMed Google Scholar
Lena Karotki
View author publications
You can also search for this author in PubMed Google Scholar
Anette Kirn
View author publications
You can also search for this author in PubMed Google Scholar
Judith Konantz
View author publications
You can also search for this author in PubMed Google Scholar
Martina Konantz
View author publications
You can also search for this author in PubMed Google Scholar
Martina Oberländer
View author publications
You can also search for this author in PubMed Google Scholar
Silke Rudolph-Geiger
View author publications
You can also search for this author in PubMed Google Scholar
Mathias Teucke
View author publications
You can also search for this author in PubMed Google Scholar
Christa Lanz
View author publications
You can also search for this author in PubMed Google Scholar
Günter Raddatz
View author publications
You can also search for this author in PubMed Google Scholar
Kazutoyo Osoegawa
View author publications
You can also search for this author in PubMed Google Scholar
Baoli Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Rapp
View author publications
You can also search for this author in PubMed Google Scholar
Sara Widaa
View author publications
You can also search for this author in PubMed Google Scholar
Cordelia Langford
View author publications
You can also search for this author in PubMed Google Scholar
Fengtang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Stephan C. Schuster
View author publications
You can also search for this author in PubMed Google Scholar
Nigel P. Carter
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Harrow
View author publications
You can also search for this author in PubMed Google Scholar
Zemin Ning
View author publications
You can also search for this author in PubMed Google Scholar
Javier Herrero
View author publications
You can also search for this author in PubMed Google Scholar
Steve M. J. Searle
View author publications
You can also search for this author in PubMed Google Scholar
Anton Enright
View author publications
You can also search for this author in PubMed Google Scholar
Robert Geisler
View author publications
You can also search for this author in PubMed Google Scholar
Ronald H. A. Plasterk
View author publications
You can also search for this author in PubMed Google Scholar
Charles Lee
View author publications
You can also search for this author in PubMed Google Scholar
Monte Westerfield
View author publications
You can also search for this author in PubMed Google Scholar
Pieter J. de Jong
View author publications
You can also search for this author in PubMed Google Scholar
Leonard I. Zon
View author publications
You can also search for this author in PubMed Google Scholar
John H. Postlethwait
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Nüsslein-Volhard
View author publications
You can also search for this author in PubMed Google Scholar
Tim J. P. Hubbard
View author publications
You can also search for this author in PubMed Google Scholar
Hugues Roest Crollius
View author publications
You can also search for this author in PubMed Google Scholar
Jane Rogers
View author publications
You can also search for this author in PubMed Google Scholar
Derek L. Stemple
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.H., M.D.C., D.L.S., C.B., H.R.C., A.E. and K.M. wrote the manuscript and Supplementary Information. M.D.C., C.F.T., I.S., J.C.B., A.R., S.W. and C.Lang. produced the SATmap. Z.N. and Y.G. produced the WGS31 assembly. J.T., W.C. and C.F.T. generated the Zv9 assembly. Previous assemblies were produced by M.C., who developed the first assembly integration process, and by S.R., T.E. and I.S. coordinated by K.H. The analyses and figures for the manuscript were produced by J.T., K.H., C.B., M.M., J.H., L.T.Q., J.A.G.-A. and J.Y. K.A., J.W., S.P., J.C., G.T., G.H., G.G., P.H. and B.K. are involved in the ongoing improvement of the zebrafish genome assembly. Manual annotation was produced by G.K.L., D.L., E.K., S.D., H.S., J.A.-K. and J.L. and coordinated by J.H. and M.W. Automated annotation (Ensembl) was provided by J.E.C., S.W., J.-H.V., S.T. and S.M.J.S. The genome sequencing was carried out by C.C., K.M., S.M., C.S., J.C., B.F., E.L., S.F.M., M.J., M.Q., D.W., A.H., J.B., S.S., K.M., B.P., J..D., C.C., K.O., B.M., G.K., B.P., A.T., N.C., C.J., S.C., M.S., R.G., P.H., N.B., C.Lanz, C.S., J.H., K.H., G.P., J.L., H.B., C.H., D.G., D.W., C.R., L.D., K.L., L.R., K.A., D.L., S.M., R.G., C.G., D.M., S.N., G.B., S.W., M.K., J.B., C.M., E.G., M.H., N.S., D.B., D.S., J.W., A.B., S.H., K.O., M.M.-M., L.B., S.M., P.W., A.E., N.M., M.E., R.W., G.C., J.C., A.T., D.G., C.S., R.P., R.A., E.H., A.K., J.G., N.F., R.H., P.G., D.K., C.B. and S.P. The generation of maps used in the initial assemblies and the production of clone tiling paths were carried out by R.K., S.H., G.-J.R., Y.Z., C.R., R.C., D.E., D.W., S.B., L.M., M.D., I.G., A.B., C.M.D., Z.E.-Ü., C.E., H.G., M.G., L.K., A.K., J.K., M.K., M.O., S.R.-G., M.T., C.Lanz, G.R., S.C.S., R.B., F.Y., N.P.C., R.G., R.H.A.P. and C.Lee. K.O., B.Z. and P.J.d.J. generated and provided clone libraries. The Zebrafish Genome Project was coordinated by L.I.Z., J.H.P., C.N.-V., T.J.P.H., J.R. and D.L.S.

Corresponding author

Correspondence to Derek L. Stemple.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Text, Supplementary Tables 1-18, and Supplementary Figures 1-25 and A1-25- see Contents list for more details. (PDF 4709 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/), which permits distribution, and reproduction in any medium, provided the original author and source are credited. This licence does not permit commercial exploitation, and derivative works must be licensed under the same or similar licence.

Reprints and permissions

About this article

Cite this article

Howe, K., Clark, M., Torroja, C. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503 (2013). https://doi.org/10.1038/nature12111

Download citation

Received: 23 August 2012
Accepted: 21 March 2013
Published: 17 April 2013
Issue Date: 25 April 2013
DOI: https://doi.org/10.1038/nature12111

This article is cited by

Understanding the pathophysiology of acute critical illness: translational lessons from zebrafish models
- Kensuke Fujii
- Kazuma Yamakawa
- Fumihito Ono
Intensive Care Medicine Experimental (2024)
Zebrafish xenograft as a tool for the study of colorectal cancer: a review
- Camilla Maria Fontana
- Hien Van Doan
Cell Death & Disease (2024)
Aiouea padiformis extract exhibits anti-inflammatory effects by inhibiting the ATPase activity of NLRP3
- Sumin Lee
- Qianying Ye
- Yong Hwan Park
Scientific Reports (2024)
Myeloid differentiation factor-2/LY96, a potential predictive biomarker of metastasis and poor outcomes in prostate cancer: clinical implications as a potential therapeutic target
- Marina G. Ferrari
- Alexis P. Jimenez-Uribe
- Adrian P. Mansini
Oncogene (2024)
Establishment of a zebrafish inbred strain, M-AB, capable of regular breeding and genetic manipulation
- Kenichiro Sadamitsu
- Fabien Velilla
- Noriyoshi Sakai
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.