A systematic genome-wide analysis of zebrafish protein-coding gene function

Kettleborough, Ross N. W.; Busch-Nentwich, Elisabeth M.; Harvey, Steven A.; Dooley, Christopher M.; de Bruijn, Ewart; van Eeden, Freek; Sealy, Ian; White, Richard J.; Herd, Colin; Nijman, Isaac J.; Fényes, Fruzsina; Mehroke, Selina; Scahill, Catherine; Gibbons, Richard; Wali, Neha; Carruthers, Samantha; Hall, Amanda; Yen, Jennifer; Cuppen, Edwin; Stemple, Derek L.

doi:10.1038/nature11992

Letter
Published: 17 April 2013

A systematic genome-wide analysis of zebrafish protein-coding gene function

Ross N. W. Kettleborough¹^na1,
Elisabeth M. Busch-Nentwich¹^na1,
Steven A. Harvey¹^na1,
Christopher M. Dooley¹,
Ewart de Bruijn²,
Freek van Eeden³,
Ian Sealy¹,
Richard J. White¹,
Colin Herd¹,
Isaac J. Nijman²,
Fruzsina Fényes¹,
Selina Mehroke¹,
Catherine Scahill¹,
Richard Gibbons¹,
Neha Wali¹,
Samantha Carruthers¹,
Amanda Hall¹,
Jennifer Yen¹,
Edwin Cuppen² &
…
Derek L. Stemple¹

Nature volume 496, pages 494–497 (2013)Cite this article

27k Accesses
458 Citations
132 Altmetric
Metrics details

Subjects

Development

Abstract

Since the publication of the human reference genome, the identities of specific genes associated with human diseases are being discovered at a rapid rate. A central problem is that the biological activity of these genes is often unclear. Detailed investigations in model vertebrate organisms, typically mice, have been essential for understanding the activities of many orthologues of these disease-associated genes. Although gene-targeting approaches^1,2,3 and phenotype analysis have led to a detailed understanding of nearly 6,000 protein-coding genes^3,4, this number falls considerably short of the more than 22,000 mouse protein-coding genes⁵. Similarly, in zebrafish genetics, one-by-one gene studies using positional cloning⁶, insertional mutagenesis^7,8,9, antisense morpholino oligonucleotides¹⁰, targeted re-sequencing^11,12,13, and zinc finger and TAL endonucleases^14,15,16,17 have made substantial contributions to our understanding of the biological activity of vertebrate genes, but again the number of genes studied falls well short of the more than 26,000 zebrafish protein-coding genes¹⁸. Importantly, for both mice and zebrafish, none of these strategies are particularly suited to the rapid generation of knockouts in thousands of genes and the assessment of their biological activity. Here we describe an active project that aims to identify and phenotype the disruptive mutations in every zebrafish protein-coding gene, using a well-annotated zebrafish reference genome sequence^18,19, high-throughput sequencing and efficient chemical mutagenesis. So far we have identified potentially disruptive mutations in more than 38% of all known zebrafish protein-coding genes. We have developed a multi-allelic phenotyping scheme to efficiently assess the effects of each allele during embryogenesis and have analysed the phenotypic consequences of over 1,000 alleles. All mutant alleles and data are available to the community and our phenotyping scheme is adaptable to phenotypic analysis beyond embryogenesis.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 3: Phenotypic analysis of alleles.**

**Figure 4: Confirmation of causality through complementation crosses.**

Multiomic atlas with functional stratification and developmental dynamics of zebrafish cis-regulatory elements

Article Open access 04 July 2022

The mutational constraint spectrum quantified from variation in 141,456 humans

Article Open access 27 May 2020

Integrating non-mammalian model organisms in the diagnosis of rare genetic diseases in humans

Article 25 July 2023

Accession codes

Accessions

European Nucleotide Archive

ERP000426

Data deposits

Detailed information on alleles and their availability can be found online (http://www.sanger.ac.uk/Projects/D_rerio/zmp/). All sequencing data are deposited in the European Molecular Biology Laboratory (EMBL) European Nucleotide Archive under accession ERP000426.

References

Gossler, A., Joyner, A. L., Rossant, J. & Skarnes, W. C. Mouse embryonic stem cells and reporter constructs to detect developmentally regulated genes. Science 244, 463–465 (1989)
Article ADS CAS PubMed Google Scholar
Skarnes, W. C., Auerbach, B. A. & Joyner, A. L. A gene trap approach in mouse embryonic stem cells: the lacZ reported is activated by splicing, reflects endogenous gene expression, and is mutagenic in mice. Genes Dev. 6, 903–918 (1992)
Article CAS PubMed Google Scholar
Ringwald, M. et al. The IKMC web portal: a central point of entry to data and resources from the International Knockout Mouse Consortium. Nucleic Acids Res. 39, D849–D855 (2011)
Article CAS PubMed Google Scholar
Church, D. M. et al. Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol. 7, e1000112 (2009)
Article PubMed PubMed Central Google Scholar
Chinwalla, A. T. et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002)
Article ADS PubMed Google Scholar
Zhang, J., Talbot, W. S. & Schier, A. F. Positional cloning identifies zebrafish one-eyed pinhead as a permissive EGF-related ligand required during gastrulation. Cell 92, 241–251 (1998)
Article CAS PubMed Google Scholar
Golling, G. et al. Insertional mutagenesis in zebrafish rapidly identifies genes essential for early vertebrate development. Nature Genet. 31, 135–140 (2002)
Article CAS PubMed Google Scholar
Amsterdam, A. et al. Identification of 315 genes essential for early zebrafish development. Proc. Natl Acad. Sci. USA 101, 12792–12797 (2004)
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, Z. et al. A genetic screen in zebrafish identifies cilia genes as a principal cause of cystic kidney. Development 131, 4085–4093 (2004)
Article CAS PubMed Google Scholar
Nasevicius, A. & Ekker, S. C. Effective targeted gene 'knockdown' in zebrafish. Nature Genet. 26, 216–220 (2000)
Article CAS PubMed Google Scholar
Kettleborough, R. N., Bruijn, E., Eeden, F., Cuppen, E. & Stemple, D. L. High-throughput target-selected gene inactivation in zebrafish. Methods Cell Biol. 104, 121–127 (2011)
Article CAS PubMed Google Scholar
Sood, R. et al. Methods for reverse genetic screening in zebrafish by resequencing and TILLING. Methods 39, 220–227 (2006)
Article CAS PubMed Google Scholar
Wienholds, E. et al. Efficient target-selected mutagenesis in zebrafish. Genome Res. 13, 2700–2707 (2003)
Article CAS PubMed PubMed Central Google Scholar
Meng, X., Noyes, M. B., Zhu, L. J., Lawson, N. D. & Wolfe, S. A. Targeted gene inactivation in zebrafish using engineered zinc-finger nucleases. Nature Biotechnol. 26, 695–701 (2008)
Article CAS Google Scholar
Doyon, Y. et al. Heritable targeted gene disruption in zebrafish using designed zinc-finger nucleases. Nature Biotechnol. 26, 702–708 (2008)
Article CAS Google Scholar
Huang, P. et al. Heritable gene targeting in zebrafish using customized TALENs. Nature Biotechnol. 29, 699–700 (2011)
Article Google Scholar
Sander, J. D. et al. Targeted gene disruption in somatic zebrafish cells using engineered TALENs. Nature Biotechnol. 29, 697–698 (2011)
Article CAS Google Scholar
Howe, K. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature (in the press)
Collins, J. E., White, S., Searle, S. M. & Stemple, D. L. Incorporating RNA-seq data into the zebrafish ensembl gene build. Genome Res. 22, 2067–2078 (2012)
Article CAS PubMed PubMed Central Google Scholar
Stemple, D. L. TILLING–a high-throughput harvest for functional genomics. Nature Rev. Genet. 5, 145–150 (2004)
Article CAS PubMed Google Scholar
The 1000 Genome Project Consortium A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2011)
Article Google Scholar
The 1000 Genome Project Consortium An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012)
Article Google Scholar
Ayadi, A. et al. Mouse large-scale phenotyping initiatives: overview of the European Mouse Disease Clinic (EUMODIC) and of the Wellcome Trust Sanger Institute Mouse Genetics Project. Mamm. Genome 23, 600–610 (2012)
Article PubMed PubMed Central Google Scholar
Seiler, C. et al. Myosin VI is required for structural integrity of the apical surface of sensory hair cells in zebrafish. Dev. Biol. 272, 328–338 (2004)
Article CAS PubMed Google Scholar
Manfroid, I. et al. Zebrafish sox9b is crucial for hepatopancreatic duct development and pancreatic endocrine cell regeneration. Dev. Biol. 366, 268–278 (2012)
Article CAS PubMed PubMed Central Google Scholar
Steffen, L. S. et al. The zebrafish runzel muscular dystrophy is linked to the titin gene. Dev. Biol. 309, 180–192 (2007)
Article CAS PubMed PubMed Central Google Scholar
Haffter, P. et al. The identification of genes with unique and essential functions in the development of the zebrafish, Danio rerio. Development 123, 1–36 (1996)
CAS PubMed Google Scholar
Driever, W. et al. A genetic screen for mutations affecting embryogenesis in zebrafish. Development. 123, 37–46 (1996)
Nielsen, R., Paul, J. S., Albrechtsen, A. & Song, Y. S. Genotype and SNP calling from next-generation sequencing data. Nature Rev. Genet. 12, 443–451 (2011)
Article CAS PubMed Google Scholar
McLaren, W. et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010)
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank P. Ellis, E. Markham, H. van Roekel, P. Toonen, J. van de Belt, J. Mudde and S. Widaa for technical assistance, and B. Novak, Y. Yi and E. LeProust from Agilent Technologies. We thank everyone at The Zebrafish International Resource Center and the European Zebrafish Resource Center for stocking and distributing alleles. We thank members of the Wellcome Trust Sanger Institute RSF and DNA pipelines. We also thank G. Powell, J. Collins and F. L. Marlow for critical reading of the manuscript. This work was funded through a core grant to the Sanger Institute by the Wellcome Trust (grant number 098051), the US National Institutes of Health (5R01HG004819), the EU Sixth Framework Programme (ZF-MODELS, contract number LSHG-CT-2003-503496) and the EU Seventh Framework Programme (ZF-HEALTH). F.v.E. is supported by the UK Medical Research Council (grant number G0777791) and E.C. is supported by the SmartMix program (SSM06010) from the Dutch government.

Author information

Ross N. W. Kettleborough, Elisabeth M. Busch-Nentwich and Steven A. Harvey: These authors contributed equally to this work.

Authors and Affiliations

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA, Cambridge, UK
Ross N. W. Kettleborough, Elisabeth M. Busch-Nentwich, Steven A. Harvey, Christopher M. Dooley, Ian Sealy, Richard J. White, Colin Herd, Fruzsina Fényes, Selina Mehroke, Catherine Scahill, Richard Gibbons, Neha Wali, Samantha Carruthers, Amanda Hall, Jennifer Yen & Derek L. Stemple
Hubrecht Institute, KNAW and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT Utrecht, The Netherlands
Ewart de Bruijn, Isaac J. Nijman & Edwin Cuppen
MRC-CDBG/Department of Biomedical Science, The University of Sheffield, Western Bank, S10 2TN, Sheffield, UK
Freek van Eeden

Authors

Ross N. W. Kettleborough
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth M. Busch-Nentwich
View author publications
You can also search for this author in PubMed Google Scholar
Steven A. Harvey
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Dooley
View author publications
You can also search for this author in PubMed Google Scholar
Ewart de Bruijn
View author publications
You can also search for this author in PubMed Google Scholar
Freek van Eeden
View author publications
You can also search for this author in PubMed Google Scholar
Ian Sealy
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. White
View author publications
You can also search for this author in PubMed Google Scholar
Colin Herd
View author publications
You can also search for this author in PubMed Google Scholar
Isaac J. Nijman
View author publications
You can also search for this author in PubMed Google Scholar
Fruzsina Fényes
View author publications
You can also search for this author in PubMed Google Scholar
Selina Mehroke
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Scahill
View author publications
You can also search for this author in PubMed Google Scholar
Richard Gibbons
View author publications
You can also search for this author in PubMed Google Scholar
Neha Wali
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Carruthers
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Hall
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Yen
View author publications
You can also search for this author in PubMed Google Scholar
Edwin Cuppen
View author publications
You can also search for this author in PubMed Google Scholar
Derek L. Stemple
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.N.W.K., E.M.B.-N. and S.A.H. initiated, organized and executed the work (equal contributions). R.N.W.K., E.M.B.-N., S.A.H. and D.S. designed the experiments. R.N.W.K., E.M.B.-N. and S.A.H. wrote the manuscript with assistance from I.S., R.J.W., C.M.D. and D.L.S. Mutagenesis was carried out by R.N.W.K., F.v.E. and E.d.B. The mutation analysis pipeline was developed by I.S. and I.J.N., and maintained by R.J.W. C.H. and F.F. implemented and improved genotyping procedures, F.F. and E.M.B.-N. developed the cryopreservation procedure, S.A.H., S.M., C.S., C.M.D. and N.W. carried out the phenotyping, J.Y. designed and tested the first Agilent SureSelect exome set, R.G. helped to maintain and distribute alleles. S.C. and A.H. provided assistance for cryopreservation and genotyping. E.C. and D.L.S. collaborated in the initiation, design and process development of the project. All authors read the manuscript and provided comments.

Corresponding authors

Correspondence to Edwin Cuppen or Derek L. Stemple.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Tables 1-3 and guidance notes for the Zebrafish Mutation Project website. (PDF 346 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kettleborough, R., Busch-Nentwich, E., Harvey, S. et al. A systematic genome-wide analysis of zebrafish protein-coding gene function. Nature 496, 494–497 (2013). https://doi.org/10.1038/nature11992

Download citation

Received: 18 August 2012
Accepted: 07 February 2013
Published: 17 April 2013
Issue Date: 25 April 2013
DOI: https://doi.org/10.1038/nature11992

This article is cited by

Pla2g12b drives expansion of triglyceride-rich lipoproteins
- James H. Thierer
- Ombretta Foresti
- Steven A. Farber
Nature Communications (2024)
Digenic inheritance involving a muscle-specific protein kinase and the giant titin protein causes a skeletal muscle myopathy
- Ana Töpf
- Dan Cox
- Volker Straub
Nature Genetics (2024)
Oxidative pentose phosphate pathway controls vascular mural cell coverage by regulating extracellular matrix composition
- Nicola Facchinello
- Matteo Astone
- Massimo M. Santoro
Nature Metabolism (2022)
Rear traction forces drive adherent tissue migration in vivo
- Naoya Yamaguchi
- Ziyi Zhang
- Holger Knaut
Nature Cell Biology (2022)
Zebrafish: A Promising Real-Time Model System for Nanotechnology-Mediated Neurospecific Drug Delivery
- Suraiya Saleem
- Rajaretinam Rajesh Kannan
Nanoscale Research Letters (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.