Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence

Roest Crollius, Hugues; Jaillon, Olivier; Bernot, Alain; Dasilva, Corinne; Bouneau, Laurence; Fischer, Cécile; Fizames, Cécile; Wincker, Patrick; Brottier, Philippe; Quétier, Francis; Saurin, William; Weissenbach, Jean

doi:10.1038/76118

Letter
Published: 01 June 2000

Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence

Hugues Roest Crollius¹,
Olivier Jaillon¹,
Alain Bernot¹,
Corinne Dasilva¹,
Laurence Bouneau¹,
Cécile Fischer¹,
Cécile Fizames¹,
Patrick Wincker¹,
Philippe Brottier¹,
Francis Quétier¹,
William Saurin¹ &
…
Jean Weissenbach¹

Nature Genetics volume 25, pages 235–238 (2000)Cite this article

611 Accesses
244 Citations
19 Altmetric
Metrics details

Abstract

The number of genes in the human genome is unknown, with estimates ranging from 50,000 to 90,000 (refs 1, 2), and to more than 140,000 according to unpublished sources. We have developed ‘Exofish’, a procedure based on homology searches, to identify human genes quickly and reliably. This method relies on the sequence of another vertebrate, the pufferfish Tetraodon nigroviridis, to detect conserved sequences with a very low background. Similar to Fugu rubripes , a marine pufferfish proposed by Brenner et al.³ as a model for genomic studies, T. nigroviridis is a more practical alternative⁴ with a genome also eight times more compact than that of human. Many comparisons have been made between F. rubripes and human DNA that demonstrate the potential of comparative genomics using the pufferfish genome⁵. Application of Exofish to the December version of the working draft sequence of the human genome and to Unigene showed that the human genome contains 28,000–34,000 genes, and that Unigene contains less than 40% of the protein-coding fraction of the human genome.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 3: Examples of chromosome 22 results.**

**Figure 4: Distribution of gene and ecores on individual human chromosomes according to the EST physical map⁸ and Exofish.**

Gap-free genome assembly of anadromous Coilia nasus

Article Open access 06 June 2023

Fengjiao Ma, Yinping Wang, … Kai Liu

Genome sequences of Tropheus moorii and Petrochromis trewavasae, two eco-morphologically divergent cichlid fishes endemic to Lake Tanganyika

Article Open access 22 February 2021

C. Fischer, S. Koblmüller, … C. Sturmbauer

Widespread patterns of gene loss in the evolution of the animal kingdom

Article 24 February 2020

Cristina Guijarro-Clarke, Peter W. H. Holland & Jordi Paps

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Fields, C., Adams, M.D., White, O. & Venter, J.C. How many genes in the human genome? Nature Genet. 7, 345 –346 (1994).
Article CAS Google Scholar
Antequera, F. & Bird, A. Number of CpG islands and genes in human and mouse. Proc. Natl Acad. Sci. USA 90, 11995–11999 (1993).
Article CAS Google Scholar
Brenner, S. et al. Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature 366, 265 –268 (1993).
Article CAS Google Scholar
Crnogorac-Jurcevic, T., Brown, J.R., Lehrach, H. & Schalkwyk, L.C. Tetraodon fluviatilis, a new puffer fish model for genome studies. Genomics 41, 177–184 ( 1997).
Article CAS Google Scholar
Elgar, G. et al. Generation and analysis of 25 Mb of genomic DNA from the pufferfish Fugu rubripes by sequence scanning. Genome Res. 9, 960–971 (1999).
Article Google Scholar
Schuler, G.D. et al. A gene map of the human genome. Science 274, 540–546 (1996).
Article CAS Google Scholar
Dunham, I. et al. The DNA sequence of human chromosome 22. Nature 402, 489–495 (1999).
Article CAS Google Scholar
Deloukas, P. et al. A physical map of 30,000 human genes. Science 282, 744–746 (1998).
Article CAS Google Scholar
Roest Crollius, H. et al. Characterization and repeat analysis of the compact genome of the freswater pufferfish Tetraodon nigroviridis. Genome Res . (in press).
Jin, L., Zhong, Y. & Chakraborty, R. The exact numbers of possible microsatellite motifs . Am. J. Hum. Genet. 55, 582– 583 (1994).
CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 ( 1999).
Article CAS Google Scholar
Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 ( 1990).
Article CAS Google Scholar
Smith, T.F. & Waterman, M.S. Identification of common molecular subsequences. J. Mol. Biol. 147, 195– 197 (1981).
Article CAS Google Scholar
Glemet, E. & Codani, J. LASSAP, a large scale sequence comparisons package. Comput. Appl. Biosci. 13, 137– 143 (1997).
CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the sequencing and template preparation team at Genoscope; Sun Microsystems for access to the SUN benchmark centre; and F. Francis for critical reading of the manuscript. This work would not have been possible without the public availability of a large fraction of the sequence of the human genome, and we thank all contributing genome centres.

Author information

Authors and Affiliations

Genoscope and CNRS FRE2231, Evry cedex, France
Hugues Roest Crollius, Olivier Jaillon, Alain Bernot, Corinne Dasilva, Laurence Bouneau, Cécile Fischer, Cécile Fizames, Patrick Wincker, Philippe Brottier, Francis Quétier, William Saurin & Jean Weissenbach

Authors

Hugues Roest Crollius
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Jaillon
View author publications
You can also search for this author in PubMed Google Scholar
Alain Bernot
View author publications
You can also search for this author in PubMed Google Scholar
Corinne Dasilva
View author publications
You can also search for this author in PubMed Google Scholar
Laurence Bouneau
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Fischer
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Fizames
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Wincker
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Brottier
View author publications
You can also search for this author in PubMed Google Scholar
Francis Quétier
View author publications
You can also search for this author in PubMed Google Scholar
William Saurin
View author publications
You can also search for this author in PubMed Google Scholar
Jean Weissenbach
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jean Weissenbach.

Supplementary information

Table 1 (PDF 34 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roest Crollius, H., Jaillon, O., Bernot, A. et al. Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence. Nat Genet 25, 235–238 (2000). https://doi.org/10.1038/76118

Download citation

Received: 10 March 2000
Accepted: 02 May 2000
Issue Date: 01 June 2000
DOI: https://doi.org/10.1038/76118

This article is cited by

Fish genomics and its impact on fundamental and applied research of vertebrate biology
- Syed Farhan Ahmad
- Maryam Jehangir
- Cesar Martins
Reviews in Fish Biology and Fisheries (2022)
The Tetraodon nigroviridis reference transcriptome: developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome
- Swaraj Basu
- Yavor Hadzhiev
- Ferenc Müller
Scientific Reports (2016)
Macroglial cells of the teleost central nervous system: a survey of the main types
- Barbara Cuoghi
- Lucrezia Mola
Cell and Tissue Research (2009)
Addressing chromosome evolution in the whole-genome sequence era
- Thomas Faraut
Chromosome Research (2008)
The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla

Nature (2007)

Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence

Abstract

Access options

Similar content being viewed by others

Gap-free genome assembly of anadromous Coilia nasus

Genome sequences of Tropheus moorii and Petrochromis trewavasae, two eco-morphologically divergent cichlid fishes endemic to Lake Tanganyika

Widespread patterns of gene loss in the evolution of the animal kingdom

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Supplementary information

Table 1 (PDF 34 kb)

Rights and permissions

About this article

Cite this article

This article is cited by

Fish genomics and its impact on fundamental and applied research of vertebrate biology

The Tetraodon nigroviridis reference transcriptome: developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome

Macroglial cells of the teleost central nervous system: a survey of the main types

Addressing chromosome evolution in the whole-genome sequence era

The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla

How to count…human genes

Search

Quick links

Abstract

Access options

Similar content being viewed by others

Accession codes

Accessions

GenBank/EMBL/DDBJ

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links