Published online 4 November 2009 | Nature 462, 21 (2009) | doi:10.1038/462021a


10,000 genomes to come

Vertebrates in line for massive sequencing project.

The anteater could have its genome sequenced.The anteater could have its genome sequenced.A. SHAH/NATUREPL.COM

Results are just beginning to arrive for the 1000 Genomes Project, a genomic study of human diversity. However, an international group is already planning something even more ambitious — a 10,000 genomes project. The initiative, called Genome 10K, aims to tackle thousands of vertebrate species.

Project members write this week in the Journal of Heredity that the effort will provide an unprecedented look at the genomic mechanisms for generating diversity in a group of animals with different lifestyles and adaptations.

"We see this stunning diversity of forms in vertebrates, from manatees to anteaters," says David Haussler of the University of California, Santa Cruz, one of the three masterminds behind the project, which involves 68 scientists from five continents. "What continually strikes me is the unbelievable malleability and adaptability of the vertebrate genome," he says, "and we have an enormous amount to learn about the genetic roots of that."

Haussler, Stephen O'Brien of the National Cancer Institute in Frederick, Maryland, and Oliver Ryder, director of genetics at the San Diego Zoo's Institute for Conservation Research in Escondido, California, decided to organize the project after realizing that one of the major obstacles to non-human sequencing projects has been collecting and organizing specimens. After an April meeting in Santa Cruz to bring in other scientists, the team now has a database of samples from more than 16,000 species from 50 institutions. The scientists are also planning a pilot project to sample portions of the genomes of a small subset of these species.

The group is looking for funding for the main phase of the project, which could cost anywhere from US$10 million to $100 million, depending on the costs to process and sequence each sample. The team anticipates that sequencing costs will drop below $10,000 per genome within a few years, making it feasible to sequence the entire genomes of 10,000 vertebrates within this budget.

However, analysing all the data will be an enormous challenge, because it is still not easy to assemble new genomes from the short readouts of DNA delivered by current sequencing technologies.

Other scientists say that the project sounds exciting, but question the decision to sequence so deeply solely among vertebrates.

David Maddison, who studies beetle phylogeny at Oregon State University in Corvallis, points out that, so far, only one beetle species has had its genome sequenced, despite there being about six times as many beetle species as there are vertebrates.

"My biggest concern is that if one were to decide where our funds should go — to 10,000 vertebrates or more generally scattered across organismal diversity — I would very strongly argue for the latter," says Maddison, who conceived the Tree of Life project, a web-based catalogue of biodiversity.

O'Brien agrees that invertebrate-genome sequencing is "a valuable area that should also be considered for whole-genome-sequence assessment". Other scientists are already planning a large invertebrate sequencing project.


Indeed, O'Brien expects Genome 10K to be the first study in what he anticipates will be a larger shift toward using sequencing technology to study biodiversity. He compares it to the culture shift that accompanied the invention of the printing press, which was first used to print the Bible and then for broader purposes.

"If the Human Genome Project is the Bible," he says, "then the [Genome 10K] is a library that gets filled up with other books." 


If you find something abusive or inappropriate or which does not otherwise comply with our Terms or Community Guidelines, please select the relevant 'Report this comment' link.

Comments on this thread are vetted after posting.

  • #60095

    Fun part about this is the whole certainty aspect of it – you are supposed to find the oldest ancestor. But the parent/child mutation is completely symetric, so the best you can do is assymtopically approach 0.33* certainty for the three oldest bitvectors.I had worked on this one when it first came out.

Commenting is now closed.