Of trees and networks

Vernikos, Georgios S.

doi:10.1038/nrmicro2227

Download PDF

Genome Watch
Published: October 2009

Of trees and networks

Georgios S. Vernikos¹

Nature Reviews Microbiology volume 7, page 691 (2009)Cite this article

190 Accesses
1 Citations
Metrics details

Abstract

This month's Genome Watch discusses the limitations of strictly binary classification models in the study of bacterial populations.

Main

When the first steps were made towards understanding the principles that govern biological systems, scientists made simplistic assumptions about biological concepts to reduce the complexity of their hypotheses and draw interpretable conclusions. In the past few years, the transition from single-isolate genomics to comparative genomics of entire microbial populations has introduced new parameters that question or even threaten to reject those initial assumptions. For example, the current recognition of increased microbial genome fluidity indicates that the fundamental definition of a biological species¹ fails in some cases to provide a realistic description of the dynamic relationships that shape microbial evolution. These findings do not support the strictly bifurcating tree of life as a means of phylogenetic analysis and instead favour the more realistic model of a phylogenetic network², which better represents the true relationships among species that are characterized by high rates of DNA exchange^3,4,5,6.

The first data to support this model came from the genomic analysis of the obligate intracellular bacterium Wolbachia pipientis. Klasson et al.⁷ compared 450 genes shared by three W. pipientis strains (W. pipientis wRi, W. pipientis wMel and W. pipientis wUni) that infect Drosophila simulans, Drosophila melanogaster and Muscidifurax uniraptor, respectively. Approximately 30% of core genes indicated that W. pipientis wMel and W. pipientis wRi are sister lineages, a different ∼30% supported the W. pipientis wMel and W. pipientis wUni sister phylogeny and 20% showed that W. pipientis wRi and W. pipientis wUni are the more closely related pair. The authors concluded that the high rates of intra-species recombination in W. pipientis do not allow a one-to-one relationship between gene history, genome history and strain phenotype. This suggests that W. pipientis is a mixture of subpopulations, and strains in the same subpopulation recombine more frequently which each other than with strains outside of it.

In the second example, Didelot et al.⁸ compared the genomes of eight serovars of Salmonella enterica to identify blocks of high or low similarity. Their data showed that in all but one pairwise comparison the distribution of sequence divergence is unimodal. However, in the case of S. enterica subsp. enterica serovar Paratyphi A and S. enterica subsp. enterica serovar Typhi, the distribution showed two peaks corresponding to regions of high (1.2%) and low (0.18%) sequence divergence. Overall, in 75% of their DNA sequences the two serovars appeared to be distantly related isolates of S. enterica and in 25% they resemble sister lineages. The authors suggest that this apparent relatedness is the result of more than 100 recombination events that took place over a recent, restricted time span.

A similar pattern of genome mosaicism is seen in Pseudomonas fluorescens. Silby et al.⁹ sequenced the genomes of two P. fluorescens strains (SBW25 and Pf0-1) and compared them with that of P. fluorescens Pf-5. The comparison yielded a shared core set of ∼3,600 protein-coding genes, which corresponds to only ∼60% of genes in each of the three genomes. By contrast, a similar analysis of five isolates of Pseudomonas aeruginosa gave a core set of almost 5,000 genes, with only 1–8% of protein-coding genes being strain specific. Despite this diversity, a comparison of the three P. fluorescens strains and P. aeruginosa PA01 showed that almost 24% and 35% of the genes place P. fluorescens SBW25 closest to P. fluorescens Pf-5 and P. fluorescens Pf0-1, respectively, and 37% put P. fluorescens Pf0-1 in the same node as P. fluorescens Pf-5, suggesting that there has been extensive genetic recombination between these strains despite their extreme diversity.

These three examples show that, in the case of highly mosaic genomes, traditional models for analyzing the history of microorganisms are not applicable. Methodologies that tailor the model to the data, rather than the data to the model, offer a more realistic approximation of microbial diversity and complexity.

References

Mayr, E. Systematics and the Origin of Species. (Columbia University Press, New York,1942).
Google Scholar
Huson, D. H. & Bryant, D. Application of phylogenetic networks in evolutionary studies. Mol. Biol. Evol. 23, 254–267 (2006).
Article CAS PubMed Google Scholar
Doolittle, W. F. Lateral genomics. Trends Cell Biol. 9, M5–M8 (1999).
Article CAS PubMed Google Scholar
Doolittle, W. F. Phylogenetic classification and the universal tree. Science, 284, 2124–2129 (1999).
Article CAS PubMed Google Scholar
Gogarten, J. P. & Townsend, J. P. Horizontal gene transfer, genome innovation and evolution. Nature Rev. Microbiol. 3, 679–687 (2005).
Article CAS Google Scholar
Kunin, V., Goldovsky, L., Darzentas, N. & Ouzounis, C. A. The net of life: reconstructing the microbial phylogenetic network. Genome Res. 15, 954–959 (2005).
Article CAS PubMed PubMed Central Google Scholar
Klasson, L. et al. The mosaic genome structure of the Wolbachia wRi strain infecting Drosophila simulans. Proc. Natl Acad. Sci. USA 106, 5725–5730 (2009).
Article CAS PubMed PubMed Central Google Scholar
Didelot, X., Achtman, M., Parkhill, J., Thomson, N. R. & Falush, D. A bimodal pattern of relatedness between the Salmonella Paratyphi A and Typhi genomes: convergence or divergence by homologous recombination? Genome Res. 17, 61–68 (2007).
Article CAS PubMed PubMed Central Google Scholar
Silby, M. W. et al. Genomic and genetic analyses of diversity and plant interactions of Pseudomonas fluorescens. Genome Biol. 10, R51 (2009).
Article PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Georgios S. Vernikos is at the Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK. microbes@sanger.ac.uk,
Georgios S. Vernikos

Authors

Georgios S. Vernikos
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vernikos, G. Of trees and networks. Nat Rev Microbiol 7, 691 (2009). https://doi.org/10.1038/nrmicro2227

Download citation

Issue Date: October 2009
DOI: https://doi.org/10.1038/nrmicro2227

Of trees and networks

Abstract

Main

References

Author information

Authors and Affiliations

Related links

DATABASES

Entrez Genome Project

Rights and permissions

About this article

Cite this article

Search

Quick links

Abstract

Main

References

Author information

Authors and Affiliations

Related links

Related links

DATABASES

Entrez Genome Project

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links