Unusual biology across a group comprising more than 15% of domain Bacteria

Brown, Christopher T.; Hug, Laura A.; Thomas, Brian C.; Sharon, Itai; Castelle, Cindy J.; Singh, Andrea; Wilkins, Michael J.; Wrighton, Kelly C.; Williams, Kenneth H.; Banfield, Jillian F.

doi:10.1038/nature14486

Letter
Published: 15 June 2015

Unusual biology across a group comprising more than 15% of domain Bacteria

Christopher T. Brown¹,
Laura A. Hug²,
Brian C. Thomas²,
Itai Sharon²,
Cindy J. Castelle²,
Andrea Singh²,
Michael J. Wilkins^3,4,
Kelly C. Wrighton⁴,
Kenneth H. Williams⁵ &
…
Jillian F. Banfield^2,5,6

Nature volume 523, pages 208–211 (2015)Cite this article

44k Accesses
686 Citations
315 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

A prominent feature of the bacterial domain is a radiation of major lineages that are defined as candidate phyla because they lack isolated representatives. Bacteria from these phyla occur in diverse environments¹ and are thought to mediate carbon and hydrogen cycles². Genomic analyses of a few representatives suggested that metabolic limitations have prevented their cultivation^2,3,4,5,6. Here we reconstructed 8 complete and 789 draft genomes from bacteria representing >35 phyla and documented features that consistently distinguish these organisms from other bacteria. We infer that this group, which may comprise >15% of the bacterial domain, has shared evolutionary history, and describe it as the candidate phyla radiation (CPR). All CPR genomes are small and most lack numerous biosynthetic pathways. Owing to divergent 16S ribosomal RNA (rRNA) gene sequences, 50–100% of organisms sampled from specific phyla would evade detection in typical cultivation-independent surveys. CPR organisms often have self-splicing introns and proteins encoded within their rRNA genes, a feature rarely reported in bacteria. Furthermore, they have unusual ribosome compositions. All are missing a ribosomal protein often absent in symbionts, and specific lineages are missing ribosomal proteins and biogenesis factors considered universal in bacteria. This implies different ribosome structures and biogenesis mechanisms, and underlines unusual biology across a large part of the bacterial domain.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: Phylogeny and genomic sampling of the CPR.**

**Figure 2: Features of insertions encoded within CPR 16S rRNA genes.**

**Figure 3: Intron-encoding 16S rRNA gene from complete Microgenomates genome.**

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Unveiling microbial diversity: harnessing long-read sequencing technology

Article 30 April 2024

Genomes of multicellular algal sisters to land plants illuminate signaling network evolution

Article Open access 01 May 2024

Accession codes

Primary accessions

BioProject

PRJNA273161

Sequence Read Archive

SRP050083

Data deposits

DNA and RNA sequences have been deposited in the NCBI Sequence Read Archive under accession number SRP050083, and genome sequences have been deposited in NCBI BioProject under accession number PRJNA273161 (first versions described here). Genomes are also available through ggKbase: http://ggkbase.berkeley.edu/CPR-complete-draft/organisms. ggKbase is a ‘live data’ site, thus annotations and genomes may be improved after publication.

Change history

29 January 2016
Extended Data Table 1 was corrected on 25 January 2016

References

Harris, J. K., Kelley, S. T. & Pace, N. R. New perspective on uncultured bacterial phylogenetic division OP11. Appl. Environ. Microbiol. 70, 845–849 (2004).
Article CAS PubMed PubMed Central Google Scholar
Wrighton, K. C. et al. Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla. Science 337, 1661–1665 (2012).
Article ADS CAS PubMed Google Scholar
Kantor, R. S. et al. Small genomes and sparse metabolisms of sediment-associated bacteria from four candidate phyla. MBio 4, e00708–e00713 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wrighton, K. C. et al. Metabolic interdependencies between phylogenetically novel fermenters and respiratory organisms in an unconfined aquifer. ISME J. 8, 1452–1463 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rinke, C. et al. Insights into the phylogeny and coding potential of microbial dark matter. Nature 499, 431–437 (2013).
Article ADS CAS PubMed Google Scholar
Albertsen, M. et al. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nature Biotechnol. 31, 533–538 (2013).
Article CAS Google Scholar
Castelle, C. J. et al. Genomic expansion of domain archaea highlights roles for organisms from new phyla in anaerobic carbon cycling. Curr. Biol. 25, 690–701 (2015).
Article CAS PubMed Google Scholar
Luef, B. et al. Diverse, uncultivated ultra-small bacterial cells in groundwater. Nature Commun. 6, 6372 (2015).
Article ADS CAS Google Scholar
Burt, A. & Koufopanou, V. Homing endonuclease genes: the rise and fall and rise again of a selfish element. Curr. Opin. Genet. Dev. 14, 609–615 (2004).
Article CAS PubMed Google Scholar
Salman, V., Amann, R., Shub, D. A. & Schulz-Vogt, H. N. Multiple self-splicing introns in the 16S rRNA genes of giant sulfur bacteria. Proc. Natl Acad. Sci. USA 109, 4203–4208 (2012).
Article ADS PubMed PubMed Central Google Scholar
Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596 (2013).
Article CAS PubMed Google Scholar
Evguenieva-Hackenberg, E. Bacterial ribosomal RNA in pieces. Mol. Microbiol. 57, 318–325 (2005).
Article CAS PubMed Google Scholar
Raghavan, R., Hicks, L. D. & Minnick, M. F. Toxic introns and parasitic intein in Coxiella burnetii: legacies of a promiscuous past. J. Bacteriol. 190, 5934–5943 (2008).
Article CAS PubMed PubMed Central Google Scholar
Baker, B. J., Hugenholtz, P., Dawson, S. C. & Banfield, J. F. Extremely acidophilic protists from acid mine drainage host Rickettsiales-lineage endosymbionts that have intervening sequences in their 16S rRNA genes. Appl. Environ. Microbiol. 69, 5512–5518 (2003).
Article CAS PubMed PubMed Central Google Scholar
Gong, J., Qing, Y., Guo, X. & Warren, A. ‘Candidatus Sonnebornia yantaiensis’, a member of candidate division OD1, as intracellular bacteria of the ciliated protist Paramecium bursaria (Ciliophora, Oligohymenophorea). Syst. Appl. Microbiol. 37, 35–41 (2014).
Article CAS PubMed Google Scholar
Caporaso, J. G. et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 6, 1621–1624 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nawrocki, E. P. in Structural RNA Homology Search and Alignment using Covariance Models (ed. Eddy, S. R. et al.) (Washington Univ. in Saint Louis, 2009).
Google Scholar
Baker, B. J. & Dick, G. J. Omic approaches in microbial ecology: charting the unknown. Microbe 8, 353–360 (2013).
Google Scholar
Yarza, P. et al. Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences. Nature Rev. Microbiol. 12, 635–645 (2014).
Article CAS Google Scholar
Akanuma, G. et al. Inactivation of ribosomal protein genes in Bacillus subtilis reveals importance of each ribosomal protein for cell proliferation and cell differentiation. J. Bacteriol. 194, 6282–6291 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lecompte, O. Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale. Nucleic Acids Res. 30, 5382–5390 (2002).
Article CAS PubMed PubMed Central Google Scholar
Lagkouvardos, I., Jehl, M.-A., Rattei, T. & Horn, M. Signature protein of the PVC superphylum. Appl. Environ. Microbiol. 80, 440–445 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yutin, N., Puigbò, P., Koonin, E. V. & Wolf, Y. I. Phylogenomics of prokaryotic ribosomal proteins. PLoS ONE 7, e36972 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Nowotny, V. & Nierhaus, K. H. Initiator proteins for the assembly of the 50S subunit from Escherichia coli ribosomes. Proc. Natl Acad. Sci. USA 79, 7238–7242 (1982).
Article ADS CAS PubMed PubMed Central Google Scholar
Atkins, J. F. & Björk, G. R. A gripping tale of ribosomal frameshifting: extragenic suppressors of frameshift mutations spotlight P-site realignment. Microbiol. Mol. Biol. Rev. 73, 178–210 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schuwirth, B. S. Structures of the bacterial ribosome at 3.5 Å resolution. Science 310, 827–834 (2005).
Article ADS CAS PubMed Google Scholar
Nevskaya, N. Ribosomal protein L1 recognizes the same specific structural motif in its target sites on the autoregulatory mRNA and 23S rRNA. Nucleic Acids Res. 33, 478–485 (2005).
Article CAS PubMed PubMed Central Google Scholar
Shajani, Z., Sykes, M. T. & Williamson, J. R. Assembly of bacterial ribosomes. Annu. Rev. Biochem. 80, 501–526 (2011).
Article CAS PubMed Google Scholar
Luef, B. et al. Iron-reducing bacteria accumulate ferric oxyhydroxide nanoparticle aggregates that may support planktonic growth. ISME J. 7, 338–350 (2013).
Article CAS PubMed Google Scholar
Williams, K. H. et al. Acetate availability and its influence on sustainable bioremediation of uranium-contaminated groundwater. Geomicrobiol. J. 28, 519–539 (2011).
Article CAS Google Scholar
Peng, Y., Leung, H. C. M., Yiu, S. M. & Chin, F. Y. L. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428 (2012).
Article CAS PubMed Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nature Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).
Article CAS PubMed Google Scholar
Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R. & Wu, C. H. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282–1288 (2007).
Article CAS PubMed Google Scholar
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012).
Article CAS PubMed Google Scholar
Kanehisa, M. & Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27 (2000).
Article CAS PubMed PubMed Central Google Scholar
Hug, L. A. et al. Community genomic analyses constrain the distribution of metabolic traits across the Chloroflexi phylum and indicate roles in sediment carbon cycling. Microbiome 1, 22 (2013).
Article PubMed PubMed Central Google Scholar
Castelle, C. J. et al. Extraordinary phylogenetic diversity and metabolic versatility in aquifer sediment. Nature Commun. 4, 2120 (2013).
Article ADS Google Scholar
Dick, G. J. et al. Community-wide analysis of microbial genome sequence signatures. Genome Biol. 10, R85 (2009).
Article CAS PubMed PubMed Central Google Scholar
Raes, J., Korbel, J. O., Lercher, M. J., von Mering, C. & Bork, P. Prediction of effective genome size in metagenomic samples. Genome Biol. 8, R10 (2007).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F., Gish, W., Miller, W., Meyers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
McLean, J. S. et al. Candidate phylum TM6 genome recovered from a hospital sink biofilm provides genomic insights into this uncultivated phylum. Proc. Natl Acad. Sci. USA 110, E2390–E2399 (2013).
Article PubMed PubMed Central Google Scholar
Podar, M. et al. Targeted access to the genomes of low-abundance organisms in complex microbial communities. Appl. Environ. Microbiol. 73, 3205–3214 (2007).
Article CAS PubMed PubMed Central Google Scholar
Marcy, Y. et al. Dissecting biological ‘dark matter’ with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth. Proc. Natl Acad. Sci. USA 104, 11889–11894 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Nawrocki, E. P., Kolbe, D. L. & Eddy, S. R. Infernal 1.0: inference of RNA alignments. Bioinformatics 25, 1335–1337 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cannone, J. J. et al. The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics 3, 2 (2002).
Article PubMed PubMed Central Google Scholar
Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41, D226–D232 (2013).
Article CAS PubMed Google Scholar
Andronescu, M., Condon, A., Hoos, H. H., Mathews, D. H. & Murphy, K. P. Efficient parameter estimation for RNA secondary structure prediction. Bioinformatics 23, i19–i28 (2007).
Article CAS PubMed Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Article PubMed PubMed Central Google Scholar
Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014).
Article CAS PubMed Google Scholar
Kelley, L. A. & Sternberg, M. J. E. Protein structure prediction on the Web: a case study using the Phyre server. Nature Protocols 4, 363–371 (2009).
Article CAS PubMed Google Scholar
Gilbert, J. A. et al. Meeting report: the terabase metagenomics workshop and the vision of an Earth microbiome project. Stand. Genomic Sci. 3, 243–248 (2010).
Article MathSciNet PubMed PubMed Central Google Scholar
Walters, W. A. et al. PrimerProspector: de novo design and taxonomic analysis of barcoded polymerase chain reaction primers. Bioinformatics 27, 1159–1161 (2011).
Article CAS PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
CAS PubMed PubMed Central Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLOS Comput. Biol. 7, e1002195 (2011).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS ONE 5, e9490 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Abascal, F., Zardoya, R. & Posada, D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics 21, 2104–2105 (2005).
Article CAS PubMed Google Scholar
Huson, D. H. & Scornavacca, C. Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst. Biol. 61, 1061–1067 (2012).
Article PubMed Google Scholar
Zerbino, D. R. & Birney, E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008).
Article CAS PubMed PubMed Central Google Scholar
Ultsch, A. & Moerchen, F. ESOM-Maps: tools for clustering, visualization, and classification with Emergent SOM. Technical Report no. 46 (Dept. of Mathematics and Computer Science, University of Marburg, Germany, 2005).

Download references

Acknowledgements

We thank J. Cate and S. Moore for input into the ribosomal protein analysis, J. Doudna and E. Nawrocki for suggestions on the rRNA insertion analysis, and M. Markillie and R. Taylor for assistance with RNA sequencing. Research was supported by the US Department of Energy (DOE), Office of Science, Office of Biological and Environmental Research under award number DE-AC02-05CH11231 (Sustainable Systems Scientific Focus Area and DOE-JGI) and award number DE-SC0004918 (Systems Biology Knowledge Base Focus Area). L.A.H. was partially supported by a Natural Sciences and Engineering Research Council postdoctoral fellowship. DNA sequencing was conducted at the DOE Joint Genome Institute, a DOE Office of Science User Facility, via the Community Science Program. RNA sequencing was performed at the DOE-supported Environmental Molecular Sciences Laboratory at Pacific Northwest National Laboratory.

Author information

Authors and Affiliations

Department of Plant and Microbial Biology, University of California, Berkeley, 94720, California, USA
Christopher T. Brown
Department of Earth and Planetary Science, University of California, Berkeley, 94720, California, USA
Laura A. Hug, Brian C. Thomas, Itai Sharon, Cindy J. Castelle, Andrea Singh & Jillian F. Banfield
School of Earth Sciences, The Ohio State University, Columbus, 43210, Ohio, USA
Michael J. Wilkins
Department of Microbiology, The Ohio State University, Columbus, 43210, Ohio, USA
Michael J. Wilkins & Kelly C. Wrighton
Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, 94720, California, USA
Kenneth H. Williams & Jillian F. Banfield
Department of Environmental Science, Policy, and Management, University of California, Berkeley, 94720, California, USA
Jillian F. Banfield

Authors

Christopher T. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Laura A. Hug
View author publications
You can also search for this author in PubMed Google Scholar
Brian C. Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Itai Sharon
View author publications
You can also search for this author in PubMed Google Scholar
Cindy J. Castelle
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Singh
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Wilkins
View author publications
You can also search for this author in PubMed Google Scholar
Kelly C. Wrighton
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth H. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Jillian F. Banfield
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Samples and geochemical measurements were taken by M.J.W., K.C.W. and K.H.W. B.C.T. assembled the metagenome data. I.S. implemented the ABAWACA algorithm. C.T.B. and J.F.B. binned the data and carried out the ESOM binning validation. J.F.B. closed and curated the complete genomes. C.T.B., L.A.H. and B.C.T. conducted the rRNA gene insertion analysis. C.T.B. and L.A.H. performed phylogenetic analyses. M.J.W. and K.C.W. conducted the RNA sequencing. C.T.B. carried out the 16S rRNA gene copy number, primer binding and transcript analyses. C.T.B. and J.F.B. carried out the ribosomal protein analyses. C.T.B., L.A.H., C.J.C. and J.F.B. conducted the metabolic analysis. A.S. and B.C.T. provided bioinformatics support. C.T.B. and J.F.B. drafted the manuscript. All authors reviewed the results and approved the manuscript.

Corresponding author

Correspondence to Jillian F. Banfield.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Extended data figures and tables

Extended Data Figure 1 Sampling and geochemical measurements from acetate amendment field experiment conducted in aquifer well CD-01 at the Rifle IFRC site.

a, b, Samples were collected for metagenomics and metatranscriptomics at six time points (A–F) spanning several redox transitions during acetate stimulation of groundwater microbial communities. a, Groundwater was pumped from the alluvial aquifer and filtered through serial 1.2, 0.2 and 0.1 μm filters. DNA was extracted and sequenced from both the 0.2 and 0.1 μm filters, and RNA extracted and sequenced from the 0.2 μm filters (aerial image provided by S. M. Stoller for the US DOE under contract DE-AM01-07LM00060). b, Geochemical measurements were taken throughout the time series, showing a transition from dominant iron reduction to sulfate reduction through to methane production in the sampling environment.

Extended Data Figure 2 Validation of 20 draft-quality genomes by ESOM clustering of genome fragments based on tetranucleotide sequence composition.

For validation, 20 draft genomes from a sample with a high proportion of CPR genomes (GWA2) were chosen at random. Each data point represents a 5–10 kb genome fragment. The ESOM was trained for 100 epochs with normalized tetranucleotide frequencies. Dark lines between data points indicate strong separation between regions. Data points are coloured based on the genome the fragment originated from. The ESOM shows well-delineated clusters for most of the 20 draft genomes, with few sequence fragments falling outside of these clusters. Two genomes from the same Microgenomates (OP11) phylum were not well delineated in the tetranucleotide-based ESOM (genomes 18 and 19). This shows how the method we used for binning, which takes into account abundance patterns in addition to sequence signatures, provides more accurate genome reconstructions. The white box distinguishes a single period on the repeating map. Genomes split into multiple clusters are labelled in red.

Extended Data Figure 3 Relative abundance of bacterial community members during acetate amendment.

a, b, Relative abundance was calculated based on stringent mapping of paired-read sequences from each sample to 16S rRNA gene sequences assembled from all samples. Relative abundance of cells from 0.2 μm filters (a) and from 0.1 μm (b) filters. Enrichment of CPR organisms in the 0.2 μm filtrate indicates that these organisms have ultra-small cell sizes.

Extended Data Figure 4 Features of insertion sequences encoded within 16S rRNA genes from the Silva database.

The non-redundant Silva 16S rRNA gene database (v. 115) was analysed to assess the prevalence of insertions. Only 761 of the 418,498 16S rRNA gene sequences from bacteria encode insertions. While many small insertions were identified, unlike the 16S rRNA gene sequences assembled from groundwater, these sequences (1) rarely encode large insertions, (2) do not contain both ORFs and introns, (3) do not encode ORFs that could be assigned to Pfam families, and (4) may be found in one of multiple copies of the 16S rRNA gene.

Extended Data Figure 5 16S rRNA gene copy number estimations for genomes reconstructed from groundwater metagenomics.

a, b, 16S rRNA gene copy number was estimated for all draft CPR genomes and genome bins for organisms outside the CPR. This was achieved by comparing the coverage of 16S rRNA gene regions to the coverage of the rest of the genome. Importantly, coverage was calculated only with stringently mapped reads (no mismatches were allowed) to improve the accuracy of coverage calculations. a, Histogram of the number of 16S rRNA gene sequence copies estimated for each genome by calculating (16S rRNA gene coverage)/(genome coverage). Several WWE3 genomes were estimated to have high 16S rRNA gene copy number (Supplementary Table 7), but it was later determined that these estimates were skewed by the presence of a highly abundant closely related strain. The complete WWE3 genome assembled previously³ has an identical 16S rRNA gene and confirms that it is found in only one copy for this genotype. Thus, we removed these estimates from subsequent copy number analysis. b, Density plot comparing estimated copy number of genomes for organisms found within and outside the CPR, where the longer tail for non-CPR genomes depicts the propensity for multiple 16S rRNA copies, a trait absent from the CPR.

Extended Data Figure 6 Features of insertion sequences encoded within 23S rRNA genes recovered from groundwater-associated bacteria.

Bacteria associated with the CPR encode insertions within their 23S rRNA genes (Supplementary Table 5). These insertions share many features with those identified in 16S rRNA gene sequences from CPR bacteria. Taxonomy was determined by inclusion in a genome with an established phylogeny.

Extended Data Figure 7 Analysis of the ability of PCR primers 515F and 806R to bind to recovered groundwater-associated 16S rRNA gene sequences.

a, b, PrimerProspector was used to assess the ability of primers 515F and 806R to bind a non-redundant set of assembled near-complete 16S rRNA gene sequences (clustered at 97% sequence identity). The percentage of sequences that would be amplified by these primers is shown on the left axis, the total number of sequences analysed is on the top of each bar, and the number of sequences these primers would not bind to is indicated by the shading. Many assembled groundwater-associated 16S rRNA gene sequences would evade amplification by PCR primers 515F and 806R. Results of the analysis are shown at the domain (a) and superphylum or phylum (b) levels.

Extended Data Figure 8 Metabolic potential and ribosomal protein analysis of genomes from CPR and TM6 organisms.

Assembled genomes were analysed using ggKbase (Supplementary Data 4). Shown here is a non-redundant set of complete and near-complete genomes (≥75% of single copy genes, ≤1.125 copies) organized based on a subset of a maximum-likelihood 16S rRNA gene phylogeny (Supplementary Fig. 1). CPR organisms have partial tricarboxylic acid (TCA) cycles and lack electron transport chain (ETC) complexes. In addition, they have incomplete biosynthetic pathways for nucleotides and amino acids. The Peregrinibacteria are a notable exception to some of these limitations. Several Parcubacteria exhibit a complete ubiquinol (cytochrome b_o) oxidase operon, as previously seen in Saccharibacteria³. However, lack of NADH dehydrogenase and other ETC components suggests that this enzyme is involved in oxygen scavenging/detoxification rather than energy production. AA Syn., amino acid synthesis; PP, pentose phosphate pathway.

Extended Data Table 1 Proposed names for CPR phyla based on microbiology lifetime achievement award recipients

Full size table

Supplementary information

Supplementary Information

This file contains a guide to Supplementary Figure 1, Supplementary Tables 1-10 and the Supplementary Data (see separate files). (PDF 129 kb)

Supplementary Figure

This file contains Supplementary Figure 1 (see the Supplementary Information file for details). (PDF 1414 kb)

Supplementary Tables

This file contains Supplementary Tables 1-10 (see the Supplementary Information file for details). (XLSX 3243 kb)

Supplementary Data

This zipped file contains the Supplementary Data (see the Supplementary Information file for details). (ZIP 14107 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brown, C., Hug, L., Thomas, B. et al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature 523, 208–211 (2015). https://doi.org/10.1038/nature14486

Download citation

Received: 17 December 2014
Accepted: 10 April 2015
Published: 15 June 2015
Issue Date: 09 July 2015
DOI: https://doi.org/10.1038/nature14486

This article is cited by

Unraveling the phylogenomic diversity of Methanomassiliicoccales and implications for mitigating ruminant methane emissions
- Fei Xie
- Shengwei Zhao
- Shengyong Mao
Genome Biology (2024)
COBRA improves the completeness and contiguity of viral genomes assembled from metagenomes
- LinXing Chen
- Jillian F. Banfield
Nature Microbiology (2024)
Microbial decomposition of biodegradable plastics on the deep-sea floor
- Taku Omura
- Noriyuki Isobe
- Tadahisa Iwata
Nature Communications (2024)
The oral microbiome: diversity, biogeography and human health
- Jonathon L. Baker
- Jessica L. Mark Welch
- Xuesong He
Nature Reviews Microbiology (2024)
Biodiversity patterns of cyanobacterial oligotypes in lakes and rivers: results of a large-scale metabarcoding survey in the Alpine region
- Nico Salmaso
- Serena Bernabei
- Rainer Kurmayer
Hydrobiologia (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.