The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum

Ruepp, Andreas; Graml, Werner; Santos-Martinez, Martha-Leticia; Koretke, Kristin K.; Volker, Craig; Mewes, H. Werner; Frishman, Dmitrij; Stocker, Susanne; Lupas, Andrei N.; Baumeister, Wolfgang

doi:10.1038/35035069

Letter
Published: 28 September 2000

The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum

Andreas Ruepp¹,
Werner Graml¹,
Martha-Leticia Santos-Martinez¹,
Kristin K. Koretke²,
Craig Volker²,
H. Werner Mewes³,
Dmitrij Frishman³,
Susanne Stocker³,
Andrei N. Lupas² &
…
Wolfgang Baumeister¹

Nature volume 407, pages 508–513 (2000)Cite this article

5779 Accesses
335 Citations
9 Altmetric
Metrics details

Abstract

Thermoplasma acidophilum is a thermoacidophilic archaeon that thrives at 59 °C and pH 2, which was isolated from self-heating coal refuse piles and solfatara fields^1,2. Species of the genus Thermoplasma do not possess a rigid cell wall, but are only delimited by a plasma membrane. Many macromolecular assemblies from Thermoplasma , primarily proteases and chaperones, have been pivotal in elucidating the structure and function of their more complex eukaryotic homologues^3,4. Our interest in protein folding and degradation led us to seek a more complete representation of the proteins involved in these pathways by determining the genome sequence of the organism. Here we have sequenced the 1,564,905-base-pair genome in just 7,855 sequencing reactions by using a new strategy. The 1,509 open reading frames identify Thermoplasma as a typical euryarchaeon with a substantial complement of bacteria-related genes; however, evidence indicates that there has been much lateral gene transfer between Thermoplasma and Sulfolobus solfataricus, a phylogenetically distant crenarchaeon inhabiting the same environment. At least 252 open reading frames, including a complete protein degradation pathway and various transport proteins, resemble Sulfolobus proteins most closely.

You have full access to this article via your institution.

Download PDF

Recovery of Lutacidiplasmatales archaeal order genomes suggests convergent evolution in Thermoplasmatota

Article Open access 15 July 2022

Genomic evolution of the class Acidithiobacillia: deep-branching Proteobacteria living in extreme acidic conditions

Article Open access 18 May 2021

The metabolic network of the last bacterial common ancestor

Article Open access 26 March 2021

Main

Two basic approaches have been taken to genome sequencing. The statistical approach (‘shotgun sequencing’) relies on the determination of a highly redundant set of random genomic DNA sequences, which are assembled in the computer, the gaps remaining to be closed by other methods⁵. This approach rapidly yields 90–98% of the genomic information, but requires an extensive robotic infrastructure. The directed approach relies on a complete mapping of the genome, followed by the sequencing of large overlapping genomic fragments by shotgun sequencing or primer walking⁶. This approach reduces the infrastructure requirements but is slowed down by the need for a genetic map.

One of our aims was to establish a strategy for sequencing microbial genomes in reasonable time without extensive infrastructure. This strategy (‘shotgun primer walking’) combines features of the statistical and directed methods. After construction of several phagemid libraries and one cosmid library, inserts from randomly chosen clones were sequenced from the ends by primer walking (see Supplementary Information). Sequencing was stopped in regions that were redundant with already determined sequences. In total, 400 phagemids, covering 850 kilobase-pairs (kbp) (54%), and 469 cosmids, covering 1,533 kbp (98%), were partially or fully sequenced. With an average insert length of 40 kbp, the clones of the cosmid library statistically covered the genome 12 times. Because cosmids are susceptible to recombination events, the reverse strand of cosmid DNA was always sequenced with DNA templates from other cosmid clones covering the same region, or from polymerase chain reaction (PCR) fragments generated from genomic DNA. As the project advanced, fragments were assembled into progressively larger contigs, until 4 gaps of 33 kbp in total were left. These were closed by four PCR fragments, which were sequenced by primer walking. By this method, we completed the 1.56 Mbp genome of T. acidophilum with 7,855 sequencing reactions. For comparison, shotgun sequencing of the 1.86 Mbp Thermotoga maritima genome, which has the same G+C content, required over 30,000 sequencing reactions.

During the annotation process, we detected ten open reading frames (ORFs) with putative frameshifts. The corresponding genomic regions were amplified by PCR and resequenced with the primers obtained during primer walking. Four of the frameshifts were confirmed, and six had to be corrected. All errors occurred in phagemids, which, unlike cosmids, were sequenced on both strands but without using template DNA from other sources. This suggests that in genome projects operating with low sequence coverage the two strands should be sequenced with template DNA from different sources. It also shows, however, that high-quality data can be obtained with only twofold sequence coverage.

Beyond the advantages discussed here, shotgun primer walking generates an extensive primer set, covering the whole genome. This represents a powerful tool, which can be used to determine rapidly from PCR fragments differences in related DNA sequences, for example between strains of pathogenic bacteria or between individuals in human populations.

The genome of T. acidophilum DSM 1728, one of the smallest among free-living organisms, consists of a single circular chromosome of 1,564,905 bp (Table 1). We did not detect any plasmids, either by biochemical methods or through DNA sequencing. A single plasmid of 15.2 kbp has previously been reported in other isolates of T. acidophilum⁷. The origin of replication was estimated using the cumulative skew of the orientation of ORFs as well as different nucleotide skews⁸. The centre of a 477-bp intergenic region in this area was chosen as nucleotide 1.

Table 1 Genome features

Full size table

Thermoplasma contains one copy of each of the ribosomal RNAs, which are dispersed in the genome⁹ and separated by at least 52 kbp. This is different from all other known archaea, where at least two of the three genes are contiguous on the chromosome.

We identified 1,509 ORFs in the genome (see Supplementary Information), of which over one-third have homologues in all three domains of life (see Supplementary Information). After manual curation, 823 ORFs (55%) were considered similar to functionally annotated proteins, 446 (29%) resembled hypothetical ORFs in other organisms, and 240 (16%) were not recognizably similar to other sequences in current databases (‘singletons’). In total, we could assign 685 ORFs (46%) to categories of the MIPS Functional Catalogue (see Supplementary Information ). The only category that deviates significantly from other archaea is transport facilitation, which occupies a larger proportion of the genome than in any other archaeon, but is not exceptional when compared with some bacteria (Thermotoga, Deinococcus). By conservative search methods, we could match 620 domains of 537 ORFs (35.6%) to proteins of known structure (Table 1; Supplementary Information ). The distribution of structural classes resembles that of other prokaryotic genomes, but shows a striking overrepresentation of three-layer (αβα) sandwich folds (43% of all matches), which primarily encompass enzymes.

About one-third of the ORFs (484; 32%) occurred in 139 conserved gene clusters, as judged relative to a set of 7 bacterial and 6 archaeal genomes (see Methods). Twenty-two of these clusters occurred only in Thermoplasma and bacteria and are presumably due to lateral transfer. The clusters occur in ‘hot spots’ on the chromosome (Fig. 1), suggesting that entire gene regions may have been acquired in discrete genetic events. The largest detected clusters encode ribosomal proteins (Ta1249–Ta1271), NADH dehydrogenase subunits (Ta0959–Ta0970), precorrin biosynthesis proteins (Ta0653–Ta0660) and flagellar proteins (Ta0553–Ta0560).

**Figure 1: Chromosomal distribution of *Thermoplasma* ORFs most similar to *Sulfolobus* (top panel), and of ORFs found in conserved gene clusters (bottom panel).**

In the thermoacidophiles Thermoplasma and Sulfolobus, glucose degradation proceeds by a non-phosphorylated variant of the Entner–Doudoroff pathway¹⁰ (see Supplementary Information), in which the first step is catalysed by glucose dehydrogenase. The acetyl-CoA produced in this pathway enters the oxidative tricarboxylic acid (TCA) cycle. A complete set of TCA enzymes is recognizable in Thermoplasma, many of which have been studied experimentally. In the glycolysis/gluconeogenesis pathway (Embden–Meyerhof–Parnas; EMP), homologues are recognizable for most enzymes, but not for phosphofructokinase and fructose bisphosphate aldolase, so the presence of a working EMP pathway¹¹ cannot be confirmed.

Thermoplasma is microaerophilic and contains several respiratory chain proteins, such as electron transfer flavoproteins (Ta0429, Ta0329 and Ta0212) and cytochrome b homologues (Ta1222, Ta1228 and Ta1003). Two of the latter are found in gene clusters with Rieske iron–sulphur proteins (Ta1223 and Ta1229). In addition, we identified two homologues of a cytochrome bd quinol oxidase¹² (Ta1484 and Ta0992).

Thermoplasma is also able to gain energy anaerobically by sulphur respiration². Unexpectedly, no homologues of the genes that mediate sulphur respiration in Archaeoglobus fulgidus¹³ were found. Instead, Thermoplasma contains homologues of AsrA (Ta0046) and AsrB (Ta0047), which mediate dissimilatory sulphur reduction in Salmonella typhimurium¹⁴. A sulphide-quinone reductase homologue (Ta1129) may also be involved in sulphur respiration.

Most studied archaea are motile and previous work suggested that they use a chemotaxis signal transduction pathway resembling that of bacteria¹⁵. However, Thermoplasma is flagellated and motile², yet lacks recognizable chemotaxis proteins. This situation is also found in other archaea (Methanococcus jannaschii, Aeropyrum pernix) and bacteria (Aquifex aeolicus), suggesting the presence of an alternative, unrelated signal transduction pathway. Another puzzle concerning motility is the lack of a cell wall, which anchors the flagella in prokaryotes. It is unclear what structure could function as the stator for flagellar rotation in Thermoplasma.

Two chaperones of Thermoplasma have been studied experimentally, the thermosome³ and the protein ‘unfoldase’ VAT^16,17. Additional chaperones recognizable in the genome ( Table 2) include three Hsp20-like proteins (Ta0437, Ta0471 and Ta0864) and a complete Hsp60 system, consisting of two thermosome subunits (α, Ta0980; and β, Ta1276) and two prefoldin/Gim subunits (α, Ta1076; and β, Ta1137). Unlike many archaea, Thermoplasma also contains an Hsp70 system (DnaK, Ta1087; DnaJ, Ta1088; and GrpE, Ta1086), whose genes are clustered on the chromosome in contrast to the endogenous Hsp60 genes.

Table 2 The chaperone complement of T. acidophilum

Full size table

Thermoplasma contains several ‘unfoldases’ (AAA+ proteins)¹⁷, which are frequently associated with proteolytic degradation¹⁸. These are the Cdc48 homologues VAT (Ta0840) and VAT-2 (Ta1175), an archaeal-type Lon protease (Ta1081) and a Lon-related ATPase lacking the protease domain (Ta0098). Surprisingly, Thermoplasma is missing the proteasomal AAA ATPase (PAN; ref. 19), which has hitherto been found in all archaea. It is conceivable that Ta0098, VAT-2 or VAT, which has protein unfolding activity¹⁷, can substitute for this function.

Other chaperones include two protein disulphide isomerases (Ta0125 and Ta0866), a peptidyl-prolyl isomerase of the FKBP family (Ta1011) and two homologues of cold-shock protein CsaA (Ta1284, Ta1046). Ta1284 is most similar to the carboxy-terminal domain of archaeal methionyl-tRNA synthetases (MetRS); this domain is missing from Thermoplasma MetRS. Overall, a surprising number of chaperones from Thermoplasma appear to have been acquired from bacteria (Table 2).

The pathway for polypeptide degradation in Thermoplasma is well understood²⁰. Unfolded proteins are cleaved into fragments of 6–15 residues by the proteasome (α, Ta1288; and β, Ta0612), whose structure and activity have been extensively studied⁴. These fragments are further degraded by the tricorn peptidase (Ta1490) to di-, tri- and tetrapeptides, which are hydrolysed to free amino acids by three tricorn-interacting factors F1 (Ta0830), F2 (Ta0301) and F3 (Ta0815). It is, however, unclear how proteins are targeted to this degradation pathway, because ubiquitin, whose existence in T. acidophilum had been proposed on the basis of peptide sequences²¹, is not present in this genome (or indeed in any other archaeon). It is also unclear how proteins are unfolded before degradation by the proteasome, as Thermoplasma is missing a proteasomal AAA ATPase.

In addition to the 6 proteins involved in this degradation pathway, Thermoplasma contains at least 23 putative proteases ( Table 3, Fig. 2), as well as several α/β hydrolases of unknown specificity and a broad-spectrum deacetylase with potential glutamate carboxypeptidase activity (Ta0934). Few proteases seem to be of bacterial origin; however, nine are most similar to proteins of the phylogenetically distant crenarchaeon Sulfolobus solfataricus (http://niji.imb.nrc.ca/sulfhome/results.html). These proteins are tricorn, its interacting factors and five extracellular, multidomain acid proteases (Ta1403, Ta0167, Ta0376, Ta0741 and Ta0976), three of which are membrane-anchored by C-terminal transmembrane sequences. It is unclear how Thermoplasma retains unanchored secreted proteins without a cell wall, but we note that the two secreted proteases and one of the anchored ones (Ta0167) contain a domain conserved in many archaeal S-layer proteins, which may serve as a scaffold.

Table 3 The protease complement of T. acidophilum

Full size table

**Figure 2: Overview of the main proteolytic pathways in *Thermoplasma*.**

Overall, ten Thermoplasma proteases are probably extracellular. These include two membrane-anchored signal peptidase I homologues (Ta0151 and Ta0378), which are closely related to their eukaryotic homologues and may yield useful model systems for the more elaborate eukaryotic complex, a membrane-anchored serine peptidase of the S49 family (Ta1083), a probable O-sialoglycoprotein endopeptidase (Ta0324) and a metalloprotease of the M6 family (Ta0728). The two latter proteins are probably secreted, on the basis of experimental findings from other organisms. Like their homologues²², however, they lack recognizable signal sequences. In total, 11 Thermoplasma proteases are membrane-associated, including the archaeal Lon protein, which is anchored to the membrane from the cytoplasmic side by a helical hairpin, a structure present in all archaea but missing in bacterial proteins. As the membrane-bound protease FtsH, which is essential in bacteria, is absent from archaea, its function may be assumed by the archaeal Lon protease.

As discussed, Thermoplasma surprisingly lacks ubiquitin, a proteasomal ATPase and an archaeal-type sulphur respiration system. We searched systematically for further differences to other archaeal genomes using clusters of orthologous groups (COGs; http://www.ncbi.nlm.nih.gov/COG). The first four published archaeal genomes share a ‘stable core’ of 543 COGs; thirty-six of these appear to be missing in Thermoplasma (http://www.biochem.mpg.de/baumeister/genome). The most notable are DNA topoisomerase VI and eukaryal-like histones. Topoisomerase VI (ref. 23) is a divergent type II topoisomerase present only in archaea. In its place, Thermoplasma contains a bacterial-type DNA gyrase (GyrA, Ta1054; and GyrB, Ta1055).

Thermoplasma also contains proteins not present in any other archaeal genome (68, after elimination of singletons; http://www.biochem.mpg.de/baumeister/genome). Most noteworthy was Hta (Ta0093), the first archaeal DNA-binding protein to be investigated²⁴, which is closely related to bacterial proteins and appears to substitute functionally for the missing histones. Other proteins include an egghead homologue (Ta0213), a membrane-anchored protein kinase (Ta0488) and a Ras-like GTPase (Ta1192).

In the past, a Thermoplasma-like organism has been debated as a possible ancestor of the eukaryotic cytoplasm. The genome sequence, however, shows that Thermoplasma is a typical archaeon with a fairly large protein complement of bacterial origin. We could not identify any of the proteins peculiar to eukaryotes, such as subunits of the nuclear pore complex, the exocyst or the cytoskeleton. However, Thermoplasma has been reported to contain a cytoskeleton²⁵, and we identified in genome searches a cytosolic coiled-coil protein (Ta1488), which is conserved in all archaea and seems analogous to intermediate filament proteins.

During the comparison to other archaea we noted that 252 Thermoplasma ORFs (17%) were most similar to proteins of S. solfataricus, a higher number than for any other organism in our reference set—even though this genome sequence is still incomplete. Of these, 60 appear to function in transport and are frequently clustered (for example, Ta1325–Ta1329, Ta0261–Ta0275 and Ta0126–Ta0146). Many others are involved in metabolism (including energy metabolism) or in proteolysis, or are secreted or membrane-bound ORFs of unknown function. With a few notable exceptions, such as DNA polymerase (Ta0450) or Rad50 (Ta0157), hardly any are involved in information processes (replication, transcription and translation). Although it has been observed previously that in prokaryotes core information processing generally tracks organism phylogeny, whereas metabolism is strongly affected by lateral transfer, it is highly unusual to see such a large number of genes transferred between two phylogenetically distant organisms. We propose that the adaptation to an extreme environment shared by few other organisms has led to substantial genetic exchange. This may have proceeded by only a few large genetic events, resulting in the skewed distribution of Sulfolobus -like genes in the Thermoplasma genome (Fig. 1).

Thermoplasma inhabits a hot and highly acidic environment, sometimes as low as pH 0.5, in which few organisms are viable. It has adapted to scavenging nutrients from the decomposition of organisms killed by the extreme acidity and requires yeast, bacterial or meat extract when grown in culture. There is an absolute growth requirement for basic oligopeptides²⁶ and analysis of the genome shows the presence of a complete degradation chain for exogenous polypeptides, starting with a set of large extracellular proteases, over two oligopeptide transport systems, to a cytosolic degradation machine consisting of tricorn and its cofactors (Fig. 2). All proteins of this degradation chain are most similar to cognate proteins of Sulfolobus, as are many other proteins involved in transport and metabolism. Thus, it would seem that two classes of genes can be distinguished in Thermoplasma: One is primarily composed of constitutive, ‘housekeeping’ genes, which generally reflect the phylogenetic origin of the organism. The other mostly contains ‘life-style’ genes, including but not limited to metabolism, which are tailored to a specific environment and are shared between phylogenetically distant organisms within one ecological niche.

Methods

Genome sequencing analysis methods

Analysis methods are given in the Supplementary Information.

Annotation

In a first approximation, gene prediction was performed automatically by the Orpheus software²⁷ allowing for ORFs longer than 150 bases and for overlaps between ORFs no longer than 30% of the length of the shorter overlapping ORF. As an additional precaution we modified the algorithm to keep all ORFs longer than 450 bases as part of the preliminary ORF set destined for manual inspection. Selection of putative gene starts was assisted by ribosome-binding-site detection; however, the information content of these sites in T. acidophilum proved to be insufficient for improving prediction results. The predicted ORF complement was then manually refined on the basis of extrinsic evidence.

The main vehicle for automatic and manual annotation of gene products was the PEDANT software suite²⁸. Complete annotation of the genome, including DNA and protein viewers, extensive protein reports, multiple functional and structural categories and search capabilities is available at http://pedant.mips.biochem.mpg.de.

Global comparisons of gene complement, searches for conserved gene clusters, and in-depth annotation of protein families were performed in the SmithKline Beecham AiBi Toolkit, a suite of software tools running on top of a relational database containing 68 partial and complete genomes of archaeal (8), bacterial (55) and eukaryotic origin (5), in addition to the non-redundant public sequence database from NCBI (http://www.ncbi.nlm.nih.gov). Sequence comparisons were made using BLAST2 and PSI-BLAST²⁹, with low-complexity regions (including coiled coils and transmembrane regions) masked out and a significance threshold of e^-3. Conserved gene clusters were identified by comparing pairs of ORFs separated by at most three ORFs between Thermoplasma and a reference set of six archaeal and seven bacterial genomes, chosen to broadly represent archaea and deep-branching bacteria. The archaeal genomes were Methanococcus jannaschii, Methanobacterium thermoautotrophicum , Archaeoglobus fulgidus, Pyrococcus abyssi, Pyrococcus furiosus and Aeropyrum pernix; and the bacterial genomes were Escherichia coli, Bacillus subtilis, Clostridium acetobutylicum , Thermotoga maritima, Aquifex aeolicus, Deinococcus radiodurans and Synechocystis sp. For opening a cluster, both members of the ORF pair had to have BLAST2 matches in chromosomal vicinity at e^-10 in at least one of the reference genomes. Clusters were then extended by relaxing the significance cutoff to e^-3 for proteins satisfying the distance cutoff. Finally, clusters sharing at least one ORF were merged.

Protein fold predictions were made using the PSI-Blast-based search routine SENSER³⁰, and results were compared against the CATH classification (see Supplementary Information). We report only the PSI-Blast results because, with the chosen settings, they have an estimated error rate of less than 5%. However, SENSER returned predictions for an additional 152 proteins with stringent settings, and 205 proteins with relaxed settings, for a total of 727 proteins (48% of the ORF complement; http://www.biochem.mpg.de/baumeister/genome). Coiled coils were predicted with the COILS program (http://www.ch.embnet.org/software/COILS_form.html), and signal sequences with SignalP (http://www.cbs.dtu.dk/services/SignalP/) and PHYSEAN.

References

Darland, G., Brock, T. D., Samsonoff, W. & Conti, S. F. A thermophilic acidophilic mycoplasm isolated from a coal refuse pile. Science 170, 1416–1418 ( 1970).
Article ADS CAS Google Scholar
Segerer, A. & Stetter, K. O. in The Prokaryotes (eds Balows, A., Trüper, H. G., Dworkin, M., Harder, W. & Schleifer, K. H.) 712–718 (Springer, New York, 1992).
Google Scholar
Gutsche, I., Essen, L. O. & Baumeister, W. Group II chaperonins: New TRiC(k)s and Turns of a Protein Folding Machine. J. Mol. Biol. 293, 295– 312 (1999).
Article CAS Google Scholar
Voges, D., Zwickl, P. & Baumeister, W. The 26S proteasome: A molecular machine designed for controlled proteolysis. Annu. Rev. Biochem. 68, 1015–1068 (1999).
Article CAS Google Scholar
Fleischmann, R. D. et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269, 496– 512 (1995).
Article ADS CAS Google Scholar
Himmelreich, R. et al. Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae. Nucleic Acids Res. 24, 4420–4449 (1996).
Article CAS Google Scholar
Yasuda, M., Yamagishi, A. & Oshima, T. The plasmids found in isolates of the acidothermophilic archaebacterium Thermoplasma acidophilum. FEMS Microbiol. Lett. 128, 157–161 ( 1995).
Article CAS Google Scholar
Lopez, P., Philippe, H., Myllykallio, H. & Forterre, P. Identification of putative chromosomal origins of replication in Archaea. Mol. Microbiol. 32, 881– 891 (1999).
Article Google Scholar
Ree, H. K. R. & Zimmermann, R. A. Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum. Nucleic Acids Res. 18, 4471–4478 (1990).
Article CAS Google Scholar
Budgen, N. & Danson, M. J. Metabolism of glucose via a modified Entner–Doudoroff pathway in the thermoacidophilic archaebacterium Thermoplasma acidophilum. FEBS Lett. 196, 207–210 (1986).
Article CAS Google Scholar
Searcy, D. G. & Whatley, F. R. Thermoplasma acidophilum: Glucose degradative pathways and respiratory activities. Syst. Appl. Microbiol. 5, 30–40 (1984).
Article CAS Google Scholar
Luebben, M. Cytochromes of archaeal electron transfer chains. Biochim. Biophys. Acta 1229, 1–22 ( 1995).
Article Google Scholar
Klenk, H. P. et al. The complete genome sequence of the hyperthermophilic, sulphate-reducing archaeon Archaeoglobus fulgidus. Nature 390, 364–370 (1997).
Article ADS CAS Google Scholar
Huang, C. J. & Barret, E. L. Sequence Analysis and Expression of the Salmonella typhimurium asr Operon Encoding Production of Hydrogen Sulfide from Sulfite. J. Bacteriol. 173, 1544–1553 (1991).
Article CAS Google Scholar
Rudolph, J. & Oesterhelt, D. Chemotaxis and phototaxis require a CheA histidine kinase in the archaeon Halobacterium salinarium. EMBO J. 14, 667–673 ( 1995).
Article CAS Google Scholar
Coles, M . et al. The solution structure of VAT-N reveals a ‘missing link’ in the evolution of complex enzymes from a simple beta alpha beta beta element. Curr. Biol. 9, 1158–1168 (1999).
Article CAS Google Scholar
Golbik, R., Lupas, A. N., Koretke, K. K., Baumeister, W. & Peters, J. The janus face of the archaeal Cdc48/p97 homologue VAT: Protein folding versus unfolding. Biol. Chem. 380, 1049–1062 (1999).
Article CAS Google Scholar
Lupas, A., Flanagan, J. M., Tamura, T. & Baumeister, W. Self-compartmentalizing proteases. Trends Biochem. Sci. 22, 399–404 (1997).
Article CAS Google Scholar
Zwickl, P., Ng, D., Woo, K. M., Klenk, H. P. & Goldberg, A. L. An archaebacterial ATPase, homologous to ATPases in the eukaryotic 26 S proteasome, activates protein breakdown by 20 S proteasomes. J. Biol. Chem. 274, 26008– 26014 (1999).
Article CAS Google Scholar
Tamura, N., Lottspeich, F., Baumeister, W. & Tamura, T. The role of Tricorn protease and its aminopeptidase-interacting factors in cellular protein degradation. Cell 95, 637 –648 (1998).
Article CAS Google Scholar
Wolf, S., Lottspeich, F. & Baumeister, W. Ubiquitin found in the archaebacterium Thermoplasma acidophilum. FEBS Lett. 326, 42– 44 (1993).
Article CAS Google Scholar
Barrett, A. J., Rawlings, N. D. & Woessner, J. F. Handbook of proteolytic enzymes (Academic, San Diego, CA, 1999).
Google Scholar
Bergerat, A. et al. An atypical topoisomerase II from archaea with implications for meiotic recombination. Nature 386, 414 –417 (1997).
Article ADS CAS Google Scholar
Stein, D. B. & Searcy, D. G. Physiologically important stabilization of DNA by a prokaryotic histone-like protein. Science 202, 219–221 (1978).
Article ADS CAS Google Scholar
Hixon, W. G. & Searcy, D. G. Cytoskeleton in the archaebacterium Thermoplasma acidophilum? Viscosity increase in soluble extracts. BioSystems 29, 151–160 ( 1993).
Article CAS Google Scholar
Smith, P. F., Langworthy, T. A. & Smith, M. R. Polypeptide nature of growth requirement in yeast extract for Thermoplasma acidophilum. J. Bacteriol. 124, 884–892 (1975).
CAS PubMed PubMed Central Google Scholar
Frishman, D., Mironov, A., Mewes, H. W. & Gelfand, M. Combining diverse evidence for gene recognition in completely sequenced bacterial genomes. Nucleic Acids Res. 26, 2941– 2947 (1998).
Article CAS Google Scholar
Frishman, D. & Mewes, H. W. PEDANTic genome analysis. Trends Genet. 13, 415–416 (1997).
Article CAS Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Article CAS Google Scholar
Koretke, K. K., Russell, R. B., Copley, R. R. & Lupas, A. N. Fold recognition using sequence and secondary structure information. Proteins Struct. Funct. Genet. 37, 141– 148 (1999).
Article Google Scholar

Download references

Acknowledgements

We thank P. Forterre and P. Lopez for helping to define the origin of replication; M. Boicu and C. Czoppelt for sequencing; G. Mannhaupt for annotating part of the ORFs; I. Echabe for preparing template DNA and sequencing; and B. Marshall for developing software for gene cluster analysis and for data management.

Author information

Authors and Affiliations

Max-Planck-Institut für Biochemie , Am Klopferspitz 18a, Martinsried , D-82152, Germany
Andreas Ruepp, Werner Graml, Martha-Leticia Santos-Martinez & Wolfgang Baumeister
Bioinformatics, Smith Kline Beecham Pharmaceuticals, Collegeville, 19426, Pennsylvania, USA
Kristin K. Koretke, Craig Volker & Andrei N. Lupas
GSF-Forschungszentrum für Umwelt und Gesundheit, Munich Information Center for Protein Sequences, Am Klopferspitz 18a, Martinsried, D-82152, Germany
H. Werner Mewes, Dmitrij Frishman & Susanne Stocker

Authors

Andreas Ruepp
View author publications
You can also search for this author in PubMed Google Scholar
Werner Graml
View author publications
You can also search for this author in PubMed Google Scholar
Martha-Leticia Santos-Martinez
View author publications
You can also search for this author in PubMed Google Scholar
Kristin K. Koretke
View author publications
You can also search for this author in PubMed Google Scholar
Craig Volker
View author publications
You can also search for this author in PubMed Google Scholar
H. Werner Mewes
View author publications
You can also search for this author in PubMed Google Scholar
Dmitrij Frishman
View author publications
You can also search for this author in PubMed Google Scholar
Susanne Stocker
View author publications
You can also search for this author in PubMed Google Scholar
Andrei N. Lupas
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Baumeister
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wolfgang Baumeister.

Supplementary information

Sequencing methods (DOC 34 kb)

Gene list (XLS 126 kb)

Figure 1 (PDF 201 kb)

Figure 2 (PDF 13 kb)

Figure legends (DOC 35 kb)

Energy metabolism (DOC 70 kb)

Energy classification (XLS 11 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ruepp, A., Graml, W., Santos-Martinez, ML. et al. The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum. Nature 407, 508–513 (2000). https://doi.org/10.1038/35035069

Download citation

Received: 13 April 2000
Accepted: 06 July 2000
Issue Date: 28 September 2000
DOI: https://doi.org/10.1038/35035069

This article is cited by

Microbial carbon, sulfur, iron, and nitrogen cycling linked to the potential remediation of a meromictic acidic pit lake
- Diana Ayala-Muñoz
- Jennifer L Macalady
- William D Burgos
The ISME Journal (2022)
Community structure and activity potentials of archaeal communities in hadal sediments of the Mariana and Mussau trenches
- Zixuan Wang
- Li Wang
- Zhe Xie
Marine Life Science & Technology (2022)
Genomic attributes of thermophilic and hyperthermophilic bacteria and archaea
- Digvijay Verma
- Vinay Kumar
- Tulasi Satyanarayana
World Journal of Microbiology and Biotechnology (2022)
Geobiological feedbacks and the evolution of thermoacidophiles
- Daniel R Colman
- Saroj Poudel
- Eric S Boyd
The ISME Journal (2018)
Microbial communities and their predicted metabolic functions in a desiccating acid salt lake
- Elena Zaikova
- Kathleen C. Benison
- Sarah Stewart Johnson
Extremophiles (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum

Abstract

Similar content being viewed by others

Recovery of Lutacidiplasmatales archaeal order genomes suggests convergent evolution in Thermoplasmatota

Genomic evolution of the class Acidithiobacillia: deep-branching Proteobacteria living in extreme acidic conditions

The metabolic network of the last bacterial common ancestor

Main

Methods

Genome sequencing analysis methods

Annotation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Supplementary information

Sequencing methods (DOC 34 kb)

Gene list (XLS 126 kb)

Figure 1 (PDF 201 kb)

Figure 2 (PDF 13 kb)

Figure legends (DOC 35 kb)

Energy metabolism (DOC 70 kb)

Energy classification (XLS 11 kb)

Rights and permissions

About this article

Cite this article

This article is cited by

Microbial carbon, sulfur, iron, and nitrogen cycling linked to the potential remediation of a meromictic acidic pit lake

Community structure and activity potentials of archaeal communities in hadal sediments of the Mariana and Mussau trenches

Genomic attributes of thermophilic and hyperthermophilic bacteria and archaea

Geobiological feedbacks and the evolution of thermoacidophiles

Microbial communities and their predicted metabolic functions in a desiccating acid salt lake

Comments

Use your neighbour's genes

Search

Quick links

Abstract

Similar content being viewed by others

Main

Methods

Genome sequencing analysis methods

Annotation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links