Fatty acid metabolism and its regulation are known to play important roles in bacteria and eukaryotes. By contrast, although certain archaea appear to metabolize fatty acids, the regulation of the underlying pathways in these organisms remains unclear. Here, we show that a TetR-family transcriptional regulator (FadRSa) is involved in regulation of fatty acid metabolism in the crenarchaeon Sulfolobus acidocaldarius. Functional and structural analyses show that FadRSa binds to DNA at semi-palindromic recognition sites in two distinct stoichiometric binding modes depending on the operator sequence. Genome-wide transcriptomic and chromatin immunoprecipitation analyses demonstrate that the protein binds to only four genomic sites, acting as a repressor of a 30-kb gene cluster comprising 23 open reading frames encoding lipases and β-oxidation enzymes. Fatty acyl-CoA molecules cause dissociation of FadRSa binding by inducing conformational changes in the protein. Our results indicate that, despite its similarity in overall structure to bacterial TetR-family FadR regulators, FadRSa displays a different acyl-CoA binding mode and a distinct regulatory mechanism.
The phylogenetic classification of archaea as a domain of life distinct from bacteria1 is supported by the nature of their membrane lipids having isoprenoid-based hydrocarbon chains instead of fatty acids2. Despite the absence in archaeal membrane lipids, small amounts of fatty acids and derivatives have been detected in archaeal cells3,4,5,6,7,8,9. The role of fatty acids for archaeal cellular physiology is not yet clear and a controversial issue of debate10, although an involvement in the acylation or stabilization of membrane-bound energy-conversion proteins such as rhodopsin or cytochromes has been postulated10,11,12. Many archaeal genomes have extensive sets of typical bacterial-like genes encoding fatty acid synthase type II (FAS-II) complex and β-oxidation enzymes10,13,14. An outstanding question is whether these fatty acid metabolism genes perform anabolic or catabolic reactions, or both9,10. Given the absence of genes encoding acyl-carrier protein (ACP) or ACP synthase13, it has been postulated that a β-oxidation pathway might operate in the reverse direction in conjunction with acetyl-CoA C-acetyltransferase enzymes10. These are abundantly encoded in archaeal genomes, sometimes in the direct neighborhood of β-oxidation genes10,14.
Despite the abundance of fatty acid metabolism genes in many genomes, nothing is known about how their expression is regulated in archaea. In contrast, this is well characterized in bacteria, in which a tight regulation of the synthesis and degradation of fatty acids involves multiple transcription regulators that act in response to intracellular fatty acid-related metabolic signals15. In Gram-negative bacteria a GntR-family regulator FadR has a dual role by coordinately repressing β-oxidation genes while activating FAS-II genes in response to acyl-CoA molecules16,17, whereas a TetR-family malonyl-CoA-dependent regulator FabR controls the ratio between mono-unsaturated and saturated fatty acids18,19,20. Gram-positive bacteria such as Bacillus subtilis use an identically named transcription factor FadR that belongs to the TetR family for the acyl-CoA dependent regulation of β-oxidation degradation21 and a DeoR family member FapR that regulates biosynthesis of saturated fatty acids and phospholipids22. The mechanism of action of the bacterial acyl-CoA responsive TetR-like regulator has been unraveled by analysis of apo, ligand-bound, and DNA-bound crystal structures23,24,25,26.
In this work, we focus on characterizing the transcriptional regulation of genes encoding fatty acid metabolism functions in the thermoacidophilic crenarchaeon Sulfolobus acidocaldarius, which is genetically tractable and considered to be a major archaeal model organism27,28. S. acidocaldarius has an extensive gene cluster, comprising genes Saci_1103 until Saci_1126, encoding homologs of the three β-oxidation enzymes acyl-CoA dehydrogenase, enoyl-CoA hydratase, and hydroxyacyl-CoA dehydrogenase. Also, genes encoding members of the thiolase superfamily presumably catalyzing the last step of the β-oxidation cycle, i.e., ketoacyl-CoA thiolases as well as acetyl-CoA acetyltransferases were identified within the cluster10. In addition, genes encoding lipid degradation functions are present in this genomic region. Concerning these latter functions, Saci_1105 and Saci_1116 code for enzymes that were experimentally shown to display esterase activity29. The Saci_1103-Saci_1126 gene cluster also comprises a gene, Saci_1107, encoding a predicted TetR-like transcription factor for which we hypothesized that it might be involved in regulating the expression of the gene cluster. We aim at performing structural, biochemical, genetic and genomic analyses of this regulator, named FadRSa, thereby unveiling the function and mode of action of an acyl-CoA-responsive transcriptional regulator in an archaeal microorganism.
S. acidocaldarius harbors a 30-kb gene cluster consisting of genes encoding enzymes involved in lipid and fatty acid metabolism and a putative regulator (Saci_1107, Fig. 1a). As a first step towards functional characterization of this regulator, we performed a crystallographic analysis of the protein encoded by Saci_1107 (Figure 1b and Table 1). Size exclusion chromatography (SEC) indicated that the purified recombinant protein behaves as a homogenous population of 44-kDa sized dimers (Supplementary Figure 1). The asymmetric unit of the 2.4-Å resolution crystal structure also contains a homodimer with an exclusive alpha-helical structure. Each subunit displays two functional domains: an N-terminal helix-turn-helix (HTH) DNA-binding domain (α1–α3) and a C-terminal domain (α4–α9) of which α8 and α9 stabilize dimerization. The overall Ω-shape structure of the dimer validates its classification as a TetR family member30.
Although BLAST analyses initially did not reveal which bacterial regulators could be considered as potential functional homologs for the protein encoded by Saci_1107, a superposition revealed structural similarity with the previously characterized TetR-family FadR transcription regulators in Bacillus sp., FadRBs (RMSD = 4.23 Å) and FadRBh21,26 (RMSD = 5.88 Å), and Thermus thermophilus, FadRTt24 (RMSD = 11.85 Å) (Fig. 1c). Conservation is significantly higher for the N-terminal than for the C-terminal domains (Supplementary Table 1) as also confirmed by a structure-based sequence alignment (Fig. 1d). This structural similarity led us to propose to name this protein FadRSa accordingly.
Upon solving the FadRSa crystal structure, one of the subunits (subunit B) was found to have additional unassigned electron density in the C-terminal domain. This could be explained by fitting it with an acyl-CoA molecule (Fig. 1b), which was likely derived from Escherichia coli during heterologous overexpression. The best fit was obtained with heptanoyl-CoA. Given the low intracellular abundance of odd-chained acyl-CoA molecules it is possible that a mixture of even-chained short-chain acyl-CoA molecules was present in the ligand binding pockets of different protein molecules packed in the crystal. The unintended cocrystallization of acyl-CoA with FadRSa (Fig. 1b) suggests that it is a specific ligand of the protein. This further supports the hypothesis that the regulatory role of this transcription factor is connected to acyl-CoA metabolism.
Genome-wide DNA-interaction map of FadRSa
As a next step toward unraveling FadRSa function, we employed chromatin immunoprecipitation (ChIP) in combination with next-generation sequencing (ChIP-seq). A total of 14 significant and reproducible in vivo-associated genomic loci were identified (Fig. 2a and Supplementary Table 2). The two highest enrichments were observed within the Saci_1103-Saci_1126 gene cluster. More specifically, both high-enrichment binding regions were located within the intergenic region of the divergently organized operon encoding the fadRSa gene itself and a putative esterase-encoding gene (peaks 1 and 2). Within the Saci_1103-Saci_1126 gene cluster, two additional low-enrichment binding regions were observed within the coding sequence of gene Saci_1115 and in the intergenic region separating a divergently encoded β-oxidation operon and a putative transcription factor gene, respectively (peaks 3 and 4). Targeted chromatin immunoprecipitation quantitative polymerase chain reaction (ChIP-qPCR) validated the observed enrichments (Fig. 2b), which were not observed anymore upon deleting fadRSa (Supplementary Figure 2). All sequences enriched in the ChIP-seq analysis were subjected to a computational binding motif prediction, yielding a 16-base pair (bp) motif with dyad symmetry that is conserved in 13 of 14 binding regions (Fig. 2c and Supplementary Table 2). Besides the four experimentally identified binding sites, an in silico screening revealed three additional putative binding sites in the gene cluster, of which one is located within the open reading frame (ORF) of Saci_1106 (Supplementary Figure 3, Supplementary Table 3). Possibly they were not captured in the ChIP-seq analysis but are functional in other conditions.
Electrophoretic mobility shift assays (EMSAs) with DNA probes encompassing the centers of the binding regions verified that the observed ChIP-seq enrichment regions represent direct and specific FadRSa–DNA interactions (Fig. 2d, Supplementary Table 2 and Supplementary Figure 4). Densitometric analysis of EMSA autoradiographs performed with the high-enrichment targets revealed a formation of two electrophoretically distinct FadRSa–DNA complexes with high affinity and positive-binding cooperativity. In contrast, with the low-enrichment targets only a single complex was formed with lower affinity and without apparent cooperativity (Fig. 2d and Supplementary Figure 4). These findings suggest that for different genomic targets, FadRSa is capable of using distinct DNA-interaction modes differing in binding stoichiometry.
Mechanisms of DNA binding
To further unravel mechanisms of DNA binding by FadRSa, we determined the cocrystal structure of the protein–DNA complex to a resolution of 3.29 Å using a duplex DNA containing the predicted FadRSa binding motif in the control region of the fadRSa gene itself, corresponding to ChIP-seq peak 2 (Table 1 and Fig. 3a, b). The asymmetric unit contained six FadRSa subunits, organized as three dimers, and two DNA duplex molecules thus representing a nonbiological assembly (Fig. 3a), although the protein–DNA molecular interactions within this structure are representative of the biologically relevant complexes (see below, DNA-binding stoichiometry of FadRSa).
In each FadRSa subunit in the cocrystal structure, residues of the recognition helix α3 and the α2–α3 loop interact with the major groove of DNA with the establishment of an extensive number of contacts (Fig. 3b, Supplementary Note 1 and Supplementary Data 1). Base-specific contacts mainly consist of hydrophobic interactions between FadRSa residues Tyr47, Leu49, Tyr51, and Phe52 and methyl groups of thymines (Fig. 3c) similar as in other TetR-like regulators26,30 (Supplementary Figure 5), in addition to electrostatic interactions between Gly48 and the N7 group of guanines. The role of these residues for DNA binding was further investigated by performing site-directed alanine substitution and analyzing the mutant proteins in EMSA (Supplementary Figure 6a). FadRSaY47A, FadRSaY51A, and FadRSaY53A are all negatively affected in DNA-binding affinity and cooperativity. With FadRSaG48A, no DNA binding was observed at all demonstrating that Gly48 is a crucial residue (Supplementary Figure 6a). Besides protein–DNA contacts, a weak electrostatic protein–protein contact was also observed between Asn37 residues of FadRSa dimers bound on different sides of the DNA helix (Fig. 3d and Supplementary Note 1).
DNA-binding stoichiometry of FadRSa
To dissect the stoichiometric nature of the electrophoretically distinct FadRSa–DNA complexes, SEC was performed with the different molecular species (Fig. 4a). With a homogenous population of dimers in solution (Fig. 4a and Supplementary Figure 1) and the FadRSa–Saci_1123 complex B having an apparent molecular weight (MW) measured in SEC of 140 kDa that is only minimally exceeding that of free DNA (119 kDa), it can be concluded that FadRSa binds the Saci_1123 operator as a single dimer. The observation that the relative mobility in EMSA of FadRSa–Saci_1123 complex B is highly similar to that of the FadRSa–fadRSa complex B1 (Fig. 4b), led us to postulate that the transitional FadRSa–fadRSa complex B1 has a stoichiometry similar as for the sole FadRSa–Saci_1123 complex. In contrast, the dominantly formed complex B2 with the fadRSa operator has a larger apparent MW (179 kDa) (Fig. 4a): it can be assumed that the apparent MW attributed by the FadRSa protein itself is similar for measurements of free protein and of FadRSa–DNA complexes and that the FadRSa–fadRSa complex B2 has a stoichiometry that is twice as large as that of complex B1, thus harboring two dimers. SEC experiments with lower protein:DNA molar ratios indicate that the entire amount of FadRSa in the preparation is capable of binding DNA (Supplementary Figure 7). This excludes the possibility that a subpopulation of the protein is in a ligand-induced state lacking DNA-binding activity as suggested by the observation of acyl-CoA cocrystallizing with the protein in the apo crystal structure (Fig. 1b), assuming that acyl-CoA binding causes DNA dissociation like in bacterial FadR regulators.
Next, footprinting experiments were performed for the FadRSa–DNA complexes B1 and B2 observed in EMSAs with fadRSa and quasi-identical Saci_1106 operator probes (representing ChIP-seq peaks 1 and 2, respectively) (Fig. 4c and Supplementary Figure 8). Chemical “in gel” Cu–phenantroline footprinting demonstrated that for the fadRSa operator probe, both complexes are characterized by a similar protection zone roughly restricted to the predicted binding motif with a small stretch of additional protection extending upstream of the motif in complex B2 (Fig. 4c). This upstream extension was also observed in DNase I footprinting experiments for both operator probes (Fig. 4c and Supplementary Figure 8). In contrast, the protection zones observed in footprinting experiments with the Saci_1123 operator probe (representing ChIP-seq peak 4) are smaller and only correspond to the binding motif (Supplementary Figure 9). These results support the notion that in the FadRSa–Saci_1123 complex B and the FadRSa–fadRSa complex B1 a single-FadRSa dimer interacts with the predicted semipalindromic binding motif. A second dimer interacts with the upstream (left) side of the inverted repeat in FadRSa–fadRSa complex B2, suggesting a dimer-of-dimer interaction mode similar to that observed for a bacterial subclass of TetR-like regulators prototyped by QacR in Staphylococcus aureus, in which two overlapping inverted repeats are bound by dimers located on opposite sides of the DNA helix31.
To relate the molecular interactions and binding architecture in the FadRSa–DNA cocrystal structure with the biologically relevant stoichiometries, we mapped the contacts on the fadRSa and Saci_1123 operator sequences that could be hypothesized to exist based on the cocrystal structure (Fig. 5a). The presence of a G–C bp 11 positions upstream of the symmetrical C–G in the pseudopalindromic site appears to be the sole explanation of the preference of a second FadRSa dimer to establish an interaction with the upstream (left) and not downstream (right) side of the pseudopalindromic site. Similarly, the absence of purine–pyrimidine and pyrimidine–purine bps on appropriate positions up- and downstream of the inverted repeat in the Saci_1123 operator appears to explain why only a single dimer is bound. Based on the cocrystal structure, the G base of this G–C bp in the fadRSa operator can be assumed to be contacted by Gly48 that is crucial for DNA binding (Fig. 5b and Supplementary Figure 6a) with the dimer-of-dimer-complex architecture reflecting that of the architecture of the AB dimer and subunit E of the DE dimer interacting with a single-DNA duplex in the cocrystal structure (Fig. 3a). This reasoning is underscored by the observation that the introduction of a G–C and C–G bp at the indicated positions of the Saci_1123 operator causes the formation of two instead of one nucleoprotein complex (Fig. 5c). Given that adenines also have an N7 group, the base specificity of the Gly48–guanine interaction is possibly explained by an indirect readout of the preceding thymidine or cytosine residue in the light of YpG base pair steps being more prone to unstacking and commonly involved in sequence-specific protein–DNA interactions in a combined direct and indirect readout32. In addition, the mutation of Asn37 causes a diminished cooperativity in the formation of the dimer-of-dimer complex B2 with the mutated Saci_1123 operator (Fig. 5c), proving the involvement of this residue in a protein–protein interaction. This supports the notion that established FadRSa–DNA contacts are similar in the complex in solution as in the portion of the cocrystal structure harboring dimer AB and subunit E of dimer EF and that the relative positioning of the two dimers in the biologically relevant complex is similar to that of dimers AB and EF in the cocrystal structure.
In conclusion, while bacterial TetR proteins are subdivided in two classes depending on whether they employ a dimer or dimer-of-dimer DNA-binding mode26,30, the archaeal FadRSa regulator is capable of using both interaction modes depending on the operator sequence. A Gly48–guanine interaction and Asn37-mediated protein–protein contacts, of which the latter have never before been observed for bacterial dimer-of-dimer binding TetR-like proteins, assist in the dimer-of-dimer interaction mode.
Determination of the FadRSa regulon
To infer whether or not the observed genomic binding of FadRSa leads to transcriptional regulation, a comparative transcriptomic analysis was performed for the fadRSa deletion mutant versus the isogenic WT strain using an RNA-seq approach (Fig. 6a, Supplementary Note 2, Supplementary Figure 10, Supplementary Table 4 and Supplementary Data 2). The deletion of fadRSa did not affect cell morphology and growth in a medium containing sucrose and NZamine as carbon and energy sources (Supplementary Figure 11). RNA-seq analysis revealed that thirteen genes are differentially expressed, which all belong to the Saci_1103-Saci_1126 gene cluster. Moreover, as confirmed by quantitative reverse transcriptase PCR (qRT-PCR), all other genes of the gene cluster appear to be expressed at slightly higher levels in the mutant strain as well (Fig. 6a–b and Supplementary Data 2). We therefore conclude that FadRSa is a local repressor of the entire Saci_1103-Saci_1126 gene cluster, which is predicted to harbor genes for a complete β-oxidation pathway (Fig. 6a).
All but one of these FadRSa binding sites are located at too large distances (>130 bp) from their corresponding promoters to hypothesize a classical repression mechanism that involves direct interaction with the components of the basal transcription machinery. As an exception, the fadRSa control region harbors a binding site just downstream of the transcription start site. For this target, it is shown that FadRSa binding stimulates the interaction with basal transcription factors TATA binding protein (TBP) and transcription factor B (TFB) (Supplementary Figure 12), pointing to a direct repression mechanism occurring at later stages of transcription initiation than during TBP and TFB recruitment.
Besides the local regulon, several other genes, including an operon encoding a putative sulfate reduction pathway and cytochrome-encoding genes, were found to have slightly lower expression levels in the ΔfadRSa strain pointing to an indirect activation effect (Fig. 6b, Supplementary Note 2 and Supplementary Data 2). The suggestion of this effect being indirect is corroborated by the prediction of only a limited number of putative FadRSa binding sites in the genomic regions surrounding the downregulated genes, which are characterized by relative high p values (>1.00E−05) (Supplementary Table 3) and which were not captured by ChIP-seq. Furthermore, these indirect regulatory effects hint at a reversely correlated link between the fatty acid metabolism catalyzed by the enzymes encoded in the Saci_1103-Saci_1126 gene cluster on one hand and sulfur metabolism and cytochrome-containing membrane complexes on the other hand.
The observed transcriptional regulation of the Saci_1103-Saci_1126 gene cluster strongly suggests that FadRSa is implicated in the regulation of fatty acid and lipid metabolism. Since it was previously observed that simultaneous deletion of both esterase-encoding genes in the gene cluster (Saci_1105 and Saci_1116) led to a phenotype lacking the ability to perform tributyrin hydrolysis29, we performed a similar phenotypic assay with the fadRSa deletion mutant (Supplementary Figure 13a). Despite the higher expression levels of both esterase genes in the fadRSa deletion mutant (Fig. 6a–b), we did not observe a difference in time-dependent halo formation upon growth on tributyrin (Supplementary Figure 13a). In contrast, upon growing S. acidocaldarius in a liquid medium containing hexanoate as a sole carbon and energy source, the fadRSa deletion mutant displayed a significantly higher growth rate in exponential growth phase with respect to the isogenic WT strain (doubling times Tds of 20.5 and 26.3 h, respectively; Fig. 6c). As this effect was not observed during growth on the shorter-chain butyrate (Supplementary Figure 13b), it correlates to fatty acid chain length. These experiments support that S. acidocaldarius is capable of degrading fatty acids to sustain growth and that this catabolic metabolism may be at least partly catalyzed by enzymes encoded in the Saci_1103-Saci_1126 gene cluster. Furthermore, the FadRSa regulator represses this catabolic fatty acid metabolism as its deletion, thereby causing a derepression of the gene cluster, results in a faster growth rate (Fig. 6c). The observation of this difference can be explained by hexanoate only causing a partial FadRSa-mediated derepression given the relative short-chain length of these acyl-CoA molecules (see below, “FadRSa–ligand interactions”).
Next, we prepared FadRSa crystals in the presence of lauroyl–CoA (C12:0-CoA) and solved the FadRSa–lauroyl–CoA cocrystal structure to a resolution of 1.90 Å (Table 1). In contrast to the initial structure showing an acyl-CoA derivative bound to only one of the subunits, in this structure, both subunits of the dimer harbor a lauroyl–CoA molecule (Supplementary Figure 14a). The orientation of the ligand-binding pockets within the protein and the binding mode of the ligand is completely different in FadRSa as compared to the characterized bacterial FadR proteins24,25,26 (Fig. 7a). In contrast to the ligand entering the pocket from within the dimer interface as in bacterial FadR regulators, in the FadRSa structure the ligand enters the protein from the outer surface of the protein completely opposite to the dimer interface. Further, for each ligand-binding pocket only a single FadRSa subunit is involved in ligand interaction in contrast to two subunits in the bacterial FadR regulators. Consequently, ligand conformation is different and the acyl chain has a rather straight conformation in FadRSa while it is bent in FadRBs (Supplementary Figure 14b).
Upon zooming into the ligand-binding pocket, a large number of specific lauroyl–CoA–FadRSa interactions are observed (Fig. 7b and Supplementary Data 3). Whereas the adenine moiety is located on the outside of α4 and appears not to be contacted by the protein, the remainder of the lauroyl–CoA molecule enters the protein in between α4 and α7 with the establishment of electrostatic interactions between polar residues (a.o. Arg73, Lys77, Arg86, and Arg132) and the CoA moiety, especially with the CoA phosphate groups (Fig. 7b). Upon comparison of residue conformations in the DNA-bound and lauroyl–CoA structures, it became apparent that the orientation of the α5 residue Met101 is significantly altered (Supplementary Figure 15a). The lauroyl chain is deeply buried into a tunnel-like binding pocket formed in between helices α4–α7 and is entirely surrounded by hydrophobic residues such as Phe68, Phe97, Phe111, and Phe126. Although the nature of these ligand-interaction residues (polar residues for CoA-interactions, hydrophobic residues for side chain interactions) is similar as in bacterial TetR-like FadR regulators, they are not homologous as shown on a structure-based sequence alignment (Fig. 1d). Furthermore, FadRSa does not contain a hydrophilic patch in the acyl-binding pocket similarly to the Bacillus counterparts in which it affects ligand-binding specificity25,26.
EMSAs demonstrated that acyl-CoA molecules, but not acetyl-CoA, CoA and free fatty acids, strongly abrogate FadRSa–DNA complex formation with the extent of the abrogation effect correlating with the length of the acyl chain (Fig. 7d and Supplementary Figure 16a). Competition assays confirmed that the addition of acetyl-CoA or hexanoyl-CoA does not affect sensitivity of the protein to oleoyl-CoA and thus that the inhibition effect reflects binding specificity (Supplementary Figure 16b). Alanine substitution of the CoA-interacting residues Arg73 and Arg86 confirmed their importance for the ligand response (Fig. 7e and Supplementary Figure 6b). Likewise, alanine mutation of the allosterically altered Met101 residue desensitizes the protein to oleoyl-CoA. In conclusion, acyl-CoA binds the regulator thereby causing dissociation of FadRSa–DNA complexes with the affinity and extent of the effect correlating to the acyl chain length.
Molecular mechanism of ligand response
To learn more about the allosteric regulatory mechanism employed by FadRSa, we compared the ligand-bound and DNA-bound structures (Fig. 7c and Supplementary Figure 15b). Both structures were superimposed with an RMSD of 1.01 Å (Supplementary Figure 15b). Subtle differences were noted in the relative orientation of the HTH motifs within a dimer with the binding of lauroyl–CoA causing the distance between the two α3 helices to be enlarged from an average 37.0–43.2 Å (measured as the Cα–Cα distance of Tyr51 residues located in α3). As a consequence, the increased distance between the α3 helices makes the dimeric FadRSa conformation suboptimal for interaction in consecutive major groove segments. Besides the α3 helix, the α2 helix is shifted by an average distance of 7.9 Å and the α4 helix is displaced with an angle of 9° (Fig. 7c). Intriguingly, one of the three dimers in the FadRSa–DNA cocrystal structure appeared to have a ligand-bound conformation distinct from the other two dimers; this is reflected by the distance between the two α3 helices being 45.3 Å for this central dimer EF versus an average of 37.0 Å for the flanking dimers (Fig. 3a; Supplementary Note 3). In conclusion, ligand binding allosterically opens up the dimer thereby causing it to dissociate from the DNA, similarly as the mechanism observed for FadRBh26.
Occurence of FadR in archaea
FadRSa is not restricted to S. acidocaldarius but is also represented in all other Sulfolobus species, in other Crenarchaeota belonging to Sulfolobales (Acidianus spp.) and Thermoproteales (Cuniculiplasma divulgatum), and also in species belonging to Euryarchaeota (Thermoplasmatales) and in the recently discovered Marsarchaeota33 (Fig. 8). These organisms have in common that they are all thermophiles, some with a (facultative) aerobic metabolism, others with an anaerobic metabolism. Conservation of residues that are involved in DNA or ligand binding indicate that these proteins are FadRSa orthologs with similar functional characteristics (Fig. 8a). Moreover, as is the case for FadRSa, several of these homologs are encoded in genomic environments abundant in genes coding for enzymes involved in fatty acid metabolism or for enzymes with lipase functions (Fig. 8b). Although gene synteny is not strictly conserved, the extent of some of these gene clusters, especially in other Sulfolobus species, suggests the potential existence of similar FadR-mediated acyl-CoA responsive repression. This hypothesis is supported by the prediction of putative FadR binding sites in the neighborhood of fadR promoters and at distant locations, either in intergenic regions or in ORFs, for the gene clusters in other Sulfolobus species (Fig. 8b).
It is well-established that archaea harbor typical bacterial-like transcription regulators34,35, which are proposed to result from shared ancestry as well as from extensive horizontal gene transfers, especially from bacteria to archaea36. FadRSa, of which we show that it displays structural similarities with bacterial TetR-like FadR regulators, is an archaeal member of the widespread prokaryotic TetR family. Despite these similarities, there are pronounced differences between the archaeal FadR regulator and the bacterial counterparts which point to a complete absence of shared ancestry. For example, the acyl-CoA binding function of FadRSa appears to have arisen through convergent evolution with respect to the bacterial regulators. In contrast to FadRBs and FadRTt, in which the crystal structure revealed medium-chain lauroyl–CoA in one of the two dimeric subunits24,25, we observed the presence of a short-chain acyl-CoA in the native FadRSa structure. This difference likely reflects different acyl-CoA-binding specificities and might be explained by the absence of a hydrophilic patch in the part of the ligand-binding pocket that surrounds the first 10–12 carbon atoms of the acyl chain, as observed for the Bacillus counterparts25,26. As a consequence, FadRSa has a different ligand specificity, which is expected to have consequences for the biological function and to reflect different biological roles of fatty acids for cellular physiology in bacteria and archaea. Interestingly, the observation of an inverse correlation between the expression of the Saci_1103-Saci_1126 gene cluster and that of cytochrome-encoding genes further supports the suggested function of fatty acids stabilizing membrane complexes in archaea10.
FadRSa uses two distinct DNA-binding modes operator-dependently with a Gly–guanine interaction being the crucial determinant and that dimer-of-dimer complex formation with the quasi-identical high-affinity fadRSa and Saci_1106 operators occurs in a cooperative manner with the transitional formation of a dimer-bound complex in which FadRSa-mediated DNA bending is more pronounced than in the dimer-of-dimer complex. By binding to a total of only four binding sites in the Saci_1103-Saci_1126 gene cluster, FadRSa is capable of repressing transcription of the entire 23-gene cluster containing at least 17 transcription units37. Furthermore, with the exception of the autoregulatory binding site that is located just downstream of the TTS and that is expected to result in repression through direct interaction with the basal transcription initiation machinery, the other binding sites are located at least 130 bp upstream of the corresponding promoters, which is a noncanonical position as compared to most previously characterized archaeal repressors38. FadRSa thus employs a nonparadigmatic repression mechanism that could be hypothesized to be dependent on long-range interactions. The observation of cocrystallization of FadRSa with two individual DNA duplexes (Fig. 3a), as was also observed for the bacterial TetR member CgmR in Corynebacterium glutamicum39, further supports the possibility that the regulator is capable of co-associating with different DNA segments.
The finding that FadRSa represses the Saci_1103-Saci_1126 gene cluster and that it is responsive to acyl-CoA molecules acting as inducers in vitro strongly suggests that intracellularly present acyl-CoA molecules cause a derepression and thus higher transcriptional expression of the gene cluster in vivo. The observation that deletion of the regulator causes cells to display a faster growth on hexanoate as sole energy and carbon source supports that the β-oxidation enzymes encoded in this gene cluster minimally have a degradation function; this is in line with the logic behind the regulatory strategy. A catabolic function of the β-oxidation enzymes is also in agreement with the function of the co-regulated esterase enzymes encoded by Saci_1105 and Saci_1116, which enable cells to grow on lipids29. Fatty acid oxidation adds to the chemoorganotrophic capabilities of Sulfolobus spp. that appear more important than the originally described chemolithotrophic sulfur-oxidizing metabolism40. A full picture of the functioning of fatty acid metabolism in Sulfolobus, and whether the enzymes encoded in the Saci_1103–1126 function only the catabolic or also anabolic direction, awaits the biochemical and genetic characterization of the enzymes. An intriguing hypothesis has been put forward stating that fatty acid metabolism enzymes in archaea do not have a catalytic bias but are instead regulated by the relative substrate and product concentrations9,10. Possibly, acyl-CoA-responsive FadRSa-mediated induction of the expression of the Saci_1103-Saci_1126 gene cluster is only a single element in a more complex regulatory system in which other regulatory mechanisms are also in place to enable the fine-tuned expression and activity of promiscuous enzymes in response to relative concentrations of a variety of fatty acid-related metabolic signals. This postulated regulatory complexity might reflect the employment of the same enzymes for catabolic and anabolic reactions instead of the use of distinct dedicated pathways as in bacteria and eukaryotes. Furthermore, based on the occurrence of FadRSa orthologs in other thermophilic archaea, such as Thermoplasmatales and Marsarchaeota, it can be assumed that this type of transcriptional regulation is not restricted to Sulfolobus spp.
Microbial strains and growth conditions
S. acidocaldarius strains MW001 and its derivatives were cultivated at 75 °C in Brock basal salts medium40 supplemented with 0.2 (w/v)% sucrose, 0.2 (w/v)% NZamine and 10 μg ml−1 uracil. For ChIP experiments, a 200 ml S. acidocaldarius DSM639 culture was grown at 80 °C in Brock basal salts medium supplemented with 0.1% tryptone. The pH of the medium was adjusted to 3.5 by addition of sulfuric acid. For growth experiments in the presence of fatty acids, precultures of S. acidocaldarius MW001 and its derivatives were first grown in Brock medium supplemented with 0.2 (w/v)% NZamine followed by multiple transfers to Brock medium supplemented with 2 mM butyrate or hexanoate. Cultivations of the third and fourth generation were considered to lack residual NZamine and to be representative for growth on fatty acids as sole carbon source. For growth on plates, Brock medium was solidified by the addition of 0.6 (w/v)% gelrite, 10 mM MgCl2 and 3 mM CaCl2. To detect the hydrolysis of tributyrin, cells were inoculated on tributyrin-containing plates composed of 1.4% gelrite dissolved in Brock basal salt medium supplemented with 20 mM MgCl2 × 6H2O, 6 mM CaCl2, 0.2% (w/v) NZamine, 0.4% (w/v) dextrine and 1% (v/v) tributyrin29. Cavities were made into the plates using a 5 ml pipette tip followed by addition of 20 µl of cell culture grown up to an optical density measured at 600 nm (OD600) of 0.7. The plates were incubated up to 7 days.
E. coli strains DH5α and Rosetta DE3 were used for the propagation of plasmid DNA constructs and heterologous protein overexpression, respectively, and were grown at 37 °C in lysogeny broth (LB) medium supplemented with 50 μg ml−1 ampicillin (DH5α) or with 50 µg ml−1 ampicillin and 34 μg ml−1 chloramphenicol (Rosetta (DE3)). An overview of all strains used in this work is given in Supplementary Table 5.
A markerless in-frame fadRSa deletion strain (S. acidocaldarius ΔfadRSa) was generated of the uracil auxotrophic S. acidocaldarius strain (MW001) using the classical “pop-in pop-out” method with a suicide disruption vector containing up- and downstream flanking regions and the gene-of-interest region besides the pyrEF selection marker genes (pSVA406xΔfadR)28. An overview of all plasmid vectors and oligonucleotide sequences is given in Supplementary Table 6 and Supplementary Data 4, respectively. Successful deletion of the gene was confirmed by PCR analysis and sequencing of genomic DNA (gDNA) (Supplementary Figure 2).
Protein expression and purification
The fadRSa coding region was amplified by PCR from S. acidocaldarius gDNA and cloned into a pET45b expression vector using BamHI and HindIII restriction sites (pET45bxfadRSa) resulting in an N-terminally His-tagged construct. Site-directed mutagenesis was performed with the overlap PCR mutagenesis approach41 using pET45bxfadRSa as a template and complementary mutagenic primers to remove an NdeI restriction site in the fadRSa ORF with a silent mutation. This enabled cloning in the NdeI/XhoI sites of pET24a yielding a C-terminally His-tagged construct (pET24axfadRSaNdenull). For the construction of N37A, Y47A, G48A, Y51A, Y53A, R73A, R86A, and M101A variants of FadRSa, site-directed mutagenesis was performed in an identical approach using pET24axfadRSaNdenull as a template.
Heterologous overexpression of the recombinant FadRSa proteins was accomplished in E. coli Rosetta (DE3) by growing a culture until reaching an OD600 between 0.6 and 0.7, incubating the cells on ice during 30 min and inducing gene expression by the addition of 0.4 mM isopropyl β-d-1-thiogalactopyranoside (IPTG). Subsequently, the culture was further incubated at 37 °C during 20 h, pelleted by centrifugation, resuspended in binding buffer (20 mM sodium phosphate pH 7.4, 0.5 M NaCl, 20 mM imidazole) and lysed by sonication. Finally, the cell extract was subjected to heat treatment (80 °C during 30 min) followed by centrifugation at 23,000×g during 45 min. Purification of His-tagged FadRSa was performed with immobilized metal ion affinity chromatography by applying the supernatant to a His GraviTrap system (GE Healthcare) equilibrated with binding buffer (20 mM sodium phosphate (pH 7.4), 0.5 M NaCl and 20 mM imidazole). The column was washed with binding buffer where after the protein was eluted with 3 × 1 ml of 20 mM sodium phosphate (pH 7.4), 0.5 M NaCl and 500 mM imidazole. N- and C-terminally His-tagged proteins behave similar in DNA-binding assays.
For the preparation of selenomethionine (SeMet)-substituted FadRSa protein, cells were allowed to grow overnight at 37 °C in 2 ml 2 × YT medium containing 50 µg ml−1 ampicillin. The overnight culture was used to inoculate 50 ml M9 minimal medium (50 mM Na2HPO4, 3 g l−1 KH2PO4, 0.5 g l−1 NaCl, 1 g l−1 NH4Cl) and growth was continued at 37 °C until an OD600 of 0.5 was reached. Thereafter, 50 ml of the culture was added to 700 ml M9 medium supplemented with 2 mM MgSO4, 0.1 mM CaCl2, 0.4% glucose, and 50 µg ml−1 ampicillin, and continue growth at 37 °C until an OD600 of 0.4–0.6. Following l-amino acids were then added: Lys, Phe, and Tyr (100 mg l−1), Leu, Ile, and Thr (50 mg l−1) and l-selenomethionine (Acros Organics; final concentration of 60 mg l−1). At an OD600 of 0.6–0.8, protein expression was induced by adding 0.5 mM IPTG (isopropyl β-d-1-thiogalactopyranodside) followed by overnight incubation. SeMet-substituted protein was purified according to the same procedure as described above using a pET45bxfadRSa harboring strain. All FadRSa protein preparations were essentially pure, as judged by SDS-polyacrylamide gel electrophoresis (PAGE) and by SEC (Supplementary Figure 1).
The ORFs of TBP (Saci_1336) and TFB1 (Saci_0866) were PCR-amplified, digested and ligated into pET30a (Novagen) using the restriction exonucleases NdeI and XhoI and transformed into E. coli Rosetta 2 (DE3). Cells harboring pET30axtbp were grown until reaching an OD600 of approximately 0.6, followed by an induction with 1 mM IPTG and further growth during 4 h at 37 °C. Cells were harvested by centrifugation and resuspended in lysis buffer (25 mM Tris-HCl pH 7.5, 300 mM NaCl) supplemented with protease inhibitor cocktail (Roche) and disrupted via French pressure cell (Thermo Electron Corporation, USA) for three passages at 12,000 psi followed by ultracentrifugation (30,000×g during 45 min). A heat treatment (80 °C during 15 min) was again followed by ultracentrifugation and TBP protein was further purified by anion exchange chromatography using a ResourceQ column (GE Healthcare) with a salt gradient up to 1 M NaCl and SEC on a HiLoad superdex 26/60 75 prep grade column (GE Healthcare) using 25 mM Tris-HCl pH 7.5, 300 mM NaCl. Similarly, E. coli Rosetta 2 (DE3) cells expressing C-terminally hexahistidine-tagged TFB1 were grown in medium supplemented with 10 mM MgCl2 and 100 µM ZnSO4, induced at an OD600 of approximately 0.6 by adding 0.2 mM IPTG and followed by further growth at 23 °C overnight. Cells were harvested by centrifugation and subsequently resuspended in modified N-buffer (25 mM Tris-HCl pH 7.5, 10 mM MgCl2, 100 µM ZnSO4, 1 mM tris(2-carboxyethyl)phosphine (TCEP)) supplemented with 1 M NaCl and protease inhibitor. After cell disruption via French pressure cell (three passages at 12,000 psi), ultracentrifugation (30,000×g during 45 min) and heat treatment at 75 °C for 15 min the cleared lysate was applied to a Ni-TED column (Macherey and Nagel) for affinity purification. Modified N-buffer containing 300 mM NaCl was used for equilibration and washing, the bound target protein was eluted with the same buffer containing 250 mM imidazole. Pure TFB1 protein was finally obtained by performing SEC with a HiLoad superdex 26/60 200 prep grade gel filtration column (GE Healthcare). TBP and TFB1 were used in EMSAs (Supplementary Figure 12).
Crystallization and data collection
Crystallization of SeMet-derivated FadRSa was performed at 20 °C using the hanging-drop vapor diffusion method by mixing equal volumes of protein solution (3 mg ml−1) and reservoir solution consisting of 20% (w/v) PEG3350, 0.2 M sodium nitrate and 0.1 M Bis–Tris propane, pH 8.5. Appropriately sized crystals were obtained after 6–8 weeks. The crystals belong to space group P21, with unit-cell parameters a = 41.9, b = 98.7, c = 55.9 Å, and β = 106.4°, and two molecules per asymmetric unit, giving a Matthews coefficient of 2.51 Å3 Da−1 and 51% solvent content. The complexes of FadRSa:DNA and FadRSa:lauroyl–CoA were obtained by cocrystallization of FadRSa with a 21-bp duplex DNA and 1 mM lauroyl–CoA (lithium salt), respectively. Prior to data collection, crystals were soaked in a cryo-solution containing 20% glycerol, 10% (w/v) PEG3350, 0.1 M sodium nitrate and 0.05 M Bis–Tris propane, pH 8.5 followed by immediate flash-cooling in liquid nitrogen.
High-resolution X-ray data (Table 1) were collected at 100 K at European Synchroton Radiation Facility (ESRF) beamlines ID23-1 (SeMet-substituted FadRSa) and ID29 (FadRSa:DNA and FadRSa:lauroyl–CoA).
Structure determination and refinement
The structure of FadRSa was determined using the SAD method with selenomethionine-substituted protein. Diffraction data were processed and scaled using the XDS program package42. A set of 5% of the reflections was set aside and used to calculate the quality factor Rfree43. The structure was solved using AutoSol in PHENIX44. Refinement was performed with PHENIX altered with manual rebuilding in O45. The structure was refined to Rfac = 20.3% and Rfree = 25.3%, respectively (Table 1). The structure of the dsDNA and lauroyl–CoA complexes were solved by molecular replacement with Phaser46, using the SeMet-subsituted FadRSa structure as a model. All structures were evaluated using wwPDB Validation Server47. Refinement statistics are presented in Table 1. The coordinates and structure factors have been deposited in the PDB database with accession codes 5MWR, 6EN8, and 6EL2. PDBsum48 was used to identify protein–DNA and protein–ligand interactions, supplemented with a manual inspection employing PyMOL49. All figures displaying protein structures were prepared with PyMOL49.
ChIP was performed by growing S. acidocaldarius DSM639 to early exponential growth phase (an OD600 between 0.2 and 0.3) and adding formaldehyde to the culture to a final concentration of 1%50,51. After a 5-min incubation, glycine was added to a final concentration of 125 mM. Fixed cells were harvested by centrifugation at 8000×g during 10 min and the pellet was resuspended in 3 ml IP buffer (50 mM Hepes-KOH pH 7.5, 150 mM NaCl, 1 mM EDTA, 1% Triton X-100, 0.1% sodium deoxycholate, 0.1% SDS, protease inhibitor cocktail (Roche Applied Science)). Subsequently, cells were sonicated on ice until DNA fragments were obtained with an average size around 250 bp. After centrifugation during 20 min at 17,000×g, 100 µl of the sheared DNA-containing supernatant was kept apart to use as input control and the remaining sample was divided into two aliquots. One aliquot was incubated with anti-FadRSa antibody coated M-280 Sheep Anti-Rabbit Dynabeads (Invitrogen) and the other was incubated with pre-immune serum coated Dynabeads, which served as a nonspecific binding control (mock control). The bead-antibody complexes were prepared by mixing 80 µl Dynabeads with either FadRSa-specific antibodies (produced by immunizing a rabbit with purified recombinant FadRSa (Innovagen)) or rabbit pre-immune serum. Precipitation reactions were performed according to manufacturer’s instruction. After overnight incubation at 4 °C, the Dynabeads were collected and the captured DNA was eluted and purified by using the iPURE DNA extraction kit (Diagenode) according to the manufacturer’s instruction.
DNA purified from ChIP, input and mock samples were sequenced (1 × 51 bp) by a Miseq sequencer (Illumina) at ScilifeLab, Stockholm, Sweden. Sequence reads were mapped to the S. acidocaldarius DSM639 genome (NC_007181.1) with Burrows–Wheeler Aligner (BWA 0.7.10)52 using default settings and MACS2 (2.1.0)53 was employed for peak calling. To generate sufficient sequencing reads, reads of two mock samples were combined before performing the analysis. The ChIP-seq experiment was done in biological duplicate and only peaks called in both experiments were retained; this was followed by a manual curation. Finally, ChIP-seq results were visualized by IGV version 2.3.5954. DNA sequences of enriched regions were extracted by BEDTools’ getfasta function55 and subjected to a binding motif search with MEMEsuite (4.10.0)56. The FIMO tool of MEMEsuite was used for the prediction of binding motifs in other genomic regions.
For ChIP-qPCR, 20-ml cultures of S. acidocaldarius MW001 and S. acidocaldarius MW001ΔfadRSa were crosslinked and harvested in mid exponential growth phase (OD600 of about 0.4) and ChIP was performed as described above. Primers were designed with Primer3 Plus software57 (Supplementary Data 5) and were chosen to amplify fragments around the ChIP-seq peak summit regions and with a length between 150 and 200 bp. qPCR was performed with a My-iQTM Single color Real-time PCR system (Bio-Rad) and GoTaq qPCR Master Mix (Promega) was done with thermal cycling conditions: 10 min at 94 °C and 40 cycles of 30 s at 94 °C and 30 s at 60 °C. Fold-enrichment calculations were performed with the 2−ΔΔCt method58 using an irrelevant genomic region (the ORF of Saci_1336) as a nonbinding reference. Cultures were assayed in biological duplicate and the S. acidocaldarius ΔfadRSa strain was used as mock experiment.
Electrophoretic mobility shift and footprinting assays
32P-labeled DNA was prepared by 5′-end-labeling of oligonucleotides using [γ-32P]-ATP (Perkin Elmer) and T4 polynucleotide kinase (Thermo Scientific). Each of these labeled oligonucleotides were then used together with a non-labeled oligonucleotide (Supplementary Data 4) in a PCR reaction with S. acidocaldarius gDNA as a template or, in case of 45-bp probes, in a hybridization reaction with the nonlabeled reverse complementary oligonucleotide. Labeled DNA fragments were subsequently purified by polyacrylamide gel electrophoresis. EMSAs were performed59 with approximately 0.1 nM 32P-labeled DNA probe and an excess of nonspecific competitor DNA. Dimethylsulfoxide was used as a solvent to dissolve acyl-CoA, but did not affect the protein–DNA interaction (Supplementary Figure 16). Binding reactions were prepared in binding buffer (20 mM Tris-HCl (pH 8.0), 1 mM MgCl2, 0.1 mM dithiothreitol (DTT), 12.5% glycerol, 50 mM NaCl, 0.4 mM EDTA) and allowed to equilibrate at 37 °C prior to electrophoresis on 6% acrylamide gels in TEB buffer (89 mM Tris, 2.5 mM EDTA, and 89 mM boric acid).
DNase I footprinting was performed by the method of Galas and Schmitz60 in the same binding buffer as used for EMSA and with Maxam–Gilbert treated samples as sequencing ladders61. “In gel” Cu–phenantroline (OP) chemical footprinting experiments, enabling to analyze distinct populations of nucleoprotein complexes separately, were performed by performing EMSA as described above and immersing an entire EMSA acrylamide gel in 200 ml of 10 mM Tris (pH 8.0)59. To initiate chemical cleavage reactions, 20 ml of solution A (2 mM OP, 0.45 mM CuSO4) was added followed by 20 ml of solution B (58 mM mercaptopropionic acid). After 15 min, reactions were quenched by the addition of 20 ml 30 mM neocuprine hydrate. This mixture was allowed to equilibrate during 5 min, after which the gel was thoroughly rinsed with distilled H2O and exposed to an autoradiograph film. Different DNA populations were excised from the gel, recovered by precipitation and analyzed by denaturing acrylamide gel electrophoresis with Maxam–Gilbert treated samples as sequencing ladders61.
For stoichiometry experiments of FadRSa–DNA complexes, SEC was performed on a Superdex 200 Increase10/30 GL column with an ÄKTA FPLC system (GE Healthcare Life Sciences). A total of 4–40 nM FadRSa protein was mixed with 0.2–1 nM 45-bp DNA fragments encompassing the fadRSa operator or Saci_1123 operator, respectively, which were prepared before by hybridization. After an incubation of the reaction mixtures in 20 mM Na2HPO4 (pH 7.4), 150 mM NaCl during 25 min at 37 °C, they were loaded onto the column with the same buffer as mobile phase buffer. Calibration for MW calculation was performed with ribonuclease A (13.7 kDa), carbonic anhydrase (29 kDa), conalbumin (75 kDa), and ferritin (440 kDa), all from a Gel Filtration Calibration Kit (GE Healthcare Life Sciences).
Total RNA was prepared from duplicate MW001 and MW001ΔfadRSa cultures in early exponential growth phase at an OD600 between 0.2 and 0.3 using a miRNeasy Mini Kit (Qiagen). Libraries were prepared with a TruSeq Stranded mRNA Library Prep Kit (Illumina). Paired-end (2 × 125 bp) RNA sequencing was performed using a Hiseq2500 system (Illumina) at SciLifeLab, Stockholm, Sweden. Sequence reads were first trimmed to remove sequencing adapters by cutadapt 1.9.162 and reads shorter than 20 nt were discarded. Processed reads were then mapped to the S. acidocaldarius DSM639 genome (NC_007181.1)63 with Tophat 2.0.1264. For each gene, the FPKM was calculated with Cufflinks 2.2.165. Finally, read counts were obtained by the featureCounts function in the Subread package 1.5.066 and only genes having at least one count in all samples were used for differential gene expression analysis with DESeq267.
RNA was extracted at an OD600 between 0.2 and 0.3 by stabilization with RNA Protect Reagens (Qiagen) and by using an SV Total RNA Isolation System (Promega). Residual gDNA was removed by treatment with TURBO DNase (Ambion Life Technologies). cDNA was prepared from 1 μg RNA with an iScript™ Select cDNA Synthesis Kit (Bio-Rad). All qRT-PCR oligonucleotides (Supplementary Data 4) were designed with Primer3 Plus software57. qRT-PCR analysis was performed in a Bio-Rad iCycler using SYBR Green Master Mix (Bio-Rad) for amplification and detection. Each reaction mixture contained approximately 10 ng of template and 200 nM of each primer in a total volume of 25 µl. The temperature program was as follow: 10 min at 94 °C and 40 cycles of 30 s at 94 °C, 30 s at 60 °C50. Relative expression ratios were calculated using the delta–delta Ct method58 for biological triplicates and by normalizing with respect to the tbp reference gene. Data were statistically analyzed by performing a t test with the software package Prism 6.0 (GraphPad).
Further information on experimental design is available in the Nature Research Reporting Summary linked to this article.
All crystal structures presented in this work have been deposited in the Protein Data Bank (PDB) and are available with accession codes 5MWR, [has been superseded with 6EL2] (native FadRSa structure), 6EL2 (lauroyl–CoA-bound FadRSa structure) and 6EN8 (DNA-bound FadRSa structure). All raw data for the ChIP-seq and RNA-seq studies presented in this work have been deposited in the Gene Expression Omnibus (GEO) databank with accession codes GSE108039 and GSE108018, respectively.
Woese, C. R. & Fox, G. E. Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc. Natl Acad. Sci. USA 74, 5088–5090 (1977).
De Rosa, M., Gambacorta, A. & Gliozzi, A. Structure, biosynthesis, and physicochemical properties of archaebacterial lipids. Microbiol. Rev. 50, 70–80 (1986).
Kates, M., Wassef, M. K. & Kushner, D. J. Radioisotopic studies on the biosynthesis of the glyceryl diether lipids of Halobacterium cutirubrum. Can. J. Biochem. 46, 971–977 (1968).
Langworthy, T. A., Mayberry, W. R. & Smith, P. F. Long-chain glycerol diether and polyol dialkyl glycerol triether lipids of Sulfolobus acidocaldarius. J. Bacteriol. 119, 106–116 (1974).
Tornabene, T. G. et al. Phytanyl-glycerol ethers and squalenes in the archaebacterium Methanobacterium thermoautotrophicum. J. Mol. Evol. 11, 259–266 (1978).
Carballeira, N. M. et al. Unusual fatty acid compositions of the hyperthermophilic archaeon Pyrococcus furiosus and the bacterium Thermotoga maritima. J. Bacteriol. 179, 2766–2768 (1997).
Nishihara, M., Nagahama, S., Ohga, M. & Koga, Y. Straight-chain fatty alcohols in the hyperthermophilic archaeon Pyrococcus furiosus. Extremophiles 4, 275–277 (2000).
Gattinger, A., Schloter, M. & Munch, J. C. Phospholipid etherlipid and phospholipid fatty acid fingerprints in selected euryarchaeotal monocultures for taxonomic profiling. FEMS Microbiol. Lett. 213, 133–139 (2002).
Hamerly, T. et al. Characterization of fatty acids in Crenarchaeota by GC-MS and NMR. Archaea 2015, 472726 (2015).
Dibrova, D. V., Galperin, M. Y. & Mulkidjanian, A. Y. Phylogenomic reconstruction of archaeal fatty acid metabolism. Environ. Microbiol. 16, 907–918 (2014).
Corcelli, A. et al. Role of palmitic acid on the isolation and properties of halorhodopsin. Biochim. Biophys. Acta 1281, 173–181 (1996).
Colella, M., Lobasso, S., Babudri, F. & Corcelli, A. Palmitic acid is associated with halorhodopsin as a free fatty acid. Radiolabeling of halorhodopsin with 3H-palmitic acid and chemical analysis of the reaction products of purified halorhodopsin with thiols and NaBH4. Biochim. Biophys. Acta 1370, 273–279 (1998).
Lombard, J., López-García, P. & Moreira, D. An ACP-independent fatty acid synthesis pathway in archaea: implications for the origin of phospholipids. Mol. Biol. Evol. 29, 3261–3265 (2012).
Villanueva, L., Schouten, S. & Damsté, J. S. S. Phylogenomic analysis of lipid biosynthetic genes of Archaea shed light on the ‘lipid divide’. Environ. Microbiol. 19, 54–69 (2017).
Fujita, Y., Matsuoka, H. & Hirooka, K. Regulation of fatty acid metabolism in bacteria. Mol. Microbiol. 66, 829–839 (2007).
Henry, M. F. & Cronan, J. E. Escherichia coli transcription factor that both activates fatty acid synthesis and represses fatty acid degradation. J. Mol. Biol. 222, 843–849 (1991).
My, L., Ghandour Achkar, N., Viala, J. P. & Bouveret, E. Reassessment of the genetic regulation of fatty acid synthesis in Escherichia coli: global positive control by the dual functional regulator FadR. J. Bacteriol. 197, 1862–1872 (2015).
Zhang, Y.-M., Marrakchi, H. & Rock, C. O. The FabR (YijC) transcription factor regulates unsaturated fatty acid biosynthesis in Escherichia coli. J. Biol. Chem. 277, 15558–15565 (2002).
Zhu, K., Zhang, Y.-M. & Rock, C. O. Transcriptional regulation of membrane lipid homeostasis in Escherichia coli. J. Biol. Chem. 284, 34880–34888 (2009).
Feng, Y. & Cronan, J. E. Complex binding of the FabR repressor of bacterial unsaturated fatty acid biosynthesis to its cognate promoters. Mol. Microbiol. 80, 195–218 (2011).
Matsuoka, H., Hirooka, K. & Fujita, Y. Organization and function of the YsiA regulon of Bacillus subtilis involved in fatty acid degradation. J. Biol. Chem. 282, 5180–5194 (2007).
Schujman, G. E., Paoletti, L., Grossman, A. D. & de Mendoza, D. FapR, a bacterial transcription factor involved in global regulation of membrane lipid biosynthesis. Dev. Cell 4, 663–672 (2003).
Badger, J. et al. Structural analysis of a set of proteins resulting from a bacterial genomics project. Proteins 60, 787–796 (2005).
Agari, Y., Agari, K., Sakamoto, K., Kuramitsu, S. & Shinkai, A. TetR-family transcriptional repressor Thermus thermophilus FadR controls fatty acid degradation. Microbiology 157, 1589–1601 (2011).
Fujihashi, M. et al. Structural characterization of a ligand‐bound form of Bacillus subtilis FadR involved in the regulation of fatty acid degradation. Proteins 82, 1301–1310 (2014).
Yeo, H. K., Park, Y. W. & Lee, J. Y. Structural basis of operator sites recognition and effector binding in the TetR family transcription regulator FadR. Nucleic Acids Res. 45, 4244–4254 (2017).
Leigh, J. A., Albers, S.-V., Atomi, H. & Allers, T. Model organisms for genetics in the domain Archaea: methanogens, halophiles, Thermococcales and Sulfolobales. FEMS Microbiol. Rev. 35, 577–608 (2011).
Wagner, M. et al. Versatile genetic tool box for the crenarchaeote Sulfolobus acidocaldarius. Front. Microbiol. 3, 214 (2012).
Zweerink, S. et al. Activity-based protein profiling as a robust method for enzyme identification and screening in extremophilic Archaea. Nat. Commun. 8, 15352 (2017).
Ramos, J. L. et al. The TetR family of transcriptional repressors. Microbiol. Mol. Biol. Rev. 69, 326–356 (2005).
Schumacher, M. A. et al. Structural basis for cooperative DNA binding by two dimers of the multidrug-binding protein QacR. EMBO J. 21, 1210–1218 (2002).
Lamoureux, J., Maynes, J. T. & Glover, J. N. M. Recognition of 5’-YpG-3’ sequences by coupled stacking/hydrogen bonding interactions with amino acid residues. J. Mol. Biol. 335, 399–408 (2004).
Jay, Z. J. et al. Marsarchaeota are an aerobic archaeal lineage abundant in geothermal iron oxide microbial mats. Nat. Microbiol 3, 732–740 (2018).
Kyrpides, N. C. & Ouzounis, C. A. Transcription in archaea. Proc. Natl Acad. Sci. USA 96, 8545–8550 (1999).
Bell, S. D. & Jackson, S. P. Mechanism and regulation of transcription in archaea. Curr. Opin. Microbiol. 4, 208–213 (2001).
Nelson-Sathi, S. et al. Origins of major archaeal clades correspond to gene acquisitions from bacteria. Nature 517, 77–80 (2015).
Cohen, O. et al. Comparative transcriptomics across the prokaryotic tree of life. Nucleic Acids Res. 44, W46–W53 (2016).
Peeters, E., Peixeiro, N. & Sezonov, G. Cis-regulatory logic in archaeal transcription. Biochem. Soc. Trans. 41, 326–331 (2013).
Itou, H., Watanabe, N., Yao, M., Shirakihara, Y. & Tanaka, I. Crystal structures of the multidrug binding repressor Corynebacterium glutamicum CgmR in complex with inducers and with an operator. J. Mol. Biol. 403, 174–184 (2010).
Brock, T. D., Brock, K. M., Belly, R. T. & Weiss, R. L. Sulfolobus: a new genus of sulfur-oxidizing bacteria living at low pH and high temperature. Arch. Mikrobiol. 84, 54–68 (1972).
Higuchi, R., Krummel, B. & Saiki, R. K. A general method of in vitro preparation and specific mutagenesis of DNA fragments: study of protein and DNA interactions. Nucleic Acids Res. 16, 7351–7367 (1988).
Kabsch, W. XDS. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 (2010).
Brünger, A. T. Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature 355, 472–475 (1992).
Terwilliger, T. C. et al. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard. Acta Crystallogr. D Biol. Crystallogr. 64, 61–69 (2008).
Jones, T. A., Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A Found. Crystallogr. 47, 110–119 (1991). (Pt 2).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Berman, H., Henrick, K. & Nakamura, H. Announcing the worldwide Protein Data Bank. Nat. Struct. Biol. 10, 980–980 (2003).
Laskowski, R. A., Jabłońska, J., Pravda, L., Vařeková, R. S. & Thornton, J. M. PDBsum: Structural summaries of PDB entries. Protein Sci. 112, 535 (2017).
DeLano, W. L. PyMOL. (2002).
Nguyen-Duc, T., Peeters, E., Muyldermans, S., Charlier, D. & Hassanzadeh-Ghassabeh, G. Nanobody(R)-based chromatin immunoprecipitation/micro-array analysis for genome-wide identification of transcription factor DNA binding sites. Nucleic Acids Res. 41, e59–e59 (2013).
Liu, H., Wang, K., Lindås, A.-C. & Peeters, E. The genome-scale DNA-binding profile of BarR, a β-alanine responsive transcription factor in the archaeon Sulfolobus acidocaldarius. BMC Genom. 17, 569 (2016).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Bailey, T. L. & Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2, 28–36 (1994).
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365–386 (2000).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 25, 402–408 (2001).
Peeters, E., Thia-Toong, T.-L., Gigot, D., Maes, D. & Charlier, D. Ss-LrpB, a novel Lrp-like regulator of Sulfolobus solfataricus P2, binds cooperatively to three conserved targets in its own control region. Mol. Microbiol. 54, 321–336 (2004).
Galas, D. J. & Schmitz, A. DNase footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res. 5, 3157–3170 (1978).
Maxam, A. M. & Gilbert, W. Sequencing end-labeled DNA with base-specific chemical cleavages. Meth. Enzymol. 65, 499–560 (1980).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
Chen, L. et al. The genome of Sulfolobus acidocaldarius, a model organism of the Crenarchaeota. J. Bacteriol. 187, 4992–4999 (2005).
Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14, R36 (2013).
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515 (2010).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Notredame, C., Higgins, D. G. & Heringa, J. T-Coffee: a novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302, 205–217 (2000).
Nguyen, N. T. T. et al. RSAT 2018: regulatory sequence analysis tools 20th anniversary. Nucl. Acids Res. 46, W209–W214.
We are grateful to Daniël Charlier for assistance with footprinting experiments and critical reading of the manuscript. We also would like to acknowledge support from Science for Life Laboratory, the National Genomics Infrastructure (NGI) and Uppmax in Stockholm/Uppsala, Sweden, for providing assistance in massive parallel sequencing and computational infrastructure. Research in the laboratory of E.P. was supported by the Research Council of the Vrije Universiteit Brussel (start-up funds) and by the Research Foundation Flanders (FWO-Vlaanderen) (research project G021118 and research grant 1526418N). D.S. is a recipient of an FWO-SB PhD fellowship of the Research Foundation Flanders (FWO-Vlaanderen). Research in the laboratory of A.C.L. was supported by the Swedish Research Council, Grant 621-2013-4685.
The authors declare no competing interests.
Journal peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wang, K., Sybers, D., Maklad, H.R. et al. A TetR-family transcription factor regulates fatty acid metabolism in the archaeal model organism Sulfolobus acidocaldarius. Nat Commun 10, 1542 (2019). https://doi.org/10.1038/s41467-019-09479-1
This article is cited by
A TetR family transcriptional regulator, SP_2854 can affect the butenyl-spinosyn biosynthesis by regulating glucose metabolism in Saccharopolyspora pogona
Microbial Cell Factories (2022)