All known triterpenes are generated by triterpene synthases (TrTSs) from squalene or oxidosqualene1. This approach is fundamentally different from the biosynthesis of short-chain (C10–C25) terpenes that are formed from polyisoprenyl diphosphates2,3,4. In this study, two fungal chimeric class I TrTSs, Talaromyces verruculosus talaropentaene synthase (TvTS) and Macrophomina phaseolina macrophomene synthase (MpMS), were characterized. Both enzymes use dimethylallyl diphosphate and isopentenyl diphosphate or hexaprenyl diphosphate as substrates, representing the first examples, to our knowledge, of non-squalene-dependent triterpene biosynthesis. The cyclization mechanisms of TvTS and MpMS and the absolute configurations of their products were investigated in isotopic labelling experiments. Structural analyses of the terpene cyclase domain of TvTS and full-length MpMS provide detailed insights into their catalytic mechanisms. An AlphaFold2-based screening platform was developed to mine a third TrTS, Colletotrichum gloeosporioides colleterpenol synthase (CgCS). Our findings identify a new enzymatic mechanism for the biosynthesis of triterpenes and enhance understanding of terpene biosynthesis in nature.
Triterpenoids are a large class of natural products with a broad range of bioactivities5 known from all kingdoms of life6,7. Their biosynthesis starts with dimerization of farnesyl diphosphate (FPP) by squalene synthase into squalene8. The cyclization is catalysed by TrTSs such as squalene hopene cyclase (SHC) to yield pentacyclic hopene, the precursor of hopanoids9. After oxidation to (S)-2,3-epoxysqualene, lanosterol synthase (LS) promotes the biosynthesis of lanosterol, the precursor to steroids and saponins10 (Fig. 1). These enzymes are classified as class II terpene synthases (TSs) with a conserved DXDD motif for substrate activation through protonation. By contrast, class I TSs have two conserved DDXX(D/E) and NSE/DTE motifs for binding of a (Mg2+)3 cluster that in turn binds the substrate’s diphosphate11. In cooperation with the diphosphate sensor, a conserved arginine forming hydrogen bridges to the diphosphate, and an effector residue at a helix break12 the Lewis acidity of Mg2+ leads to substrate ionization through diphosphate abstraction. Many examples are known of class I TSs13 that convert substrates with chain lengths between C10 and C25 into usually (poly)cyclic terpenes. By contrast, class I cyclization of hexaprenyl diphosphate (HexPP; C30) into triterpenes is unknown, although HexPP is widely distributed in nature14,15,16 and serves as a precursor for ubiquinone (Q6) in yeast and for highly bioactive meroterpenoids in sponges17. Bifunctional enzymes harbouring a prenyl transferase (PT) and terpene cyclase (TC) domain (PTTCs) have recently attracted attention3,18,19. Their PT domains convert dimethylallyl diphosphate (DMAPP) and isopentenyl diphosphate (IPP) into polyisoprenyl diphosphates, and the TC domains carry out cyclization into di- and sesterterpenes. Thus far, no such enzyme is known to produce triterpenes. Here we describe the discovery of chimeric fungal class I TrTSs for the production and conversion of HexPP into triterpenes.
Using our yeast-based genome mining platform20, two putative bifunctional TSs with PT and TC domains emerged from the endophyte T. verruculosus TS63-9 (ref. 21; ZTR_06220) and the plant pathogen M. phaseolina MS6 (ref. 22; MPH_02178) (Supplementary Figs. 1 and 2). These enzymes showed 40.4% and 33.4% amino acid sequence identity with the Phomopsis amygdali fusicoccadiene synthase23 (PaFS) and the Aspergillus clavatus ophiobolin F synthase24 (AcOS), respectively, and are located within clades II-B and II-C of fungal bifunctional TSs (Supplementary Fig. 3). Thus, both enzymes are predicted to follow a C1-III-IV cyclization mode with attack of the third double bond at C1 followed by attack of the fourth double bond25.
Expression of ZTR_06220 in Saccharomyces cerevisiae YZL141 (ref. 20) yielded the triterpene hydrocarbon talaropentaene (1) (Supplementary Table 3, Fig. 2a, d and Supplementary Figs. 4–12), characterizing the enzyme as TvTS. Transcriptional analysis showed that TvTS was not expressed in its native host, but placing the gene under the control of the amyB promoter resulted in the production of 1 and confirmed its natural function (Fig. 2b, d). Because recombinant full-length TvTS was insoluble, for in vitro assays the TvTS-PT and TvTS-TC domains were expressed individually (Supplementary Fig. 14). Incubation of TvTS-PT with DMAPP and IPP yielded HexPP (Fig. 2c), which was converted into 1 by TvTS-TC (Fig. 2d). The combination of TvTS-PT and TvTS-TC also accepted FPP and IPP, while TvTS-TC alone was sufficient to cyclize HexPP (Supplementary Scheme 1) to 1 (Supplementary Fig. 15).
The absolute configuration of 1 was determined through stereoselective deuteration (Supplementary Table 5)26. Conversion of geranyl diphosphate (GPP) and (E)- or (Z)-(4-13C,4-2H)IPP27 with TvTS-PT and TvTS-TC indicated an absolute configuration of (10R,11S)-1 (Supplementary Fig. 16). Similar experiments using GPP and (R)- or (S)-(1-13C,1-2H)IPP28 with TvTS-PT and TvTS-TC confirmed this assignment for 1 (Supplementary Fig. 17). Compound 1 is structurally similar to (–)-δ-araneosene from the fungal clade II-D TS C. gloeosporioides dolasta-1(15),8-diene synthase (CgDS), including corresponding absolute configurations (Fig. 2e)25.
The proposed cyclization mechanism for 1 starts with diphosphate abstraction from HexPP, followed by C1-III-IV cyclization to A. A subsequent 1,2-hydride shift to B and deprotonation generate 1 (Fig. 2e). Experimental proof for the 1,2-hydride shift was obtained through incubation of (3-13C,2-2H)FPP29 and IPP with TvTS-PT and TvTS-TC, yielding (15-13C,15-2H)-1. This product showed an intensive upfield-shifted triplet (Δδ = –0.45 ppm, 1JC,D = 18.9 Hz) for C15 indicating a direct 13C–2H bond (Supplementary Fig. 18). Incubation of GPP and (R)- or (S)-(1-13C,1-2H)IPP with TvTS-PT and TvTS-TC resulted in specific loss of the 13-pro-R hydrogen of B in the deprotonation to 1 (Supplementary Fig. 19).
Expression of MPH_02178 in S. cerevisiae resulted in macrophomene (2), albeit with low yield, but DMAPP and IPP were efficiently converted into the same triterpene using purified recombinant enzyme (Supplementary Figs. 20–22). Consequently, the TrTS was identified as MpMS. Analysis of an MpMS D114A/N115A variant with an inactivated TC domain demonstrated that HexPP was produced by the PT domain (Fig. 2c). In addition, GPP, FPP, geranylgeranyl diphosphate (GGPP) and geranylfarnesyl diphosphate (GFPP) in combination with IPP and synthetic HexPP were also accepted by MpMS to yield 2. For structure elucidation of 2 (Supplementary Table 6 and Supplementary Figs. 23–30), the MpMS E104Y variant was selected30 (Supplementary Fig. 31). To enable full assignment of the nuclear magnetic resonance (NMR) data, which was prevented by multiple peak overlaps for unlabelled 2, isotopic labelling experiments with 13C-labelled precursors were performed (Supplementary Fig. 32a–e).
Notably, 2 is structurally similar to casbene. Both compounds are bicyclic with one macrocyclic ring and one three-membered ring, except the ring fusion is trans for 2 and cis for casbene (Fig. 2f)31. To our knowledge, the 22-membered ring in macrophomene represents the largest macrocycle discovered in terpenes so far and is only reflected by 22-membered rings in nostocyclophanes, a class of oxidatively dimerized polyketides from cyanobacteria32. The proposed cyclization mechanism from HexPP to 2 starts with diphosphate abstraction and 1,22-cyclization to C, followed by deprotonation at C1 to close the cyclopropane ring. This mechanism represents an unprecedented C1-VI cyclization mode (Supplementary Fig. 3, designated type C) for chimeric TSs. The stereochemical fate of the geminal methyl groups at C24 and C25 of HexPP was addressed by incubating (12-13C)FPP33 and (9-13C)GPP25 with IPP and MpMS. Product analysis by 13C-NMR showed a clear stereochemical course for the attack at C23 in cyclopropane ring closure (Supplementary Fig. 33). The stereochemistry of the deprotonation was addressed by incubation with GFPP and (R)-(1-2H)IPP or (S)-(1-2H)IPP34, with deuterium retained for the (R) enantiomer and lost from the (S) enantiomer (Supplementary Fig. 34). Taken together, the face selectivity at C23 and the stereospecificity of the deprotonation at C1 suggest an absolute configuration of (1R,22R)-2, which was confirmed by stereoselective deuteration through incubation of MpMS with (R)-(1-13C,1-2H)IPP and (S)-(1-13C,1-2H)IPP (Supplementary Fig. 35).
The structural basis of the cyclization mechanism was investigated by determination of the crystal structure of apo TvTS-TC by soaking with the non-reactive substrate surrogate 2,3-dihydro-HexPP (Extended Data Table 1 and Supplementary Scheme 2). Similarly to PaFS-TC35, TvTS-TC adopts the characteristic class I TS fold12,36 (Extended Data Fig. 1a, b), but TvTS-TC possesses a larger active site to accommodate HexPP. Although 2,3-dihydro-HexPP was not clearly observed and we cannot completely exclude the possibility that the observed electron density originated from polyethylene glycol (PEG) used in the crystallization buffer (Supplementary Fig. 36), the disordered regions in the apo structure, especially the DDXXD motif and the active site loop D173–D182, appeared clearly structured following soaking with the substrate surrogate. This observation suggests that a major conformational change occurs in TvTS following substrate binding that facilitates active site closure (Extended Data Fig. 1a, c). The docking model of 2,3-dihydro-HexPP based on the observed electron density suggested two possible conformers (Fig. 3a and Supplementary Fig. 36e), one in which HexPP is stretched out across the active site (yellow) and the other in which HexPP is prefolded (purple) for C1-III-IV cyclization with C1–C11 and C10–C14 distances of 5.4 and 3.2 Å, respectively.
The TvTS active site contains the conserved DDXXD and NSE motifs and other residues that interact with the substrate’s diphosphate (Fig. 3a, b). Aromatic residues in TvTS-TC (F65, F89, F187, W188 and W310) are observed in positions analogous to those in PaFS (F65, F89, F196, W197 and W319), while the PaFS residues at the bottom of the active site (M221, W225, M309, S312 and N316) are substituted with smaller residues in TvTS (A212, Y216, G300, G303 and S307; Fig. 3a, b). These differences create a larger pocket in TvTS-TC that is composed of a tunnel to accommodate two additional (non-reacting) isoprene units of HexPP, sticking out from a ball-shaped cavity in which the four reacting isoprene units are located. This ball-shaped part is similar in shape and size to the PaFS active site that houses GGPP.
Structure-based site-directed mutagenesis experiments (Extended Data Fig. 2) included several small-to-large substitutions in TvTS near C15 to C20 of 2,3-dihydro-HexPP (G184F, A219L, G220L and S307F). The enzyme variants showed strongly reduced production of 1, suggesting that the available space in this region is important for correct substrate folding. In addition, the exchange of conserved aromatic residues for alanine or leucine (F65A/L, F89A/L, F187A/L, W188A/L and W310A) abolished or decreased enzyme activity, indicating that these TvTS residues have an important role in shaping the active site cavity and/or stabilizing intermediates through cation−π interactions (Fig. 2e)6. The small residues that widen the binding pocket in comparison with PaFS were substituted with the corresponding residues in PaFS-TC. While the G303S and S307N substitutions had little effect, the A212M and Y216W variants retained only 15% and 28% of wild-type activity for the generation of 1 and production with the G300M variant was completely disrupted, in support of the hypothesis that the larger steric bulk introduced in these enzyme variants does not allow uptake of HexPP.
Further insights into triterpene biosynthesis by PT–TC chimeric enzymes were obtained by cryo-electron microscopy (cryo-EM) of MpMS. The data obtained for full-length MpMS showed that the enzyme exclusively forms a hexamer, whereas PaFS forms octamers and hexamers with a ratio of 9:1 (Fig. 3d, Supplementary Figs. 37 and 38 and Extended Data Table 2)37. A three-dimensional (3D) reconstruction of unliganded MpMS could be established for only the PT domain, as in the cryo-EM analysis of PaFS37. The overall structure of the MpMS-PT monomer is highly similar to the PaFS-PT cryo-EM and crystal structures, and the hexamer of MpMS-PT also superimposes well with the PaFS hexamer (root mean square deviation of 1.4–1.6 Å for 274 Cα atoms). Moreover, the active site of MpMS-PT is of suitable size for its product HexPP (Extended Data Fig. 3).
Interactions between the PT and TC domains in MpMS were further investigated by cross-linking the domains with glutaraldehyde38. The obtained structural data for cross-linked MpMS were substantially different from those for non-cross-linked MpMS (Supplementary Figs. 39 and 40 and Extended Data Table 2). Additional electron densities were observed in cross-linked MpMS at each vertex of the PT hexamer with a different occupancy, suggesting that the orientation of the TC domain is flexible although its position was fixed by the cross-linking. However, because the local resolution was low and fragmented, a model for only one helix of the TC domain in addition to the PT domain could be built (Fig. 3e). Fitting of a TC domain homology model into the density map (Fig. 3f) suggested that the α5 helix region (residues 146–163) participates through cross-linking of K150 to K442 of the PT domain (the distance between these residues is only 3.0 Å). Moreover, TC residues T147, K150, N151 and K155 on helix α5 form hydrogen bonds with PT domain residues on helices α2 (419–429) and α14 (686–695; Extended Data Fig. 4).
Bifunctional TSs with an αα domain architecture may have a catalytic advantage via proximity channelling if the active sites of the interacting domains are properly faced towards each other39,40,41. In the PaFS octamer, this is ideally realized, as the TC domains of PaFS are oriented towards the central pores of the PT domains, which facilitates product channelling from the PT domain to the TC domain37. Additionally, in the hexameric structure of MpMS, each TC domain is located close to a PT domain, with their active sites facing each other (Extended Data Fig. 5), which allows for direct transfer of HexPP from the PT domain to an adjacent TC domain.
The MpMS-TC model42 does not show the same tunnel for uptake of two non-reacting isoprene units as was observed for TvTS, as the TvTS positions of A212, G300 and G303 are occupied by bulky residues in MpMS (L238, L318 and M321). By contrast, the ball-shaped part of the active site of MpMS is wider because residues F187 and W188 of TvTS are substituted by smaller ones in MpMS (V206 and A207; Fig. 3a, c). These observations may explain why TvTS-TC with its initial C1-III-IV cyclization behaves like a diterpene synthase (DTS), acting on only the first four isoprene units of HexPP, while the two isoprene units at the end of the HexPP chain stick out into a tunnel and do not participate in terpene cyclization. By contrast, MpMS lacks this tunnel but has an overall larger reaction chamber for full uptake of HexPP, thus allowing its unusual C1-VI cyclization. In agreement with this hypothesis, MpMS V206F and A207W variants showed nearly or completely abolished production of 2 (Extended Data Fig. 6).
The high similarity between the AlphaFold2-predicted model and the crystal structure of TvTS-TC offers a basis for the discovery of additional chimeric fungal TrTSs through structural prediction (Fig. 4a and Supplementary Fig. 41), which is not possible from amino acid sequence analyses. Such structure predictions with docking of HexPP were performed for ten chimeric TSs with low sequence similarity to previously characterized enzymes. Six of these enzymes (PTTC027, PTTC044, PTTC060, PTTC074, PTTC114 and Cgl13855) contained a pocket that may be sufficiently large for HexPP binding (Supplementary Figs. 42 and 43 and Supplementary Table 7). Functional characterization through expression in yeast indeed resulted in colleterpenol (3) for Cgl13855 (NMDCN0000R73) from the endophyte C. gloeosporioides ES026 (ref. 43) (Fig. 4c, Extended Data Fig. 7 and Supplementary Fig. 44), characterizing the TrTS as CgCS. HexPP docking identified an active site cavity for Cgl13855-TC similar to that of TvTS, explaining the C1-III-IV cyclization with two isoprene units sticking out into a tunnel (Fig. 4b).
The planar structure of 3 was identified by NMR (Supplementary Table 8 and Supplementary Figs. 45–59), showing structural similarity to the known 15-hydroxy-α-cericerene44. Solving the relative configuration of 3 through nuclear Overhauser enhanced spectroscopy (NOESY) was difficult because of its conformational flexibility. A comparison of calculated to measured NMR data favoured the structure (14R*,15S*)-3, which was further supported by a comparison of the 13C-NMR data to those reported for 15-hydroxy-α-cericerene44 with the same relative configuration (Supplementary Table 8). The absolute configuration of (14R,15S)-3 was then determined through comparison of measured and calculated electronic circular dichroism (ECD) curves (Supplementary Fig. 47). Notably, 3 formation requires syn addition of C1 and water to the C14=C15 alkene in HexPP (Fig. 4d). Additionally, PTTC074 produced a similar triterpene compound as CgCS (Extended Data Fig. 7 and Supplementary Fig. 44), which implies the existence of more TrTSs to be explored.
In summary, a novel family of chimeric class I TrTSs converting HexPP into triterpenes was identified for which two subclasses can be distinguished. Some TrTS-TCs provide one large cavity for HexPP uptake to form macrocycles. Other enzymes have a ball-shaped cavity of similar size to that in DTSs with an adjacent tunnel that accommodates two ‘spectator’ isoprene units, with the consequence that only the first four units participate in cyclization. This reflects the situation for the C1-III-IV and C1-IV-V cyclizing subclasses of sesterterpene synthases, but structural insights are lacking to understand these different modes. The findings described here show expanded product boundaries for class I TSs and enrich understanding of terpene biosynthesis in nature.
Strains and culture conditions
Cloning was performed in Escherichia coli DH10B using standard recombinant DNA techniques. E. coli BL21(DE3) (Invitrogen) was used for protein expression. Both E. coli strains were grown in LB (10.0 g l–1 tryptone, 5.0 g l–1 yeast extract, 5.0 g l–1 NaCl, pH 7.0) at 37 °C. S. cerevisiae YZL141 (ref. 45) was used for the heterologous expression of ZTR_06220 (KUL85185) from T. verruculosus TS63-9 (ref. 21), MPH_02178 (EKG20455) from M. phaseolina MS6 (ref. 22) and Cgl13855 (NMDCN0000R73) from C. gloeosporioides ES026 (ref. 43) and grown in YPD medium (20 g l–1 tryptone, 10 g l–1 yeast extract, 20 g l–1 glucose, pH 7.2) at 30 °C. T. verruculosus TS63-9 was selected for functional verification of TvTS in vivo and grown in PDB medium (20 g l–1 potato extract, 20 g l–1 glucose, pH 6.5).
Phylogenetic analysis of TvTS and MpMS
To characterize the evolutionary relationships of TvTS, MpMS and CgCS, 56 characterized fungal chimeric class I TSs were selected and multiple-sequence alignment was performed using ClustalW (version 2.0.12). The Poisson correction model based on the maximum-likelihood method was used to infer the evolutionary history of these enzymes, and MEGA7 was used to conduct the evolutionary analysis46. Bootstrap values were obtained using 1,000 replications. The initial tree for the heuristic search was acquired automatically by applying the Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model and then selecting the topology with a superior log-likelihood value. After eliminating all positions that contained gaps and missing data, 186 positions were left in the final dataset. The plant-derived sesterterpene synthase AtTPS25 (ref. 47) was selected as an outgroup.
Functional characterization of TrTS candidates in vivo
Synthetic genes encoding TvTS (ZTR_06220), MpMS (MPH_02178), CgCS and the nine remaining candidates (Supplementary Table 7), codon optimized for expression in yeast, were obtained from GenScript. After digestion with HindIII and ScaI, the genes were cloned into HindIII- and ScaI-digested pYJ117 plasmid20 to produce plasmids pXM139 (from ZTR_06220), pXM018 (from MPH_02178), pRC181 (from Cgl13855) and five additional plasmids. The plasmids were individually transformed into S. cerevisiae YZL141 to yield corresponding strains, which were grown in 5 ml YPD medium (2% glucose) at 30 °C overnight. From this starter culture, 5 ml was added to a 500-ml flask containing 200 ml YPD (2% glucose and 1% galactose), followed by growth at 30 °C with shaking at 220 r.p.m. for 3 d. Strains were collected, and the mycelium was collected and extracted with hexane/ethyl acetate (4:1). The organic layers were combined for gas chromatography mass spectrometry (GC–MS) analysis.
Fermentation of engineered S. cerevisiae strains and isolation of 1 and 3
For isolation of 1 and 3, S. cerevisiae XM139 and RC181 strains were scaled up in 2-litre shaker flasks containing 1 litre YPD (2% glucose and 1% galactose). Subsequently, the mycelium was collected and extracted with acetone three times. The acetone-extracted layer was distilled under reduced pressure to remove acetone and then partitioned between ethyl acetate and water to afford the ethyl acetate fraction (1.7 g for 1 and 1.56 g for 3). For 1, the residue was subjected to a silica gel column (80–100 mesh) and elution with petroleum ether/ethyl acetate (100:0, 99.8:0.2, 99:1, 98:2, 95:5, 0:100) to give fractions A–F. Following this, 1 was identified in fraction B by GC–MS detection and further purified by semi-preparative HPLC (Ultimate 3000 HPLC equipped with an XBridge Prep C18 column (Waters, 113; 10 × 250 mm, 5 μm)) to afford compound 1 (7.2 mg). For 3, the crude extract was dissolved in methanol/DMSO (10:1, vol/vol) and subjected to semi-preparative HPLC (column: Agilent ZORBAX BS-C18, 5 μm, 9.6 × 250 mm internal diameter; solvent: acetonitrile/H2O, 99:1; flow: 3 ml min–1; detector: 210, 230 nm) to yield compound 3 (30.5 mg, tR = 24.2 min).
Plasmid construction for in vitro and labelling experiments
For expression of recombinant TvTS and MpMS in E. coli, codon-optimized synthetic genes were obtained from GenScript. The synthetic gene for MpMS was digested with NdeI and EcoRI and cloned into pET28a to produce pRC088. The sequences encoding the PT and TC domains of TvTS were amplified separately by PCR with Phusion DNA polymerase using primer pairs P1/P2 and P3/P4 (Supplementary Table 2). The amplified nucleotide sequence for the PT domain was digested with NdeI and EcoRI and cloned into pET28a to produce plasmid pRC009, and the amplified nucleotide sequence for the TC domain was digested with HindIII and XhoI and cloned into pET21a to generate plasmid pRC041. The sequence encoding the TC domain-inactivated MpMS D114A/N115A variant was amplified from pRC088 using primer pairs P5/P6 and P7/P8 (Supplementary Table 2) and then assembled by overlap extension PCR and cloned into NdeI- and EcoRI-digested pET28a to produce pRC088-D114A/N115A. Correct gene insertion was verified by sequencing, and plasmids were used to transform E. coli BL21(DE3) competent cells using a calcium-based protocol.
Protein expression and purification for in vitro assays
For gene expression, a fresh LB culture of E. coli BL21(DE3) transformants (containing 100 mg l–1 ampicillin for pRC041 (TvTS-TC) and 50 mg l–1 kanamycin for pRC009 (TvTS-PT), pRC088 (MpMS) and pRC088-D114A/N115A) was inoculated from a glycerol stock and grown overnight. The precultures were used to inoculate the desired volume of LB (1 ml per litre) amended with the appropriate antibiotic. Cultures were grown at 37 °C with shaking until an OD600 of 0.6–0.8 was reached. The cultures were cooled to 16 °C, and isopropyl β-d-1-thiogalactopyranoside (0.1 mM) was added to induce expression. Proteins were expressed overnight (~20 h) at 16 °C with shaking. Cells were collected by centrifugation (8,000g, 5 min). The supernatant was discarded, and the cell pellet was resuspended in buffer A (50 mM Tris-HCl, 300 mM NaCl, 4 mM β-mercaptoethanol, pH 7.6; 10 ml per litre of culture). Cells were lysed by sonication on ice (5 × 30 s). The cellular debris was removed by centrifugation (14,710g, 2 × 7 min), and the supernatant was subjected to Ni-NTA affinity chromatography (Protino Ni-NTA, Macherey-Nagel) through a syringe filter. The resin was washed with buffer A (20 ml per litre of culture), followed by elution of the His6-tagged proteins using elution buffer (buffer A + 300 mM imidazole; 10 ml per litre of culture). The proteins were concentrated using centrifugal filters (10-kDa cut-off; 5,000g, 4 °C; Amicon Ultra-15 (Millipore) or Vivaspin 20 (Sartorius)) and diluted with incubation buffer (50 mM Na2HPO4, 10% glycerol, 2 mM MgCl2). Enzyme concentrations were determined by Bradford assay and adjusted to 20 μΜ. Incubation experiments were carried out at 30 °C overnight using combinations of enzymes and substrates as listed in Supplementary Table 5. For experiments with unlabelled substrates, hexane (650 μl) was used for extraction, whereas for experiments with labelled substrates extraction was performed using C6D6 (650 μl). Samples were directly analysed by GC–MS and/or NMR spectroscopy.
In vitro enzyme assays for TvTS-PT and MpMS D114A/N115A and detection of HexPP
Reactions were carried out using substrate (IPP and DMAPP, 100 μM each), 2 mM Mg2+, 10% glycerol and 10 μM enzyme (TvTS-PT or MpMS-PT D114A/N115A) in Tris-HCl buffer (200 μl; 50 mM, pH 7.6) at 30 °C overnight. The resulting HexPP was extracted with acetonitrile and analysed by liquid chromatography mass spectrometry (LC–MS). For high-resolution MS analysis of HexPP, an LTQ Orbitrap Elite instrument coupled to a Thermo Scientific Ultimate 3000 RSLC HPLC system and an ACE UltraCore 2.5 SuperC18 (2.1 × 100 mm) column was used for compound separation at 35 °C. The mobile phase (pH 9.5) containing 5 mM ammonium bicarbonate in water as solvent A and acetonitrile as solvent B was set to a flow rate of 0.2 ml min–1. The gradient programme was as follows: 98–10% solvent A (0–10 min), 10–0% solvent A (10–15 min), 0% solvent A (15–17 min), 0–98% solvent A (17–18 min) and 98% solvent A (18–22 min). EI was used in negative mode for detection. The ion source parameters were as follows: sheath gas, 40 arb; auxiliary gas, 5 arb; spray voltage, 3.1 kV; capillary temperature, 270 °C; S-lens RF level, 65 kV; auxiliary gas heater temperature, 250 °C. Full-scan MS mode with a resolution of 60,000 was used for qualitative analysis.
Site-directed mutagenesis and in vitro analysis of enzyme variants
Site-directed mutagenesis for the construction of MpMS and TvTS enzyme variants was performed with the PCR-based QuikChange Site-Directed Mutagenesis kit (Stratagene) according to the manufacturer’s protocol, using Phusion DNA polymerase and the mutational primers listed in Supplementary Table 2. Plasmids pRC088 (containing the full-length gene for MpMS) and pMBP139 (containing the full-length gene for TvTS) were used as template. All mutants were verified by gene sequencing. Reactions for TvTS, MpMS and the variants were carried out using substrate (DMAPP and IPP, 100 μM and 500 μM, respectively), 2 mM Mg2+, 5% glycerol and 10 μM enzyme in Tris-HCl buffer (200 μl; 50 mM, pH 7.5) at 30 °C overnight. The resulting product was extracted with ethyl acetate and analysed by GC–MS. All experiments were performed in three biological replicates.
Isolation of macrophomene (2) from incubation with MpMS E104Y
MpMS E104Y was expressed in E. coli and purified by Ni-NTA affinity chromatography. The pooled enzyme fractions were concentrated, further purified by size-exclusion chromatography and concentrated again. Incubation was performed using GPP (30 mg, 0.08 mM) and IPP (100 mg, 0.34 mM) dissolved in NH4HCO3 buffer (25 mM, 20 ml), which was added to the enzyme solution in incubation buffer (100 ml) over a period of 1 h using a syringe pump. The mixture was incubated overnight and extracted with n-hexane (150 ml) three times. The organic layers were dried with MgSO4 and concentrated under reduced pressure. The residue was purified by column chromatography on SiO2 (pentane) to yield macrophomene as a colourless oil.
Construction of expression plasmids for protein crystallization
To increase the solubility of protein in E. coli, the coding sequence for TvTS was amplified using primer pair P21/P22 from the codon-optimized synthetic gene and ligated into NdeI- and HindIII-digested pET28-MBP-TEV, using the In-Fusion HD Cloning kit (TaKaRa), yielding plasmid pMBP139. For expression of protein for crystallization, the coding sequence for TvTS-TC was amplified from pRC041 using primer pair P23/P24. The amplified sequence was cloned into pET-SUMO, which was itself amplified with primer pair P25/P26, using the In-Fusion HD Cloning kit, resulting in plasmid pSUMO041.
Protein expression and purification for crystallization
Protein expression was performed with E. coli BL21(DE3) harbouring pSUMO041 using the same protocol as described above. For crystallization, after Ni-NTA purification, SUMO–TvTS-TC was dialysed against 2 × 1 litre of Tris-HCl buffer (pH 8.0) containing 5% (vol/vol) glycerol and 300 mM NaCl. After dialysis, SUMO–TvTS-TC was treated with SUMO protease Ulp1403-621 (ref. 48; prepared as previously described, 0.87 μM) in the presence of dithiothreitol (DTT, 1 mM) at 4 °C overnight. The protein solution was loaded onto a column filled with Ni-NTA resin. The His6–SUMO fragment and protease were then captured by the Ni-NTA resin, leaving TvTS-TC in the flow-through, and the remaining protein on the column was eluted with Tris-HCl buffer (pH 8.0) containing 5% (vol/vol) glycerol, 10 mM imidazole and 300 mM NaCl. The collected protein solution was incubated with 10 mM EDTA (pH 8.0) at 4 °C for 1 h. To obtain a protein preparation of high purity, further purification was performed using anion-exchange chromatography on a Resource Q column (Cytiva) by linearly increasing the salt concentration from 0 M NaCl to 1 M NaCl in buffer (50 mM Tris-HCl (pH 8.0), 1 mM DTT, 5% glycerol) over 20 column volumes. The desired protein was collected and then purified to homogeneity by size-exclusion chromatography on a HiLoad 16/600 Superdex 200 pg column (Cytiva) and eluted with a solution containing 20 mM Tris-HCl (pH 8.0), 100 mM NaCl, 1 mM DTT and 5% glycerol. The resulting eluate was concentrated to 10 mg ml–1, using an Amicon Ultra-4 filter (molecular weight cut-off of 30 kDa) at 4 °C. The purity of the proteins was monitored by SDS–PAGE, and protein concentrations were determined with a SimpliNano microvolume spectrophotometer.
Crystallization and structure determination
Crystals of TvTS-TC were obtained after 1 d at 10 °C by using the sitting-drop vapor-diffusion method. Before crystallization, 125 μM of protein was incubated with 2 mM MgCl2 on ice for 30 min, and 0.5 μl of protein solution was then mixed with 0.5 μl of reservoir solution containing 0.1 M Tris-HCl (pH 8.5), 0.1 M MgCl2, 30% PEG 4000 and 0.2 M NDSB-211. Crystals of TvTS-TC in complex with 2,3-dihydro-HexPP were obtained by incubation of TvTS-TC crystals with 10 mM 2,3-dihydro-HexPP in the crystallization drop at 10 °C for 14 h. The crystals were transferred to cryoprotectant solution (reservoir solution with 25% (vol/vol) glycerol) and then flash cooled at –173 °C in a nitrogen gas stream. The X-ray diffraction datasets were collected at X06SA (Paul Scherrer Institut, Villigen, Switzerland) for the apo TvTS-TC structure and at BL-1A (Photon Factory, Tsukuba, Japan) for the structure of TvTS-TC in complex with 2,3-dihydro-HexPP, using a beam wavelength of 1.0 and 1.1 Å, respectively. The diffraction datasets for TvTS-TC were processed and scaled using the XDS package49 and Aimless50. The initial phase of the TvTS-TC structure was determined by molecular replacement, using PaFS (PDB, 5ER8) as the search model. Molecular replacement was performed with Phaser in PHENIX (version 1.19.2-4158-000)51,52. The initial phase was further calculated with AutoBuild in PHENIX52. The TvTS-TC structures were modified manually with Coot53 and refined with PHENIX.refine54. The cif parameters of the ligands for the energy minimization calculations were obtained by using the PRODRG server55. After soaking with 2,3-dihydro-HexPP, strong additional electron densities were observed close to the DDXXD motif and in the active site cavity (Supplementary Fig. 36a, b). Modelling of 2,3-dihydro-HexPP to the observed density was partly satisfactory, but some of the methyl groups along the isoprenoid chain stuck out (Supplementary Fig. 36c). For an alternative explanation, PEG used in the crystallization buffer was modelled to the density, but in this case large unassigned densities close to the DDXXD motif remained in the refined structure (Supplementary Fig. 36d). Thus, it is possible that the observed density originated from both 2,3-dihydro-HexPP and PEG with low occupancies. The final crystal data and intensity statistics are summarized in Extended Data Table 1. The Ramachandran statistics were as follows: 97.6% favoured and 2.4% allowed for apo TvTS-TC, 98.9% favoured and 1.1% allowed for TvTS-TC soaked with 2,3-dihydro-HexPP. Although the ligand was not assigned, the conformations of PEG and 2,3-dihydro-HexPP in the active site should be similarly defined by active site residues. Therefore, a docking model of 2,3-dihydro-HexPP based on the observed density was developed. All crystallographic figures were prepared with PyMOL (DeLano Scientific; http://www.pymol.org).
Purification of MpMS and cross-linking of the PT and TC domains
Protein expression was performed with E. coli BL21(DE3) harbouring pRC088 using the same protocol as described above. After Ni-NTA purification, MpMS protein solution was further purified using anion-exchange chromatography on a Resource Q column (Cytiva) by linearly increasing the salt concentration from 20 mM NaCl to 1 M NaCl in buffer (50 mM HEPES (pH 7.5) and 1 mM DTT) over 20 column volumes. The desired protein was collected and then purified by size-exclusion chromatography on a Superose 6 10/300 column (Cytiva) using buffers containing 20 mM HEPES (pH 7.5), 150 mM NaCl and 1 mM DTT. The resulting eluate was concentrated to 50 μM, using an Amicon Ultra-4 filter (molecular weight cut-off of 100 kDa) at 4 °C. The purity of the proteins was monitored by SDS–PAGE. To obtain the structure of full-length MpMS, cross-linking of the PT and TC domains via glutaraldehyde (25% in water; Nacalai Tesque) was performed. First, MpMS protein was purified by Ni-NTA and anion-exchange chromatography, as described above, to obtain pure protein. Subsequently, cross-linking was performed on ice by incubating 1 mg ml–1 MpMS protein with 0.06% glutaraldehyde for 10 min at a 100-μl scale. Reactions were quenched by adding 10 μl of 1.0 M Tris-HCl (pH 8.0). Reaction mixtures were pooled and further purified by size-exclusion chromatography on a Superose 6 10/300 column (Cytiva) to exclude aggregations. The resulting eluate was concentrated to 50 μM, using an Amicon Ultra-4 filter (molecular weight cut-off of 100 kDa) at 4 °C.
Cryo-EM sample preparation and data acquisition
For cryo-grid preparation, 3 μl of sample was applied to a holey carbon grid (Quantifoil, Cu, R1.2/1.3, 300 mesh). The grid was rendered hydrophilic by a 30-s glow discharge in air (11-mA current) with PIB-10 (Vacuum Device). The grid was blotted for 20 s (blot force of 0) at 18 °C and 100% humidity and then flash frozen in liquid ethane using a Vitrobot Mark IV (Thermo Fisher Scientific). For the MpMS-PT and MpMS cross-linking datasets, 1,888 and 1,529 movies were acquired, respectively, on a Talos Arctica (FEI) microscope operating at 200 kV in nanoprobe mode using EPU software for automated data collection. The movies were collected on a 4,000 × 4,000 grid using a Falcon 3EC direct electron detector (electron counting mode) at a nominal magnification of 120,000 (0.88 Å per pixel). Fifty movie fractions were recorded at an exposure of 1.00 electrons per Å2 per fraction, corresponding to a total exposure of 50 electrons per Å2. The defocus steps used were –1.0, –1.5, –2.0 and –2.5 μm. The movie fractions were aligned, dose weighted and averaged using RELION’s own implementation on 5 × 5 tiled fractions with a B-factor of 300. The non-weighted movie sums were used for contrast transfer function (CTF) estimation with Gctf 56. The dose-weighted sums were used for all subsequent steps of image processing. The subsequent processes of particle picking, two-dimensional classification, ab initio reconstruction, 3D classification, 3D refinement, CTF refinement and Bayesian polishing were performed using RELION-3.18 (ref. 57). For details of the cryo-EM data processing, see Supplementary Figs. 37–40.
AlphaFold2 prediction and docking analysis
UCSF Chimera58 (version 1.12) and AutoDock Vina59 (version 1.1.2) were used to perform receptor and ligand preparation and molecular docking analysis. Before the docking procedure, the receptor structures predicted by AlphaFold2 and ligands were processed as follows. Metal ions were added to the TC domain binding site of receptors, referring to the homologous protein in PDB for the coordinates of the three Mg2+ ions. With the exception of Cgl13855 and FgMS, for which 5IMP and 5ER8 were used as the reference for metal ions, respectively, 6VYD was used as the reference for the other PTTCs. The receptor structures containing metal ions were then processed using the Chimera tool Dock Prep, in which hydrogen atoms and charges were added and other parameters were set as default. The energy-minimization molecule models included in Chimera were used to minimize the energy for the structures from the previous step. The ligand structures were drawn, charge was added and the structures were transformed to 3D conformations. The binding site of the TC domain was determined by referring to the crystal structures for homologous proteins. The grid box in the docking procedure was defined to include the metal ions, and corresponding residues appeared in the binding site of crystal structures for homologous proteins. Receptor and ligand options in AutoDock Vina were set as default. The number of binding modes, exhaustiveness of search and maximum energy difference (kcal mol–1) parameters were set as 9, 8 and 3, respectively.
Further information on research design is available in the Nature Research Reporting Summary linked to this paper.
The authors declare that the main data supporting the findings of this study are available within the article and its Supplementary Information. Original data can be obtained from the corresponding authors on reasonable request. The coordinates and structure factor amplitudes for the apo structure of TvTS-TC and the structure of TvTS-TC after soaking with 2,3-dihydro-HexPP have been deposited to PDB under accession codes 7VTA and 7VTB, respectively. The cryo-EM maps and atomic coordinates for MpMS-PT and cross-linked MpMS have been deposited in the Electron Microscopy Data Bank (EMDB; https://www.ebi.ac.uk/pdbe/emdb/) and PDB with accession codes EMD-32531 and EMD-32532 and 7WIJ, respectively. The accession numbers (NMDCN0000RG9, NMDCN0000RGA, NMDCN0000RGB, NMDCN0000R73, KFX89132, KAH9237577, KIK55704, KAF2708718, CRG86078, QIH97829) corresponding to the PTTC candidates for AlphaFold2 prediction have been deposited in the National Microbiology Data Center (https://nmdc.cn/en) and are listed in the Supplementary Information.
Abe, I. Enzymatic synthesis of cyclic triterpenes. Nat. Prod. Rep. 24, 1311–1331 (2007).
Degenhardt, J., Köllner, T. G. & Gershenzon, J. Monoterpene and sesquiterpene synthases and the origin of terpene skeletal diversity in plants. Phytochemistry 70, 1621–1637 (2009).
Minami, A., Ozaki, T., Liu, C. & Oikawa, H. Cyclopentane-forming di/sesterterpene synthases: widely distributed enzymes in bacteria, fungi, and plants. Nat. Prod. Rep. 35, 1330–1346 (2018).
Karunanithi, P. S. & Zerbe, P. Terpene synthases as metabolic gatekeepers in the evolution of plant terpenoid chemical diversity. Front. Plant Sci. 10, 1166 (2019).
Dzubak, P. et al. Pharmacological activities of natural triterpenoids and their therapeutic implications. Nat. Prod. Rep. 23, 394–411 (2006).
Quin, M. B., Flynn, C. M. & Schmidt-Dannert, C. Traversing the fungal terpenome. Nat. Prod. Rep. 31, 1449–1473 (2014).
Thimmappa, R., Geisler, K., Louveau, T., O’Maille, P. & Osbourn, A. Triterpene biosynthesis in plants. Annu. Rev. Plant Biol. 65, 225–257 (2014).
Beytia, E., Qureshi, A. A. & Porter, J. W. Squalene synthetase: III. Mechanism of the reaction. J. Biol. Chem. 248, 1856–1867 (1973).
Hoshino, T. & Sato, T. Squalene-hopene cyclase: catalytic mechanism and substrate recognition. Chem. Commun. 4, 291–301 (2002).
Phillips, D. R., Rasbery, J. M., Bartel, B. & Matsuda, S. P. Biosynthetic diversity in plant triterpene cyclization. Curr. Opin. Plant Biol. 9, 305–314 (2006).
Christianson, D. W. Structural and chemical biology of terpenoid cyclases. Chem. Rev. 117, 11570–11648 (2017).
Baer, P. et al. Induced‐fit mechanism in class I terpene cyclases. Angew. Chem. Int. Ed. 53, 7652–7656 (2014).
Dickschat, J. S. Bacterial terpene cyclases. Nat. Prod. Rep. 33, 87–110 (2016).
Kawamukai, M. Biosynthesis of coenzyme Q in eukaryotes. Biosci. Biotechnol., Biochem. 80, 23–33 (2016).
Sasaki, D. et al. Crystal structure of heterodimeric hexaprenyl diphosphate synthase from Micrococcus luteus BP 26 reveals that the small subunit is directly involved in the product chain length regulation. J. Biol. Chem. 286, 3729–3740 (2011).
Zhang, Y. W., Li, X. Y. & Koyama, T. Chain length determination of prenyltransferases: both heteromeric subunits of medium-chain (E)-prenyl diphosphate synthase are involved in the product chain length determination. Biochemistry 39, 12717–12722 (2000).
Smith, T. E. Biogenetic relationships of bioactive sponge merotriterpenoids. Mar. Drugs 15, 285 (2017).
Mitsuhashi, T., Okada, M. & Abe, I. Identification of chimeric αβγ diterpene synthases possessing both type II terpene cyclase and prenyltransferase activities. ChemBioChem 18, 2104–2109 (2017).
Faylo, J. L., Ronnebaum, T. A. & Christianson, D. W. Assembly-line catalysis in bifunctional terpene synthases. Acc. Chem. Res. 54, 3780–3791 (2021).
Chen, R. et al. Systematic mining of fungal chimeric terpene synthases using an efficient precursor-providing yeast chassis. Proc. Natl Acad. Sci. USA 118, e2023247118 (2021).
Hu, L. et al. Draft genome sequence of Talaromyces verruculosus (“Penicillium verruculosum”) strain TS63-9, a fungus with great potential for industrial production of polysaccharide-degrading enzymes. J. Biotechnol. 219, 5–6 (2016).
Islam, M. S. et al. Tools to kill: genome of one of the most destructive plant pathogenic fungi Macrophomina phaseolina. BMC Genomics 13, 493 (2012).
Toyomasu, T. et al. Fusicoccins are biosynthesized by an unusual chimera diterpene synthase in fungi. Proc. Natl Acad. Sci. USA 104, 3084–3088 (2007).
Chiba, R., Minami, A., Gomi, K. & Oikawa, H. Identification of ophiobolin F synthase by a genome mining approach: a sesterterpene synthase from Aspergillus clavatus. Org. Lett. 15, 594–597 (2013).
Bian, G. et al. A clade II‐D fungal chimeric diterpene synthase from Colletotrichum gloeosporioides produces dolasta‐1 (15), 8‐diene. Angew. Chem. Int. Ed. 57, 15887–15890 (2018).
Rabe, P. et al. Mechanistic investigations of two bacterial diterpene cyclases: spiroviolene synthase and tsukubadiene synthase. Angew. Chem. Int. Ed. 56, 2776–2779 (2017).
Lauterbach, L., Rinkel, J. & Dickschat, J. S. Two bacterial diterpene synthases from Allokutzneria albata for bonnadiene and for phomopsene and allokutznerene. Angew. Chem. Int. Ed. 57, 8280–8283 (2018).
Rinkel, J. & Dickschat, J. S. Addressing the chemistry of germacrene A by isotope labeling experiments. Org. Lett. 21, 2426–2429 (2019).
Rinkel, J., Lauterbach, L., Rabe, P. & Dickschat, J. S. Two diterpene synthases for spiroalbatene and cembrene A from Allokutzneria albata. Angew. Chem. Int. Ed. 57, 3238–3241 (2018).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Robinson, D. R. & West, C. A. Biosynthesis of cyclic diterpenes in extracts from seedlings of Ricinus communis. I. Identification of diterpene hydrocarbons formed from mevalonate. Biochemistry 9, 70–79 (1970).
Chen, J. L., Moore, R. E. & Patterson, G. M. Structures of nostocyclophanes AD. J. Org. Chem. 56, 4360–4364 (1991).
Rabe, P. et al. Conformational analysis, thermal rearrangement, and EI–MS fragmentation mechanism of (1(10)E, 4E, 6S, 7R)‐germacradien‐6‐ol by 13C‐labeling experiments. Angew. Chem. Int. Ed. 54, 13448–13451 (2015).
Rinkel, J., Lauterbach, L. & Dickschat, J. S. A branched diterpene cascade: the mechanism of spinodiene synthase from Saccharopolyspora spinosa. Angew. Chem. Int. Ed. 58, 452–455 (2018).
Chen, M., Chou, W. K., Toyomasu, T., Cane, D. E. & Christianson, D. W. Structure and function of fusicoccadiene synthase, a hexameric bifunctional diterpene synthase. ACS Chem. Biol. 11, 889–899 (2016).
Tarshis, L., Yan, M., Poulter, C. D. & Sacchettini, J. C. Crystal structure of recombinant farnesyl diphosphate synthase at 2.6-Å resolution. Biochemistry 33, 10871–10877 (1994).
Faylo, J. L. et al. Structural insight on assembly-line catalysis in terpene biosynthesis. Nat. Commun. 12, 3487 (2021).
Migneault, I., Dartiguenave, C., Bertrand, M. J. & Waldron, K. C. Glutaraldehyde: behavior in aqueous solution, reaction with proteins, and application to enzyme crosslinking. Biotechniques 37, 790–802 (2004).
Bauler, P., Huber, G., Leyh, T. & McCammon, J. A. Channeling by proximity: the catalytic advantages of active site colocalization using Brownian dynamics. J. Phys. Chem. Lett. 1, 1332–1335 (2010).
Chen, M., Harris, G. G., Pemberton, T. A. & Christianson, D. W. Multi-domain terpenoid cyclase architecture and prospects for proximity in bifunctional catalysis. Curr. Opin. Struct. Biol. 41, 27–37 (2016).
Pemberton, T. A. et al. Exploring the influence of domain architecture on the catalytic function of diterpene synthases. Biochemistry 56, 2010–2023 (2017).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Zhao, X. M. et al. Ethanol and methanol can improve huperzine A production from endophytic Colletotrichum gloeosporioides ES026. PLoS ONE 8, e61777 (2013).
Chen, Y. G. et al. A cryptic plant terpene cyclase producing unconventional 18‐ and 14‐membered macrocyclic C25 and C20 terpenoids with immunosuppressive activity. Angew. Chem. Int. Ed. 60, 25468–25476 (2021).
Bian, G. et al. Metabolic engineering-based rapid characterization of a sesquiterpene cyclase and the skeletons of fusariumdiene and fusagramineol from Fusarium graminearum. Org. Lett. 20, 1626–1629 (2018).
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Huang, A. C. et al. Unearthing a sesterterpene biosynthetic repertoire in the Brassicaceae through genome mining reveals convergent evolution. Proc. Natl Acad. Sci. USA 114, 6005–6014 (2017).
Guerrero, F., Ciragan, A. & Iwaï, H. Tandem SUMO fusion vectors for improving soluble protein expression and purification. Protein Expr. Purif. 116, 42–49 (2015).
Kabsch, W. Xds. Acta Crystallogr. D 66, 125–132 (2010).
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr. D 69, 1204–1214 (2013).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D 66, 213–221 (2010).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix. refine. Acta Crystallogr. D 68, 352–367 (2012).
Schüttelkopf, A. W. & Van Aalten, D. M. PRODRG: a tool for high-throughput crystallography of protein–ligand complexes. Acta Crystallogr. D 60, 1355–1363 (2004).
Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018).
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Trott, O. & Olson, A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461 (2010).
This work was funded by the National Key R&D Program of China (2018YFA0900400 and 2021YFC2102600), the National Natural Science Foundation of China (31670090 and 31800032) and the German Research Foundation (DFG; DI1536/7-2). The numerical calculations in this paper were carried out on the supercomputing system in the Supercomputing Center of Wuhan University. The synchrotron radiation experiments were performed at BL-1A of the Photon Factory (proposal number 2019G038) and X06SA of the SLS (proposal numbers 20191094 and 20191134). This work was also supported in part by the Platform Project for Supporting Drug Discovery and Life Science Research (Basis of Supporting Innovative Drug Discovery and Life Science Research) from AMED (JP21am0101071 (support number 1553)), Grants-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology, Japan (JSPS KAKENHI grants JP16H06443, JP19K15703, JP20H00490, JP20KK0173, JP21K18246), the New Energy and Industrial Technology Development Organization (NEDO, grant JPNP20011), the PRESTO program from Japan Science and Technology Agency (JPMJPR20DA) and AMED (grant JP21ak0101164). H.T. is a recipient of the JSPS Postdoctoral Fellowship for Foreign Researchers (P18404).
T.L. has filed a patent application relating to the function of TvTS, MpMS and CgCS and the resulting structures talaropentaene and colleterpenol. The other authors declare no competing interests.
Peer review information
Nature thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data figures and tables
a, the apo-form of TvTS-TC (green). The DDXXD, NSE, and RY motifs (red box) are conserved, and the active site forming regions, including aspartic-rich metal binding DDXXD motif, the region (V234-V243) after the NSE motif, and the A156-C183 region are disordered; b, PaFS-TC in complex with neridronate (cyan cartoon with magenta sticks, PDB: 5ER835, in comparison to TvTS-TS: rmsd values of 1.6 Å for Cα-atoms, 47% amino acid sequence identity); c, partially closed conformation of TvTS-TC. The disordered regions in the apo structure, especially the DDXXD motif and the active site loop D173-D182 (shown in salmon), appear clearly structured after soaking with 2,3-dihydro-HexPP. The docking model of 2,3-dihydro-HexPP was constructed based on the observed density and two possible conformers are shown by yellow and purple sticks.
a, Active site cavity of TvTS-TC. The docking model of 2,3-dihydro-HexPP was constructed based on the observed density and two possible conformers were shown as yellow and purple sticks. b, In vitro activities of wild-type TvTS and its variants for production of 1. Full-length TvTS and its variants were expressed with a maltose binding protein (MBP) fused at the N-terminus. Peak integrals for the ion chromatogram of m/z 408 were used for quantification. Wild-type production was set to 100%, bars and error bars show mean and s.d. from three biological replicates, respectively.
a, noncrosslinked MpMS-PT; b, crosslinked MpMS-PT; c, crystal structure of PaFS-PT complexed with cobalt ions and pamidronate (PDB ID: 5ERO); d, superimposed crosslinked MpMS-PT (cyan) and PaFS-PT (wheat) crystal structure; e, active site cleft of MpMS-PT; f, active site cavity of PaFS-PT.
a, The residues on α5 of TC domain (green) interact with the residues on α2 (419-429) and C-terminal α14 (686-695) of the PT domain (cyan); b, close-up view of the interface between the PT domain and fitted model of TC domain. Distances indicated by grey dashed lines are given in Å.
a, top view; b, side view.
Ion chromatograms (m/z = 408) of wildtype MpMS (black) and its enzyme variants V206F (blue) and A207W (magenta).
Extended Data Fig. 7 Product detection of triterpenes produced by chimeric class I TrTS in S. cerevisiae.
Engineered S. cerevisiae containing i) PTTC027, ii) PTTC035, iii) PTTC044, iv) PTTC58, v) PTTC59, vi) PTTC60, vii) PTTC62, viii) PTTC74, ix) PTTC114, x) Cgl13855 overexpressed under the control of GAL10 promoter; xi) blank, engineered efficient terpene precursor providing chassis S. cerevisiae YZL141. Red asterisks indicated the presence of compound 3, white asterisks indicate the similar peak produced by PTTC074.
This file contains Supplementary Methods, Supplementary Figs. 1–59, Supplementary Tables 1–9, Supplementary Schemes 1 and 2, the synthetic procedure for compounds S1–S10, the synthetic procedure for compounds S11–S15, the amino acid sequences for AlphaFold2 prediction and supplementary references.
This file contains Supplementary Tables 10–111.
About this article
Cite this article
Tao, H., Lauterbach, L., Bian, G. et al. Discovery of non-squalene triterpenes. Nature 606, 414–419 (2022). https://doi.org/10.1038/s41586-022-04773-3