Discovery of non-squalene triterpenes

Tao, Hui; Lauterbach, Lukas; Bian, Guangkai; Chen, Rong; Hou, Anwei; Mori, Takahiro; Cheng, Shu; Hu, Ben; Lu, Li; Mu, Xin; Li, Min; Adachi, Naruhiko; Kawasaki, Masato; Moriya, Toshio; Senda, Toshiya; Wang, Xinghuan; Deng, Zixin; Abe, Ikuro; Dickschat, Jeroen S.; Liu, Tiangang

doi:10.1038/s41586-022-04773-3

Download PDF

Article
Open access
Published: 01 June 2022

Discovery of non-squalene triterpenes

Nature volume 606, pages 414–419 (2022)Cite this article

31k Accesses
89 Citations
160 Altmetric
Metrics details

Subjects

Abstract

All known triterpenes are generated by triterpene synthases (TrTSs) from squalene or oxidosqualene¹. This approach is fundamentally different from the biosynthesis of short-chain (C₁₀–C₂₅) terpenes that are formed from polyisoprenyl diphosphates^2,3,4. In this study, two fungal chimeric class I TrTSs, Talaromyces verruculosus talaropentaene synthase (TvTS) and Macrophomina phaseolina macrophomene synthase (MpMS), were characterized. Both enzymes use dimethylallyl diphosphate and isopentenyl diphosphate or hexaprenyl diphosphate as substrates, representing the first examples, to our knowledge, of non-squalene-dependent triterpene biosynthesis. The cyclization mechanisms of TvTS and MpMS and the absolute configurations of their products were investigated in isotopic labelling experiments. Structural analyses of the terpene cyclase domain of TvTS and full-length MpMS provide detailed insights into their catalytic mechanisms. An AlphaFold2-based screening platform was developed to mine a third TrTS, Colletotrichum gloeosporioides colleterpenol synthase (CgCS). Our findings identify a new enzymatic mechanism for the biosynthesis of triterpenes and enhance understanding of terpene biosynthesis in nature.

Molecular insights into the catalytic promiscuity of a bacterial diterpene synthase

Article Open access 06 July 2023

Discovery and characterization of a terpene biosynthetic pathway featuring a norbornene-forming Diels-Alderase

Article Open access 11 May 2022

Molecular networking and computational NMR analyses uncover six polyketide-terpene hybrids from termite-associated Xylaria isolates

Article Open access 07 June 2024

Main

Triterpenoids are a large class of natural products with a broad range of bioactivities⁵ known from all kingdoms of life^6,7. Their biosynthesis starts with dimerization of farnesyl diphosphate (FPP) by squalene synthase into squalene⁸. The cyclization is catalysed by TrTSs such as squalene hopene cyclase (SHC) to yield pentacyclic hopene, the precursor of hopanoids⁹. After oxidation to (S)-2,3-epoxysqualene, lanosterol synthase (LS) promotes the biosynthesis of lanosterol, the precursor to steroids and saponins¹⁰ (Fig. 1). These enzymes are classified as class II terpene synthases (TSs) with a conserved DXDD motif for substrate activation through protonation. By contrast, class I TSs have two conserved DDXX(D/E) and NSE/DTE motifs for binding of a (Mg²⁺)₃ cluster that in turn binds the substrate’s diphosphate¹¹. In cooperation with the diphosphate sensor, a conserved arginine forming hydrogen bridges to the diphosphate, and an effector residue at a helix break¹² the Lewis acidity of Mg²⁺ leads to substrate ionization through diphosphate abstraction. Many examples are known of class I TSs¹³ that convert substrates with chain lengths between C₁₀ and C₂₅ into usually (poly)cyclic terpenes. By contrast, class I cyclization of hexaprenyl diphosphate (HexPP; C₃₀) into triterpenes is unknown, although HexPP is widely distributed in nature^14,15,16 and serves as a precursor for ubiquinone (Q₆) in yeast and for highly bioactive meroterpenoids in sponges¹⁷. Bifunctional enzymes harbouring a prenyl transferase (PT) and terpene cyclase (TC) domain (PTTCs) have recently attracted attention^3,18,19. Their PT domains convert dimethylallyl diphosphate (DMAPP) and isopentenyl diphosphate (IPP) into polyisoprenyl diphosphates, and the TC domains carry out cyclization into di- and sesterterpenes. Thus far, no such enzyme is known to produce triterpenes. Here we describe the discovery of chimeric fungal class I TrTSs for the production and conversion of HexPP into triterpenes.

Using our yeast-based genome mining platform²⁰, two putative bifunctional TSs with PT and TC domains emerged from the endophyte T. verruculosus TS63-9 (ref. ²¹; ZTR_06220) and the plant pathogen M. phaseolina MS6 (ref. ²²; MPH_02178) (Supplementary Figs. 1 and 2). These enzymes showed 40.4% and 33.4% amino acid sequence identity with the Phomopsis amygdali fusicoccadiene synthase²³ (PaFS) and the Aspergillus clavatus ophiobolin F synthase²⁴ (AcOS), respectively, and are located within clades II-B and II-C of fungal bifunctional TSs (Supplementary Fig. 3). Thus, both enzymes are predicted to follow a C1-III-IV cyclization mode with attack of the third double bond at C1 followed by attack of the fourth double bond²⁵.

Expression of ZTR_06220 in Saccharomyces cerevisiae YZL141 (ref. ²⁰) yielded the triterpene hydrocarbon talaropentaene (1) (Supplementary Table 3, Fig. 2a, d and Supplementary Figs. 4–12), characterizing the enzyme as TvTS. Transcriptional analysis showed that TvTS was not expressed in its native host, but placing the gene under the control of the amyB promoter resulted in the production of 1 and confirmed its natural function (Fig. 2b, d). Because recombinant full-length TvTS was insoluble, for in vitro assays the TvTS-PT and TvTS-TC domains were expressed individually (Supplementary Fig. 14). Incubation of TvTS-PT with DMAPP and IPP yielded HexPP (Fig. 2c), which was converted into 1 by TvTS-TC (Fig. 2d). The combination of TvTS-PT and TvTS-TC also accepted FPP and IPP, while TvTS-TC alone was sufficient to cyclize HexPP (Supplementary Scheme 1) to 1 (Supplementary Fig. 15).

**Fig. 2: Characterization of TvTS and MpMS.**

The absolute configuration of 1 was determined through stereoselective deuteration (Supplementary Table 5)²⁶. Conversion of geranyl diphosphate (GPP) and (E)- or (Z)-(4-¹³C,4-²H)IPP²⁷ with TvTS-PT and TvTS-TC indicated an absolute configuration of (10R,11S)-1 (Supplementary Fig. 16). Similar experiments using GPP and (R)- or (S)-(1-¹³C,1-²H)IPP²⁸ with TvTS-PT and TvTS-TC confirmed this assignment for 1 (Supplementary Fig. 17). Compound 1 is structurally similar to (–)-δ-araneosene from the fungal clade II-D TS C. gloeosporioides dolasta-1(15),8-diene synthase (CgDS), including corresponding absolute configurations (Fig. 2e)²⁵.

The proposed cyclization mechanism for 1 starts with diphosphate abstraction from HexPP, followed by C1-III-IV cyclization to A. A subsequent 1,2-hydride shift to B and deprotonation generate 1 (Fig. 2e). Experimental proof for the 1,2-hydride shift was obtained through incubation of (3-¹³C,2-²H)FPP²⁹ and IPP with TvTS-PT and TvTS-TC, yielding (15-¹³C,15-²H)-1. This product showed an intensive upfield-shifted triplet (Δδ = –0.45 ppm, ¹J_C,D = 18.9 Hz) for C15 indicating a direct ¹³C–²H bond (Supplementary Fig. 18). Incubation of GPP and (R)- or (S)-(1-¹³C,1-²H)IPP with TvTS-PT and TvTS-TC resulted in specific loss of the 13-pro-R hydrogen of B in the deprotonation to 1 (Supplementary Fig. 19).

Expression of MPH_02178 in S. cerevisiae resulted in macrophomene (2), albeit with low yield, but DMAPP and IPP were efficiently converted into the same triterpene using purified recombinant enzyme (Supplementary Figs. 20–22). Consequently, the TrTS was identified as MpMS. Analysis of an MpMS D114A/N115A variant with an inactivated TC domain demonstrated that HexPP was produced by the PT domain (Fig. 2c). In addition, GPP, FPP, geranylgeranyl diphosphate (GGPP) and geranylfarnesyl diphosphate (GFPP) in combination with IPP and synthetic HexPP were also accepted by MpMS to yield 2. For structure elucidation of 2 (Supplementary Table 6 and Supplementary Figs. 23–30), the MpMS E104Y variant was selected³⁰ (Supplementary Fig. 31). To enable full assignment of the nuclear magnetic resonance (NMR) data, which was prevented by multiple peak overlaps for unlabelled 2, isotopic labelling experiments with ¹³C-labelled precursors were performed (Supplementary Fig. 32a–e).

Notably, 2 is structurally similar to casbene. Both compounds are bicyclic with one macrocyclic ring and one three-membered ring, except the ring fusion is trans for 2 and cis for casbene (Fig. 2f)³¹. To our knowledge, the 22-membered ring in macrophomene represents the largest macrocycle discovered in terpenes so far and is only reflected by 22-membered rings in nostocyclophanes, a class of oxidatively dimerized polyketides from cyanobacteria³². The proposed cyclization mechanism from HexPP to 2 starts with diphosphate abstraction and 1,22-cyclization to C, followed by deprotonation at C1 to close the cyclopropane ring. This mechanism represents an unprecedented C1-VI cyclization mode (Supplementary Fig. 3, designated type C) for chimeric TSs. The stereochemical fate of the geminal methyl groups at C24 and C25 of HexPP was addressed by incubating (12-¹³C)FPP³³ and (9-¹³C)GPP²⁵ with IPP and MpMS. Product analysis by ¹³C-NMR showed a clear stereochemical course for the attack at C23 in cyclopropane ring closure (Supplementary Fig. 33). The stereochemistry of the deprotonation was addressed by incubation with GFPP and (R)-(1-²H)IPP or (S)-(1-²H)IPP³⁴, with deuterium retained for the (R) enantiomer and lost from the (S) enantiomer (Supplementary Fig. 34). Taken together, the face selectivity at C23 and the stereospecificity of the deprotonation at C1 suggest an absolute configuration of (1R,22R)-2, which was confirmed by stereoselective deuteration through incubation of MpMS with (R)-(1-¹³C,1-²H)IPP and (S)-(1-¹³C,1-²H)IPP (Supplementary Fig. 35).

The structural basis of the cyclization mechanism was investigated by determination of the crystal structure of apo TvTS-TC by soaking with the non-reactive substrate surrogate 2,3-dihydro-HexPP (Extended Data Table 1 and Supplementary Scheme 2). Similarly to PaFS-TC³⁵, TvTS-TC adopts the characteristic class I TS fold^12,36 (Extended Data Fig. 1a, b), but TvTS-TC possesses a larger active site to accommodate HexPP. Although 2,3-dihydro-HexPP was not clearly observed and we cannot completely exclude the possibility that the observed electron density originated from polyethylene glycol (PEG) used in the crystallization buffer (Supplementary Fig. 36), the disordered regions in the apo structure, especially the DDXXD motif and the active site loop D173–D182, appeared clearly structured following soaking with the substrate surrogate. This observation suggests that a major conformational change occurs in TvTS following substrate binding that facilitates active site closure (Extended Data Fig. 1a, c). The docking model of 2,3-dihydro-HexPP based on the observed electron density suggested two possible conformers (Fig. 3a and Supplementary Fig. 36e), one in which HexPP is stretched out across the active site (yellow) and the other in which HexPP is prefolded (purple) for C1-III-IV cyclization with C1–C11 and C10–C14 distances of 5.4 and 3.2 Å, respectively.

**Fig. 3: Structures of TvTS and MpMS.**

The TvTS active site contains the conserved DDXXD and NSE motifs and other residues that interact with the substrate’s diphosphate (Fig. 3a, b). Aromatic residues in TvTS-TC (F65, F89, F187, W188 and W310) are observed in positions analogous to those in PaFS (F65, F89, F196, W197 and W319), while the PaFS residues at the bottom of the active site (M221, W225, M309, S312 and N316) are substituted with smaller residues in TvTS (A212, Y216, G300, G303 and S307; Fig. 3a, b). These differences create a larger pocket in TvTS-TC that is composed of a tunnel to accommodate two additional (non-reacting) isoprene units of HexPP, sticking out from a ball-shaped cavity in which the four reacting isoprene units are located. This ball-shaped part is similar in shape and size to the PaFS active site that houses GGPP.

Structure-based site-directed mutagenesis experiments (Extended Data Fig. 2) included several small-to-large substitutions in TvTS near C15 to C20 of 2,3-dihydro-HexPP (G184F, A219L, G220L and S307F). The enzyme variants showed strongly reduced production of 1, suggesting that the available space in this region is important for correct substrate folding. In addition, the exchange of conserved aromatic residues for alanine or leucine (F65A/L, F89A/L, F187A/L, W188A/L and W310A) abolished or decreased enzyme activity, indicating that these TvTS residues have an important role in shaping the active site cavity and/or stabilizing intermediates through cation−π interactions (Fig. 2e)⁶. The small residues that widen the binding pocket in comparison with PaFS were substituted with the corresponding residues in PaFS-TC. While the G303S and S307N substitutions had little effect, the A212M and Y216W variants retained only 15% and 28% of wild-type activity for the generation of 1 and production with the G300M variant was completely disrupted, in support of the hypothesis that the larger steric bulk introduced in these enzyme variants does not allow uptake of HexPP.

Further insights into triterpene biosynthesis by PT–TC chimeric enzymes were obtained by cryo-electron microscopy (cryo-EM) of MpMS. The data obtained for full-length MpMS showed that the enzyme exclusively forms a hexamer, whereas PaFS forms octamers and hexamers with a ratio of 9:1 (Fig. 3d, Supplementary Figs. 37 and 38 and Extended Data Table 2)³⁷. A three-dimensional (3D) reconstruction of unliganded MpMS could be established for only the PT domain, as in the cryo-EM analysis of PaFS³⁷. The overall structure of the MpMS-PT monomer is highly similar to the PaFS-PT cryo-EM and crystal structures, and the hexamer of MpMS-PT also superimposes well with the PaFS hexamer (root mean square deviation of 1.4–1.6 Å for 274 Cα atoms). Moreover, the active site of MpMS-PT is of suitable size for its product HexPP (Extended Data Fig. 3).

Interactions between the PT and TC domains in MpMS were further investigated by cross-linking the domains with glutaraldehyde³⁸. The obtained structural data for cross-linked MpMS were substantially different from those for non-cross-linked MpMS (Supplementary Figs. 39 and 40 and Extended Data Table 2). Additional electron densities were observed in cross-linked MpMS at each vertex of the PT hexamer with a different occupancy, suggesting that the orientation of the TC domain is flexible although its position was fixed by the cross-linking. However, because the local resolution was low and fragmented, a model for only one helix of the TC domain in addition to the PT domain could be built (Fig. 3e). Fitting of a TC domain homology model into the density map (Fig. 3f) suggested that the α5 helix region (residues 146–163) participates through cross-linking of K150 to K442 of the PT domain (the distance between these residues is only 3.0 Å). Moreover, TC residues T147, K150, N151 and K155 on helix α5 form hydrogen bonds with PT domain residues on helices α2 (419–429) and α14 (686–695; Extended Data Fig. 4).

Bifunctional TSs with an αα domain architecture may have a catalytic advantage via proximity channelling if the active sites of the interacting domains are properly faced towards each other^39,40,41. In the PaFS octamer, this is ideally realized, as the TC domains of PaFS are oriented towards the central pores of the PT domains, which facilitates product channelling from the PT domain to the TC domain³⁷. Additionally, in the hexameric structure of MpMS, each TC domain is located close to a PT domain, with their active sites facing each other (Extended Data Fig. 5), which allows for direct transfer of HexPP from the PT domain to an adjacent TC domain.

The MpMS-TC model⁴² does not show the same tunnel for uptake of two non-reacting isoprene units as was observed for TvTS, as the TvTS positions of A212, G300 and G303 are occupied by bulky residues in MpMS (L238, L318 and M321). By contrast, the ball-shaped part of the active site of MpMS is wider because residues F187 and W188 of TvTS are substituted by smaller ones in MpMS (V206 and A207; Fig. 3a, c). These observations may explain why TvTS-TC with its initial C1-III-IV cyclization behaves like a diterpene synthase (DTS), acting on only the first four isoprene units of HexPP, while the two isoprene units at the end of the HexPP chain stick out into a tunnel and do not participate in terpene cyclization. By contrast, MpMS lacks this tunnel but has an overall larger reaction chamber for full uptake of HexPP, thus allowing its unusual C1-VI cyclization. In agreement with this hypothesis, MpMS V206F and A207W variants showed nearly or completely abolished production of 2 (Extended Data Fig. 6).

The high similarity between the AlphaFold2-predicted model and the crystal structure of TvTS-TC offers a basis for the discovery of additional chimeric fungal TrTSs through structural prediction (Fig. 4a and Supplementary Fig. 41), which is not possible from amino acid sequence analyses. Such structure predictions with docking of HexPP were performed for ten chimeric TSs with low sequence similarity to previously characterized enzymes. Six of these enzymes (PTTC027, PTTC044, PTTC060, PTTC074, PTTC114 and Cgl13855) contained a pocket that may be sufficiently large for HexPP binding (Supplementary Figs. 42 and 43 and Supplementary Table 7). Functional characterization through expression in yeast indeed resulted in colleterpenol (3) for Cgl13855 (NMDCN0000R73) from the endophyte C. gloeosporioides ES026 (ref. ⁴³) (Fig. 4c, Extended Data Fig. 7 and Supplementary Fig. 44), characterizing the TrTS as CgCS. HexPP docking identified an active site cavity for Cgl13855-TC similar to that of TvTS, explaining the C1-III-IV cyclization with two isoprene units sticking out into a tunnel (Fig. 4b).

**Fig. 4: AlphaFold2-based genome mining of CgCS.**

The planar structure of 3 was identified by NMR (Supplementary Table 8 and Supplementary Figs. 45–59), showing structural similarity to the known 15-hydroxy-α-cericerene⁴⁴. Solving the relative configuration of 3 through nuclear Overhauser enhanced spectroscopy (NOESY) was difficult because of its conformational flexibility. A comparison of calculated to measured NMR data favoured the structure (14R*,15S*)-3, which was further supported by a comparison of the ¹³C-NMR data to those reported for 15-hydroxy-α-cericerene⁴⁴ with the same relative configuration (Supplementary Table 8). The absolute configuration of (14R,15S)-3 was then determined through comparison of measured and calculated electronic circular dichroism (ECD) curves (Supplementary Fig. 47). Notably, 3 formation requires syn addition of C1 and water to the C14=C15 alkene in HexPP (Fig. 4d). Additionally, PTTC074 produced a similar triterpene compound as CgCS (Extended Data Fig. 7 and Supplementary Fig. 44), which implies the existence of more TrTSs to be explored.

In summary, a novel family of chimeric class I TrTSs converting HexPP into triterpenes was identified for which two subclasses can be distinguished. Some TrTS-TCs provide one large cavity for HexPP uptake to form macrocycles. Other enzymes have a ball-shaped cavity of similar size to that in DTSs with an adjacent tunnel that accommodates two ‘spectator’ isoprene units, with the consequence that only the first four units participate in cyclization. This reflects the situation for the C1-III-IV and C1-IV-V cyclizing subclasses of sesterterpene synthases, but structural insights are lacking to understand these different modes. The findings described here show expanded product boundaries for class I TSs and enrich understanding of terpene biosynthesis in nature.

Methods

Strains and culture conditions

Cloning was performed in Escherichia coli DH10B using standard recombinant DNA techniques. E. coli BL21(DE3) (Invitrogen) was used for protein expression. Both E. coli strains were grown in LB (10.0 g l^–1 tryptone, 5.0 g l^–1 yeast extract, 5.0 g l^–1 NaCl, pH 7.0) at 37 °C. S. cerevisiae YZL141 (ref. ⁴⁵) was used for the heterologous expression of ZTR_06220 (KUL85185) from T. verruculosus TS63-9 (ref. ²¹), MPH_02178 (EKG20455) from M. phaseolina MS6 (ref. ²²) and Cgl13855 (NMDCN0000R73) from C. gloeosporioides ES026 (ref. ⁴³) and grown in YPD medium (20 g l^–1 tryptone, 10 g l^–1 yeast extract, 20 g l^–1 glucose, pH 7.2) at 30 °C. T. verruculosus TS63-9 was selected for functional verification of TvTS in vivo and grown in PDB medium (20 g l^–1 potato extract, 20 g l^–1 glucose, pH 6.5).

Phylogenetic analysis of TvTS and MpMS

To characterize the evolutionary relationships of TvTS, MpMS and CgCS, 56 characterized fungal chimeric class I TSs were selected and multiple-sequence alignment was performed using ClustalW (version 2.0.12). The Poisson correction model based on the maximum-likelihood method was used to infer the evolutionary history of these enzymes, and MEGA7 was used to conduct the evolutionary analysis⁴⁶. Bootstrap values were obtained using 1,000 replications. The initial tree for the heuristic search was acquired automatically by applying the Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model and then selecting the topology with a superior log-likelihood value. After eliminating all positions that contained gaps and missing data, 186 positions were left in the final dataset. The plant-derived sesterterpene synthase AtTPS25 (ref. ⁴⁷) was selected as an outgroup.

Functional characterization of TrTS candidates in vivo

Synthetic genes encoding TvTS (ZTR_06220), MpMS (MPH_02178), CgCS and the nine remaining candidates (Supplementary Table 7), codon optimized for expression in yeast, were obtained from GenScript. After digestion with HindIII and ScaI, the genes were cloned into HindIII- and ScaI-digested pYJ117 plasmid²⁰ to produce plasmids pXM139 (from ZTR_06220), pXM018 (from MPH_02178), pRC181 (from Cgl13855) and five additional plasmids. The plasmids were individually transformed into S. cerevisiae YZL141 to yield corresponding strains, which were grown in 5 ml YPD medium (2% glucose) at 30 °C overnight. From this starter culture, 5 ml was added to a 500-ml flask containing 200 ml YPD (2% glucose and 1% galactose), followed by growth at 30 °C with shaking at 220 r.p.m. for 3 d. Strains were collected, and the mycelium was collected and extracted with hexane/ethyl acetate (4:1). The organic layers were combined for gas chromatography mass spectrometry (GC–MS) analysis.

Fermentation of engineered S. cerevisiae strains and isolation of 1 and 3

For isolation of 1 and 3, S. cerevisiae XM139 and RC181 strains were scaled up in 2-litre shaker flasks containing 1 litre YPD (2% glucose and 1% galactose). Subsequently, the mycelium was collected and extracted with acetone three times. The acetone-extracted layer was distilled under reduced pressure to remove acetone and then partitioned between ethyl acetate and water to afford the ethyl acetate fraction (1.7 g for 1 and 1.56 g for 3). For 1, the residue was subjected to a silica gel column (80–100 mesh) and elution with petroleum ether/ethyl acetate (100:0, 99.8:0.2, 99:1, 98:2, 95:5, 0:100) to give fractions A–F. Following this, 1 was identified in fraction B by GC–MS detection and further purified by semi-preparative HPLC (Ultimate 3000 HPLC equipped with an XBridge Prep C18 column (Waters, 113; 10 × 250 mm, 5 μm)) to afford compound 1 (7.2 mg). For 3, the crude extract was dissolved in methanol/DMSO (10:1, vol/vol) and subjected to semi-preparative HPLC (column: Agilent ZORBAX BS-C18, 5 μm, 9.6 × 250 mm internal diameter; solvent: acetonitrile/H₂O, 99:1; flow: 3 ml min^–1; detector: 210, 230 nm) to yield compound 3 (30.5 mg, t_R = 24.2 min).

Plasmid construction for in vitro and labelling experiments

For expression of recombinant TvTS and MpMS in E. coli, codon-optimized synthetic genes were obtained from GenScript. The synthetic gene for MpMS was digested with NdeI and EcoRI and cloned into pET28a to produce pRC088. The sequences encoding the PT and TC domains of TvTS were amplified separately by PCR with Phusion DNA polymerase using primer pairs P1/P2 and P3/P4 (Supplementary Table 2). The amplified nucleotide sequence for the PT domain was digested with NdeI and EcoRI and cloned into pET28a to produce plasmid pRC009, and the amplified nucleotide sequence for the TC domain was digested with HindIII and XhoI and cloned into pET21a to generate plasmid pRC041. The sequence encoding the TC domain-inactivated MpMS D114A/N115A variant was amplified from pRC088 using primer pairs P5/P6 and P7/P8 (Supplementary Table 2) and then assembled by overlap extension PCR and cloned into NdeI- and EcoRI-digested pET28a to produce pRC088-D114A/N115A. Correct gene insertion was verified by sequencing, and plasmids were used to transform E. coli BL21(DE3) competent cells using a calcium-based protocol.

Protein expression and purification for in vitro assays

For gene expression, a fresh LB culture of E. coli BL21(DE3) transformants (containing 100 mg l^–1 ampicillin for pRC041 (TvTS-TC) and 50 mg l^–1 kanamycin for pRC009 (TvTS-PT), pRC088 (MpMS) and pRC088-D114A/N115A) was inoculated from a glycerol stock and grown overnight. The precultures were used to inoculate the desired volume of LB (1 ml per litre) amended with the appropriate antibiotic. Cultures were grown at 37 °C with shaking until an OD₆₀₀ of 0.6–0.8 was reached. The cultures were cooled to 16 °C, and isopropyl β-d-1-thiogalactopyranoside (0.1 mM) was added to induce expression. Proteins were expressed overnight (~20 h) at 16 °C with shaking. Cells were collected by centrifugation (8,000g, 5 min). The supernatant was discarded, and the cell pellet was resuspended in buffer A (50 mM Tris-HCl, 300 mM NaCl, 4 mM β-mercaptoethanol, pH 7.6; 10 ml per litre of culture). Cells were lysed by sonication on ice (5 × 30 s). The cellular debris was removed by centrifugation (14,710g, 2 × 7 min), and the supernatant was subjected to Ni-NTA affinity chromatography (Protino Ni-NTA, Macherey-Nagel) through a syringe filter. The resin was washed with buffer A (20 ml per litre of culture), followed by elution of the His₆-tagged proteins using elution buffer (buffer A + 300 mM imidazole; 10 ml per litre of culture). The proteins were concentrated using centrifugal filters (10-kDa cut-off; 5,000g, 4 °C; Amicon Ultra-15 (Millipore) or Vivaspin 20 (Sartorius)) and diluted with incubation buffer (50 mM Na₂HPO₄, 10% glycerol, 2 mM MgCl₂). Enzyme concentrations were determined by Bradford assay and adjusted to 20 μΜ. Incubation experiments were carried out at 30 °C overnight using combinations of enzymes and substrates as listed in Supplementary Table 5. For experiments with unlabelled substrates, hexane (650 μl) was used for extraction, whereas for experiments with labelled substrates extraction was performed using C₆D₆ (650 μl). Samples were directly analysed by GC–MS and/or NMR spectroscopy.

In vitro enzyme assays for TvTS-PT and MpMS D114A/N115A and detection of HexPP

Reactions were carried out using substrate (IPP and DMAPP, 100 μM each), 2 mM Mg²⁺, 10% glycerol and 10 μM enzyme (TvTS-PT or MpMS-PT D114A/N115A) in Tris-HCl buffer (200 μl; 50 mM, pH 7.6) at 30 °C overnight. The resulting HexPP was extracted with acetonitrile and analysed by liquid chromatography mass spectrometry (LC–MS). For high-resolution MS analysis of HexPP, an LTQ Orbitrap Elite instrument coupled to a Thermo Scientific Ultimate 3000 RSLC HPLC system and an ACE UltraCore 2.5 SuperC18 (2.1 × 100 mm) column was used for compound separation at 35 °C. The mobile phase (pH 9.5) containing 5 mM ammonium bicarbonate in water as solvent A and acetonitrile as solvent B was set to a flow rate of 0.2 ml min^–1. The gradient programme was as follows: 98–10% solvent A (0–10 min), 10–0% solvent A (10–15 min), 0% solvent A (15–17 min), 0–98% solvent A (17–18 min) and 98% solvent A (18–22 min). EI was used in negative mode for detection. The ion source parameters were as follows: sheath gas, 40 arb; auxiliary gas, 5 arb; spray voltage, 3.1 kV; capillary temperature, 270 °C; S-lens RF level, 65 kV; auxiliary gas heater temperature, 250 °C. Full-scan MS mode with a resolution of 60,000 was used for qualitative analysis.

Site-directed mutagenesis and in vitro analysis of enzyme variants

Site-directed mutagenesis for the construction of MpMS and TvTS enzyme variants was performed with the PCR-based QuikChange Site-Directed Mutagenesis kit (Stratagene) according to the manufacturer’s protocol, using Phusion DNA polymerase and the mutational primers listed in Supplementary Table 2. Plasmids pRC088 (containing the full-length gene for MpMS) and pMBP139 (containing the full-length gene for TvTS) were used as template. All mutants were verified by gene sequencing. Reactions for TvTS, MpMS and the variants were carried out using substrate (DMAPP and IPP, 100 μM and 500 μM, respectively), 2 mM Mg²⁺, 5% glycerol and 10 μM enzyme in Tris-HCl buffer (200 μl; 50 mM, pH 7.5) at 30 °C overnight. The resulting product was extracted with ethyl acetate and analysed by GC–MS. All experiments were performed in three biological replicates.

Isolation of macrophomene (2) from incubation with MpMS E104Y

MpMS E104Y was expressed in E. coli and purified by Ni-NTA affinity chromatography. The pooled enzyme fractions were concentrated, further purified by size-exclusion chromatography and concentrated again. Incubation was performed using GPP (30 mg, 0.08 mM) and IPP (100 mg, 0.34 mM) dissolved in NH₄HCO₃ buffer (25 mM, 20 ml), which was added to the enzyme solution in incubation buffer (100 ml) over a period of 1 h using a syringe pump. The mixture was incubated overnight and extracted with n-hexane (150 ml) three times. The organic layers were dried with MgSO₄ and concentrated under reduced pressure. The residue was purified by column chromatography on SiO₂ (pentane) to yield macrophomene as a colourless oil.

Construction of expression plasmids for protein crystallization

To increase the solubility of protein in E. coli, the coding sequence for TvTS was amplified using primer pair P21/P22 from the codon-optimized synthetic gene and ligated into NdeI- and HindIII-digested pET28-MBP-TEV, using the In-Fusion HD Cloning kit (TaKaRa), yielding plasmid pMBP139. For expression of protein for crystallization, the coding sequence for TvTS-TC was amplified from pRC041 using primer pair P23/P24. The amplified sequence was cloned into pET-SUMO, which was itself amplified with primer pair P25/P26, using the In-Fusion HD Cloning kit, resulting in plasmid pSUMO041.

Protein expression and purification for crystallization

Protein expression was performed with E. coli BL21(DE3) harbouring pSUMO041 using the same protocol as described above. For crystallization, after Ni-NTA purification, SUMO–TvTS-TC was dialysed against 2 × 1 litre of Tris-HCl buffer (pH 8.0) containing 5% (vol/vol) glycerol and 300 mM NaCl. After dialysis, SUMO–TvTS-TC was treated with SUMO protease Ulp1403-621 (ref. ⁴⁸; prepared as previously described, 0.87 μM) in the presence of dithiothreitol (DTT, 1 mM) at 4 °C overnight. The protein solution was loaded onto a column filled with Ni-NTA resin. The His₆–SUMO fragment and protease were then captured by the Ni-NTA resin, leaving TvTS-TC in the flow-through, and the remaining protein on the column was eluted with Tris-HCl buffer (pH 8.0) containing 5% (vol/vol) glycerol, 10 mM imidazole and 300 mM NaCl. The collected protein solution was incubated with 10 mM EDTA (pH 8.0) at 4 °C for 1 h. To obtain a protein preparation of high purity, further purification was performed using anion-exchange chromatography on a Resource Q column (Cytiva) by linearly increasing the salt concentration from 0 M NaCl to 1 M NaCl in buffer (50 mM Tris-HCl (pH 8.0), 1 mM DTT, 5% glycerol) over 20 column volumes. The desired protein was collected and then purified to homogeneity by size-exclusion chromatography on a HiLoad 16/600 Superdex 200 pg column (Cytiva) and eluted with a solution containing 20 mM Tris-HCl (pH 8.0), 100 mM NaCl, 1 mM DTT and 5% glycerol. The resulting eluate was concentrated to 10 mg ml^–1, using an Amicon Ultra-4 filter (molecular weight cut-off of 30 kDa) at 4 °C. The purity of the proteins was monitored by SDS–PAGE, and protein concentrations were determined with a SimpliNano microvolume spectrophotometer.

Crystallization and structure determination

Crystals of TvTS-TC were obtained after 1 d at 10 °C by using the sitting-drop vapor-diffusion method. Before crystallization, 125 μM of protein was incubated with 2 mM MgCl₂ on ice for 30 min, and 0.5 μl of protein solution was then mixed with 0.5 μl of reservoir solution containing 0.1 M Tris-HCl (pH 8.5), 0.1 M MgCl₂, 30% PEG 4000 and 0.2 M NDSB-211. Crystals of TvTS-TC in complex with 2,3-dihydro-HexPP were obtained by incubation of TvTS-TC crystals with 10 mM 2,3-dihydro-HexPP in the crystallization drop at 10 °C for 14 h. The crystals were transferred to cryoprotectant solution (reservoir solution with 25% (vol/vol) glycerol) and then flash cooled at –173 °C in a nitrogen gas stream. The X-ray diffraction datasets were collected at X06SA (Paul Scherrer Institut, Villigen, Switzerland) for the apo TvTS-TC structure and at BL-1A (Photon Factory, Tsukuba, Japan) for the structure of TvTS-TC in complex with 2,3-dihydro-HexPP, using a beam wavelength of 1.0 and 1.1 Å, respectively. The diffraction datasets for TvTS-TC were processed and scaled using the XDS package⁴⁹ and Aimless⁵⁰. The initial phase of the TvTS-TC structure was determined by molecular replacement, using PaFS (PDB, 5ER8) as the search model. Molecular replacement was performed with Phaser in PHENIX (version 1.19.2-4158-000)^51,52. The initial phase was further calculated with AutoBuild in PHENIX⁵². The TvTS-TC structures were modified manually with Coot⁵³ and refined with PHENIX.refine⁵⁴. The cif parameters of the ligands for the energy minimization calculations were obtained by using the PRODRG server⁵⁵. After soaking with 2,3-dihydro-HexPP, strong additional electron densities were observed close to the DDXXD motif and in the active site cavity (Supplementary Fig. 36a, b). Modelling of 2,3-dihydro-HexPP to the observed density was partly satisfactory, but some of the methyl groups along the isoprenoid chain stuck out (Supplementary Fig. 36c). For an alternative explanation, PEG used in the crystallization buffer was modelled to the density, but in this case large unassigned densities close to the DDXXD motif remained in the refined structure (Supplementary Fig. 36d). Thus, it is possible that the observed density originated from both 2,3-dihydro-HexPP and PEG with low occupancies. The final crystal data and intensity statistics are summarized in Extended Data Table 1. The Ramachandran statistics were as follows: 97.6% favoured and 2.4% allowed for apo TvTS-TC, 98.9% favoured and 1.1% allowed for TvTS-TC soaked with 2,3-dihydro-HexPP. Although the ligand was not assigned, the conformations of PEG and 2,3-dihydro-HexPP in the active site should be similarly defined by active site residues. Therefore, a docking model of 2,3-dihydro-HexPP based on the observed density was developed. All crystallographic figures were prepared with PyMOL (DeLano Scientific; http://www.pymol.org).

Purification of MpMS and cross-linking of the PT and TC domains

Protein expression was performed with E. coli BL21(DE3) harbouring pRC088 using the same protocol as described above. After Ni-NTA purification, MpMS protein solution was further purified using anion-exchange chromatography on a Resource Q column (Cytiva) by linearly increasing the salt concentration from 20 mM NaCl to 1 M NaCl in buffer (50 mM HEPES (pH 7.5) and 1 mM DTT) over 20 column volumes. The desired protein was collected and then purified by size-exclusion chromatography on a Superose 6 10/300 column (Cytiva) using buffers containing 20 mM HEPES (pH 7.5), 150 mM NaCl and 1 mM DTT. The resulting eluate was concentrated to 50 μM, using an Amicon Ultra-4 filter (molecular weight cut-off of 100 kDa) at 4 °C. The purity of the proteins was monitored by SDS–PAGE. To obtain the structure of full-length MpMS, cross-linking of the PT and TC domains via glutaraldehyde (25% in water; Nacalai Tesque) was performed. First, MpMS protein was purified by Ni-NTA and anion-exchange chromatography, as described above, to obtain pure protein. Subsequently, cross-linking was performed on ice by incubating 1 mg ml^–1 MpMS protein with 0.06% glutaraldehyde for 10 min at a 100-μl scale. Reactions were quenched by adding 10 μl of 1.0 M Tris-HCl (pH 8.0). Reaction mixtures were pooled and further purified by size-exclusion chromatography on a Superose 6 10/300 column (Cytiva) to exclude aggregations. The resulting eluate was concentrated to 50 μM, using an Amicon Ultra-4 filter (molecular weight cut-off of 100 kDa) at 4 °C.

Cryo-EM sample preparation and data acquisition

For cryo-grid preparation, 3 μl of sample was applied to a holey carbon grid (Quantifoil, Cu, R1.2/1.3, 300 mesh). The grid was rendered hydrophilic by a 30-s glow discharge in air (11-mA current) with PIB-10 (Vacuum Device). The grid was blotted for 20 s (blot force of 0) at 18 °C and 100% humidity and then flash frozen in liquid ethane using a Vitrobot Mark IV (Thermo Fisher Scientific). For the MpMS-PT and MpMS cross-linking datasets, 1,888 and 1,529 movies were acquired, respectively, on a Talos Arctica (FEI) microscope operating at 200 kV in nanoprobe mode using EPU software for automated data collection. The movies were collected on a 4,000 × 4,000 grid using a Falcon 3EC direct electron detector (electron counting mode) at a nominal magnification of 120,000 (0.88 Å per pixel). Fifty movie fractions were recorded at an exposure of 1.00 electrons per Å² per fraction, corresponding to a total exposure of 50 electrons per Å². The defocus steps used were –1.0, –1.5, –2.0 and –2.5 μm. The movie fractions were aligned, dose weighted and averaged using RELION’s own implementation on 5 × 5 tiled fractions with a B-factor of 300. The non-weighted movie sums were used for contrast transfer function (CTF) estimation with Gctf ⁵⁶. The dose-weighted sums were used for all subsequent steps of image processing. The subsequent processes of particle picking, two-dimensional classification, ab initio reconstruction, 3D classification, 3D refinement, CTF refinement and Bayesian polishing were performed using RELION-3.18 (ref. ⁵⁷). For details of the cryo-EM data processing, see Supplementary Figs. 37–40.

AlphaFold2 prediction and docking analysis

UCSF Chimera⁵⁸ (version 1.12) and AutoDock Vina⁵⁹ (version 1.1.2) were used to perform receptor and ligand preparation and molecular docking analysis. Before the docking procedure, the receptor structures predicted by AlphaFold2 and ligands were processed as follows. Metal ions were added to the TC domain binding site of receptors, referring to the homologous protein in PDB for the coordinates of the three Mg²⁺ ions. With the exception of Cgl13855 and FgMS, for which 5IMP and 5ER8 were used as the reference for metal ions, respectively, 6VYD was used as the reference for the other PTTCs. The receptor structures containing metal ions were then processed using the Chimera tool Dock Prep, in which hydrogen atoms and charges were added and other parameters were set as default. The energy-minimization molecule models included in Chimera were used to minimize the energy for the structures from the previous step. The ligand structures were drawn, charge was added and the structures were transformed to 3D conformations. The binding site of the TC domain was determined by referring to the crystal structures for homologous proteins. The grid box in the docking procedure was defined to include the metal ions, and corresponding residues appeared in the binding site of crystal structures for homologous proteins. Receptor and ligand options in AutoDock Vina were set as default. The number of binding modes, exhaustiveness of search and maximum energy difference (kcal mol^–1) parameters were set as 9, 8 and 3, respectively.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Data availability

The authors declare that the main data supporting the findings of this study are available within the article and its Supplementary Information. Original data can be obtained from the corresponding authors on reasonable request. The coordinates and structure factor amplitudes for the apo structure of TvTS-TC and the structure of TvTS-TC after soaking with 2,3-dihydro-HexPP have been deposited to PDB under accession codes 7VTA and 7VTB, respectively. The cryo-EM maps and atomic coordinates for MpMS-PT and cross-linked MpMS have been deposited in the Electron Microscopy Data Bank (EMDB; https://www.ebi.ac.uk/pdbe/emdb/) and PDB with accession codes EMD-32531 and EMD-32532 and 7WIJ, respectively. The accession numbers (NMDCN0000RG9, NMDCN0000RGA, NMDCN0000RGB, NMDCN0000R73, KFX89132, KAH9237577, KIK55704, KAF2708718, CRG86078, QIH97829) corresponding to the PTTC candidates for AlphaFold2 prediction have been deposited in the National Microbiology Data Center (https://nmdc.cn/en) and are listed in the Supplementary Information.

References

Abe, I. Enzymatic synthesis of cyclic triterpenes. Nat. Prod. Rep. 24, 1311–1331 (2007).
Article CAS PubMed Google Scholar
Degenhardt, J., Köllner, T. G. & Gershenzon, J. Monoterpene and sesquiterpene synthases and the origin of terpene skeletal diversity in plants. Phytochemistry 70, 1621–1637 (2009).
Article CAS PubMed Google Scholar
Minami, A., Ozaki, T., Liu, C. & Oikawa, H. Cyclopentane-forming di/sesterterpene synthases: widely distributed enzymes in bacteria, fungi, and plants. Nat. Prod. Rep. 35, 1330–1346 (2018).
Article CAS PubMed Google Scholar
Karunanithi, P. S. & Zerbe, P. Terpene synthases as metabolic gatekeepers in the evolution of plant terpenoid chemical diversity. Front. Plant Sci. 10, 1166 (2019).
Article PubMed PubMed Central Google Scholar
Dzubak, P. et al. Pharmacological activities of natural triterpenoids and their therapeutic implications. Nat. Prod. Rep. 23, 394–411 (2006).
Article CAS PubMed Google Scholar
Quin, M. B., Flynn, C. M. & Schmidt-Dannert, C. Traversing the fungal terpenome. Nat. Prod. Rep. 31, 1449–1473 (2014).
Article CAS PubMed PubMed Central Google Scholar
Thimmappa, R., Geisler, K., Louveau, T., O’Maille, P. & Osbourn, A. Triterpene biosynthesis in plants. Annu. Rev. Plant Biol. 65, 225–257 (2014).
Article CAS PubMed Google Scholar
Beytia, E., Qureshi, A. A. & Porter, J. W. Squalene synthetase: III. Mechanism of the reaction. J. Biol. Chem. 248, 1856–1867 (1973).
Article CAS PubMed Google Scholar
Hoshino, T. & Sato, T. Squalene-hopene cyclase: catalytic mechanism and substrate recognition. Chem. Commun. 4, 291–301 (2002).
Article CAS Google Scholar
Phillips, D. R., Rasbery, J. M., Bartel, B. & Matsuda, S. P. Biosynthetic diversity in plant triterpene cyclization. Curr. Opin. Plant Biol. 9, 305–314 (2006).
Article CAS PubMed Google Scholar
Christianson, D. W. Structural and chemical biology of terpenoid cyclases. Chem. Rev. 117, 11570–11648 (2017).
Article CAS PubMed PubMed Central Google Scholar
Baer, P. et al. Induced‐fit mechanism in class I terpene cyclases. Angew. Chem. Int. Ed. 53, 7652–7656 (2014).
Article CAS Google Scholar
Dickschat, J. S. Bacterial terpene cyclases. Nat. Prod. Rep. 33, 87–110 (2016).
Article CAS PubMed Google Scholar
Kawamukai, M. Biosynthesis of coenzyme Q in eukaryotes. Biosci. Biotechnol., Biochem. 80, 23–33 (2016).
Article CAS Google Scholar
Sasaki, D. et al. Crystal structure of heterodimeric hexaprenyl diphosphate synthase from Micrococcus luteus BP 26 reveals that the small subunit is directly involved in the product chain length regulation. J. Biol. Chem. 286, 3729–3740 (2011).
Article CAS PubMed Google Scholar
Zhang, Y. W., Li, X. Y. & Koyama, T. Chain length determination of prenyltransferases: both heteromeric subunits of medium-chain (E)-prenyl diphosphate synthase are involved in the product chain length determination. Biochemistry 39, 12717–12722 (2000).
Article CAS PubMed Google Scholar
Smith, T. E. Biogenetic relationships of bioactive sponge merotriterpenoids. Mar. Drugs 15, 285 (2017).
Article PubMed Central CAS Google Scholar
Mitsuhashi, T., Okada, M. & Abe, I. Identification of chimeric αβγ diterpene synthases possessing both type II terpene cyclase and prenyltransferase activities. ChemBioChem 18, 2104–2109 (2017).
Article CAS PubMed Google Scholar
Faylo, J. L., Ronnebaum, T. A. & Christianson, D. W. Assembly-line catalysis in bifunctional terpene synthases. Acc. Chem. Res. 54, 3780–3791 (2021).
Article CAS PubMed Google Scholar
Chen, R. et al. Systematic mining of fungal chimeric terpene synthases using an efficient precursor-providing yeast chassis. Proc. Natl Acad. Sci. USA 118, e2023247118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hu, L. et al. Draft genome sequence of Talaromyces verruculosus (“Penicillium verruculosum”) strain TS63-9, a fungus with great potential for industrial production of polysaccharide-degrading enzymes. J. Biotechnol. 219, 5–6 (2016).
Article CAS PubMed Google Scholar
Islam, M. S. et al. Tools to kill: genome of one of the most destructive plant pathogenic fungi Macrophomina phaseolina. BMC Genomics 13, 493 (2012).
Article CAS PubMed PubMed Central Google Scholar
Toyomasu, T. et al. Fusicoccins are biosynthesized by an unusual chimera diterpene synthase in fungi. Proc. Natl Acad. Sci. USA 104, 3084–3088 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Chiba, R., Minami, A., Gomi, K. & Oikawa, H. Identification of ophiobolin F synthase by a genome mining approach: a sesterterpene synthase from Aspergillus clavatus. Org. Lett. 15, 594–597 (2013).
Article CAS PubMed Google Scholar
Bian, G. et al. A clade II‐D fungal chimeric diterpene synthase from Colletotrichum gloeosporioides produces dolasta‐1 (15), 8‐diene. Angew. Chem. Int. Ed. 57, 15887–15890 (2018).
Article CAS Google Scholar
Rabe, P. et al. Mechanistic investigations of two bacterial diterpene cyclases: spiroviolene synthase and tsukubadiene synthase. Angew. Chem. Int. Ed. 56, 2776–2779 (2017).
Article CAS Google Scholar
Lauterbach, L., Rinkel, J. & Dickschat, J. S. Two bacterial diterpene synthases from Allokutzneria albata for bonnadiene and for phomopsene and allokutznerene. Angew. Chem. Int. Ed. 57, 8280–8283 (2018).
Article CAS Google Scholar
Rinkel, J. & Dickschat, J. S. Addressing the chemistry of germacrene A by isotope labeling experiments. Org. Lett. 21, 2426–2429 (2019).
Article CAS PubMed Google Scholar
Rinkel, J., Lauterbach, L., Rabe, P. & Dickschat, J. S. Two diterpene synthases for spiroalbatene and cembrene A from Allokutzneria albata. Angew. Chem. Int. Ed. 57, 3238–3241 (2018).
Article CAS Google Scholar
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Article CAS PubMed PubMed Central Google Scholar
Robinson, D. R. & West, C. A. Biosynthesis of cyclic diterpenes in extracts from seedlings of Ricinus communis. I. Identification of diterpene hydrocarbons formed from mevalonate. Biochemistry 9, 70–79 (1970).
Article CAS PubMed Google Scholar
Chen, J. L., Moore, R. E. & Patterson, G. M. Structures of nostocyclophanes AD. J. Org. Chem. 56, 4360–4364 (1991).
Article CAS Google Scholar
Rabe, P. et al. Conformational analysis, thermal rearrangement, and EI–MS fragmentation mechanism of (1(10)E, 4E, 6S, 7R)‐germacradien‐6‐ol by ¹³C‐labeling experiments. Angew. Chem. Int. Ed. 54, 13448–13451 (2015).
Article CAS Google Scholar
Rinkel, J., Lauterbach, L. & Dickschat, J. S. A branched diterpene cascade: the mechanism of spinodiene synthase from Saccharopolyspora spinosa. Angew. Chem. Int. Ed. 58, 452–455 (2018).
Article CAS Google Scholar
Chen, M., Chou, W. K., Toyomasu, T., Cane, D. E. & Christianson, D. W. Structure and function of fusicoccadiene synthase, a hexameric bifunctional diterpene synthase. ACS Chem. Biol. 11, 889–899 (2016).
Article CAS PubMed PubMed Central Google Scholar
Tarshis, L., Yan, M., Poulter, C. D. & Sacchettini, J. C. Crystal structure of recombinant farnesyl diphosphate synthase at 2.6-Å resolution. Biochemistry 33, 10871–10877 (1994).
Article CAS PubMed Google Scholar
Faylo, J. L. et al. Structural insight on assembly-line catalysis in terpene biosynthesis. Nat. Commun. 12, 3487 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Migneault, I., Dartiguenave, C., Bertrand, M. J. & Waldron, K. C. Glutaraldehyde: behavior in aqueous solution, reaction with proteins, and application to enzyme crosslinking. Biotechniques 37, 790–802 (2004).
Article CAS PubMed Google Scholar
Bauler, P., Huber, G., Leyh, T. & McCammon, J. A. Channeling by proximity: the catalytic advantages of active site colocalization using Brownian dynamics. J. Phys. Chem. Lett. 1, 1332–1335 (2010).
Article CAS PubMed PubMed Central Google Scholar
Chen, M., Harris, G. G., Pemberton, T. A. & Christianson, D. W. Multi-domain terpenoid cyclase architecture and prospects for proximity in bifunctional catalysis. Curr. Opin. Struct. Biol. 41, 27–37 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pemberton, T. A. et al. Exploring the influence of domain architecture on the catalytic function of diterpene synthases. Biochemistry 56, 2010–2023 (2017).
Article CAS PubMed Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, X. M. et al. Ethanol and methanol can improve huperzine A production from endophytic Colletotrichum gloeosporioides ES026. PLoS ONE 8, e61777 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, Y. G. et al. A cryptic plant terpene cyclase producing unconventional 18‐ and 14‐membered macrocyclic C₂₅ and C₂₀ terpenoids with immunosuppressive activity. Angew. Chem. Int. Ed. 60, 25468–25476 (2021).
Article ADS CAS Google Scholar
Bian, G. et al. Metabolic engineering-based rapid characterization of a sesquiterpene cyclase and the skeletons of fusariumdiene and fusagramineol from Fusarium graminearum. Org. Lett. 20, 1626–1629 (2018).
Article CAS PubMed Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Huang, A. C. et al. Unearthing a sesterterpene biosynthetic repertoire in the Brassicaceae through genome mining reveals convergent evolution. Proc. Natl Acad. Sci. USA 114, 6005–6014 (2017).
Article CAS Google Scholar
Guerrero, F., Ciragan, A. & Iwaï, H. Tandem SUMO fusion vectors for improving soluble protein expression and purification. Protein Expr. Purif. 116, 42–49 (2015).
Article CAS PubMed Google Scholar
Kabsch, W. Xds. Acta Crystallogr. D 66, 125–132 (2010).
Article CAS PubMed PubMed Central Google Scholar
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr. D 69, 1204–1214 (2013).
Article CAS PubMed PubMed Central Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007).
Article CAS PubMed PubMed Central Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D 66, 213–221 (2010).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
Article PubMed CAS Google Scholar
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix. refine. Acta Crystallogr. D 68, 352–367 (2012).
Article CAS PubMed PubMed Central Google Scholar
Schüttelkopf, A. W. & Van Aalten, D. M. PRODRG: a tool for high-throughput crystallography of protein–ligand complexes. Acta Crystallogr. D 60, 1355–1363 (2004).
Article PubMed CAS Google Scholar
Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018).
Article PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Trott, O. & Olson, A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461 (2010).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was funded by the National Key R&D Program of China (2018YFA0900400 and 2021YFC2102600), the National Natural Science Foundation of China (31670090 and 31800032) and the German Research Foundation (DFG; DI1536/7-2). The numerical calculations in this paper were carried out on the supercomputing system in the Supercomputing Center of Wuhan University. The synchrotron radiation experiments were performed at BL-1A of the Photon Factory (proposal number 2019G038) and X06SA of the SLS (proposal numbers 20191094 and 20191134). This work was also supported in part by the Platform Project for Supporting Drug Discovery and Life Science Research (Basis of Supporting Innovative Drug Discovery and Life Science Research) from AMED (JP21am0101071 (support number 1553)), Grants-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology, Japan (JSPS KAKENHI grants JP16H06443, JP19K15703, JP20H00490, JP20KK0173, JP21K18246), the New Energy and Industrial Technology Development Organization (NEDO, grant JPNP20011), the PRESTO program from Japan Science and Technology Agency (JPMJPR20DA) and AMED (grant JP21ak0101164). H.T. is a recipient of the JSPS Postdoctoral Fellowship for Foreign Researchers (P18404).

Author information

These authors contributed equally: Hui Tao, Lukas Lauterbach, Guangkai Bian, Rong Chen, Anwei Hou, Takahiro Mori

Authors and Affiliations

Graduate School of Pharmaceutical Sciences, The University of Tokyo, Tokyo, Japan
Hui Tao, Takahiro Mori & Ikuro Abe
Kekulé-Institut für Organische Chemie und Biochemie, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
Lukas Lauterbach, Anwei Hou & Jeroen S. Dickschat
Department of Urology, Zhongnan Hospital of Wuhan University, School of Pharmaceutical Sciences, Wuhan University, Wuhan, China
Guangkai Bian & Tiangang Liu
Department of Urology, Zhongnan Hospital of Wuhan University, Wuhan, China
Rong Chen, Xinghuan Wang & Tiangang Liu
Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo, Japan
Takahiro Mori & Ikuro Abe
PRESTO, Japan Science and Technology Agency, Saitama, Japan
Takahiro Mori
Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education and School of Pharmaceutical Sciences, Wuhan University, Wuhan, China
Shu Cheng, Ben Hu, Li Lu, Xin Mu, Min Li, Zixin Deng & Tiangang Liu
Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), Tsukuba, Japan
Naruhiko Adachi, Masato Kawasaki, Toshio Moriya & Toshiya Senda

Authors

Hui Tao
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Lauterbach
View author publications
You can also search for this author in PubMed Google Scholar
Guangkai Bian
View author publications
You can also search for this author in PubMed Google Scholar
Rong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Anwei Hou
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Mori
View author publications
You can also search for this author in PubMed Google Scholar
Shu Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Ben Hu
View author publications
You can also search for this author in PubMed Google Scholar
Li Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xin Mu
View author publications
You can also search for this author in PubMed Google Scholar
Min Li
View author publications
You can also search for this author in PubMed Google Scholar
Naruhiko Adachi
View author publications
You can also search for this author in PubMed Google Scholar
Masato Kawasaki
View author publications
You can also search for this author in PubMed Google Scholar
Toshio Moriya
View author publications
You can also search for this author in PubMed Google Scholar
Toshiya Senda
View author publications
You can also search for this author in PubMed Google Scholar
Xinghuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zixin Deng
View author publications
You can also search for this author in PubMed Google Scholar
Ikuro Abe
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen S. Dickschat
View author publications
You can also search for this author in PubMed Google Scholar
Tiangang Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.B. initiated the project and found the TrTS activity of TvTS and MpMS. R.C. and M.L. performed in vitro and in vivo analysis of TvTS and MpMS. S.C. verified the function of CgCS. X.M. constructed the plasmids and carried out fermentation. L. Lu and B.H. performed AlphaFold2 analysis and function prediction. Z.D. and X.W. supervised biological experiments. H.T. performed in vitro analysis and mutagenesis experiments. H.T. and T. Mori performed crystallization experiments. H.T., T. Mori, N.A., M.K., T. Moriya and T.S. performed the cryo-EM analysis. A.H. synthesized HexPP and 2,3-dihydro-HexPP. A.H. and L. Lauterbach performed in vitro characterization of and isotopic labelling experiments with TvTS-PT, TvTS-TC and MpMS. H.T., T. Mori, I.A., L. Lauterbach, A.H., J.S.D., G.B., R.C. and T.L. analysed the data and wrote the manuscript. I.A., J.S.D. and T.L. designed and supervised the study.

Corresponding authors

Correspondence to Ikuro Abe, Jeroen S. Dickschat or Tiangang Liu.

Ethics declarations

Competing interests

T.L. has filed a patent application relating to the function of TvTS, MpMS and CgCS and the resulting structures talaropentaene and colleterpenol. The other authors declare no competing interests.

Peer review

Peer review information

Nature thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 The structure of cyclase domain of TvTS.

a, the apo-form of TvTS-TC (green). The DDXXD, NSE, and RY motifs (red box) are conserved, and the active site forming regions, including aspartic-rich metal binding DDXXD motif, the region (V234-V243) after the NSE motif, and the A156-C183 region are disordered; b, PaFS-TC in complex with neridronate (cyan cartoon with magenta sticks, PDB: 5ER8³⁵, in comparison to TvTS-TS: rmsd values of 1.6 Å for Cα-atoms, 47% amino acid sequence identity); c, partially closed conformation of TvTS-TC. The disordered regions in the apo structure, especially the DDXXD motif and the active site loop D173-D182 (shown in salmon), appear clearly structured after soaking with 2,3-dihydro-HexPP. The docking model of 2,3-dihydro-HexPP was constructed based on the observed density and two possible conformers are shown by yellow and purple sticks.

Extended Data Fig. 2 Structure based mutagenesis studies with TvTS.

a, Active site cavity of TvTS-TC. The docking model of 2,3-dihydro-HexPP was constructed based on the observed density and two possible conformers were shown as yellow and purple sticks. b, In vitro activities of wild-type TvTS and its variants for production of 1. Full-length TvTS and its variants were expressed with a maltose binding protein (MBP) fused at the N-terminus. Peak integrals for the ion chromatogram of m/z 408 were used for quantification. Wild-type production was set to 100%, bars and error bars show mean and s.d. from three biological replicates, respectively.

Extended Data Fig. 3 Active site of prenyltransferase domain of MpMS.

a, noncrosslinked MpMS-PT; b, crosslinked MpMS-PT; c, crystal structure of PaFS-PT complexed with cobalt ions and pamidronate (PDB ID: 5ERO); d, superimposed crosslinked MpMS-PT (cyan) and PaFS-PT (wheat) crystal structure; e, active site cleft of MpMS-PT; f, active site cavity of PaFS-PT.

Extended Data Fig. 4 Close-up view of the interface between PT and TC domains.

a, The residues on α5 of TC domain (green) interact with the residues on α2 (419-429) and C-terminal α14 (686-695) of the PT domain (cyan); b, close-up view of the interface between the PT domain and fitted model of TC domain. Distances indicated by grey dashed lines are given in Å.

Extended Data Fig. 5 The active site of MpMS-PT (cyan) and TC (green) domains face to each other.

a, top view; b, side view.

Extended Data Fig. 6 Mutagenesis study of MpMS.

Ion chromatograms (m/z = 408) of wildtype MpMS (black) and its enzyme variants V206F (blue) and A207W (magenta).

Extended Data Fig. 7 Product detection of triterpenes produced by chimeric class I TrTS in S. cerevisiae.

Engineered S. cerevisiae containing i) PTTC027, ii) PTTC035, iii) PTTC044, iv) PTTC58, v) PTTC59, vi) PTTC60, vii) PTTC62, viii) PTTC74, ix) PTTC114, x) Cgl13855 overexpressed under the control of GAL10 promoter; xi) blank, engineered efficient terpene precursor providing chassis S. cerevisiae YZL141. Red asterisks indicated the presence of compound 3, white asterisks indicate the similar peak produced by PTTC074.

Extended Data Table 1 Data collection and refinement statistics

Full size table

Extended Data Table 2 Data collection and refinement statistics of cryo-EM data

Full size table

Supplementary information

Supplementary Information

This file contains Supplementary Methods, Supplementary Figs. 1–59, Supplementary Tables 1–9, Supplementary Schemes 1 and 2, the synthetic procedure for compounds S1–S10, the synthetic procedure for compounds S11–S15, the amino acid sequences for AlphaFold2 prediction and supplementary references.

Reporting Summary

Peer Review File

Supplementary Tables

This file contains Supplementary Tables 10–111.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tao, H., Lauterbach, L., Bian, G. et al. Discovery of non-squalene triterpenes. Nature 606, 414–419 (2022). https://doi.org/10.1038/s41586-022-04773-3

Download citation

Received: 06 January 2022
Accepted: 19 April 2022
Published: 01 June 2022
Issue Date: 09 June 2022
DOI: https://doi.org/10.1038/s41586-022-04773-3

This article is cited by

The structural basis of pyridoxal-5′-phosphate-dependent β-NAD-alkylating enzymes
- Takayoshi Awakawa
- Takahiro Mori
- Ikuro Abe
Nature Catalysis (2024)
Discovery of fungal onoceroid triterpenoids through domainless enzyme-targeted global genome mining
- Jia Tang
- Yudai Matsuda
Nature Communications (2024)
The microbial biosynthesis of noncanonical terpenoids
- Mengyu Ma
- Mingkai Li
- Tianyue An
Applied Microbiology and Biotechnology (2024)
Recent Advances in Microbial Production of Terpenoids from Biomass-derived Feedstocks
- Yujin Cao
- Mo Xian
Chemical Research in Chinese Universities (2024)
Biotransformation of triterpenoid ganoderic acids from exogenous diterpene dihydrotanshinone I in the cultures of Ganoderma sessile
- Xinwei Wang
- Haibo Wu
- Kun Feng
Microbial Cell Factories (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.