Engineering the stambomycin modular polyketide synthase yields 37-membered mini-stambomycins

Su, Li; Hôtel, Laurence; Paris, Cédric; Chepkirui, Clara; Brachmann, Alexander O.; Piel, Jörn; Jacob, Christophe; Aigle, Bertrand; Weissman, Kira J.

doi:10.1038/s41467-022-27955-z

Download PDF

Article
Open access
Published: 26 January 2022

Engineering the stambomycin modular polyketide synthase yields 37-membered mini-stambomycins

Nature Communications volume 13, Article number: 515 (2022) Cite this article

5882 Accesses
7 Citations
14 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 06 October 2022

This article has been updated

Abstract

The modular organization of the type I polyketide synthases (PKSs) would seem propitious for rational engineering of desirable analogous. However, despite decades of efforts, such experiments remain largely inefficient. Here, we combine multiple, state-of-the-art approaches to reprogram the stambomycin PKS by deleting seven internal modules. One system produces the target 37-membered mini-stambomycin metabolites − a reduction in chain length of 14 carbons relative to the 51-membered parental compounds − but also substantial quantities of shunt metabolites. Our data also support an unprecedented off-loading mechanism of such stalled intermediates involving the C-terminal thioesterase domain of the PKS. The mini-stambomycin yields are reduced relative to wild type, likely reflecting the poor tolerance of the modules downstream of the modified interfaces to the non-native substrates. Overall, we identify factors contributing to the productivity of engineered whole assembly lines, but our findings also highlight the need for further research to increase production titers.

Insights into azalomycin F assembly-line contribute to evolution-guided polyketide synthase engineering and identification of intermodular recognition

Article Open access 04 February 2023

Pass-back chain extension expands multimodular assembly line biosynthesis

Article 21 October 2019

Enzymology of assembly line synthesis by modular polyketide synthases

Article 13 March 2023

Introduction

For almost 30 years, efforts have been made to leverage the modular genetic architecture of the type I polyketide synthases (PKSs) to generate novel derivatives, typically by modifying individual catalytic domains. Despite enormous progress in establishing domain structure–function relationships^1,2, such genetic manipulation remains inefficient³. Initial sight into factors potentially contributing to low product yields was provided by cryo-electron microscopy analysis of a model PKS module at multiple stages of its catalytic cycle^4,5. This work revealed that interdomain contacts are critical for establishing the various functional states of the module, and that transitions between such states rely on evolving interfaces between the domains, as well as the intervening ‘linker’ regions. This view of modular function was recently reinforced by cryo-EM/crystallographic analysis of additional modules sourced from two PKS systems^6,7. In short, PKS modules appear to be highly integrated units, thus explaining why exchange of catalytic domains for heterologous counterparts is often detrimental⁸. Collectively, these observations motivate future approaches in which modules or multi-modular subunits are employed as the basic building blocks for engineering the assembly lines^{9,10,11,12,13,14,15}. Such strategies could be exploited to engineer chimeras between multiple PKS systems, or to generate internally truncated forms of single PKSs, providing access to structurally-simplified analogues (minimal pharmacophores¹⁶) for biological evaluation.

Nevertheless, using modules requires a clear definition of their domain composition. Classically, modules encompass the three invariable domains required for monomer selection and chain extension (ketosynthase (KS), acyl transferase (AT) and acyl carrier protein (ACP)), as well as any intervening β-keto processing activities (e.g., ketoreductase (KR), dehydratase (DH), and enoyl reductase (ER)) (Fig. 1a), and thus have functional meaning. However, a domain set potentially more relevant to genetic engineering was recently suggested by the finding that KS domains in certain PKSs co-evolve with the tailoring domains located upstream in the assembly lines^17,18. To avoid confusion with alternative module definitions, we suggest that the term eXchange Unit (XU) that is used in the nonribosomal peptide synthetase (NRPS) field¹⁹ be adopted for this set of domains. Accordingly, a PKS XU begins with the modifying domains and the associated AT, and terminates with the KS that is assigned to the downstream functional module (Fig. 2). Even before these alternative domain sets were identified, engineering efforts revealed that maintaining the key ACP_n/KS_n+1 interface can, in certain cases, be critical for the function of a hybrid PKS^9,12,20.

**Fig. 1: Stambomycin polyketide synthase and structures of stambomycin derivatives produced by *S. ambofaciens* ATCC23877.**

**Fig. 2: A schematic representation of classical and alternative module definitions.**

Recently, we engineered hybrid PKSs based on both of these domain sets, by covalently tethering heterologous modules to a common donor module within a bimodular mini-PKS^21,22. Overall, our data demonstrated that the use of both classical module boundaries and XUs led to functional hybrid PKSs, and which domain sets worked best depended on the source module²². Indeed, regardless of which extremities are employed, module exchange results in non-native interdomain interactions (ACP_n/KS_n+1 or KS_n+1/ACP_n+1), and in the case of classical module boundaries, potential incompatibilities in terms of KS substrate specificity (Fig. 2)—both of which have been shown to reduce activity via detailed studies in vitro^23,24,25.

An alternative to the covalent fusion approach is to create alternative junctions between modules located on distinct subunits³. In this case, chain transfer not only depends on the employed modules, but also on the presence of compatible protein–protein interaction motifs called docking domains (DDs)²⁶ situated at the extreme C- and N-termini of the subunits (Figs. 1 and 2). In native PKSs, matched pairs of such DDs form specific complexes at intersubunit interfaces, enforcing a strict subunit ordering within the systems. To date, multiple mini-PKSs have been engineered based on non-native module combinations and suitable DDs. However, as much of this effort centered on the erythromycin (DEBS) and related macrolide PKSs, and both classical modules^11,27,28 and XUs^13,20 functioned in these contexts, the applicability of these findings to other PKS systems is not clear. Furthermore, the limited work that has been carried out on intact assembly lines^{14,29,30,31,32} has focused on engineering hybrid systems.

Here, we leverage subunit-based engineered to substantially shorten the PKS responsible for biosynthesis of the anti-cancer stambomycins 1 in Streptomyces ambofaciens ATCC23877³³. Using both the classical module definitions and PKS XUs, we generate 37-membered derivatives of the normally 51-membered macrolides (Fig. 1b), albeit in moderate yields. The identified derivatives also clarify the relative timing of the two cytochrome P450-catalyzed hydroxylations at C-28 and C-50 (Fig. 1b). Furthermore, we report an example of intersubunit crosstalk resulting in thioesterase (TE)-mediated release of shunt metabolites. Taken together with recent work by others¹², our data establish guidelines for module/subunit-based truncation of whole PKS systems, as well as targets for future study towards further boosting product titers.

Results

Design of engineering experiments based on classical modular boundaries

The stambomycin PKS comprises 25 modules distributed among 9 polypeptides (Pks1–9)³³ (Fig. 1a) (Note: throughout the text, the stambomycin genes have been numbered in accordance with ref. ³³). Module 12 of Pks4 notably houses a broad-specificity AT domain which gives rise to the six characterized stambomycin family members (A–F), which differ in the alkyl functionality at the C-26 position^33,34. To access abridged derivatives using the classical module boundaries, we reasoned that we could engineer intersubunit interfaces by suitable manipulation of docking domains. Encouragingly, the extreme C- and N-termini of all subunits (with the exception of the N-terminus of Pks1 and the C-terminus of Pks9) contain sequences with convincing homology to previously identified DDs^26,35 (the C-terminal DDs are referred to hereafter as ^CDDs and their partner N-terminal DDs as ^NDDs). By bioinformatics analysis, we were able to confidently assign the DDs acting at 6 of the 8 interfaces to the type 1a class²⁶, and the remaining two sets of DDs as type 1b³⁵ (Supplementary Fig. 1). In both cases, docking occurs between an α-helical ^CDD and a coiled-coil formed by the ^NDD, with specificity achieved via strategically placed charge:charge interactions at the complex interface (Supplementary Fig. 1)^26,35.

Among the type 1a junctions, there were notably two sets which appeared compatible in terms of the translocated substrate: Pks 3/4 + 7/8 and Pks 4/5 + 8/9 (Supplementary Fig. 2). Specifically, the functional groups at the critical α- and β-positions^17,36 of the transferred chains are identical at these junctions, and correspondingly, the downstream KSs show similarities across several sequence motifs previously correlated with substrate specificity^17,24,37 (Supplementary Fig. 2). Targeting such interfaces thus allowed us, at least in principle, to overcome the functional block to the engineered systems represented by poor recognition of the incoming substrate by the directly downstream KS domain²⁵. Ultimately, we aimed to create an interface between Pks subunits 4 and 9 for two principal reasons. First, as mentioned earlier, Pks4 is at the origin of the structural variation between the stambomycin family members, and thus we anticipated that maintaining the subunit within the hybrid system would give rise to a corresponding series of truncated analogues, providing important evidence for their identities. Second, it was genetically more practical to modify the second set of interfaces due to splitting of the genes encoding the PKS subunits between two loci (Fig. 1 and Supplementary Fig. 4).

To establish the Pks4/Pks9 junction, we initially modified the ^CDD of Pks4 (^CDD₄) to match that of Pks8 (the natural partner of the ^NDD of Pks9 (^NDD₉)), either by site-directed mutagenesis (SDM) of residues previously identified as key mediators of interaction specificity (construct ^CDD₄ SDM; Supplementary Fig. 3 and Supplementary Table 1)²⁶, or by exchange of the complete ^CDD docking α-helix of ^CDD₄ for that of ^CDD₈ (construct ^CDD₄ helix swap; Supplementary Fig. 3 and Supplementary Table 1)³⁸. Modifying the ^CDD₄ specificity code to match that of ^CDD₈ required mutation of 3 residues, while for the ^CDD₄ helix swap, the terminal 16 amino acids of ^CDD₄ were exchanged for the corresponding 15 residues of ^CDD₈ (Supplementary Fig. 3 and Supplementary Table 1). The genetic alterations were carried out in two distinct PKS contexts: (i) in the presence of the intervening subunits 5–8, which allowed for the possibility of competitive interactions between modified Pks4 and both Pks5 and Pks9; and (ii) removing the intervening subunits 5–8, thus eliminating competition for binding of Pks4 by Pks5, and of Pks9 by Pks8 (Supplementary Fig. 3). We further generated a mutant in which Pks subunits 5–8 were deleted but no modification was made to ^CDD₄, in order to judge the intrinsic capacity of Pks4 and Pks9 to interact. Furthermore, genetic engineering was carried out in parallel by both PCR-targeting³⁹ and CRISPR-Cas9⁴⁰ (Supplementary Figs. 5 and 6), in order to directly compare the efficacy of these two approaches, as well as evaluate the effect of the short scar sequence remaining in the chromosome following PCR-targeting.

The ^CDD₄ SDM and ^CDD₄ helix swap sequences were introduced in parallel into the S. ambofaciens genome. As previous work has shown that production from the stambomycin biosynthetic gene cluster requires activation by constitutive overexpression of a pathway-specific LAL (Large ATP-binding regulators of the LuxR family) regulator³³, we additionally introduced the LAL overexpression plasmid (pOE484) into each of the mutants, using the empty parental plasmid (pIB139⁴¹) as a control. In total, this strategy resulted in 20 targeted strains harboring interface mutants (where K7N refers to PCR-targeting and CPN to CRISPR-Cas9 engineering): K7N1/pIB139, K7N1/OE484, K7N2/pIB139, K7N2/OE484, K7N3/pIB139, K7N3/OE484, K7N4/pIB139, K7N4/OE484, K7N5/pIB139, K7N5/OE484, K7N6/pIB139, K7N6/OE484, CPN1/pIB139, CPN1/OE484, CPN2/pIB139, CPN2/OE484, CPN4/pIB139, CPN4/OE484, CPN5/pIB139, CPN5/OE484 (Table 1, Supplementary Data 1–3; despite extensive efforts the CPN3 mutant strain was not obtained). The principal difference between the K7N and CPN series of constructs is the presence of a 33 bp scar sequence between the modified pks4 and pks9 genes (Supplementary Fig. 4). Construct K7N6 was assembled specifically to test the effect of this region, without any further modification to ^CDD₄ and the intervening pks5–pks8 genes.

Table 1 Summary of various strains generated, as well as the stambomycin derivatives detected.

Full size table

With the exception of K7N3, CPN4, and CPN5, extracts of the engineered mutant strains harboring pOE484 were analyzed by high performance liquid chromatography heated electrospray ionization high-resolution mass spectrometry (HPLC-ESI-HRMS) on a Dionex UItiMate 3000 HPLC coupled to a Q Exactive^TM Hybrid Quadrupole-Orbitrap^TM Mass Spectrometer, and compared to extracts of the control strain containing pIB139⁴¹ as well as the wild-type S. ambofaciens, using SIEVE 2.0 screening software. All extracts were subsequently analyzed on a Thermo Scientific Orbitrap LTQXL and/or an Orbitrap ID-X Tribrid Mass Spectrometer, and the data inspected manually. Metabolites not present in the control strains are listed in Table 1 and Supplementary Table 2.

It must be noted that the low yields of the target mini-stambomycins (vide infra) precluded their purification, and consequently full structure elucidation by NMR and use as quantification standards. However, we were able to obtain convincing evidence for their identities based on exact masses obtained from high-resolution mass spectra, and detailed comparative analysis of MS and MS² data with those acquired on a series of shorter derivatives which were produced in substantially higher amounts (as detailed in the respective Supplementary figures). Further support for the identity of multiple metabolites was afforded by genetic engineering controls. In the absence of authentic standards for the stambomycin derivatives, we evaluated the use of two surrogates for quantification: the parental stambomycins 1A/B³³ (Supplementary Fig. 7–9), and linear, 50-deoxy derivatives of stambomycins 2A/B (Supplementary Figs. 10 and 11). The latter compounds were purified from a previously described strain in which the C-50 hydroxylase SamR0479 (Fig. 1a) had been inactivated⁴². This analysis notably revealed that the detection sensitivity towards the 50-deoxystambomycins 2 whether using MS or UV, was dramatically lower than for the parental stambomycins 1 (Supplementary Fig. 11). In the case of the MS analysis, we can clearly attribute this difference to the presence in 1 of β-D-mycaminose which contains an easily protonatable nitrogen, as it is absent in 2. Indeed, analysis of erythromycin A 3 which contains an alternative amino sugar, β-D-desosamine, showed it to be detected with similar sensitivity to 1 (Supplementary Fig. 12). Thus, overall, to permit an estimation of yield ranges for the engineered metabolites, we generated a standard curve based on stambomycins 1A/B for which we could detect a 25,000-fold range of concentrations (0.00001–0.25 mg mL⁻¹). Using this curve directly then provided the lower yield limit for the derivatives, while introduction of a correction factor (×206) based on the 50-deoxystambomycins 2, furnished the upper yield limit. Importantly, the maximum yields calculated directly from a limited calibration curve based on 2, did not differ substantially from those determined using the correction factor (Supplementary Table 3).

The first result is that the K7N6/OE484 mutant yielded a similar metabolic profile to S. ambofaciens wt (22 ± 3 mg L⁻¹ of stambomycins 1 (Supplementary Table 4), 73% relative yield), showing that the scar sequence impacted stambomycin production, but not dramatically (Fig. 3). By contrast, no stambomycins were observed, as anticipated, in all constructs in which Pks5–Pks8 had been removed (K7N1−3; CPN1, 2) (Fig. 3). Stambomycins 1 were present, however, in strains K7N4 and CPN4 harboring ^CDD₄ site-directed mutations and in the ^CDD₄ helix swap strain CPN5, all of which still contained Pks5–Pks8, albeit at reduced amounts relative to the wild-type (18, 23, and 14% of wt, respectively) (Fig. 3 and Supplementary Table 4). (Surprisingly, the metabolic profile of K7N5 reproducibly differed from that of CPN5, as no stambomycin-related metabolites were detected from K7N5 (Fig. 3)). These data suggested that while the mutations introduced into ^CDD₄ negatively impacted the interaction with ^NDD₅, they were not sufficient to disrupt natural chain transfer between Pks4 and Pks5. Thus, DD engineering to alter partner choice should be accompanied by removal of competing intersubunit interactions.

**Fig. 3: Analysis of metabolites derived from PKS engineering based on the classical module definition.**

We did not find any evidence in the DD engineering experiments for any of the target 37-membered metabolites (Supplementary Figs. 3 and 13). However, all strains in which stambomycin production was abolished (Table 1) exhibited four peaks in common (Fig. 3b and Supplementary Fig. 13) (peaks potentially corresponding to additional derivatives were observed, but none were shared between multiple strains). The determined exact masses and MS/MS analysis (as exemplified by strain CPN2/OE484, Fig. 3b) correspond to truncated derivatives of stambomycins A/B and C/D respectively, following premature release from modules 12 and 13 of Pks4 (compounds 4–7, Fig. 3d and Supplementary Figs. 13–15; ca. 8-fold greater yield of the module 13 products (Supplementary Table 5)). Further support for the identity of these shunt compounds was obtained by grafting the chain-terminating (type I) thioesterase (TE) domain from the C-terminal end of Pks9 to the C-terminus of Pks4 in order to force chain release at this stage. Indeed, identical compounds were produced, but at 17-fold increased yield relative to CPN2/OE484, consistent with active off-loading of the chains (Fig. 3c, Supplementary Figs. 16–18, and Supplementary Table 5).

Based on their exact masses, both sets of shunt metabolites were hydroxylated on a single carbon, while none were found to bear the β-mycaminose of the mature stambomycins, consistent with the absence of the tetrahydropyran moiety to which it is normally tethered. To determine the location of the hydroxylation and therefore the hydroxylase responsible, we inactivated in mutant CPN2/OE484 the genes samR0478 and samR0479 encoding respectively, the stambomycin C-28 and C-50 cytochrome P450 hydroxylases (Fig. 1a)⁴². While extracts of CPN2/OE484/Δ478 were unchanged relative to CPN2/OE484 (i.e., the hydroxyl group was still present), the CPN2/OE484/Δ479 mutant exhibited four peaks with masses and fragmentation patterns corresponding to the deoxy shunt products (Fig. 3, Supplementary Figs. 19–21 (compounds 8–11) and Supplementary Table 5). Taken together, these data show that the unusual online modification catalyzed by SamR0479⁴², which is necessary for macrocyclization, occurs prior to chain extension by Pks5. While SamR0478 has also been speculated to act during chain assembly⁴², hydroxylation evidently occurs downstream of Pks4, at least. The intriguing substrate structural and/or protein–protein recognition features controlling the timing of hydroxylation by these P450 enzymes remain to be elucidated.

Role of TE domains in release of the shunt metabolites

We attributed the observed shunt metabolites to the lack of productive chain translocation between Pks4 and Pks9, causing intermediates to accumulate on ACPs 12 and 13. To evaluate whether these were released by spontaneous hydrolysis or enzymatically, we further investigated the role of the Pks9 TEI⁴² in chain release, as well as that of SamR0485, a proof-reading type II TE⁴³ located in the cluster. Both TEs were disabled by site-directed mutagenesis of the active site serines (Ser to Ala) (Supplementary Fig. 17).

Interestingly, inactivation of both the type I and type II TEs reduced the yields of shunt products 4–7 relative to the parental strain CPN2/OE484 (by 66% and 27%, respectively; average of duplicate experiments) (Supplementary Fig. 18 and Supplementary Table 5). These data clearly show that premature release of the chains is catalyzed, at least in part, by both TEs in the cluster, although spontaneous liberation also occurs. While type II TEs typically interact with acyl-ACPs in trans to release blocked chains⁴³, the effect of the Pks9 TEI is less readily explained. One possibility is that the productive docking interaction between Pks4 and Pks9 allows Pks9 to adopt an alternative conformation from which the TE can off-load intermediates bound to Pks4 ACP₁₂ and ACP₁₃ (Supplementary Fig. 18).

Although this mechanism is reminiscent of that used by the pikromycin PKS to generate both 12- and 14-membered rings⁴⁴, the pikromycin TEI is separated from its alternative ACP target by a single module, while Pks9 TEI is located five or four modules downstream from ACPs 12 and 13 in the engineered system, which would seem to necessitate substantial intersubunit acrobatics. Alternatively, or in addition, such remote off-loading may involve interactions between distinct assembly lines within the context of a PKS megacomplex, as described for the bacillaene system of Bacillus subtilis⁴⁵.

Understanding the docking domain engineering via studies in vitro with recombinant domains

To better understand the results of the DD engineering, we studied in vitro the wild-type DD pairs (^CDD₄/^NDD₅ and ^CDD₈/^NDD₉), as well as binding between the modified versions of ^CDD₄ and wild-type ^NDD₉. Design of suitable expression constructs in E. coli (Supplementary Table 1 and Supplementary Data 1 and 2) was based on bioinformatics analysis of the C-terminal ends of Pks4 and Pks8, and the N-termini of Pks5 and Pks9, and secondary structure analysis using PSIPRED⁴⁶ (Supplementary Fig. 22). Overall, we expressed and purified the following proteins in recombinant form from E. coli: ^CDD₄ wt, ^CDD₄ SDM, ^CDD₄ helix swap, ^NDD₅, and ^CDD₈ (Supplementary Figs. 22 and 23, Supplementary Data 3). As ^NDD₉ proved insoluble when expressed in E. coli, two versions with alternative start sites were obtained as synthetic peptides (Met and Val; Supplementary Fig. 22, Supplementary Table 1). Analysis of the individual ^CDDs by circular dichroism (CD) confirmed their expected high α-helical content (^CDD₄ wt (100 μM): 58%; ^CDD₈ wt (100 μM): 49%), and showed no evident effect of the introduced mutations on secondary structure (Supplementary Fig. 24). All of the constructs were further confirmed to be homodimeric by size exclusion chromatography multi-angle light scattering (SEC-MALS) (Supplementary Fig. 25).

The two ^NDDs also exhibited α-helical character, though less pronounced than the ^CDDs (^NDD₅ (100 μM): 27%; ^NDD₉ Met (100 μM): 21%; ^NDD₉ Val (100 μM): 25%), and were monomeric by SEC-MALS (Supplementary Fig. 25). The latter result was surprising as type 1a ^NDDs classically form a homodimeric coiled-coil domain (Fig. 1, Supplementary Fig. 1), but we recently identified functional, monomeric type 1 ^NDDs⁴⁷. Indeed, we detected binding between the native pairs by isothermal titration calorimetry (ITC), with affinities in the range of those determined previously for matched pairs of DDs^35,47,48,49 (^CDD₄ + ^NDD₅, K_d = 14.5 ± 0.9 μM; ^CDD₈ + ^NDD₉ Met, K_d = 33 ± 2 μM; ^CDD₈ + ^NDD₉ Val, K_d = 22 ± 1 μM) (Supplementary Fig. 26). Thus, while stable homodimerization of the ^NDDs may depend on the presence of a downstream homodimeric KS domain, their monomeric character did not preclude interaction with their ^CDD partners. Based on the higher affinity of the interaction, we could identify the ^NDD₉ Val as the physiologically relevant construct. The observed binding stoichiometry (1 homodimeric ^CDD:2 monomeric ^NDDs), is consistent with the known structure of a type 1a complex in which two monomers of each DD are present (Fig. 1, Supplementary Fig. 1)²⁶. As expected, no nonspecific interaction was detected between native ^CDD₄ and ^NDD₉, explaining the lack of productive communication between subunits Pks4 and Pks9 when the intervening multienzymes are deleted (strain K7N3) (Fig. 3a).

Analysis by ITC of binding between ^CDD₄ SDM or ^CDD₄ helix swap and ^NDD₅ revealed the complete absence of interaction (Supplementary Fig. 26), and therefore that the introduced modifications were sufficient to disrupt communication between the native pair. Thus, the continued production of stambomycins 1 by K7N4, CPN4, and K7N5 harboring Pks5–Pks8 must be due to additional contacts between Pks4 and Pks5 beyond the docking domains, likely including the compatible ACP₁₃/KS₁₄ interface¹⁵. On the other hand, no interaction was detected between ^CDD₄ SDM and ^NDD₉, showing that this limited number of mutations was inadequate to induce productive contacts. This result is fully in accord with the absence of the expected mini-stambomycin products from these strains (K7N1/CPN1, Fig. 3a). By contrast, the ^CDD₄ helix swap exhibited essentially the same binding to ^NDD₉ Val as ^CDD₈ (K_d = 21.0 ± 0.3 μM), demonstrating that exchange of just this helix is sufficient to redirect docking specificity³⁸. Thus, inefficient docking is not at the origin of the failure of the ^CDD₄ helix swaps to yield chain-extended products in vivo (strains K7N2/CPN2, Fig. 3a). We could therefore conclude at this stage that the problem arose from the non-native interface generated between ACP₁₃ and KS₂₁, poor acceptance by KS₂₁ of the incoming substrate during chain transfer and/or chain extension, and/or low activity towards the modified intermediate of domains and modules acting downstream.

Attempted optimization of the stambomycin DD mutants

We aimed next to improve the engineered Pks4/Pks9 intersubunit interface in strain CPN2 (^CDD₄ helix swap + deletion of Pks5–8) by targeting helix αI of ACP₁₃, as the first 10 residues of this helix have been implicated previously in governing the interaction with the downstream KS domain at hybrid junctions⁵⁰. Notably, multiple sequence alignment of all ACPs in the stambomycin PKS located at intersubunit junctions revealed a unique sequence for each ACP in the helix αI region. This observation is consistent with a recognition code for the KS partner, and the idea that mismatching these contacts might hamper productive chain transfer (Supplementary Fig. 27). Indeed, as mentioned previously, even when docking is interrupted, contacts between ACP₁₃ and KS₁₄ are apparently sufficient to enable chain translocation between Pks4 and Pks5 (Fig. 3a). In addition, an analogous strategy of optimizing the ACP_n/KS_n+1 chain transfer interface was shown recently to substantially improve interaction between an ACP (JamC) derived from the jamaicamide B biosynthetic pathway, and the first chain extension module of the lipomycin PKS (LipPKS1)⁵¹.

In our case, the first six residues of ACP₁₃ helix αI were modified using CRISPR-Cas9, so that the full 10-residue recognition sequence matched that of ACP₂₀, the natural partner of KS₂₁ (EADQRR → PSERRQ) (Supplementary Figs. 27 and 28). Analysis of extracts of the resulting strain CPN2/OE484/ACP₁₃ SDM by LC-ESI-HRMS revealed at best small amounts (maximum yield of 0.1 mg L⁻¹) of target cyclic mini-stambomycins A/B (13), lacking the hydroxyl group introduced by SamR0478 (Fig. 4, Supplementary Fig. 29 and Supplementary Table 6). Thus, while this experiment finally yielded evidence for successful chain transfer between Pks4 and Pks9 followed by subsequent chain extension by Pks9 and TE-catalyzed release, the overall efficiency of the system remained low. Interestingly, however, the titers of the four shunt metabolites 4–7 were as much as 48-fold higher from the ACP₁₃ helix swap mutant than from CPN2/OE484. Evidently, improved interactions between ACP₁₃ and KS₂₀ facilitated release of the stalled intermediates from ACPs 12 and 13, presumably via remote action by the TEI domain.

**Fig. 4: Engineering of functional mini-stambomycin PKSs.**

Engineering mini-stambomycins by maintaining the native ACP₁₃/KS₁₄ junction

Cumulatively, the results obtained with the docking domain engineering identified KS₂₁ as one potential bottleneck in the engineered PKS. Our parallel strategy based on PKS XUs (Fig. 2) allowed us to directly test this idea. Specifically, we investigated the effects of preserving the native ^CDD₄/^NDD₅ pair and either the majority of KS₁₄, or a little more than half of the domain, resulting in a KS₁₄/KS₂₁ hybrid. For this, we used two different splice sites in KS₁₄ (Supplementary Fig. 30): (i) at the end of the domain in a highly conserved region (GTNAHV) exploited recently to efficiently swap downstream AT domains⁵²; and, (ii) at a site corresponding to a recombination hot spot identified during induced evolution of the rapamycin (RAPS) PKS⁵³, yielding the KS₁₄/KS₂₁ chimera (Fig. 4 and Supplementary Fig. 31). Both of these modifications were introduced into S. ambofaciens using CRISPR-Cas9, while simultaneously removing Pks5–Pks8, giving, respectively, after co-transformation with pOE484 and the control plasmid pIB139, strains ATCC/OE484/hy59_S1, ATCC/pIB139/hy59_S1, ATCC/OE484/hy59_S2, and ATCC/pIB139/hy59_S2.

Analysis of culture extracts relative to the controls revealed the presence in both ATCC/OE484/hy59_S1 and ATCC/OE484/hy59_S2, of a series of 37-membered metabolites (Fig. 4). The obtained comprehensive MS/MS data were consistent with the desired mini-stambomycins either as their free acids or in cyclic form (metabolites 12–14, Fig. 4, Supplementary Figs. 32–35 and Supplementary Note 1). Signals corresponding to the A/B and C/D derivatives of all metabolites were detected, providing important evidence for their identities, as well as both the C-14 hydroxylated 14 and non-hydroxylated 13 forms of the cyclic mini-stambomycins (C-14 corresponds to C-28 in the parental compounds (Fig. 1)). For detailed justification of the structure assignments of 13 and 14, see the Supplementary Note 1. It is not surprising that the corresponding E and F forms were not detected, as their yields even from the wild-type are much lower than the A–D derivatives (Fig. 3a). Critically, we obtained additional support for the identities of 12–14 by inactivation of samR0479 (which introduces the hydroxyl used for macrocyclization), which resulted in exclusive production of linear deoxy mini-stambomycins 15 (Supplementary Figs. 36–38 and Supplementary Table 7). The observation of non-hydroxylated 13 shows notably that internal hydroxylation by SamR0478 is not an absolute prerequisite for TE-catalysed macrolactonization, and argues that hydroxylation of the mini-stambomycins only takes place on the macrocyclic compound. Although compounds 13 and 14 likely incorporate the tetrahydropyran moiety of the parental stambomycins 1, which undergoes glycosylation, derivatives bearing β-D-mycaminose were not observed, presumably due to poor recognition of the overall modified macrocycle by glycosyl transferase SamR0481³³.

The combined, estimated maximum yields of the target compounds were reduced relative to the wild-type stambomycins by some 8-fold, and variable between fermentations. Notably, however, metabolites 12–14 were obtained at approximately three-fold higher titer from ATCC/OE484/hy59_S2 incorporating the hybrid KS₁₄/KS₂₁ (0.76, 2.0, and 0.74 mg L⁻¹, respectively, (3.5 mg L⁻¹ total)) than from ATCC/OE484/hy59_S1 containing the full KS₁₄ swap (0.27, 0.66, and 0.20 mg L⁻¹, respectively (1.1 mg L⁻¹ total)) (Fig. 4, Supplementary Fig. 32 and Supplementary Table 6). As observed previously, the strains also produced substantial quantities of the shunt products 4–7, while inactivation of samR0479 led correspondingly to the deoxy versions of these compounds 8–11 (Supplementary Figs. 32 and 36). The yields of the shunts were ca. 80-fold higher than those of the corresponding mini-stambomycins, with the highest titers observed in the strain incorporating the hybrid KS₁₄/KS₂₁. The amount of shunt metabolites was also ~123-fold higher than from strain CPN2/OE484 (which incorporates an ACP₁₃-^CDD₄ swap/^NDD₉-KS₂₁ interface) (Figs. 3b and 4, Supplementary Table 6). Thus, contrary to expectation, although using the KS as a fusion site improved communication between Pks4 and Pks9, it also substantially boosted TEI-mediated off-loading of stalled upstream intermediates.

In principle, such stalling could result from a slow rate of chain extension in the now hybrid acceptor module (for example, in the full KS swap construct, KS₁₄ and ACP₂₁ are completely mismatched for chain extension). To evaluate this idea, we modified ACP₂₁ within ATCC/OE484/hy59_S1 incorporating the full-length KS₁₄, targeting a sequence region previously identified as mediating intramodular communication between the KS and ACP during chain extension (Supplementary Fig. 39)^23,50. Specifically, we exchanged loop 1 and the initial portion of helix αII of ACP₂₁ for the corresponding sequence of ACP₁₄, using CRISPR-Cas9 (Supplementary Fig. 39). As we anticipated that creation of this substantially hybrid ACP might engender structural perturbation, we also engineered a minimal mutant of ACP₂₁ in which only one of the two most critical residues in the recognition motif was mutated to the corresponding amino acid in ACP₁₄ (G₁₄₉₉ of Pks9 → D; the second residue, R, of the motif is already common to the two ACPs) (Supplementary Fig. 39). Analysis of the loop/helix αII swap by HPLC-MS showed that all mini-stambomycin production had been abolished (Supplementary Fig. 40 and Supplementary Table 6), consistent with the anticipated disruption to ACP₁₄ structure. Production by the ACP site-directed mutant was not any better than by the full KS swap construct (Fig. 4, Supplementary Fig. 40 and Supplementary Table 6), as only metabolite 13 remained detectable.

In principle, the hybrid KS₁₄/KS₂₁ domain may have worked better than KS₁₄ for chain extension due to improved interaction with ACP₂₁, with stalling displaced to later modules. If this were the case, we might expect to see accumulation in the medium of shunt metabolites corresponding to the intermediate generated by module 21. Indeed, in the case of strain hy59_S2 (chimeric KS₁₄/KS₂₁) but not hy59_S1 (KS₁₄), we detected masses consistent with the A/B and C/D forms of intermediate 16 generated by module 21, at yields comparable to those of the final mini-stambomycins (Fig. 4, Supplementary Fig. 41 and Supplementary Table 6)). Correspondingly, 17, the C30-deoxy analogue of 16, was detected in the SamR0479 mutant (Supplementary Figs. 37 and 38 and Supplementary Table 7). The same metabolite 16 was identified from the ACP₂₁ G →D mutant (Fig. 4 and Supplementary Fig. 40 and Supplementary Table 6), consistent with interrupted chain transfer to KS₂₂. Taken together, these data confirm module 22 as a blockage point in the engineered systems.

Relative efficacy of PKS engineering using PCR-targeting and CRISPR-Cas9

As multiple of our core constructs were generated by both PCR-targeting and CRISPR-Cas9, we were able to directly compare the efficiency of the two techniques (Fig. 3 and Supplementary Figs. 5 and 6). Globally, our results confirm that both approaches can be employed to introduce large-scale modifications to PKS biosynthetic genes (i.e., deletions of single or multi-gene regions)^40,54,55,56. We have also demonstrated that CRISPR-Cas9 can be leveraged to specifically modify modular PKS domains⁵⁷. Of the two methods, CRISPR-Cas9 was the more rapid, as the corresponding constructs were engineered in approximately half of the time. In addition, while CRISPR-Cas9 allowed for direct modification of the host genome, PCR-targeting relied on the availability of suitable cosmids housing the target genes, and resulted in a 33 bp attB-like scar sequence in the genome (Supplementary Fig. 4)⁵⁸. In addition to hampering iterative use of the approach, the scar apparently provoked a moderate reduction in stambomycin yields in mutant K7N6 compared to the wild-type, an effect also noted upon comparison of several analogous mutant strains (e.g., K7N4 vs. CPN4, Fig. 3). Nonetheless, we did encounter certain difficulties with use of CRISPR-Cas9 (i.e., failure to obtain construct CPN3, occasional reversions to wild-type, etc.), observations motivating ongoing efforts in other laboratories to further enhance the suitability of CRISPR-Cas9 for editing PKS pathways^{57,59,60,61,62,63,64}.

Discussion

In this work, we have utilized an approach based on the state-of-the-art in PKS engineering to modify the stambomycin PKS (Fig. 5). Specifically, we aimed to remove the four PKS subunits between Pks4 and Pks9 in the assembly line which together house seven chain extension modules, to generate a series of 37-membered mini-stambomycins. While in principle such a change might have been possible by directly fusing Pks4 and Pks9 via a suitable intermodular linker, this approach would have resulted in a heptamodular subunit whose size is far in excess of the tetramodular multienzymes present in the system. We have also demonstrated recently the low efficacy of this strategy when the module downstream of the linker is N-terminal in its native subunit context, as with module 21 of Pks9²².

**Fig. 5: Summary of the engineering strategies applied in this work to the stambomycin PKS.**

As an initial approach (Fig. 5), we modified ^CDD₄ to render it compatible with ^NDD₉. The aim in this case was to induce productive communication between Pks4 and Pks9, while leaving all modular units intact. This modified PKS relied for function on both a non-native chain transfer interface (ACP₁₃/KS₂₁), and the intrinsic tolerance of the downstream KS/modules to the incoming substrate. We were optimistic this experiment might work given the structural similarities between the native substrates of KS₁₄ and KS₂₁ at least directly adjacent to the acyl terminus, as well as the fact that the stambomycin PKS generates a small family of metabolites, and therefore must exhibit some intrinsic tolerance to structural variation. Although we showed in vitro with recombinant DD pairs that a docking helix-swapped mutant of ^CDD₄ communicated effectively with ^NDD₉, chain transfer across the engineered interface did not occur in vivo, as evidenced by the accumulation of multiple shunt products. While our attempt to render the ACP₁₃/KS₂₁ junction more native by site-directed mutagenesis did result in certain target metabolites, the most significant effect was to increase the yields of the truncated chains.

Having narrowed down the biosynthetic block to events occurring downstream of the engineered junction, we next carried out interface engineering based on PKS XUs, leveraging fusion points within the KS domain (Fig. 5). In this case, sites were selected to either maintain essentially the whole of KS₁₄, or to create a hybrid KS₁₄/KS₂₁ domain. This strategy at once preserved key elements of the ACP₁₃/KS₁₄ chain transfer junction, and in the case of the almost full-length KS₁₄, ensured that the domain had the appropriate substrate specificity for the incoming chain. Interestingly, the construct incorporating the chimeric KS functioned best, producing the desired mini-stambomycins in both linear and macrocyclic forms. These data thus identify this location in the middle of the KS⁵³ as a potentially general fusion site, perhaps because it maintains key contacts with the two partner ACP domains (ACP₁₃ and ACP₂₁, in this case). Intriguingly, an analogous site within the condensation (C) domains forms the basis for the XUCs (eXchange Unit Condensation domain) of NRPS systems, which are now used routinely to generate productive hybrids⁶⁵.

Our results also showcase the intrinsically high tolerance of the Pks9 TEI domain towards shorter substrates. Indeed, the TEI domain participates in off-loading shunt metabolites from the upstream subunit, an activity which interferes with passage of the chain to subsequent modules. Unfortunately, our attempts to boost yields of the mini-stambomycins by engineering the condensation interface between KS₁₄ and ACP₂₁ were unsuccessful, both when the full ACP₂₁ recognition loop/helix αII region was swapped for that of ACP₁₄, and when a single site-directed mutation was made at a putatively critical position (Fig. 5). This result is surprising, as both of these modifications were reported in vitro to improve chain extension carried out by mismatched KS and ACP domains sourced from the erythromycin PKS (DEBS)⁵⁰. Apparently, the introduced changes were not sufficient to ensure effective communication between the KS₁₄ and ACP₂₁ domains or were in fact deleterious to function, and/or any benefit was masked by the poor tolerance of the downstream modules to the modified intermediates.

To fully judge the efficacy of this work, it is instructive to compare it to the other two examples in the literature in which full biosynthetic systems have been re-engineered to remove multiple internal modules¹². In the first, recently-reported case, the neoaureothin (Nor) hexamodular PKS was morphed into the evolutionarily-related aureothin (Aur) tetramodular PKS by removing the second bimodular subunit, NorA′. As in our work, the authors initially attempted to engineer an interaction between the monomodular subunits NorA and NorB flanking NorA′ using compatible docking domains, by exchanging the type 1b ^NDD of NorB for the type 1a ^NDD of NorA′ (the natural partner of NorA). When the target metabolite was not obtained, they relocated the fusion site to the KS-AT linker downstream of the conserved KS region in NorB, thereby maintaining the native NorA ACP-^CDD/^NDD-KS NorA′ junction. Ultimately, several linker variants had to be evaluated before a functional sequence was identified, in part by serendipity—indeed it is 1 residue longer than the native linker. Overall, the yields of the targeted chain-shortened metabolites dropped ~18-fold compared to the parental neoaureothin (to ca. 2.5 mg L⁻¹), a similar penalty as engendered by our engineering strategy. Presumably, the useful titers obtained in this experiment reflect the intrinsic amenability of the Nor PKS to conversion into an Aur PKS, as the Nor PKS likely evolved from an Aur PKS by subunit insertion¹². Nevertheless, the newly created NorA/NorB interface was also only partially functional, as product corresponding to the intermediate generated by iterative action of the upstream subunit NorA was still obtained.

The second relevant study concerns the accelerated evolution (AE) of the RAPS PKS, based on spontaneous induced homologous recombination between its component modules⁵³. As mentioned earlier, several of the resulting systems incorporated intermodular fusion sites essentially at the mid-point of the respective KS domains, and so can be compared to our best performing construct hy59_S2. Notably, yields from the hybrid RAPS PKSs from which either 3 or 6 modules were removed, were reduced by a maximum of 3-fold relative to that of the parental compound. We propose two explanations for the higher functionality of these systems relative to hy59_S2. First of all, in every case, the module downstream of the newly formed junctions in the contracted RAPS PKSs was internal to its respective subunit (unlike module 21 of Pks9), consistent with the idea that such modules boast intrinsically broader substrate specificity²². Secondly, the KS domains of the RAPS PKS exhibit unusually high mutual sequence identity (85–90%)⁶⁶. This strong similarity means, in effect, that the same KS domain acts on a large variety of substrates of differing length and functionality, contributing to the tolerance of all modules downstream of the fusion site. In contrast, the KSs in the stambomycin PKS have substantially lower sequence identity (62–80%), and thus KSs 22–24 in hy59_S2 likely represent a specificity barrier to efficient transfer and extension of the modified intermediates.

Taken together, this set of results shows that contracting PKS assembly lines represents a viable approach to accessing truncated polyketide derivatives of variable length, including macrocycles. Whether such systems are generated rationally or using an AE process, efficiency will likely be boosted by employing (i) PKSs whose modules (and in particular KS domains) exhibit a substantial degree of mutual sequence identity and thus intrinsically high substrate tolerance, or which can be adapted by mutagenesis to broaden their specificity³⁷; and, (ii) creating junctions with downstream modules which are situated at internal positions within their subunits. The data also reinforce the idea that in cases where communication at modified interfaces occurs via noncovalent protein–protein interactions, PKS XUs which include at least a portion of the KS downstream from the docking domains, should be used to boost efficiency^12,13,20. Finally, our work has identified an increase in TEI-mediated proof-reading provoked by such interface engineering. Elucidating the mechanism underlying this unexpected intersubunit release activity, and thus how to effectively suppress it, should be a profitable avenue for further boosting product titers.

Methods

Bioinformatics analysis

To underpin the interface engineering strategy, the extremities of all the stambomycin PKS subunits were analyzed to identify the boundaries of the most C-terminal and N-terminal functional domains (ACP and KS, respectively), and thus the regions potentially containing docking domains (DDs). The resulting sequences were compared by multiple sequence alignment using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo)⁶⁷ to bona fide and putative DD sequences from multiple DD classes, including those present at the DEBS 2/DEBS 3 interface (type 1a, PDB ID:1PZQ [https://doi.org/10.2210/pdb1PZQ/pdb], 1PZR [https://doi.org/10.2210/pdb1PZR/pdb]²⁶), and the PikAIII/PikAIV junction (type 1b, PDB ID: 3F5H [https://doi.org/10.2210/pdb3F5H/pdb]³⁵), to allow for type classification. To identify suitable boundaries for DD heterologous expression in E. coli, the secondary structure of the putative DD regions was predicted using PSIPRED 4.0 (http://bioinf.cs.ucl.ac.uk/psipred/)⁴⁶. Analysis for potential specificity-conferring residues in the stambomycin PKS ketosynthase (KS) domains was carried out by multiple sequence alignment against model KS domains^17,24,37, using Clustal Omega⁶⁷.

General methods

All reagents and chemicals were obtained from Sigma–Aldrich, except the following: BD (tryptone, yeast extract, TSB powder), Thermo Fisher Scientific (Tris), VWR (glycerol, NaCl, NaNO₃), ADM, France (NutriSoy flour), and New England Biolabs (T4 DNA ligase, restriction enzymes). Oligonucleotide primers and two additional synthetic DNA fragments for CPN4 and CPN5 constructs were synthesized by Sigma–Aldrich (Supplementary Data 1). The docking domains ^NDD₉ Val and ^NDD₉ Met (Supplementary Table 1) were obtained as synthetic peptides from GeneCust. DNA sequencing of PCR products was performed by Sigma–Aldrich and Eurofins.

PCR reactions were performed with Taq DNA polymerase (Thermo Fisher Scientific), or Phusion High-Fidelity DNA polymerase (Thermo Fisher Scientific) when higher fidelity was required. Isolation of DNA fragments from agarose gel, purification of PCR products and extraction of plasmids were carried out using the NucleoSpin® Gel and PCR Clean-up or NucleoSpin® Plasmid DNA kits (Macherey Nagel, Hoerdt, France).

Strains and media

E. coli BL21 strains were obtained from Novagen. Unless otherwise specified, all E. coli strains were cultured in LB medium (yeast extract 10 g, tryptone 5 g, NaCl 10 g, distilled water up to 1 L, pH 7.0)⁶⁸ or on LB agar plates (LB medium supplemented with 20 g L⁻¹ agar) at 37 °C. Streptomyces ambofaciens ATCC23877 and the derived mutants were grown in TSB (TSB powder 30 g (tryptone 17 g, soy 3 g, NaCl 5 g, K₂HPO₄ 2.5 g, glucose 2.5 g), distilled water up to 1 L, pH 7.3) or on TSA plates (TSB medium supplemented with 20 g L⁻¹ agar), and sporulated on SFM⁶⁹ agar plates (NutriSoy flour 20 g, D-mannitol 20 g, agar 20 g, tap water up to 1 L) at 30 °C. All strains were maintained in 20% (v/v) glycerol in 2 mL Eppendorf tubes and stored at −80 °C.

For fermentation of S. ambofaciens ATCC23877 and its mutants, spores were streaked on TSA with appropriate antibiotics and after incubation 48 h at 30 °C, a loop of mycelium was used to inoculate 7 ml of MP5 medium (yeast extract 7 g, NaCl 5 g, NaNO₃ 1 g, glycerol 36 mL, MOPS 20.9 g, distilled water up to 1 L, pH 7.4) supplemented with selective antibiotics and sterile glass beads, followed by incubation at 30 °C and 200 rpm for 24–48 h. Finally, the seed culture was centrifuged and resuspended into 2 mL fresh MP5 before being inoculated into 50 mL MP5 medium in a 250 mL Erlenmeyer flask, and cultivated at 200 rpm and 30 °C for 4 days.

PCR-targeting-based genetic engineering

The following protocol applies to mutant ATCC/OE484/K7N1, but the same overall procedure was used to construct mutants K7N2/OE483, K7N2/pIB139, K7N3/OE483, K7N3/pIB139, K7N4/OE483, K7N4/pIB139, K7N5/OE483, K7N5/pIB139, K7N6/OE483, and K7N6/pIB139. To render the BAC BAA9ZA8 proficient for selection following conjugation, its chloramphenicol resistance gene was replaced using a PCR-targeting approach, by a kanamycin resistance gene cassette sourced from pIJ776³⁹, resulting in BAC1 (Supplementary Data 2). The cassette attL + aac(3)IV + oriT + attR was amplified from the plasmid pSPM88T⁷⁰ using primers 9996 and 9997 (Supplementary Data 1), affording PCR amplicon PCR-K7N1. The PCR fragment was then electro-transformed into BW25113/pKD20/BAC1⁷¹, giving rise to mutant BAC1_K7N1 (in which the C-terminus of pks4 was modified and the genes pks5–pks8, were deleted)³⁹. The BAC1_K7N1 was then introduced into E. coli ET12567/pUZ8002⁷² and then transferred to S. ambofaciens wild-type via intergeneric conjugation. The resulting exconjugants (ATCC/K7N1_aac(3)IV + oriT) were selected for their apramycin resistance and kanamycin sensitivity (i.e., a phenotype consistent with successful double cross-over). The correct mutations were confirmed by PCR and sequencing. Subsequently, the disruption cassette was excised using the excisionase and integrase of pSAM2 encoded by pOSK111⁷⁰, leaving a 33 bp scar sequence (mutant ATCC/K7N1). Successful removal of the cassette was verified by PCR and DNA sequencing. Finally, the LAL regulator overexpression plasmid pOE484³³ or the parental vector pIB139⁴¹ was introduced into the strain giving rise to mutants K7N1/OE484 and K7N1/pIB139, respectively. An analogous PCR-targeting approach was also employed to inactivate samR0478 and samR0479 using appropriate BACs⁴² (Supplementary Data 2).

CRISPR-Cas9-mediated genetic engineering

Plasmids pCRISPomyces-2 (and associated cloning and screen protocols)⁴⁰ used for construction of all mutants except ATCC/hy59_S1 and ATCC/hy59_S2 and pCRISPR-Cas9⁵⁵ were used for CRISPR-Cas9-based genome editing. The two systems differ in the way in which Cas9 is expressed; in the case of pCRISPomyces-2, the nuclease is expressed constitutively, while in the pCRISPR-Cas9 system, its expression is under inductive control by thiostrepton (Tsr). The crRNA sequence was selected to match the DNA segment which contains NGG on its 3′ end (N is any nucleotide, and the NGG corresponds to the protospacer-adjacent motif (PAM)). The annealed crRNA fragment and two homologous arms (HAL and HAR, flanking the target region) were sequentially inserted into the delivery plasmid pCRISPomyces-2 using the restriction sites BbsI and XbaI, respectively, to afford the specific recombinant plasmid for each mutant (Supplementary Fig. 6). Correspondingly, an sgRNA cassette (tracrRNA + crRNA) and two homologous arms were inserted into the plasmid pCRISPR-Cas9 using sites NcoI, SnaBI, and StuI, respectively (Supplementary Fig. 31). In addition, the crRNA was designed to be located within the region to be deleted (Supplementary Fig. 6) to avoid Cas9-catalyzed cleavage occurring in the genome of the resulting mutant. In the case of site-directed mutants, additional DNA fragments containing the targeted mutations were inserted between the two homologous arms. In addition, the DNA sequence with the fragments identical to the crRNA was modified, so as to avoid subsequent Cas9-catalyzed cleavage of the obtained mutants (Supplementary Figs. 6, 28 and 39).

Overexpression and purification of docking domains

The wild-type docking domains (^CDD₄, ^NDD₅, ^CDD₈, ^NDD₉ Val, and ^NDD₉ Met) and mutant docking domains (^CDD₄ SDM, ^CDD₄ helix swap) were amplified from genomic DNA of S. ambofaciens wild-type and the relevant mutants, using forward and reverse primers incorporating BamHI and HindIII restriction sites, respectively (Supplementary Data 1). The PCR amplicons were digested using FD BamHI and FD HindIII, and then ligated into the equivalent sites of vector pBG102 (Center for Structural Biology, Vanderbilt University). In the case of all ^CDDs which lacked aromatic residues, a tyrosine residue (codon TAT incorporated in the forward primer, Supplementary Table 1) was added at the N-terminal ends (so as not to interfere with docking with the ^NDD partner) to allow efficient monitoring by UV-Vis during the purification, as well as reliable measurement of protein concentration necessary for binding studies by ITC.

The resulting constructs pBG102-^NDD₅, pBG102-^CDD₈, and pBG102-^NDD₉ were used to transform E. coli BL21 (DE3). For ^CDD₄ and its mutants, these were transformed into Rosetta^TM 2(DE3), as these constructs contain 8 codons rarely used in E. coli. Positive transformants were selected on LB agar supplemented with kanamycin (50 μg mL⁻¹) (25 μg mL⁻¹ chloramphenicol was also added for expression in Rosetta^TM 2(DE3)). A single colony was transferred to LB (10 mL) supplemented with antibiotics, and the culture grown at 37 °C and 200 rpm overnight. The 1 mL overnight culture was used to inoculate LB media (1 L) supplemented with appropriate antibiotics, and then incubated at 37 °C and 200 rpm to an optical density of 0.8, at which point protein synthesis was induced by the addition of IPTG (final concentration 0.1 mM). After incubation at 18 °C and 200 rpm for 18 h, cells were collected by centrifugation at 8000 × g for 30 min, resuspended in 40 mL protein purification buffer A (50 mM Tris-HCl, 400 mM NaCl, 10 mM imidazole, pH 8.0), and lysed by sonication. Following centrifugation at 20,000 × g and filtration using a 0.45 μm membrane, the soluble cell lysates were loaded onto 2 × 5 mL HisTrap HP (GE Healthcare) columns (two 5 mL columns in series) equilibrated in buffer A, and purified by preparative protein purification chromatography using an ÄKTA Avant system. The following program was applied: sample loading, 1 mL min⁻¹; washing, 2 mL min⁻¹, 10 column volumes of buffer A; elution, 2 mL min⁻¹, 5 column volumes of buffer B (50 mM Tris-HCl, 400 mM NaCl, 250 mM imidazole, pH 8.0); elution, 2 mL min⁻¹, 2 column volumes of buffer C (50 mM Tris-HCl, 400 mM NaCl, 500 mM imidazole, pH 8.0).

All His₆-SUMO-tagged proteins were collected (fractions containing the protein of interest were selected based on the UV chromatography and SDS-PAGE gel), and transferred into dialysis bag containing His₆-tagged human rhinovirus 3 C protease (H3C) (1–2 μM). The dialysis bag was then placed into a container filled with buffer D (50 mM Tris-HCl, 400 mM NaCl, pH 8.0), and the cleavage allowed to proceed at 4 °C overnight. The resulting proteins, which incorporated a non-native N-terminal GPGS sequence, were then separated from the remaining His₆-tagged SUMO and His₆-tagged human rhinovirus 3 C protease by reloading onto the 2 × 5 mL HisTrap HP columns pre-equilibrated in buffer A. Purification was then carried out with the following program: sample loading, 1 mL min⁻¹; washing, 2 mL min⁻¹, 4 column volumes of buffer A; elution, 2 mL min⁻¹, 2 column volumes of buffer B; elution, 2 mL min⁻¹, 2 column volumes of buffer C. The untagged docking domains passed through the column during the washing step, and were collected and concentrated to 5–7 mL using an Amicon Ultra 3000 MWCO centrifuge filter (Millipore Corp).

Subsequently, the concentrated docking domains were loaded onto a size exclusion chromatography column (Superdex 75 26/60 column, GE Healthcare) equilibrated in buffer GF (20 mM HEPES, 100 mM NaCl, 0.5 mM TCEP, pH 7.5). Following a concentration step, the purity of the purified proteins was verified by SDS-PAGE, and their concentrations were determined by NanoDrop (Thermo Scientific) with extinction coefficients calculated using the ExPASy ProtParam tool (https://web.expasy.org/protparam/)⁷³.

Isothermal titration calorimetry measurements

Isothermal titration calorimetry (ITC) measurements were performed at 20 °C in buffer GF using a MicroCal ITC200 (Malvern Instruments) (A2F Plateforme ASIA: Approches fonctionnelles et Structurales des InterActions cellulaires). A 300 μL aliquot of ^NDD₅ at 70 μM was placed in the calorimeter cell and titrated with 700 μM of the ^CDD₄s (^CDD₄ wild-type, ^CDD₄ SDM, and ^CDD₄ helix swap) in the syringe. In the case of the binding experiments between ^NDD₉ Met and ^CDD₈, the ^CDD₈ (700 μM) was added to the ^NDD₉ Met (80 μM in the cell), while for the binding between ^NDD₉ Val and the ^CDDs (^CDD₈, ^CDD₄ wild-type, ^CDD₄ SDM, and ^CDD₄ helix swap), the ^CDDs (700 μM) were added to ^NDD₉ Val in the cell (120 μM). The ITC experiments were then carried out as follows initial waiting time 120 s, initial injection of 0.5 μL over 1 s followed by 19 serial injections of 2 μL over 4 s, separated by an interval of 120 s. For each experiment, the reference power was set to 5 μcal⁻¹, stirring speed to 750 rpm, and the high feedback mode was selected. Two independent titrations were performed for each combination of DDs. The heat of reaction per injection (μcal s⁻¹) was determined by integration of the peak areas using the Origin 7.0 (OriginLab) software, assuming a one-site binding model (consistent with the solved structures of the types of DDs^26,35), yielding the best-fit values for the heat of binding (ΔH), the stoichiometry of binding (N), and the dissociation constant (K_d). The heats of dilution of the DDs were determined by injecting them into the cell containing buffer only, and these were subtracted from the corresponding binding data prior to curve fitting.

In some cases, when a plateau (binding saturation) was not reached at the final titration step, and the problem could not be solved by increasing the concentration of DD in syringe, we initially placed ^CDD/^NDD complex in the ITC cell (at the concentration of the two partners reached in the previous titration), filled the syringe with additional DD, and performed a second titration experiment. This procedure was then repeated until binding saturation was reached. To fit the data, the MicroCal Concat ITC software version 1.00 was used to combine two ITC data files together. The critical parameter dimensionless constant (C-value) was then calculated as follows:

$${{{{{\rm{C}}}}}}={{{{{{\rm{NK}}}}}}}_{{{{{{\rm{a}}}}}}}{[{{{{{\rm{M}}}}}}]}_{{{{{{\rm{T}}}}}}}$$

(1)

where K_a is the binding constant, [M]_T is the total macromolecular concentration in the cell, and N is the stoichiometry of interaction. A reliable ITC binding isotherm is evidenced by ITC data with C-values > 1 (the optimal range is 5 < C < 500)⁷⁴, as was the case for all of our measurements.

Circular dichroism measurements

Circular dichroism (CD) spectra were recorded on a Chirascan CD (Applied Photophysics, United Kingdom) (IBS-Lor UMS 2008 Plateforme de Biophysique et Biologie Structurale) at 0.5 nm intervals in the wavelength range of 180–260 nm at 20 °C, using a temperature-controlled chamber. A 0.01 cm quartz cuvette containing 30 μL of docking domain at 100 μM, a 0.1 cm cuvette with 200 μL of sample at 10 μM, and a 1 cm cuvette containing 1.5 mL of sample at 1 μM, were used for all the measurements. All measurements were performed at least in triplicate, and sample spectra were corrected for buffer background by subtracting the average spectrum of buffer alone. The CD spectra were deconvoluted using the deconvolution software CDNN2.1⁷⁵ to estimate the secondary structure present in the docking domains.

SEC-MALS analysis of docking domains

The oligomeric state of all the docking domains was determined by size exclusion chromatography multi-angle light scattering (SEC-MALS) on the A2F Plateforme ASIA. For this, SEC was first carried out on a Superdex75 10/300 column (GE Healthcare) at 20 °C using a flow rate of 0.5 mL min⁻¹ in HEPES buffer (20 mM HEPES, 100 mM NaCl, 0.5 mM TCEP, pH 7.5) using an ÄKTA-Purifier FPLC (GE Healthcare). Multi-angle light scattering (MALS) was measured using a MiniDAWN TREOS II (Wyatt Technology), while refractometry was monitored using an Optilab T-rEX (Wyatt Technology). Data processing was carried out with the manufacturer-supplied software (ASTRA 6.1, Wyatt Technology) to determine the protein oligomerization state.

LC-ESI-HRMS analysis of fermentation metabolites and purified docking domains

The fermentation broth of Streptomyces was centrifuged at 4000 × g for 10 min. The stambomycins and their derivatives were then extracted from the mycelia, by first resuspending the cells in 40 mL distilled water, followed by centrifugation (4000 × g, 10 min, repeated 3×) to remove water-soluble components³³. After decanting the water, the cell pellets were weighed and extracted with methanol by shaking at 150 rpm for 2 h at room temperature. Thereafter, the methanol extracts were filtered to remove the cell debris, followed by rotary evaporation to dryness. The obtained extracts were then dissolved in methanol, whose volume was determined according to the initial weight of the mycelia (70 μL methanol to 1 g of initial cell pellet). The resulting mycelial crude extracts were then passed through a 0.4 μm syringe filter and analyzed in positive electrospray mode (ESI⁺) by HPLC-HRMS at the Université de Lorraine on either a Thermo Scientific Orbitrap LTQXL or an Orbitrap ID-X Tribrid Mass Spectrometer) (Plateau d’Analyse Structurale et Métabolomique (PASM) SF4242 EFABA) using an Alltima™ C18 column (2.1 × 150 mm, 5 μm particle size). Separation was carried out with Milli-Q water containing 0.1% formic acid (A) and acetonitrile containing 0.1% formic acid (B) using the following elution profile: 0–48 min, linear gradient 5–95% solvent B; 48–54 min, constant 95% solvent B; 54–60 min, constant 5% solvent B. Mass spectrometry operating parameters were: spray voltage, 5 kV; source gases were set respectively for sheath gas, auxiliary gas and sweep gas at 30, 10, and 10 arbitrary units min⁻¹; capillary temperature, 275 °C; capillary voltage, 4 V; tube lens, split lens and front lens voltages 155, −28, and −6 V, respectively. Due to the much lower sensitivity of the Orbitrap LTQXL relative to the Orbitrap ID-X Tribrid as evidenced by comparative analysis of identical samples on the two instruments, we introduced a 10× correction factor to the yields determined using the Orbitrap LTQXL (Supplementary Tables 4–7).

The purified docking domains in buffer GF were diluted with Milli-Q water to a concentration of 50 μM and injected onto an Alltima™ C18 column (2.1 × 150 mm, 5 μm particle size). Analysis was carried out with Milli-Q water containing 0.1% TFA (A) and acetonitrile containing 0.1% TFA (B), using the elution profile: 0–15 min, constant 10% solvent B; 15–20 min, linear gradient of 10% solvent B to 95%; 20–25 min, constant 10% solvent B. Mass spectrometry operating parameters were set as above.

Metabolite profiling of engineered strains

Comparative analysis of fermentation extracts of all strains containing pOE484 except K7N3, CPN4 and CPN5, relative to control mutant containing empty plasmid pIB139, was conducted at ETH Zurich on a Dionex Ultimate 3000 HPLC system coupled to a Thermo Scientific^TM Q Exactive^TM Hybrid Quadrupole-Orbitrap mass spectrometer. MS-settings: spray voltage 3.5 kV; capillary temperature 320 °C; sheath gas (52.50), auxiliary gas (13.75), sweep gas (2.75); probe heater 437.50 °C; S-Lens RF (50), positive mode, resolution 70.000; AGC target 1e6, microscans 1, maximum IT 75 ms, scan range 200–1800 m/z. Chromatographic separation was obtained using a Phenomenex Kinetex 2.6 µm XB-C18 150 × 4.6 mm column with solvents (A, H₂O + 0.1% formic acid) and (B, MeCN + 0.1% formic acid) and the following gradient: flow rate 0.7 mL min⁻¹, 20% B for 2 min, 20–98% B over 18 min, 98% B for 5 min, 98–20% B in 0.5 min and 20% B for 4 min. Metabolic differences within the obtained data (Supplementary Table 8) were identified using SIEVE 2.0 screening software (Thermo Fischer Scientific), applying the default settings for component extraction of small molecules, except that of the base peak minimum intensity, which was set to 5000000.

Quantification of metabolites

Yields of the native stambomycins were rigorously evaluated by generating a calibration curve using a previously-purified mixture of stambomycins 1A/B³³, over the concentration range of 0.00001–0.25 mg mL⁻¹. This approach yielded a linear correlation between the quantity of metabolite and the respective peak area in the extracted ion chromatogram (EIC) (the areas of the peaks corresponding to the parental ions [M + H]⁺ and [M + 2H]²⁺ were used systematically) (Supplementary Fig. 9 and Supplementary Table 2).

To allow for estimating yields of the shorter, engineered metabolites that notably lacked the β-D-mycaminose of 1, we purified 50-deoxystambomycins 2A/B from a previously described strain of S. ambofaciens in which the C-50 hydroxylase had been inactivated (S. ambofaciens ATCC/OE484/Δ479)⁴². For this, crude extracts of the strain were fractionated using preparative reverse-phase HPLC (Agilent 1260 Infinity system, equipped with a Phenomenex Luna 5 μm C18 column (21.2 × 250 mm)). Deionized water (Milli-Q, Millipore) + 0.05% TFA (solvent A) and acetonitrile + 0.05% TFA (solvent B) were used as the mobile phase. Purification was achieved using an elution gradient of 5–100% solvent B over 50 min, followed by 100% solvent B for 10 min. Product elution was monitored by UV-Vis at 210, 254, and 280 nm. Fractions containing the 50-deoxystambomycins 2 were further purified by semi-preparative reverse-phase HPLC (Agilent 1260 Infinity system, equipped with a Phenomenex Kinetex 5 μm C18 column (10 × 250 mm)). 50-Deoxystambomycins 2A/B (0.85 mg) were purified using an elution gradient of 40–60% solvent B for 30 min, followed by a gradient shift from 60–100% over 5 min, and finally isocratic 100% solvent B over 5 min. The same procedure also yielded purified 50-deoxystambomycins 2C/D (0.3 mg) and 2F (0.3 mg). Compound identities were confirmed by HPLC-MS analysis using a Thermo Scientific^TM Q Exactive^TM Hybrid Quadrupole-Orbitrap mass spectrometer (ETH Zurich), as described earlier. The final masses of the purified 50-deoxystambomycins 2A/B were determined to two decimal points of accuracy using a Mettler Toledo Excellence XS204 analytical balance.

Analysis of the 50-deoxystambomycins 2 at the same concentration range as 1 (Supplementary Fig. 11) revealed a substantially lower detection efficiency under our conditions (Orbitrap ID-X Tribrid Mass Spectrometer at the Université de Lorraine). Direct comparative analysis in triplicate of identical concentrations (0.25 mg mL⁻¹) of wild-type stambomycins 1A/B and 50-deoxystambomycins 2A/B with detection by MS using the most abundant singly- and doubly-charged ions ([M + H]⁺ and [M + 2H]²⁺ for the stambomycins 1A/B and [M−3H₂O−CO₂ + H]⁺ and [M−3H₂O−CO₂ + 2H]²⁺ for the 50-deoxystambomycins 2A/B), and by UV-Vis at 254 and 238 nm, confirmed that the sensitivity towards 1 was 206-fold higher by MS and ca. 30-fold higher by UV-Vis. We further showed that the presence of an alternative amino sugar, β-D-desosamine, in a model macrolide correlated with improved MS sensitivity, by analysis of commercial erythromycin A 3 at a range of concentrations (0.00005–0.05 mg mL⁻¹).

As the standard curve generated from the parental stambomycins 1A/B spanned a 25,000-fold range of concentrations and was therefore more reliable, we used it to convert peak areas measured for all stambomycin derivatives (based on the [M + H]⁺ and [M + Na]⁺ ions for metabolites 4−11, and [M + H]⁺ for 12−17 (as no [M + Na]⁺ peaks were present)) into yields, and then introduced a correction factor of 206 derived from the 50-deoxystambomycins 2A/B in order to obtain overall yield range estimates for the metabolites (Supplementary Tables 4–7). Direct use of a limited calibration curve produced from the 50-deoxystambomycins 2A/B gave similar results (Supplementary Table 3).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data supporting the findings of this study are available within the manuscript. The docking domain structures used in the DD analysis are available in the PDB under the following accession codes: 1PZQ, 1PZR (type 1a), and 3F5H (type 1b). The raw HPLC-MS data have been deposited in the data repository DOREL (Données de la Recherche Lorraines) [https://doi.org/10.12763/PEYXHP]. Source data are provided with this paper.

Change history

06 October 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41467-022-33524-1

References

Weissman, K. J. The structural biology of biosynthetic megaenzymes. Nat. Chem. Biol. 11, 660–670 (2015).
Article CAS PubMed Google Scholar
Keatinge-Clay, A. T. The uncommon enzymology of cis-acyltransferase assembly lines. Chem. Rev. 117, 5334–5366 (2017).
Article CAS PubMed PubMed Central Google Scholar
Weissman, K. J. Genetic engineering of modular PKSs: from combinatorial biosynthesis to synthetic biology. Nat. Prod. Rep. 33, 203–230 (2016).
Article CAS PubMed Google Scholar
Dutta, S. et al. Structure of a modular polyketide synthase. Nature 510, 512–517 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Whicher, J. R. et al. Structural rearrangements of a polyketide synthase module during its catalytic cycle. Nature 510, 560–564 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Bagde, S. R., Mathews, I. I., Fromme, J. C. & Kim, C.-Y. Modular polyketide synthase contains two reaction chambers that operate asynchronously. Science 374, 723–729 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Cogan, D. P. et al. Mapping the catalytic conformations of an assembly-line polyketide synthase module. Science 374, 729–734 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Hans, M., Hornung, A., Dziarnowski, A., Cane, D. E. & Khosla, C. Mechanistic analysis of acyl transferase domain exchange in polyketide synthase modules. J. Am. Chem. Soc. 125, 5366–5374 (2003).
Article CAS PubMed Google Scholar
Ranganathan, A. et al. Knowledge-based design of bimodular and trimodular polyketide synthases based on domain and module swaps: a route to simple statin analogues. Chem. Biol. 6, 731–741 (1999).
Article CAS PubMed Google Scholar
McDaniel, R., Kao, C. M., Hwang, S. J. & Khosla, C. Engineered intermodular and intramodular polyketide synthase fusions. Chem. Biol. 4, 667–674 (1997).
Article CAS PubMed Google Scholar
Menzella, H. G. et al. Combinatorial polyketide biosynthesis by de novo design and rearrangement of modular polyketide synthase genes. Nat. Biotechnol. 23, 1171–1176 (2005).
Article CAS PubMed Google Scholar
Peng, H., Ishida, K., Sugimoto, Y., Jenke-Kodama, H. & Hertweck, C. Emulating evolutionary processes to morph aureothin-type modular polyketide synthases and associated oxygenases. Nat. Commun. 10, 3918 (2019).
Article ADS PubMed PubMed Central Google Scholar
Miyazawa, T., Hirsch, M., Zhang, Z. & Keatinge-Clay, A. T. An in vitro platform for engineering and harnessing modular polyketide synthases. Nat. Commun. 11, 80 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Gokhale, R. S., Tsuji, S. Y., Cane, D. E. & Khosla, C. Dissecting and exploiting intermodular communication in polyketide synthases. Science 284, 482–485 (1999).
Article ADS CAS PubMed Google Scholar
Wu, N., Tsuji, S. Y., Cane, D. E. & Khosla, C. Assessing the balance between protein-protein interactions and enzyme-substrate interactions in the channeling of intermediates between polyketide synthase modules. J. Am. Chem. Soc. 123, 6465–6474 (2001).
Article CAS PubMed Google Scholar
Wang, S., Dong, G. & Sheng, C. Structural simplification of natural products. Chem. Rev. 119, 4180–4220 (2019).
Article CAS PubMed Google Scholar
Zhang, L. et al. Characterization of giant modular PKSs provides insight into genetic mechanism for structural diversification of aminopolyol polyketides. Angew. Chem. Int. Ed. Engl. 56, 1740–1745 (2017).
Article CAS PubMed Google Scholar
Keatinge-Clay, A. T. Polyketide synthase modules redefined. Angew. Chem. Int. Ed. Engl. 56, 4658–4660 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bozhüyük, K. A. J. et al. De novo design and engineering of non-ribosomal peptide synthetases. Nat. Chem. 10, 275–281 (2018).
Article PubMed Google Scholar
Chandran, S. S., Menzella, H. G., Carney, J. R. & Santi, D. V. Activating hybrid modular interfaces in synthetic polyketide synthases by cassette replacement of ketosynthase domains. Chem. Biol. 13, 469–474 (2006).
Article CAS PubMed Google Scholar
Cortés, J. et al. Repositioning of a domain in a modular polyketide synthase to promote specific chain cleavage. Science 268, 1487–1489 (1995).
Article ADS PubMed Google Scholar
Massicard, J.-M., Soligot, C., Weissman, K. J. & Jacob, C. Manipulating polyketide stereochemistry by exchange of polyketide synthase modules. Chem. Commun. 56, 12749–12752 (2020).
Article CAS Google Scholar
Kapur, S., Chen, A. Y., Cane, D. E. & Khosla, C. Molecular recognition between ketosynthase and acyl carrier protein domains of the 6-deoxyerythronolide B synthase. Proc. Natl Acad. Sci. USA 107, 22066–22071 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Klaus, M. et al. Protein-protein interactions, not substrate recognition, dominate the turnover of chimeric assembly line polyketide synthases. J. Biol. Chem. 291, 16404–16415 (2016).
Article CAS PubMed PubMed Central Google Scholar
Klaus, M., Buyachuihan, L. & Grininger, M. The ketosynthase domain constrains the design of polyketide synthases. ACS Chem. Biol. 15, 2422–2432 (2020).
Article CAS PubMed Google Scholar
Broadhurst, R. W., Nietlispach, D., Wheatcroft, M. P., Leadlay, P. F. & Weissman, K. J. The structure of docking domains in modular polyketide synthases. Chem. Biol. 10, 723–731 (2003).
Article CAS PubMed Google Scholar
Kim, B. S. et al. An unexpected interaction between the modular polyketide synthases, erythromycin DEBS1 and pikromycin PikAIV, leads to efficient triketide lactone synthesis. Biochemistry 41, 10827–10833 (2002).
Article CAS PubMed Google Scholar
Watanabe, K., Wang, C. C. C., Boddy, C. N., Cane, D. E. & Khosla, C. Understanding substrate specificity of polyketide synthase modules by generating hybrid multimodular synthases. J. Biol. Chem. 278, 42020–42026 (2003).
Article CAS PubMed Google Scholar
Tang, L., Fu, H. & McDaniel, R. Formation of functional heterologous complexes using subunits from the picromycin, erythromycin and oleandomycin polyketide synthases. Chem. Biol. 7, 77–84 (2000).
Article CAS PubMed Google Scholar
Traitcheva, N., Jenke-Kodama, H., He, J., Dittmann, E. & Hertweck, C. Non-colinear polyketide biosynthesis in the aureothin and neoaureothin pathways: an evolutionary perspective. ChemBioChem 8, 1841–1849 (2007).
Article CAS PubMed Google Scholar
Huang, J. et al. Gene replacement for the generation of designed novel avermectin derivatives with enhanced acaricidal and nematicidal activities. Appl. Environ. Microbiol. 81, 5326–5334 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Reeves, C. D. et al. Production of hybrid 16-membered macrolides by expressing combinations of polyketide synthase genes in engineered Streptomyces fradiae hosts. Chem. Biol. 11, 1465–1472 (2004).
Article CAS PubMed Google Scholar
Laureti, L. et al. Identification of a bioactive 51-membered macrolide complex by activation of a silent polyketide synthase in Streptomyces ambofaciens. Proc. Natl Acad. Sci. USA 108, 6258–6263 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Aigle, B., Challis, G., Laureti, L., Song, L. & Leblond, P. Stambomycin and derivatives, their production and their use as drugs. Patent WO2011009938A2 (2011).
Buchholz, T. J. et al. Structural basis for binding specificity between subclasses of modular polyketide synthase docking domains. ACS Chem. Biol. 4, 41–52 (2009).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, T. et al. Exploiting the mosaic structure of trans-acyltransferase polyketide synthases for natural product discovery and pathway dissection. Nat. Biotechnol. 26, 225–233 (2008).
Article CAS PubMed Google Scholar
Murphy, A. C., Hong, H., Vance, S., Broadhurst, R. W. & Leadlay, P. F. Broadening substrate specificity of a chain-extending ketosynthase through a single active-site mutation. Chem. Commun. 52, 8373–8376 (2016).
Article CAS Google Scholar
Weissman, K. J. The structural basis for docking in modular polyketide biosynthesis. ChemBioChem 7, 485–494 (2006).
Article CAS PubMed Google Scholar
Gust, B., Challis, G. L., Fowler, K., Kieser, T. & Chater, K. F. PCR-targeted Streptomyces gene replacement identifies a protein domain needed for biosynthesis of the sesquiterpene soil odor geosmin. Proc. Natl Acad. Sci. USA 100, 1541–1546 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Cobb, R. E., Wang, Y. & Zhao, H. High-efficiency multiplex genome editing of Streptomyces species using an engineered CRISPR/Cas system. ACS Synth. Biol. 4, 723–728 (2015).
Article CAS PubMed Google Scholar
Wilkinson, C. J. et al. Increasing the efficiency of heterologous promoters in actinomycetes. J. Mol. Microbiol. Biotechnol. 4, 417–426 (2002).
CAS PubMed Google Scholar
Song, L. et al. Cytochrome P450-mediated hydroxylation is required for polyketide macrolactonization in stambomycin biosynthesis. J. Antibiot. 67, 71–76 (2014).
Article CAS Google Scholar
Heathcote, M. L., Staunton, J. & Leadlay, P. F. Role of type II thioesterases: evidence for removal of short acyl chains produced by aberrant decarboxylation of chain extender units. Chem. Biol. 8, 207–220 (2001).
Article CAS PubMed Google Scholar
Kittendorf, J. D., Beck, B. J., Buchholz, T. J., Seufert, W. & Sherman, D. H. Interrogating the molecular basis for multiple macrolactone ring formation by the pikromycin polyketide synthase. Chem. Biol. 14, 944–954 (2007).
Article CAS PubMed PubMed Central Google Scholar
Straight, P. D., Fischbach, M. A., Walsh, C. T., Rudner, D. Z. & Kolter, R. A singular enzymatic megacomplex from Bacillus subtilis. Proc. Natl Acad. Sci. USA 104, 305–310 (2007).
Article ADS CAS PubMed Google Scholar
Buchan, D. W. A. & Jones, D. T. The PSIPRED protein analysis workbench: 20 years on. Nucleic Acids Res. 47, W402–W407 (2019).
Article CAS PubMed PubMed Central Google Scholar
Risser, F. et al. Towards improved understanding of intersubunit interactions in modular polyketide biosynthesis: docking in the enacyloxin IIa polyketide synthase. J. Struct. Biol. 212, 107581 (2020).
Article CAS PubMed Google Scholar
Whicher, J. R. et al. Cyanobacterial polyketide synthase docking domains: a tool for engineering natural product biosynthesis. Chem. Biol. 20, 1340–1351 (2013).
Article CAS PubMed Google Scholar
Dorival, J. et al. Characterization of intersubunit communication in the virginiamycin trans-acyl transferase polyketide synthase. J. Am. Chem. Soc. 138, 4155–4167 (2016).
Article CAS PubMed Google Scholar
Kapur, S. et al. Reprogramming a module of the 6-deoxyerythronolide B synthase for iterative chain elongation. Proc. Natl Acad. Sci. USA 109, 4110–4115 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Porterfield, W. B., Poenateetai, N. & Zhang, W. Engineered biosynthesis of alkyne-tagged polyketides by type I PKSs. iScience 23, 100938 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Yuzawa, S. et al. Comprehensive in vitro analysis of acyltransferase domain exchanges in modular polyketide synthases and its application for short-chain ketone production. ACS Synth. Biol. 6, 139–147 (2017).
Article CAS PubMed Google Scholar
Wlodek, A. et al. Diversity oriented biosynthesis via accelerated evolution of modular gene clusters. Nat. Commun. 8, 1206 (2017).
Article ADS PubMed PubMed Central Google Scholar
Huang, H., Zheng, G., Jiang, W., Hu, H. & Lu, Y. One-step high-efficiency CRISPR/Cas9-mediated genome editing in Streptomyces. Acta Biochim. Biophys. Sin. 47, 231–243 (2015).
Article CAS PubMed Google Scholar
Tong, Y., Charusanti, P., Zhang, L., Weber, T. & Lee, S. Y. CRISPR-Cas9 based engineering of actinomycetal genomes. ACS Synth. Biol. 4, 1020–1029 (2015).
Article CAS PubMed Google Scholar
Zeng, H. et al. Highly efficient editing of the actinorhodin polyketide chain length factor gene in Streptomyces coelicolor M145 using CRISPR/Cas9-CodA(sm) combined system. Appl. Microbiol. Biotechnol. 99, 10575–10585 (2015).
Article CAS PubMed Google Scholar
Zhang, J. J. & Moore, B. S. Site-directed mutagenesis of large biosynthetic gene clusters via oligonucleotide recombineering and CRISPR/Cas9 targeting. ACS Synth. Biol. 9, 1917–1922 (2020).
Article CAS PubMed PubMed Central Google Scholar
Myronovskyi, M., Rosenkränzer, B. & Luzhetskyy, A. Iterative marker excision system. Appl. Microbiol. Biotechnol. 98, 4557–4570 (2014).
Article CAS PubMed Google Scholar
Mo, J. et al. Efficient editing DNA regions with high sequence identity in actinomycetal genomes by a CRISPR-Cas9 system. Synth. Syst. Biotechnol. 4, 86–91 (2019).
Article PubMed PubMed Central Google Scholar
Tong, Y. et al. Highly efficient DSB-free base editing for streptomycetes with CRISPR-BEST. Proc. Natl Acad. Sci. USA 116, 20366–20375 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, Y., Li, G., Chen, Y. & Lu, Y. Challenges and advances in genome editing technologies in Streptomyces. Biomolecules 10, 734 (2020).
Alberti, F. & Corre, C. Editing streptomycete genomes in the CRISPR/Cas9 age. Nat. Prod. Rep. 36, 1237–1248 (2019).
Article CAS PubMed Google Scholar
Wang, Q. et al. Dual-function chromogenic screening-based CRISPR/Cas9 genome editing system for actinomycetes. Appl. Microbiol. Biotechnol. 104, 225–239 (2020).
Article CAS PubMed Google Scholar
Ye, S., Enghiad, B., Zhao, H. & Takano, E. Fine-tuning the regulation of Cas9 expression levels for efficient CRISPR-Cas9 mediated recombination in Streptomyces. J. Ind. Microbiol. Biotechnol. 47, 413–423 (2020).
Article CAS PubMed Google Scholar
Bozhüyük, K. A. J. et al. Modification and de novo design of non-ribosomal peptide synthetases using specific assembly points within condensation domains. Nat. Chem. 11, 653–661 (2019).
Article PubMed Google Scholar
Aparicio, J. F. et al. Organization of the biosynthetic gene cluster for rapamycin in Streptomyces hygroscopicus: analysis of the enzymatic domains in the modular polyketide synthase. Gene 169, 9–16 (1996).
Article CAS PubMed Google Scholar
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Article PubMed PubMed Central Google Scholar
Sambrook, J., Fritsch, E. F. & Maniatis, T. Molecular cloning: a laboratory manual. (Cold Spring Harbor Laboratory Press, 1989).
Kieser, T., Bibb, M. J., Buttner, M. J., Chater, K. F. & Hopwood, D. A. Practical Streptomyces Genetics. (John Innes Foundation, 2000).
Raynal, A., Karray, F., Tuphile, K., Darbon-Rongère, E. & Pernodet, J.-L. Excisable cassettes: new tools for functional analysis of Streptomyces genomes. Appl. Environ. Microbiol. 72, 4839–4844 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Datsenko, K. A. & Wanner, B. L. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc. Natl Acad. Sci. USA 97, 6640–6645 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
MacNeil, D. J. et al. Analysis of Streptomyces avermitilis genes required for avermectin biosynthesis utilizing a novel integration vector. Gene 111, 61–68 (1992).
Article CAS PubMed Google Scholar
Gasteiger, E. et al. in The Proteomics Protocols Handbook (ed. Walker, J. M.) 571–607 (Humana Press, 2005).
Wiseman, T., Williston, S., Brandts, J. F. & Lin, L. N. Rapid measurement of binding constants and heats of binding using a new titration calorimeter. Anal. Biochem. 179, 131–137 (1989).
Article CAS PubMed Google Scholar
Böhm, G., Muhr, R. & Jaenicke, R. Quantitative analysis of protein far UV circular dichroism spectra by neural networks. Protein Eng. 5, 191–195 (1992).
Article PubMed Google Scholar
Keatinge-Clay, A. T. A tylosin ketoreductase reveals how chirality is determined in polyketides. Chem. Biol. 14, 898–908 (2007).
Article CAS PubMed Google Scholar
Lim, J. et al. Synthesis of the C1-C27 fragment of stambomycin D validates modular polyketide synthase-based stereochemical assignments. Org. Lett. 23, 7439–7444 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We acknowledge financial support from the Agence Nationale de la Recherche (grant number ANR-16-CE92-0006-01, PKS STRUCTURE to K.J.W.), the Université de Lorraine, the Centre National de la Recherche Scientifique (CNRS), and the IMPACT Biomolecules project of the Lorraine Université d’Excellence (Investissements d’avenir—ANR 15-004 to L.S., C.J., B.A., and K.J.W.). J.P. acknowledges funding by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Program (grant agreement No. 742739). S. Collin and B. Chagot are thanked for help with the analysis of the DDs in vitro, A. Kriznik for assistance with the biophysical experiments, J. Grosjean for help with the HPLC-MS analysis, S. Rousselot for contributing to development of the genetic engineering strategy, and J.-M. Girardet and T. Dhalleine of the Functional and Structural Approaches to Cellular InterActions platform (ASIA, University of Lorraine-INRAE; https://a2f.univ-lorraine.fr/en/asia-2/), for help with the ITC and SEC-MALS, respectively. We are also grateful for access to the Orbitrap ID-X Tribrid system of the Structural and Metabolomics Analyses Platform for the LC-MS/MS analyses (PASM, SF4242, Université de Lorraine, EFABA, Vandoeuvre‐lès‐Nancy, France).

Author information

Li Su
Present address: Max-Planck-Institute for Terrestrial Microbiology, Department of Natural Products in Organismic Interactions, 35043, Marburg, Germany

Authors and Affiliations

Université de Lorraine, CNRS, IMoPA, F-54000, Nancy, France
Li Su, Christophe Jacob & Kira J. Weissman
Université de Lorraine, INRAE, DynAMic, F-54000, Nancy, France
Li Su, Laurence Hôtel & Bertrand Aigle
Université de Lorraine, LIBio, F-54000, Nancy, France
Cédric Paris
Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zurich, 8093, Zurich, Switzerland
Clara Chepkirui, Alexander O. Brachmann & Jörn Piel

Authors

Li Su
View author publications
You can also search for this author in PubMed Google Scholar
Laurence Hôtel
View author publications
You can also search for this author in PubMed Google Scholar
Cédric Paris
View author publications
You can also search for this author in PubMed Google Scholar
Clara Chepkirui
View author publications
You can also search for this author in PubMed Google Scholar
Alexander O. Brachmann
View author publications
You can also search for this author in PubMed Google Scholar
Jörn Piel
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Jacob
View author publications
You can also search for this author in PubMed Google Scholar
Bertrand Aigle
View author publications
You can also search for this author in PubMed Google Scholar
Kira J. Weissman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.S. constructed plasmids, generated and fermented recombinant strains, analyzed and interpreted HPLC-MS data, expressed and purified recombinant docking domains, carried out biophysical analysis of DD interactions, and generated all of the manuscript figures. L.H. and C.J. constructed certain plasmids, mutant BACs and/or mutant strains. C.P. performed LC-ESI-HRMS analyses. A.B. and J.P. carried out HPLC-MS analyses and metabolic profiling. C.C. purified the 50-deoxystambomycins. B.A., C.J., and K.J.W. designed the research and supervised the project. K.J.W. performed in silico analyses and wrote the manuscript, with contributions from all authors.

Corresponding authors

Correspondence to Christophe Jacob, Bertrand Aigle or Kira J. Weissman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Ryan Seipke and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Su, L., Hôtel, L., Paris, C. et al. Engineering the stambomycin modular polyketide synthase yields 37-membered mini-stambomycins. Nat Commun 13, 515 (2022). https://doi.org/10.1038/s41467-022-27955-z

Download citation

Received: 08 February 2021
Accepted: 21 December 2021
Published: 26 January 2022
DOI: https://doi.org/10.1038/s41467-022-27955-z

This article is cited by

Insights into azalomycin F assembly-line contribute to evolution-guided polyketide synthase engineering and identification of intermodular recognition
- Guifa Zhai
- Yan Zhu
- Yuhui Sun
Nature Communications (2023)
Enzymology of assembly line synthesis by modular polyketide synthases
- Martin Grininger
Nature Chemical Biology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.