Introduction

Clavulanic acid (CA) is a potent inhibitor of a wide range of β-lactamases from pathogenic organisms.1, 2 It is used in combination with β-lactam antibiotics against those organisms that are known or suspected to be β-lactamase producers, to protect activity of the β-lactam antibiotics and prevent infections. The mechanism of β-lactamase inhibition is because of the structural similarity of CA with the natural substrate of the β-lactamase enzyme, that is, β-lactam antibiotic, which results in CA binding to the serine-hydroxyl group in the active site of the enzyme and causing irreversible enzyme inhibition.3, 4 CA is active against group 2 class A or D β-lactamases, including both penicillinases and cephalosporinases.5 The combination therapy of CA+β-lactam antibiotic has been very successful in preventing infections due to Gram-positive (Staphylococcus sp.) and Gram-negative (Klebsiella sp. Hemophilus sp. Proteus, Shigella, Pseudomonas.) β-lactamase-producing pathogens. Augmentin, a combination of CA and the β-lactam antibiotic, amoxicillin, is a very successful combination therapy with reported global sales of >$2.1 billion (Glaxo SmithKline).6, 7

CA belongs to the clavam group of β-lactams, characterized by the bicyclic β-lactam/oxazolidine ring system, in which its β-lactamase inhibitory property is due to the characteristic 3R, 5R stereochemistry of the strained bicyclic nucleus structure (Figure 1). This β-lactamase inhibitor was discovered in the fermentation broth of Streptomyces clavuligerus in a screening program aimed at identifying novel β-lactamase inhibitors.8 It is notable that S. clavuligerus was previously isolated for its ability to produce the β-lactam antibiotic, cephamycin C and related β-lactam antibiotics (Figure 1), and subsequently it was found that it also produces CA. The β-lactam antibiotics penicillin N and cephalosporin occur as intermediates in the production of cephamycin C. Production of both β-lactam antibiotics and a β-lactamase inhibitor by one organism, and as described later, the contiguous chromosomal arrangement of the genes responsible for the production of these two classes of compounds, and their coregulation, makes S. clavuligerus a unique case, in which both the antibiotic and a molecule to protect the antibiotic from degradation seem to have coevolved. Thus, the idea and development of Augmentin as a combination therapy seems to have been inspired by the mother nature. Subsequently, several other Streptomyces sp., including S. jumonjinensis and S. katsurahamanus, have been shown to produce both cephamycin C and CA.9 Among these, S. clavuligerus has been studied the most as a model system, and is also used for producing CA commercially.

Figure 1
figure 1

β-Lactam metabolites produced by S. clavuligerus.

Apart from cephamycin C and CA, S. clavuligerus also produces non-CA clavam type of compounds all of which contain the antipodal 3S, 5S stereochemistry in their β-lactam ring system, as compared with the 3R, 5R stereochemistry of CA, which is required for the β-lactamase inhibitory property10 (Figure 1). Therefore, the 3S, 5S clavams do not possess any β-lactamase inhibitory property, but do possess activities ranging from antibacterial and antifungal in nature. Owing to structural similarity between CA and 3S, 5S clavams, there are shared mechanisms involved in their biosynthesis.11, 12 However, this review will focus primarily on the molecular aspects of CA biogenesis in S. clavuligerus, with special emphasis on the strategies used for strain development. The biosynthesis of 3S, 5S clavams and relevant genetics have been reviewed elsewhere.13, 14

Biogenesis

The knowledge on CA biosynthesis has been obtained from several lines of investigation including classical approaches of feeding cells with labeled precursors and tracing the incorporation of the label in the CA molecule,10, 15, 16, 17 isolating or constructing non-producing mutants13, 18, 19 and characterizing the accumulated intermediates, and taking molecular approaches including, sequence and transcriptional analysis of the biosynthetic and regulatory genes.13 The biosynthetic pathway for CA is presented in Figure 2, and the description of enzymes and genes involved in the pathway and their functions is summarized in Table 1. The CA biosynthetic gene cluster is located immediately downstream from the cephamycin C gene cluster,20, 21 and stretches from carboxyethylarginine synthase ceaS 2 to gcaS;13 (Figure 3). Beyond ceaS2 on one end of the CA cluster is pcbR, a penicillin-binding protein involved in β-lactam resistance and part of the cephamycin C gene cluster.22 Beyond gcaS on the other end are orf-18 and orf-19, encoding another set of penicillin-binding proteins, which are not involved in CA biosynthesis. These are followed by orf-20, orf-21 and orf-22, encoding potential regulatory genes, which seem to have some effects on CA production. However, these genes also show pleiotropic effects, so their role in CA biosynthesis remains uncertain13, 23, 24 (Figure 3).

Figure 2
figure 2

Clavulanic acid (CA) biosynthetic pathway. gap, glyceraldehyde-3-phosphate dehydrogenase; glk, glycerate kinase.

Table 1 Genes involved in clavulanic acid (CA) pathway and their role and function
Figure 3
figure 3

Clavulanic acid (CA) biosynthetic gene cluster and its regulation (a hypothetical scheme based on published work and references cited in this article).

The ‘early’ steps

The CA pathway begins with the condensation of the 3-carbon glyceraldehyde 3-phosphate with the 5-carbon L-arginine to form N2-(2-carboxyethyl) arginine, catalyzed by carboxyethylarginine synthase (CeaS2).25 In the next reaction, N2-(2-carboxyethyl) arginine is cyclized to form a monocyclic β-lactam compound, deoxyguanidinoproclavaminate, catalyzed by β-lactam synthase (Bls2).26, 27, 28 Deoxyguanidinoproclavaminate is then hydroxylated to form guanidinoproclavaminate, catalyzed by clavaminate synthase (Cas2).29, 30, 31 Guanidinoproclavaminic acid undergoes a hydrolysis reaction catalyzed by proclavaminate amidino-hydrolase (Pah2) to form proclavaminate.21, 32 The next two consecutive reactions are again catalyzed by Cas2. In the first reaction, proclavaminate is cyclized to form the first bicyclic ring structure, dihydroclavaminic acid, which then undergoes subsequent desaturation to produce clavaminic acid.33 The enzymology involved in the ‘early’ reactions leading to the formation of clavaminic acid is well characterized and reviewed elsewhere.13 The genes encoding the ‘early’ steps, ceasS2, bls, pah and cas2, are coexpressed and coregulated. The gene immediately downstream from cas-2 is oat2, which codes for an ornithine acetyltransferase-like activity.18, 34 On the basis of mutational analysis and similarity of its transcriptional regulation to that of ‘early’ genes, it is considered a part of the ‘early’ gene cluster. However, oat2 is transcribed separately.34, 35 The oat2 mutant showed decreased CA production, however, this phenotype could be reversed by addition of arginine. Therefore, Oat2 is thought to contribute toward building arginine pools required for CA biosynthesis.

It is striking to note that in S. clavuligerus the entire ‘early’ gene cluster is duplicated. This knowledge came from several independent studies. The initial enzymology and gene cloning had demonstrated the existence of duplicate sets (paralogs) of Cas genes, cas1 and cas2, located on two different regions of the chromosome.30 cas2 is now assigned to the CA gene cluster downstream from the cephamycin C gene cluster along with other ‘early’ CA biosynthetic genes, whereas cas1 resides elsewhere on the chromosome along with 3S 5S clavam-specific genes.36, 37 Separately, it was discovered that knock-out mutations in each of the five ‘early’ CA biosynthetic genes, but not in the ‘late’ genes, led to a ‘leaky’ phenotype; that is, the mutants in each of the five genes, ceasS2, bls2, pah2, cas2 and oat2, were completely blocked in CA production when cultures were grown in a synthetic medium. However, the same mutants were still able to produce CA in a complex soy-based medium.18 This suggested the presence of a second set of the ‘early’ genes, which was expressed in the soy-based medium but not in the synthetic medium, and which functionally complemented the mutations in the ‘early’ genes, in the soy-based medium. Supporting this idea, transcriptional analysis confirmed that cas2 was expressed in both the synthetic medium and the soy-based medium, whereas its paralog, cas1, was expressed in the soy medium, but not the synthetic medium.18, 38 Eventually, the second set of the paralogous ‘early’ genes was located, characterized, and their role in the biosynthesis of 3S, 5S clavams established. Of these, ceasS1, bls1, pah1 and oat1 are clustered with 3S-, 5S-specific clavam genes (the paralog gene cluster), and located on the giant linear plasmid pSCL4, whereas cas1 is colocated with yet another set of clavam biosynthetic genes (clavam biosynthetic gene cluster) and is present on the chromosome, well separated from the cephamycin C-CA gene cluster by >1 Mb.36, 37, 39, 40, 41 The gene organization of the paralog gene cluster and the clavam biosynthetic gene cluster responsible for the production of 3S, 5S clavams has been recently reviewed elsewhere.13

In summary, this organism has two sets of the ‘early’ genes leading to the formation of clavaminic acid, in three locations on the chromosome. One complete set is clustered with the ‘late’ genes responsible for CA biosynthesis; that is, the cephamycin C-CA gene cluster, whereas the second set of genes is located within two separate clusters responsible for the 3S, 5S clavam biosynthesis.

The ‘late’ steps

Clavaminic acid arising from the ‘early’ pathway is the branch point intermediate that can either be converted to CA, or to the 3S, 5S clavams, by respective ‘late’ reactions. The ‘late’ steps involved in the formation of 3S, 5S clavams have been discussed elsewhere.13

The ‘late’ steps involved in the conversion of clavaminic acid to CA proceed via clavaldehyde as an intermediate, and entail two key structural modifications. The first modification is the inversion of the stereochemistry from 3S, 5S in clavaminic acid to 3R, 5R in clavaldehyde and CA; that is, a double ring enantiomerization reaction, which gives these later compounds their characteristic β-lactamase-inhibitory property. The second modification is the conversion of the amino group present on the side chain of clavaminic acid to a hydroxyl group, via an oxidative deamination reaction. However, reactions involved in the two modifications are not clearly understood. The DNA sequencing of the putative ‘late’ genes and characterization of the corresponding knock-out mutants have provided some insights into the nature of potential ‘late’ steps of the pathway.

The first dedicated ‘late’ reaction is the conversion of clavaminic acid to N-glycylclavaminic acid, catalyzed by glycylclavaminic acid synthase (gcaS; orf-17).42, 43 It is predicted that the double ring enantiomerization and oxidative deamination reactions then follow, leading to the formation of clavaldehyde, which possesses the 3R, 5R stereochemistry essential for the β-lactamase activity. However, the details of these reactions and how N-glycylclavaminic acid is converted to clavaldehyde remain elusive. The last reaction entails reduction of clavaldehyde to CA, catalyzed by NADPH-dependent clavaldehyde dehydrogenase (cad).18, 44, 45

Aside from gcaS and cad, there are at least six additional genes that seem to be essential for CA biosynthesis, as mutations in these genes lead to loss of CA production. The exact role of these genes in the pathway is not fully understood, although sequence similarities and analysis of knock-out mutants have provided some clues into their function.

oppA1 (orf-7) and oppA2 (orf-15) encode similar proteins with sequence similarity to periplasmic oligopeptide-binding proteins.18, 27, 46, 47 Mutants in each of these genes are unable to produce CA, suggesting that these genes are essential for CA production, but also that these genes cannot complement each other.47 However, the exact role of the putative peptide transport function in CA production is unclear. Perhaps these two gene products are responsible for transporting arginine/peptide substrates or regulatory signaling peptides involved in CA biosynthesis.47

claR (orf-8) encodes a transcriptional activator-containing sequences with significant similarity to the helix-turn-helix motif of the LysR family of transcriptional regulators, and positively regulates the ‘late’ genes oppA1, cad and cyp.44, 48 A claR mutant was blocked for CA production but accumulated clavaminic acid. In the same mutant, transcripts for ‘early genes’ ceasS1, bls1, pah1, cas2 and oat1 remained unaffected, whereas the transcripts corresponding to the ‘late’ genes oppA1, cad and cyp were absent.44, 48 These data are consistent with the role of claR as a regulator of ‘late’ steps specific to the conversion of clavaminic acid to CA.

cyp (orf-10) encodes cytochrome P450 (Cyp) hydroxylase whereas fd (orf-11) encodes a ferredoxin, an electron transport protein, which could provide functionality to the Cyp protein.18, 46, 49 A mutation in cyp leads to loss of CA formation, whereas a mutation in fd results in partial loss, suggesting that another electron transfer protein from the organism may be able to partner with Cyp in the absence of the Fd. The cyp–fd combination of genes is a potential candidate for carrying out the oxidative deamination and/or double ring enantiomerization reactions present between clavaminic acid and CA, although the in vitro evidence is yet to be established.18, 46, 49

orf-12, which has some similarity to class A β-lactamases, is essential for CA production based on mutational analysis.42, 49 Orf-12 encodes a protein with some similarity to β-lactamases and possesses the highly conserved SDN motif important for the catalytic activity of β-lactamases. However, it lacks the other conserved motifs, STFK, EPELN and KTG. The role of orf-12 in CA biosynthesis is not clear as yet.

orf-13 encodes an amino acid metabolite efflux pump protein and may be involved in the transport of CA and pathway intermediates from inside of the cell to the outside.42, 46 Efflux pumps are commonly associated with antibiotic gene clusters as a part of the resistance mechanism to protect the organism from excess build-up of potentially toxic metabolites. The mutation in orf-13 decreases CA production significantly.42, 46

orf-14 shows similarities to GCN5-acetyltransferases, and mutants in this gene are negatively affected in CA production. The acetylation could have a role in the generation of N-acetylated clavaminic acid intermediates, which have been detected earlier in uncharacterized mutants of S. clavuligerus.42, 46, 50

orf-16 shows no significant similarities to any proteins in the database. However, a mutation in orf-16 leads to a complete loss of CA formation, demonstrating the essential nature of this gene for CA production.42, 46 Interestingly, orf-16 showed accumulation of N-acetylglycylclavaminic acid and N-glycylclavaminic acid metabolites. The accumulation of the modified clavaminic acid metabolites in orf-16 mutants, and the putative acetylation role of orf-14, suggests that clavaminic acid is first converted to glycylclavaminic acid catalyzed by gcaS, then acetylated by orf-14 gene product to form N-acetylglycylclavaminic acid, and subsequently converted to N-acetylclavaminic acid by orf-16 gene product. However, more experimental data need to be generated to confirm this hypothesis.

Regulation

Most antibiotic biosynthetic gene clusters in Streptomyces are regulated in a hierarchical manner.51 At the first level, the biosynthetic genes are typically controlled by pathway-specific transcriptional regulators encoded within the gene clusters. The expression of the transcriptional regulators in turn is tightly controlled by global regulatory mechanisms, which tie the production of the antibiotic to the physiological state of the cell and morphological differentiation (formation of aerial mycelium and sporulation). The evidence suggests that CA biosynthesis in S. clavuligerus is also regulated in this hierarchical manner52 (Figure 3).

At the bottom of the hierarchy is claR, which, as discussed earlier, is a LysR type of transcriptional activator located within the CA gene cluster and positively regulates only the ‘late’ biosynthetic genes involved in the conversion of clavaminic acid to CA (see above).

At the next level is ccaR, which is located within the adjacent cephamycin C gene cluster, and encodes a Streptomyces antibiotic regulatory protein family, which shows similarity to the OmpR family of regulators but lacks any evidence of a helix-turn-helix DNA-binding motif.53, 54, 55 Knock-out mutation of ccaR abolishes both cephamycin C and CA production, which is restored by introducing the wild-type ccaR back into the mutants, suggesting that CcaR positively regulates both cephamycin C and CA production. Transcriptional analysis suggests that CcaR regulates CA production directly, by regulating the expression of the ‘early’ ceaS2-bls-pah-cas2 polycistronic transcript, and indirectly, by regulating expression of claR, which in turn controls expression of the ‘late’ genes in the CA pathway. Heptameric target sequences upstream of ceaS2 and claR, where CcaR is thought to bind and exert transcriptional control, have been identified via gel-shift and DNA footprinting analysis.55 These sequences are similar to target sequences where the Streptomyces antibiotic regulatory protein family of transcriptional regulators binds and regulates gene expression in other Streptomyces. CcaR also binds to the sequences upstream from ccaR and autoregulates its own expression, indicating the presence of an added level of control to fine tune the production of the two β-lactams in this organism.53, 54, 55

The next level of hierarchy involves several levels of global regulatory mechanisms, which allows the organism to trigger the onset of antibiotic production and morphological differentiation in response to different environmental and metabolic signals.51 Such signals include changes in growth rate, nutritional stress and environmental imbalances. These mechanisms can operate at the level of transcription, translation or post-translation of key effector genes/gene products responsible for further cascading the signal to regulate the expression of the target genes. In some cases, small MW compounds, such as γ-butyrolactones, are known to mediate such effects. Although the data from several laboratories suggests that the CA gene cluster is regulated by such global mechanisms, there are some notable differences as compared with similar mechanisms operating in other Streptomycetes. The following section covers three global regulatory mechanisms, which seem to be operating in S. clavuligerus, including those involving (a) bld genes, (b) production of autoregulators and (c) stringent response.

In Streptomyces, the bld gene-based regulon links the expression of antibiotic gene clusters to morphological differentiation, and common genes and mechanisms have been identified that regulate both antibiotic production and morphological differentiation. One such gene is bldA that encodes a leucyl-transfer RNA that recognizes UUA codons, and is involved in the translation of genes containing TTA codons. The TTA-encoding genes in Streptomyces are rare and present only in secondary metabolism regulatory genes, or genes associated with the regulation of sporulation and aerial mycelium formation. ccaR, which controls both cephamycin C and CA gene clusters, contains the rare TTA codon, which initially suggested the presence of this mode of regulation.54, 56 However, although bldA mutants of S. clavuligerus show the absence of aerial mycelium formation, as expected of this phenotype, the same mutants are not blocked for production of either CA or cephamycin C. This suggests that there must be an alternative mechanism for translation of the ccaR TTA codons, or mistranslation mechanism in the absence of bldA transfer RNA, and also suggests that CA and cephamycin C gene clusters are independent of the bldA-mediated regulation, at least under the conditions tested.56 This is an exception to the bldA-mediated control of the TTA-encoding target genes normally seen in other Streptomyces systems.

Another mode of regulation that commonly controls antibiotic production in Streptomyces involve sigma factors and sigma factor-related mechanisms, like those involving anti-sigma factors and anti-anti-sigma factors.51 Transcriptional regulation by sigma factor-related mechanisms is common in Bacillus sp. for controlling sporulation and other stress-associated responses. The presence of more than 60 sigma factors present in the S. coelicolor genome suggests that sigma factors-based regulation is also present in this organism.57

bldG encodes a putative anti-anti-sigma factor, which is thought to regulate those genes involved in morphological differentiation that are regulated by sigma factors. In S. clavuligerus, bldG mutants were blocked in morphological differentiation and production of both CA and cephamycin C, indicating the presence of bldG-mediated regulation in this organism.58, 59 Transcriptional analysis revealed that in the bldG mutant, ccaR transcription was abolished. Thus, it appears that the bldG in S. clavuligerus controls CA and cephamycin C production by regulating transcription of ccaR, perhaps mediated by other as yet unidentified proteins in the regulatory cascade.58, 59 Similarly, orf-21 located downstream of gcaS within the CA gene cluster codes for a putative sigma factor, which appears to have a role in CA production. However, its effect on CA production was variable across different laboratories, and seems to be linked closely with growth conditions.13, 24, 60

The two-component sensor kinase-response regulator system is another mechanism present in Streptomyces to regulate antibiotic production and morphological differentiation in response to the physiology of the organism. There are more than 80 sensor kinases and more than 60 response regulators present in the S. coelicolor genome.57 The two genes downstream from the CA gene cluster, orf-22 and orf-23, show similarity to sensor kinases and response regulators, respectively, from other Streptomyces.24, 61 Although the orf-22 mutation had little effect on CA production, orf-23 mutant showed severe reduction in CA levels, decreased levels of claR transcripts, and defects in growth and sporulation. Conversely, overexpression of orf-23 caused overproduction of spores and CA. This phenotype suggests that orf-23 may act as a response regulator to control growth, sporulation and antibiotic production in this organism in response to environmental signals.

The γ-butyrolactone (autoregulatory)-mediated regulation of morphological differentiation and antibiotic production is prevalent in Streptomyces.51 In this model of regulation, the autoregulators bind to their cognate-binding proteins, called as autoregulator-binding proteins, which in turn, bind to regulatory sequences (ARE) upstream of the target genes; that is, genes directly or indirectly involved in differentiation and antibiotic production to regulate gene expression at the transcriptional level.62, 63 The evidence so far suggests that the autoregulator-binding protein ARE-mediated regulation is also present in S. clavuligerus, and controls CA and cephamycin C production via ccaR.64, 65 Although small molecule autoregulators have been identified in this organism,66 the exact small molecule involved in this particular interaction has not yet been identified. ScaR (also called the Brp; γ-butyrolactone receptor protein), an autoregulator-binding protein, was identified in this organism,64 along with the corresponding ARE target sequences upstream from ccaR.64, 67 The binding of ScaR to ccaR ARE sequences has been demonstrated in vitro, and scaR mutants overproduce both CA and cephamycin C significantly. This suggests that ScaR protein interacts with its cognate small molecule (as yet unidentified), and then binds to the ccaR ARE sequences, and represses the ccaR transcription.

Although Brp (ScaR) is postulated to control ccaR directly, it also controls ccaR and claR expression indirectly, via the AdpA pathway.68 AdpA is a transcriptional regulator that shows similarity to the AraC/XylS family of transcriptional activators, contains two helix-turn-helix motifs and a UUA leucine codon in the coding region. AdpA has been characterized in S. griseus and S. coelicolor where it regulates sporulation, morphogenesis and antibiotic production. In S. clavuligerus, adpA expression is regulated negatively by ScaR (Brp), and thus, in brp-mutants, adpA expression is elevated. The adpA mutants of S. clavuligerus show a significant decrease in CA production, and also decreased transcripts levels of ccaR, claR, the ‘early’ and other biosynthetic genes. These data suggest that Brp negatively regulates AdpA, which, in turn, positively regulates ccaR and thus CA production. Furthermore, as adpA contains a UUA leucine codon and is not expressed in bldA mutant of S. clavuligerus, adpA provides linkage between bldA regulation and CA production.

Apart from ScaR, AreB is another protein that has been shown to bind to ccaR ARE sequences in gel-shift DNA-binding experiments.69 AreB belongs to the IclR family of regulatory proteins, and the corresponding gene areB is located adjacent to the leuCD operon, in opposite transcriptional orientation. The areB autoregulates its own transcription and also regulates transcription of leuCD operon. The areB mutant displays weak leucine auxotrophy, and a derepression effect on CA production, suggesting this mechanism could provide a link between primary and secondary metabolism.69 However, the derepression effect of the areB mutation on CA production and ccaR transcript levels was modest as compared with similar effects due to scaR mutation. Interestingly, the binding of AreB to ccaR ARE sequences requires a small molecule, as purified protein does not show the binding activity. The identity of this small molecule autoregulator remains to be established.

The stringent response, which is a well-studied phenomenon in E. coli, is also involved in regulating secondary metabolism in Streptomyces including CA biosynthesis in S. clavuligerus. This mechanism involves production of unusual phosphorylated nucleotide (p)ppGpp, mediated by two proteins, ppGpp synthetase (RelA) and (p)ppGpp synthetase/hydrolase (SpoT).51 These phosphorylated nucleotides have a regulatory role via binding to the β-subunit of RNAP, and regulating expression of ribosomal genes, and resulting protein synthesis, in response to amino-acid starvation and nutritional downshift conditions. In S. coelicolor, the corresponding RelA/SpoT homologs, RelA/RshA, have been identified and shown to control antibiotic biosynthesis, at least in certain physiological conditions.51 Similarly, in S. clavuligerus, a relA mutant was significantly blocked in its ability to produce both CA and cephamycin C, and the mutant was also impaired in aerial mycelium formation and sporulation; however, a less drastic effect was observed for rshA mutants.70, 71 On the basis of this evidence, it is likely that in S. clavuligerus, the stringent response to amino-acid starvation induces production of the phosphorylated nucleotides, mediated by RelA/RshA, and these nucleotides could positively influence CA production by mechanisms that need to be further elucidated.

Strain development

The strain development strategies to deliver microbial products at higher yields and lower costs have evolved from the traditional random mutation and selection approach to knowledge-based rational approaches. The latter relies upon understanding key biosynthetic and regulatory mechanisms affecting the yields, and engineering strains to eliminate the potential bottlenecks in a targeted manner. There are successful examples of the application of both these approaches for improving CA-producing S. clavuligerus strains.

Owing to the proprietary nature of the industrial screening programs, not much published literature is available, discussing specific screening strategies employed for developing high CA-producing strains. However, given that the titers of wild-type ATCC strain are reportedly in the range of 25–120 mg l−1 (depending on the media and growth conditions, which varies from laboratory to laboratory), and a recent report of titers in an industrial strain approaching 3 g l−1,6, 72 it suggests that the industrial strain improvement programs have enabled orders of magnitude improvement in CA strain productivity.

More recently, the knowledge gained on the biogenesis of CA and the availability of genetic tools in S. clavuligerus to overexpress genes, and/or to eliminate competing reactions by making directed modifications has allowed construction of improved strains in a rational manner. These efforts can be broadly classified into three approaches: (a) increasing the flow of precursors into the pathway, (b) increasing the gene dosage of key biosynthetic and/or regulatory genes and (c) eliminating competing reactions that divert the flow of pathway intermediates into non-CA products, or competing reactions that affect the overall energetics and carbon flow into the CA pathway.

As discussed earlier, arginine and glyceraldehyde-3-phosphate (G3P) are the two essential precursors of the CA pathway (Figure 2). Earlier reports that supplementation of glycerol leads to an increase in CA yields, suggested that G3P could be a limiting factor. Once glycerol is metabolized into G3P, it can either enter the glycolytic reactions and then the Krebs cycle or it can be recruited into either gluconeogenesis to produce glucose, or it can combine with arginine to begin CA biosynthesis. Thus, a gene encoding glyceraldehyde-3-phosphate dehydrogenase (gap), which is involved in channeling G3P into the glycolytic pathway via formation of 1,3, diphosphoglyceric acid and pyruvate (Figure 3), was identified, and the disrupted gap-1 mutant showed substantial increase (80–110%) in CA yields over the wild type, suggesting that in the gap-1 mutant more G3P pools were now available for CA biosynthesis.73 Further increases of >200% were obtained when arginine was fed to the gap-1 mutants, which suggests that when increased pool of G3P was available, arginine became limiting for CA biosynthesis.73 Efficient glycerol utilization to provide sufficient G3P pools also determines the CA yields. Increased gene dosage of glp operon consisting of glycerol transporter, glycerol kinase, and glycerol-3-phosphate dehydrogenase increased the CA yields further by >4.5-fold, and 7.5-fold upon supplementation with glycerol.74

Increasing gene dosage of key CA biosynthetic and/or regulatory genes has allowed construction of improved strains in a rational manner resulting in productivity increases in the range anywhere between 2- and >20-fold.23 Introduction of additional gene copies by using integrative vectors or by use of multi-copy plasmids has given promising results. The integration approach seems more practical; considering it provides higher strain stability and obviates the use of antibiotics for plasmid selections. Increasing gene dosage of the ‘early’ pathway genes, ceaS2, blsS2, cas2 and pah2, either individually or as an operon, and ‘late’ pathway genes cyp, fd, orf-12 and orf-14, resulted in increased production levels.23, 75, 76, 77 Similarly, increasing gene dosage of the two pathway-specific regulators ccaR and claR resulted in increased strain productivity. In one case, increasing gene dosage of cas2 along with ccaR reportedly gave more than a 20-fold increase in CA titer. In another strategy, ccaR and claR were overexpressed in a gap-1 mutant, giving significant increase in titers, which were further enhanced upon feeding the strain with glycerol and ornithine, presumably leading to higher levels of the two precursors, G3P and arginine, being available for CA biosynthesis.78 Thus, combining appropriate mutations with increased dosage of genes determining rate-limiting reactions in a logical manner, coupled with appropriate feeding strategies is critical for enhancing CA titers.

Elimination of competing pathways, which draw upon common pathway intermediates, or influence the overall energetics and primary metabolic pools available for CA biosynthesis, is another strategy that has been successfully applied to construct high-yielding strains.79 When cvm1, a gene encoding an enzyme specifically involved in the conversion of the branch point intermediate clavaminic acid to 3S, 5S clavams, was inactivated it resulted in increased CA production, presumably resulting from increased channeling of clavaminic acid to CA.79 Interestingly, although the cephamycin C pathway draws upon α-aminoadipate, L-cysteine and L-valine as precursors and is biochemically distinct from CA pathway, its elimination from the strain resulted in increased CA titers. Disruption of lat gene, encoding lysine ɛ-aminotransferase that converts lysine to α-aminoadipate, resulted in loss of cephamycin C production as expected, but also resulted in approximately 200–250% increase in CA production as compared with the wild type.79 Both cvm1 and lat mutations were subsequently introduced into a high titer commercial strain, which resulted in a consistent 10% increase in CA titers over the parental strain.79 Apart from increased CA productivity, elimination of structurally similar 3S,5S clavams, which are relatively toxic metabolites, is highly desirable from a regulatory standpoint, and this also improved the efficiency of the downstream CA purification processes. Notably, the improved strain carrying the cvm1 and lat mutations showed increased titers in 1 l and pilot 1000 l fermentors, thus demonstrating that this increase in productivity is scalable.79 Given that the CA titer of the commercial strain is already several orders of magnitude higher than the wild-type titers, a 10% increase is quite significant. Thus, knowledge-based targeted manipulations can be applied to strains derived using conventional mutation and selection approaches, to further enhance strain productivity by eliminating competing pathways and structurally similar molecules, and leading to a more efficient overall process. The mechanism by which elimination of the cephamycin C pathway leads to an increase in CA remains to be established. However, eliminating unrelated secondary metabolic pathways from the strain, and making general cellular resources available for the pathway of interest, could be explored further as a general approach to improve strain productivity.

The recent availability of genome sequences, the ability to construct genome scale in silico models to predict biochemical pathways and metabolic flux (flux-based models), and ability to refine such models with comparative ‘omics’ data (genomics, proteomics, transcriptomics and metabolomics) from strains with different phenotypes has opened up exciting possibilities to improve strains using newer approaches. An industrial high-producing S. clavuligerus strain obtained by iterative random mutagenesis and selection and producing CA at levels orders of magnitude higher than the wild type was subjected to transcriptome and genome-wide flux balance analysis to understand the changes that had occurred in this strain, which could explain the high-producing phenotype. Interestingly, the majority of the changes observed in the industrial strain were those that were also predicted and shown to be present in the recent wild-type derived strains constructed via directed knowledge-based strategies.80 For example, the industrial strain showed general overexpression of CA biosynthetic and regulatory genes, including ccaR, claR and adpA, whereas overexpression of the same genes in the wild-type strain led to enhanced CA production. Likewise, the changes in the expression levels of the glycerol metabolic genes in the industrial strain mimicked the phenotype associated with the gap-1 deletion mutation constructed in the wild-type background using rational approaches. These overlaps in the changes between strains obtained using random mutagenesis and screening and directed knowledge-based improvements have provided valuable insights into the key target genes and pathways responsible for overall strain productivity.

Future perspective

CA (and Augmentin) continues to have a critical role in anti-infective therapy. Newer strategies are continually needed to improve Streptomyces strains for producing CA cost-effectively and to make this therapy affordable globally. Random mutagenesis and selection has had a critical role in developing strains for the commercial production of CA. This approach strictly depends on creating a pool of diverse mutants as randomly as possible, and identifying the best performing mutants from this pool, using appropriate screening or selection strategies. On the other hand, the knowledge-based targeted strain engineering approach can be used to construct strains in a very specific manner, by adding or deleting genes, or manipulating gene expression governing the desired phenotype, accordingly. The S. clavuligerus genome sequence is now available,81 and the cost of sequencing strains is decreasing rapidly. The ability to create genome-wide metabolic models (for example, a virtual cell) based on the genome sequence and the biochemical architecture of the strain, and the ability to feed the ‘omics’-based data to build and validate such models, will further enhance our knowledge about CA biosynthesis and its regulation, and allow identification of additional targets for strain improvement in a more efficient manner.80, 82 However, the knowledge-based targeted approach is still somewhat empirical because overproduction of metabolites is a complex phenotypic trait and involves a multitude of genes and pathways, and a desired phenotype cannot always be guaranteed by making defined changes one at a time. Perhaps it is more pertinent to have a combination of random mutagenesis and selection and knowledge-based approaches as a strategy for strain improvement. A recent example of this approach is the use of ccaR promoter-reporter-based screen to identify S. clavuligerus strains, which have higher levels of ccaR expression and higher CA production.83 Additionally, protoplast fusion-based genome shuffling techniques can speed up the generation of diverse mutant libraries for phenotypic screening and selections, and increase the probability of isolating the desired mutants.84, 85, 86 The key will be to design appropriate screening or selection strategies to identify the desired phenotypes. Subsequently, identification and rationalization of the critical mutations that lead to the desired phenotype in such mutants via genomics-based models, and then reverse engineering these mutations back into a suitable robust strain, which is more suitable for a commercial process, would be an attractive way forward.

Dedication

This article is dedicated to the memory of Professor Leo Charles Vining, who made significant contributions to the natural products chemistry and biosynthesis of secondary metabolites. As a former graduate student in Leo’s laboratory at Dalhousie University, I have fond memories of our beloved Professor and mentor. Leo began his career as a traditional natural product chemist during the 1950s, a period that also belonged to the so called ‘golden era of antibiotics’. Being in the prime of his scientific youth during this period, and engaged in making significant contributions in the area of natural product biosynthesis and regulation of antibiotic production, was perhaps very timely for Leo’s career. Leo’s passion included deciphering the various biosynthetic schemes, utilized by different bacteria, fungi and actinomycetes, for the production of diverse secondary metabolites, and understanding biochemical and physiological switches regulating the production of these metabolites. During the latter part of his career, he employed molecular biological tools toward understanding the molecular basis of antibiotic biosynthesis in actinomycetes. His career included several appointments, both in government institutes and academia, where he published 283 scientific articles and 2 books. Leo received several awards including the prestigious Charles Thom Award from the Society of Industrial Microbiology (SIM) and a Fellowship of the Royal Society of Canada for his outstanding contributions. Leo was a very smart and quick thinking scientist, an effective teacher and a very compassionate mentor. He had a very effective style of communication, which made him a very inspiring leader. What was amazing about Leo was that despite scaling so many heights in his career, he was very modest and down to earth in his approaches. He was always besides us, during good times celebrating exciting results, or during challenging times sharing his insights and expert opinions to create a path forward. Not surprising, his unassuming style and demeanor won him popularity and respect among students and peers, and his laboratory was one of the most sought after places to work in the field of Streptomyces, and represented a vibrant group of graduate students and post-doctoral fellows. Leo, you will be missed!!!