Introduction

A201A is a complex aminoacylnucleoside antibiotic that has recently been totally synthetized chemically,1 but that was reported much earlier to be produced naturally by Saccharothrix mutabilis subsp. capreolus (formerly Streptomyces capreolus) NRRL3817.2 Curiously, the same nucleoside antibiotic has been also reported to be isolated from crude extracts of a deep-sea bacterium named Marinactinospora thermotolerans SCSIO 006523 A201A is very active against Gram-positive aerobic and anaerobic bacteria, as well as most Gram-negative anaerobic species. However, it is much less toxic to aerobic Gram-negative bacteria, some fungi and mammals.4 Its chemical structure includes a moiety derived from D-rhamnose, the N6, N6-dimethyl-3′-amino-3′-deoxyadenosyl (aminonucleoside) moiety of puromycin, an α-methyl-p-coumaric acid (a polyketide) and an unsaturated furanose moiety closely related to similar structures found in hygromycin A (Figure 1).5 The structural basis for both Hygromycin A and A201A antibiotics binding and inhibition to ribosomes have been recently showed by Polikanov et al.6 The similarities of A201A structure with puromycin and hygromycin A antibiotics strongly suggest that certain enzymes, and hence the corresponding genes in the A201A biosynthetic pathway, may be related to their counterparts in the puromycin and hygromycin A biosynthetic pathways.7, 8, 9, 10, 11 Homologs of the ata open reading frames (ORFs) have indeed been found with for at least 14 ORFs of the hygromycin A biosynthetic cluster in S. hygroscopicus9 and as we showed previously a set of five consecutive genes involved in the biosynthesis of the aminonucleoside moiety of A201A12 and their deduced products (AtaP3, AtaP4, AtaP5, AtaP7 and AtaP10) are similar to their counterparts from the pur cluster of S. alboniger (Pur3, Pur4, Pur5, Pur7 and Pur10, respectively), genes implicated in the biosynthesis of the aminonucleoside moiety of puromycin. Not surprisingly, ataP4, ataP5 and ataP10 can complement corresponding mutations in the pur cluster in S. alboniger.12 Here, we describe the cloning, heterologous expression and organization of the A201A biosynthetic ata gene cluster from S. mutabilis subsp. capreolus NRRL3817.

Figure 1
figure 1

Structures of puromycin (a), hygromycin (b) and A201A (c).

Experimental Procedures

Strains, plasmids and culture conditions

The bacterial strains used in this study are listed in Table 1. Saccharothrix mutabilis subsp. capreolus NRRL3817 that produces A201A, Streptomyces lividans 66–1326, Streptomyces lividans 66-TK21 and Streptomyces colelicolor M145 have all been described previously (Table 1). The E. coli strains DH5α, BW25113(pIJ790) and ET12567(pUZ8002) were employed for DNA amplification, gene disruption by PCR-targeting13 and Streptomyces conjugation, respectively. The pPACS2 Streptomyces-E. coli shuttle plasmid, which incorporates the FC31 attP-int system and confers resistance to thiostrepton, could be stably maintained in Streptomyces by specific integration at the attB site of the genome. The plasmid pIJ773, containing the apramycin resistance gene aac(3)IV, was used to obtain the disruption cassette by PCR-amplification. pEM4neo, a plasmid derived from pEM414 in which the bla gene (860 bp) is replaced with neo (774 bp) from SuperCos115 by λ-Red-mediated recombination, was used for gene expression in Streptomyces.

Table 1 Strains and plasmids used in this study

E. coli strains were grown in liquid or agar LB medium. S. mutabilis subsp. capreolus was grown in NE or MEY medium,12 whereas S. lividans and the other different Streptomyces strains generated in this study were grown in either solid SFM13 or liquid YEME medium.15 Modified S medium (1% starch, 0.4% Bacto-peptone, 0.4% yeast extract, 50 mg l−1 KH2PO4, 200 mg l−1 K2HPO4, 2 mM MgSO4) was employed to study A201A production. When required, antibiotics were added at the following concentrations: 100 μg ml−1 ampicillin; 10 and 25 μg ml−1 thiostrepton for liquid and solid media, respectively; 50 μg ml−1 kanamycin; 25 μg ml−1 chloramphenicol; 50 μg ml−1 apramycin. A201A was kindly provided by Dr H.A. Kirst (Eli Lilly Research Laboratories). The antimicrobial activity of the Streptomyces strains was determined by the agar disc method using Micrococcus luteus as the test organism as indicated previously.16

Nucleic acid methodology

DNA manipulations, preparation of competent cells, and storage and transformation of Streptomyces and E. coli were performed as previously described.15, 17 To prepare a gene library from S. mutabilis subsp. capreolus NRRL3817, total DNA was obtained and partially digested with Sau3AI as indicated previously.12 The resulting 40–80 kb DNA fragments were collected from sucrose gradients and then ligated into the BamHI-digested pPACS2 plasmid.18 The library was screened by colony hybridization on nitrocellulose Hybond-N membranes (Amersham) using fragments from the ends of the cosmid inserts of pCAR13 and pCAR23 as DNA probes described,16, 19 isolating plasmid pIES. This plasmid contained an insert of ~60 kb that included the A201A biosynthetic (ata) gene cluster from S. mutabilis subsp. capreolus. Appropriate restriction fragments from the aforementioned cosmids were subcloned into the pBS plasmid and sequenced. For heterologous expression of the ata cluster in Streptomyces, the pIES plasmid was introduced into S. lividans 66–1326, S. lividans 66-TK21 and S. coelicolor (Table 1) using standard integrative methods. Thus, S. lividans 66–1326 (pIES) (named S. lividans ata), S. lividans 66-TK21 (pIES) and S. coelicolor M145 (pIES) were generated (Table 1).

To define the ata cluster, orf1, ata 11 and merR in-frame deletions were generated in pIES via λ-Red-mediated recombination.20 Disruption cassettes (1.4 kb) containing the aac(3)IV apramycin resistance gene were generated by PCR amplification using the vector pIJ773 and specific primers. These primers contained 39 nt regions that match the sequence adjacent to the gene to be inactivated and 19–20 nt regions matching the ends of the disruption cassette. The primers 5′-orf1/3′-orf1, 5′-ata11/3′-ata11 and 5′-merR/3′merR (Table 2) were then used to generate the orf1 (1774ATG/TGA890), ata11 (30325GTG/TGA31035) and merR (32153GTG/TGA31773) disruption cassettes, respectively. Using this procedure, the plasmids pIESΔorf1, pIESΔatt11 and pIESΔmerR were generated in E. coli BW25113, and subsequently introduced by conjugation into S. lividans 66–1326. Site-specific integration of apramycin-resistant clones into the genome was checked by PCR. For ata11-mutant complementation, ata11 was amplified with the High fidelity expand PCR system (Roche, Germany) using the primers 5′-ata11ext and 3′-ata11ext (Table 2). To facilitate subcloning, the EcoRI and XbaI restriction sites were included in 5′-ata11ext and 3′-ata11ext, respectively. The PCR product was inserted into EcoRI/XbaI linearized pEM4neo, generating pATA11, which was introduced into the S. lividans 66–1326(pIESΔatt11) strain by protoplast transformation.

Table 2 Oligonucleotides used in this work

A201A determination

A201A or A201A-like material was obtained from culture filtrates (20 ml−1), the pH adjusted to pH 8.5 and the material extracted with chloroform in a rotary shaker as described elsewhere.21 Organic fractions were evaporated to dryness and dissolved in ethanol (50 μl; 70% (v/v)). A201A was identified and quantified by HPLC with a quaternary pump (Delta 600, Waters). Chromatography was performed through a reverse phase of a C18 Nova-Pack column (4 μm, 3.9 × 300 mm) with the solvents A (0.1 M ethylacetate, pH 6) and B (0.1% trifluoroacetic acid in acetonitrile). A and B, conditioned with helium, were used as the mobile phases at a flow of 0.5 ml min−1 for a total analysis time of 60 min. A linear gradient of 0–100% eluent B was applied and the column temperature was kept constant at 25 ºC. A Waters 2487 dual absorbance detector was used at 254 and 280 nm and the data obtained were analyzed using the Waters Empower Software (Milford, MA, USA). The retention time for commercial A201A was used as a reference.

To purify A201A, the filtrate from a culture (200 ml) of S. lividans 66–1326 (pIES) was collected and extracted with chloroform as described above. The extract was evaporated to dryness in a vacuum, dissolved in 0.1 ml of EtOH 70% (v/v) and developed by preparative TLC on 2 mm Silica Gel60 F254 (Merck, Damstadt, Germany) using ethylacetate/ethanol (6:1, v/v) as the solvent. The plates were examined under UV light (254 nm) and the band migrating as A201A was removed and extracted twice with chloroform (1 ml). This TLC purification process was repeated twice and the final extract was dried under a vacuum and then dissolved in 0.05 ml EtOH 70% (v/v). Samples (25 μl) of this solution were subjected to matrix-assisted laser desorption ionization-time of flight on a Bruker ReflexIII spectrometer (SIDI, Universidad Autónoma Madrid, Spain) and a 2,5 dihydroxybenzoic acid matrix was used. NaI was added to the preparation to improve ionization. The mass spectrometry calculation for C37H50N6O14+Na was 825.5.

In addition, A201A was identified using the Ard2 phosphotransferase enzymatic assay described previously.19

For A201A-resistance assays, suspensions of Streptomyces spores were streaked on agar plates containing different antibiotic concentrations (0–1000 μg ml−1). Growth was scored after a 5-day incubation at 30 ºC.

Computer analysis

Currently available methods were employed to analyze the nucleotide and amino acid sequences.22, 23, 24 The sequence analyzed here was submitted to the EMBL database (accession number X84374). The amino acid sequences were analyzed using BLAST against the Swiss-Prot/TrEMBL protein database (http://www.expasy.org/tools/blast/).

Results and Discussion

Identification and expression of the full ata cluster in heterologous hosts

In actinomycetes, antibiotic biosynthetic gene clusters are generally situated in a single stretch of DNA that includes the genes for self-resistance, export, enzymatic activities and usually, but not always, regulation.25 A profound examination of antibiotic biosynthetic gene clusters requires the use of a wide variety of recombinant DNA techniques, including the genetic modification of relevant strains that produce them, a requisite that can often not be fulfilled. A reasonable approach to overcome this problem is to express the antibiotic gene clusters in a heterologous host amenable to gene manipulation, like Streptomyces lividans or S. coelicolor.26, 27, 28 In the case of A201A, many years ago we successfully identified two resistance determinants, ard116 and ard2,19 as well as seven ORFs involved in the biosynthesis of the antibiotic.12 Moreover, we next isolated a plasmid named pIES from a S. mutabilis subsp. capreolus gene library, where we sequenced the full putative ata gene cluster responsible for A201A antibiotic biosynthesis (GeneBank acc. number X84374, unpublished data). After many unsuccessful attempts to introduce exogenous DNA into the producer strain S. mutabilis subsp. capreolus by a variety of strategies, we introduced this putative ata gene cluster containing plasmid pIES into S. lividans 66–1326 to test whether its heterologous expression was possible. The S. lividans ata strain was resistant to 500 μg ml−1 of A201A on agar plates, and as indicated by a disc diffusion test on agar plates for biological activity, was in addition able to inhibit Micrococcus luteus growth, an A201A-sensitive organism (data not shown). These data indicated that A201A producing S. lividans ata strain could well be used as a bona fide tool to study the S. mutabilis subsp. capreolus biosynthetic A201A cluster.

To detect the heterologous production of A201A in S. lividans ata we first use an in vitro phosphorylation assay with Ard2, the ATP/GTP-binding/phosphorylating protein (an A201A-inactivating activity) encoded by the ard2 gene from ata.19 Accordingly, chloroform extracts from culture filtrates from S. lividans ata and those from S. mutabilis subsp. capreolus, S. lividans 66–1326 and S. lividans 66–1326 (pPACS2; Table 1), as well as standard A201A, were used as substrates for Ard2 assays in the presence of [γ-32P]ATP. Cell-free extracts from S. lividans ata and S. lividans 66–1326 (pPACS2) were used as a source of the Ard2 enzyme and as a negative control, respectively. Reaction products were developed by TLC and radioactive spots were detected by exposure to X-ray films, where a radioactive spot corresponding to P-A201A served as positive control. Only the presumed A201A-containing samples (S. mutabilis subsp. capreolus and S. lividans ata) appeared to incorporate [γ-32P]ATP into a P-A201A spot, suggesting that this antibiotic was expressed heterologously in S. lividans ata (data not shown). To confirm the production of A201A in the heterologous host, S. lividans ata was cultured and the chloroform extracts of culture filtrates were examined by HPLC (Figure 2a). Two compounds were identified in the HPLC profiles of the presumed A201A-containing samples. Their retention times corresponded to those of A201A and a putative A201A molecule lacking the disaccharide moiety (probably that considered as hydrolysis product 1 by Kirst et al.5). The identity of these substances was confirmed by ESI-MS (Figure 2b), yielding the characteristic molecular ion (m/e for [M+H]+) of 803 for A201A and 455 for that lacking the disaccharide (Figure 2c) that correspond to their molecular formulae (C37H50N6O14 and C22H26N6O5, respectively).

Figure 2
figure 2

HPLC and MS analyses of the A201A produced in S. lividans ata. (a) HPLC chromatograms of chloroform extracts from culture filtrates of several Streptomyces strains. (b) Samples were developed by TLC, extracted with chloroform and subjected to matrix-assisted laser desorption ionization-time of flight. (c) Chemical structure of compound corresponding to 1: A201A and 2: the putative A201A lacking the disaccharide moiety.

Taken together these results indicate that the ~34 kb-long S. mutabilis subsp. capreolus DNA fragment in S. lividans ata strain is sufficient for the heterologous production of A201A as well as to confer resistance to it, strongly suggesting that it corresponds to the full ata biosynthetic cluster.

Time course of A201A production in S. mutabilis subsp. capreolus and heterologous hosts

Cultures of S. mutabilis subsp. capreolus and S. lividans ata were grown to compare the levels of antibiotic production over relevant time courses (Figure 3). In both cases, A201A production appeared to be strictly controlled, since it commenced during late-exponential growth to reach maximum values at the late stationary phase, close to 96 and 144 h, respectively. Interesting, A201A biosynthesis commences much earlier in S. lividans than in S. mutabilis subsp. capreolus. In addition, it appears that the production by the recombinant organism was approximately twofold lower than that in the wild-type strain. A similar result was found when the puromycin biosynthetic (pur) gene cluster was cloned.7, 29 The fact that antibiotic production starts in the late logarithmic phase indicates that expression of the ata cluster could be under the control of some regulatory mechanism. The ata cluster included in the pIES plasmid was also introduced in S. lividans 66-TK21 and S. coelicolor (Table 1). Similar profiles of A201A production were observed in these organisms derivatives containing the integrated ata sequence (data not shown).

Figure 3
figure 3

Time courses of A201A production by S. mutabilis subsp. capreolus and S. lividans ata. S. mutabilis subsp. capreolus (squares) and S. lividans ata (triangles) were grown at 30 ºC. At the indicated times, samples were taken to determine the OD (broken lines) and then filtered. Chloroform extracts from the filtrates were used to quantify A201A by HPLC (solid lines).

Targeted gene disruption to identify the boundaries of the ata gene cluster

A201A production in both S. lividans and S. colelicolor indicates that the S. mutabilis subsp. capreolus DNA fragment inserted in their genomes contains the genes for the enzymes required and sufficient for the antibiotic synthesis. An examination of the deduced amino acid sequences from this 34 kb DNA stretch suggested that the ends of the putative ata cluster included ard1 and ata11 (Figure 4a and Table 3). These genes encode a putative ABC transporter that confers resistance to A201A (ard1)16 and a putative O-methyltransferase (ata11), respectively. To determine whether this was the case, a variety of deletions of the 34 kb sequence from S. lividans ata were subjected to relevant phenotypic assays (see Experimental Section and Figure 4a). At the right-hand end, a merR deletion did not alter the production of A201A in the heterologous host, whereas destruction of ata11 completely abolished it. Moreover, the ata11 gene from pATA11 complemented a specific A201A non-producing Δata11 mutant of S. lividans ata (Table 1). Despite the fact that in S. lividans the regulation of the ata cluster might differ from that in S. mutabilis subs. capreolus, or even that an homologous to merR in the host genome could functionally complement this transcriptional regulator absence, our data indicated that ata11 will be the last gene on the right-hand end of the ata gene cluster. With regards the left-hand end, while a deletion of orf1 did not alter A201A production, all attempts to delete ard1 were by now unsuccessful. This could be explained by the determinant function of this A201A-resistance gene, since its disappearance would cause lethality. Because such genes are usually members of antibiotic biosynthetic clusters,28 these findings indicate that ard1 is the terminal gene at the left-hand end of ata. Finally, it is noteworthy that a region close to 750 nucleotides with no apparent coding capacity flanks the 3′ region of the coding sequence of these two terminal genes, which contrasts with the compact organization of the DNA coding regions of streptomycetes.

Figure 4
figure 4

A201A biosynthetic gene cluster. (a) Scheme of the 32 open reading frame (ORFs) sequenced from S. mutabilis subsp. capreolus including the A201A (ata) gene cluster. Boxes indicates the direction of transcription of each predicted (ORFs). The horizontal arrows indicate the putative transcription units. Deleted genes are marked by open triangles. The ORFs with homology to putative genes in the hygromycin A biosynthetic gene cluster from S. hygroscopicus NRRL2388 or to puromycin biosynthetic genes from S. alboniger are hatched and in dark boxes, respectively. Note that trx1 and orf1 were previously named orfB and orfA, respectively.12 ORFs from trx1 to ata-PKS1 and from ata2 to ata5 were sequenced previously. Ev, EcoRV; E, EcoRI; X, XhoI; B, BglII. (b) Puromycin biosynthetic cluster from S. alboniger (left) and the region in the hygromycin A biosynthetic gene cluster from S. hygroscopicus NRRL2388 showing homology to A201A genes (right) are shown for comparison. Dotted lines indicate those sets of 2 or more genes where orientation and organization is conserved between clusters.

Table 3 Open reading frames of the ata cluster and their proposed functions.

Bioinformatics analysis of ata biosynthetic cluster

The ata biosynthetic cluster bioinformatics analysis from ard1 to ata11 has revealed the presence of 28 putative ORFs (Figure 4a and Table 3), including ard1 and ata11, most of which can be aligned with homologous sequences in GeneBank using BLAST P programs. The proposed functions of these ata gene products based on their homologies are listed in Table 3, together with the corresponding homologs from puromycin and hygromycin A antibiotics biosynthetic pathways from S. hygroscopicus and S. alboniger, respectively (see also Figure 4b). On the other hand, the organization of the A201A biosynthetic cluster is mostly conserved between S. mutabilis subs. capreolus and the other A201A producer M. thermotolerans, with the exception of the resistance gene ard1, which is placed between ata14 and ata15 in M. thermotolerans and the ataP7-P10-P4-P5 ORFs, which are in the opposite orientation in M. thermotolerans.3

Genes related to the biosynthesis of the polyketide moiety

We suggested previously that the central A201A α-methyl-p-coumaric acid moiety was derived from the condensation of methyl-malonyl CoA with p-hydroxybenzoyl CoA, which must be mediated by a type II polyketide synthase (PKS).30 The minimal PKS involves a ketoacyl synthase activity (KS), a chain length factor and an acyl carrier protein (ACP).30 Analysis of the ata cluster sequence indicated the presence of nine putative genes that might be involved in the biosynthesis of the polyketide moiety (Ata-PKS1-AtaPKS4, Ata18, Ata19, Ata2, Ata4 and Ata7). Ata–PKS1 (436 residues) aligned with PKS acyltransferases and includes the substrate-binding motif 196GHSMG200.31 Likewise, Ata-PKS2 (108 residues) and Ata19 (95 residues) showed similarity to ACP and they contain the serine 4′-phosphopantetheine-binding sites 47DX6GXDSX2LX7E68 and 35DX6GXDSX2LX7E56, respectively.32 Ata-PKS3 (388 residues) and Ata-PKS4 (265 residues) showed an overall similarity with KS, and their N terminal domains contain most of the structures involved in dimer formation. In contrast, Ata-PKS4 lacks the highly conserved cysteine residue in the proposed active site 120G-M135 required to link acyl groups.33 However, the Ata-PKS3 5G-V23 sequence includes a cysteine, 16C and could act as the protein active site. The presence or absence of key cysteine residue is a distinctive characteristic of chain length factor and KS activities.34 Ata18 (486 residues) showed similarity to AMP-dependent synthetase and ligases, including the CoA ligases that are implicated in fatty acid and polyketide biosynthesis.35 These proteins share an AMP-binding motif that includes a Ser/Gly/Thr-rich domain followed by a conserved Pro-Lys,36 which is also presents in Ata18 (142SGTQGRPK149). Ata7 (219 residues) aligned with several bacterial chorismate pyruvate-lyases (p-hydroxybenzoic acid synthases), including that of Mycobacterium tuberculosis (42% similarity over a 160-amino acid overlap). The chorismate pathway is only present in bacteria, fungi and plants, and it provides a wealth of compounds with diverse biological functions, including aromatic amino acids, folate cofactors, pigments, mycobactins and glycosylated p-hydroxybenzoic acid methyl esters. The chorismate pyruvate-lyase from M. tuberculosis is responsible for the direct conversion of chorismate to p-hydroxybenzoate, and it has been identified as the sole enzymatic source of p-hydroxybenzoic acid in this organism.37

In this context, eight of the nine gene products putatively required for the biosynthesis of A201A α-methyl-p-coumaric acid moiety (disregarding Ata7) showed clear homology to the same number of genes identified in S. hygroscopicus NRRL2388 from the hygromycin A biosynthetic gene cluster (Hyg9-15 and Hyg22).9 Hygromycin A and A201A share a similar polyketide structural moiety and therefore, it seems plausible that some analogous enzymes must be required for the biosynthesis of these two antibiotics. However, the organization of the homologous genes in both these clusters is not completely coincident (Figure 4b). This contrasts with the organization of the genes in the ata cluster that are involved in the biosynthesis of the aminonucleoside moiety when compared with those of the pur cluster from S. alboniger29 (Figure 4b).

It has been proposed that the biosynthetic pathway of the 3,4-dihydroxy α-methyl-p-coumaric acid moiety in hygromycin A commences with 4-hydroxybenzoic (or 3,4-dihydroxybenzoic) acid, which is derived from chorismate via the putative chorismate lyase Hyg4.9 Although there was little similarity between Hyg4 and Ata7, it seems likely that both putative proteins could be responsible for the biosynthesis of 4-hydroxybenzoic acid. Presumably, this acid must be activated by conversion to a thioester, and Ata18 (homologous to Hyg12) could catalyze this reaction based on its homology to AMP-dependent synthetases and CoA ligases (Figure 5). Ata-PKS2 and Ata19 (which aligned with Hyg9 and Hyg13, respectively) are homologous to ACP and either of them could a priori be required to produce methyl-malonyl ACP for condensation with the activated hydroxybenzoyl thioester (Figure 5). The putative acyltransferase Ata-PKS1 (as proposed due to its similarity to with Hyg22) may affect the methyl-malonyl CoA-methyl-malonyl ACP interconversion and the proposed KS, Ata-PKS3 (homologous to Hyg10), might be required for the decarboxylative condensation with methyl-malonyl ACP. In addition, Ata-PKS4 (homologous to Hyg11) could be involved in KS dimer formation and/or, be related to chain length factor, thereby controlling the shutdown in chain elongation (Figure 5). Once the polyketide backbone is formed, different enzymes act to modify the final structure, including ketoreductases, oxygenases, dehydratases or aromatases. Accordingly, Ata4 (a putative 3-ketoacyl ACP-reductase homologous to Hyg15) and Ata2 (a 3-hydroxylacyl ACP dehydratase homologous to Hyg14) could be responsible for additional changes in the polyketide moiety (double bonding; Figure 5).

Figure 5
figure 5

Proposed working hypothesis of A201A polyketide moiety biosynthesis. Putative S. mutabilis subsp. capreolus enzymes involved in the biosynthesis of the A201A antibiotic polyketide moiety are shown as gray boxes. See text for details.

Genes related with the biosynthesis of the disaccharide moiety

The chemical structure of A201A includes two sugar moieties, an unsaturated hexofuranose associated with the α-methyl-p-coumaric acid moiety and a 3,4-dimethyl-d-rhamnose. Analysis of the ata cluster seemed to indicate that at least six gene products might be involved in disaccharide biosynthesis (Ata12, Ata13, Ata16, Ata17, Ata5 and Ata10). Ata13 (374 residues) aligned with bacterial glycosyltransferases and mannosyltransferases that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptors, forming glycosidic bonds. Prokaryotic mannosyltransferases generally use GDP-mannose as the sugar donor in lipopolysaccharide and polysaccharide biosynthesis. These proteins are larger than 350 amino acids and while they do not display an overall similarity, they do share the EXFGX4E motif in the C-terminal half of the protein that is involved in the catalytic activity of the enzyme.38, 39 This motif is also found in sucrose phosphate synthase, which also shows some homology to Ata13 (Q0I5Z9; 30% identity; 42% similarity). However, although Ata13 contains the 264EAFGLSILE272 sequence, it also contains the 78P, 84H, 127S and 186K residues that are highly conserved in mannosyltransferases but that are not present in sucrose phosphate synthases.38 Ata16 (392 residues) and Ata5 (425 residues) were homologous to some bacterial and plant transglucosylases and glycosyltransferases, respectively. Interestingly, these two putative proteins aligned with Hyg20 and Hyg16 from the hygromycin A biosynthetic gene cluster,9 and this antibiotic also contains a pyranose–furanose moiety closely related to those found in A201A. Ata17 (384 residues) displayed similarity to NAD-dependent epimerase/dehydratases that contain a N-terminal NAD(P) binding motif40 (56GAGGFIG62 in Ata17), including epimerase-ketoreductase, epimerase-dehydratase and dehydrogenase, which utilize nucleotide-sugar substrates in a variety of chemical reactions. Finally, Ata10 (496 residues) showed overall homology to glucose-methanol-choline oxidoreductases that act on CH–OH groups of different donors (glucose, methanol and choline),41 and which share a FAD–ADP-binding domain in the N-terminal portion of the protein (11GAGSAG16 in Ata10).

The formation of activated-sugar precursors by nucleotidyltransferases must be a crucial branch point in secondary carbohydrate metabolic pathways from primary metabolism. However, there did not appear to be any genes involved in the formation of activated sugars within the ata cluster (Table 3). Similar results were recently found in the moenomycin A and the butirosin gene clusters from Streptomyces ghanaensis42 and Bacillus circulans,43 respectively. Biosynthesis of these carbohydrate-containing antibiotics requires a sugar-nucleotide building block directly derived from primary metabolism. In this context, either GDP-d-rhamnose or GDP-l-fucose is the 6-deoxyhexose produced from GDP-d-mannose. Biosynthesis of GDP-l-fucose is mediated by mannose 4,6-dehydratase, 3,5 epimerase and 4-ketoreductase activity in both eukaryotic and prokaryotic organism.44, 45 Accordingly, Ata12 (mannose 4,6-dehydratase) and Ata17 (epimerase/dehydratase) may be involved in the synthesis of GDP-d-mannose. Furthermore, the unsaturated hexofuranose moiety could be synthesized from GDP-d-rhamnose through Ata10 (oxidoreductase), and Ata13 (mannosyltransferase) could link it to that d-rhamnose. If so, Ata16 (transglucosylase) might well be involved in the glycosyl ring biosynthetic pathway and Ata5 (glycosyltransferase) could be responsible for the formation of the glycosidic link between the hexofuranose and the polyketide moieties.

Genes encoding methyltransferases

Besides AtaP5, a putative SAM-dependent methyltransferase that could dimethylate N6 of the A201A aminonucleoside moiety, three other putative methyltransferases are also found in the ata cluster: Ata6 (350 residues), Ata8 (276 residues) and Ata11 (236 residues). As Ata11 exhibited some sequence similarity to Hyg6 of the hygromycin A gene cluster (38% identity and 50% similarity over a 140-amino acid overlap), it would presumably be related with p-coumaric acid methylation which occurs in both these antibiotic structures. In addition, and given that the two glycoside residues in A201A are O-methylated, Ata 6 and Ata8 may be the enzymes responsible for these methylations. Nevertheless, despite these sequence homologies, further molecular and biochemical analysis is necessary to determine the role of these putative proteins in the A201A biosynthetic pathway and the specific order of events that take place.

Genes encoding resistance, regulatory or proteins of unknown function

Antibiotic-producing actinomycetes, usually posses more than one mechanism that confer resistance to the relevant antibiotic. Besides ard1 and ard2, the fragment sequenced from S. mutabilis subsp. capreolus NRRL3817 contained a gene whose product could be a priori involved in providing resistance to A201A, Ata9 (488 residues). This is a putative transmembrane proteins, which aligns with permeases of the major facilitator superfamily (MSF transporters) and thus, they may be involved in the efflux of A201A.

The regulation of ata cluster expression remains unclear since the cluster appears to lack dedicated regulatory genes. A putative regulatory protein MerR (125 residues) with similarity to DNA binding proteins of the MerR family of transcriptional regulators (Table 3) was present in the sequence analyzed, but removal of this putative protein did not affect either negatively or positively the A201A production (see further and Table 1). Nevertheless a misfunction of the regulation mechanism in the heterologous organism cannot be fully excluded. In addition, a higher level of regulation by the regulatory bldA gene is unlikely given that none of the genes analyzed contain the rare TTA leucine codons.46 This is not surprising considering that <0.2% of genes in the S. coelicolor genome contain TTA codons.47 A number of pleiotropic regulatory genes have been isolated from S. coelicolor that affect multiple antibiotic pathways but that are not pathway-associated.48 The isolation and characterization of these genes may reveal novel regulatory mechanisms in the biosynthetic pathways of secondary metabolites within this important group of bacteria.

Finally, the role of two genes of the cluster in A201A biosynthesis could not be defined, since the putative proteins they encode did not clearly align with any protein sequence in databases. Still, some similarities were encountered. Ata14 (334 residues) includes the nucleotide (FAD, NAD(P)) binding site 236GPGFDG24,49 and not surprisingly, its corresponding homolog in M. thermotolerans A201A biosynthetic cluster (MtdJ) has been to function as a oxidoreductase.3 For Ata15 (456 residues), the sequence from 137V to R250 aligns with bacterial glucokinases (Q2RRF9; 30% identity and 45% similarity). This 113 amino acid stretch includes the highly conserved 206GTGL209 and residues 165N and 166D that might be associated with glucose binding (all of which can also be found in its corresponding homolog in M. thermotolerans cluster MtdK3), as well as the aspartic acid functioning as the general base in the enzyme catalytic mechanism.50 However, Ata15 did not display overall similarity to glucokinases and neither were other highly conserved residues in these enzymes present, such as those associated with ATP-binding.

In conclusion, here we have studied the gene cluster involved in the biosynthesis and resistance of A201A from S. capreolus, expressed it in various heterologous systems, and the predictions drawn from the bioinformatics analysis were largely consistent with the expected enzymology. A201A is an interesting compound, which could be viewed as a natural ‘hybrid molecule’, with moieties similar to those found in other antibiotics such as puromycin and hygromycin A. Besides a moiety derived from d-rhamnose, its chemical structure includes the aminonucleoside moiety of puromycin, as well as a polyketide and an unsaturated furanose moiety closely related to similar structures found in hygromycin A. Both our group and the Reynolds group have done a remarkable work in understanding the puromycin and hygromycin A biosynthetic pathway steps and some of these are likely to occur identically for A201A biosynthesis in S. mutabilis. In this regard, the aminonucleoside moiety of A201A synthesis is likely to be initiated from ATP by the ATAP10 oxidoreductase and followed sequentially by five enzymatic steps catalyzed by ATAP4, ATAP7 and ATAP5/ATAP3, as described for puromycin.12 Likewise, we have placed at least eight enzymatic activities homologous to hygromycin A ones in those steps leading to the A201A polyketide moiety biosynthesis, all of which needs to be confirmed with further studies. The remaining enzymes sharing homology with hygromycin A ones are likely to be involved in the synthesis and incorporation of the disaccharide moieties, methylation of the antibiotic and resistance mechanisms and will require to be studied further too. Thus, the biosynthetic pathway of A201A consists in a combination of an interesting variety of known (from puromycin and hygromycin A biosynthesis) and novel enzymatic steps for the synthesis and modification of each moiety together with those that correctly link them together. It will be interesting to see how our knowledge of these complex antibiotic biosynthetic pathways is expanded with new studies and hopefully completed in the next years.

Characterization of all the genes involved in A201A biosynthesis will be useful to detect homologous genes from other actinomycetes, to identify biosynthetic clusters for other antibiotics that include nucleoside, polyketide and/or sugar moieties, and also to produce novel hybrid antibiotics. Future combinatorial biosynthetic strategies, and cross-complementation between selected segments from ata, pur and hyg clusters, could potentially provide a tool for the generation of new compounds with different biotechnological applications.