Introduction

The genus Amycolatopsis was first described in 1986 along with Amycolata to accommodate nocardioform actinomycetes, forming new branches in the evolutionary tree of Pseudonocardiaceae [1]. Before this, Amycolatopsis species were categorised as Streptomyces and then Nocardia. To date, 96 verified Amycolatopsis species have been acknowledged in the List of Prokaryotic names with Standing in Nomenclature (LPSN) database [2], making it the 28th most well-described bacterial genus. A thorough review of the published Amycolatopsis secondary metabolites by Song et al. [3] revealed that between 1989 and 2020, 159 chemical entities had been isolated from 8 known and 18 unidentified Amycolatopsis species. Only 8 compounds, from the pargamicins and valgamicins families, possessed a simple cyclic peptide structure [4,5,6]. Since 2020, another 2 cyclic peptides have been identified, amycolatomycins A and B [7].

Amycolatopsis sp. MST-135876 was isolated from soil collected from the riverbank of El Pont de Suert, 230 km east of Pamplona, Spain. Preliminary HPLC of a crude extract of the strain identified a metabolite profile unreported in Amycolatopsis. On passage, the strain lost its vitality and productivity, which were restored on mono-spore selection. The variant 3 subculture of the original isolate yielded a stable strain with high secondary metabolite productivity. Analysis of the metabolite profile revealed several compounds with UV–vis spectra commonly associated with the amino acid tryptophan (λmax 192, 228, 284 nm) in addition to several others with UV–vis spectra comparable to the previously reported amycolatopsins [8]. This report describes the metabolic restoration and cultivation of MST-135876, and the isolation, spectroscopic characterisation, and biological profiling of three new cyclic hexapeptides, suertides A–C (13) (Fig. 1).

Fig. 1
figure 1

The molecular structures of suertides A–C (13) isolated from Amycolatopsis sp. MST-135876v3

Experimental section

Mono-spore culture preparation

Culture MST-135876 was used to inoculate one ISP2 agar plate, which was then incubated in a dark temperature-controlled room at 28 °C. On day 3, individual colonies (10) were picked from the plate using a sterile toothpick, and individually streaked onto separate ISP2 daughter plates. The daughter plates were incubated at 28 °C for 7 days, at which point they were subsampled (1 g) and extracted in MeOH (2 ml) for 1 h and analysed by HPLC to determine their metabolite profiles. The residual culture from each daughter plate was preserved at −80 °C under glycerol (40%).

Cultivation and extraction (1)

MST-135876v3 was cultured on ISP2 agar plates for 7 days at 28 °C. A spore suspension (H2O; 100 ml) was used to inoculate 260 × Modified Bennett’s (MS) agar plates (2.1 kg), which were incubated for 10 days at 28 °C, before being bagged and frozen at −20 °C for three days. The frozen plates were thawed at 28 °C for 14 h over a sieve, resulting in an aqueous extract (2000 ml) and dehydrated agar plates. The dehydrated agar plates were extracted in 3:1 CHCl3/MeOH (1 × 2300 ml; 1 × 2000 ml) and acetone (1 × 1800 ml). The organic solvent extractions were filtered, combined and reduced in vacuo to an aqueous slurry (400 ml) before being partitioned against ethyl acetate (2 × 1000 ml). After in vacuo reduction, the crude ethyl acetate extract (4.5 g) was dissolved in CHCl3 (150 ml), applied to a silica gel column (80 g) and eluted with a stepwise gradient of 0–100% MeOH in CHCl3, collecting a total of 16 fractions (500 ml). The compounds characterised by tryptophan-like UV spectra eluted in fractions A13 − A15. Fraction A15 (507.7 mg) was dissolved in MeOH (4 ml) and purified using preparative HPLC (Zorbax-SB C18, 250 × 21.5 mm, isocratic 36.25% MeCN, 36.25% MeOH, and 27.5% H2O containing 0.1% TFA, 20 ml min−1) to yield 1 (tR = 8.02 min, 3.1 mg).

Cultivation and extraction (2 and 3)

MST-135876v3 was cultured on ISP2 agar plates for 7 days at 28 °C. A spore suspension (H2O; 100 ml) was used to inoculate 250 × MS + 2% KBr agar plates (2.2 kg), which were incubated for 10 days at 28 °C, before being bagged and frozen at −20 °C for 2 days. The frozen plates were thawed at 28 °C for 24 h over a sieve, resulting in an aqueous extract (1800 ml) and dehydrated agar plates. The agar plates were extracted in 3:1 CHCl3/MeOH (3 × 2000 ml) and the organic solvent extractions were filtered, combined and reduced in vacuo to an aqueous slurry (400 ml) before being partitioned against ethyl acetate (2 × 1000 ml). The peptides were observed in both the aqueous and organic phases of the separation. The ethyl acetate extract was reduced in vacuo, dissolved in MeOH (150 ml) and partitioned against hexanes (2 × 300 ml). The MeOH fraction was reduced in vacuo to a residue. The MeOH from the hexanes-methanol partition and the aqueous fraction of the ethyl acetate-water partition were dissolved in 3:1 CHCl3 and MeOH (300 ml), applied to a silica gel column (80 g) and eluted with a stepwise gradient of 0–100% MeOH in CHCl3, collecting a total of 12 fractions (500 ml). The UV–vis spectra of the peptides were observed in fractions C10 and C11. Fraction C11 was dissolved in MeOH (4 ml) and fractionated by preparative HPLC (C18-enhanced polar selectivity, 250 × 22 mm, isocratic 36.25% MeCN, 36.25% MeOH, and 27.5% H2O containing 0.1% TFA, 20 ml min−1) to yield 2 (tR = 7.65 min, 4.0 mg) and 3 (tR = 13.63 min, 8.9 mg).

Marfey’s analysis

To determine the absolute configuration of the amino acids, compound 1 (0.50 mg) was dissolved in H2O (100 μl) and HCl (5 M; 50 µl) was added. The solution was heated at 60 °C for 24 h, then dried under nitrogen prior to the addition of NaHCO3 solution (10 M; 50 µl) and Marfey’s reagent in acetone (1% w/v; 20 μl). The reaction mixture was heated at 60 °C for 2 h and the reaction neutralised with HCl (10 M; 25 µl) before analysis on LC-MS. Marfey’s conjugate standards for d- and l-valine, isoleucine, allo-isoleucine, serine, tryptophan, 5-chlorotryptophan, 6-chlorotryptophan, and glutamic acid were prepared and compared with the amino acids hydrolysed from 1.

Amino acid sequence determination

LC-MS was used to support the structure of 1 as determined by NMR. The peptide was first linearised according to a previous method, slightly modified [9]. Briefly, an aliquot (20 µl) of a solution of the peptide in MeOH (250 µg ml−1) was diluted to 100 µl with water and HCl (1.2 M; 100 µl) was added. The resulting solution was heated at 90 °C for 40 min to partially hydrolyse the peptide. The reaction was cooled on ice and quenched with an equimolar quantity of NaOH. The sample was dried in a vacuum centrifuge (Labconco) and redissolved in an aqueous solution of 5% MeCN / 0.1% formic acid (20 µl). LC-MS analysis was conducted on a Q Exactive Focus instrument (Thermo Fisher Scientific). Raw data returned from the laboratory were analysed with the software package Xcalibur Qual Browser 3.0.63 (Thermo Fisher Scientific) and manually sequenced de novo (Supplementary Fig. S27). The Orbitrap LC-MS data were acquired by the Thermo Fisher Proof of Concept Laboratory at Edith Cowan University, Perth, Australia.

Results and discussion

The secondary metabolite distribution of MST-135876 was highly media-dependent, with 1 produced on only one of fifteen media, MS liquid medium. During these experiments, it was noticed that the vitality and metabolic productivity of the strain were unstable and diminished with iterative passage. A limited dilution spread of the original strain on agar identified 80% of the strains as non-producers and the productivity was stabilised by selection of a stable mono-spore, variant 3 (v3). MST-135876v3 was used in all subsequent cultivations, with optimal production of 1 observed on day 7 of a culture on MS agar. Supplementing MS agar with 2% KBr suppressed the production of 1, while triggering the production of two novel non-polar analogues. Both the MS and MS + 2% KBr cultures were processed separately and extracted with acetone, the crude extract then partitioned between ethyl acetate and water followed by removal of the fats by hexane/methanol partition, provided an enriched fraction that was further fractionated by reversed phase C18 preparative HPLC to yield pure 13 (Supplementary Fig. S1).

16S rRNA gene sequence analysis indicated that the strain MST-135876v3 has 99.57% similarity to Amycolatopsis xuchangensis str. CFH S0322 [10]. The MST-135876v3 16S data also showed strong similarity to Amycolatopsis magusensis str. KT2025 (98.72%) [11], Amycolatopsis albispora str. WP1 (97.60%) [12], Amycolatopsis jiguanensis str. CFHS01580 (97.04%) [10], and Amycolatopsis xylanica str. CPCC 202699 (96.74%) [13]. A total of ten type strain Amycolatopsis species showed over 96% similarity to the 16S data for MST-135876v3 (Supplementary Table S1), hence the microbial species was tentatively identified as an Amycolatopsis species. The 16S sequence was submitted to GenBank under accession number OK487575.

HRESI(+)MS analysis of 1 indicated a molecular formula C41H50Cl2N8O9 ([M + Na]+ m/z 891.2968, Δmmu −0.2). A distinctive isotopic pattern [M + H]+ m/z 869/871/873 with 9:6:1 relative intensities was observed (Supplementary Fig. S20), which is characteristic of dichlorinated compounds. The 13C NMR spectrum of 1 revealed 41 distinct peaks, while the 1H NMR peak integration suggested 50 protons, in agreement with the calculated chemical formula (Table 1, Supplementary Fig. S3). 1H – 13C HSQC, 1H – 13C HMBC, 1H – 1H COSY, and 1H – 1H ROESY NMR data were used to elucidate the molecular structure (Supplementary Figs. S4S7). COSY correlations defined a spin system extending from Val-NH (δH 8.34) to Val-H-γ1/γ2 (δH 0.81/0.82), suggesting the presence of valine in the molecule (Fig. 2). An HMBC correlation from Val-H-α to δC 170.4 identified the Val carbonyl carbon. HMBC correlations from Ile-H-α (δH 4.00) to the Ile-carbonyl carbon (δC 172.2), Ile-C-β (δC 34.9), and Ile-C-γ1/γ2 (δC 25.0/14.6), and from Ile-H-δ (δH 0.73) to Ile-C-β and Ile-C-γ1, indicated the presence of isoleucine. COSY correlations from Ile-NH (δH 8.21) to Ile-H-α completed assignment of the amino acid. ROESY correlations between Ile-NH and Val-H-α suggested the two amino acids were adjacent. A second ROESY correlation between Ile-NH and 6-Cl-Trp-H-α (δH 4.72) was also observed. The 6-Cl-Trp-H-α proton showed HMBC correlations to the carbonyl carbon (δC 170.0), the β position (δC 28.6), and C-3 (δC 109.9). Further HMBC correlations were observed from 6-Cl-Trp-H-1 (δH 11.00) to C-2 (δC 124.6), C-3, C-3a (δC 126.5), and C-7a (δC 136.4). HMBC correlations from H-4 (δH 7.56) to C-7a and C-6 (δC 125.5), from H-5 (δH 6.98) to C-3a and C-7 (δC 110.7), and from H-7 (δH 7.33) to C-3a and C-5 (δC 118.5) suggested a 6-substituted tryptophan. This hypothesis was supported by strong reciprocal COSY correlations between H-4 and H-5. Accounting for the calculated formula that suggested 2 chlorine atoms, the amino acid was taken to be 6-Cl-Trp. The COSY and ROESY data from 6-Cl-Trp-NH (δH 7.37) to the H-α position were used to define the remaining constituents of the amino acid. The fourth amino acid in the sequence was identified from a diagnostic ROESY correlation between 5-Cl-Trp-NH (δH 8.66) and 6-Cl-Trp-NH. The 5-Cl-Trp-NH proton showed HMBC correlations to the C-α (δC 54.2) and C-β (δC 26.9) positions. COSY correlations were also observed between H-α (δH 4.26) and diastereotopic H-βa/b (δH 3.20/2.90). HMBC correlations from H-βa/b to C-2 (δC 125.3), C-3 (δC 110.6), and C-3a (δC 128.3), and from H-2 (δH 7.20) to C-3, C-3a and C-7a (δC 134.5), indicated the presence of a second tryptophan residue in the molecule. Diagnostic HMBC correlations from H-4 (δH 7.57) to C-6 (δC 120.8) and C-7a, in addition to the correlations from H-6 (δH 7.04) to C-4 (δC 117.4) and C-7a, and from H-7 (δH 7.33) to C-3a and non-protonated C-5 (δC 123.1) suggested that the second tryptophan was 5-chloro- substituted. This was further evidenced by the strong reciprocal correlations between H-6 and H-7 in the COSY spectrum. An HMBC correlation from H-α position to a carbonyl carbon (δC 170.8) completed assignment of the amino acid. A ROESY correlation between 5-Cl-Trp-NH and Ser-H-α (δH 4.12) was used as a starting point for the fifth amino acid. HMBC correlations from the Ser-H-α to the carbonyl carbon (δC 170.3) and Ser-C-β (δC 60.3) defined the extent of the amino acid. This was supported by COSY correlations between Ser-H-α and Ser-H-β (δH 3.27) and Ser-NH (δH 8.19). The 1H and 13C chemical shifts of the β position are typical of those found in serine, although the presence of an OH group was not observed in the 1H NMR spectrum of 1. As such, this amino acid was tentatively characterised as serine. ROESY correlations from Ser-NH to Glu-H-α (δH 4.40), Glu-H-βb (δH 1.79), and Glu-H-γ (δH 2.18) were observed, indicating the start of the sixth amino acid. The connectivity of these residues was confirmed by HMBC correlations from Glu-H-α to a carbonyl carbon (δC 170.3), C-β (δC 28.5) and C-γ (δC 29.7). An HMBC correlation from Glu-H-γ to a second carbonyl carbon (δC 173.9) and the presence of a downfield chemical shift associated with an acidic hydroxy group (δH 12.10), confirmed the identity of the final amino acid as glutamic acid. A COSY correlation between Glu-H-α and Glu-NH (δH 7.28) was used to identify the final amide group. ROESY correlations between Glu-NH (δH 7.28) and Val-NH, Val-H-α, and Val-H-γ1/2 completed the cyclic hexapeptide core. De novo sequencing by LC-MS/MS confirmed the amino acid sequence determined from the NMR data (Supplementary Fig. S32). A Marfey’s analysis was undertaken to determine the absolute configurations of the amino acids present in 1 [14]. Using LC-MS to compare the retention times of the Marfey’s-conjugated amino acids liberated from acid-catalysed hydrolysis of 1 to those of amino acid standards (Supplementary Figs. S26S31), we identified the presence of d-Ser, 5-Cl-d-Trp, 6-Cl-d-Trp, l-Ile, d-Val, and d-Glu. Taken together, the absolute configuration of 1 was confirmed, as depicted in Fig. 1.

Table 1 1H (600 MHz) and 13C (150 MHz) NMR data for suertides A–C (13) in DMSO-d6
Fig. 2
figure 2

Key 2D NMR correlations for suertides A–C (13)

HRESI(+)MS analysis of 2 revealed a molecular formula C41H51BrN8O9 ([M + Na]+ m/z 901.2855, Δmmu 0.0). A distinctive isotopic pattern m/z 901/903 with 1:1 relative intensity (Supplementary Fig. S21), suggested the incorporation of one bromine atom into the cyclic peptide. The NMR data for 2 were very similar to those for 1, with the only significant difference being the presence of an additional aromatic methine proton (H-5; δH 6.98) on the tryptophan residue adjacent to serine, suggesting the presence of non-halogenated tryptophan. This was supported by COSY correlations between H-4 (δH 7.53) and H-5, between H-5 and H-6 (δH 7.04), and between H-6 and H-7 (δH 7.31) (Fig. 2). As the remaining inter- and intra-amino acid correlations were consistent with the previously described non-substituted tryptophan amino acids, this indicated that the compound was a monobromo-derivative of 1. A chemical shift comparison between the 6-substituted tryptophan residues of 1 and 2 revealed an upfield shift in C-6 from δC 125.5 (1) to δC 113.6 (2), consistent with the additional shielding provided by bromine compared to chlorine. A downfield shift was observed for the atoms ortho to the halogenation site, with position 5 changing from δH 6.98/δC 118.5 (1) to δH 7.11/δC 121.0 (2), at position 7, the chemical shifts changed from δH 7.33/δC 110.7 (1) to δH 7.48/δC 113.8 (2), and there was no notable change at the 4-position. An HMBC correlation from H-5 (δH 7.11) to C-7, as well as the reciprocal COSY correlations between H-4 and H-5, indicated that this amino acid was 6-Br-Trp. Given their close biosynthetic relationship, that the absolute configuration of 2 was tentatively assigned to be the same as 1. Taken together, the HRESI(+)MS and NMR data confirmed the structure of 2 to be cyclo(d-Ser, d-Trp, 6-Br-d-Trp, l-Ile, d-Val, d-Glu), as shown in Fig. 1.

HRESI(−)MS analysis of 3 suggested a molecular formula C41H50Br2N8O9 ([M − H] m/z 955.1995, Δmmu 0.0). The spectrum contained a distinctive isotopic pattern m/z 955/957/959 with the relative intensity of 1:2:1 (Supplementary Fig. S22), indicative of a dibrominated compound. A comparison of the 1D and 2D NMR data obtained for 3 with those obtained for 1 revealed that the compounds were nearly identical. In the 6-substituted tryptophan, there was a characteristic upfield shift of C-6 from δC 125.5 (1) to δC 113.6 (3), suggesting substitution by bromine rather than chlorine. A downfield shift of the atoms ortho to the bromine-substituted carbon was observed, with position 5 changing from δH 6.98 / δC 118.5 (1) to δH 7.10 / δC 121.0 (3), and position 7 changing from δH 7.33 / δC 110.7 (1) to δH 7.48 / δC 113.5 (3). The key correlations from H-5 to C-3a (δC 126.7) and C-7 were observed, as were reciprocal COSY correlations between H-4 (δH 7.51) and H-5. Similarly, in the 5-substituted tryptophan, there was a change from δC 123.1 (1) to 111.0 (3) at the 5-position. Downfield shifts were also noted in the positions ortho to the brominated carbon, with C-4 changing from δH 7.57 / δC 117.4 (1) to δH 7.71 / δC 120.4 (3), and C-6 changing from δH 7.04 / δC 120.8 (1) to δH 7.15 / δC 123.3 (3). The characteristic HMBC correlations from H-6 to C-4 and C-7a (δC 134.7), in addition to the COSY correlations between H-6 and H-7 (δH 7.29), were used to confirm the amino acid was 5-bromo-substituted. Taken together, the HRESI(−)MS and NMR data confirmed that the structure of 3 was cyclo(d-Ser, 5-Br-d-Trp, 6-Br-d-Trp, l-Ile, d-Val, d-Glu), as shown in Fig. 1.

Biological activity

The suertides were evaluated for in vitro biological activity in antibacterial, antifungal, antiprotozoal, herbicidal, and antitumour bioassays and found to be selective antibacterial compounds. All compounds showed strong biological activity against Bacillus subtilis (ATCC 6633) (MIC 1.6 μg ml−1, Table 2), while differences in antibacterial activity were observed against Staphylococcus aureus (ATCC 25923), with the presence of brominated tryptophan residues resulting in reduced activity (3.1, 6.3, and 25.0 μg ml−1, for 1, 2, and 3, respectively). The compounds were also tested against methicillin-resistant S. aureus (MRSA, ATCC 33592), revealing increased potency of 1 (1.6 μg ml−1), comparable activity for 2 (6.3 μg ml−1) and no activity for 3. The reported compounds showed no activity up to 100 µg ml−1 against the Gram-negative bacterium Escherichia coli (ATCC 25922), the fungus Candida albicans (ATCC 10231), a mouse myeloma cell line (NS-1), a human fibroblast cell line (NFF), the protozoan Tritrichomonas foetus (strain KV-1) or the monocotyledonous plant Eragrostis tef (teff).

Table 2 In vitro bioassay data for compounds 13

Actinobacteria-derived antibacterial chlorinated peptides are a large group of small molecules that continues to grow. However, until recently, few compounds were associated with the genus Amycolatopsis [3]. The suertides are a family of antibacterial cyclic hexapeptides that contain more than two-thirds of the constitutive amino acids in the rarer non-proteogenic d-configuration and represent the second example in of two adjacent Trp moieties within a cyclic peptide from Amycolatopsis. The amycolatomycins, recently isolated by the Stadler laboratory, contain the same core amino acid units as suertide A, but with differing primary sequence, stereochemical configuration, and the presence of a distinct 2,6-dichloro-l-Trp residue [7]. To date, all the non-thiazolyl cyclic peptides isolated from Amycolatopsis have shown varying degrees of activity against MRSA [4,5,6] except for amycolatomycin A, which showed only weak antibacterial activity against B. subtilis (33.4 μg ml−1) [7]. More distantly related ditryptophan-containing metabolites include the cyclic heptapeptide argyrin A, from the myxobacterium Archangium gephyra [15], the chlorotryptophan-containing cyclic heptadepsipeptide krisynomycin, from Streptomyces canus [16], the cyclic nonapeptide propeptin, from Microbispora sp. [17], and the cyclic octadepsipeptides telomycins, which contain adjacent tryptophanyl and dihydrotryptophanyl moieties within the macrocycle [18]. Alone among these cyclic peptides, the suertides are the sole examples featuring two adjacent d-Trp moieties.

In conclusion, three new halogenated cyclo-hexapeptides were isolated from a putative Amycolatopsis sp. collected from a riverbank in Spain. All three compounds display antibacterial activity, with two displaying strong activity against MRSA and no cytotoxicity against mammalian cell lines up to 100 µg ml−1. Ultimately, this study demonstrated the ongoing utility of novel, soil derived actinobacteria in the quest for chemical novelty in drug discovery.

Physical characterisation

Suertide A (1): white powder; \([a]\frac{{20}}{D}\)  + 25 (c 0.03, MeOH); UV (MeCN) λmax (log ε) 200 (5.06); 230 (4.86); 288 (4.02) nm; HRMS m/z 891.2968; calcd. for C41H50Cl2N8O9Na+ [M + Na]+, 891.2968.

Suertide B (2): white powder; \([a]\frac{{20}}{D}\)  + 9 (c 0.02, MeOH); UV (MeCN) λmax (log ε) 200 (5.06); 230 (4.86); 288 (4.02) nm; IR (ATR) νmax 3271, 2961, 2922, 2359, 1643, 1542, 1456, 744, 660 cm–1; HRMS m/z 901.2855; calcd. for C41H51BrN8O9Na+ [M + Na]+, 901.2855.

Suertide C (3): white powder; \([a]\frac{{20}}{D}\)  + 23 (c 0.09, MeOH); UV (MeCN) λmax (log ε) 200 (5.06); 230 (4.86); 288 (4.02) nm; IR (ATR) νmax 3285, 2961, 2925, 2358, 1626, 1536, 1225, 795, 686 cm–1; HRMS m/z 955.1995; calcd. for C41H49Br2N8O9 [M − H], 955.1995.