Abstract
Microbial natural products form the basis of most of the antibiotics used in the clinic. The vast majority has not yet been discovered, among others because the hidden chemical space is obscured by previously identified (and typically abundant) antibiotics in culture extracts. Efficient dereplication is therefore key to the discovery of our future medicines. Here we present an analytical platform for the efficient identification and prioritization of low abundance bioactive compounds at nanoliter scale, called nanoRAPIDS. NanoRAPIDS encompasses analytical scale separation and nanofractionation of natural extracts, followed by the bioassay of interest, automated mass spectrometry identification, and Global Natural Products Social molecular networking (GNPS) for dereplication. As little as 10 μL crude extract is fractionated into 384 fractions. First, bioactive congeners of iturins and surfactins were identified in Bacillus, based on their bioactivity. Subsequently, bioactive molecules were identified in an extensive network of angucyclines elicited by catechol in cultures of Streptomyces sp. This allowed the discovery of a highly unusual N-acetylcysteine conjugate of saquayamycin, despite low production levels in an otherwise abundant molecular family. These data underline the utility and broad application of the technology for the prioritization of minor bioactive compounds in complex extracts.
Similar content being viewed by others
Introduction
The increasing number of antibiotic resistant bacteria combined with the decreasing number of new antibiotics discovered is becoming a serious threat to public health. The traditional source of antibiotics is natural products produced by microorganisms, whereby some 70% are produced by Actinobacteria1,2. The canonical antibiotic discovery approach, known as the “Waksman platform”3, involves high-throughput screening of Actinobacteria for antimicrobial activity, but while this approach was highly successful in the golden era of antibiotic discovery, nowadays that approach fails to identify truly novel classes of compounds4. Still, we have likely only exploited a fraction of the chemical space of microbial natural products5. Indeed, the molecules that are currently described6 likely only represent 3% of the extant chemical space of natural products7. The challenge lies in the development of technologies to access the undiscovered chemical space of natural products. Genome sequencing revealed that Streptomyces genomes each contain up to 60 or even more biosynthetic gene clusters (BGCs) that specify natural products8. Bioinformatic tools such as antiSMASH9 allow visualizing these BGCs and predicting the classes of molecules they produce. However, many of these BGCs are expressed at a very low level under routine laboratory conditions, and hence their cognate products cannot be elucidated. Actinobacteria are usually grown in isolation and in laboratory media that do not accurately mimic the complexity of their natural habitat. The control of BGCs is most likely tightly tied to the competitive ecological conditions in which antibiotic production evolved10,11.
Several methods have been developed to boost the production of hitherto hidden natural products, including ribosome engineering, co-cultivation, OSMAC (One strain-many compounds), promoter replacement, and high-throughput elicitor screening (reviewed in refs. 12,13). Metabolites are identified by liquid chromatography coupled to mass spectrometry (LC–MS) and NMR spectroscopy. Statistical models such as Pearson correlation, partial least squares (PLS), discriminant analysis (PCA-DA, PLS-DA, OPLS-DA), and hierarchical cluster analysis (HCA) are then used to link metabolite fingerprints to bioactivity data14. Identifying the bioactive compounds of interest is often time-consuming and expensive, while efficient dereplication is essential to avoid wasting time and resources on the rediscovery of known compounds. Global Natural Products Social (GNPS) molecular networking developed in the Dorrestein laboratory15 is an important addition to the toolbox. Although these approaches provide lists of highlighted features, the information obtained is not immediately linked to bioactivity. Several analytical modules involving different bioassays and detection technologies can be linked to allow simultaneous bioactivity evaluation and identification of compounds present in small amounts (analytical scale) in complex compound mixtures. Such approaches include the at-line high-resolution nanofractionation, which could complement previously mentioned strategies and provide a more comprehensive insight into compounds responsible for the activity16,17.
One of the major challenges is the lack of truly novel chemical scaffolds, which may serve as the basis for the antibiotics of tomorrow. We discovered lugdunomycin that is produced by the soil-derived Streptomyces sp. QL3718. Lugdunomycin is a highly rearranged member of the angucyclines, the largest family of type II polyketides19,20, and has a radically different chemical scaffold as compared to other polyketides. Its discovery underlines that novel chemical space may still lie hidden even in highly explored chemical families.
To allow more efficient mining of microbial extracts for novel chemical space, we developed an analytical platform for the rapid and efficient identification and dereplication of bioactive metabolites based on nanofractionation (nanoRAPIDS). NanoRAPIDS combines at-line high-resolution nanofractionation21 coupled to LC–MS/MS and featured-based molecular networking (FBMN)22 in a single analysis. The implementation of MZmine23 in our platform allows for automatic peak selection and correlation of the bioactivity with the appropriate mass while the same workflow performs FBMN. This allows dereplication and analog searches in a semi-quantitative fashion. The nanoRAPIDS workflow was validated by identifying a suite of iturins and surfactins produced by Bacillus sp. 90A-23 based on their bioactivity against Escherichia coli, Bacillus subtilis, and/or Aspergillus niger in nanoliter fractions. We then applied the technology to prioritize bioactive angucyclines produced by Streptomyces sp. MBT84 following challenge by catechol. This afforded the isolation of a hitherto unknown N-acetylcysteine conjugate of saquayamycin N.
Results and discussion
Development of the nanoRAPIDS platform
The challenge we sought to overcome is that bioactivity-guided fractionation is typically frustrated by highly abundant known antibiotics, which obscures bioactive compounds that are produced at low levels. This is particularly true for screening of small culture volumes, such as those obtained through HT cultivation. Therefore, we developed an antibiotic discovery platform based on nanofractionation16, mass spectrometry, and bioactivity assays. This set-up was combined with automated data pre-processing in MZmine 2.5323 and feature-based molecular networking22. The pipeline is designated nanoRAPIDS, for a Reliable Analytical Platform for Identification and Dereplication of Specialized metabolites based on nanofractionation. The schematic overview of the proposed workflow is shown in Fig. 1.
The nanoRAPIDS pipeline makes use of an at-line nanofractionation set-up21. First, extracts are separated by liquid chromatography (LC) and after the post-column split, the largest part of the flow is fractionated using high-resolution nanofractionation, while a small part is subjected to mass spectrometry (MS and MS/MS), allowing the parallel data collection of all fractions. Next, the individual fractions are tested against target microorganisms such as Gram-positive, Gram-negative bacteria, and/or fungi with a resazurin reduction assay. Since the at-line nanofractionation is performed at a 6 s resolution, the retention time of eluting compounds from the LC separation is retained in the bioactivity chromatograms. After at-line nanofractionation, the LC-MS/MS spectral data are automatically processed with MZmine23, which gives the information on the m/z values of the bioactive fraction(s) constituents and their MS/MS spectra. Pre-processed MS2 data are subsequently uploaded onto the GNPS web-platform15 for feature-based molecular networking22. Molecular networking is then performed to visualize all ions of molecules detected and fragmented during an MS experiment and identify the chemical relationships between them in a semi-quantitative fashion. Additionally, GNPS automatically performs a spectral library search for known molecules in the molecular networks if their MS/MS spectra are available in public MS/MS spectral libraries. The final step of the analysis consists of mapping of the bioactive compounds within the obtained network in Cytoscape24, based on their m/z and retention time (Fig. 1b).
Validation of nanoRAPIDS by screening Bacillus sp. 90A-23 extracts
Bacillus sp. 90A-23 produces several bioactive compounds with broad spectrum activity, many of which are well characterized, including iturins and surfactins, which both consist of families of molecules. To validate the platform, crude extracts were subjected to analysis using nanoRAPIDS (Fig. 2). Automated feature detection and correlation guided by high-resolution bioactivity assays identified eight mass features with m/z 1029.5404 at 16.32 min, m/z 1043.5591 at 18.05 min, m/z 1057.5642 at 19.22 min, m/z 1071.5880 at 20.64 min, m/z 1071.5880 at 20.95 min, m/z 1008.6698 at 27.91 min, m/z 1022.6782 at 28.84 min and m/z 1036.6921 at 29.47 min (Table 1). It is important to note that some compounds may remain undetected due to the limitation of the ionization method from the MS system.
The filtered peak list generated in MZmine was exported for GNPS molecular networking. The network consisted of 852 nodes, which were clustered in 92 spectral families and 134 individual nodes. The bioactive compounds with m/z 1029.5404 at 16.32, m/z 1043.5591 at 18.05 min, m/z 1057.5642 at 19.22 min, m/z 1071.5880 at 20.64 min, m/z 1071.5880 at 20.95 min were found in a 19-node cluster (Fig. 2c). Search each MS/MS spectrum against the GNPS spectral libraries resulted in the annotation of m/z 1043.5591, m/z 1057.5642, m/z 1071.5880 as iturin A2, A3/4/5 and A6/7, respectively. Annotation of these compounds was supported by a direct comparison of the MS/MS spectra with the standards of iturins from GNPS by mirror plot using mass software25 (Figs. S2–S4). The compound with m/z 1029.5404 at 16.32 min was annotated as iturin A1 using the Antibase database26 and manual comparison of MS/MS spectra with those in the literature27. These amphiphilic compounds are characterized by a peptide ring of seven amino acid residues, closed by a β-amino fatty acid (Fig. 3). Iturins are a family of lipopeptides produced by various Bacilli species28,29,30. The 14 Da difference between molecules with m/z 1029.5404, m/z 1043.5591, m/z 1057.5642, and m/z 1071.5880 is explained by variation in length of the fatty acid chain, containing 13, 14, 15 or 16 carbons, respectively (Fig. S5).
The major peaks in the bioactivity chromatogram at 18.05 and 19.22 min corresponded to the most intense compounds, which were dereplicated as iturin A2 (m/z 1043.5591) and iturin A3/A4 (m/z 1057.5642). Moreover, 12 more mass spectral features were also identified as iturins based on analog searches. The bioactive features with m/z 1008.6698 at 27.91 min, m/z 1022.6782 at 28.84 min and m/z 1036.6921 at 29.47 min were detected within the surfactin spectral family (Fig. 2c).
The molecule with m/z 1008.6698 was annotated as surfactin Leu/Ile7 C13, that with m/z 1022.6782 as surfactin Leu/Ile7 C14 and the compound with m/z 1036.6921 as surfactin Leu/Ile7 C1531 (Figs. S6–S9). An additional 23 other spectral features also belonged to the same spectral family.
In the initial screen, Bacillus sp. 90A-23 showed bioactivity against both bacteria and fungi. NanoRAPIDS allowed us to readily establish which detected individual mass features contributed to the bioactive nanofraction. The analysis revealed that the initially detected antifungal and antibacterial activity of the crude extract was mostly caused by iturins and surfactins (Fig. 2 and Table 1).
Identification of a N-acetylcysteine conjugate in a complex network of angucyclines in Streptomyces sp. MBT84
Streptomycetes typically produce a large variety of angucyclines, which are the largest family of type II polyketides, many of which with significant bioactivity as antibiotics and/or anticancer agents20. We, therefore, identified this molecular family as an ideal challenge for nanoRAPIDS, seeking to identify minor molecules with bioactivity without prior pre-purification. We recently showed that there is still a large chemical space that remains to be elucidated18,32. Streptomyces sp. MBT84 produces a particularly large number of angucyclines and derivatives, many of which are produced specifically when catechol is added as an elicitor to the culture media32. To identify the nature of the metabolites produced by Streptomyces sp. MBT84, in response to catechol, the strain was streaked on MM agar plates, and the metabolites were then extracted from the spent agar after 5 days of growth using ethyl acetate (EtOAc). The crude extracts of the catechol-grown cultures showed increased bioactivity against B. subtilis 168, as compared to those grown under control conditions32. To identify which mass features correlated to the increased bioactivity of Streptomyces sp. MBT84, crude extracts were screened with the nanoRAPIDS platform (Fig. 4).
Following nanofractionation, the chromatogram showing the bioactivity against B. subtilis 168 showed 14 peaks with negative maxima (Fig. 4a). Automated peak detection combined with antimicrobial activity assays allowed the selection of the individual bioactive features (Fig. 4b), which were mapped onto the enhanced feature-based molecular network. Combining the output from molecular networking, MS2LDA, and in silico annotation, the MolNetEnhancer33 workflow predicted an angucycline molecular family (Fig. 4c). The bioactive angucyclines were annotated by manual comparison of the molecular formula and monoisotopic mass using the Dictionary of Natural Products (Table 2). Notably, all selected features were more abundant in the extracts from cultures supplemented with catechol.
Multiple mass features were detected in the bioactive nanofractions, mostly due to in-source fragmentation (Table 2). For example, for peak 1, three mass features were selected, namely, m/z 601.228, m/z 487.1601, and m/z 469.1515, that correspond to the mass of protonated aquayamycin derivative, in-source sugar loss and subsequent water loss. Two of the 14 bioactivity peaks, namely 3 and 6, were caused by co-elution of two different compounds. To our knowledge, the compounds with m/z 758.2166 and m/z 615.2071, responsible for the bioactivity of peak 3, have not been previously described. Interestingly, the mass feature with m/z 758.2166 did not cluster with the angucycline spectral family and was classified as an aminocyclitol glycoside based on the enhanced molecular network annotation (Fig. 4c).
Large-scale fermentation was performed to elucidate the structure of this compound. Streptomyces sp. MBT84 was grown confluently on 12 × 12 mm MM agar plates with 50 µM catechol, and the metabolites were extracted with EtOAc. Following chromatographic separation, 1 (red amorphous powder) and 2 (yellow amorphous powder) were isolated. Compound 1 had a molecular formula of C36H39NO15S, with 18 degrees of unsaturation based on high resolution-electrospray ionization-mass spectrometry (HRESIMS) analysis. Detailed NMR analysis showed that 1 is a derivative of saquayamycin B1 (Figs. S10–S13 in Supplementary data file 1)34. This derivative was named saquayamycin N. The derivatization is at C-5, which showed in the NMR a signal for a tetrasubstituted carbon (δc 162.6) instead of a methine group in saquayamycin B1. This was established through the HMBC correlations from H2-4 and H-6, which is now a singlet proton (δH 6.65, s), to C-4a and C-5 (Fig. S14 in Supplementary data file 1). An N-acetylcysteine moiety was identified as the remaining part of 1 based on the observed remaining NMR signals and correlations, together with the established molecular formula. The N-acetylcysteine was connected to C-5 of the saquayamycin B core through the sulfur atom based on the HMBC and NOESY correlations observed from the protons of the methylene group in N-acetylcysteine to C-5 and H-6, respectively (Figs. S14 and S15 in Supplementary data file 1). The absolute configuration of compound 1 could not be determined due to the limited amount of material purified. The configuration is most likely 3R, 4aS, 12bS, as was consistently observed for all saquayamycins identified so far34,35. For the N-acetylcysteine moiety, an l-cysteine would be expected. The molecular formula of compound 2 was established to be C25H26O10, with 13 degrees of unsaturation, based on HRESIMS analysis. 1D and 2D NMR analysis (Figs. S16–S22 in Supplementary data file 1) revealed that 2 is the previously known fridamycin A36.
The structures of saquayamycin N (1) and fridamycin A (2) are shown in Fig. 5. Compounds 1 and 2 showed moderate activity against B. subtilis 168, with a MIC of 125 µg/mL, indicating low bioactivity. The observed bioactivity for 1 and 2 aligned with the data obtained in the resazurin reduction assay and corroborated the contribution of saquayamycin N and fridamycin A to the bioactivity observed in peaks 2 and 3, respectively (Fig. 4). The resazurin reduction assay in the nanofractionation setup only has a short incubation time, and this is radically different from—and more sensitive than—the MIC assays which are based on overnight cultures. To validate the contribution of saquayamycin N and fridamycin A to the activity detected in the crude extracts, the purified compounds were pipetted in a 384-well plate, and the resazurin reduction assay was performed as previously described. The selected concentrations were based on the MIC values, namely 125 (MIC value), 12.5, and 1.25 µg/mL concentrations (sub-MIC values). The MIC and sub-MIC concentration of 12.5 µg/mL resulted in peaks with significant negative maxima for both saquayamycin N and fridamycin A (Fig. S23).
The potential of these two isolated molecules should be addressed by further investigation of their anticancer activity, as the compounds from the angucycline family are well-known for their anticancer properties37.
NanoRAPIDS is a pipeline based on at-line nanofractionation technology, which enables the detection of individual mass features that contribute to the bioactivity of nanofractions. If multiple fractions with bioactivity are identified, GNPS will allow their linkage if, indeed they would belong to the same (known or unknown) family. Thus, the nanoRAPIDS platform has the potential to go beyond the discovery of known specialized metabolites and facilitates the identification of new natural product families. Nevertheless, the discovery of novel bioactive natural products remains a substantial challenge in the field of natural product discovery.
Conclusions
The nanoRAPIDS platform is an efficient and cost-effective technology based on nanofractionation, which allows the identification of low-abundance molecules, and in particular antibiotics, in small culture extracts against a background of abundantly present known antibiotics. The powerful combination of analytical LC–MS/MS, bioassays, and open bioinformatic tools, including MZmine, GNPS Web-platform, and Cytoscape, creates an attractive pipeline for natural product discovery. Only minimal amounts of the crude extract are required for LC–MS/MS injection, allowing a significant reduction of fermentation volumes. The analysis of a single sample using the nanoRAPIDS platform requires approximately 24 h. Therefore, multiple extracts obtained from a range of culturing conditions can be rapidly tested for bioactive compounds and dereplicated, which is particularly powerful when combined with elicitation approaches in antibiotic discovery12,13.
Proof of concept for the technology was obtained by identification of the bioactive members of the iturin and surfactin molecular families in extracts of Bacillus sp. 90A-23, and then successfully applied to discover the angucycline saquayamycin N with a unique cysteine moiety in extracts of Streptomyces sp. MBT84. The method is versatile and can be readily applied to screen nanoscale-level extracts or biological mixtures against a range of indicator strains.
Methods
Bacterial strains and culturing conditions
Bacillus sp. 90A-23 was obtained from the Auburn strain collection and had previously been isolated from the peanut rhizosphere in Tallassee, AL, USA. The strain was fermented in 100 mL Erlenmeyer flasks with 30 mL of tryptic soy broth (TSB) at 30 °C at 220 rpm for 3 days. Streptomyces sp. MBT8432 was obtained from the Leiden University strain collection and had been isolated from soil samples collected from the Qinling mountains (Shanxi province, China38). Streptomyces sp. MBT84 was grown confluently on minimal media (MM) agar plates with and without 100 µM catechol for 5 days at 30 °C. Escherichia coli strain AS19-RlmA-39, Bacillus subtilis 16840, and Aspergillus niger N40241 were used as indicator strains for bioactivity assays. The bacterial strains were grown on Luria-Bertani (LB) agar at 37 °C for 24 h, A. niger was propagated on complete medium (CM) agar at 25 °C for 48 h.
Metabolite extraction
Following fermentation of Bacillus sp. 90A-23, 1.5 g of Diaion® HP-20 (Resindion, Mitsubishi) was added to the cultures and stirred overnight at 4 °C. The resin was collected by filtration, washed with distilled water, resuspended in 30 mL methanol, and maintained at 4 °C for 4 h. The resulting methanol fraction was collected and concentrated under reduced pressure to a final concentration of 15 mg/mL. Streptomyces sp. MBT84 was grown on MM agar plates for 5 days, after which the agar was cut into small pieces and soaked overnight in ethyl acetate (EtOAc)32. The solvent was evaporated at room temperature, and the dry extracts were dissolved in MeOH to a final concentration of 4 mg/mL.
Liquid chromatography, at-line nanofractionation and mass spectrometry
Liquid chromatography separation, subsequent at-line nanofractionation, and parallel mass spectrometry analysis were performed in an automated fashion. For LC-MS analyses, 10 μL were injected into Waters Acquity UPLC system equipped with XBridge Peptide BEH C18 column (5 µm, 300 Å, 4.6 × 100 mm). The flow rate used was 0.6 mL/min. Solvent A was 0.1% formic acid in water; solvent B was 0.1% formic acid in ACN. The gradient started with 100% A and was from the start linearly increased to 50% B over 20 min, and then linearly increased to 90% B over 5 min, then held at 90% for 5 min, and then increased to 100% B over 1 min and held at 100% B for 10 min. After the column, the flow was split at a 1:9 ratio. The smaller fraction was sent to the Waters Acquity photodiode array detector followed by a Thermo Instruments MS system (LTQ Orbitrap XL, Bremen, Germany) equipped with an electrospray ionization source (ESI). As for the MS, the following ESI parameters were used42: capillary voltage 5 V, spray voltage 3.5 kV, capillary temperature 300 °C, auxiliary gas flow rate 10 arbitrary units, and sheath gas flow rate 50 arbitrary units. Full MS spectra were acquired with the Orbitrap in positive mode at a mass range of 100–2000 m/z and FT resolution of 30,000. Data-dependent MS2 spectra were acquired in the ion trap for the three most intense ions using collision-induced dissociation (CID). The larger fraction was sent to the chip-based nano-electrospray ionization source/fractionation robot (NanoMate Triversa, Advion BioSciences). The nanofractions were collected every six seconds onto black 384-well plates (Greiner Bio One, Alphen aan den Rijn, The Netherlands) in a serpentine pattern. Each sample was collected into 350 wells of the 384-well plate. After the nanofractionation, the plates were dried under vacuum. In order to determine the delay between the MS trace and the bioassay trace, nalidixic acid was dissolved in distilled water to a final concentration of 0.5 mg/mL, and 1 μL was fractionated (Fig. S1).
Resazurin reduction assay
The antibacterial and antifungal activities of the nanofractionated Bacillus sp. 90A-23 extract was determined in a resazurin reduction assay. Indicator strains B. subtilis 168 and E. coli ASD19 were cultured in liquid LB medium at 37 °C to exponential phase and diluted in LB to the final OD600 of 0.005. Resazurin solution was then added to the cells to a final concentration of 15 μg/mL, and 25 μL of this mixture was directly pipetted into each well of 384-well plates containing dried nanofractions. Controls were pipetted in the last column of the plate: positive control (cells with resazurin and ampicillin (0.2 mg/mL)), growth control (cells and resazurin), blank (LB with resazurin), and sterility control (only LB). Directly after this, the plates were spun down for 10 s at 1500 rpm and subsequently incubated without the lids in the plastic bags in a humidified incubator at 37 °C for 2–3 h21.
Spores of A. niger N402 were harvested from CM agar plates, pregerminated in complete medium for 5 h at 30 °C, and diluted in CM to the final titer of 1 × 105 spores/mL. Each well of 384-well plate with dried nanofraction was filled with 25 µL of A. niger spore solution. Amphotericin B (0.02 mg/mL) was added to the A. niger spores in the positive control wells. Subsequently, the plates were centrifuged and incubated at 30 °C overnight, as explained above. Finally, resazurin was added to each well to a final concentration of 60 μg/mL and incubated for 45 min at 37 °C. After the incubation, the fluorescence was measured as a single point measurement using a Victor3 Plate Reader (PerkinElmer, Inc., Waltham, MA, USA) at 550 nm excitation and 590 nm emission wavelengths. The fluorescent readout was normalized by dividing each value measured with the median of all the values obtained in a single measurement. Subsequently, the bioactivity chromatograms were plotted in GraphPad Prism 8 software (La Jolla, CA, USA) in a graph showing the normalized response of each nanofraction versus the time at which each nanofraction was collected.
Alignment of chromatograms
Alignment of the different chromatographic plots was conducted through a semi-automated process. Initially, the delay introduced by the tubing setup was determined by the injection of 0.5 μg of nalidixic acid in the nanoRAPIDS set-up, and E. coli ASD 19 was used as the indicator strain (Figure S1). Once the delay was quantified in 0.3 min, and the bioactivity chromatogram was adjusted based on this determined delay. Subsequently, both the mass spectrometry (MS) chromatogram, utilizing automatic peak detection, and the bioactivity chromatogram were standardized to the same size. Given the identical time function, the chromatograms were then aligned to the center, ensuring accurate synchronization of the two datasets. This semi-automated alignment procedure aimed to enhance precision in data interpretation and visualization, facilitating meaningful comparisons between the MS and bioactivity chromatograms.
Feature-based GNPS networking
A molecular network was created with the feature-based molecular networking workflow22 using Global Natural Product Social molecular networking (GNPS)15. Prior to feature-based GNPS networking, the MS files in mzXML format were imported into MZmine 2.5323 for data processing. Mass ion peaks were detected using the centroid algorithm with a noise level set to 1.0 × 104 for MS scans and 0 for MS2 scans. Afterwards, chromatograms were built for the detected masses with a minimum group size of 10, m/z tolerance of 0.001 m/z, group intensity threshold of 1.0 × 104, and a minimum height of 5.0 × 104. Chromatogram deconvolution was then performed using a local minimum search algorithm (search minimum in RT range 0.1 min, chromatographic threshold 90%, minimum relative height 1%, minimum absolute height 1.0 × 104, minimum ratio of peak top/edge two and peak duration range 0.05–3 min). A feature with its MS2 scans was paired using a 0.05 m/z range and retention time range of 1 min. In the generated peak lists, isotopes were identified using isotopic peaks grouper (m/z tolerance 0.001 m/z and retention time tolerance 0.1 min). Duplicate peaks were filtered using the single feature and old average mode by deleting the duplicate rows with m/z tolerance of 0.001 m/z in an RT range of 0.05 min and with an m/z tolerance of 1 m/z in an RT range of 0.05 min, respectively. The resulting peak list was filtered with a rows filter to keep only the peaks with MS2 scans. The filtered peak list was exported to the feature quantification table and the MS2 spectral summary file for feature-based molecular networking. On GNPS, the precursor ion mass tolerance was set to 0.02 Da and the MS/MS fragment ion tolerance to 0.9 Da. A molecular network was then created where edges were filtered to have a cosine score above 0.7 and more than three matched peaks. All matches kept between network spectra and library spectra were required to have a score above 0.7 and at least 3 matched peaks. To enhance chemical structural information within the molecular network, information from in silico structure annotations from GNPS Library Search, Network Annotation Propogation43, Dereplicator44 were incorporated into the network using the GNPS MolNetEnhancer workflow33. Chemical class annotations were performed using the ClassyFire chemical ontology45. The molecular networks were visualized using Cytoscape24 software. The molecular networking job from the extract of Bacillus sp. 90A-23 can be publicly accessed at https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=1ba65a809b17413694522c27b0d952df. The molecular networking job from the extract of Streptomyces sp. MBT84 can be publicly accessed at https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=15e986d4356f41b4962baab398c571fa. LC–MS/MS data were deposited in the MassIVE Public GNPS data set (MSV000094019).
Large-scale fermentation, extraction, and fractionation
Large-scale fermentation of Streptomyces sp. MBT84, extraction, and fractionation were performed as previously described32. Briefly, Streptomyces sp. MBT84 was grown on solid MM supplemented with 50 µM catechol, 1% glycerol, and 0.5% mannitol at 30 °C for five days. Agar plates were extracted with EtOAc, and 1.6 g of crude extract was obtained. This extract was adsorbed onto 1.6 g silica gel (pore size 60 Å, 70–230 mesh, Sigma Aldrich), and loaded on a silica column, followed by gradient elution using mixtures of n-hexane, EtOAc, and MeOH. The fractions that eluted with 50% MeOH: 50% EtOAc were combined with the fraction that eluted with 100% MeOH and reconstituted in 50% ACN: 50% H2O. The resulting fraction was subjected to a SunFire C18 column (10 μm, 100 Å, 19 × 150 mm) and eluted with H2O (solvent A) and ACN (solvent B) with 0.1% formic acid in a gradient of solvent B 30–70% in 30 min, at a flow rate of 15 mL/min. Fractions A–C were collected every 10 min. The fraction A was further purified on a semi-preparative SunFire C18 column (5 μm, 100 Å, 10 × 250 mm), run at 3 mL/min, and eluted using a H2O and ACN with 0.1% formic acid gradient of 47–50% in 20 min, to yield compound 1 (0.8 mg) and compound 2 (2.6 mg). NMR data were acquired on Bruker Ascend 850 NMR spectrometer (850 MHz) and interpreted using the MestreNova V.14 software.
Saquayamycin N (1): red amorphous powder; 1H and 13C NMR data, see Table S1; HRESIMS (positive mode) m/z 758.2103 [M + H]+ (calcd. for C36H40NO15S, 758.21187).
Fridamycin A (2): yellow amorphous powder; 1H and 13C NMR data, see Table S2; HRESIMS (positive mode) m/z 487.1606 [M + H]+ (calcd. for C25H27O10, 487.1604).
MIC tests
The minimum inhibitory concentration (MIC) was determined by the broth microdilution method using the British Standard BS EN ISO 20776–1:200646,47,48,49. A stock solution of compounds 1 and 2 was made by dissolving them in MeOH to a concentration of 1 mg/mL. B. subtilis 168 was grown from an overnight culture until an OD600 of 0.3 in LB and diluted until a concentration of bacteria of 1 × 106 CFU/mL in fresh broth. Two-fold serial dilutions of compounds 1 and 2 were tested in the concentration range of 250–0.12 μg/mL in triplicates. Growth control wells containing 100 μL of 5 × 105 CFU/mL were included without test compounds. After overnight incubation at 37 °C, inhibition was defined as no visible growth compared to the growth observed in the control wells.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
All datasets generated and/or analyzed during the current study are publicly available. The metabolomics dataset is available in the GNPS repository. The data from the extract of Bacillus sp. 90A-23 can be publicly accessed at https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=1ba65a809b17413694522c27b0d952df. The molecular networking job from the extract of Streptomyces sp. MBT84 can be accessed at https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=15e986d4356f41b4962baab398c571fa. LC–MS/MS data and metadata were deposited in the MassIVE Public GNPS data set (MSV000094019). Copies of the NMR spectra of the metabolites identified in this study are provided in the Supplementary Data 1 file, and the raw data are available upon request.
References
Barka, E. A. et al. Taxonomy, physiology, and natural products of the Actinobacteria. Microbiol. Mol. Biol. Rev. 80, 1–43 (2016).
Bérdy, J. Bioactive microbial metabolites. J. Antibiot. 58, 1–26 (2005).
Wright, G. D. Something old, something new: revisiting natural products in antibiotic drug discovery. Can. J. Microbiol. 60, 147–154 (2014).
Baltz, R. H. Renaissance in antibacterial discovery from actinomycetes. Curr. Opin. Pharmacol. 8, 557–563 (2008).
Genilloud, O. The re-emerging role of microbial natural products in antibiotic discovery. Antonie Van Leeuwenhoek 106, 173–188 (2014).
van Santen, J. A. et al. The natural products atlas: an open access knowledge base for microbial natural products discovery. ACS Cent. Sci. 5, 1824–1833 (2019).
Gavriilidou, A. et al. Compendium of specialized metabolite biosynthetic diversity encoded in bacterial genomes. Nat. Microbiol. 7, 726–735 (2022).
Nett, M., Ikeda, H. & Moore, B. S. Genomic basis for natural product biosynthetic diversity in the actinomycetes. Nat. Prod. Rep. 26, 1362–1384 (2009).
Blin, K. et al. antiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 49, W29–W35 (2021).
Chevrette, M. G. et al. Evolutionary dynamics of natural product biosynthesis in bacteria. Nat. Prod. Rep. 37, 566–599 (2020).
Cornforth, D. M. & Foster, K. R. Competition sensing: the social side of bacterial stress responses. Nat. Rev. Microbiol. 11, 285–293 (2013).
van Bergeijk, D. A., Terlouw, B. R., Medema, M. H. & van Wezel, G. P. Ecology and genomics of Actinobacteria: new concepts for natural product discovery. Nat. Rev. Microbiol. 18, 546–558 (2020).
Yoon, V. & Nodwell, J. R. Activating secondary metabolism with stress and chemicals. J. Ind. Microbiol. Biotechnol. 41, 415–424 (2014).
Ory, L. et al. Targeting bioactive compounds in natural extracts—development of a comprehensive workflow combining chemical and biological data. Anal. Chim. Acta 1070, 29–42 (2019).
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 34, 828–837 (2016).
Mladic, M. et al. At-line nanofractionation with parallel mass spectrometry and bioactivity assessment for the rapid screening of thrombin and factor Xa inhibitors in snake venoms. Toxicon 110, 79–89 (2016).
Zietek, B. M. et al. Nanofractionation platform with parallel mass spectrometry for identification of CYP1A2 inhibitors in metabolic mixtures. SLAS Discov. 23, 283–293 (2017).
Wu, C. et al. Lugdunomycin, an angucycline-derived molecule with unprecedented chemical architecture. Angew. Chem. Int. Ed. Engl. 58, 2809–2814 (2019).
Fotso, S., Mahmud, T., Zabriskie, T. M., Santosa, D. A. & Proteau, P. J. Rearranged and unrearranged angucyclinones from Indonesian Streptomyces spp. J. Antibiot. 61, 449–456 (2008).
Kharel, M. K. et al. Angucyclines: biosynthesis, mode-of-action, new natural products, and synthesis. Nat. Prod. Rep. 29, 264–325 (2012).
Mladic, M. et al. Detection and identification of antibacterial proteins in snake venoms using at-line nanofractionation coupled to LC–MS. Toxicon 155, 66–74 (2018).
Nothias, L.-F. et al. Feature-based molecular networking in the GNPS analysis environment. Nat. Methods 17, 905–908 (2020).
Pluskal, T., Castillo, S., Villar-Briones, A. & Oresic, M. MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinform. 11, 395 (2010).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Strohalm, M., Kavan, D., Novak, P., Volny, M. & Havlicek, V. mMass 3: a cross-platform software environment for precise analysis of mass spectrometric data. Anal. Chem. 82, 4648–4651 (2010).
Laatsch, H. AntiBase 2014: the natural compound identifier, Vol. 313 (Wiley-Vch, Weinheim, Germany, 2011).
Pathak, K. V. & Keharia, H. Identification of surfactins and iturins produced by potent fungal antagonist, Bacillus subtilis K1 isolated from aerial roots of banyan (Ficus benghalensis) tree using mass spectrometry. 3 Biotech 4, 283–295 (2014).
Sood, S. et al. Paenilarvins: Iturin family lipopeptides from the honey bee pathogen Paenibacillus larvae. Chembiochem 15, 1947–1955 (2014).
Chung, Y. R., Kim, C. H., Hwang, I. & Chun, J. Paenibacillus koreensis sp. nov., a new species that produces an iturin-like antifungal compound. Int. J. Syst. Evol. Microbiol. 50, 1495–1500 (2000).
Hertlein, G. et al. Biological role of paenilarvins, iturin-like lipopeptide secondary metabolites produced by the honey bee pathogen Paenibacillus larvae. PLoS ONE 11, e0164656 (2016).
Ma, Y. et al. Identification of lipopeptides in Bacillus megaterium by two-step ultrafiltration and LC-ESI-MS/MS. AMB Express 6, 79 (2016).
Van Bergeijk, D. A. et al. The ubiquitous catechol moiety elicits siderophore and angucycline production in Streptomyces. Commun. Chem. 5, 14 (2022).
Ernst, M. et al. MolNetEnhancer: enhanced molecular networks by integrating metabolome mining and annotation tools. Metabolites 9, https://doi.org/10.3390/metabo9070144 (2019).
Uchida, T. et al. Saquayamycins, new aquayamycin-group antibiotics. J. Antibiot. 38, 1171–1181 (1985).
Shaaban, K. A., Ahmed, T. A., Leggas, M. & Rohr, J. Saquayamycins G–K, cytotoxic angucyclines from Streptomyces sp. including two analogues bearing the aminosugar rednose. J. Nat. Prod. 75, 1383–1392 (2012).
Krohn, K. & Baltus, W. Synthesis of rac- and ent-fridamycin E. Tetrahedron 44, 49–54 (1988).
Korynevska, A. et al. Mechanisms underlying the anticancer activities of the angucycline landomycin E. Biochem. Pharm. 74, 1713–1726 (2007).
Zhu, H. et al. Eliciting antibiotics active against the ESKAPE pathogens in a collection of actinomycetes isolated from mountain soils. Microbiology 160, 1714–1725 (2014).
Liu, M. & Douthwaite, S. Resistance to the macrolide antibiotic tylosin is conferred by single methylations at 23S rRNA nucleotides G748 and A2058 acting in synergy. Proc. Natl Acad. Sci. USA 99, 14658–14663 (2002).
Barbe, V. et al. From a consortium sequence to a unified sequence: the Bacillus subtilis 168 reference genome a decade later. Microbiology 155, 1758–1775 (2009).
Wu, C. et al. Expanding the chemical space for natural products by Aspergillus–Streptomyces co-cultivation and biotransformation. Sci. Rep. 5, 10868 (2015).
Machushynets, N. V. et al. Discovery of actinomycin L, a new member of the actinomycin family of antibiotics. Sci. Rep. 12, 2813 (2022).
da Silva, R. R. et al. Propagating annotations of molecular networks using in silico fragmentation. PLoS Comput. Biol. 14, 1006089 (2018).
Mohimani, H. et al. Dereplication of peptidic natural products through database search of mass spectra. Nat. Chem. Biol. 13, 30–37 (2017).
Djoumbou Feunang, Y. et al. ClassyFire: automated chemical classification with a comprehensive, computable taxonomy. J. Cheminform. 8, 61 (2016).
Chemother, A., Abubacker, M. N., Ramanathan, R. & Senthil, K. T. In vitro antifungal activity of Cassia alata Linn flower extract. Nat. Prod. Radiance 7, 6–9 (2008).
Uiterweerd, M. T., Santiago, I. N., van der Heul, H., van Wezel, G. P. & Minnaard, A. J. Iso‐maleimycin, a constitutional isomer of maleimycin, from Streptomyces sp. QL37. Eur. J. Org. Chem. 2020, 5145–5152 (2020).
International Organization for Standardization. British Standard BS EN ISO 20776–1:2006.7 https://www.iso.org/standard/41630.html (2006).
Andrews, J. M. Determination of minimum inhibitory concentrations. J. Antimicrob. Chemother. 48(Suppl. S1), 5–16 (2001).
Ye, Y. F. et al. Identification of antifungal substance (Iturin A2) produced by Bacillus subtilis B47 and its effect on southern corn leaf blight. J. Integr. Agric. 11, 90–99 (2012).
Caldeira, A. T., Santos Arteiro, J. M., Coelho, A. V. & Roseiro, J. C. Combined use of LC–ESI–MS and antifungal tests for rapid identification of bioactive lipopeptides produced by Bacillus amyloliquefaciens CCMI 1051. Process Biochem. 46, 1738–1746 (2011).
Nanjundan, J., Ramasamy, R., Uthandi, S. & Ponnusamy, M. Antimicrobial activity and spectroscopic characterization of surfactin class of lipopeptides from Bacillus amyloliquefaciens SR1. Micro. Pathog. 128, 374–380 (2019).
Rohr, J. et al. Investigations on the biosynthesis of the angucycline group antibiotics aquayamycin and the urdamycins A and B. Results from the structural analysis of novel blocked mutant products. J. Org. Chem. 58, 2547–2551 (1993).
Sezaki, M., Hara, T., Ayukawa, S., Takeuchi, T. & Okami, Y. Studies on a new antibiotic pigment, aquayamycin. J. Antibiot. 21, 91–97 (1968).
Akhter, N. et al. Stress-driven discovery of new angucycline-type antibiotics from a marine Streptomyces pratensis NA-ZhouS1. Mar. Drugs 16, https://doi.org/10.3390/md16090331 (2018).
Lai, Z. et al. Grincamycins I-K, cytotoxic angucycline glycosides derived from marine-derived actinomycete Streptomyces lusitanus SCSIO LR32. Planta Med. 84, 201–207 (2018).
Maskey, R. P., Helmke, E. & Laatsch, H. Himalomycin A and B: isolation and structure elucidation of new fridamycin type antibiotics from a marine Streptomyces isolate. J. Antibiot. 56, 942–949 (2003).
Zheng, C., Xie, T., He, H. & Gao, S. Asymmetric total synthesis of PD-116740. Org. Lett. 23, 469–473 (2021).
Erb, A., Krauth, C., Luzhetskyy, A. & Bechthold, A. Differences in the substrate specificity of glycosyltransferases involved in landomycins A and E biosynthesis. Appl. Microbiol. Biotechnol. 83, 1067–1076 (2009).
Rohr, J. & Zeeck, A. Metabolic products of microorganisms. 240. Urdamycins, new angucycline antibiotics from Streptomyces fradiae. II. Structural studies of urdamycins B to F. J. Antibiot. 40, 459–467 (1987).
Henkel, T., Rohr, J., Beale, J. M. & Schwenen, L. Landomycins, new angucycline antibiotics from Streptomyces sp. I. Structural studies on landomycins A–D. J. Antibiot. 43, 492–503 (1990).
Shaaban, K. A., Srinivasan, S., Kumar, R., Damodaran, C. & Rohr, J. Landomycins P-W, cytotoxic angucyclines from Streptomyces cyanogenus S-136. J. Nat. Prod. 74, 2–11 (2011).
Rix, U. et al. The oxidative ring cleavage in jadomycin biosynthesis: a multistep oxygenation cascade in a biosynthetic black box. Chembiochem 6, 838–845 (2005).
Acknowledgements
We are grateful to Mark Liles for providing Bacillus sp. 90A-23. The work was supported by NACTAR grant 14439 from the Netherlands Organization for Scientific Research (NWO) to G.P.v.W.
Author information
Authors and Affiliations
Contributions
Conceptualization, G.P.v.W.; Methodology, I.N.S. and N.V.M.; Data collection and analysis, I.N.S. and N.V.M.; Investigation, D.A.v.B. and M.M.; NMR analysis, S.S.E.; Writing—original draft, I.N.S., N.V.M. and G.P.v.W.; Writing—review & editing, I.N.S., N.V.M., M.M., D.v.B., S.S.E., T.H. and G.P.v.W.; Visualization, I.N.S. and N.V.M.; Supervision, T.H. and G.P.v.W.; Funding acquisition, G.P.v.W.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Chemistry thanks Louis-Félix Nothias for his contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nuñez Santiago, I., Machushynets, N.V., Mladic, M. et al. nanoRAPIDS as an analytical pipeline for the discovery of novel bioactive metabolites in complex culture extracts at the nanoscale. Commun Chem 7, 71 (2024). https://doi.org/10.1038/s42004-024-01153-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42004-024-01153-y
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.