Since the discovery of streptomycin from Streptomyces griseus, many additional antibiotics have been identified from cultures of the genus Streptomyces.1, 2 Consequently, the likelihood of discovering novel secondary metabolites from Streptomyces members has dwindled over time and the focus of screening has moved to less studied genera such as non-Streptomyces called rare actinomycetes. For example, members of genera within the family Streptosporangiaceae, such as Microbispora, Nonomuraea, Planobispora and Streptosporangium, have been sources for many novel compounds.3 The genus Microtetraspora, belonging to the family Streptosporangiaceae, was once comprised of 21 species, but 16 of them were reclassified into other genera such as Nonomuraea, Thermopolyspora and Actinomadura (Supplementary Figure S1).4, 5, 6, 7 Consequently, the genus Microtetraspora now contains only five valid species, Microtetraspora fusca, M. glauca, M. malaysiensis, M. niveoalba and M. tyrrhenii. However, it was later suggested that M. tyrrhenii, based on its morphological characteristics, should be transferred to the genus Nonomuraea, but nomenclatural change for the species has not been proposed because the type strain, ATCC 53931, is no longer available at the American Type Culture Collection and there is not alternative collection.8 Although several bioactive compounds were reportedly discovered from Microtetraspora strains, the producing strains have either been reclassified into other genera or are not valid Microtetraspora species.9, 10, 11 Therefore, no bioactive compound has been discovered from any valid species in the genus Microtetraspora, and so it remains uncertain whether Microtetraspora members are useful sources of novel bioactive compounds.

Recent genomic studies of actinomycetes revealed that each actinomycete genome encodes various biosynthetic gene clusters for secondary metabolites and about half to three quarters of these clusters are associated with nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) pathways. This suggests that nonribosomal peptides and polyketide compounds are the major secondary metabolites of actinomycetes.12 Nonribosomal peptides and polyketide compounds often show pharmaceutically useful bioactivities, and many have been developed into various drugs including antibiotics, anticancer agents and immunosuppressants. Therefore, actinomycete strains are often surveyed for the presence, abundance and novelty of NRPS and PKS genes to evaluate these strains as potential secondary metabolite producers.13, 14, 15 In the present study, we sequenced the whole genomes of all type strains from the genus Microtetraspora available for public use because no Microtetraspora genome sequences have been registered in public databases. Next, we searched for NRPS, type-I PKS (PKS-I) and type-II PKS (PKS-II) gene clusters in the genome sequences, predicted their metabolites and generated information on the novelty and diversity of the NRPS and PKS pathways to assess the potential of members of the genus as secondary metabolite producers.

Genomic DNAs of M. glauca NBRC 14761T, M. fusca NBRC 13915T, M. malaysiensis NBRC 100735T and M. niveoalba NBRC 102641T were prepared directly from liquid-dried cells in ampoules provided from the NBRC (Biological Resource Center, National Institute of Technology and Evaluation, Chiba, Japan) culture collection, using a Qiagen EZ1 tissue kit and an EZ1 Advanced instrument (Qiagen, Germantown, MD, USA). The prepared DNA was sequenced using paired-end sequencing with a MiSeq (Illumina, San Diego, CA, USA). The average sequence redundancy for the four draft genomes ranged from 56.0 to 66.8. The sequence reads were assembled using Newbler v2.6 (454 Life Sciences, Branford, CT, USA) and subsequently finished using GenoFinisher.16 Coding regions in the draft genome sequences were predicted by Prodigal v2.6.17 NRPS and PKS gene clusters were determined as previously reported.13, 14 NRPS and PKS-I genes containing only a single domain were excluded from the present analysis, because we considered them atypical in the assembly line theory and focused on multi-domain genes. A BLASTP search was performed using the NCBI Protein BLAST program against the non-redundant protein sequence database. We considered genes of distinct strains to be orthologous when their closest homologs in the BLASTP search were the same, and also when their domain organizations were identical or almost the same. AntiSMASH18 was used to predict substrates for adenylation (A) domains. Average nucleotide identity according to BLAST (ANIb) was calculated using the JSpecies program.19

Phylogenetic positions of the strains studied here, along with strains previously misidentified as Microtetraspora, are shown in Supplementary Figure S1. During this study, the draft genome sequence of M. glauca NRRL B-3735T was published (accession no. JOFO00000000.1). However, it is questionable whether the sequences are from M. glauca NRRL B-3735T, because the 16S rRNA gene sequence in JOFO01000118 showed <90% similarity to the sequence of the M. glauca type strain (D85490) and, instead, showed high (99%) similarity to Streptomyces strains. Therefore, in the present study, we did not analyze the NRPS and PKS gene clusters of JOFO01000000 and focused on only our data. Genome sizes of the four Microtetraspora type strains ranged from 8.5 to 9.3 Mb (Table 1), which places them in the middle of the range seen for Streptomyces strains (5.0–11.9 Mb) and strains in the family Streptosporangiaceae (5.5–13 Mb). We calculated ANIb values among the four strains using their genome sequences in order to confirm that each strain is an independent species, because these species show high 16S rRNA gene sequence similarities (98.02–99.86%) (Supplementary Table S1). Since the ANIb values (86.5–93.4%) were below the threshold (95–96%) corresponding to DNA relatedness value of 70% recommended as the cutoff point for the assignment of bacterial strains to the same species,19, 20 each strain is definitely independent species. The four strains each possessed 6–10 gene clusters for NRPS, PKS/NRPS hybrid, PKS-I and PKS-II pathways. The numbers of gene clusters were smaller (by about one-third to one-half) than the numbers found in Streptomyces genomes.12, 14, 21 The number of each of the four types of gene clusters found in each strain is listed in Table 1. Unlike in the published genome sequences of typical actinomycetes, no PKS-I gene cluster was present in M. glauca NBRC 14761T or M. malaysiensis NBRC 100735T, and only one PKS-I gene cluster was present in M. fusca NBRC 13915T. This suggests that PKS-I pathways are not abundant in the genus Microtetraspora, as compared with other genera of actinomycetes. Table 2 shows details of all clusters found in each genome. Orthologous genes and gene clusters are aligned in the same row of the table. These orthologous genes had the same domain organization and, therefore, their gene clusters should synthesize the same products (shown in the ‘Predicted product’ column of Table 2). Among the 18 gene clusters (nrps-1 to -11, pks/nrps-1 to -3, pks-1 to -3, pksII) identified from the four strains, three (nrps-1, nrps-2, pks/nrps-1) were conserved in all strains, five (nrps-3 to -6, pks-1) were shared by two or three strains, and ten (nrps-7 to -11, pks/nrps-2 and -3, pks-2 and -3, pksII) were strain-specific.

Table 1 Genome sequencing and numbers of NRPS and PKS gene clusters in Microtetraspora strains
Table 2 Open reading frames (ORFs) encoding modular NRPSs and PKSs in NRPS and PKS gene clusters of Microtetraspora strains

Gene clusters conserved in all four strains. The data in Table 2 suggested that three predicted products (nrps-1, nrps-2 and pks/nrps-1) are common among the four type strains. Based on their module numbers and A domain substrates, nrps-1 and -2 were predicted to synthesize a tetrapeptide containing glycine (Gly), lysine (Lys) and aspartic acid (Asp) residues, and a dipeptide containing one Asp residue, respectively. Pks/nrps-1 is a PKS/NRPS hybrid gene encoding five NRPS modules and one PKS module, suggesting a product composed of five amino acid residues and one polyketide unit. Because the NRPS and PKS-I genes in the pks/nrps-1 cluster showed high sequence homologies (70–88% identity) with those of Streptosporangium roseum, and the domain organization of Microtetraspora strains and S. roseum is the same, their products are likely to be similar.

Gene clusters shared among two to three strains. Four NRPS gene clusters (nrps-3 to -6) and one PKS gene cluster (pks-1) were shared by two to three strains. Nrps-3, nrps-4 and nrps-5 were predicted to synthesize a tetrapeptide, a tripeptide containing two alanine (Ala) residues and a dipeptide containing one cysteine (Cys) residue, respectively. Since the NRPS genes in the nrps-4 cluster showed high sequence homologies (85–88% identity) with those of S. roseum, and the domain organizations in Microtetraspora strains and S. roseum are equivalent, these strains will produce similar tripeptide compounds. We were not able to predict the peptide compound product of nrps-6 because the gene cluster encoded only a single module. Pks-1 also encoded only a single module. The PKS-I resembled that of Microbispora rosea, but the product of M. rosea has not been identified.

Strain-specific gene clusters. M. glauca NBRC 14761T, M. fusca NBRC 13915T, M. malaysiensis NBRC 100735T and M. niveoalba NBRC 15239T had one, three, three and three strain-specific NRPS and/or PKS gene clusters, respectively. M. glauca NBRC 14761T possessed one strain-specific NRPS gene cluster, nrps-7. This cluster comprised four modules encoded by two NRPS genes and one of the modules was predicted to code for Asp, suggesting that the product is a tetrapeptide containing an Asp. Because this cluster is homologous to that of S. roseum, M. glauca NBRC 14761T likely produces a tetrapeptide similar to that of S. roseum. M. fusca NBRC 13915T possessed one specific NRPS gene cluster, nrps-8; one specific PKS/NRPS gene cluster, pks/nrps-2; and one specific PKS-II gene cluster, pksII. Though nrps-8 was not completely sequenced, this cluster encoded at least nine modules and the predicted product was a peptide composed of nine or more amino acid residues including serine (Ser), threonine (Thr) and asparagine (Asn). Pks/nrps-2 also may not have been completely sequenced, but this cluster encoded at least one PKS-I module and three NRPS modules, suggesting the product contains at least three amino acid residues and a polyketide unit. PksII was predicted to synthesis a xantholipin-like polyketide, because its PKSs show 63–78% sequence identities with synthases of xantholipin and related compound.22, 23 M. malaysiensis NBRC 100735T possessed two specific NRPS gene clusters, nrps-9 and nrps-10 and one specific PKS/NRPS gene cluster, pks/nrps-3. The products of nrps-9 and nrps-10 were predicted to be a tripeptide composed of one phenylalanine (Phe) and two Cys residues, and a tripeptide containing valine (Val), respectively. Pks/nrps-3 encoded seven NRPS modules and one PKS module, suggesting that its product is a polyketide-nonribosomal peptide hybrid compound composed of seven amino acid residues and one polyketide unit. Substrate prediction for the A domains suggested that this hybrid compound contains glutamate (Glu), Ser and Gly residues. M. niveoalba NBRC 15239T possessed one specific NRPS gene cluster, nrps-11, and two specific PKS-I gene clusters, pks-2 and pks-3. Nrps-11 encoded 11 NRPS modules, one of which had a CoA-ligase domain instead of an A domain. Substrate prediction for the ten A domains indicated a product composed of a starter molecule and ten amino acid residues such as Gly, Thr, tryptophan (Trp), Asp, Val and Ser. Pks-2 was not completely sequenced, but encoded at least 22 modules, and 12 of its putative 14 PKS-Is showed sequence homologies (56–78% similarity) with those of Streptomyces ambofaciens ATCC 23877. Because these closest homologs are reportedly involved in the synthesis of stambomycin, a 51-membered macrolide,24 we predict that pks-2 will synthesize a stambomycin-like large polyketide compound. Pks-3 was also not completely sequenced but encoded at least seven PKS modules. We predict that its product is a polyketide compound including a C14 or longer polyketide chain.

Ayuso-Sacido et al.25 searched for PKS-I genes in actinomycete strains by PCR using degenerate primers and could not detect any PKS-I genes in type strains of M. fusca and M. niveoalba. However, our present study indicates that these two strains harbor PKS-I genes, suggesting the usefulness of genome analysis to survey for PKS-I genes in rare actinomycetes. This is the first report on Microtetraspora genome sequences, revealing novel and diverse NRPS and PKS pathways whose products have yet to be identified. Among the 18 types of NRPS and PKS gene clusters found in the four Microtetraspora type strains, seven (nrps-1, nrps-2, nrps-4, nrps-7, nrps-10, pks/nrps-1 and pks-1) are similar to those present in strains belonging to the genera Streptosporangium and Microbispora, both phylogenetically close to the genus Microtetraspora. This suggests that these products are not specific to Microtetraspora. In contrast, the remaining 11 types of gene clusters do not show high sequence similarities to those registered in GenBank/EMBL/DDBJ databases, suggesting they are specific to Microtetraspora. Therefore, these products are potentially novel. We suggest that Microtetraspora strains might be an untapped, rich source of secondary metabolites and should be studied further for the discovery of novel bioactive compounds.