Nocardia species are the causative agents of nocardiosis in humans and are known to produce a variety of bioactive compounds with antimicrobial, antitumor and immunosuppressive activities. Nocardia brasiliensis is often reported to produce various bioactive compounds such as indole alkaloid, terpenoid, peptide and aromatic compounds.1 Other species, including N. asteroides and N. farcinica, are not widely reported to be producers of bioactive compounds, and their products are generally siderophores.2, 3 Reports regarding the production of polyketide compounds, such as macrolides and polyethers, synthesized by modular type-I polyketide synthases (PKS-Is), are limited to a few species such as N. brasiliensis and N. terpenica,4, 5, 6, 7 although some polyketide-producing Nocardia strains are yet to be identified at the species level.8 The recent N. farcinica IFM 10152 genome sequencing project revealed the presence of only six PKS-I genes, suggesting that PKS-I genes are less abundant in N. farcinica compared with Streptomyces spp.9, 10, 11 Nfa7640 (nbtB) and nfa7650 (nbtC) are PKS-I genes in a mixed PKS/non-ribosomal peptide synthetase (NRPS) gene cluster involved in the synthesis of siderophores named nocobactins;2 however, metabolites of the other four PKS-I genes, nfa1890, nfa30250, nfa43240 and nfa55930, are still unclear, although nfa30250 is predicted to be mycocerosic acid synthase. Because none of these four genes organizes multimodular PKS-I gene clusters, their metabolites are not likely to be structurally complex and the strain will not produce typical polyketide compounds, such as macrolides and polyethers. This may be one reason why type-I polyketide compounds are not frequently produced by Nocardia. The biosynthetic potential of actinomycetes is often approximated through the analysis of genes involved in synthesis of secondary metabolites, such as PKS-Is, type-II PKSs and NRPSs.12, 13, 14, 15 However, the diversity and novelty of these genes in Nocardia species other than N. farcinica have not been studied. There is no information regarding the relationship between taxonomy and gene distribution. In our study, we focused on PKS-I genes and investigated their relationship with Nocardia taxonomy. This allowed us to assess their ability to produce various type-I polyketide compounds.

The strains evaluated included 24 Nocardia strains belonging to 22 species (Figure 1). Genomic DNA was prepared using a DNeasy Blood & Tissue Kit (QIAGEN Sciences, Germantown, MD, USA). Ketosynthase (KS) domain DNA regions in PKS-I genes were amplified as reported previously.15 The polymerase chain reaction (PCR) products were cloned using a TOPO TA Cloning Kit for Sequencing (Invitrogen, Carlsbad, CA, USA) and transformed into ECOS competent Escherichia coli DH5α (Nippon Gene, Toyama, Japan). Approximately 10 recombinant E. coli clones were picked for each PCR sample and the inserted DNA segments were amplified directly from the cells by colony PCR with M13 forward (5′-GTAAAACGACGGCCAG-3′) and M13 reverse (5′-CAGGAAACAGCTATGAC-3′) primers. The amplified inserts were purified using the Agencourt AMPure (Beckman Coulter Genomics, Danvers, MA, USA). DNA sequencing of the inserts was performed with a BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems Japan Ltd., Tokyo, Japan), M13 primers and a 3130xl Genetic Analyzer (Applied Biosystems Japan Ltd.). The obtained sequences were queried with the BLASTX software program (http://blast.ncbi.nlm.nih.gov/Blast.cgi). The deduced amino-acid sequences were aligned, and a phylogenetic tree was constructed using Mega416 and the neighbor-joining method.

Figure 1
figure 1

A phylogenetic tree of the test strains based on almost complete 16S rDNA sequences. Bootstrap values above 50% from 1000 re-samplings are shown at the nodes. Strains shaded in gray possess universal PKS-I genes. The origins of the isolates are shown in parentheses. The origins of N. nova, N. asteroides and N. brasiliensis were not described by Tsukamura,21 Blanchard22 or Pinoy.23 aNumbers of non-redundant KS sequences obtained. The sequences shaded in Figure 2 and those in the others were counted. -, not detected. bVulcanized natural rubber pipe rings suspended in partially treated water from the River Thames.17

PKS-I genes were successfully amplified from all test strains. Because PKS-Is are large multimodular enzymes, and even a single strain generally possesses several PKS-I genes in its genome, the amplicon from each strain contained several types of KS sequences. We cloned the PCR products and determined 225 DNA sequences (approximately 1100 bp each) from the 24 strains. After removing duplicate sequences, 107 non-redundant sequences were subjected to a BLAST search. All sequences showed low similarities (<78%) to PKS-I genes reported to be involved in the synthesis of known polyketide compounds and/or revealed by whole genome sequencing projects, although 50 sequences among them showed over 80% identity to Nocardia PKS-I genes whose products are yet to be identified (Supplementary Table 1). To determine the diversity of the 107 sequences, the deduced amino-acid sequences were compared using a phylogenetic tree (Figure 2). Strains identified as the same species, such as N. nova NBRC 15556T and N. nova NBRC 103080 or N. takedensis NBRC 100417T and N. takedensis NBRC 100418, contained almost identical PKS-I gene pairs. N. fluminea, N. salmonicida, N. soli and N. cummidelens, known as the N. salmonicida clade members,17 shared some common PKS-I genes. Likewise, phylogenetically close species such as N. cerradoensis and ‘N. violaceofusca’ also shared some common PKS-I genes. The 65 sequences from 14 strains, listed in Figure 1, could be classified into two groups as shown in Figure 2. The 65 sequences showed high sequence similarities (average 87% identity) to already published Nocardia KS sequences, suggesting that they are widely distributed among Nocardia spp. As different strains shared the KS sequences clustered in the same phylogenetic clade, it is suggested that they also shared the same type of PKS-I gene cluster involved in the synthesis of analogous metabolites.18 Therefore, we speculate that the polyketide compounds corresponding to the genes grouped in each clade shown in Figure 2 are analogous metabolites. To analyze the relationship between taxonomy and distribution of these PKS-I genes, a phylogenetic tree based on the 16S rDNA sequences was constructed (Figure 1). The PKS genes grouped in Figure 2 were widely observed in the species belonging to certain clades. In contrast, strains lacking such universal PKS-I genes often possessed fairly diverse KS sequences. Among the 42 sequences that were not common, 36 were found in the strains lacking universal PKS genes (Figure 1). The KS sequences derived from N. brasiliensis, N. terpenica and N. takedensis were not observed in other species (Figure 2), suggesting that their PKS-I genes were unique. The species harboring such unique PKS genes were mainly located near N. brasiliensis and N. salmonicida in the 16S rDNA phylogenetic tree. N. terpenica and N. jiangxiensis were phylogenetically distant from these; however, the bootstrap values were less than 50% and the topology was unstable. To interpret the relationship between PKS-I content and the taxonomy of the strains more precisely, reliable and high-resolution methods such as multilocus sequence typing should be carried out.

Figure 2
figure 2

A phylogenetic tree of the KS domain (approximately 366 amino acids) sequences in PKS-I genes of Nocardia species. Bootstrap values above 50% from 1000 re-samplings are shown at the nodes. PKS-I genes (gray) were universal in the 13 species highlighted in Figure 1. PKS-I genes in N. farcinica IFM 10152 were used as an outgroup because they are in a mixed PKS/NRPS gene cluster or in non-multimodular clusters whose KS domains are phylogenetically distinct.19, 20 *Indicates different strains identified as the same species that shared almost identical genes. **Indicates that phylogenetically close species shared similar genes.

Recent genetic analyses of biosynthesis pathways for secondary metabolites have demonstrated their crucial roles in the discovery of novel natural products. Ayuso-Sacido and Genilloud13 examined the distribution of PKS-I genes in Nocardia strains using a PCR assay, and found that 69% (11/16) of the strains were positive for the presence of PKS-I genes. This was comparable to that in Streptomyces (79%, 26/33); however, sequence diversity in Nocardia has not been thoroughly examined.12 We demonstrated that all the Nocardia strains tested in this study contained PKS-I genes and the diversity varied among taxonomic species. The 13 species possessed mainly universal PKS-I genes. Although the KS sequences were grouped into two clades, as shown by the phylogenetic analysis, the relationship between the two groups is still unclear from this study. We are sequencing the full-length PKS gene cluster(s) in some strains to clarify this relationship. In contrast, the universal PKS-I genes were not detected in the other nine species shown in Figure 1. The sequences of KS domains in these nine species were different (<70% identity) from the universal sequences and each other. Therefore, these species will be useful in screening sources for discovering novel types of polyketide compounds. As mentioned previously, N. brasiliensis and N. terpenica are known to be a useful source of various bioactive compounds. In this study, we revealed that some species, including N. brasiliensis and N. terpenica, had unique and diverse PKS-I genes. In contrast, reports of type-I polyketide products from other species are limited.2, 8, 9 This might be due to the poor PKS-I diversity in some species, in which universal PKS-I genes are dominant. It would be valuable to extensively analyze PKS-I genes in Nocardia strains and determine any relationship with their taxonomy.

Nucleotide accession numbers

Accession numbers of the DNA sequences in Supplementary Table 1 are AB606044−AB606150.