Comparative genomics study reveals Red Sea Bacillus with characteristics associated with potential microbial cell factories (MCFs)

Othoum, G.; Prigent, S.; Derouiche, A.; Shi, L.; Bokhari, A.; Alamoudi, S.; Bougouffa, S.; Gao, X.; Hoehndorf, R.; Arold, S. T.; Gojobori, T.; Hirt, H.; Lafi, F. F.; Nielsen, J.; Bajic, V. B.; Mijakovic, I.; Essack, M.

doi:10.1038/s41598-019-55726-2

Download PDF

Article
Open access
Published: 17 December 2019

Comparative genomics study reveals Red Sea Bacillus with characteristics associated with potential microbial cell factories (MCFs)

Scientific Reports volume 9, Article number: 19254 (2019) Cite this article

2112 Accesses
7 Citations
30 Altmetric
Metrics details

Subjects

Abstract

Recent advancements in the use of microbial cells for scalable production of industrial enzymes encourage exploring new environments for efficient microbial cell factories (MCFs). Here, through a comparison study, ten newly sequenced Bacillus species, isolated from the Rabigh Harbor Lagoon on the Red Sea shoreline, were evaluated for their potential use as MCFs. Phylogenetic analysis of 40 representative genomes with phylogenetic relevance, including the ten Red Sea species, showed that the Red Sea species come from several colonization events and are not the result of a single colonization followed by speciation. Moreover, clustering reactions in reconstruct metabolic networks of these Bacillus species revealed that three metabolic clades do not fit the phylogenetic tree, a sign of convergent evolution of the metabolism of these species in response to special environmental adaptation. We further showed Red Sea strains Bacillus paralicheniformis (Bac48) and B. halosaccharovorans (Bac94) had twice as much secreted proteins than the model strain B. subtilis 168. Also, Bac94 was enriched with genes associated with the Tat and Sec protein secretion system and Bac48 has a hybrid PKS/NRPS cluster that is part of a horizontally transferred genomic region. These properties collectively hint towards the potential use of Red Sea Bacillus as efficient protein secreting microbial hosts, and that this characteristic of these strains may be a consequence of the unique ecological features of the isolation environment.

Cultivation and functional characterization of 79 planctomycetes uncovers their unique biology

Article 18 November 2019

Phylogenomics, phenotypic, and functional traits of five novel (Earth-derived) bacterial species isolated from the International Space Station and their prevalence in metagenomes

Article Open access 06 November 2023

Three marine species of the genus Fulvivirga, rich sources of carbohydrate-active enzymes degrading alginate, chitin, laminarin, starch, and xylan

Article Open access 18 April 2023

Introduction

Bacillus species are ubiquitous Gram-positive bacteria known for their ability to survive in a wide variety of environments, including marine environments such as seawater¹, tidal flat^2,3,4, and sediments^5,6,7,8,9,10; soil environments such as rhizospheres^{11,12,13,14,15}; human gut samples^16,17,18; as well as food samples such as dairy products^19,20 and fermented soybeans¹⁹. For several Bacillus strains from such diverse environments, in silico methods^21,22 and in vitro experiments^{23,24,25,26,27,28,29,30} have shown strong biosynthetic and superb protein secreting capabilities. This observation motivated using certain Bacillus as industrial producers for an array of pharmacologically and industrially relevant compounds including biosurfactants^31,32, antimicrobials^33,34,35, hydrolysis and deproteinization enzymes^36,37,38, and livestock probiotics. The premise that species from different environments have unique biosynthetic capabilities is supported by a study that surveyed secondary metabolism gene clusters (SMGCs) in Bacillus genomes and identified classes of SMGCs (lipopeptides and polyketides) that are only present in specific Bacillus species²². As an example, mining the genomes of the rhizosphere-dwelling B. amyloliquefaciens has revealed that it produces a number of antimicrobial and antitumor agents^{39,40,41,42,43,44,45,46}. Additionally, the analysis of the phenotypic and genomic properties of B. thuringiensis resulted in the identification of cry genes and other gene clusters encoding for proteins with toxic phenotypes. This discovery promoted the use of B. thuringiensis for insecticidal applications^{23,27,47,48,49,50,51,52,53} which are now representing more than 70% of all Bacillus-based commercially used biocontrol agents⁵⁴.

The genus also harbors a number of strains with extensively characterized metabolism, most notably the model organism B. subtilis 168. This detailed knowledge enabled the use of metabolic and genetic modification methods to transform these strains into efficient microbial cell factories (MCFs)⁵⁵. Additionally, advancements in next-generation sequencing technologies along with the development of computational genome-annotation tools have facilitated mining whole genomes for functional genes and gene clusters that account for unique metabolic and biosynthetic capabilities^{56,57,58,59,60,61}. These genome-mining approaches have become important tools in the quest for new proteins and metabolites^62,63, and in the increasingly successful efforts to use microbes as cost-efficient agents for industrial production^64,65,66. In addition to genome mining, another way of identifying functional differences between taxonomically close strains is through reconstructed genome-scale metabolic networks. This approach has been facilitated by the establishment of large biochemical pathway databases that provide a comprehensive view of the metabolic functions that differ between organisms⁶⁷. For instance, functional genome-scale metabolic models and flux balance analysis successfully identified metabolic functions related to the virulent phenotype of Staphylococcus aureus strains in more than 300 different growth-supporting environments⁴².

Our metagenomic-based research initiative reported a rich repertoire of nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) sequences (often associated with the synthesis of antimicrobial compounds) derived from strains belonging to the phylum Firmicutes in Red Sea-associated mangrove samples⁶⁸. The ecological uniqueness of this environment, especially its high salinity and temperature, marks its microbial genomic repertoire as putatively attractive for the discovery of unique metabolic and biosynthetic capabilities. Here, we are ranking ten recently sequenced Bacillus strains isolated from microbial mat, mangrove mud or barren soil samples taken from Rabigh Harbor Lagoon on the Red Sea shoreline, for their use as MCF platforms for protein and/or metabolite production. We focus on comparing over- and under- represented metabolic reactions in a dataset containing 32 Bacillus genomes (22 reference Bacillus genomes and the ten Red Sea strain genomes). We also predict and catalog secondary metabolic gene clusters and analyze their homology and co-localization patterns. Finally, in order to further assess the biotechnological potential of these strains, we compared their capacity for protein secretion and sporulation, ranking them against levels detected in the reference strain Bacillus subtilis 168. Our results suggest that specific modules of secondary metabolism have evolved in the Red Sea Bacillus due to environmental adaptation, and that several of the isolated strains represent promising platforms for development of MCFs.

Results and Discussion

Bacillus species present in the Red Sea originate from several colonization events

Sequencing the genomes of the ten Red Sea isolates yielded an average of 128,522 reads with a mean length of 9,527 base pairs (bp) and average coverage of 272 (lowest 112 and highest 344). All assemblies are single circular chromosomes without plasmids (except for B. foraminis (Bac44), V. halodenitrificans (Bac324), Virgibacillus sp. (Bac332) and B. amyloliquefaciens (Bac57)), with, on average, genome size of 4,536 Kb, 4,483 predicted open reading frames (ORFs), and 29.6 rRNAs (Table 1). Assembly statistics of the raw reads are presented in Supplementary Table S1.

Table 1 Summary of sequencing features of the ten Red Sea genomes.

Full size table

To assess the phylogenetic positions of the ten newly sequenced Red Sea species, we ran a whole-genome phylogenetic analysis of those ten species, along with 22 reference strains from⁶⁹, and eight Bacillus that were found to be closely related to the Red Sea species based on 16S phylogenetic analysis. In total, there are 40 genomes including two outgroups considered in the analysis. The whole-genome phylogeny tree, built using 188 single-copy genes, confirmed that most of the Red Sea species are phylogenetically located within Bacillus species (Fig. 1).

Furthermore, since species that underwent single colonization events exhibit consistent patterns in the clades derived from both the phylogenetic tree and their metabolic networks, we further reconciled this phylogenetic tree with the metabolic clades obtained from metabolic reconstructions (see section entitled Convergent evolution of metabolic networks in Bacillus species). This allowed us to hypothesis that since the metabolic clades show different enrichment patterns of distinguishable metabolic pathways, the Bacillus species present in the Red Sea have likely gone through several colonization events.

This analysis further showed some species are closely related to other well-studied Bacillus species. For example, B. vallismortis (Bac111) is closely related to B. subtilis 168, whereas B. paralicheniformis (Bac84) and B. paralicheniformis (Bac48) are closely related to B. licheniformis ATCC 14580. Interestingly, B. subtilis 168 and B. licheniformis ATCC 14580 both have a soil habitat, while the newly sequenced species live in the sea environment. The close relationship between their genomes suggests that some aspects of the metabolism of the newly sequenced species have occurred that enable them to colonize this specific environment. Thus, the B. paralicheniformis (Bac84) and B. paralicheniformis (Bac48) strains were also used in a separate study focused on the differences in the closely-related licheniformis and paralicheniformis strains based on predicted biosynthetic capabilities⁷⁰.

Several Red Sea Bacillus species exhibit an above average percentage of their genomes’ genes overlapping with genomic islands

The ten genomes collectively have a total of 2,737,911 bp of DNA sequence overlapping with predicted genomic islands (GIs) regions, with an average length of 273,791 bp per genome (minimum length of 172,668 bp, and a maximum length of 434,257 bp). These GIs are important indicators regarding the discovery of new products as they have been shown to harbor traits that allow strains to adapt to the ecological niche and are enriched with genes associated with SMGCs^71,72. Therefore, we further calculated and ranked the percentage of the genomes’ genes that are located within predicted GIs for our ten Red Sea strains along with 18 publicly-available Bacillus species (Supplementary Table S2). Using the number of DNA sequence base pairs in GIs relative the size of each genome, we found eight of the ten Red Sea strains ranked in the third quartile (5.05–6.69%) and fourth quartile (6.89–19.48%). Also, considering the average of the ‘percentage of the genomes’ genes that are located within predicted GIs’ for all 28 strains is 5.6%, we find five of the Red Sea isolates to have an above average percentage of their genomes overlapping with GIs.

Next, we further investigated what functional properties these GIs provide to the Red Sea strains. The analysis showed that in total 292 genes that are part of GIs overlap with identified biosynthetic genes. Interestingly, four of the Red Sea isolates have horizontally transferred genes falling in NRPS clusters and hybrid PKS/NRPS clusters (Table 2). Specifically, modular genes that are structurally critical for the assembly-line machineries of hybrid PKS/NRPS clusters are part of horizontally transferred genomic regions, indicating the important role these putative GIs play in the synthesis of bioactive compounds. For instance, the NRPS/PKS cluster in V. dokdonensis (Bac330) has several structural genes that are predicted to be horizontally transferred, including genes encoding for methyltransferases, peptide synthases, malonyl CoA-acyl carrier protein transacylases and polyketide synthases. All the isolates (i.e., B. paralicheniformis (Bac48), B. amyloliquefaciens (Bac57), V. dokdonensis (Bac330), and Virgibacillus sp. (Bac332)) that have modular clusters overlapping with a predicted GI are from mangrove mud samples and not from microbial mat ones. None of the overlapping clusters, except for the cluster for the synthesis of Bacillaene in B. amyloliquefaciens (Bac57), has any assigned product using available databases of known SMGCs. Eight of the Red Sea isolates (B. valismortis (Bac111), B. paralicheniformis (Bac84), B. amyloliquefaciens (Bac57), B. paralicheniformis (Bac48), B. marisflavi (Bac144), V. halodenitrificans (Bac324), Virgibacillus sp. (Bac332), B. foraminis Bac44) have 13 genes (gene IDs: NP_390764.1, NP_390616.1, NP_388001.1, NP_389359.1, NP_390160.1, NP_389983.1, NP_391547.1, NP_391794.1, NP_388483.2, NP_389974.1, NP_388373.1, NP_388370.1, NP_389162.1) previously reported to be differentially expressed under salt stress conditions during spore outgrowth⁷³. This does not mean the other two strains do not have mechanism or SMGCs that allow them to thrive in a salt stressed environment. Specifically, Bac330 is placed into a clade that is statistically enriched with reactions for the biosynthesis of ectoine, a known osmoprotectant. Also, Bac94 has the highest number of unique BGCs, that are not shared with any species, most of which are not assigned to known products and may be contributing to the strains’ salt tolerance. Collectively, these results show that the identified horizontally transferred regions with functionally relevant genes, such as the ones involved in salt tolerance and bioactive compounds, are putative indicators of the environment-specific increased fitness of these Red Sea isolates.

Table 2 Predicted genomic islands and prophage regions in the Red Sea genomes showing their overlap with biosynthetic genes

Full size table

Several Red Sea Bacillus strains exhibit potential as protein-secreting cell factories

Bacillus species are extensively used for production of industrial enzymes. In order to qualify for such applications, a candidate strain would ideally be non-sporulating and have efficient protein secretion machinery. We provide a preliminary bioinformatics analysis of the potential sporulation of the species as sporulation is a complex developmental process, involving several hundred genes^69,74. The caveat here is that the presence of a high number of sporulation genes merely indicates that the species has sporulation potential, it does not necessarily correlate to the number of spores, as expression of these genes are not taken into account. Thus, we further experimentally evaluated the Red Sea species along these lines, by comparing them to the standard laboratory strain B. subtilis 168. Three of the species (B. foraminis Bac44, B. marisflavi (Bac144), and V. halodenitrificans (Bac324)) did not grow on standard laboratory media, thus Fig. 2A (and Fig. 2B) show only results for seven Red Sea strains.

Figure 2A show no correlation between the number of genes involved in sporulation and the number of spores produced by the different species. However, it does show the higher the number of sporulation genes present in a species, the higher the chance that key sporulation genes are not missing, as is evident in the Virgibacillus strains, and sporulation potential is more likely. That is, Virgibacillus strains Bac332 and Bac330 exhibited the least number of sporulation genes, and were completely incapable of sporulating, under the conditions tested. This could be related to the fact that in both species, despite the presence of many sporulation genes, the gene spo0B is missing. This gene is a key element in the phosphorelay regulating sporulation initiation⁷⁵. Its absence has already been reported to arrest sporulation in phase 0⁷⁵. No clear explanation has been found concerning the extremely low level of sporulation of B. paralicheniformis (Bac84) under the conditions tested. Further studies will be needed to explain this phenomenon. Globally, it is interesting that none of the Red Sea species sporulate more than B. subtilis 168 under the conditions used.

Protein secretion in Bacillus species is mainly controlled by two systems: the twin arginine translocation (Tat) and general secretion (Sec) systems, which transport folded and unfolded proteins, respectively, across the membrane. In B. subtilis, both systems involve only a small number of genes: tatAd, tatAy, tatAc, tatCd and tatCy; and secA, secY, secE, secG and secDF. Comparing the genes present in the Red Sea species and the actual protein secretion (Fig. 2B), no clear correlation has been found between the presence of genes from the Tat and Sec systems, and the actual protein secretion capabilities. It is nevertheless worth noting that two of the species from the Red Sea strains secrete twice as much protein as B. subtilis 168 when they are grown in the LB medium. One of these species, B. halosaccharovorans (Bac94) possesses complete Tat and Sec systems. These results serve as positive features for the selection of B. halosaccharovorans (Bac94) and B. paralicheniformis (Bac48) as MCFs optimized for enzyme production, as well as for further study protein of secretion pathways.

Convergent evolution of metabolic networks in Bacillus species

To better characterize the metabolism of the 32-studied species, and to discover eventual metabolic features specific to the ten Red Sea species, we reconstructed all their metabolic networks. These metabolic networks contained from 671 to 1,398 reactions, with an average of 1,238 reactions (median: 1,291 reactions) and contained from 1,050 to 1,897 different metabolites involved in these reactions, with an average of 1,667 metabolites (median: 1,715 metabolites). It is interesting to note that the three networks containing the most metabolic reactions correspond to the Red Sea species. Globally, the Red Sea species possess on average 57 more reactions than the other species (1,277 vs. 1,220 reactions) and on average 75 more metabolites (1,719 vs. 1,644 metabolites). The reactions are inferred based on metabolic genes. We used between 477 and 1,042 genes per species, 903 on average (median: 941).

These numbers could be compared to the most complete existing metabolic model of B. subtilis 168, iBsu1103⁷⁶. This metabolic model contains 1,437 reactions associated with 1,103 genes. For comparison, our reconstruction of the metabolic network of B. subtilis 168 contains 1,366 reactions associated with 1,006 genes. The numbers are similar and the small difference could be attributed to the absence of gap-filling and the fact that, in our case, the manual curation effort was less intense compared to the one performed for iBsu1103.

To find signatures of metabolism in Bacillus species, we compared the reactions present in the different metabolic networks. Clustering the genomes, represented in Fig. 3, has been made based on the presence/absence of reactions. Based on this clustering, several “metabolic clades” appeared clearly. We decided to divide the 32 species into seven different metabolic clades based on the species dendrogram obtained. Based solely on the dendrogram, the clades 5 and 6 may have been merged, but we decided to split them to understand the meaning of the group of reactions predominantly present in the clade 6. B. coahuilensis m4–4 was not included in the analysis because of the low quality of its genome, and hence the expected low quality of its metabolic network. B. pumilus SAFR-032 was also not included in this analysis since it appeared to be too distant to all the other clades and the subsequent statistical analysis would not be relevant.

We first compared the obtained dendrogram and the previous phylogenetic analysis. Some of the metabolic clades are conserved compared to the phylogenetic clustering (clades 1, 4, 5 and 6). On the other hand, metabolic clades 2, 3 and 7 do not correspond to a phylogenetic clustering. This could be the sign of a convergent evolution of the metabolism of these species, which could have evolved to adapt to special environmental conditions. To facilitate the composition of the metabolic networks, a binomial test was performed on each reaction to find if a given reaction is statistically over-represented or under-represented in a given clade (Supplementary Dataset S1). The obtained reactions have been grouped in pathways as defined in the MetaCyc database.

From this analysis, several hypotheses can be drawn about the metabolism of the analyzed species. For example, the metabolic clade 4 possesses statistically more reactions involved into ectoine biosynthesis. Ectoine is already known for their osmoprotectant power^77,78, suggesting that Virgibacillus strains Bac332, Bac330, Bac324, sp. SK37 and Oceanobacillus iheyensis HTE831 are highly tolerant against osmotic stress. The same species have more reactions involved in glucuronoarabinoxylan degradation compared to others, which can be linked to utilization of plant cell wall polysaccharides. Metabolic clades 5 and 6 seems also prone to degradation of plant cell walls with an over-representation of reactions involved into rhamnogalacturonan type I degradation and D-galacturonate degradation. This assumption is validated with the presence of B. licheniformis, B. subtilis and B. amyloliquefaciens in those clades, that are all known to be associated with plant and plant material in nature^39,79. These results are also consistent with a previous study⁶⁹ where the evolutionary and functional relationships between twenty complete and draft Bacillus genomes are compared and found that most of the metabolic variation between the strains was stemming from genes related to functions necessary for adapting to the environments from which they were isolated.

In metabolic clades 2, 3, and 7, the most represented metabolic pathways were carbon metabolism, amino acid metabolism, and nucleotide metabolism. In clade 2 we also find the dimethylsulfide (DMS) degradation pathway that is a significant contributor of sulfur. Degradation of dimethylsulfoniopropanoate (DMSP) osmolytes in marine, estuarine, and salt marsh systems are the primary source of DMS. DMSP was shown to be degraded by several bacterial types from salt marsh sediments^80,81. Also, metabolic clades 2 and 7 possess a few antibiotic biosynthesis pathways e.g., Kanamycin biosynthesis in clade 2, and Paromomycin and Mithramycin biosynthesis in clade 7⁸². There are also some interesting reactions involved in detoxification, such as furfural degradation in clade 2⁸³, and a reaction related to mercury detoxification pathways in clade 3⁸⁴. These pathways may be providing a selective advantage for these Bacillus species to thrive in the Red Sea environment.

Moreover, we also investigated if any reactions that are commonly shared by the 10 Bacillus species are associated with adaptation to the local environment. We found no reactions shared by all ten Red Sea Bacillus species while being absent from other Bacillus species. Figure 3 depicts this result as well, where the 10 Bacillus species do not cluster together based on their reactions. If we choose a looser threshold to select reactions that are enriched in at least 75% of the networks of Red Sea species and are absent in at least 90% of the other networks, we filter down the number of reactions to 17 (Supplementary Table S3). The low number of reactions hinders the inference of possible functional pathways these reactions might be part of. Manual inspection of the functions of these reactions revealed that one of these reactions gives relevance to pectate lyase, which might be related to the prevalence of mangrove trees in the proximate vicinity of the isolation site. However, we would like to note that the exclusive presence of these reactions in most of the Red species, is not necessarily a discriminative feature of the uniqueness of these species. It is indeed the collective analysis of the distribution of reactions and corresponding pathways, such as we did in Fig. 3, that succeeds in drawing the lines between all the analyzed Bacillus species in general, including the Red Sea Bacillus isolates.

The potential secondary metabolism of the Red Sea strains suggests the production of novel active molecules

We noted a big diversity in the number of SMGCs predicted in different Bacillus species, ranging from 6 to 23 gene clusters per species (Fig. 4). On average there are 13 gene clusters per species (the median is 15). For example, Virgibacillus strains Bac330 and Bac332 are very close from a phylogenetic and metabolic point of view, but the first has almost twice the number of gene clusters compared to the second. The species containing the most SMGCs is the Red Sea strain B. amyloliquefaciens (Bac57) which has ten more gene clusters than the average.

To compare more precisely the secondary metabolism of different species, we looked for gene clusters that are shared between several species. An annotation of known gene clusters has also been performed. In total, 417 gene clusters are present in the 32-studied species. Since the same gene cluster can be present in different species, we can distinguish in total 200 different gene clusters: 55 are shared between at least two species, while 145 are unique for a given species. 21 of those 200 gene clusters are known, while 179 are still unknown, highlighting a huge potential of new discoveries in future. In particular, 54 unknown gene clusters are only present in the genomes of the Red Sea species (Supplementary Fig. S1).

Since complete genomes are available for most of the studied species (29 out of 32), we next studied the localization of gene clusters in different species. Studying the localization of the SMGCs in the genomes could serve as a proxy to study the evolution of these gene clusters in the Bacillus species. Specifically, the colocalization of the same SMGCs on random genomic positions, as these SMGCs are reported to be putative indicators of horizontal gene transfer or chromosomal rearrangement in close species^85,86. We found that most of the gene clusters shared by at least 2 species co-localize in their genomes, implying that these gene clusters appeared in one of their common ancestors (Fig. 5). Among the 54 clusters of gene clusters shared by at least two species having their complete genomes, 38 are strictly co-localized, four have genes that co-localize in at least two species but are also placed somewhere else in other genomes, and 12 are randomly appearing in all the genomes. Interestingly, while the SMGCs present in clusters No. 49 and 51 perfectly co-localize in the genomes of B. amyloliquefaciens DSM7 and B. amyloliquefaciens (Bac57), the SMGCs present in clusters No. 48 and No. 50, also shared by both species, are not co-localized in the genome with respect to their normalized positions on the chromosome. Cluster No. 37, assigned to the production of the antibiotic Bacillaene is present and co-localizes in the genomes of B. amyloliquefaciens (Bac57), B. amyloliquefaciens DSM7 and B. subtilis subsp subtilis str.168, while it is not present in the genome of B. vallismortis (Bac111) suggesting a loss of this cluster in that species during evolution. Conversely, the SMGC No. 21 is present in B. marisflavi (Bac144) and B. pumilus SAFR 32 and not co-localized, suggesting an independent apparition of this gene cluster in these species. In the same manner, cluster No. 3 is present at the same position in genomes from the B. cereus group suggesting an apparition in the common ancestor of these species, while this cluster is also present in B. clausii KSM K16 at another position suggesting an independent apparition in this species. The exact same mechanism seems to exist in the B. subtilis group and B. foraminis (Bac44), with the SMGC present in the cluster No. 2. The raw results of this analysis are presented in Supplementary Table S4.

Concerning the Red Sea species, they possess in total 95 different SMGCs. Among them, 38 are shared by at least one other species, while 57 are unique. We have been able to assign a product to ten of the shared gene cluster (29% of them) and five of the unique ones (9%). It is worth noting that only three SMGCs present in more than one species are shared solely among Red Sea species. Among the 15 SMGCs from the Red Sea species, we can see an enrichment of SMGCs involved in the production of antibiotics and lantibiotics, representing 66% of the identified SMGCs. The three identified SMGCs shared by the biggest number of species, including Red Sea Species, concern protections against environmental changes. For example, we can identify the genes involved in production of Bacillibactin, a siderophore⁸⁷, teichuronic acid, a part of the cell wall produced in low-phosphate conditions⁸⁸, or ectoine, which serves as protection against osmotic stress.

Among the identified SMGCs, those that are less spread among the studied species mostly correspond to antibiotics, lantibiotics and antifungals. We can, for example, cite the identification of gene clusters involved in production of an antibiotic bacitracin, a class II lantibiotic pseudomycoicidin, or an antifungal fengycin⁸⁹ shared by four out of the ten Red Sea species.

This highlights once again a potential of new secondary metabolites discoveries, especially among molecules that would be produced by gene clusters uniquely present in these species. For example, it is worth noting that the species B. foraminis (Bac44), B. halosaccharovorans (Bac94) and B. marisflavi (Bac144), while possessing between 12 and 15 SMGCs, share only two of these gene clusters with other species. Experimental metabolite profiling is needed to address these hypotheses and truly check the different species capacity for secondary metabolite synthesis.

Concluding Remarks

Identifying environment-specific phenotypes in newly isolated species is one way to hunt for potentially efficient MCFs. Here, through comparative analysis, we first established the existence of notable indicative patterns of convergent evolution in the genomes of 10 sequenced Red Sea strains and other representative Bacillus species. Further experimental evaluation of key properties for optimal microbial platforms revealed that some of the Red Sea strains are efficient protein secretors surpassing the model strain B. subtilis 168. From a bioinformatics perspective, we also evaluated if the presence of a complete set of genes represents a more positively-correlated phenotype compared to those without, given that the presence of these genes is requirements for their potential activation and expression within organisms. The bioinformatic analysis confirmed the genomic enrichment of the genomes with secretion system-related and sporulation-related genes, do not necessarily add more evidence to the organism’s potential protein secretion capacity. By implementing a ranking system that leverages the available completely sequenced genomes, we evaluated the production potential of the isolated strains, taking into account critical driving features, such as their overall metabolic similarity, the distribution of biosynthetic genes and the prevalence of horizontally transferred genes of biosynthetic relevance. The comparative analysis primarily pointed towards B. paralicheniformis (Bac48) and B. halosaccharovorans (Bac94) as the top candidates for further probing and optimization for the development of efficient MCFs.

Material and Methods

Sampling, isolation and purification of bacterial strains

The sampling, isolation and purification of Red Sea mangrove strains were previously described in⁹⁰. Strains were isolated from samples collected from the Rabigh Harbor Lagoon by the Red Sea in Saudi Arabia (39°0′35.762′′E, 22°45′5.582′′ N). Bac44, Bac48, Bac57, Bac324, Bac330 and Bac332 were isolated from samples taken from mangrove mud, while the other samples were isolated from microbial mat (Bac84, Bac94, and Bac111) and barren soil (Bac144) in close proximity.

DNA extraction, sequencing, assembly and annotation

Sigma Gen Elute Bacterial Genomic DNA Kit (Sigma, USA) was used to extract Genomic DNA following the manufacturer’s protocol and the PowerClean Pro Clean-Up Kit (MO BIO, USA) was utilized as another purification step. DNA purification was assessed by overnight gel electrophoresis and NanoDrop (Thermo Fisher Scientific, USA). DNA was quantified using Qubit 2.0 (Life Technologies, Germany). The whole genome sequencing of the ten Red Sea strains was done on the PacBio RS II sequencing platform (Pacific Biosciences, USA) in the Core Laboratory sequencing facility at King Abdullah University of Science and Technology (KAUST). P6-C4 chemistry was used to sequence the large-insert libraries in single-molecule real-time (SMRT) sequencing cells. The assembly of the genomes was carried out using PacBio’s SMRT Analysis pipeline v2.3.0 setting the GenomeSize parameter to 6 Mb. All other parameters were set to default. All assembled genomes were examined for indications of circularization using Gepard⁹¹, then circularized, if applicable, using minimus2⁹². To polish the assembly, SMRT Analysis Resequencing protocol was used in multiple rounds until convergence. The Automatic Annotation of Microbial Genomes pipeline (AAMG)⁹³ was used to annotate the genomes. Finally, genomic islands (GIs) and prophages were predicted using IslandViewer v4⁹⁴ and PHASTER⁹⁵, respectively.

Phylogenetic placement and clustering

The phylogeny analysis was performed based on single-copy genes shared amongst 40 representative strains including the 10 Red Sea ones. The single-copy genes were aligned using MUSCLE v3.8.31⁹⁶ and the alignments were concatenated using FASconCAT-G v1.02⁹⁷. ProtTest3 v.3.4.2⁹⁸ was then used to predict the amino-acid replacement model. Finally a maximum likelihood tree was built using PhyML v3.1.⁹⁹ under the LG + I + G (LG) model as recommended by ProtTest3, with 100 bootstrap replicates.

Identification of protein secretion and sporulation genes

To identify protein secretion and sporulation genes, homologous genes to those present in B. subtilis 168 were identified using BLASTp and the bidirectional-best blast hits (BBH) method. Genes related to sporulation and protein secretion functions were retrieved from SubtiList (http://genolist.pasteur.fr/SubtiList/)^100,101. A gene was considered to be present in the genome if it has a bidirectional hit with its homologue in B. subtilis. The minimum e-value was set to be 10⁻⁵. All top hits had an e-value of 10⁻²⁰ or less. If a gene is absent in a genome, it was given a coverage value of 0.

Metabolic network reconstruction

The metabolic networks were reconstructed using the MetaCyc¹⁰² database. For every reaction in MetaCyc, the sequences of the enzymes catalyzing the reactions were retrieved. In cases where, only one enzyme sequence was available for a given reaction, a blast alignment¹⁰³ was run between this sequence and the predicted proteome of a given species (bit-score threshold of 100). If, however, multiple enzyme sequences were available for a given reaction, an HMM model characterizing those sequences was reconstructed using HMMer¹⁰⁴. This model was then searched against the predicted proteome using the same bit-score threshold of 100.

Metabolic clades were identified based on a manual inspection of the dendrogram associated with the heatmap describing the presence/absence of reactions. To find over- and under-represented reactions per metabolic clade in the heat map, a binomial test was run for each reaction, comparing the number of positive occurrences between the considered clade and the other species. A threshold of 10⁻⁵ was chosen to consider a reaction as over- of under-represented in a given clade.

Secondary metabolic gene clusters prediction and analysis

Predictions of secondary metabolic gene clusters was performed on the public web version of antiSMASH 4.0 (http://antismash.secondarymetabolites.org)¹⁰⁵. The ClusterFinder algorithm was used to predict the borders of biosynthetic gene clusters. A manual curation was performed to define more precisely the boundaries of the gene clusters based on similarities with known gene clusters present in the MIBiG database. Some of the predicted gene clusters have been split in two different gene clusters when sufficient information was available to make such a decision. The secondary metabolite product of a given gene cluster was considered as identified if this gene cluster showed at least 60% of similarities with one characterized gene cluster present in the MIBiG database. MultiGeneBlast¹⁰⁶ was used to infer similarities between all predicted gene clusters in the Bacillus species. Two gene clusters were considered as similar if they show at least 60% of similarities with each other. To identify co-localization of gene clusters, a normalized position was computed for each SMGC based on the length of the chromosome and the position of the middle of the SMGC. This was only computed for the species possessing a complete chromosomal sequence.

Bacterial strains and media

All strains were cultured in Luria-bertani (LB) broth at 37 °C in shaking flask. Minimal media M9¹⁰⁷ was used for secretion essays and the Difco Sporulation Medium (DSM)¹⁰⁸ for sporulation essays. When relevant, kanamycin (25 µg/ml) and erythromycin (1 µg/ml) were added for the Bacillus strains genetic transformation.

Sporulation assay

The overnight LB cultures of the Bacillus strains at 37 °C were diluted to OD600 0.1 in preheated liquid DSM medium¹⁰⁸ and cultured under shaking conditions (200 rpm) at 37 °C to OD600 of two (around 24 hours of culture). 500 µl of the culture was heated at 80 °C for 10 minutes. Serial dilutions from the heated samples were made in 0.9% NaCl (from 10–1 to10–7) and 100 µl of the material from dilution with 10–3 to10–7, plated in LB and incubated for 36 h at 37 °C. After incubation the colonies were counted on plates and used as the number corresponding to heat-resistant spores. Three biological replicates were performed, providing similar results. One representative biological replicate for each Bacillus strain is reported with standard deviation of variables from three technical replicates.

Protein secretion assay

The tested Bacillus strains were grown overnight in culture using LB at 37 °C and 200 rpm shaking speed. 20 ml of minimal media (M9), which was supplemented with 0.4% glucose¹⁰⁷, were inoculated to OD600 of 0.1. At the late exponential phase of growth, 3 Mm of the protease inhibitor Phenylmethylsulfonyl fluoride (PMSF) was added to the cultures to prevent proteolytic digestion and then the cell was grown for 24 h to OD600 of 2. The cultures were centrifuged at (8000 × g) for ten minutes and the supernatants were filtered with 0.2 nm nitrocellulose to eliminate the rest of cells. To precipitate the secreted proteins, 50% (w/v) of ammonium sulphate was added to the supernatants and kept on ice for 1 hour. The mixtures were then centrifuged at 25000 × g for 20 minutes. The precipitated proteins were resuspended in PBS buffers at pH 6.8. The proteins concertation was measured using NanoDrop, and compared with samples of M9 media without bacteria treated with the same procedure for culture and protein precipitation. One representative biological replicate is reported from three technical replicates.

Data availability

All data used in this study have been included in this article and its additional files.

References

Gordon, R. E., Haynes, W. & Pang, C. H.-N. The genus bacillus. US Department of Agriculture handbook, 109–126 (1973).
Ivanova, E. P. et al. Characterization of Bacillus strains of marine origin. Int Microbiol 2, 267–271 (1999).
CAS PubMed Google Scholar
Yoon, J. H., Kim, I. G., Kang, K. H., Oh, T. K. & Park, Y. H. Bacillus marisflavi sp. nov. and Bacillus aquimaris sp. nov., isolated from sea water of a tidal flat of the Yellow Sea in Korea. Int J Syst Evol Microbiol 53, 1297–1303, https://doi.org/10.1099/ijs.0.02365-0 (2003).
Article CAS PubMed Google Scholar
Yoon, J. H. & Oh, T. K. Bacillus litoralis sp. nov., isolated from a tidal flat of the Yellow Sea in Korea. Int J Syst Evol Microbiol 55, 1945–1948, https://doi.org/10.1099/ijs.0.63332-0 (2005).
Article CAS PubMed Google Scholar
Dastager, S. G. et al. Bacillus encimensis sp. nov. isolated from marine sediment. Int J Syst Evol Microbiol 65, 1421–1425, https://doi.org/10.1099/ijs.0.000114 (2015).
Article CAS PubMed Google Scholar
Jiang, Z. et al. Bacillus tianshenii sp. nov., isolated from a marine sediment sample. Int J Syst Evol Microbiol 64, 1998–2002, https://doi.org/10.1099/ijs.0.062224-0 (2014).
Article CAS PubMed Google Scholar
Mawlankar, R., Thorat, M. N., Krishnamurthi, S. & Dastager, S. G. Bacillus cellulasensis sp. nov., isolated from marine sediment. Arch Microbiol 198, 83–89, https://doi.org/10.1007/s00203-015-1155-7 (2016).
Article CAS PubMed Google Scholar
Singh, N. K. et al. Bacillus aequororis sp. nov., isolated from marine sediment. Curr Microbiol 69, 758–762, https://doi.org/10.1007/s00284-014-0654-0 (2014).
Article CAS PubMed Google Scholar
Zhuang, W. Q., Zhuang, W. Q., Maszenan, A. M. & Tay, S. T. Bacillus naphthovorans sp. nov. from oil-contaminated tropical marine sediments and its role in naphthalene biodegradation. Appl Microbiol Biotechnol 58, 547–553 (2002).
Article CAS PubMed Google Scholar
Jia, Z., Jin, W., Huang, Y. & Song, S. Complete Genome Sequence of Bacillus subtilis J-5, a Potential Biocontrol Agent. Genome Announc 5, https://doi.org/10.1128/genomeA.00275-17 (2017).
Pandey, A. & Palni, L. M. Bacillus species: the dominant bacteria of the rhizosphere of established tea bushes. Microbiol Res 152, 359–365, https://doi.org/10.1016/s0944-5013(97)80052-3 (1997).
Article CAS PubMed Google Scholar
Lozano, G. L., Bravo, J. I. & Handelsman, J. Draft Genome Sequence of Pseudomonas koreensis CI12, a Bacillus cereus “Hitchhiker” from the Soybean Rhizosphere. Genome Announc 5, https://doi.org/10.1128/genomeA.00570-17 (2017).
Wang, Y. et al. Complete Genome Sequence of Bacillus paralicheniformis MDJK30, a Plant Growth-Promoting Rhizobacterium with Antifungal Activity. Genome Announc 5, https://doi.org/10.1128/genomeA.00577-17 (2017).
Ma, J. et al. Complete Genome Sequence of Bacillus subtilis GQJK2, a Plant Growth-Promoting Rhizobacterium with Antifungal Activity. Genome Announc 5, https://doi.org/10.1128/genomeA.00467-17 (2017).
Perez-Flores, P. et al. Bacillus methylotrophicus M4-96 isolated from maize (Zea mays) rhizoplane increases growth and auxin content in Arabidopsis thaliana via emission of volatiles. Protoplasma, https://doi.org/10.1007/s00709-017-1109-9 (2017).
Article PubMed Google Scholar
Tidjani Alou, M. et al. Bacillus niameyensis sp. nov., a new bacterial species isolated from human gut. New Microbes and New Infections 8, 61–69, https://doi.org/10.1016/j.nmni.2015.09.011 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tidjiani Alou, M. et al. Bacillus rubiinfantis sp. nov. strain mt2(T), a new bacterial species isolated from human gut. New Microbes New Infect 8, 51–60, https://doi.org/10.1016/j.nmni.2015.09.008 (2015).
Article CAS PubMed PubMed Central Google Scholar
Alou, M. T., Fournier, P. E. & Raoult, D. “Bacillus mediterraneensis”, a new bacterial species isolated from human gut microbiota. New Microbes New Infect 12, 86–87, https://doi.org/10.1016/j.nmni.2016.05.006 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. Z. et al. Bacillus endoradicis sp. nov., an endophytic bacterium isolated from soybean root. Int J Syst Evol Microbiol 62, 359–363, https://doi.org/10.1099/ijs.0.028936-0 (2012).
Article CAS PubMed Google Scholar
Christiansson, A., Naidu, A. S., Nilsson, I., Wadström, T. & Pettersson, H. E. Toxin production by Bacillus cereus dairy isolates in milk at low temperatures. Applied and Environmental Microbiology 55, 2595–2600 (1989).
CAS PubMed PubMed Central Google Scholar
Zhao, X. & Kuipers, O. P. Identification and classification of known and putative antimicrobial compounds produced by a wide variety of Bacillales species. BMC Genomics 17, 882, https://doi.org/10.1186/s12864-016-3224-y (2016).
Article CAS PubMed PubMed Central Google Scholar
Aleti, G., Sessitsch, A. & Brader, G. Genome mining: Prediction of lipopeptides and polyketides from Bacillus and related Firmicutes. Computational and Structural Biotechnology Journal 13, 192–203, https://doi.org/10.1016/j.csbj.2015.03.003 (2015).
Article CAS PubMed PubMed Central Google Scholar
Izumi Willcoxon, M. et al. A high-throughput, in-vitro assay for Bacillus thuringiensis insecticidal proteins. J Biotechnol 217, 72–81, https://doi.org/10.1016/j.jbiotec.2015.10.021 (2016).
Article CAS PubMed Google Scholar
Tjalsma, H. et al. Proteomics of protein secretion by Bacillus subtilis: separating the “secrets” of the secretome. Microbiol Mol Biol Rev 68, 207–233, https://doi.org/10.1128/mmbr.68.2.207-233.2004 (2004).
Article CAS PubMed PubMed Central Google Scholar
Tachibana, K. et al. Secretion of Bacillus subtilis alpha-amylase in the periplasmic space of Escherichia coli. J Gen Microbiol 133, 1775–1782, https://doi.org/10.1099/00221287-133-7-1775 (1987).
Article CAS PubMed Google Scholar
Maresso, A. W., Garufi, G. & Schneewind, O. Bacillus anthracis secretes proteins that mediate heme acquisition from hemoglobin. PLoS Pathog 4, e1000132, https://doi.org/10.1371/journal.ppat.1000132 (2008).
Article CAS PubMed PubMed Central Google Scholar
Donovan, W. P. et al. Discovery and characterization of Sip1A: A novel secreted protein from Bacillus thuringiensis with activity against coleopteran larvae. Appl Microbiol Biotechnol 72, 713–719, https://doi.org/10.1007/s00253-006-0332-7 (2006).
Article CAS PubMed Google Scholar
Song, Y. et al. High-Efficiency Secretion of beta-Mannanase in Bacillus subtilis through Protein Synthesis and Secretion Optimization. J Agric Food Chem 65, 2540–2548, https://doi.org/10.1021/acs.jafc.6b05528 (2017).
Article CAS PubMed Google Scholar
Simonen, M. & Palva, I. Protein secretion in Bacillus species. Microbiological Reviews 57, 109–137 (1993).
CAS PubMed PubMed Central Google Scholar
Deb, P., Talukdar, S. A., Mohsina, K., Sarker, P. K. & Sayem, S. M. A. Production and partial characterization of extracellular amylase enzyme from Bacillus amyloliquefaciens P-001. SpringerPlus 2, 154, https://doi.org/10.1186/2193-1801-2-154 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dhanarajan, G. et al. Biosurfactant-biopolymer driven microbial enhanced oil recovery (MEOR) and its optimization by an ANN-GA hybrid technique. J Biotechnol 256, 46–56, https://doi.org/10.1016/j.jbiotec.2017.05.007 (2017).
Article CAS PubMed Google Scholar
Grangemard, I., Wallach, J., Maget-Dana, R. & Peypoux, F. Lichenysin: a more efficient cation chelator than surfactin. Appl Biochem Biotechnol 90, 199–210 (2001).
Article CAS PubMed Google Scholar
Lin, S. et al. Asymmetric Total Synthesis of Ieodomycin B. Mar Drugs 15, https://doi.org/10.3390/md15010017 (2017).
Article PubMed Central Google Scholar
Ma, Y. et al. Identification of lipopeptides in Bacillus megaterium by two-step ultrafiltration and LC-ESI-MS/MS. AMB Express 6, 79, https://doi.org/10.1186/s13568-016-0252-6 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gowrishankar, S. et al. Cyclic dipeptide cyclo(l-leucyl-l-prolyl) from marine Bacillus amyloliquefaciens mitigates biofilm formation and virulence in Listeria monocytogenes. Pathog Dis 74, ftw017, https://doi.org/10.1093/femspd/ftw017 (2016).
Article CAS PubMed Google Scholar
Chatterjee, J. et al. Production and characterization of thermostable alkaline protease of Bacillus subtilis (ATCC 6633) from optimized solid-state fermentation. Biotechnol Appl Biochem 62, 709–718, https://doi.org/10.1002/bab.1309 (2015).
Article CAS PubMed Google Scholar
Rajkumar, R., Kothilmozhian, J. & Ramasamy, R. Production and characterization of a novel protease from Bacillus sp. RRM1 under solid state fermentation. J Microbiol Biotechnol 21, 627–636 (2011).
PubMed Google Scholar
Maruthiah, T., Somanath, B., Immanuel, G. & Palavesam, A. Deproteinization potential and antioxidant property of haloalkalophilic organic solvent tolerant protease from marine Bacillus sp. APCMST-RS3 using marine shell wastes. Biotechnology Reports 8, 124–132, https://doi.org/10.1016/j.btre.2015.10.009 (2015).
Article PubMed PubMed Central Google Scholar
Arguelles-Arias, A. et al. Bacillus amyloliquefaciens GA1 as a source of potent antibiotics and other secondary metabolites for biocontrol of plant pathogens. Microb Cell Fact 8, 63, https://doi.org/10.1186/1475-2859-8-63 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schneider, K. et al. Macrolactin is the polyketide biosynthesis product of the pks2 cluster of Bacillus amyloliquefaciens FZB42. J Nat Prod 70, 1417–1423, https://doi.org/10.1021/np070070k (2007).
Article CAS PubMed Google Scholar
Chen, X. H. et al. Structural and functional characterization of three polyketide synthase gene clusters in Bacillus amyloliquefaciens FZB 42. J Bacteriol 188, 4024–4036, https://doi.org/10.1128/jb.00052-06 (2006).
Article CAS PubMed PubMed Central Google Scholar
Bosi, E. et al. Comparative genome-scale modelling of Staphylococcus aureus strains identifies strain-specific metabolic capabilities linked to pathogenicity. Proc Natl Acad Sci USA 113, E3801–3809, https://doi.org/10.1073/pnas.1523199113 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sutyak, K. E., Wirawan, R. E., Aroutcheva, A. A. & Chikindas, M. L. Isolation of the Bacillus subtilis antimicrobial peptide subtilosin from the dairy product-derived Bacillus amyloliquefaciens. J Appl Microbiol 104, 1067–1074, https://doi.org/10.1111/j.1365-2672.2007.03626.x (2008).
Article CAS PubMed Google Scholar
Chen, X. H. et al. Genome analysis of Bacillus amyloliquefaciens FZB42 reveals its potential for biocontrol of plant pathogens. J Biotechnol 140, 27–37, https://doi.org/10.1016/j.jbiotec.2008.10.011 (2009).
Article CAS PubMed Google Scholar
Chen, X. H. et al. Comparative analysis of the complete genome sequence of the plant growth-promoting bacterium Bacillus amyloliquefaciens FZB42. Nat Biotechnol 25, 1007–1014, https://doi.org/10.1038/nbt1325 (2007).
Article CAS PubMed Google Scholar
Chen, X. H. et al. Difficidin and bacilysin produced by plant-associated Bacillus amyloliquefaciens are efficient in controlling fire blight disease. J Biotechnol 140, 38–44, https://doi.org/10.1016/j.jbiotec.2008.10.015 (2009).
Article CAS PubMed Google Scholar
Bravo, A., Gill, S. S. & Soberon, M. Mode of action of Bacillus thuringiensis Cry and Cyt toxins and their potential for insect control. Toxicon 49, 423–435, https://doi.org/10.1016/j.toxicon.2006.11.022 (2007).
Article CAS PubMed Google Scholar
Hao, Z. et al. Extraction of antibiotic zwittermicin A from Bacillus thuringiensis by macroporous resin and silica gel column chromatography. Biotechnol Appl Biochem 62, 369–374, https://doi.org/10.1002/bab.1277 (2015).
Article CAS PubMed Google Scholar
Hofte, H. & Whiteley, H. R. Insecticidal crystal proteins of Bacillus thuringiensis. Microbiol Rev 53, 242–255 (1989).
CAS PubMed PubMed Central Google Scholar
Palma, L., Munoz, D., Berry, C., Murillo, J. & Caballero, P. Bacillus thuringiensis toxins: an overview of their biocidal activity. Toxins (Basel) 6, 3296–3325, https://doi.org/10.3390/toxins6123296 (2014).
Article CAS Google Scholar
Schnepf, E. et al. Bacillus thuringiensis and its pesticidal crystal proteins. Microbiol Mol Biol Rev 62, 775–806 (1998).
CAS PubMed PubMed Central Google Scholar
van Frankenhuyzen, K. Insecticidal activity of Bacillus thuringiensis crystal proteins. J Invertebr Pathol 101, 1–16, https://doi.org/10.1016/j.jip.2009.02.009 (2009).
Article CAS PubMed Google Scholar
Roh, J. Y., Choi, J. Y., Li, M. S., Jin, B. R. & Je, Y. H. Bacillus thuringiensis as a specific, safe, and effective tool for insect pest control. J Microbiol Biotechnol 17, 547–559 (2007).
CAS PubMed Google Scholar
Mondol, M. A., Shin, H. J. & Islam, M. T. Diversity of secondary metabolites from marine Bacillus species: chemistry and biological activity. Mar Drugs 11, 2846–2872, https://doi.org/10.3390/md11082846 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dong, H. & Zhang, D. Current development in genetic engineering strategies of Bacillus species. Microbial Cell Factories 13, 63–63, https://doi.org/10.1186/1475-2859-13-63 (2014).
Article CAS PubMed PubMed Central Google Scholar
Penesyan, A., Marshall-Jones, Z., Holmstrom, C., Kjelleberg, S. & Egan, S. Antimicrobial activity observed among cultured marine epiphytic bacteria reflects their potential as a source of new drugs. FEMS Microbiol Ecol 69, 113–124, https://doi.org/10.1111/j.1574-6941.2009.00688.x (2009).
Article CAS PubMed Google Scholar
Yang, Y., Wang, J., Liao, J., Xie, S. & Huang, Y. Abundance and diversity of soil petroleum hydrocarbon-degrading microbial communities in oil exploring areas. Appl Microbiol Biotechnol 99, 1935–1946, https://doi.org/10.1007/s00253-014-6074-z (2015).
Article CAS PubMed Google Scholar
Xue, Y. et al. Methylotrophic yeast Pichia pastoris as a chassis organism for polyketide synthesis via the full citrinin biosynthetic pathway. J Biotechnol 242, 64–72, https://doi.org/10.1016/j.jbiotec.2016.11.031 (2017).
Article CAS PubMed Google Scholar
Holmes, N. A. et al. Genome Analysis of Two Pseudonocardia Phylotypes Associated with Acromyrmex Leafcutter Ants Reveals Their Biosynthetic Potential. Front Microbiol 7, 2073, https://doi.org/10.3389/fmicb.2016.02073 (2016).
Article PubMed PubMed Central Google Scholar
Cao, J., Jiang, M., Li, P. & Chu, Z. Genome-wide identification and evolutionary analyses of the PP2C gene family with their expression profiling in response to multiple stresses in Brachypodium distachyon. BMC Genomics 17, 175, https://doi.org/10.1186/s12864-016-2526-4 (2016).
Article CAS PubMed PubMed Central Google Scholar
Horn, H., Hentschel, U. & Abdelmohsen, U. R. Mining Genomes of Three Marine Sponge-Associated Actinobacterial Isolates for Secondary Metabolism. Genome Announc 3, https://doi.org/10.1128/genomeA.01106-15 (2015).
Wohlleben, W., Mast, Y., Stegmann, E. & Ziemert, N. Antibiotic drug discovery. Microb Biotechnol 9, 541–548, https://doi.org/10.1111/1751-7915.12388 (2016).
Article PubMed PubMed Central Google Scholar
Penesyan, A., Gillings, M. & Paulsen, I. T. Antibiotic discovery: combatting bacterial resistance in cells and in biofilm communities. Molecules 20, 5286–5298, https://doi.org/10.3390/molecules20045286 (2015).
Article CAS PubMed PubMed Central Google Scholar
Erickson, B., Nelson & Winters, P. Perspective on opportunities in industrial biotechnology in renewable chemicals. Biotechnol J 7, 176–185, https://doi.org/10.1002/biot.201100069 (2012).
Article CAS PubMed PubMed Central Google Scholar
Berry, D. A. Engineering organisms for industrial fuel production. Bioeng Bugs 1, 303–308, https://doi.org/10.4161/bbug.1.5.12941 (2010).
Article PubMed PubMed Central Google Scholar
Chen, G. Q. New challenges and opportunities for industrial biotechnology. Microb Cell Fact 11, 111, https://doi.org/10.1186/1475-2859-11-111 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bowerman, N., Tintle, N., Dejongh, M. & Best, A. A. Identification and Analysis of Bacterial Genomic Metabolic Signatures. Pac Symp Biocomput 22, 3–14 (2016).
Google Scholar
Al-Amoudi, S. et al. Metagenomics as a preliminary screen for antimicrobial bioprospecting. Gene 594, 248–258, https://doi.org/10.1016/j.gene.2016.09.021 (2016).
Article CAS PubMed Google Scholar
Alcaraz, L. D. et al. Understanding the evolutionary relationships and major traits of Bacillus through comparative genomics. BMC Genomics 11, 332, https://doi.org/10.1186/1471-2164-11-332 (2010).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Othoum, G. et al. In silico exploration of Red Sea Bacillus genomes for natural product biosynthetic gene clusters. BMC Genomics 19, 382, https://doi.org/10.1186/s12864-018-4796-5 (2018).
Article CAS PubMed PubMed Central Google Scholar
Penn, K. et al. Genomic islands link secondary metabolism to functional adaptation in marine Actinobacteria. ISME J 3, 1193–1203, https://doi.org/10.1038/ismej.2009.58 (2009).
Article CAS PubMed Google Scholar
Dobrindt, U., Hochhut, B., Hentschel, U. & Hacker, J. Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol 2, 414–424, https://doi.org/10.1038/nrmicro884 (2004).
Article CAS PubMed Google Scholar
Nagler, K. et al. Identification of Differentially Expressed Genes during Bacillus subtilis Spore Outgrowth in High-Salinity Environments Using RNA Sequencing. Front Microbiol 7, 1564, https://doi.org/10.3389/fmicb.2016.01564 (2016).
Article PubMed PubMed Central Google Scholar
Errington, J. Bacillus subtilis sporulation: regulation of gene expression and control of morphogenesis. Microbiological reviews 57, 1–33 (1993).
CAS PubMed PubMed Central Google Scholar
Burbulys, D., Trach, K. A. & Hoch, J. A. Initiation of sporulation in B. subtilis is controlled by a multicomponent phosphorelay. Cell 64, 545–552 (1991).
Article CAS PubMed Google Scholar
Henry, C. S., Zinner, J. F., Cohoon, M. P. & Stevens, R. L. i Bsu1103: a new genome-scale metabolic model of Bacillus subtilis based on SEED annotations. Genome biology 10, R69 (2009).
Article PubMed PubMed Central Google Scholar
Jebbar, M., von Blohn, C. & Bremer, E. Ectoine functions as an osmoprotectant in Bacillus subtilis and is accumulated via the ABC-transport system OpuC. FEMS microbiology letters 154, 325–330 (1997).
Article CAS Google Scholar
Kuhlmann, A. U., Hoffmann, T., Bursy, J., Jebbar, M. & Bremer, E. Ectoine and Hydroxyectoine as Protectants against Osmotic and Cold Stress: Uptake through the SigB-Controlled Betaine-Choline- Carnitine Transporter-Type Carrier EctT from Virgibacillus pantothenticus. Journal of Bacteriology 193, 4699–4708, https://doi.org/10.1128/JB.05270-11 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bouizgarne, B. In Bacteria in agrobiology: disease management 15–47 (Springer, 2013).
Kiene, R. P. & Bates, T. S. Biological removal of dimethyl sulphide from sea water. Nature 345, 702–705 (1990).
Article ADS CAS Google Scholar
De Zwart, J., Sluis, J. & Kuenen, J. G. Competition for Dimethyl Sulfide and Hydrogen Sulfide by Methylophaga sulfidovorans and Thiobacillus thioparus T5 in Continuous Cultures. Appl Environ Microbiol 63, 3318–3322 (1997).
PubMed PubMed Central Google Scholar
Abdelfattah, M. S. & Rohr, J. Premithramycinone G, an early shunt product of the mithramycin biosynthetic pathway accumulated upon inactivation of oxygenase MtmOII. Angewandte Chemie International Edition 45, 5685–5689 (2006).
Article CAS PubMed Google Scholar
Koopman, F., Wierckx, N., de Winde, J. H. & Ruijssenaars, H. J. Identification and characterization of the furfural and 5-(hydroxymethyl)furfural degradation pathways of Cupriavidus basilensis HMF14. Proc Natl Acad Sci USA 107, 4919–4924, https://doi.org/10.1073/pnas.0913039107 (2010).
Article ADS PubMed PubMed Central Google Scholar
Schiering, N. et al. Structure of the detoxification catalyst mercuric ion reductase from Bacillus sp. strain RC607. Nature 352, 168–172, https://doi.org/10.1038/352168a0 (1991).
Article ADS CAS PubMed Google Scholar
Adamek, M. et al. Comparative genomics reveals phylogenetic distribution patterns of secondary metabolites in Amycolatopsis species. BMC Genomics 19, 426, https://doi.org/10.1186/s12864-018-4809-4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Du, Y. et al. Comparative genomic analysis of Bacillus paralicheniformis MDJK30 with its closely related species reveals an evolutionary relationship between B. paralicheniformis and B. licheniformis. BMC Genomics 20, 283, https://doi.org/10.1186/s12864-019-5646-9 (2019).
Article PubMed PubMed Central Google Scholar
May, J. J., Wendrich, T. M. & Marahiel, M. A. The dhb operon of bacillus subtilisEncodes the biosynthetic template for the catecholic siderophore 2, 3-dihydroxybenzoate-glycine-threonine trimeric ester bacillibactin. Journal of Biological Chemistry 276, 7209–7217 (2001).
Article CAS PubMed Google Scholar
Lang, W., Glassey, K. & Archibald, A. Influence of phosphate supply on teichoic acid and teichuronic acid content of Bacillus subtilis cell walls. Journal of bacteriology 151, 367–375 (1982).
CAS PubMed PubMed Central Google Scholar
Vanittanakom, N., Loeffler, W., Koch, U. & Jung, G. Fengycin–a novel antifungal lipopeptide antibiotic produced by Bacillus subtilis F-29-3. J Antibiot (Tokyo) 39, 888–901 (1986).
Article CAS Google Scholar
Al-Amoudi, S. et al. Bioprospecting Red Sea Coastal Ecosystems for Culturable Microorganisms and Their Antimicrobial Potential. Mar Drugs 14, https://doi.org/10.3390/md14090165 (2016).
Article PubMed Central Google Scholar
Krumsiek, J., Arnold, R. & Rattei, T. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23, 1026–1028, https://doi.org/10.1093/bioinformatics/btm039 (2007).
Article CAS PubMed Google Scholar
Sommer, D. D., Delcher, A. L., Salzberg, S. L. & Pop, M. Minimus: a fast, lightweight genome assembler. BMC Bioinformatics 8, 64, https://doi.org/10.1186/1471-2105-8-64 (2007).
Article CAS PubMed PubMed Central Google Scholar
Alam, I. et al. INDIGO–INtegrated Data Warehouse of MIcrobial GenOmes with examples from the red sea extremophiles. PloS one 8, e82210 (2013).
Article ADS PubMed PubMed Central Google Scholar
Bertelli, C. et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic acids research 45(W30–W35%@), 0305–1048 (2017).
Google Scholar
Arndt, D. et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res 44, W16–21, https://doi.org/10.1093/nar/gkw387 (2016).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic acids research 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kück, P. & Longo, G. C. FASconCAT-G: extensive functions for multiple sequence alignment preparations concerning phylogenetic studies. Frontiers in zoology 11, 81 (2014).
Article PubMed PubMed Central Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164–1165 (2011).
Article CAS PubMed Google Scholar
Guindon, S., Dufayard, J., Hordijk, W., Lefort, V. & Gascuel, O. In Infection Genetics and Evolution. 384–385 (Elsevier Science BV PO BOX 211, 1000 AE Amsterdam, Netherlands).
Moszer, I., Glaser, P. & Danchin, A. SubtiList: a relational database for the Bacillus subtilis genome. Microbiology 141, 261–268 (1995).
Article CAS PubMed Google Scholar
Moszer, I., Jones, L. M., Moreira, S., Fabry, C. & Danchin, A. SubtiList: the reference database for the Bacillus subtilis genome. Nucleic acids research 30, 62–65 (2002).
Article CAS PubMed PubMed Central Google Scholar
Caspi, R. et al. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Research 42, D459–D471, https://doi.org/10.1093/nar/gkt1103 (2014).
Article CAS PubMed Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. Journal of molecular biology 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Research 39, W29–W37, https://doi.org/10.1093/nar/gkr367 (2011).
Article CAS PubMed PubMed Central Google Scholar
Blin, K. et al. antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res, https://doi.org/10.1093/nar/gkx319 (2017).
Article CAS PubMed PubMed Central Google Scholar
Medema, M. H., Takano, E. & Breitling, R. Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast. Molecular Biology and Evolution 30, 1218–1223, https://doi.org/10.1093/molbev/mst025 (2013).
Article CAS PubMed PubMed Central Google Scholar
Harwood, C. & Cutting, S. Vol. 548 (Wiley Chichester, United Kingdom, 1990).
Schaeffer, P., Millet, J. & Aubert, J.-P. Catabolic repression of bacterial sporulation. Proceedings of the National Academy of Sciences 54, 704–711 (1965).
Article ADS CAS Google Scholar

Download references

Acknowledgements

The authors wish to acknowledge the experimental support from the King Abdullah University of Science and Technology (KAUST) Bioscience Core Laboratory. The research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST) through the Awards Nos. FCC/1/1976-02-01, FCC/1/1976-17-01, FCC/1/1976-03-01, FCC/1/1976-20-01, FCC/1/1976-16-01, FCS/1/2911-01-01, BAS/1/1606-01-01, URF/1/1976-06-01, BAS/1/1624-01-01, BAS/1/1659-01-01, BAS/1/1059-01-01 from the Office of Sponsored Research (OSR).

Author information

These authors contributed equally: G. Othoum and S. Prigent.

Authors and Affiliations

Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
G. Othoum, S. Bougouffa, X. Gao, R. Hoehndorf, S. T. Arold, T. Gojobori, F. F. Lafi, V. B. Bajic & M. Essack
Department of Biology and Biological Engineering, Division of Systems & Synthetic Biology, Chalmers University of Technology, Kemivägen 10, 41296, Gothenburg, Sweden
S. Prigent, A. Derouiche, L. Shi, J. Nielsen & I. Mijakovic
Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
A. Bokhari, T. Gojobori & H. Hirt
Department of Biology, Science and Arts College, King Abdulaziz University, Rabigh, 21589, Kingdom of Saudi Arabia
S. Alamoudi
College of Natural and Health Sciences, Zayed University, 144534, Abu-Dhabi, United Arab Emirates
F. F. Lafi
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
J. Nielsen & I. Mijakovic
Science for Life Laboratory, Royal Institute of Technology, Solna, Sweden
J. Nielsen

Authors

G. Othoum
View author publications
You can also search for this author in PubMed Google Scholar
S. Prigent
View author publications
You can also search for this author in PubMed Google Scholar
A. Derouiche
View author publications
You can also search for this author in PubMed Google Scholar
L. Shi
View author publications
You can also search for this author in PubMed Google Scholar
A. Bokhari
View author publications
You can also search for this author in PubMed Google Scholar
S. Alamoudi
View author publications
You can also search for this author in PubMed Google Scholar
S. Bougouffa
View author publications
You can also search for this author in PubMed Google Scholar
X. Gao
View author publications
You can also search for this author in PubMed Google Scholar
R. Hoehndorf
View author publications
You can also search for this author in PubMed Google Scholar
S. T. Arold
View author publications
You can also search for this author in PubMed Google Scholar
T. Gojobori
View author publications
You can also search for this author in PubMed Google Scholar
H. Hirt
View author publications
You can also search for this author in PubMed Google Scholar
F. F. Lafi
View author publications
You can also search for this author in PubMed Google Scholar
J. Nielsen
View author publications
You can also search for this author in PubMed Google Scholar
V. B. Bajic
View author publications
You can also search for this author in PubMed Google Scholar
I. Mijakovic
View author publications
You can also search for this author in PubMed Google Scholar
M. Essack
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study was conceived and designed by G.O., S.P., J.N., I.M., V.B.B. and M.E. Data was generated by S.P., G.O., S.B., A.D., L.S., A.B., S.A., H.H. and F.F.L. Data analysis was performed by G.O., S.P., A.D. and M.E. Discussion of analyzed data was provided by X.G., R.H., S.T.A., T.G. and I.M. The manuscript was written by S.P., G.O., J.N., I.M., V.B.B. and M.E.

Corresponding authors

Correspondence to I. Mijakovic or M. Essack.

Ethics declarations

Competing interests

V.B.B. is an editorial board member of the Scientific Reports journal. All other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures and Tables

Supplementary Dataset S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Othoum, G., Prigent, S., Derouiche, A. et al. Comparative genomics study reveals Red Sea Bacillus with characteristics associated with potential microbial cell factories (MCFs). Sci Rep 9, 19254 (2019). https://doi.org/10.1038/s41598-019-55726-2

Download citation

Received: 08 March 2019
Accepted: 27 November 2019
Published: 17 December 2019
DOI: https://doi.org/10.1038/s41598-019-55726-2

This article is cited by

Genome mining, antimicrobial and plant growth-promoting potentials of halotolerant Bacillus paralicheniformis ES-1 isolated from salt mine
- Sajid Iqbal
- Muhammad Qasim
- Hussnain Ahmed Janjua
Molecular Genetics and Genomics (2023)
Hemicellulosic biomass conversion by Moroccan hot spring Bacillus paralicheniformis CCMM B940 evidenced by glycoside hydrolase activities and whole genome sequencing
- Soufiane Maski
- Serigne Inssa Ngom
- Christel Béra-Maillet
3 Biotech (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.