Abstract
Bradyrhizobium diazoefficiens USDA110 is one of the most effective nitrogen-fixing symbionts of soybeans. Here we carried out a large-scale transposon insertion sequencing (Tn-seq) analysis of strain Bd110spc4, which is derived from USDA110, with the goal of increasing available resources for identifying genes crucial for the survival of this plant symbiont under diverse conditions. We prepared two transposon (Tn) insertion libraries of Bd110spc4 with 155,042 unique Tn insertions when the libraries were combined, which is an average of one insertion every 58.7 bp of the reference USDA110 genome. Application of bioinformatic filtering steps to remove genes too small to be expected to have Tn insertions, resulted in a list of genes that were classified as putatively essential. Comparison of this gene set with genes putatively essential for the growth of the closely related alpha-proteobacterium, Rhodopseudomonas palustris, revealed a small set of five genes that may be collectively essential for closely related members of the family Bradyrhizobiaceae. This group includes bacteria with diverse lifestyles ranging from plant symbionts to animal-associated species to free-living species.
Similar content being viewed by others
Introduction
Bradyrhizobium diazoefficiens, formerly named Bradyrhizobium japonicum1, is an alpha-proteobacterium that infects soybeans (Glycine max) to form nitrogen-fixing root nodules that provide useable nitrogen to the host plant. With its outstanding nitrogen-fixing abilities2,3,4,5,6, B. diazoefficiens USDA110 has been established as a model organism in studies of the soybean-bacterial symbiosis. Although many genes important or required for symbiosis have been identified, many fewer genes have been identified that are important for the fitness of USDA110 or its derivatives in other environmental settings, such as drought-stressed soils or in soils near or distant from soybean roots.
A now standard method to identify bacterial genes that are important for viability under a particular condition is transposon (Tn) sequencing (Tn-seq)7,8,9,10,11. Tn-seq is based on the generation of a transposon insertion library of a bacterium of interest. Hundreds of thousands of Tn insertions can be identified simultaneously by high-throughput sequencing and those genes which receive few or no insertions are classified as putatively essential under the conditions being interrogated12. To provide a community resource that we or others can use to identify genes important for fitness, we generated a transposon insertion library of Bd110spc4, mapped the sites of Tn insertions, and applied bioinformatic analysis to estimate a set of putative essential genes required for aerobic growth on yeast extract- arabinose-gluconate (AG) medium. Bd110spc4 is a spontaneous spectinomycin resistant mutant of B. diazoefficiens USDA11013.
This baseline data set can be used in conjunction with the Bd110spc4 transposon insertion library that we generated to identify genes that are needed for Bd110spc4 growth or survival in natural conditions or in a specific condition in the laboratory. We found immediate utility for the list of Bd110spc4 putative essential genes when we used multiple BLAST comparisons to identify a small set of five genes that we hypothesize to be collectively or conditionally essential for a subset of related species in the family Bradyrhizobiaceae with diverse lifestyles but not for other alpha-proteobacteria.
Results
Transposon library construction
Two Bd110spc4 libraries were generated by delivering suicide plasmid pLG10711 from Escherichia coli strain SM10 \(\mathrm{\lambda }\)pir to Bd110spc4. pLG107 carries the Tn5-based transposon T24 (ISlacZ-prhaBo/FRT-kan). Individual conjugative mating mixtures were plated on yeast extract-arabinose-gluconate (AG) agar supplemented with kanamycin (Km) to select for Tn insertions and chloramphenicol to select against the E. coli donor strain, since Bd110spc4 is naturally resistant to chloramphenicol. Two mutant libraries were prepared, and the Tn insertions were mapped by Illumina sequencing to the USDA110 genome (BA000040)6 as described in Materials and Methods.
Identification and analysis of putative essential genes
We pooled data from five sequenced samples from two Tn libraries (MP1 and MP2) that we prepared, leading to the identification of 155,042 unique Tn insertions, 129,275 of which are in predicted genes and 25,767 of which are located in intergenic regions (Table 1) (Tables S1–3). We carried out a series of computational steps to generate normalized read values for each of the 8317 annotated protein-encoding genes (Table S4). First, the number of reads were normalized to a total of 107 reads per sample (see Table S1–3 in the supplemental material). Second, since we did not consider Tn insertions in the first 5% or the last 10% of each gene, the total number of reads per gene were normalized based on 85% of the gene length, resulting in a 5–90 RpK value. Log2 [5-90RpK normalized] values were then plotted against numbers of genes (See Figure S2 in Supplemental Material). To identify genes that were required for growth under the conditions of the Tn mutant library construction, we conducted a Gaussian distribution analysis to find genes with no or very low RpK values compared to the rest of the genes. For this we used a cut-off of 99.9% confidence interval based on log25-90 RpK (Figure S2). Putatively essential genes were defined as those with RpK values below this cut-off for each of the five samples analyzed (Table S4). The predicted essential genes were further filtered to remove genes too small to be expected statistically (P-value < 0.005) to have an insertion based on the density of Tn insertions that we observed. This resulted in the removal of genes with a length < 310 bp from consideration as essential genes.
From this analysis, we noticed that one region of the USDA110 genome of 9.1 Mb had almost no Tn insertions. Shortly after this, a report was published showing that Bd110spc4 has a deletion of genes blr0001-bsr0078 and blr8173-bsl8317 encompassing a contiguous region of about 202 kb that includes 223 genes as well as a few other changes, resulting in a genome size of approximately 8.9 Mb14. Bd110spc4, which has been in laboratory culture since 198313, has maintained its nitrogen-fixing abilities and is not defective in symbiosis14. In estimating the number of putatively essential genes, we used the USDA110 genome sequence and annotation, minus the 223 genes that have been deleted from the Bd110spc4 genome (Fig. 1), resulting in an estimate of 622 putative essential genes (Table S5). This is in the same range as the number of putative essential genes inventoried for other bacteria in other studies8, 11, 15,16,17,18,19.
The set of putative essential genes that we identified has a similar enrichment of functions to those reported for essential gene sets from other bacteria. Of the 622 protein-encoding genes identified, 511 genes were assigned a Clusters of Orthologous Groups of proteins (COG) identifier (Fig. 2) (Table S6)20. Relative to the numbers of genes in each category in the USDA110 genome, there was an enrichment of putative essential genes in categories D (Cell cycle control), F (Nucleotide transport and metabolism), H (Coenzyme transport and metabolism), M (Cell wall, membrane and envelope biogenesis), U (Intracellular trafficking, secretion, and vesicular transport) and over a three-fold enrichment in category J (Translation, ribosomal structure and biogenesis).
Our data set has caveats
Having said this, we also know that the Tn library that we generated does not reflect the number of Bd110spc4 genes that are truly essential with complete accuracy. Recent studies of other bacteria have generated datasets of hundreds of thousands of Tn insertions and this is accompanied by an increase in accuracy. For reasons described in the methods section, the generation of the Bd110spc4 Tn libraries was laborious, and the total number of Tn insertions present in the Tn libraries that were generated were lower than we would have liked. However, in view of the fact that no other Tn-seq analysis of any Bradyrhizobium strain appears to have yet been reported, we decided to move forward with our libraries of 155,042 unique insertions. Our complete analysis can be found in the accompanying Supplementary Tables. In the following two paragraphs we describe caveats that are evident from our analysis of the Tn insertion data.
One is that our list of 622 putative essential genes, includes genes that we know not to be required for aerobic growth of any bacterium on a complex medium such as AG. These include some nitrogen fixation genes, hydrogenase genes, nodulation genes, conjugal transfer genes, rchl type 3 secretion genes, cbbS, encoding the small subunit of rubisco, flagella genes, and the regulator gene regR. One reason why some non-essential genes appear on putative essential gene lists is transposon Tn5 and its derivatives do not insert completely randomly into DNA. This is clear from looking at Tables S1 and S2, where one can see that Tn insertion points across genes are not evenly distributed and that the Tn can insert into the exact same site with high frequency. In general, Tn5 inserts into regions of high GC content much more efficiently compared to more AT rich regions21. In this regard it is interesting to note that most of the many putatively essential genes that we know to not to be needed for growth on AG medium, are located in the large 680-Kb symbiosis island that is located at genome coordinates 1681–2362 Kb (Table S5). The symbiosis island has a considerably lower GC content than the rest of the genome (Fig. 1). In all likelihood, the symbiosis island encompasses 123 false positive essential genes (shaded in yellow in Table S5).
A second caveat is that some genes that we expected to be essential were not identified as essential in Bd110spc4. Examples are RNA polymerase genes and some ribosomal proteins genes. In instances such as these, we found it important to look at the Tn insertion data carefully. Most of ribosomal protein genes identified as non-essential have Tn insertions located only at the beginning and/or the end of the gene, suggesting that only a segment of the gene is essential, but our library is not dense enough to allow highly accurate identification of the entire essential genome. We found that using a less stringent cut-off for essentiality revealed additional genes that may be important for cell viability (Table S4). For example, genes bll5409 and bll5410 encoding the beta and beta prime subunits of RNA-polymerase each met the criteria for essentiality in four of the five Tn library samples sequenced and are scored as reduced fitness genes in the fifth Tn library sample. Another example is succinate dehydrogenase, a four-subunit enzyme that is part of the tricarboxylic acid cycle. The four genes encoding this enzyme are essential in R. palustris but not in E. coli and one of them (blr0515) is putatively essential in Bd110spc4. Inspection of the data in Table S4 shows that the full set of succinate dehydrogenase genes is essential in three of the five Tn library samples analyzed. From this we conclude that succinate dehydrogenase probably is necessary for viability of Bd110spc4 under the conditions in which we constructed the library.
Genes that may be collectively essential for a subgroup of the Bradyrhizobiaceae
We were interested to see that five genes that scored as necessary for growth of both Bd110spc4 and R. palustris CGA009 do not have homologs in E. coli or the alpha-proteobacterium Caulobacter crescentus11. A Blastp analysis showed that the top hits to these five Bd110spc4 genes were to homologues present in species of Afipia, Nitrobacter, Tardiphaga, Oligotropha and Rhodopseudomonas (Table 2). These genera form a subgroup within the family Bradyrhizobiaceae22. We were interested to see that four of the five genes that we hypothesize to be essential for members of the subgroup of Bradyrhizobiaceae based on Tn-seq, encode hopanoid biosynthesis enzymes. Hopanoids have some of the properties of sterols. They comprise a small percentage of membrane lipid and confer rigidity to phospholipid bilayers and to the lipopolysaccharide layer of outer membranes23, 24. The basic cyclic triterpenoid hopanoid structure is derived from squalene by the squalene-hopene cyclase (encoded by sqhC) in an oxygen-independent reaction. Three genes, hpnC, hpnD and hpnE catalyze the biosynthesis of squalene. Squalene synthase, encoded by hpnC, is considered to be the first committed step in hopanoid biosynthesis25, 26. In past work investigators attempted but failed to isolate a sqhC mutant of B. diazoefficiens, which supports our data presented here showing hopanoid synthesis is required for growth27. A sqhC deletion mutant of a different species, Bradyrhizobium BTAi1, has been described that is defective in growth and sensitive to a variety of stresses24.
Discussion
In our (MP2) library, which is available to the community as a shared resource, a total of 7347 genes were found to harbor at least one insertion located in the central 5 to 90% of gene length out of the 8094 genes (8317 genes minus the 223 genes deleted). This corresponds to insertions in 90.7% of genes. This Tn library and the accompanying files that document the sites of insertions of T24 should be useful to investigators in several different contexts. One would be to use the Tn library to determine genes important for success in the soybean rhizosphere or in root colonization prior to root nodule formation. A study like this was recently carried out with Rhizobium leguminosarum bv. viciae28. Also, since it is quite time consuming to construct B. diazoefficiens mutants, inspection of Table S4 can provide confidence that the gene to be disrupted is not essential and that it should be possible to obtain the mutant in question. In addition, a particular gene may be deemed non-essential, but close inspection of the domain structures of the gene may reveal that some domains are likely to be essential. Experimental validation of essential gene candidates by manually deleting genes and checking their phenotypes is always important29, 30.
From this analysis we identified a set of five genes that we hypothesize may be essential or conditionally essential for the viability of closely related members of the family Bradyrhizobiaceae. The five genes include four genes for synthesis of hopanoid lipids. The Bradyrhizobiaceae encompass physiologically disparate bacteria including the nitrate-oxidizing genus Nitrobacter, the CO, CO2 and H2 -oxidizing species Oligotropha carboxidovorans, the metabolically versatile phototrophic genus Rhodopseudomonas, and the animal-associated bacterium Afipia felix, in addition to Bradyrhizobium. Despite their diverse physiologies, it seems likely that, unlike other bacteria, most of which do not make hopanoids, this subgroup pf Bradyrhizobiaceae rely on membrane hopanoids for full viability in diverse conditions (Table 2).
Materials and Methods
Bacterial strains and growth
This study was carried out with B. diazoefficiens 110spc4, a spontaneous spectinomycin resistant mutant13. This strain was a gift from Hauke Hennecke. Bd110spc4 was grown at 30 °C in arabinose-gluconate (AG) medium31.
Transposon library construction
Transposon mutagenesis was carried out by conjugation of Bd110spc4 with E. coli SM10λpir (pLG107)11. Cultures of donor E. coli cells [SM10λpir (pLG107)] in mid-exponential growth (OD600 0.2–0.5) were mixed with Bd110spc4 cells (OD600 0.4–0.8). Cultures (200 ml volumes) were combined in a one-to-one ratio, harvested by centrifugation, suspended in 10 ml phosphate buffer (0.1 M, pH 7) and then filtered onto 10 nitrocellulose discs (0.45 µM pore diameter, Millipore). The filters were placed mating site up on AG plates with no antibiotics. After an overnight incubation, cells from the ten nitrocellulose discs were resuspended in a 1:1 mixture of 0.1 M phosphate buffer and 50% glycerol and stored at -80 °C. The mating suspension was then spread onto 24- by 24 cm QTrays (Molecular Devices) of AG medium supplemented with spectinomycin (100 µg/ml), kanamycin (300 µg/ml) and chloramphenicol (60 µg/ml). A total of 12 matings were carried out. After 10 to 15 days of growth, colonies from each mating were harvested and resuspended in 9 ml of a 1:1 mixture of 0.1 M phosphate buffer and 50% glycerol. One aliquot from each mating was mixed with the others to form master pool 1 (MP1), which was then aliquoted and frozen. To obtain samples for Tn-seq analysis, we thawed two library aliquots, which were treated independently to generate two technical replicates (MP1-A1 and MP1-B1). These two independent samples from MP1 were subjected to high-throughput sequencing. The read values of each technical replicate were normalized to 107 reads to allow comparison between the samples. The total number of reads per insertion location were counted, the genomic locations of the Tn24 insertions were annotated, and read counts per gene were generated for the central 5 to 90% of a gene divided by the full length of the gene or the length of the 5–90% portion of the gene and normalized to reads per kilobase (RpK, or 5–90 RpK, see Table S4). In addition, reads mapping to multiple locations in the genome were counted for each gene. Finally, a combined list of all insertion locations across the two replicates was generated. (Table S1). Technical replicates showed a good correlation of read counts (Figure S1). The columns “MP1-A1 reads” and “MP1- B1 reads” in Table S1 indicate the numbers of multiple and uniquely mapped reads, whereas “MP1-A1 Multi” and “MP1- B1 Multi “columns include the numbers of reads that mapping to multiple locations in the genome. We used the same approach to analyze the MP2 samples (Table S2).
Using our custom bioinformatics pipeline described in Pechter et al.11, a total of 68,971 unique insertion location sites were identified and supported by at least two reads. However, 93 to 95% of the reads failed to align to the chromosome of Bd110spc4. Instead, most of the reads aligned to the pLG107 plasmid suggesting that the plasmid was retained in Bd110spc4 after conjugation either as a replicating vector or integrated into its chromosome. To measure the proportion of plasmids relative to inserted transposons, we performed real time PCR using primer sets specific for the transposon and specific for the plasmid backbone on DNA extracted from (i) the MP1 master pool, (ii) the MP1 pool grown in AG plus Km and Spc liquid culture (about 7 generations) and (iii) the MP1 pool re-plated on AG plates supplemented with Km and Spc (real-time PCR conditions and primers are available on request). The values obtained from the real time PCR were analyzed as a ratio of transposon-specific DNA to plasmid backbone-specific DNA. We obtained a ratio of about 1 with DNA extracted from the MP1 pool, confirming the presence of pLG107 in the MP1 cells; about 10 with the liquid grown MP1 cells, and about 150 with the re-plated MP1 pool, suggesting that most of the plasmid was cured from the cells when the MP1 cells were re-plated. Therefore, an aliquot of master pool 1 was thawed and plated onto 24- by 24 cm QTrays (Molecular Devices) of AG medium supplemented with Spc (100 µg/ml) and Km (100 µg/ml). After 15 to 20 days about 360,000 individual colonies were collected in a 1:1 mixture of 0.1 M phosphate buffer and 50% glycerol to create master pool 2 (MP2). The prolonged period of growth was necessary to allow sufficient cell replications (about 30 generations) for the suicide plasmid pLG107 to be lost from cells and for growth of slow growing transconjugants. Master pool 2 was aliquoted (96 tubes of 1 ml aliquots) and frozen at −80 °C. Three aliquots of MP2 (MP2-M1, MP2-H1 and MP2-H2) were subjected to high-throughput sequencing and the genomic locations of the Tn24 insertions were mapped (Table S2). Technical replicates showed a good correlation of read counts (Figure S2). Sequencing of MP2 indicated that about 70% of the reads mapped to the B. diazoefficiens genome. The unique transposon insertion sites MP1 and MP2 are listed in Table S3.
Tn-seq Illumina sequencing
Bd110spc4 Tn library DNA (MP1 and MP2) was prepared for Illumina sequencing using the TdT method, with some modifications19, 32, 33. Genomic DNA was extracted using the Qiagen DNeasy kit, eluted in water and quantified using the Qubit kit (Life Technologies). Total genomic DNA used for shearing step for each sample was 2.5 µg with a final volume in each Covaris tube at 130 µl (Covaris E210, 10% duty cycle, intensity of 5, 100 cycles per burst, 200-s duration). Sheared samples were end-repaired using the NEBNext End repair reagents (NEB) following manufacturer instructions. End-repaired DNA was purified (MinElute PCR Cleanup kit) and eluted in EB. For each sample, 1 µg of purified end-repaired DNA was mixed with a 2 µl solution containing 0.5 mM ddCTP (Affymetrix) and 9.5 mM dCTP, 4 µl of 5X TdT reaction buffer, 1 µl of terminal deoxynucleotidyl transferase (Promega) in a final volume of 20 µl. Each sample was incubated 1 h at 37 °C then 20 min at 75 °C and purified using Performa DTR gel filtration columns (Edge Biosystems). To amplify fragments containing the transposon a first qPCR was performed with 5 µl of purified C-tailed DNA as template and primers olj376 (GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTGGGGGGGGGGGGGGGG) and T24-88 (GATCCCCCTAGGGCGCGCCGAAG) at 250 nM each using SsoFast EvaGreen supermix (Biorad) in 50-µl reactions. The following real time PCR program was used: initial denaturation at 95 °C for 2 min 30 s, and then 24 cycles of 95 °C for 25 s, 59 °C for 25 s, 72 °C for 30 s. A second qPCR was then performed using the same reagents but with 1 µl of the previous PCR product and primers T23_SLXA_PAIR_AMPF_3.
(AATGATACGGCGACCACCGAGATCTACACTAGAGAATAGGAACTTCGGAATAGGAACTTCTTAGATGTGTATAAGAG) and TdT_Index_1_ATCACG.
(CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT). The same real time PCR program was used except that 14 cycles were performed. No sample was multiplexed. DNA was purified with the MinElute PCR purification kit (Qiagen) and eluted in 12 µl EB. Samples were quantified using fluorometry (Qubit) and samples quality was checked using the TapeStation system (Agilent). Samples were sequenced with 36-bp single end reads on an Illumina Hi-seq instrument (Genomic Resources at Fred Hutchinson Cancer Research Center) for MP1-A1, MP1-B1, MP2-H1 and MP2-H2 samples and on an Illumina Mi-seq instrument (from the Manoil laboratory, University of Washington) for MP2-M1 sample, using sequencing primer T23_SEQ_G.
(AATAGGAACTTCGGAATAGGAACTTCTTAGATGTGTATAAGAGACAG). Tn-seq sequencing data were analyzed according to the pipeline and protocols described previously11.
Determination of putative essential genes
To determine the set of putative essential genes, the log2 RpK values for Tn insertions in each gene, determined as described above, were plotted and subjected to Gaussian distribution analysis. Essential candidates were selected with log2 5–90 RpK values falling outside a 99.9% confidence interval (P < 0.001 by two-tailed test). The essential candidates were constrained by overlapping the essential candidates of five sequencing runs for the two master pools (MP-1 and MP-2); MP1-A1, MP1-B1, MP2-M1, MP2-H1 and MP2-H2. In addition, genes, particularly short genes, may escape transposition insertion by chance. The probability of escaping insertion can be calculated using the Poisson formula; P(0) = e-(l) = e(length/58.7) where l is the expected value of insertions calculated by the length of the gene divided by the average spacing, 58.7 bp . Based on this, genes of < 310 bp were expected to have no Tn insertions (P-value < 0.005). Thus, these genes were removed from consideration as essential.
Bioinformatic resources
DEG (database of essential genes, http://tubic.tju.edu.cn/deg/) was used to blast the essential homologues, WebMGA (http://weizhong-lab.ucsd.edu/metagenomic-analysis/) for Cluster of orthologous groups (COG) analysis, CGView Server (http://stothard.afns.ualberta.ca/cgview_server/), for the visualization of putatively essential genes in the context of the B. diazoefficiens 110spc4 chromosome.
Statistical analyses
Statistical analyses were performed using Excel and R (http://www.R-project.org/).
Datasets
All data generated or analyzed during this study are included in this published article and its Supplementary Information files.
References
Delamuta, J. R. M. et al. Polyphasic evidence supporting the reclassification of Bradyrhizobium japonicum group Ia strains as Bradyrhizobium diazoefficiens sp. nov. Int. J. Syst. Evol. Microbiol. 63, 3342–3351 (2013).
Kuykendall, L. & Elkan, G. Rhizobium japonicum derivatives differing in nitrogen-fixing efficiency and carbohydrate utilization. Appl. Environ. Microbiol. 32, 511–519 (1976).
Schubert, K. R., Jennings, N. T. & Evans, H. J. Hydrogen reactions of nodulated leguminous plants II. Effects on dry matter accumulation and nitrogen fixation. Plant Physiol. 61, 398–401 (1978).
Itakura, M. et al. Genomic comparison of Bradyrhizobium japonicum strains with different symbiotic nitrogen-fixing capabilities and other Bradyrhizobiaceae members. ISME J. 3, 326 (2009).
Mathis, J. N., McMillin, D. E., Champion, R. A. & Hunt, P. G. Genetic variation in two cultures of Bradyrhizobium japonicum 110 differing in their ability to impart drought tolerance to soybean. Curr. Microbiol. 35, 363–366 (1997).
Kaneko, T. et al. Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110. DNA Res. 9, 189–197 (2002).
Gawronski, J. D., Wong, S. M., Giannoukos, G., Ward, D. V. & Akerley, B. J. Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung. Proc. Natl. Acad. Sci. 106, 16422–16427 (2009).
Langridge, G. C. et al. Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants. Genome Res. 19, 2308–2316 (2009).
van Opijnen, T., Bodi, K. L. & Camilli, A. Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat. Methods 6, 767–772. https://doi.org/10.1038/nmeth.1377 (2009).
Goodman, A. L., Wu, M. & Gordon, J. I. Identifying microbial fitness determinants by insertion sequencing using genome-wide transposon mutant libraries. Nat. Protoc. 6, 1969–1980 (2011).
Pechter, K. B., Gallagher, L., Pyles, H., Manoil, C. S. & Harwood, C. S. Essential genome of the metabolically versatile alphaproteobacterium Rhodopseudomonas palustris. J. Bacteriol. 198, 867–876. https://doi.org/10.1128/JB.00771-15 (2016).
Chao, M. C., Abel, S., Davis, B. M. & Waldor, M. K. The design and analysis of transposon insertion sequencing experiments. Nat. Rev. Microbiol. 14, 119–128 (2016).
Regensburger, B. & Hennecke, H. RNA polymerase from Rhizobium japonicum. Arch. Microbiol. 135, 103–366 (1983).
Fernández, N. et al. An integrated systems approach unveils new aspects of microxia-mediated regulation in Bradyrhizobium diazoefficiens. Front. Microbiol. 10, 924 (2019).
Dembek, M. et al. High-throughput analysis of gene essentiality and sporulation in Clostridium difficile. MBio 6, e02383-e12314 (2015).
Gislason, A. S., Turner, K., Domaratzki, M. & Cardona, S. T. Comparative analysis of the Burkholderia cenocepacia K56–2 essential genome reveals cell envelope functions that are uniquely required for survival in species of the genus Burkholderia. Microbial Genom. 3, 1 (2017).
Yang, H. et al. Genome-scale metabolic network validation of Shewanella oneidensis using transposon insertion frequency analysis. PLoS Comput. Biol. 10, e1003848 (2014).
Christen, B. et al. The essential genome of a bacterium. Mol. Syst. Biol. 7, 528 (2011).
Klein, B. A. et al. Identification of essential genes of the periodontal pathogen Porphyromonas gingivalis. BMC Genom. 13, 578 (2012).
Wu, S., Zhu, Z., Fu, L., Niu, B. & Li, W. WebMGA: a customizable web server for fast metagenomic sequence analysis. BMC Genom. 12, 1 (2011).
Green, B. et al. Insertion site preference of Mu, Tn5, and Tn7 transposons. Mob. DNA 3, 3. https://doi.org/10.1186/1759-8753-3-3 (2012).
Martinez-Romero, E. & Ormeño-Orrillo, E. A genomotaxonomy view of the Bradyrhizobium genus. Front. Microbiol. 10, 1334 (2019).
Belin, B. J. et al. Hopanoid lipids: from membranes to plant–bacteria interactions. Nat. Rev. Microbiol. 16, 304 (2018).
Silipo, A. et al. Covalently linked hopanoid-lipid A improves outer-membrane resistance of a Bradyrhizobium symbiont of legumes. Nat. Commun. 5, 1–11 (2014).
Furubayashi, M., Li, L., Katabami, A., Saito, K. & Umeno, D. Construction of carotenoid biosynthetic pathways using squalene synthase. FEBS Lett. 588, 436–442 (2014).
Perzl, M. et al. Cloning of conserved genes from Zymomonas mobilis and Bradyrhizobium japonicum that function in the biosynthesis of hopanoid lipids. Biochimica et Biophysica (BBA) Acta Lipids and Lipid Metabolism 1393, 108–118 (1998).
Kulkarni, G. et al. Specific hopanoid classes differentially affect free-living and symbiotic states of Bradyrhizobium diazoefficiens. MBio 6, e01251-e11215 (2015).
Wheatley, R. M. et al. Lifestyle adaptations of Rhizobium from rhizosphere to symbiosis. Proc Natl Acad Sci U SA. 117, 23823–23834 (2020).
Rubin, B. E. et al. The essential gene set of a photosynthetic organism. Proc. Natl. Acad. Sci. 112, E6634–E6643 (2015).
Lee, S. A. et al. General and condition-specific essential functions of Pseudomonas aeruginosa. Proc. Natl. Acad. Sci. 112, 5189–5194 (2015).
Sadowsky, M. J., Tully, R. E., Cregan, P. B. & Keyser, H. H. Genetic diversity in Bradyrhizobium japonicum serogroup 123 and its relation to genotype-specific nodulation of soybean. Appl. Environ. Microbiol. 53, 2624–2630 (1987).
Gallagher, L. A. et al. Resources for genetic and genomic analysis of emerging pathogen Acinetobacter baumannii. J. Bacteriol. 197, 2027–2035 (2015).
Lazinski, D. W. & Camilli, A. Homopolymer tail-mediated ligation PCR: a streamlined and highly efficient method for DNA cloning and library construction. Biotechniques 54, 25–34 (2013).
Acknowledgements
We thank all members of our laboratory for their help with many aspects of this work. We express our gratitude to Deng Yizhen (South China Agricultural University) for helpful discussions. We thank Gael Panis (University of Geneva, Switzerland) for the suggestion that a chromosomal deletion might have occurred in the B. diazoefficiens genome.
Funding
This study was supported by the National Natural Science Foundation of China (31771341) and by Guangdong Province Science and Technology Innovation Strategy Special Fund (Grant no: 2018B020206001). Support to the laboratory of CSH for this project was from the Agricultural and Food Research Initiative Competitive Grants Program 2010-65108-20536 from the US Department of Agriculture and also from the U.S. Department of Energy (DOE), Office of Biological and Environmental Research (BER), as part of BER’s Secure Biosystems Design Science Focus Area "Persistence Control of Engineered Functions in Complex Soil Microbiomes".
Author information
Authors and Affiliations
Contributions
C.B., W.D. and J.M. carried out the experiments, C.B., W.D., K.P. and C.S.H. analyzed the data. C.B. and C.S.H. wrote the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Baraquet, C., Dai, W., Mendiola, J. et al. Transposon sequencing analysis of Bradyrhizobium diazoefficiens 110spc4. Sci Rep 11, 13211 (2021). https://doi.org/10.1038/s41598-021-92534-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-92534-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.