Introduction

Diverse microorganisms live as endophytes and epiphytes of plants in nature. However, many questions regarding the driving forces and ecological rules underlying these relationships remain unanswered (Saito et al., 2007; Hardoim et al., 2008). During their evolution, legumes developed nodulation and mycorrhization systems to attain mutual symbiosis with rhizobia and mycorrhizae (Harrison, 2005; Stacey et al., 2006). Although plants are known to control the degrees of nodulation and mycorrhization of roots by rhizobia and mycorrhizae, respectively, through autoregulation systems (Carroll et al., 1985; Meixner et al., 2007), the effects of the autoregulation systems on the other plant-associated microorganisms in the phytosphere remain unclear (Parniske, 2000). Recent studies have shown that the wild-type and symbiosis-defective mutants of legumes such as Medicago truncatula and Glycine max have different microbial community structures, and certain microbes associate preferentially with nodulated or mycorrhizal root systems (Offre et al., 2007; Ikeda et al., 2008a). Furthermore, transcriptional analyses revealed that both nodulation and mycorrhization systemically induced a series of defense genes in a host plant and consequently increased its disease resistance (Liu et al., 2007; Kinkema and Gresshoff, 2008). These examples indicate that genetic alteration in the nodulation or mycorrhization signaling pathways can in turn alter the accompanying plant microflora, aside from rhizobia and mycorrhizae.

The autoregulatory mechanism in plants occurs through long-distance signaling between shoot and root (Oka-Kira and Kawaguchi, 2006), and a heavy supply of nitrate to the roots of leguminous plants inhibits nodulation through autoregulation (Carroll et al., 1985; Okamoto et al., 2009). Thus, there is a possibility that nodulation phenotype and host nitrogen status may affect the structure of the microbial community in shoots, as observed earlier in roots (Ikeda et al., 2008a). However, the culture-independent analysis of soybean stem-associated microbial communities failed because of serious interference by plant DNA in the PCR amplification (Ikeda et al., 2008a). To overcome this problem, a method was recently developed for enriching the bacterial cells collected from soybean stems (Ikeda et al., 2009). The enrichment method allows the construction of a clone library of the bacterial 16S ribosomal RNA (rRNA) genes, which provides better resolution for examining the microbial community structures compared with the limited analyses in previous studies based mainly on fingerprinting methods (Offre et al., 2007; Ikeda et al., 2008a).

In this study, the effects of nodulation phenotypes and nitrogen application levels were examined on the stem-associated bacterial community including both epiphytes and endophytes in field-grown soybeans. Clone libraries of the 16S rRNA genes were constructed in conjunction with the method of enriching bacterial cells from soybean stems, and the bacterial diversities in different nodulation phenotypes were evaluated based on 16S rRNA gene sequences.

Materials and methods

Plant materials and field experimental design

The plants included cultivar Enrei (wild-type nodulating cultivar, Nod+), lines En 1314 and En 1282 (non-nodulating mutants derived from Enrei, Nod; Francisco and Akao, 1993) and lines En 6500 and Sakukei 4 (hypernodulating mutants derived from Enrei, Nod++; Akao and Kouchi, 1989; Matsunami et al., 2004). Mutations in En 1314 and En 1282 soybeans were found in the NFR1 gene, which is known to be responsible for a LysM-type receptor kinase for Nod factor recognition (Radutoiu et al., 2003 Masaki Hayashi (National Institute of Agrobiological Sciences, Tsukuba, Japan), personal communication). A mutation in En 6500 was found in the GmNARK (NTS1) gene (Nishimura et al., 2002), which mediates systemic autoregulation of nodulation (Searle et al., 2003), and Sakukei 4 is a descendant with a hypernodulating phenotype that was derived from En 6500 (Matsunami et al., 2004).

The seeds were planted on 10 June 2008 in the experimental field at Tohoku University (Kashimadai, Miyagi, Japan), which has been cultivated with a rotation of rice (paddy field condition) and soybeans (upland field condition) every year since 1997. The field soil was classified as gray lowland soil (pH(H2O), 5.9; pH(KCl), 4.3; total carbon content, 1.21%; total nitrogen content, 0.11%; Truog phosphorus content, 69 mg P2O5 kg–1). Soybeans of each nodulation genotype were grown in two neighboring fields (standard nitrogen (SN) and heavy nitrogen (HN) fields). The SN field was dressed with SN fertilization (15 kg N ha–1) as a basal fertilization, and the HN field was also dressed four times with HN fertilization (N1 to N4 in Figure 1; each application 150 kg N ha–1 as urea, total 600 kg N ha–1) in addition to the basal fertilization.

Figure 1
figure 1

Time table and design of the field experiment. Basal fertilization was supplied for both standard and heavy nitrogen (HN) fields with 15 kg N ha–1 of a commercial fertilizer. The HN field was also supplied with nitrogen fertilization as urea four times (N1 to N4; 150 kg N ha–1 for each fertilization). GE1 to GE4 represent the days on which soybean growth indexes (Supplementary Figure 1) were measured. GS1 and GS2 indicate growth stages 1 and 2 for measuring dry weights and nitrogen contents of soybean tissues (Supplementary Figure 2). The sampling for the DNA extraction was carried out at GS2.

Growth evaluation and sampling

To define the environmental factors that are relevant to changes in bacterial community structures, the plant growth stage was monitored by measuring vegetative and reproductive indexes, plant dry weights, and nitrogen contents in plant tissues (GE1 to GE4 in Figure 1) and in field soils. Plant growth stages were evaluated according to the criteria of Fehr et al. (1971) four times throughout the growing period (GE1 to GE4 in Figure 1). Dry weights of shoots, roots, and nodules as well as the number of nodules were measured twice during the growing period (GS1 and GS2 in Figure 1). Total nitrogen of soybean shoots at GS1 was analyzed by the Kjeldahl method. Two plants were processed as a composite sample for measuring the N content of soybean shoots. The nitrogen content in soils under the SN and HN fertilization conditions was also measured three times before the additional fertilization of the HN field (N2, N3 and GS2 in Figure 1). Soil was air-dried and sieved through a 2-mm mesh. Dried soil was extracted by shaking with 2 M KCl solution for 30 min and then filtered through a filter paper (No. 2, Advantec, Tokyo, Japan). Contents of NH4+ and NO3 in the filtrates were determined by titrimetry after steam distillation (UDK127, Actac, Tokyo, Japan).

The aerial parts of soybean plants were harvested on 3 September 2008 (GS2 in Figure 1) and immediately transported on ice to the laboratory. Leaves were removed manually. The stems were serially washed well with tap and distilled water and stored at –80 °C until they were used. Four plants were processed as a composite sample for the DNA extraction. Three composite samples were analyzed for each genotype as a triplicate.

Clone library construction and sequencing

Stems (approximately 100 g) of a composite sample were homogenized without surface sterilization to prepare the stem-associated bacterial cells containing both epiphytes and endopytes, and the bacterial cells were extracted and purified using an enrichment method (Ikeda et al., 2009). Total bacterial DNA was extracted from each bacterial cell fraction by the DNA extraction method described by Ikeda et al. (2008b). The final DNA samples were resuspended in 100 μl of sterilized water. The quality and quantity of DNA were assessed spectrophotometrically by calculating absorbance at a wavelength of 260 nm (A260) and the A260/A230 and A260/A280 ratios. The triplicate DNA samples were prepared from composite samples for each soybean genotype under the SN and HN fertilization conditions.

PCR clone libraries for 16S rRNA genes were constructed as follows. Briefly, 50 ng of total bacterial DNA was used as a template in a final reaction volume of 60 μl, including 90 pmol of each primer and 9 U of Ex Taq DNA polymerase (Takara Bio, Otsu, Japan). The universal primers 27F (5′-AGAGTTTGATCMTGGCTCAG-3′) and 1525R (5′-AAGGAGGTGWTCCARCC-3′) were used (Lane, 1991). Cycling conditions were as follows: initial denaturation for 2 min at 94 °C; then 25 cycles consisting of 30 s at 94 °C, 30 s at 55 °C and 2 min at 72 °C; and a final extension for 10 min at 72 °C. Three PCR products derived from triplicate DNA samples were combined into a sample, and the PCR products were resolved by 1% agarose gel electrophoresis in 1 × TAE buffer. PCR products of the predicted size (approximately 1400 bp) were extracted from the gels using a QIAquick Gel Extraction Kit (Qiagen, Tokyo, Japan) and ligated into the plasmid vector pGEM-T Easy (Promega Japan, Tokyo, Japan) at 16 °C overnight using a DNA ligation kit, Mighty Mix (Takara Bio). ElectroTen-Blue Electroporation-Competent Cells (Stratagene, La Jolla, CA, USA) were then electroporated with the ligated DNA using a Gene Pulser (Bio-Rad Laboratories, Tokyo, Japan). After the transformants were cultured overnight at 37 °C on Luria-Bertani agar plates containing ampicillin (50 μg ml–1), 384 colonies were randomly selected from each library for sequencing. Sequencing analysis was conducted using a Type 3730 × l DNA Analyzer (Applied Biosystems, Foster City, CA, USA) using a BigDye Terminator Cycle Sequencing Reaction Kit (Applied Biosystems). Template DNAs were prepared using an Illustra TempliPhi DNA Amplification Kit (GE Healthcare, Uppsala, Sweden). A partial sequence of the 16S rRNA gene was obtained using the 27F primer. Sequences were processed with Phred (Ewing and Green, 1998; Ewing et al., 1998) at a cutoff value of 0.001 for eliminating low-quality regions. Then the 5′ region of the 16S rRNA gene (corresponding to bp 109–665 of the Escherichia coli 16S rRNA gene) was used for the sequence analyses.

Additional cloning and sequencing was carried out for three samples (Enrei of SN, Enrei of HN and En 1282 of HN) because of heavy contamination of chloroplast DNA. Libraries of these samples were screened by PCR with a soybean chloroplast-specific primer set, SCS-F (5′-AGGCGGTGGAAACTACCAAG-3′), and SCS-R (5′-CAGCGCCTAGTATCCATCGT-3′), designed in this study to eliminate clones derived from the chloroplast 16S rRNA gene. After the negative screening by PCR with the soybean chloroplast-specific primer set, 192, 96, and 96 clones were additionally subjected to sequencing for Enrei of SN, Enrei of HN and En 1282 of HN, respectively, as potentially nonbacterial clones.

Sequence analysis

The sequences were analyzed for orientation and detection of non-16S rDNA sequences using OrientationChecker (Ashelford et al., 2006). The presence of chimeras was assessed using MALLARD (Ashelford et al., 2006). A sequence identified at the 99.9% and 99% cutoffs or suspected at the 99.9% cutoff as a chimeric sequence was discarded. The remaining sequences were aligned using CLUSTAL W (Thompson et al., 1994). On the basis of the alignment, a distance matrix was constructed using the DNADIST program from PHYLIP ver. 3.66 (http://evolution.genetics.washington.edu/phylip.html) with the default parameters. The resulting matrices were run in DOTUR (Schloss and Handelsman, 2005) to generate diversity indexes. The default DOTUR settings were used with threshold values of 97% sequence identity for the definition of operational taxonomic units (OTUs). Library coverage was calculated with the non-parametric estimator C (Good, 1953), as described by Kemp and Aller (2004). The reciprocal of Simpson's index (1/D) was used as a measure of diversity to evaluate the level of dominance in a community (Zhou et al., 2002). The number of shared OTUs between libraries was calculated using SONS (Schloss and Handelsman, 2006).

UniFrac (Lozupone and Knight, 2005) was applied to examine the differences between clone libraries. A tree file generated by CLUSTAL W (Thompson et al., 1994) and an environment file, which links a file to a library, were uploaded to UniFrac (Lozupone and Knight, 2005). Principal coordinate analysis (PCoA) was performed by using UniFrac with the abundance-weighted option (Lozupone and Knight, 2005).

Phylogenetic analysis

The phylogenetic composition of the sequences in each library was evaluated using the Classifier program of RDP-II release 10 (Wang et al., 2007), with confidence levels of 80%. BLASTN (Altschul et al., 1990) was also used to classify the clones and to identify the closest relatives in the GenBank database.

For the phylogenetic analysis, sequences were aligned using the CLUSTAL W program (Thompson et al., 1994). The neighbor-joining method was used for building the trees (Saitou and Nei, 1987). The PHYLIP format tree output was obtained using the bootstrapping procedure (Felsenstein, 1985); 1000 bootstrap trials were used. The trees were constructed using TreeView software (Page, 1996).

Accession numbers of nucleotide sequences

Nucleotide sequences of 16S rRNA genes for the clone libraries have been deposited in the DDBJ database under accession numbers shown in Table 1.

Table 1 Accession numbers of sequences deposited in the DDBJ

Results

Soybean growth

The soybean stem-associated bacteria including endophytes and epiphytes at the reproductive stage (GS2 in Figure 1) were subjected to clone library analyses. The growth indexes revealed no difference in the stages of the reproductive growth phase among the soybean genotypes or nitrogen treatments throughout the growth period, although slight variations were observed for the vegetative growth stages among the soybean genotypes in the early phase of growth (Supplementary Figure 1). At the sampling time, the reproductive growth indexes were R6 (pod-maturing stage; Fehr et al., 1971) for all samples in the SN and HN fields, and the vegetative growth indexes ranged approximately from 12 to 15 for most samples. Thus, we concluded that the effects of soybean growth stage on the community analysis would not be significant among the genotypes examined under the SN or HN field conditions.

Statistics of clone libraries

The statistics of the clone libraries are summarized in Table 2. The library coverage ranged from 87.8 to 94.9%, showing a relatively high coverage for all libraries. Under the SN fertilization condition, Nod+ soybeans (Enrei) showed the highest number of OTUs and slightly higher diversity for all indexes compared with those of mutants, whereas no clear effect of nodulation phenotype was observed for the number of OTUs and diversity indexes under the HN fertilization condition.

Table 2 Statistical analysis of 16S rRNA gene clone libraries derived from soybean stems

Phylogenetic diversities of stem-associated bacteria with different nodulation phenotypes under standard nitrogen fertilization condition

The assessment of phylogenetic compositions using RDP Classifier revealed that proteobacteria was the most dominant taxon (88.7–96.5%) in all libraries at the phylum level under the SN fertilization condition (Figures 2a). In proteobacteria, the relative abundances of Alphaproteobacteria and Gammaproteobacteria varied according to either nodulation phenotype, whereas the abundance of Betaproteobacteria was relatively constant in all libraries constructed under these environmental conditions (Figure 2a). The ratio of Alpha/Gammaproteobacteria in Nod and Nod++ soybeans (4.6–6.1) was higher than that in Nod+ soybeans (2.9; Figure 2a).

Figure 2
figure 2

The phylogenetic compositions of 16S ribosomal RNA (rRNA) gene clone libraries of soybean shoot-associated bacteria in different nodulation phenotypes under standard and heavy nitrogen (HN) fertilization conditions. (a, b) Phylogenetic compositions at the phylum level; (c, d) relative abundance of three genera of Rhizobiales; (e, f) relative abundance of three orders of Gammaproteobacteria. Genotype stand for the name of plant materials: Enrei (e), En 1282 (12), En 1314 (13), En 6500 (65), Sakukei 4 (S4). Nod stands for the nodulation phenotype: wild-type (+), non-nodulated (−) and hypernodulated (++).

Further analyses of phylogenetic compositions at lower taxonomic levels revealed that three genera (Aurantimonas, Methylobacterium and Rhizobium) in the order Rhizobiales for Alphaproteobacteria (Figure 2c) and three orders (Pseudomonadales, Xanthomonadales and Enterobacteriales) for Gammaproteobacteria (Figure 2e) were mainly responsible for the community shifts of Alpha/Gammaproteobacteria under the SN fertilization condition.

The results of PCoA showed two tight clusters for Nod++ and Nod soybeans distinctly separated from Nod+ soybeans, suggesting that the community structures of stem-associated bacteria were mainly dependent on nodulation phenotype under SN fertilization (Figure 3). In addition, the overall structures of the microbial communities in Nod++ and Nod soybeans were more similar to each other than to that of Nod+ soybeans under the SN fertilization condition (Figure 3).

Figure 3
figure 3

Principal-coordinate analysis for the 16S ribosomal RNA (rRNA) gene clone libraries of soybean stem-associated bacteria in different nodulation phenotypes. The ordination was constructed using Unifrac distances weighted by the relative abundances. Each number represents the phylogenetic composition of soybean stem-associated bacteria in different soybean genotypes under standard and heavy nitrogen (HN) conditions. Numbers 1 to 5 and 6 to 10 were sampled under standard and HN conditions, respectively. Soybean genotypes: 1 and 6: wild-type (Enrei, Nod+); 2 and 7: non-nodulated (En 1282, Nod); 3 and 8: non-nodulated (En 1314, Nod); 4 and 9: hypernodulated (En 6500, Nod++); 5 and 10: hypernodulated (Sakukei 4, Nod++).

Clustering analysis of 16S rRNA sequences allowed the identification of the OTUs responsible for the population shifts of Alpha- and Gammaproteobacteria at the species level. Two OTUs (AP6 and AP30) noticeably increased the ratio of the Alphaproteobacterial population in mutants of both Nod and Nod++ soybeans compared with Nod+ soybeans under the SN fertilization condition (Figures 4 and 5). The representative clones for these two OTUs were identical to Aurantimonas ureilytica (DQ883810) and Methylobacterium chloromethanicum (CP001298), respectively.

Figure 4
figure 4

Heat map for the relative abundance of operational taxonomic units (OTUs) for the 16S ribosomal RNA (rRNA) gene clone libraries of soybean stem-associated bacteria under standard and heavy nitrogen (HN) fertilization conditions. Soybean genotypes are as follows: E: Enrei; 12: En 1282; 13: En 1314; 65: En 6500; and S4: Sakukei 4. Nod stands for the nodulation phenotype: wild-type (+), non-nodulated (−) and hypernodulated (++). The relative abundance of each OTU in each library is indicated by different colors: sky blue (<1%), orange (1–3%) and red (>3%), with a background of dark blue (0%). The relative abundance for the OTUs described in the main text is shown at the right side.

Figure 5
figure 5

Phylogenetic distribution of operational taxonomic units (OTUs) for Alphaproteobacteria of the clone libraries of 16S ribosomal RNA (rRNA) genes for soybean stem-associated bacteria. The dendrogram (left) indicates the phylogenetic relationships among the representative sequences of OTUs (defined by 97% identity). The table indicates the relative abundance of clones belonging to each OTU in each library and the results of a BLAST search using the representative sequences. The soybean genotypes are as follows: E: Enrei; 12: En 1282; 13: En 1314; 65: En 6500; and S4: Sakukei 4. Nod stands for the nodulation phenotype: wild-type (+), non-nodulated (−) and hypernodulated (++).

In contrast, the relative abundance of OTU GP39 in Enterobacteriales was lower in mutants of both Nod and Nod++ soybeans compared with that in Nod+ soybeans (Figures 4 and 6). The representative clone of OTU 39 was classified as Enterobacter sp. and was identical to an isolate (AY822505) that was derived from a plant-associated bacterium, and its closest relative in known species was Klebsiella oxytoca (EU982878) with 99% identity (Figure 6).

Figure 6
figure 6

Phylogenetic distribution of operational taxonomic units (OTUs) for Gammaproteobacteria of the clone libraries of 16S ribosomal RNA (rRNA) genes for soybean stem-associated bacteria. The dendrogram (left) indicates the phylogenetic relationships among the representative sequences of OTUs (defined by 97% identity). The table indicates the relative abundance of clones belonging to each OTU in each library and the results of a BLAST search using the representative sequences. The soybean genotypes are as follows: E: Enrei; 12: En 1282; 13: En 1314; 65: En 6500; and S4: Sakukei 4. Nod stands for the nodulation phenotype: wild-type (+), non-nodulated (−) and hypernodulated (++).

Comparisons of phylogenetic diversity of stem-associated bacteria between standard and heavy nitrogen fertilization conditions

The HN fertilization condition was designed to compare the community structures of shoot-associated bacteria among different nodulation phenotypes by eliminating the bias of nitrogen status of host plants and to suppress the nodulation. When HN fertilization was applied to the soybeans, the effect of nodulation phenotype on the bacterial communities was obscured (Figure 2b). However, the ratio of Alpha/Gammaproteobacteria was drastically decreased for all nodulation phenotypes under HN fertilization (ranging from 0.8 to 2.1) compared with those under SN fertilization (2.9–6.1; Figures 2a and b). Further analyses of phylogenetic compositions at lower taxonomic levels revealed that the same taxonomic groups that were responding to the community shifts between the Nod+ soybeans and the mutants (Nod and Nod++ soybeans) under SN fertilization were also responding to the effects of HN fertilization. Thus, three genera (Aurantimonas, Methylobacterium and Rhizobium) in the order of Rhizobiales for Alphaproteobacteria (Figure 2d) and three orders (Pseudomonadales, Xanthomonadales and Enterobacteriales) for Gammaproteobacteria (Figure 2f) were responding to the population shifts of Alpha- and Gammaproteobacteria in all libraries under the HN fertilization condition.

The results of PCoA clearly showed that HN fertilization caused drastic shifts of community structures of shoot-associated bacteria for all nodulation phenotypes from those found under SN fertilization (Figure 3). PC1 (49.4%) explained the effects of HN fertilization on the community structures. In addition, the community structures among the soybean genotypes under HN fertilization were far more dissimilar to each other than those under SN fertilization.

Clustering analysis of 16S rRNA sequences identified the same three OTUs (AP6, AP30 and GP39) that were responding to the population shifts of Alpha- and Gammaproteobacteria between the Nod+ soybeans and the mutants (Nod and Nod++ soybeans) under SN fertilization were also responding markedly to the effects of HN fertilization for all soybean genotypes (Figures 4, 5 and 6). In addition, OTUs GP7, GP37 and GP38 were identified as responsible for the increase of the relative abundance of Gammaproteobacteria in all soybean genotypes under HN fertilization (Figures 4 and 6). The representative clone of OTU GP7 was classified as Lysobacter sp. and its closest relative among known species was Lysobacter koreensis (AB166878) with 96% identity (Figure 6). The representative clone of OTU GP37 was identical to several Enterobacter spp., including Enterobacter aerogenes (FJ796202) and Enterobacter asburiae (AB004744). The representative clone of OTU GP38 was identical to several species of Enterobacteriaceae, including Pantoea agglomerans (FJ603033), Enterobacter ludwigii (FJ462703), Enterobacter cloacae (FJ194527), Enterobacter asburiae (EU301774) and Citrobacter freundii (DQ133536). OTUs GP37, GP38 and GP39 were shown to be closely related to each other, as indicated by clustering in a branch of the Enterobacteriales (Figures 4 and 6).

Discussion

The phylogenetic analyses based on 16S rRNA gene sequences revealed the striking effects of both nodulation phenotype and HN fertilization on bacterial communities in soybean stems (Figures 2 and 3). The population shifts of Alpha- and Gammaproteobacteria under SN fertilization indicated that the nodulation phenotypes affected the bacterial community structures not only in roots (Ikeda et al., 2008a), in which nodule formation occurs, but also systemically in shoots.

Surprisingly, further phylogenetic analyses at lower taxonomic levels and the results of PCoA revealed that soybeans of two very different nodulation phenotypes, Nod and Nod++, hosted highly similar microbial community structures under the SN fertilization condition, even at the species level (Figures 3 and 4). A possible explanation for this may be the absence of mycorrhization in Nod soybeans under the present experimental conditions (data not shown). The rotation of the experimental field between upland and paddy conditions could reduce the population of mycorrhizal fungi. In the absence of mycorrhization, Nod soybeans do not need to activate autoregulation systems, whereas the autoregulation systems of Nod++ soybeans were genetically impaired. Thus, both Nod and Nod++ soybeans may be considered as hosts with no activated autoregulation in this study. Therefore, a bacterial population related to rhizobia may be passively affected by a part of the autoregulation systems for regulating rhizobia. In fact, the three genera (Aurantimonas, Methylobacterium and Rhizobium) in Rhizobiales found in the present study are all closely related to Bradyrhizobium spp., endosymbionts of soybeans (Gupta and Mok, 2007). Another surprising finding was the high similarity observed for the population shifts of Alpha- and Gammaproteobacteria between two different environmental factors: the effects of genotypes under SN fertilization and the effects of the nitrogen fertilization under the HN fertilization condition. Thus, the same taxonomic groups that were found in the population shifts of Alpha- and Gammaproteobacteria among different nodulation phenotypes under the SN fertilization condition were also responsible for the drastic population shifts of Alpha- and Gammaproteobacteria between the SN and HN fertilization conditions, regardless of nodulation phenotype. These results suggest that exogenous or endogenous nitrogen levels could be an important signal for regulating a subpopulation of proteobacteria by activating a plant signaling pathway, such as autoregulation systems in legume–rhizobia symbiosis.

The clustering analysis of 16S rRNA sequences (Figure 4) identified OTUs responsible for the population shifts of Alpha- and Gammaproteobacteria at the species level. The type strain of A. ureilytica was originally reported as an isolate from air dust (Weon et al., 2007), but the closely related isolates (AB291857 and AB291900) have been identified on the surface and inside of rice leaves (Mano et al., 2007). The genus Aurantimonas has been recently established (Denner et al., 2003), and the biology of this genus in relation to the plant symbiosis is largely unknown, and needs to conduct further study. Methylobacterium chloromethanicum is currently considered as a synonym of Methylobacterium extorquens (Kato et al., 2005). Several Methylobacterium spp., including M. extorquens, are well-known plant-associated bacteria (Corpe, 1985) as both an epiphyte and an endophyte, and the scavenging monocarbon wastes such as methanol in plant tissues is speculated for the putative role of this bacterial group (Corpe and Basil, 1982). Therefore, the population shifts of Methylobacterium sp. could be considered as an indication for the change of physiological status in plants. In fact, it has been reported that both the nodulation and the nitrate sensing in plant activate the autoregulation system that causes the drastic shift of internal hormone balance in plants (Caba et al., 2000; Okamoto et al., 2009). The representative clones for these OTUs showed high similarity to known species. Therefore, it would be feasible to isolate bacteria corresponding to these OTUs and to examine the interactions of these bacteria with soybeans by an inoculation test to clarify the speculations described above. In particular, it would be interesting to analyze the interaction between the Aurantimonas sp. representing OTU AP6 and soybeans because of its high sensitivity to nodulation genotype and nitrogen fertilization level.

On the basis of a culture-dependent method, Gammaproteobacteria were reported as the most dominant group of stem-associated bacteria in soybeans (up to 94% and 100% for epiphyte and endophyte isolates, respectively), whereas the Alphaproteobacteria isolated from shoots were shown to be far less abundant (5.9% for epiphytes; Kuklinsky-Sobral et al., 2004). Okubo et al. (2009) also reported similar results of the strong dominance of Gammaproteobacteria were obtained when culturing endophytic bacteria in soybean stems. In contrast, based on the culture-independent method used in this study, Alphaproteobacteria were the most dominant group in soybean shoots under SN fertilization. These differences may simply reflect experimental biases between the culture-dependent and culture-independent methods used for analyzing bacterial communities. By considering the known diversity of plant-associated bacteria, however, the phylogenetic compositions strongly and unusually biased toward to the Gammaproteobacteria are not likely to simply reflect the nature of soybean-associated bacterial communities. As the relative abundance of Alphaproteobacteria was increased in nodulation mutants under the SN fertilization condition in the present study, a plant signaling pathway such as autoregulation systems in the legume–rhizobia symbiosis may be responsible for the dominance of Gammaproteobacteria in the culture of soybean-associated bacteria.

Exogenous nitrate suppresses the infection of rhizobia and inhibits nodule formation (Carroll et al., 1985; Barbulova et al., 2007; Okamoto et al., 2009). Recently, Okamoto et al. (2009) suggested that a nitrate-induced CLE gene in Lotus japonica is involved in autoregulation systems through the HAR1 gene, which is a homolog of GmNARK in soybeans.

In this study, the HN treatment was initially performed to eliminate the effects of nitrogen deficiency in Nod soybeans on the microbial community and to inhibit nodulation in wild-type soybeans (Nod+). The HN fertilization increased nitrate contents in the soil (7–10 mmol NO3 per kg dry soil), which were four- to sixfold higher than those in the SN field soil (1.2–2.6 mmol NO3 per kg dry soil) throughout the soybean cultivation period (Supplementary Table 1). Plant dry weights and nitrogen contents showed no significant difference between nodulated Enrei and the Nod mutant under the HN field conditions (Supplementary Figures 2a–d, Figure 2i). Under the HN fertilization conditions, nodule dry weight and nodule number of wild-type soybean Enrei were also markedly inhibited.

These results suggest that HN fertilization gave rise to nitrate-induced nodulation inhibition through the soybean autoregulation system. Thus, a GmNARK-mediated signaling pathway may regulate a subpopulation of Alphaproteobacteria, which may be closely related to rhizobia, beyond the regulation of nodule formation by Bradyrhizobium japonicum in soybeans. As a consequence, this may have resulted in the low ratio of Alpha/Gammaproteobacteria in wild-type soybeans.

It is also interesting that population shifts were observed for Nod++ soybeans under both SN and the HN fertilization conditions (Figures 2 and 4). As Nod++ soybeans do not harbor the functional GmNARK gene (Nishimura et al., 2002), the GmNARK-mediated nitrate sensing systems could not regulate nodule formation (Okamoto et al., 2009). Nevertheless, the effects of HN fertilization on Nod++ soybeans occurred systemically, and the mode of the community shifts was very similar to those for other nodulation phenotypes (Nod+ and Nod soybeans). Therefore, the GmNARK gene is not essential for the population shifts of Alpha- and Gammaproteobacteria responding to HN fertilization. These results also may imply that presence of GmNARK-independent nitrogen-sensing system(s) affects the shoot-associated bacterial community in soybeans. However, there is a possibility that the Gammaproteobacteria population may be more adaptive than Alphaproteobacteria in soybeans under the influence of autoregulation and the effects of HN fertilization.

In conclusion, this study showed that the community structure of stem-associated bacteria was affected by nodulation phenotype and nitrogen fertilization level. Both nodulation phenotypes and nitrogen levels caused a similar directional change in the stem-associated bacterial communities, with a population shift of Alpha- and Gammaproteobacteria. Further analyses identified several OTUs in Alpha- and Gammaproteobacteria responding to the population shifts. The relative abundances of Aurantimonas and Methylobacterium sp. were especially sensitive to the nodulation phenotype and nitrogen level, being decreased in the wild-type nodulation phenotype and under HN fertilization. These results imply that the nitrogen signaling pathway in plants may be related to the symbiotic regulation systems between legumes and rhizobia and could have a significant influence on the diversity of plant-associated microbes.

In this study, the clone library analyses were conducted on the whole community of stem-associated bacteria including both the epiphytic and endophytic bacteria rather than on the each of epiphytic and endophytic bacterial communities. Therefore, the important issues for the tissue localization of bacteria responding to nodulation phenotypes and soil N levels remain unanswered, and should be addressed in the following study. Quantitative analyses of the population shifts of Alpha- and Gammaproteobacteria and the isolation of bacteria responsible for this population shift should also be conducted, which would facilitate our understanding of how plants regulate and shape the structures of microbial communities.