Introduction

Soybean (Glycine max (L.) Merr.) is one of the major global crops that also has a unique role in sustainable agriculture, due to its ability to fix atmospheric nitrogen through symbiotic interactions with rhizobia in the soil (Keyser and Li, 1992). There are two major groups of rhizobia that inhabit the root nodules of soybean: the slow-growing Bradyrhizobium and the fast-growing Sinorhizobium (Keyser and Li, 1992). In USA, Brazil and Argentina, slow-growing rhizobia, particularly the species Bradyrhizobium japonicum and Bradyrhizobium elkanii, are used as inoculants on soybean fields (Hungria et al., 1998). On the other hand, the soils in China are dominated by fast-growing rhizobia such as Sinorhizobium (Ensifer) fredii (Tian et al., 2012).

After rhizobial cells have encountered their compatible soybean partners, a series of developmental events in the infected roots occur, which lead to the formation of nodules, a special organ where biological nitrogen fixation (BNF) takes place (Stougaard, 2000). The primary metabolic product of BNF is ammonia. In soybean, ammonia is subsequently converted into ureides for long-range transport. Ureides will then provide organic nitrogen resources for the building of nitrogen-containing biomolecules (Shelp and Ireland, 1985).

During the domestication of soybean, the size and weight of seeds have been enhanced by human selection (Zhou et al., 2015). There may have been multiple ways to achieve these results. To support the increase in dry matter in the seed, which is a major sink of metabolites, corresponding increases in the carbon and nitrogen supplies are essential. Although an increased photosynthetic capacity could have provided more carbon (Richards, 2000), we hypothesize that the capacity of BNF was also enhanced during the domestication of soybean.

Several factors contribute to the capacity of BNF, including the development of nodules and the accumulation of ureides. Previous studies identified a non-nodulating and a super-nodulating mutant (Carroll et al., 1985, 1986). The soybean mutant that cannot form nodules does not carry out BNF (Mathews et al., 1992). On the other hand, super-nodulating mutants do not exhibit enhanced BNF and sometimes even exhibit a suppressed growth (Takahashi et al., 1995). One possible reason is that the efficiencies of carbon and nitrogen metabolisms may not be able to catch up with the excessive nodule numbers. Genetic studies have been carried out to identify the quantitative trait loci (QTLs) associated with BNF in soybean. For example, there are QTLs identified on chromosomes 6, 9, 13 and 19 that are associated with the ureides content (the QTL on chromosome 19 accounting for up to 31% of the variance) (Hwang et al., 2013). One QTL for the specific nodule weight (total nodule dry weight/number of nodules) was identified on chromosome 20 (Santos et al., 2013) and one for total nodule weight was identified on chromosome 18 (Hwang et al., 2014).

Most of the previous genetic studies on soybean BNF have employed only cultivated soybeans as the sources of genetic materials (Hwang et al., 2013; Santos et al., 2013). Therefore, the results may not reflect the process of domestication. An investigation using wild germplasms which exhibit a much higher genetic diversity may fill this gap. The idea of using wild germplasms to study BNF has been under-explored despite some earlier discussions (Zahran, 2001; Provorov and Tikhonovich, 2003).

Advances in genetics and genomics of crop plants and their wild relatives have expanded our understanding of complex traits, leading to the discovery of new genes. Hence, we and other groups have characterized and compared the genomic variations of wild and cultivated soybeans by next-generation sequencing, contributing to our knowledge of the diversity and population structure of soybeans (Lam et al., 2010; Li et al., 2014; Zhou et al., 2015). To identify QTLs controlling important traits of soybean, we also constructed a recombinant inbred (RI) population from one wild and one cultivated soybean accession, and used this RI population to clone a salt tolerance causal gene from a salt tolerance major QTL (Qi et al., 2014). In this work, we made further use of this RI population to identify novel QTLs for total nodule fresh weight and ureides contents. The possible involvement of these QTLs in the domestication process is discussed.

Materials and Methods

Plant material and bacterial strains

Thirty-one previously re-sequenced soybean germplasms (17 wild and 14 cultivated without pedigree relationship) (Lam et al., 2010), and seventeen additional cultivated soybeans from various origins were used in this study (Supplementary Table S1). The non-nodulating soybean nod139 (Carroll et al., 1986) was used as the negative control for BNF. Ninety-six RI lines from a bi-parental population were adopted for QTL analyses (Qi et al., 2014). Seeds were disinfected with chlorine gas for 16 h and then germinated in plastic trays with moistened sterile vermiculite for 3 days until the radicles reached a length of 3–5 cm. These pre-germinated seeds were transferred to 210 ml plastic cups (one plant per cup) containing moistened sterile vermiculite (with 50 ml sterile water). An additional 50 ml of B and D nutrient solution (Broughton and Dilworth, 1971) without any nitrogen source was added to each cup. Immediately after transferring each plant to a cup, these pre-germinated seeds were inoculated with 1 ml of bacterial culture per plant. Two soybean nodulating rhizobia Bradyrhizobium diazoefficiens sp. nov. USDA110 (originally classified as Bradyrhizobium japonicum group Ia) (Delamuta et al., 2013) and S. (E.) fredii CCBAU45436 (Tian et al., 2012) were cultured in yeast extract mannitol medium at 28 °C with constant agitation (180 r.p.m.) for 5 days and 24 h, respectively, until they reached a density of 3 × 109 cells ml−1.

Morphological and biological nitrogen fixation traits

Soybean plants were grown under controlled environmental conditions: 25 °C and a 16:8 light:dark photoperiod. Plants at V3 stage (4 weeks post inoculation) with second trifoliate leaves fully expanded were removed from the cups. The complete root system was washed with deionized water (Elix Water Purification System, Millipore, Darmstadt, Germany). Nodules were removed from each plant for nodule number and fresh weight quantification. Roots and shoots were separated and the fresh weight of each part was recorded for each plant. Each whole plant was dried inside a paper envelope in the oven at 80 °C for 24 h, and the plant’s total dry weight was recorded. Eight to nine plants were evaluated on two independent replicates, which involved a total of approximately 3000 plants. The replicates have been performed under completely randomized design.

The dried plant material was ground to a fine powder using a sample mill (Cyclotec 1093, Foss, Hillerød, Denmark). For ureides extraction, 50 mg of fine powder per plant was boiled for 20 min in 25 mm potassium phosphate buffer (pH 6). Samples were centrifuged at 10 000 g for 15 min, and the clear supernatant containing ureides was used for the colorimetric determination following a published method (Vogels and Van Der Drift, 1970). In brief, three 100-μl aliquots of each sample were collected in three separate tubes for the measurement of endogenous glyoxylate, allantoic acid-derived glyoxylate and allantoin-derived glyoxylate. The content of allantoic acid and allantoin was converted into glyoxylate by acid and alkaline/acid hydrolysis, respectively (Vogels and Van Der Drift, 1970). Finally, the total glyoxylate content was measured by a colorimetric reaction where the glyoxylate is converted into glycolic acid-phenylhydrazone and then oxidized by ferricyanide in the presence of concentrated acid and phenylhydrazine to give a pink color. Absorbance at 520 nm by samples was determined using Synergy H1 Multi-Mode Reader (BioTek Instruments, Inc., Winooski, VT, USA) on 96-well plates, based on a standard curve with known concentrations of glyoxylate. By subtracting the amount of endogenous glyoxylate from the total amount of glyoxylate in each sample subjected to alkaline/acid or acid hydrolysis, the concentration of allantoin or allantoic acid could then be calculated. To check the accuracy of the ureides determination method, non-nodulating soybean plants nod139 were grown alongside the experimental plants as a negative control.

For total nitrogen content, 5–10 mg of dried and milled samples was weighed in tin capsules using an ultra-precision PerkinElmer AD-4 autobalance (PerkinElmer, Waltham, MA, USA). Nitrogen content was then determined by mass spectrometry using the stable isotope facility service of the University of California, Davis (USA).

Nodule ultrastructure

The nodule ultrastructures of wild and cultivated soybeans were examined by high-pressure freezing (Studer et al., 1992) of nodules at 28 days post inoculation using the High Pressure Freezer System Leica EM PACT2 (Leica Microsystems, Buffalo Grove, IL, USA) followed by observation using a transmission electron microscope Hitachi H-7650 (Hitachi High Technologies America, Inc., Schaumburg, IL, USA).

RI population and QTL analyses

The RI population obtained by crossing the cultivated Glycine max (C08) with the wild-type Glycine soja (W05) was adopted from a previous study (Qi et al., 2014). In our previous research (Qi et al., 2014), a core panel of 96 RI lines was re-sequenced at ~1 × depth to generate high-quality single-nucleotide polymorphism (SNP) information. An array of high-quality SNPs was grouped into bin marker. There are 2757 bin markers distributed on the 20 chromosomes. A bin map of 2992.0 cM in the Kosambi mapping function was generated using the bin markers. QTL analyses were performed using QTLCartographer (http://statgen.ncsu.edu/qtlcart/). In brief, QTLs were identified using interval mapping and composite interval mapping with a 10-cM scanning window and a 0.5-cM walking step. Log-of-odds (LOD) cutoff was determined with 1000 times permutation with P<0.05. The QTL boundary was identified with a 1.5 drop of LOD score peak. Two replicates were performed with nine plants per line per replicate.

Genomic analyses of the QTL regions

In our previous study (Zhao et al., 2015), sequencing reads data from 47 cultivated soybean accessions and 25 wild soybean accessions were retrieved from 4 studies (Kim et al., 2010; Lam et al., 2010; Li et al., 2013; Chung et al., 2014), of which 17 wild and 14 cultivated soybean accessions from Lam et al. (2010) were also used in this study. After mapping to the Williams 82 genome version Gmax_189, SNPs were called for each soybean accession. Pairwise divergence (θπ) within the wild soybean population or the cultivated soybean population and the divergence index (Fst) between these two populations were calculated based on this SNP information by using a Perl script with a sliding window size of 20Kb and a step window size of 2Kb. If all the soybean accessions in the wild population or the cultivated population have the same genotype at an SNP locus, then this SNP was designated as a fixed SNP in the corresponding population. In this study, fixed SNPs retrieved from Zhao et al. (2015) were further filtered to remove low-confidence data. To be specific, fixed SNP loci with a missing genotype in more than 7 soybean accessions or with a minor allele frequency lower than 3 were removed.

Statistical analyses

Statistical analyses were performed using IBM SPSS Statistics for Windows, Version 22.0 (IBM Corp, NY, USA). All the parameters analyzed on this work have passed the Shapiro–Wilk test for normal data distribution except in the case of ureides content. Given the non-normal distribution, non-parametric tests were used for the statistical analyses involving ureides contents.

Results

A strong correlation between nitrogen accumulation and nodule development in soybean

A pilot experiment using a non-nodulating soybean mutant was performed (Carroll et al., 1986). The non-nodulating plants contained only a trace amount of ureides which were the by-products of purine degradation, and these plants clearly exhibited symptoms of nitrogen deficiency (Supplementary Figure S1). Therefore, the much higher levels of ureides detected in the nodulating plants can be regarded as the specific product of nitrogen fixation under our experimental set-up.

Several physiological parameters related to nitrogen fixation were measured on a per-plant basis for the 17 wild and 31 cultivated soybean accessions: total plant fresh weight, total root fresh weight, total plant dry weight, total nodule fresh weight, nodule number, total nitrogen and total ureides (Supplementary Table S2). Our results indicated that in general, there are significant correlations between the development of nodules (that is, total nodule fresh weight and nodule number) and general plant growth (that is, total plant fresh and dry weights and total root fresh weight), and the accumulation of ureides and total nitrogen (Table 1).

Table 1 Correlations between indicators of symbiotic performance on wild and cultivated soybean populations upon inoculations with S. (E.) fredii CCBAU45436 or B. diazoefficiens sp. nov. USDA110

As expected, total nitrogen per plant was highly correlated with total fresh weight, total dry weight and root fresh weight since nitrogen supports plant growth and development. This was independent of the soybean accessions or types of rhizobia used (Table 1). However, the correlation between total ureides vs total fresh weight, total dry weight and root fresh weight was not statistically significant under some conditions, for example, when wild soybeans were inoculated with CCBAU45436 or when cultivated soybeans were inoculated with USDA110 (Table 1).

Nodule number and total nodule fresh weight are two measurable parameters reflecting the extent of nodule development. Total nodule fresh weight was closely correlated with both total nitrogen and total ureides, independent of soybean accessions and types of rhizobia used (Table 1). On the other hand, nodule number also correlated with both total nitrogen and total ureides, except when wild soybeans were inoculated with CCBAU45436 (Table 1).

Cultivated soybeans exhibited higher nitrogen fixation capacity than wild soybeans

In general, when inoculated with the same rhizobium, cultivated soybeans as a whole out-performed wild soybeans in most parameters tested, including total fresh weight, total root fresh weight, total dry weight, total nodule fresh weight, nodule number, total nitrogen and total ureides accumulation (Figure 1). This observation indicates that domestication and human selections might have led to improved overall growth in soybean that could be a result of enhanced nitrogen fixation.

Figure 1
figure 1

Overall symbiotic performance of cultivated and wild soybeans. Thirty-one cultivated and seventeen wild soybean accessions were inoculated with either B. diazoefficiens sp. nov. USDA110 or S. (E.) fredii CCBAU45436. Average data per plant for cultivated and wild soybean populations were compared: (a) total fresh weight, (b) total root fresh weight, (c) total dry weight, (d) total nodule fresh weight, (e) nodule number per plant, (f) total nitrogen and (g) total ureides. Solid bars: cultivated soybean; open bars: wild soybean; green bars: plants inoculated with USDA110; blue bars: plants inoculated with CCBAU45436. For total fresh weight, total root fresh weight, total dry weight, total nodule fresh weight, nodule number and total nitrogen per plant, different letters indicate a significant difference between groups by one-way ANOVA followed by post hoc Tukey test (P<0.05). For total ureides, different letters indicate a significant difference between groups using Mann–Whitney non-parametric test between pairs of independent samples (P<0.05 with Bonferroni correction). Each bar represents the average value of 31 cultivated soybean or 17 wild soybean accessions. Error bar: ±s.e.m.

In the Americas, it is recommended to use slow-growing rhizobia as the inoculants since they are believed to be more efficient in nitrogen fixation than fast-growing rhizobia. Considering the 31 cultivated soybean accessions that we have studied, we do not observe a significant difference in total fresh weight, root fresh weight or total dry weight when inoculated with either USDA110 or CCBAU45436 (Figures 1a–c). However, on average, cultivated soybeans inoculated with USDA110 exhibited higher total nodule fresh weight and nodule number, and accumulated more total nitrogen and ureides, when compared with those inoculated with CCBAU45436 (Figures 1d–g).

Despite a seemingly better overall nitrogen fixation performance when USDA110 was used (Figure 1), 17 of the 24 cultivated soybean accessions originating from China exhibited equal performance in terms of total nitrogen per plant when inoculated with either USDA110 or CCBAU45436 (Supplementary Table S2). This suggests that the fast-growing rhizobia can also be used as inoculants when suitable soybean partners are identified.

For wild soybeans, on average, inoculation with USDA110 gave higher nodule number and total ureides, but no statistically significant differences were found for total fresh weight, total dry weight, total root fresh weight, total nodule fresh weight and total nitrogen (Figure 1). When the measurements were compared at the level of individual lines, the results were even more diverse and the advantage of using USDA110 was not clear (Supplementary Table S2; Figure 2). Therefore, the wild soybeans are of very different genetic backgrounds that may respond differently toward USDA110 and CCBAU45436. During domestication and subsequent human selection processes, some cultivated lines giving better performance toward slow-growing rhizobia may have been selected, especially in the Americas.

Figure 2
figure 2

Performance of individual soybean lines on nitrogen fixation-related traits. Thirty-one cultivated and seventeen wild soybean accessions were inoculated with either B. diazoefficiens sp. nov. USDA110 or S. (E.) fredii CCBAU45436. Average data per plant for individual lines were plotted in the following order: (a) total fresh weight, (b) total root fresh weight, (c) total dry weight, (d) total nodule fresh weight, (e) nodule number, (f) total nitrogen and (g) total ureides. Each bar represents the average value of 4–16 plants. Error bar: ±s.e.m.

In symbiosis, while the bacteroids contribute to the nitrogenase activity, the host supplies the carbon to fuel nitrogen fixation. The efficiency of nitrogen accumulation is dependent on the optimized balance between nitrogen and carbon metabolisms.

To further explain the differences between wild and cultivated soybeans in the capacity of nitrogen fixation, we examined the ultrastructure of nodules. Since data are already available for cultivated soybeans in the literature (Cohen et al., 1986; Studer et al., 1992), we only confirmed the previous findings with one cultivated accession C08 (Figures 3a–d). In the same experiment, we examined the ultrastructure of nodules in four different wild soybean accessions inoculated with USDA110 or CCBAU45436 (Figures 3e–h and Supplementary Figure S2).

Figure 3
figure 3

Electron microscopic images of the nodules of cultivated and wild soybeans inoculated with either B. diazoefficiens sp. nov. USDA110 or S. (E.) fredii CCBAU45436. (a, c) Cultivated soybean C08 inoculated with USDA110. (b, d) C08 inoculated with CCBAU45436. (e, g) Wild soybean W05 inoculated with USDA110. (f, h) W05 inoculated with CCBAU45436. Samples were collected and fixed 28 days post inoculation. B, Bacteroid; S, Symbiosome.

The ultrastructure of nodules from C08 was similar to those reported previously for both types of rhizobia (Figures 3a–d). Interestingly, there was an exceptionally higher accumulation of poly-β-hydroxybutyrate bodies (PHBs) in bacteroids (Figures 3e–h; Supplementary Figure S2) in the nodules of all wild soybean accessions, compared with cultivated ones.

Mapping QTLs of biological nitrogen fixation using an RI population

Our physiological analyses above suggested that cultivated soybeans may have accumulated important genetic determinants that enable the plants to perform more efficient nitrogen fixation. To explore the possible genetic determinants involved, we performed QTL analyses on an RI population originated from a cross between a wild (W05) and a cultivated (C08) soybean accession. In this study, we showed that, compared with W05, C08 exhibited a higher total fresh weight, total root fresh weight, total dry weight, total nodule fresh weight, total nitrogen and total ureides under the same experimental settings described above (Supplementary Table S2). As inoculation with USDA110 exhibited more consistent differences in these parameters, we used this rhizobium strain for our subsequent population studies.

We quantified the phenotypes of the 96 core RI lines (containing two parental lines), including total plant fresh weight, total root fresh weight, total nodule fresh weight and total ureides. The two parents of this population showed significant differences in these parameters (Supplementary Figure S3), consistent with the results in germplasm studies. The dependent variables exhibited normal distributions except for total ureides (Supplementary Figure S4), which showed an exponential distribution. To normalize the data distribution for total ureides, the data were transformed by taking the natural logarithm (Ln) of the original data. QTL analyses using both the transformed and untransformed data gave similar results (see below).

Using composite interval mapping, we identified several QTLs (Figure 4; Table 2). Figure 4 shows the peaks of the LOD score curve for each QTL. Table 2 shows the LOD cutoff, physical length and genetic distances for each characterized QTL. Only QTLs that could be identified in two independent experiments were further analyzed. The span of each QTL was reported by combining the genomic regions of the two replicates.

Figure 4
figure 4

LOD score distributions of nitrogen fixation-related traits across 20 soybean chromosomes. Peak LOD scores higher than the LOD cutoff values calculated by permutation test (1000 times, P<0.05) were indicated. '-' indicates the absence of a statistically significant LOD score in one replicate. R1 (red line): biological replicate 1; R2 (blue line): biological replicate 2. Gm1: soybean chromosome 1; Gm2: soybean chromosome 2, and so on.

Table 2 QTLs identified for biological nitrogen fixation-related traits

In the QTL study, we identified one high-confidence QTL for root fresh weight on chromosome 8 (explaining 10.4% of the total variance) and two high-confidence QTLs for total plant fresh weight on chromosome 7 (explaining 12.8% of the total variance) and chromosome 18 (explaining 13.1% of the total variance), respectively.

The focus of this research is to identify new loci that are related to nitrogen fixation. Here, we found one high-confidence QTL for total ureides and two high-confidence QTLs for total nodule fresh weight.

The novel QTL for total ureides identified in this study is located on chromosome 17 (explaining 11.6% of the total variance), which is different from the previously identified QTLs that are located on chromosomes 6, 9, 13 and 19 (Hwang et al., 2013).

We also identified one novel QTL for total nodule fresh weight on chromosome 12, and a second QTL on chromosome 18 which is located in close proximity to a total nodule fresh weight QTL reported previously (Hwang et al., 2014), explaining 7.92 and 19.83% of the total variance, respectively.

The total nodule fresh weight QTL on chromosome 18 overlaps with a major QTL for total plant fresh weight. High total nodule fresh weight may indicate higher nitrogen fixation capacity that results in an increase in total plant fresh weight. Conversely, plants with higher total plant fresh weight could support better nodule development.

To provide more insights into the QTLs identified in this work, we investigated the genomic regions reported for these QTLs (Figure 5). We compared the nucleotide diversity (θπ), the divergence index (Fst) and the distribution of fixed SNP between cultivated (47 accessions) and wild (25 accessions) soybeans. Such comparisons could point to the possible involvement of specific genomic regions within these QTLs during domestication.

Figure 5
figure 5

Plots of divergence index (Fst), nucleotide diversity (θπ) and fixed SNPs within the QTL regions. (a) QTL region of root fresh weight on chromosome 8. (b) QTL region of total plant fresh weight and total nodule fresh weight on chromosome 18. Gray boxes highlighted regions with high nucleotide diversity and large number of fixed nucleotides. (c) QTL region of total nodule fresh weight on chromosome 12. (d) QTL region of ureides content on chromosome 17. Gray box highlighted region with above-average Fst values, high nucleotide diversity and large number of fixed nucleotides. Black dots: Fst values between wild and cultivated soybeans. Purple line: average Fst value of the chromosome. Red dots and blue dots indicate the θπ of cultivated soybeans and wild soybeans, respectively. Red lines and blue lines indicate the number of fixed SNPs of cultivated soybeans and wild soybeans, respectively. Black lines: average number of fixed SNPs in a chromosome of cultivated soybeans.

The root fresh weight QTL on chromosome 8 identified in this study overlaps with previously reported QTLs for domestication (Brensha et al., 2012) and root architecture (Manavalan et al., 2015). Consistent with the involvement of this QTL in soybean domestication, we found that the θπ value of the wild soybean population is much higher than that of the cultivated soybean population, over an extended genomic region in this QTL (Figure 5a). Consistent with this observation, the Fst values (black dots) in the QTL region are generally higher than the average Fst value of the entire chromosome 8 (purple line) (Figure 5a).

On the other hand, when we examined the QTL for total nodule fresh weight on chromosome 18 (which overlaps with a QTL for total plant fresh weight), the θπ values of the cultivated and wild soybean populations were very much alike and the Fst values in this genomic region were very low (Figure 5b), indicating that this QTL may not have a strong relationship with domestication.

Another QTL for total nodule fresh weight is located on chromosome 12. Some regions within this QTL exhibited higher θπ values in the wild soybean population than in the cultivated soybean population (Figure 5c). Interestingly, these regions also have higher frequencies of fixed SNPs (Figure 5c).

Similarly, when examining the genomic region of the QTL for total ureides located on chromosome 17, the region between 14.75 and 16.00 Mb exhibited higher θπ values in the wild soybean population than in the cultivated population, which were accompanied with higher Fst values (Figure 5d). Higher frequencies of fixed SNPs were also observed in these regions in cultivated soybeans (Figure 5d).

Together, these results suggest that improvements in total nodule fresh weight and ureides accumulation are associated with the domestication process.

Discussion

In this study, we used a combination of soybean germplasms (both cultivated and wild) and a bi-parental RI population to test the hypothesis that improvement in nitrogen fixation capacity was the result of domestication and human selection. It should be noted that parameters such as initial seed weight, plant growth rate and photosynthetic efficiency of each accession could affect the physiological measurements. Hence, we have used a large number of accessions to reduce the statistical errors due to these differences.

The first piece of supporting evidence comes from a comprehensive physiological analysis of 17 wild and 31 cultivated soybean accessions that were inoculated with two different types of rhizobia. Irrespective of the rhizobial strains used, we demonstrated a general trend that the nitrogen fixation capacity of cultivated soybeans out-performed wild soybeans (Figures 1 and 2).

In connection to this, we also observed a prominent accumulation of PHBs in wild soybeans (Figure 3), suggesting that there may be an imbalance in carbon and nitrogen metabolisms in wild soybeans that could affect the capacity of nitrogen fixation. PHB accumulation has been associated with a release of TCA on bacteroids when high amounts of dicarboxylates are fuelling the nitrogen fixation (Trainer and Charles, 2006). Likewise, it has been suggested that this accumulation could confer adaptive advantages to rhizobia over other soil bacteria (Trainer and Charles, 2006). Although the role of PHB accumulation during symbiosis is not fully understood, our results show that the accumulation by the same rhizobium type could be highly conditioned by the host background. The strong accumulation of PHBs in wild soybean suggests that the process of nitrogen fixation in wild soybeans is not limited by the carbon supply. On the other hand, the C/N balance in wild soybeans might not be optimized for nitrogen fixation.

The performances of a slow-growing Bradyrhizobium USDA110 and a fast-growing Sinorhizobium CCBAU45436 were compared. We found that when wild soybeans were inoculated, the response to USDA110 and CCBAU45436 was very diverse and without a general pattern (Figure 2; Table 1). For cultivated soybeans, while it is expected that the performance of USDA110 would be better than CCBAU45436 when soybean cultivars from America were used (DuTeau et al., 1986; Cregan and Keyser, 1988), the advantage of USDA110 is less prominent when Chinese soybean cultivars were employed (Supplementary Table S2). This result suggests that the advantage of using the slow-growing Bradyrhizobium may be due to the initial germplasm introduction and the subsequent breeding of soybean cultivars in the Americas. On the other hand, as a large proportion of fields under soybean cultivation in China is of high pH, a condition to which Sinorhizobium is better adapted, selections might have taken place so that some Chinese cultivars could use Sinorhizobium efficiently.

The advantages of the slow-glowing Bradyrhizobium over the fast-growing Sinorhizobium were demonstrated in both cultivated and wild soybeans only in reference to total ureides and nodule number (Figure 2). However, higher ureides accumulation due to the inoculation of USDA110 could not be translated into higher total nitrogen or total fresh weight (Figure 2). One possible explanation is that the re-assimilation of ammonia stored in the form of ureides could be a key limiting step as it is also very energy consuming (Todd et al., 2006). Another parameter in which USDA110 showed consistently better performance than CCBAU45436 was nodule number. However, the study of hyper-nodulating mutant indicated that an increase in nodule number may not always translate into higher grain yield (Song et al., 1995).

To search for genomic clues to support the association between nitrogen fixation capacity and domestication, we employed an RI population to identify new QTLs for total ureides and total nodule fresh weight (Figure 4; Table 2). Only USDA110 was used as the inoculant. Since the genetic population was composed of descendants from a cross between a wild and a cultivated soybean (Qi et al., 2014), the new QTLs may contain genomic information related to domestication. Ureides are metabolic intermediates of nitrogen fixation in soybean and are commonly used as parameters to measure nitrogen fixation (Herridge and Peoples, 1990). The importance of total nodule fresh weight as an indicator of symbiotic performance has been discussed in previous reports (Döbereiner, 1966; Hungria and Bohrer, 2000).

The QTLs mapped in this study account for ~10–25% of the genetic variance of the traits in question. It is not uncommon that some QTLs can only explain a certain level (<20%) of genetic variations (Kearsey and Farquhar, 1998). For traits that are controlled by multiple loci, each locus may only account for a small variance.

As an attempt to assess domestication in the genome, we employed several parameters, including θπ, Fst and fixed SNPs. These indicators were used previously to trace the genomic footprints of domestication and human selection (Zhao et al., 2015). Within the total nodule fresh weight QTL on chromosome 12 and the total ureides QTL on chromosome 17, we identified regions exhibiting high values of θπ and Fst, and high frequencies of fixed SNPs (Figure 5). These pieces of evidence, although not in themselves a strong proof, do suggest that these genomic regions might have undergone substantial selection during domestication. However, low Fst values and high θπ are observed in both wild and cultivated soybeans within the total plant weight and total nodule fresh weight QTLs on chromosome 18 (Figure 5b), suggesting that this region may not have been affected by domestication. On the other hand, this QTL region could be important for subsequent soybean improvements through the breeding of cultivars. It is possible that the differences only occur among some soybean accessions, instead of representing a general difference between wild and cultivated soybeans. In this scenario, there may not be a strong indication based on Fst or θπ values. This implied that unlike region on chromosome 12, QTL region on chromosome 18 may have escaped from human selection.

Identification of the causal genes in the novel QTLs related to nitrogen fixation capacity identified in this work (one QTL for total ureides and two QTLs for total nodule fresh weight) will require further functional studies. Nonetheless, when taking a quick look at these QTLs, some genes of potential interest could be identified. For example, in the total ureides QTL on chromosome 17, a gene (Glyma17g15190) encoding 5'-phosphoribosylglycinamide formyltransferase, a key enzyme of the de novo purine biosynthesis pathway, was found. The purine biosynthetic pathway provides the precursors for the biosynthesis of ureides (Schubert, 1986). Moreover, a gene (Glyma17g19660) encoding a nicotinate phosphoribosyltransferase was also found. This enzyme is required for pyridine nucleotide recycling (Hashida et al., 2009), an essential process to maintain the high energy demand in nodules (Schubert, 1986).

In the total nodule fresh weight QTL on chromosome 12, a gene encoding succinate dehydrogenase 3-2 (Glyma12g10590) was identified. Succinate dehydrogenase catalyzes the oxidation of succinate to fumarate, which is later converted into malate by fumarase and serves as the carbon source for the symbiotic bacteroid.

In the total nodule fresh weight QTL on chromosome 18, a sucrose phosphate synthase gene (SPS: Glyma18g12890) was also found. SPS catalyzes the formation of sucrose-6-phosphate from UDP-glucose and fructose-6-phosphate, and this enzyme has a putative role in nodule carbon partitioning to different metabolic pathways (Aleman et al., 2010).

In summary, by making use of wild germplasms and a unique RI population, we provided physiological and genomic data to support the notion that an improvement in nitrogen fixation capacity formed part of the domestication process of soybean. In addition, we identified new QTLs for total ureides and total nodule fresh weight, and genes related to nitrogen fixation could be identified therein.

DATA ARCHIVING

Data archiving Genotypes and phenotype measurements for 96 RILs available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.vp6fs.