Abstract
Seedling vigour (SV) is important for direct seeding rice (Oryza sativa L.), especially in a paddy-direct seeding system, but the genetic mechanisms behind the related traits remain largely unknown. Here, we used 744 germplasms, having at least two subsets, for the detection of quantitative trait loci (QTLs) affecting the SV-related traits tiller number, plant height, and aboveground dry weight at three sampling stages, 27, 34, and 41 d after sowing. A joint map based on GAPIT and mrMLM produced a satisfying balance between type I and II errors. In total, 42 QTL regions, containing 18 (42.9%) previously reported overlapping QTL regions and 24 new ones, responsible for SV were detected throughout the genome. Four QTL regions, qSV1a, qSV3e, qSV4c, and qSV7c, were delimited and harboured quantitative trait nucleotides that are responsible for SV-related traits. Favourable haplotype mining for the candidate genes within these four regions, as well as the early SV gene OsGA20ox1, was performed, and the favourable haplotypes were presented with donors from the 3,000 Rice Genome Project. This work provides new information and materials for the future molecular breeding of direct seeding rice, especially in paddy-direct seeding cultivation systems.
Similar content being viewed by others
Introduction
With economic and social development, cultivating systems for rice (Oryza sativa L.) production have changed. At present, direct seeding rice (DSR) has been widely adopted by farmers and is just as important as the traditional transplanting rice (TTR) system, which is now performed with the assistance of mechanical tools in China. In other parts of Asia, the adoption of DSR in place of TTR has also increased because of its labour and energy savings1. DSR can significantly reduce water consumption and labour requirements, while increasing system productivity and resource use efficiency2. Moreover, it can reduce greenhouse gas emissions during rice cultivation3.
In practice, there are at least two types of cultivating systems for DSR, paddy direct seeding (PDS) and upland direct seeding (UDS), also known as aerobic direct seeding. In UDS, the lack of a ‘head start’ and the absence of a standing water layer to suppress weeds make it highly vulnerable to weeds, causing severe yield losses compared with other rice ecosystems4. Thus, seedling vigour (SV), especially early SV (ESV), which refers to the SV for seedlings less than 28 d old, is imperative for crop stand establishment and weed competitiveness4,5. In PDS, however, the control of weeds is much easier owing to the standing water layer. Thus, the late SV (LSV), 28 d after sowing (DAS) is more important for the population type and final grain yield, especially in PDS. This is similar to the rice in the TTR cultivating system. Nevertheless, the growth of rice is different for seedlings under TTR and PDS conditions owing to the transplanting recovery procedure.
Substantial genetic variation in SV exists within the rice gene pool6,7,8. A lot of research has focused on SV, especially its molecular mapping. Many QTLs for SV in rice have been identified in bi-parental-derived populations by QTL analyses9,10,11,12,13,14,15. Dynamic and conditional QTL mapping have been used for the detection of SV-related QTLs16,17. However, most of the QTLs detected in these studies were located in relatively large regions and favourable allele/haplotype mining was not performed. The mining of the germplasm diversity for important SV-related QTLs is still limited.
Recently, natural populations, with advantages over the bi-parental populations, have been used for identifying QTLs of complex traits18,19. QTLs and favourable haplotypes affecting mesocotyl length were identified in a set of 621 rice accessions by GWAS, and the combinations of superior haplotypes of OsML1 and OsML2 increase mesocotyl length by up to 4 cm20. Using general and mixed linear model approaches, 16 and 10 SSRs, respectively, were identified as significantly associated with ESV traits in 96 rice lines selected from a set of 629 rice accessions21. The 3,000 (3 K) Rice Genome Project has produced a highly diverse pool carrying favourable alleles for different traits22,23. Determining important loci with less background interference has been a challenge for GWAS. Additionally, widely used methods based on single marker analyses accumulate errors with the application of multiple tests. Thus, a multiple test correction, such a Bonferroni correction, is commonly required. A multi-locus random-single nucleotide polymorphism (SNP)-effect mixed linear model package, named mrMLM, was recently developed to overcome this shortcoming24.
Here, we adopted a traditional GAPIT method combined with mrMLM to dissect the SV variations in a sub-panel of 744 accessions from the 3 K rice genome. The aims of this study were to (1) identify QTL regions for SV at different DAS in diverse rice germplasms using GWAS, (2) to fine-map and mine favourable alleles for candidate genes in some important QTL regions, and (3) to compare the mapping efficacies and respective characteristics of the genomic association and prediction integrated tool (GAPIT) and mrMLM methods. This work could offer useful information for the molecular breeding of DSR, as well as being a reference for the joint use of GAPIT and mrMLM.
Methods
Plant materials and field experiments
A set of 744 germplasms was used in this work. Its members were randomly selected from the sequenced accessions of the 3 K Rice Genome Project25. The field work was conducted at the experimental station in Shenzhen (22.6°N, 114.4°E), Guangdong Province during the late season of 2017. Approximately 120 germinated seeds for each accession were directly sown with 20 cm between rows, 15 cm between individuals and two seeds per hill in the paddy plot. At the one-leaf stage, 60 similar seedlings for each accession, with a single seedling in each hill, were maintained. The extra seedlings were removed. The tests were arranged in randomised plots with three replications. Field management was carried out according to the local practice. Specifically, ~113 kg ha−1 of carbamide was applied at the 2.5-leaf stage to supply nitrogen. Butachlor was applied at ~1.8 kg ha−1 for weed control during paddy preparation before seeding, and it was applied again after the 3-leaf stage.
The first sampling (stage A) was carried out at 27 DAS, and the second (stage B) and third (stage C) samplings were carried out at 34 and 41 DAS, respectively. From each accession, 10 uniform individuals, except for those on the boarders, were evaluated for SV traits. The SV traits, tiller number (TN), plant height (PH), and aboveground dry weight (DW), were measured for each individual.
Genotyping by sequencing and SNP extraction
The 744 accession panel was re-sequenced, with an average depth of more than 10×25. The cleaned reads were then mapped to the reference genome of ‘Nipponbare’ (IRGSP1.0), and ~14 M of high-quality SNPs were identified25. From these SNPs, a set of 2.9 M SNPs related to potential protein-coding areas was carefully selected. To build a SNP set for association studies, a subset of 27,921 SNPs was selected from the 2.9 M SNPs by choosing one SNP per 100 counts, as described in our previous GWAS mapping work23.
QTL analysis, comparative mapping and haplotype analysis
Basic statistical analyses for SV traits were conducted with SAS26. For the graphing and plotting, both Excel and R scripts were adopted. The basic scenario of a compressed mixed linear model27 implemented in GAPIT28 was adopted for the association analysis between QTL-flanking markers and SV traits for the 744 sequenced accessions. The parameters for GAPIT were set in accordance with our previous report23. A relatively stringent threshold was adopted to identify significant correlations between the SNPs and SV traits with a −LOG10(P) value of 6.0. To minimise to the possibility of type II errors in QTL detection29, a relatively loose threshold of 3.0 was adopted for the QTL regions having supporting evidence from different traits or previous reports. The allelic effects were estimated by setting the Major.allele.zero = TRUE in GAPIT to identify the donors of favourable alleles and their effects on SV traits. We used the mrMLM package24 to confirm and complement the mapping results from GAPIT.
Comparative mapping was carried out against a reference sequence map, the GRAMENE annotation sequence map30, to compare the QTLs detected in this study with previously reported QTLs or genes known to be associated with SV-related traits in rice.
Favourable haplotypes for candidate genes were investigated jointly with the aid of Perl and R scripts as described in our previous reports22,23, with minor modifications. The procedure included the following six major steps: (1) Determine the sub-regions defined by quantitative trait nucleotides (QTNs) with supporting evidence from both GAPIT and mrMLM; (2) Fill in the sub-regions with more SNPs from the original 2.9 M sets23, and then carry out GWASs using mrMLM within these sub-regions; (3) Screen the key QTNs with evidences from different SV-related traits; (4) Search for candidate genes harbouring these key QTNs for SV traits; (5) Carry out haplotype analyses for these candidate genes for SV-related traits to confirm the GWAS mapping results; and 6) Determine the top donors carrying favourable haplotypes.
Results
Performances of SV-related traits in the sequenced accessions
As shown in Fig. 1a–c, the three measured major SV-related traits, TN, PH, and DW, were different at the three sampling stages (27, 34, and 41 DAS). For the three sampling stages, the TN had ranges of 2.4–34.2, 4–36.6, and 4.6–37.8, respectively, with mean values of 13.1, 15.6, and 16.0, respectively. For the three sampling stages, the PH had ranges of 30.3–71.8 cm, 38.2–105.2 cm, and 45.2–118.8 cm, respectively, with mean values of 49.8, 64.2, and 76.4 cm, respectively. For the three sampling stages, the DW had ranges of 0.39–4.92 g, 1.47–10.34 g, and 3.34–18.38 g, respectively, with mean values of 2.20 g, 5.37 g, and 9.77 g, respectively. The ranges of the phenotypic distributions became wider at each progressive sampling stage. One unique trait was DW, for which the distribution range increased significantly from sampling stages A to C (Fig. 1c). However, the distribution range of TN and PH almost remained almost unchanged from sampling stages A to C. The pattern change for PH based on the sampling stages was between those of the previous two traits.
For each trait, all of the values from the different sampling stages were highly significantly correlated except for the correlations between PH and TN, and all of the correlations were in positive direction (Fig. 2). Especially, DW and TN were highly correlated having correlation coefficients greater than 0.60 at the first two sampling stages (27 and 34 DAS), but only ranged by 0.42–0.55 at the third sampling stage (41 DAS). However, the correlations between DW and PH (ranged by 0.35–0.59) were not as significant as the others.
Using the 27 K SNPs, sample clustering and a PCA were also performed. The PCA results for the 744 panel are shown in Fig. 3a, and the kinship between the 744 accessions is presented in Fig. 3b. Based on the PCA and the kinship analysis, the 744 accessions were divided into at least two large groups. Within each group, there were at least one or two subgroups. This was consistent with the two sub-species groupings of the rice accessions.
Linkage disequilibrium (LD) decay values were calculated throughout the genome, and the average values are shown in Supplementary Fig. 1. On average, LD blocks in this rice population of 744 germplasms extend up to ~50 kb.
Identification of QTL regions controlling SV-related traits at three sampling stages
A total of 42 QTL regions (Table 1) were detected by GWAS mapping of the three SV-related traits at three sampling stages in 744 sequenced rice accessions. Among them, 28 QTL regions were responsible for TN. Among these, 15, 11, and 15 QTL regions were detected by GAPIT and 10, 7, and 12 QTL regions were detected by mrMLM at 27, 34, and 41 DAS, respectively. The averaged −LOG10(P) values for SV-related QTL regions affecting TN at 27, 34, and 41 DAS were 3.5, 3.5, and 3.7, respectively, with ranges of 3.0–4.4, 3.0–6.0, and 3.0–5.7, respectively, as determined by GAPIT/mrMLM), and 3.6, 3.6, and 3.7, respectively, with ranges of 3.1–4.7, 3.0–6.0, and 3.0–5.7, respectively, as determined by mrMLM. Three QTLs (qSV2c, qSV5a, and qSV9b) were active at all three sampling stages.
A total of 35 SV-related QTL regions were responsible for PH. Among these, at 27, 34, and 41 DAS, 20, 26 and 21, respectively, were detected by GAPIT and 12, 18, and 11 respectively were detected by mrMLM. Eleven QTL regions (qSV1c, qSV1d, qSV2c, qSV4a, qSV5c, qSV6a, qSV6b, qSV7a, qSV7c, qSV8c, and qSV10b) were active throughout all three sampling stages as assessed by GAPIT. Four of them (qSV2c, qSV4a, qSV5c, and qSV7c) were confirmed by mrMLM. The averaged −LOG10(P) values for SV-related QTL regions affecting PH at 27, 34, and 41 DAS were 4.2, 3.9, and 3.9, respectively, according to GAPIT, with ranges of 3.0–6.7, 3.0–6.0, and 3.0–6.0, respectively, and 4.1, 3.6, and 3.9, respectively, according to mrMLM, with ranges of 3.0–6.9, 3.0–5.2, and 3.0–5.5, respectively.
Of the 31 SV-related QTL regions responsible for DW, 18, 12, and 8, as assessed by GAPIT, and 10, 8, and 7, as assessed by mrMLM, were detected at 27, 34, and 41 DAS, respectively. Only qSV2c was active throughout all three sampling stages according to GAPIT and it was confirmed by mrMLM at 27 and 34 DAS. The averaged −LOG10(P) values for SV-related QTL regions affecting DW at 27, 34, and 41 DAS, were 4.2, 4.0, and 3.9, respectively, with ranges of 3.0–6.7, 3.0–6.0, and 3.0–6.0, respectively, GAPIT, and 4.1, 3.6, and 3.9, respectively, with ranges of 3.0–6.9, 3.0–5.2, and 3.0–5.5, respectively, according to mrMLM.
Approximately 11.5% of the SV-related QTL regions were responsible for TN throughout all three sampling stages (27, 34, and 41 DAS). When we analysed PH with a higher heritability, the percentage of stable QTL regions reached 33.3%; however, the DW value dramatically decreased to 3.3%
Mapping results from GAPIT and mrMLM
As shown in Table 2, for the 28 SV-related QTL regions responsible for TN, GAPIT detected 26 (15, 11, and 15 for the three sampling stages, respectively), while mrMLM detected 14 (10, 7, and 12 for the three sampling stages, respectively). In a comparison between the two methods, 12 QTL regions overlapped (9, 7, and 11 for the three sampling stages, respectively). For the 33 QTL regions responsible for PH, GAPIT detected all of them, with 20, 26, and 21 for the three sampling stages, respectively, while mrMLM detected 27, with 12, 18, and 11 for the three sampling stages, respectively. The overlapping QTL regions between the two methods at the three sampling stages were 12, 18, and 11, respectively. For the 31 SV-related QTL regions responsible for DW, GAPIT detected 30, with 18, 12, and 8 for 27, 34, and 41 DAS, respectively, while mrMLM detected 23, with 10, 8, and 7 for the three sampling stages, respectively. The overlapping QTL regions between the two methods were 10, 8, and 6 at the three sampling stages, respectively.
As shown in Supplementary Figs 2–4, Manhattan plots with similar patterns were obtained for both GAPIT and mrMLM. However, using the threshold settings, relatively fewer QTL regions with clearer backgrounds were obtained by mrMLM.
Haplotype analysis of candidate genes for QTL regions
Based on the above association mapping results, four QTL regions, qSV1a, qSV3e, qSV4c, and qSV7c, were selected for fine-mapping using more SNPs. The region harbouring qSV3e was split into two sub-regions, which contained candidate Os03g0799700, with multi-evidenced QTNs, and Os03g0856700, with a single SNP but large −LOG10(P) value of 13.0 based on the ‘Nipponbare’ reference genome (Supplementary Table). Similarly, the other three QTL regions were delimited and harboured multi-evidenced QTNs after a sub-region analysis. Candidate genes containing significant SNPs or exactly located SNPs having phenotypic haplotype effects, as well as their donors, were analysed and are shown in Table 3 and Figs 4–8. Os01g0166800, OS03g0799700 and Os03g0856700 (OsGA20ox1), Os04g0683600, and Os07g0600400 were determined to be candidate genes for qSV1a, qSV3e, qSV4c, and qSV7c, respectively, owing to significant differences in related traits among their different haplotypes. As shown in Fig. 4, haplotype 2 of Os01g0166800 appeared to be elite, affecting both TN and DW without affecting PH, while haplotype 5 affected all three traits. Haplotypes 1 and 6 of OS03g0799700 were recommended (Fig. 5), affecting TN and DW without significantly increasing PH, while haplotype 3 influenced all three traits. Two haplotypes of Os03g0856700 (OsGA20ox1) appeared in the 744 germplasms (Fig. 6). Phenotypic differences among the three trait values (TN, PH, and DW) between the two haplotypes were highly significant at all three sampling stages, except for marginal differences in PH at 34 and 41 DAS (Fig. 6b). Haplotype 2 behaved better than haplotype 1 for Os03g0856700. For qSV4c, haplotype 1 was better than haplotype 2, while for qSV7c, haplotypes 1 and 3 were better than haplotype 2. The favourable haplotypes of qSV4c and qSV7c affected all the three SV-related traits (Figs 6 and 7).
Discussion
Comparison of identified SV-related QTL regions with reported genes/QTLs
Among the 42 SV-related QTL regions detected in this research, 18 (42.9%) were consistent with those of previous reports. Of these, 10 (qSV1d, qSV2a, qSV2c, qSV3e, qSV4b, qSV5c, qSV6b, qSV7c, qSV8d, and qSV11a) harboured more than one reported locus.
Among these QTLs, qSV2c was the most supported, and it was mapped with four reported loci, qGR2 for germination rate31, qSEV-2-2 for PH and plant dry weight16, qCSH2 for seedling height, and qSDW2 for SV-related traits under cold stress32. Another two QTL regions (qSV1d and qSV3e) were mapped with three reported loci. qSV1d was mapped with qSSL1b for seedling shoot length33, qRL-1 for root length34, and qLA-1 for leaf area35, while qSV3e was close to qPHS3-2/OsGA20ox1 for seedling height36, qSEV-3-4 for both PH and plant dry weight16, and qSV-3-2 for shoot length and germination rate13. Seven QTL regions (qSV2a, qSV4b, qSV5c, qSV6b, qSV7c, qSV8d, and qSV11a) were each mapped with two reported loci. The qSV2a was mapped with qFML2-1 for mesocotyl length20 and qRL-2 for root length35; qSV4b was mapped with qSDW4.2 for shoot dry weight37 and qGP-4 for germination percentage38; qSV5c was mapped with qFV-5-2 for seedling height15 and qSEV-5-2 for both PH and plant dry weight16, although these two were detected in different tests of the same population. qSV6b was close to qDW-6 for seedling dry weight34 and qLDW-6 for leaf biomass35; qSV7c was mapped with qSDW5-1 for shoot dry weight11 and qSL-7 for shoot length35; qSV8d was mapped with qSV-8-2 for germination rate, shoot length, and root length13 and with qPR-8 for biomass portioning to roots35, while qSV11a was mapped with qFML11-1 for mesocotyl length20 and with qLN-11 for leaf number35. QTL regions for the SV-related traits that were identified in different mapping populations and diverse environments could be beneficial for the marker-assisted selection-based development of varieties with high levels of SV.
Dynamic expression of SV-related QTL regions and their pleiotropy for SV-related traits
Improving SV has long been a focus of rice breeders. In particular, ESV (less than 28 DAS) is important for DSR under aerobic conditions in which they compete with weeds. In LSV, the tillering ability and biomass are the bases for the final yield components.
However, LSV is also based on the ESV. Thus, these two types of SV are highly associated. More than one third (42.9%) of the SV-related QTL regions detected in this work were mapped with reported ESV loci. However, four QTLs (qSV1d, qSV2c, qSV3e, and qSV4b) were not only mapped with more than one reported loci, but were also detected by both GAPIT and mrMLM, and all of them were responsible for PH. PH is an agronomic trait with a relatively high level of heritability. Thus, more QTL regions were responsible for PH (33 and 27 by GAPIT and mrMLM, respectively) compared with TN (26 and 23, respectively) and DW (30 and 23, respectively). In total, 11 of the loci at the QTL regions stably detected at all three sampling stages were responsible for PH. However, the number of stably detected QTL regions decreased to three and one for TN and DW traits, respectively.
One locus on chromosome 2 (qSV2c) affected all the three traits (TN, PH, and DW) throughout all three sampling stages (27, 34, and 41 DAS). This locus is also consistent with the three previously reported loci (qGR231, qCSH2, and qSDW2)32. This stably expressed locus could be very useful and will be analysed further. However, the −LOG10(P) values for qSV2c were not very high. They ranged from 3.0 to 3.4, with a mean value of 3.2, for TN, from 3.8 to 4.7, with a mean value of 4.4, for PH, and from 3.0 to 3.3, with a mean value of 3.1, for DW.
The number of SV-related QTL regions expressed under different sampling stages varied according to the studied traits. For example, there were totally 28 loci responsible for TN, 33 loci for PH, and 31 for DW. The numbers of TN-related QTL regions for the three different sampling stages varied by 16, 11, and 16, respectively, while the numbers of PH-related QTL regions varied by 20, 26, and 21, respectively for the three stages. However, the numbers of DW-related QTL regions decreased by 18, 12, and 9, respectively. A dynamic pattern was determined for the SV-related QTL regions. In total, 33.3% of PH-related QTL regions were detected in all three sampling stages, and 36.4% were responsible for PH at two stages. Only 10.7% of TN-related and 3.2% of DW-related QTL regions were detected in all three stages, and 32.1% of the former and 19.4% of the latter were detected at two stages.
Joint mapping of QTLs using GAPIT and mrMLM
GAPIT is based on a single dimension of genome scanning. We used the built-in maximum-likelihood method for the GWAS mapping of SV-related traits, while mrMLM was used because of its multi-locus nature. Thus, for trait variations controlled by multiple loci, the estimations of the loci effects by mrMLM should have greater confidence levels24.
Using a common setting, in comparison with GAPIT, mrMLM detected relatively fewer numbers of loci with a similar statistical power, indicated by −LOG10(P) values, in most cases. Additionally, as shown in Supplementary Figs 2–4, the threshold setting influenced QTL mapping. mrMLM focused on loci with high confidence levels and produced results with clearer backgrounds than GAPIT. This would be adequate for most molecular breeding or gene cloning applications. However, to determine the whole picture of a trait, a traditional package, such as GAPIT, should also be used. Conversely, as shown in this report, QTL regions can be located with GAPIT and then, supporting evidence obtained from mrMLM.
For the estimated allelic effects, of the 84 cases detected both by GAPIT and mrMLM, only 25 (29.8%) cases showed consistent effect directions. According to previous reports, with simulated data, a multi-loci method (mrMLM) is more powerful and more accurate in QTN effect estimation than single-locus methods24. All of these QTL regions were responsible for TN and PH rather than DW. Further experiments are required to clarify this inconsistency.
Additionally, as an example, we adopted SV mapping results from GAPIT with a suitable threshold setting to minimise possible type II errors. With the aid of mrMLM, the SV mapping results should have greater confidence levels. Thus, combining GAPIT and mrMLM is an option for our future GWAS mapping.
Candidate gene identification in important QTL regions
In our 744 accessions, the average LD decayed significantly to 0.3 at a physical distance of ~50 kb (Supplementary Fig. 1). This is much less than the previously reported LD distance of 75 kb for Xian/indica39, not to mention the 150-kb LD distance for tropical Geng/japonica, and the >500-kb LD distance for temperate Geng/japonica. Thus, in this population, especially for the specific QTL regions, a sub-regional analysis with higher density markers may offer more useful information.
Based on the sub-region analysis using more SNPs and the haplotype analysis, five candidate genes, Os01g0166800, OS03g0799700 and Os03g0856700 (OsGA20ox1), Os04g0683600, and Os07g0600400 were inferred for qSV1a, qSV3e, qSV4c, and qSV7c, respectively, using multi-evidenced QTNs. The candidate gene Os01g0166800 for qSV1a is an E2F transcription factors target gene. Os03g0799700 for qSV3e is a GTP1/OBG subdomain-containing protein. Both E2F- and GTP-related pathways are highly conserved in higher eukaryotes, and are involved in multiple basic functions, including cell cycle, DNA replication, and germination40,41. Os03g0856700, or OsGA20ox1, for qSV3e is a key gene responsible for seedling height, which is especially associated with ESV36, while later it is associated with spikelet number per panicle42. Thus, it is most likely the candidate gene for qSV3e. Os04g0683600 for qSV4c is similar to the H0306F12.6 protein, which is a putative LRR receptor-like serine/threonine-protein kinase in maize but its function remains unclear in Oryza43. Os07g0600400 for qSV7c codes a WD40/YVTN repeat-like domain-containing protein, which may function in the processes of signal transduction and stress adaptation44. The candidate genes for the four QTL regions need to be validated using transgenic or gene editing approaches in the future.
Implications in the development of DSR cultivars
Both ESV and LSV are important for cultivars used in DSR cultivating systems. The ESV is highly correlated to the ability of rice seedlings to compete with weeds, especially under the aerobic condition (as in UDS)45. In PDS, weed control is much easier than in UDS. The plant type, which develops during the period of LSV, including the TN, PH, and DW contribute to the final grain yield. Most alleles at important stably detected QTL regions and/or in regions with multi-evidenced QTNs, such as qSV1a, qSV3e, qSV4c, and qSV7c, increase the TN, PH, and DW (Table 1). Further validation of favourable alleles at the above four stable QTL regions detected throughout the three sampling stages will be performed.
A rice cultivar with strong SV-related traits is desirable for direct seeding. However, SV-related traits have not been selected for in crop improvement programs focused on conventional breeding owing to their complex nature and quantitative inheritances. Molecular markers are effective in increasing selection efficiency, particularly for quantitative traits that are simply inherited. In this study, except for qSV2c’s roles in all three SV-related traits, there were 2 QTL regions (qSV5a and qSV9b) responsible for TN only, and 10 QTL regions (qSV1c, qSV1d, qSV4a, qSV5c, qSV6a, qSV6b, qSV7a, qSV7c, qSV8c, and qSV10b) responsible for PH only. A simple increase in PH would result in problems in the final maturation stage and may cause lodging, which is another key trait for DSR. A balance between TN and PH is favourable for increased yields. Thus, an appropriate combination of TN and PN for both a high SV at the seedling stage and a high yield potential at harvest could be achieved by pyramiding the QTLs underlying the two traits, such as qSV5a and qSV9b for TN and qSV1c and qSV4afor PH, after QTL validation using a marker-assisted selection approach.
With a high density of SNP markers, we identified 11 favourable haplotypes for four loci. These haplotypes, defined by key SNPs, can be transformed into breeder-friendly markers in the future. In particular, haplotype 2 of qSV1a, and haplotype 1, 4, and 6 of qSV3e offer more opportunities for the development of cultivars with greater biomasses, resulting from more tillers rather than taller plants. PDR cultivars with this kind of SV would not affect lodging resistance and would have relatively wider adaptations. In addition, elite donors for these favourable haplotypes were also identified (Table 3). For example, IRIS_313-12135 can be used as the donor for haplotype 2 of qSV1a, and IRIS_313-12033, IRIS_313-11863, and IRIS_313-11807 can be used for haplotype 6 of qSV3e. IRIS_313-11453 can be used as a donor for haplotype 1 of qSV3e, while CX92 can be the donor for haplotype 4 of qSV3e. For OsGA20ox1, IRIS_313-10973 could be a good donor for the elite haplotype 2.
Conclusions
Tremendous variations for three SV-related traits, TN, PH, and DW, existed in the studied rice germplasms. Using a GWAS of GAPIT and mrMLM, 42 QTL regions, including 18 overlapping previously reported QTL regions and 24 new ones, for the SV-related traits were identified. Five candidate genes were inferred by fine-mapping using more SNPs and a haplotype analysis, including the known gene OsGA20ox1 for qSV3e, which controls ESV and spikelet number per panicle at the maturing stage. Our results indicated that combining GAPIT and mrMLM is an option for GWAS mapping. Favourable alleles at stably expressed QTLs, such as qSV1a, qSV3e, qSV4c, and qSV7c, were mined, and their corresponding accessions were also identified. The results provide useful germplasms and genetic information for the future improvement of SV in rice.
References
Bhushan, L. et al. Saving of water and labor in a rice-wheat system with no-tillage and direct seeding technologies. Agronomy Journal 99, 1288–1296, https://doi.org/10.2134/agronj2006.0227 (2007).
Liu, H. Y. et al. Dry seeded rice as an alternative to transplanted-flooded rice in Central China. Agronomy for Sustainable Development, 285–294 (2015).
Jiang, Q. W. et al. Response of first flood irrigation timing after rice dry-direct-seeding: Productivity and greenhouse gas emissions in Central China. Agricultural Water Management 177, 241–247, https://doi.org/10.1016/j.agwat.2016.08.006 (2016).
Zhao, D. L., Bastiaans, L., Atlin, G. N. & Spiertz, J. H. J. Interaction of genotype × management on vegetative growth and weed suppression of aerobic rice. Field Crops Research 100, 327–340, https://doi.org/10.1016/j.fcr.2006.08.007 (2007).
Richards, R. A. Defining selection criteria to improve yield under drought. Plant Growth Regulation 20, 157–166 (1996).
Mgonja, M. A., Ladeinde, T. A. O. & Akenova, M. E. Genetic analysis of mesocotyl length and its relationship with other agronomic characters in rice (Oryza saliva L). Euphytica 72, 189–195 (1993).
Redoña, E. D. & Mackill, D. J. Genetic variation for seedling vigor traits in rice. Crop Science 36, 285–290 (1996).
Namuco, O. S., Cairns, J. & Johnson, D. E. Investigating early vigour in upland rice (Oryza sativa L.): Part I. Seedling growth and grain yield in competition with weeds. Vol. 113 (2009).
Onishi, K. et al. A QTL cluster for plant architecture and its ecological significance in Asian wild rice. Breeding Science 57, 7–16, https://doi.org/10.1270/jsbbs.57.7 (2007).
Redoña, E. D. & Mackill, D. J. Mapping quantitative trait loci for seedling vigor in rice using RFLPs. Theoretical and Applied Genetics 92, 395–402 (1996).
Cui, K. et al. Molecular dissection of seedling-vigor and associated physiological traits in rice. Theoretical and Applied Genetics 105, 745–753, https://doi.org/10.1007/s00122-002-0908-2 (2002).
Xu, C. G. et al. Comparison of quantitative trait loci controlling seedling characteristics at two seedling stages using rice recombinant inbred lines. TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik 109, 640–647, https://doi.org/10.1007/s00122-004-1671-3 (2004).
Zhang, Z. H., Qu, X. S., Wan, S., Chen, L. H. & Zhu, Y. G. Comparison of QTL controlling seedling vigour under different temperature conditions using recombinant inbred lines in rice (Oryza sativa). Annals of botany 95, 423–429, https://doi.org/10.1093/aob/mci039 (2005).
Zhang, Z. H., Yu, S. B., Yu, T., Huang, Z. & Zhu, Y. G. Mapping quantitative trait loci (QTLs) for seedling-vigor using recombinant inbred lines of rice (Oryza sativa L.). Field Crops Research 91, 161–170, https://doi.org/10.1016/j.fcr.2004.06.004 (2005).
Zhou, L. et al. Quantitative trait loci for seedling vigor in rice under field conditions. Field Crops Research 100, 294–301, https://doi.org/10.1016/j.fcr.2006.08.003 (2007).
Lu, X. L. et al. Genetic dissection of seedling and early vigor in a recombinant inbred line population of rice. Plant Science 172, 212–220, https://doi.org/10.1016/j.plantsci.2006.08.012 (2007).
Xie, L. X. et al. Identification and fine mapping of quantitative trait loci for seed vigor in germination and seedling establishment in rice. Journal of Integrative Plant Biology 56, 749–759, https://doi.org/10.1111/jipb.12190 (2014).
Agrama, H. A. & Eizenga, G. C. Molecular diversity and genome-wide linkage disequilibrium patterns in a worldwide collection of Oryza sativa and its wild relatives. Euphytica 160, 339–355 (2008).
Zhao, K. et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nature communications 2, 467, https://doi.org/10.1038/ncomms1467 (2011).
Zhao, Y. et al. Genetic architecture and candidate genes for deep-sowing tolerance in rice revealed by non-syn GWAS. Frontiers in Plant Science 9, https://doi.org/10.3389/fpls.2018.00332 (2018).
Anandan, A., Anumalla, M., Pradhan, S. K. & Ali, J. Population structure, diversity and trait association analysis in rice (Oryza sativa L.) germplasm for early seedling vigor (ESV) using trait linked SSR markers. Plos One 11, e0152406, https://doi.org/10.1371/journal.pone.0152406 (2016).
Zhang, G. M. et al. Joint exploration of favorable haplotypes for mineral concentrations in milled grains of rice (Oryza sativa L.). Frontiers in Plant Science 9, https://doi.org/10.3389/fpls.2018.00447 (2018).
Zhang, Q. et al. Joint mapping and allele mining of the rolled leaf trait in rice (Oryza sativa L.). Plos One 11, e0158246, https://doi.org/10.1371/journal.pone.0158246 (2016).
Wang, S. B. et al. Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology. Scientific reports 6, 19444, https://doi.org/10.1038/srep19444 (2016).
The-3K-rice-genomes-project. The 3,000 rice genomes project. GigaScience 3, 7 (2014).
Inc., S. A. S. I. SAS/STAT 9.1 user’s guide. (SAS Institute Inc., 2004).
Zhang, Z. W. et al. Mixed linear model approach adapted for genome-wide association studies. Nat Genet 42, 355–360, http://www.nature.com/ng/journal/v42/n4/suppinfo/ng.546_S1.html (2010).
Lipka, A. E. et al. GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399, https://doi.org/10.1093/bioinformatics/bts444 (2012).
Li, Z. K. In Rice Genetics IV Vol. IV (eds Khush, G. S., Brar, D. S. & Hardy, B.) 153–172 (Science Publishers, Inc., and International Rice Research Institute, 2001).
Gramene. Gramene database, http://www.gramene.org/ (2018).
Diwan, J., Channbyregowda, M., Shenoy, V., Salimath, P. & Bhat, R. Molecular mapping of early vigour related QTLs in rice. Research. Journal of Biology 1, 24–30 (2013).
Han, L. Z. et al. Identification of quantitative trait loci for cold response of seedling vigor traits in rice. Journal of Genetics and Genomics 34, 239–246, https://doi.org/10.1016/S1673-8527(07)60025-3 (2007).
Zhang, A. P. et al. Genetic analysis for rice seedling vigor and fine mapping of a major QTL qSSL1b for seedling shoot length. Breeding Science 67, 307–315, https://doi.org/10.1270/jsbbs.16195 (2017).
Wang, Y., Guo, Y. & Hong, D. L. Mining of elite alleles for seed vigor traits in two populations of japonica rice (Oryza sativa L.) in Taihu Lake region. Acta Agronomica Sinica 36, 754–763, https://doi.org/10.1016/S1875-2780(09)60048-1 (2010).
Cairns, J. E. et al. Investigating early vigour in upland rice (Oryza sativa L.): Part II. Identification of QTLs controlling early vigour under greenhouse and field conditions. Field Crops Research 113, 207–217, https://doi.org/10.1016/j.fcr.2009.05.007 (2009).
Abe, A. et al. OsGA20ox1, a candidate gene for a major QTL controlling seedling vigor in rice. TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik 125, 647–657, https://doi.org/10.1007/s00122-012-1857-z (2012).
Cheng, X. X. et al. Dynamic quantitative trait loci analysis of seed reserve utilization during three germination stages in rice. Plos One 8, e80002, https://doi.org/10.1371/journal.pone.0080002 (2013).
Wang, Z. F., Wang, J. F., Bao, Y. M., Wang, F. H. & Zhang, H. S. Quantitative trait loci analysis for rice seed vigor during the germination stage. Journal of Zhejiang University Science B 11, 958–964, https://doi.org/10.1631/jzus.B1000238 (2010).
Mather, K. A. et al. The extent of linkage disequilibrium in rice (Oryza sativa L.). Genetics 177, 2223–2232, https://doi.org/10.1534/genetics.107.079616 (2007).
Vandepoele, K. et al. Genome-wide identification of potential plant E2F target genes. Plant Physiology 139, 316–328, https://doi.org/10.1104/pp.105.066290 (2005).
Leipe, D. D., Wolf, Y. I., Koonin, E. V. & Aravind, L. Classification and evolution of P-loop GTPases and related ATPases. J Mol Biol 317, 41–72, https://doi.org/10.1006/jmbi.2001.5378 (2002).
Wu, Y. et al. The QTL GNP1 encodes GA20ox1, which increases grain number and yield by increasing cytokinin activity in rice panicle meristems. Plos Genetics 12, e1006386, https://doi.org/10.1371/journal.pgen.1006386 (2016).
Feng, Q. et al. Sequence and analysis of rice chromosome 4. Nature 420, 316–320, https://doi.org/10.1038/nature01183 (2002).
Gaudet, R., Savage, J. R., McLaughlin, J. N., Willardson, B. M. & Sigler, P. B. A molecular mechanism for the phosphorylation-dependent regulation of heterotrimeric G proteins by phosducin. Mol Cell 3, 649–660, https://doi.org/10.1016/s1097-2765(00)80358-5 (1999).
Mahender, A., Anandan, A. & Pradhan, S. K. Early seedling vigour, an imperative trait for direct-seeded rice: an overview on physio-morphological parameters and molecular markers. Planta 241, 1027–1050, https://doi.org/10.1007/s00425-015-2273-9 (2015).
Acknowledgements
We appreciate the supports from the National Key R&D Program of China (2016YFD0100301, 2016YFD0101801, 2017YFD0100100), the Agricultural Science and Technology Innovation Program and the Cooperation and Innovation Mission (CAAS-ZDXT201800), the Research Project of Agricultural Public Welfare Industry in China (201403002-7), Shenzhen Peacock Plan, the National Key Technology Support Program (2015BAD02B01), Natural Science Foundation of China (31471429), and the “Green Super Rice” project from Bill & Melinda Gates’ Foundation (OPPGD1393). We thank Lesley Benyon, PhD, from Liwen Bianji, Edanz Group China (www.liwenbianji.cn/ac), for editing the English text of a draft of this manuscript.
Author information
Authors and Affiliations
Contributions
Conceived and designed the experiments: J.L.X., T.Q.Z. and Z.K.L. Performed the experiments: K.C., Q.Z., Z.X.L. and Y.J.J. Analysed the data: C.C.W., T.Q.Z. and L.Y.Z. Wrote the paper: J.L.X. and T.Q.Z.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, K., Zhang, Q., Wang, CC. et al. Genetic dissection of seedling vigour in a diverse panel from the 3,000 Rice (Oryza sativa L.) Genome Project. Sci Rep 9, 4804 (2019). https://doi.org/10.1038/s41598-019-41217-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-41217-x
This article is cited by
-
The Function of SD1 on Shoot Length and its Pyramiding Effect on Shoot Length and Plant Height in Rice (Oryza sativa L.)
Rice (2024)
-
Linkage mapping and quantitative trait loci detection for seedling vigor and grain size in advanced backcross introgression lines from wild accessions of Oryza nivara
Genetic Resources and Crop Evolution (2024)
-
Progress in the study of functional genes related to direct seeding of rice
Molecular Breeding (2023)
-
Genome-wide association mapping combined with gene-based haplotype analysis identify a novel gene for shoot length in rice (Oryza sativa L.)
Theoretical and Applied Genetics (2023)
-
Identification of superior haplotypes in a diverse natural population for breeding desirable plant height in soybean
Theoretical and Applied Genetics (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.