Seed protein content and its relationships with agronomic traits in pigeonpea is controlled by both main and epistatic effects QTLs

Obala, Jimmy; Saxena, Rachit K.; Singh, Vikas K.; Kale, Sandip M.; Garg, Vanika; Kumar, C. V. Sameer; Saxena, K. B.; Tongoona, Pangirayi; Sibiya, Julia; Varshney, Rajeev K.

doi:10.1038/s41598-019-56903-z

Download PDF

Article
Open access
Published: 14 January 2020

Seed protein content and its relationships with agronomic traits in pigeonpea is controlled by both main and epistatic effects QTLs

Jimmy Obala^1,2,
Rachit K. Saxena¹,
Vikas K. Singh¹,
Sandip M. Kale¹,
Vanika Garg¹,
C. V. Sameer Kumar³,
K. B. Saxena^nAff4,
Pangirayi Tongoona²,
Julia Sibiya² &
…
Rajeev K. Varshney¹

Scientific Reports volume 10, Article number: 214 (2020) Cite this article

2608 Accesses
15 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The genetic architecture of seed protein content (SPC) and its relationships to agronomic traits in pigeonpea is poorly understood. Accordingly, five F₂ populations segregating for SPC and four agronomic traits (seed weight (SW), seed yield (SY), growth habit (GH) and days to first flowering (DFF)) were phenotyped and genotyped using genotyping-by-sequencing approach. Five high-density population-specific genetic maps were constructed with an average inter-marker distance of 1.6 to 3.5 cM, and subsequently, integrated into a consensus map with average marker spacing of 1.6 cM. Based on analysis of phenotyping data and genotyping data, 192 main effect QTLs (M-QTLs) with phenotypic variation explained (PVE) of 0.7 to 91.3% were detected for the five traits across the five populations. Major effect (PVE ≥ 10%) M-QTLs included 14 M-QTLs for SPC, 16 M-QTLs for SW, 17 M-QTLs for SY, 19 M-QTLs for GH and 24 M-QTLs for DFF. Also, 573 epistatic QTLs (E-QTLs) were detected with PVE ranging from 6.3 to 99.4% across traits and populations. Colocalization of M-QTLs and E-QTLs explained the genetic basis of the significant (P < 0.05) correlations of SPC with SW, SY, DFF and GH. The nature of genetic architecture of SPC and its relationship with agronomic traits suggest that genomics-assisted breeding targeting genome-wide variations would be effective for the simultaneous improvement of SPC and other important traits.

Meta-analysis of QTL reveals the genetic control of yield-related traits and seed protein content in pea

Article Open access 28 September 2020

Anthony Klein, Hervé Houtin, … Judith Burstin

Genetic Analysis of High Protein Content in ‘AC Proteus’ Related Soybean Populations Using SSR, SNP, DArT and DArTseq Markers

Article Open access 23 December 2019

Bahram Samanfar, Elroy R. Cober, … Stephen J. Molnar

Genome-wide association study uncovers genomic regions associated with grain iron, zinc and protein content in pearl millet

Article Open access 10 November 2020

Mahesh Pujar, S. Gangaprasad, … Himabindu Kudapa

Introduction

Protein deficiency affects the health of millions of children and their mothers, but protein-rich plant foods may offer solutions particularly in areas of the world where intake of animal protein is low¹. One such crop is pigeonpea (Cajanus cajan (L.) Millsp), which serves as an important source of dietary protein to over one billion people globally². It is widely cultivated in the tropics and semi-arid tropics of Asia and Africa. Pigeonpea maintains better yields than other legume crops under environmental extremes such as heat, drought and low soil fertility conditions^3,4. These attributes position pigeonpea as the preffered crop for the resources-poor farmers in marginal environments⁵. However, increasing seed protein content (SPC) of pigeonpea is, therefore, an important contribution towards alleviating malnutrition among the poor. Improvement of SPC requires an understanding of its genetic architecture and how it relates to traits of agronomic importance.

Few studies have been reported on genetic control of SPC in pigeonpea with results suggesting quantitative inheritance^6,7. However, the classical quantitative genetics approaches used in the reported studies are limited in power and resolution to dissect the genetic architecture of a quantitative trait like SPC. Similarly, information is limited on the genetic basis of the often positive or negative or no relationships of SPC with agronomic traits such as seed yield, seed weight, days to flowering, and growth habit in the crop^7,8. Determining the genetic basis of trait correlations in pigeonpea is essential in designing breeding strategies that aim at improving and stabilizing SPC while maintaining yield and other desirable agronomic attributes. The available genomics, transcriptomics and proteomics resources in pigeonpea coupled with advances in high-throughput genotyping technologies provide opportunity to dissect the genetic architecture of several quantitative traits in the crop^{2,9,10,11,12,13,14,15}. However, genetic architecture of SPC in pigeonpea and the basis of its relationships with other traits of importance has remained untouched by the genomic revolution in the crop. A common genomics approach to understand the genetic architecture of quantitative traits involves whole genome scans to find quantitative trait loci (QTLs)¹⁶.

Through QTL analysis, genetic parameters such as number of loci, effect types and sizes and epistasis, which constitute the genetic architecture underlying quantitative phenotypic variation can be estimated¹⁷. The parameters, however, are commonly population specific¹⁸. As a result, QTLs identified in a given population may not necessarily be found in another population¹⁹. Thus, an account of the genetic architecture of a trait based on a single population likely describes only a small proportion of all the loci, their effects, and potential interactions that contribute to the intraspecific phenotypic variation for a trait^17,20. Similarly, QTL analysis involving multiple traits allows for the decomposition of genetic bases of within trait variations as well as correlations among traits in terms of the signs and magnitudes of QTL effects²¹. To this end, the use of two or more segregating mapping populations with two or more measured traits in a single study have become common^20,22,23. Regardless of the number of segregating populations, QTL analysis is preceded by the development of appropriate mapping populations and anchoring of markers on a genetic map.

In view of the above, the present study reports on the first attempt to dissect the genetic architecture of SPC in pigeonpea in a manner that incorporates an investigation of the genetic basis of its correlations with other important agronomic traits. To achieve this, five segregating F₂ populations were phenotyped for SPC and agronomic traits including seed yield (SY), 100-seed weight (SW), days to first flowering (DFF) and growth habit (GH). These populations were genotyped using genotyping-by-sequencing (GBS) approach. Five SNPs-based population-specific, and a consensus genetic maps were constructed. Analysis of both main effect QTLs (M-QTL_S) and epistatic QTLs (E-QTLs) revealed the genetic architecture of SPC variation and the basis of its correlations with the measured agronomic traits.

Results

Phenotypic variation in SPC and agronomic traits in five mapping populations

The mean value of SPC in low protein containing parents ranged from 19.3 to 21.5% and from 22.3 to 24.6% among high protein containing parents (Table 1). The lowest SPC difference (0.8%) was observed between crossing parents of Pop2 and highest (3.1%) was in crossing parents of Pop5. In the case of F₂s, SPC ranged from 5.8% in Pop5 to 10.3% (Pop3) while mean SPC ranged from 19.44 ± 1.28% in Pop4 to 23.06 ± 1.08% (Pop5). Similar statistics for DFF, SW and SY have been presented (Table 1). Shapiro-Wilk test showed that distributions for SPC in Pop1, Pop3 and Pop5 were not significantly (P > 0.05) different from a Gaussian distribution while Pop2 and Pop4 differed significantly (P ≤ 0.05) from a normal distribution. Such non-Gaussian distributions were also noted for most of the other traits such as DFF in Pop1, Pop2, Pop3 and Pop4; SW (Pop2 and Pop5) and SY in all five populations.

Table 1 Population size, mean, variance, skewness, kurtosis, minimum and maximum values, and w-test for seed protein content (SPC), days to first flowering (DFF), 100-seed weight (SW) and seed yield (SY) in five F₂ mapping populations of pigeonpea.

Full size table

Phenotypic correlation among traits

Correlations among traits are presented in Table 2. Correlations were negative between SPC and DFF in all mapping populations but significant (P ≤ 0.05) in only two populations (Pop1 and Pop3). Similarly, correlations between SPC and SY were negative and significant in all populations except in Pop4. In contrast, significant positive correlations were noted between SPC and GH in three of the five populations. While correlations between SPC and SW were positive in all populations except Pop3, although only significant in two populations (Pop1 and Pop2). Correlations between agronomic traits were generally negative and significant for DFF × GH and SY × GH, positive and nonsignificant for SW × GH and DFF × SY, and negative and nonsignificant for SY × SW.

Table 2 Correlation coefficient among traits in five F₂ mapping populations of pigeonpea.

Full size table

Sequence data and SNP discovery

In total, 403.66 million reads (40.77 Gb), 343.26 million reads (34.76 Gb), 339.25 million reads (33.89 Gb), 284.77 million reads (28.76 Gb) and 298.56 million reads (30.15 Gb) of clean GBS reads were generated using HiSeq. 2500 platform from parents and 178 F₂s (Pop1), 175 F₂s (Pop2), 157 F₂s (Pop3), 137 F₂s (Pop4) and 179 F₂s (Pop5), respectively (Table 3; Suplementary Table S1). It is important to mention that sequence data generated and SNPs identified in Pop5 have been taken from Saxena et al.¹². The reads from individual progenies ranged from 0.79 to 5.82 million reads in Pop1, 0.49 to 9.52 million reads in Pop2, 0.73 to 6.84 million reads in Pop3, 0.84 to 8.19 million reads in Pop4, and 0.41 to 5.26 million reads in Pop5. Also, a total of 1.13 (ICP 11605) and 2.60 (ICP 14209) million reads of Pop1 parents, 3.00 (ICP 8863) and 7.59 (ICP 11605) million reads of Pop2 parents, 2.96 (HPL 24) and 3.31 (ICP 11605) million reads of Pop3 parents, 2.56 (ICP 8863) million reads of Pop4 parent, and 5.37 (ICP 5529) and 1.61 (ICP 11605) million reads of Pop5 parents, were generated. The final number of good quality SNPs produced were 15,728 in Pop1, 7,494 in Pop2, 12,030 in Pop3, 11,526 in Pop4 and 12,654 in Pop5¹² (Table 4).

Table 3 Number of reads and data size in gigabytes (Gb) generated in five F₂ mapping populations of pigeonpea. P₁, parent 1_; P₂, parent 2; Pop1, ICP 11605 (P₁) × ICP 14209 (P₂); Pop2, ICP 8863 (P₁) × ICP 11605 (P₂); Pop3, HPL 24 (P₁) × ICP 11605 (P₂); Pop4, ICP 8863 (P₁) × ICPL 87119 (P₂); Pop5, ICP 5529 (P₁) × ICP 11605 (P₂); n, F₂ population size; ^†Information obtained from Saxena et al.¹².

Full size table

Table 4 Features of individual genetic maps from five F₂ mapping populations of pigeonpea Pop1: ICP 11605 × ICP 14209, Pop2: ICP 8863 × ICP 11605, Pop3: HPL 24 × ICP 11605, Pop4: ICP8863 × ICPL 87119, Pop5: ICP 5529 × ICP 11605, ^†Information obtained from Saxena et al.¹².

Full size table

Population specific genetic maps

From a total of 15,728, 7,494, 12,030, 11,526 and 12,662 SNPs identified, 3,607, 1,419 2,901, 3,941, and 2,935 SNPs in Pop1, Pop2, Pop3, Pop4 and Pop5 (genetic map information for Pop5 has been taken from Saxena et al.¹²), respectively, which segregated in 1:2:1 F₂ genotypic ratio at a χ² cutoff P ≥ 10⁻⁹ were retained for genetic mapping (Table 4). Owing to high distortion from the expected F₂ segregation ratio, SNPs segregating in a 1:2:1 ratio at P > 0.05 were used as anchor markers for initial genetic map construction. As a result, a total of 82, 90, 94, 29 and 140 SNPs in Pop1, Pop2, Pop3, Pop4 and Pop5, respectively could be mapped in the base or anchor genetic maps. A further 580, 273, 513, 967 and 647 SNPs, which segregated in 1:2:1 ratio at P < 0.05 ≥ 10⁻⁹ could be added to the base map resulting in 662, 363, 607, 996 and 787 SNPs mapped, with map lengths of 1419.1 cM, 1327.6 cM, 1546.8 cM, 1599.8 cM and 1454.0 cM in Pop1, Pop2, Pop3, Pop4 and Pop5, respectively. The average marker spacing in cM among the five genetic maps were 2.1 (Pop1), 3.5 (Pop2), 2.3 (Pop3), 1.6 (Pop4) and 1.8 (Pop5). The number of gaps larger than 10.0 cM ranged from 13 in Pop1 to 33 (Pop2). The largest gaps on the genetic maps ranged from 22.3 cM in Pop1 to 40 cM in Pop2 (Table 4, Supplementary Figs. S1–S5).

Consensus genetic map

All markers used in the construction of the consensus map in the present study were SNPs. As a result, there was no discrepancy in marker names among the individual maps. Segregation data for 3,400 markers from five mapping populations were used to integrate the multiple genetic maps into a consensus map (Table 5). Among the markers, 2,386 were distinctive to particular mapping populations, 617 were common between two, 227 among three and 170 among four mapping populations. The common markers were used as anchor points for integration of the individual genetic maps. Most of the linkage groups for the population-specific genetic maps were integrated into the consensus genetic map. All common markers together led to the production of a consensus genetic map comprising 984 loci on 11 CcLGs covering a map distance of 1,609.5 cM with an average inter-marker distance of 1.6 cM (Supplementary Fig. S7).

Table 5 Number of common markers among five F₂ mapping populations of pigeonpea. Pop1: ICP 11605 × ICP 14209, Pop2: ICP 8863 × ICP 11605, Pop3: HPL 24 × ICP 11605, Pop4: ICP8863 × ICPL 87119, Pop5: ICP 5529 × ICP 11605. n, number of markers.

Full size table

Collinearity between component and consensus genetic maps

All genetic maps were, to a large extent, collinear with the consensus map (Table 6; Figs. 1 and 2; Supplementary Fig. S6). However, component CcLGs from Pop1 (CcLG02, CcLG07, CcLG08 and CcLG11), Pop2 (CcLG07), Pop3 (CcLG08 and CcLG11) and Pop5 (CcLG05, CcLG07and CcLG09) showed a reversal of marker order between component genetic map and consensus map as revealed by the negative correlation coefficients (“r”; Table 6). Similarly, CcLGs from Pop4 that contributed any markers to the consensus map displayed poor collinearity with the consensus map. Finally, genome-wide, there were 13 gaps larger than 10 cM (one each on CcLG02 and CcLG11, two each on CcLG05, CcLG09 and CcLG10, and three each on CcLG03 and CcLG07). Such gaps have been thought to result from recombination hotspots or regions that are identical-by-descent and thus lack of polymorphisms²⁴.

Table 6 Summary of a pigeonpea consensus genetic map constructed from five component genetic maps.

Full size table

Main effect QTLs for SPC and agronomic traits and their colocalization

Phenotyping data together with SNP genotyping data were used for QTL analysis in all five F₂ populations using CIM and ICIM. Based on the phenotypic variance explained (PVE), identified M-QTLs were classified as major (≥10% PVE) and minor (<10% PVE). For each F₂ population, details on M-QTLs identified have been explained below.

Seed protein content

A total of 48 M-QTLs were detected for SPC across the five mapping populations (Table 7, Supplementary Table S2). Six of the M-QTLs were detected by both CIM and ICIM with two in Pop2 (CcLG03 and CcLG11) and Pop4 (CcLG02 and CcLG06), and one M-QTL each in Pop3 (CcLG02) and Pop5 (CcLG02). There were 13 major and 35 minor M-QTLs across the five populations. The PVE by each of the major M-QTLs ranged from 10.0% (Pop1, Pop3) to 23.5% (Pop3) while that of the minor M-QTLs ranged from 0.7% (Pop2) to 9.5% (Pop3). There were three major M-QTLs each in Pop1 and Pop5 and two each in Pop2 and Pop3. Eight of the major M-QTLs (three each in Pop1 and Pop5, and one each in Pop2 and Pop4) and 18 of the minor M-QTLs (two in Pop1, six each in Pop3 and Pop5, and four in Pop4) showed negative additive effects. The remaining five major M-QTLs (one in Pop2, two each in Pop3 and Pop4) and 17 minor M-QTLs (one in Pop1, four in Pop2, five each in Pop3 and Pop4, and two in Pop5) showed positive additive effects.

Table 7 Summary of main effect QTLs detected by composite interval mapping (CIM) and inclusive composite interval mapping (ICIM) for seed protein content (SPC), 100-seed weight (SW), seed yield (SY), days to first flowering (DFF) and growth habit (GH) in five F₂ mapping populations of pigeonpea. Pop1: ICP 11605 × ICP 14209, Pop2: ICP 8863 × ICP 11605, Pop3: HPL 24 × ICP 11605, Pop4: ICP 8863 × ICPL 87119, Pop5: ICP 5529 × ICP 11605, PVE: phenotypic variation explained by a QTL. Number in parenthesis represents numbers of major M-QTLs (PVE% ≥ 10.0%). ^†Number of QTLs detected by either CIM or ICIM. ^‡Number of unique QTLs detected by either or both CIM and ICIM. PVE: phenotypic variation explained by a QTL. LOD: Logarithm of odds.

Full size table

100-seed weight

Thirty M-QTLs were detected for SW across the five mapping populations (Table 7, Supplementary Table S2). Five of the M-QTLs were detected by both CIM and ICIM with one each in Pop2 (CcLG01), Pop4 (CcLG03), and Pop5 (CcLG01), and two in Pop3 (CcLG01 and CcLG08. There were 16 major and 14 minor M-QTLs across the five populations. The PVE by each of the major M-QTLs ranged from 10.1% (Pop4) to 46.6% (Pop3) while that of the minor M-QTLs ranged from 3.6% (Pop1) to 9.4% (Pop4). There were three major M-QTLs in Pop1, one each in Pop2 and Pop5, four in Pop3 and seven in Pop4. Six of the major M-QTLs (four in Pop3 and two in Pop4), and six of the minor M-QTLs (four in Pop1, and one each in Pop3 and Pop5) showed negative additive effects. The remaining 10 major M-QTLs (three in Pop2, one each in Pop1 and Pop5, and five in Pop4) and eight minor M-QTLs (one in Pop1, two each in Pop2 and Pop4, and three in Pop5) showed positive additive effects.

Seed yield

A total of 40 M-QTLs were detected for SY across the five mapping populations (Table 7, Supplementary Table S2). Seven of the M-QTLs were detected by both CIM and ICIM with one in Pop1 on CcLG03, two in Pop4 (CcLG03, CcLG011), three in Pop3 (CcLG02, CcLG04, CcLG11) and one in Pop4 (CcLG11). There were 17 major and 23 minor M-QTLs across the five populations. The PVE by each of the major M-QTLs ranged from 10.2% (Pop1) to 53.0% (Pop4) while that of the minor M-QTLs ranged from 1.7% (Pop2) to 9.8% (Pop4). There were two major M-QTLs each in Pop1 and Pop2, five in Pop3, seven in Pop4 and one in Pop5. Six of the major M-QTLs (one each in Pop1, Pop2, Pop3 and Pop5, and two in Pop4) and 15 of the minor M-QTLs (three each in Pop1, Pop3 and Pop4, four in Pop2, and two in Pop5) showed negative additive effects. The remaining 11 major M-QTLs (one each in Pop1 and Pop2, four in Pop3 and five in Pop4) and eight minor M-QTLs (one in Pop1, two each in Pop2 and Pop5, and three in Pop3 showed positive additive effects.

Growth habit

Twenty eight M-QTLs were detected for GH across the four populations in which there was segregation for the trait (Table 7, Supplementary Table S2). Six of the M-QTLs were detected by both CIM and ICIM all on CcLG03 with two in Pop1, and one each in Pop2, Pop3 and Pop5. There were 19 major and nine minor M-QTLs across the populations. The PVE by each of the major M-QTLs ranged from 10.9% (Pop1) to 91.3% (Pop1) while that of the minor M-QTLs ranged from 3.4% (Pop5) to 6.5% (Pop3). There were three major M-QTLs in Pop1, four in Pop2, three in Pop3, and nine in Pop5. Four of the major M-QTLs (three in Pop3 and one in Pop5) and five of the minor M-QTLs (one each in Pop1, Pop2 and Pop5, and two in Pop3) showed negative additive effects to indeterminate GH. The remaining 15 major M-QTLs (three in Pop1, four in Pop2 and eight in Pop5) and four minor M-QTLs (three in Pop3 and one in Pop5) showed positive additive effects to indeterminate GH.

Days to first flowering

In total, 47 M-QTLs were detected for DFF across the five populations (Table 7, Supplementary Table S2). Eleven of the M-QTLs were detected by both CIM and ICIM with one in Pop1 on CcLG03, two in Pop2 (CcLG03, CcLG11), five in Pop4 (CcLG01, CcLG06, CcLG08, CcLG11) and two in Pop5 (CcLG03). There were 24 major and 23 minor M-QTLs across the populations. The PVE by each of the major M-QTLs ranged from 10.9% (Pop4) to 47.6% (Pop5) while that of the minor M-QTLs ranged from 2.1% (Pop4) to 9.8% (Pop4). There were two major M-QTLs in Pop1, four in Pop2, five in Pop3, and 11 in Pop4. Seventeen of the major M-QTLs (two each in Pop1 and Pop5, three in Pop2, one in Pop3, and nine in Pop4), and eight of the minor M-QTLs (two in Pop1, one each in Pop2, Pop3 and Pop5; and three in Pop4) showed negative additive effects to delayed DFF. The remaining seven major M-QTLs (one in Pop1, four in Pop3 and two in Pop5) and 15 minor M-QTLs (two each in Pop1 and Pop2, three in Pop3, seven in Pop4 and one in Pop5) showed positive additive effects to delayed DFF.

QTL colocalization and correlations among traits

One M-QTL region in Pop1 for which SPC and DFF shared one of the flanking SNPs (S3_18226407) on CcLG03 showed negative and positive additive effects on SPC and DFF, respectively (Supplementary Table S2; Fig. S1), indicating it contributed to the observed negative correlation between the two traits (Table 2). Another M-QTL region region flanked by SNPs S3_14813065 and S3_14778845 also in Pop1 on CcLG03 had positive and negative additive effects on GH and DFF, respectively, and likely contributed to the observed negative correlation between the two traits.

Similarly, M-QTL flanked by SNPs S3_22234078 and S3_19578263 on CcLG03 in Pop2 showed pleiotropic effect on SPC, SY, GH and DFF (Supplementary Table S2, Supplementary Fig. S2). The M-QTL region displayed positive additive effects on SPC and GH, but negative additive effects on DFF and SY, possibly explaining the high positive correlation between SPC and GH, and negative correlation of SPC with DFF and SY, respectively. In the same Pop2, SPC shared an M-QTL with SW on CcLG01 in a region fanked by SNPs S1_15372966 and S1_9033631 with positive additive effect on both traits thus possibly contributing to the positive correlation between the two traits (Table 2). Another region in the same Pop2 on CcLG11 flanked by SNPs S11_21940736 and S11_18137395 conditioned SPC and SY having positive and negative additive effects, respectively, and likely resulted to the negative correlation between the two traits (Table 2).

In Pop3, three M-QTL regions flanked by SNPs S3_28538775 and S3_22913898, S3_18154848 and S3_17193829, and S3_18154875 and S3_14813065 all on CcLG03 affected SPC, GH and DFF with additive effects being negative for SPC and GH, and positive in two and negative in one of the M-QTLs for DFF (Supplementary Tables S2, Supplementary Fig. S3) possibly contributing to the positive correlation between SPC and GH, and negative correlation between SPC and DFF, and between GH and DFF (Table 2). Another M-QTL region in Pop3 on CcLG04 flanked by SNPs S4_3592410 and S4_2761907 had negative and positive additive effects on SY and GH, respectively, and likely contributed to the negative correlations between the two traits. A third M-QTL region in the same Pop3 flanked by SNPs S2_2989918 and S2_2144739 on CcLG02 conditioned both SPC and SY having positive and negative additive effects, respectively, and likely contributed to the negative though none significant correlation between the traits.

Similarly, a QTL region flanked by SNPs S1_1145802 and S1_11242012 on CcLG01 in Pop4 conditioned both SY and DFF with positive and negative additive effects, respectively (Supplementary Table S2, Fig. S4), and likely contributed to the negative though non-significant correlation between the traits (Table 2). Additionally, a QTL on CcLG02, flanked by SNPs S2_11771536 and S2_10960200 conditioned SW and DFF with positive and negative additive effects, respectively, and likely contributed to the negative correlation between the two traits. A major M-QTL on CcLG10 flanked by SNPs S10_15140940 and S10_632618 influenced both SW and SY with positive additive effects on both traits, indicating it contributed to the positive correlation between the traits. There were two tight linkages (0.1 cM distance), one between M-QTLs for SPC and DFF, and another between SPC and SY both on CcLG11.

Neither CIM nor ICIM detected any overlap or tight linkage of M-QTLs in Pop5 between any of the measured traits (Supplementary Table S2, Fig. S5) although significant correlations were detected between SPC and SY, GH and DFF, SW and DFF, SY and DFF, and SY and GH (Table 2).

Consensus genetic and main effect QTLs across populations

Forty-one, 26, 27, 28 and 31 out of a total of 48, 30 40, 28 and 47 M-QTLs for SPC, SW, SY, GH and DFF, respectively, from the five mapping populations could be projected onto the consensus genetic map. Twenty-four (60%) of the projected SPC M-QTLs could be placed into six consensus QTL regions (Supplementary Fig. S7). The consensus SPC QTLs contained M-QTLs from two populations (Consensus-PROT-QTL 1, Consensus-PROT-QTL 2 and Consensus-PROT-QTL 5), three populations (Consensus-PROT-QTL 3) and four populations (Consensus-PROT-QTL 4 and Consensus-PROT-QTL 6). Out of the 26 M-QTLs for SW projected onto the consensus genetic map, only 13 could be collapsed into four consensus QTL regions, namely Consensus-SW-QTL 1 with QTLs from two populations on CcLG01 and Consensus-SW-QTL 2 with QTLs from three populations on CcLG01, Consensus-SW-QTL 3 on CcLG06 with QTLs from 2 populations and Consensus-SW-QTL 4 on CcLG08 with QTLs from 3 populations. For SY, only four out of 27 M-QTLs projected onto the consensus genetic map could be put into two consensus regions. Consensus-SY-QTL 1 and Consensus-SY-QTL 2 on CcLG03 and CcLG11 consisted of M-QTLs from two populations each. For DFF 26 M-QTLs could be put in to four consensus QTL regions with QTLs from two populations in Consensus-DFF- QTL 1 on CcLG02, four populations (Consensus-DFF-QTL 2) on CcLG03 and three populations (Consensus-DFF-QTL 3 and Consensus-DFF-QTL 4) on CcLG11. In the case of GH, 20 M-QTLs from four populations could be put in to one consensus QTL region (Consensus-GH-QTL 1) on CcLG03.

Ten QTL clusters could be recognised (Supplementary Fig. S7). QTL-Cluster 1 on CcLG01 and QTL-Cluster 6 on CcLG04 each harboured one minor M-QTL for each of SPC and SW. QTL-Cluster 2 on CcLG02, QTL-Cluster 9 on CcLG11 and QTL-Cluster 10 on CcLG11 each haboured M-QTLs for all measured traits. Two M-QTLs each for SPC, SW and SY in QTL-Cluster 3 and QTL-Cluster 5 both on CcLG02 contained M-QTLs for SPC, SY and DFF. QTL-Cluster 4 on CcLG03 harboured M-QTLs for SPC, SY, GH and DFF, while QTL-Cluster 7 on CcLG06 contained M-QTLs for SW and SY only. QTL-Cluster 8 on CcLG09 contained M-QTLs for SPC and GH only. There were two M-QTLs for SPC, one each for SW, GH and DFF, and three for SY in QTL-Cluster 2. Of the M-QTLs in QTL-Cluster 2, there were two for each of SPC, SW and SY, and one for DFF with PVE ≥ 10.0%. In the case of QTL-Cluster 3, two M-QTLs for SPC and one for SY were major, while in QTL-Cluster 4 two, 15, 12 and one M-QTLs for SPC, GH, DFF, and SY, respectively, showed large effects (>10.0%). In contrast, QTL-Cluster 5 harboured only one major M-QTL for SY colocalising with minor M-QTLs (<10.0%) for other traits. QTL-clusters 6, 7 and 8 harboured only minor M-QTLs, while QTL-Cluster 9 contained two major M-QTLs for each of SPC and SY, three for each of DFF and SW, and one for GH. QTL-Cluster 10 was made up of one major M-QTL for each of SPC, GH, SW and SY, and two for DFF.

Epistatic QTLs

To gain more insight into the complexity of the genetic control of SPC and its relationship with other traits, epistatic QTLs (E-QTLs) were mapped in each of the five F₂ populations using QTL Icimapping software v4.0 (http://www.isbreeding.net/software/?type=detail&id=14) (Table 8; Supplementary Table S3). Pop2 had the highest number of E-QTLs (173) while Pop4 had the lowest number (52) across traits. Among traits, SPC had the lowest number of E-QTLs ranging from two in Pop3 to 11 in Pop1 while GH had the highest number ranging from 40 in Pop1 to 56 in Pop2 (Table 8). The E-QTLs were detected on all CcLGs in each population. Overall, E-QTLs made large contributions to the phenotypic variations of the measured traits ranging from 6.3% for DFF in Pop1 to 99.4% for GH in Pop2 (Table 8). In the case of SPC as the core trait in this study, E-QTLs accounted for 12.8 to 31.2% (Pop1), 55.0 to 69.8% (Pop2), 19.3 to 21.2% (Pop3), 9.8 to 30.5% (Pop4) and 9.5 to 21.2% (Pop5) of the within-population SPC variations (Table 8; Supplementary Table S3). For the agronomic traits, there were five to 26 E-QTLs with PVE of 6.3 to 38.4% for DFF, 40 to 56 E-QTLs (PVE = 10.4 to 99.4%) for GH, eight to 63 (PVE = 11.5 to 41.8%) for SW, 12 to 39 (PVE = 10.6 to 38.5%) for SY. No common within-trait E-QTL pairs were detected in all of the five populations, however, eight SNPs were each found to flank at least one member of an E-QTL pair for GH and SW in two to three populations, while four SNPs each flanking at least one member of E-QTL pair for SY were detected in two populations. E-QTLs for SPC and DFF were highly population-specific without any commonly shared markers among populations.

Table 8 Summary of epistatic QTLs detected for seed protein content (SPC), 100-seed weight (SW), seed yield (SY), days to first flowering (DFF) and growth habit (GH) in five F₂ mapping populations of pigeonpea. Pop1: ICP 11605 × ICP 14209, Pop2: ICP 8863 × ICP 11605, Pop3: HPL 24 × ICP 11605, Pop4: ICP 8863 × ICPL 87119, Pop5: ICP 5529 × ICP 11605. E-QTLs: epistatic QTLs, PVE: phenotypic variation explained. Number in parenthesis represents numbers of major E-QTLs (PVE% ≥ 10.0%), PVE: phenotypic variation explained by a QTL, LOD: Logarithm of odds.

Full size table

E-QTLs shared among traits within and across populations

The number of E-QTL pairs shared between SPC and the agronomic traits were variable depending on the population (Fig. 3). In Pop1, SPC shared E-QTLs with SW, SY and GH. In Pop2, SPC shared E-QTLs with SW, SY, DFF and GH, while in Pop3, SPC shared E-QTL markers with SW and GH. In Pop4, SPC shared two E-QTLs with SY, and one E-QTL with DFF. In Pop5, two E-QTLs for SPC were shared with SW, and one with SY. The five populations also had some E-QTL pairs in common but with varying effects on measured traits. For example, SNP S5_4199522 on CcLG05 flanked several members of E-QTL pairs affecting SY in Pop3 and Pop5, GH in Pop2, Pop3 and Pop5, DFF in Pop5, and SW and SPC in Pop2. The E-QTL pairs with at least one member flanked by this SNP explained 15.1% (GH in Pop2) to 64.8% (SPC in Pop2) of the observed phenotypic variation. Another SNP, S7_14683829, flanked a member of E-QTL pairs to influence GH in Pop1, Pop2, Pop3 and Pop5, SW in Pop3, SPC and SY. Similarly, a number of E-QTLs having SNP S7_14683829 as one of the flanking markers on CcLG07 influenced GH in Pop1 and Pop2, SW in Pop3 and all five measured traits in Pop5. Interestingly, an Indel marker (s3-20698771) derived from CcTFL1 on CcLG03 which co-segregates with the Dt1 locus¹² flanked three epistatically acting QTLs to influence GH with PVEs of 73.8, 74.3 and 69.4% in Pop5. A 2-phosphoglycerate kinase (2PGK) gene-derived non-synonymous SNP (nsSNP, s4-496463²⁵) together with SNP S4_1710877 flanked a QTL on CcLG04 which interacted with other QTLs on CcLG07, CcLG08 and CcLG11 to influence GH (17.3 and 18.2%) and SW (19.5 and 19.5%) in Pop5. The Indel marker and the nsSNP also separately flanked a member of a pair of two other E-QTLs to influence GH with a PVE as high as 73.8% in the same population.

Discussion

To detect the QTLs conditioning SPC and its relationship with agronomic traits, we used parental lines with only moderate contrast in SPC (0.8 to 3.5%) between any pair parents of a cross. Wide segregation among the F₂ progenies of a cross beyond what is expected from parental values was observed in the F₂ populations, indicating transgressive segregation, a phenomenon commonly observed for SPC in other legumes such as soybean^26,27 and pea^28,29. Strong quantitative variations with transgression for days to flowering, SY and SW were also observed in the present study, consistent with reports of earlier studies in segregating populations of pigeonpea¹⁰.

Given that a 5-cM SNP spacing is considered sufficiently dense for optimized QTL detection power³⁰, the SNP marker spacing in each of the five populations in the present study provides adequate power to detect a QTL. Marker segregation distortion was observed in all the five populations with similar proportion of markers showing deviation from expectation. Segregation distortion could have resulted from various factors such as residual heterozygosity, gametic or zygotic selections and genotyping errors³¹. It is a common phenomenon observed in both intra- and inter-specific crosses and has been reported in several crops including pigeonpea^9,12,13 and chickpea³². Although distorted markers have generally been discarded in earlier studies, evidence indicates that distorted markers can be potentially helpful in the detection of QTLs³³. It has also been noted that discarding distorted markers could possibly remove substantial amounts of information and reduce genome coverage³⁴. Thus, in the present study distorted markers segregating in 1:2:1 Mendelian ratio with χ² cutoff P ≥ 10⁻⁹ were retained for genetic map construction. By integrating the five component genetic maps into a consensus genetic map, conserved marker orders were observed among the five genetic maps that could be attributed to use of relatively similar population size (137 to 179), same type of mapping populations (all F₂s) and same type of marker system (GBS-derived SNPs)⁹. The constructed genetic maps were then used for QTL analysis to map genomic regions associated with SPC and four agronomic traits.

In pigeonpea, QTLs have been mapped for plant type and earliness including days to flowering and growth habit^10,12,35 and disease resistance¹³. However, such studies have lacked for SPC and it is only till recently that we developed gene-derived sequence-based markers using whole genome resequencing of pigeonpea parental lines²⁵. Genomic regions associated with SPC and correlated traits offer opportunity to develop varieties with enhanced SPC and stable yield using genomics-assisted breeding approaches. In this context, analyses of QTLs for SPC and agronomic characters (SW, SY, DFF and GH) were conducted based on five populations. To ensure reliability of detected QTLs the present investigation used two methods, CIM and ICIM. ICIM also facilitated the detection of E-QTLs. Although both methods detected comparable number of M-QTLs across traits in the studied populations, CIM detected slightly more M-QTLs than ICIM, which agrees with results of an earlier study³⁶. Three or more M-QTLs were detected by both programs while ICIM detected two or more E-QTLs for each trait in each of the studied populations. The involvement of several M- and E-QTLs for each of the measured traits explained observed variations in the traits and indicate quantitative inheritance. Colocalized M-QTLs as well as E-QTLs explained trait correlations in each of the populations studied.

The detection of two to three major and several modifier/minor effect M-QTLs for SPC spread on nearly all linkage groups of pigeonpea in each of the five studied populations is in agreement with results obtained in soybean^26,27. The M-QTLs for SPC were highly population-specific although three CcLGs contained at least one major M-QTL in two to three of the five populations. The three CcLGs (CcLG02, CcLG03 and CcLG11) also contained M-QTLs with the highest PVE than M-QTLs on the other CcLGs suggesting their relative importance in harbouring genomic regions governing SPC in the pigeonpea. The localisation of the major M-QTLs on CcLG02 in three of the five populations in the present study could be supported by the detection on the same chromosome of some genes known for their functional role in seed storage protein accumulation such as NADH-GOGAT²⁵, or for their location in the vicinity of QTL regions associated with variability of SPC in plants³⁷. By projecting the population specific M-QTLs to the consensus genetic map, six consensus genomic regions, each comprising M-QTLs for SPC from two to four populations were generated. Such consensus regions may be targeted for further investigation in future studies. Across the five mapping populations, SPC increasing alleles were contributed by both the low and high trait parents. The majority of the trait increasing alleles from the low trait parent were minor except in Pop2 and Pop4 for which the respective low SPC parent contributed one major M-QTL each. Whereas it has been concluded that SPC in pigeonpea is conditioned by recessive oligo-genes³⁸, it is apparent from our results that the trait is polygenic with a combination of gene actions conditioning its variation in the crop. This observation is in agreement with earlier conclusions that SPC is conditioned by both additive and non-additive genes in pigeonpea³⁹. Predominance of non-additive types of gene action in the present study is also in agreement with earlier observations in pigeonpea⁴⁰ and other legumes^41,42,43.

The detection of at least two genomic regions for DFF in each population is in agreement with reports of previous studies in pigeonpea⁴⁴, but contrasts with the results of Kumawat et al.¹⁰ who reported only one major M-QTL for the trait. The detection of the majority of M-QTLs for DFF on CcLG03 in four of the five mapping populations is consistent with an earlier detection of a well-known flowering time gene CcTFL1 on the same CcLG03^12,45. Most of the M-QTLs for DFF and GH tended to colocalize to the same genomic regions, flanked by the same SNP loci, which is in agreement with observations that genes conditioning flowering time have pleiotropic effect on GH^35,45,46. The pleiotropic M-QTLs largely acted recessively in conditioning GH, consistent with observations in other crops⁴⁷, but the same loci acted either additively, dominantly, partially dominantly or overdominantly on DFF which agrees with earlier reports on genetics of early flowering^39,48,49,50.

The detection of at least two and up to seven major and several minor M-QTLs to condition SW in the crop indicates quantitative inheritance for the trait, consistent with findings in other legume crops^{28,51,52,53,54}. A genomic region on CcLG01 consistently showed highest PVE in three of the five mapping populations suggesting a common major genomic loci segregating in a wide range of genetic backgrounds, similar to reports in soybean⁵¹. With the exception of one major M-QTL on CcLG01, all other M-QTLs for SW showed population specificity. The diversity of QTL gene action observed for SW in this study mirrors earlier reports where recessiveness, dominance and overdominance have been reported to condition SW in plants^55,56,57.

QTLs for SY (on plant basis) have been mapped in other crops^58,59, but the present study is the first in pigeonpea. The detection of highly population-specific minor and major effect M-QTLs for SY on nearly all CcLGs points to a complex genetic architecture of the trait. Detection of large effect population-specific M-QTLs on CcLG10 in three of the five populations, and minor and major M-QTLs on CcLG01, CcLG02, CcLG03 and CcLG11 in three to four populations suggests the relative importance of the chromosomes in hosting genomic regions associated with the trait.

The pervasiveness of population-specific M-QTLs for SPC, SY, SW and to a lesser extent for DFF could be attributed to effects of population size or marker coverage²⁰. However, this is unlikely because population-specific M-QTLs of relatively minor effects ranging from 0.7% to 8.6% across traits were mapped in the five populations. Rather, it is possible that a QTL detected in a certain cross may not be detected in another cross because the parents of the second cross carry identical alleles at the same locus^17,20.

E-QTLs were detected that explained additional phenotypic variation for SPC and the other traits. Effects of E-QTLs have been reported in other legume crops such as soybean for SPC^26,27,60 and SW^60,61. Similarly, E-QTLs for SW, SY, flowering time and GH have been reported in common bean⁶². The large number of E-QTLs for SPC and for the agronomic traits identified in present study indicates that QTLs with minor effects or no effect interact with each other to influence expression of the traits. Such scenarios have been reported in other crop plants^60,62. Uniquely, Pop2 displayed the highest contribution of E-QTL effects on phenotypic variation of all measured traits in the present study. It is likely that the relatively low marker density in Pop2 contributed to the high PVE of the E-QTLs in this population. Across populations, the number of E-QTLs detected also varied by trait. The pattern of contributions of M-QTLs vs E-QTLs to phenotypic variation for the studied traits seemed to be highly genetic background-dependent as has been frequently reported in other crops^62,63,64,65^.

In an earlier study, we reported a number of putative candidate gene-based nsSNP for SPC, some of which significantly cosegregated with the trait in an F₂ validation population, ICP 5529 × ICP 11605²⁵. Similarly, Saxena et al.¹² developed a CcFTL1 gene-based Indel marker whose cosegregation with the determinate GH locus Dt1 was validated in the same F₂ population. The same F₂ ICP 5529 × ICP 11605 population is one of the populations used in the present study. Interestingly, a number of the gene-derived markers also flanked M-QTLs and/or E-QTLs with significant effect on the agronomic traits. For example, the detection of a major M-QTL for GH (qGH-icim-4.1; PVE = 13.1%) on CcLG04 with one of the flanking markers being a 2PGK gene-derived nsSNP likely indicates the role of the gene on GH in the crop. A further evidence for the influence of the 2PGK gene on GH is that qGH-icim-4.1 epistatically interacted with another major M-QTL for GH (qGH-icim-3.2; PVE = 61.6%) on CcLG03 resulting in an E-QTL with a much higher PVE of 73.8% than that of the individual M-QTLs. One of the flanking markers to qGH-icim-3.2 is the CcTFL1 gene-derived Indel marker¹². The involvement of the 2PGK gene-derived nsSNP in another epistatically acting QTL to influence SW also agrees with colocalization of 2PGK gene with a QTL for SW in pea²⁸. One more gene of interest from our results is Sucrose synthase (Sus6) from which a derived nsSNP flanked an epistatically acting QTL on CcLG01 to influence SW with the resultant E-QTL having a major effect (11.5%). Sucrose synthase has for long been known to play a major role in SW in several crop plants as mentioned in Turner et al.⁶⁶. The role of Sus6 as one of the possible determinants of SW in our study is further indicated by location of the derived nsSNP in the vicinity of Consensus-SW-QTL 1 comprising a major SW M-QTL from Pop2 and a minor SW M-QTL from Pop3 on CcLG01 of the consensus genetic map. The same nsSNP was only 3.1 cM away from a major M-QTL on CcLG01 in Pop5.

In this study, two lines of evidence revealed the associations between SPC and the other plant traits, and that the nature of the associations is genetic background-dependent. First, the phenotypic correlation analysis showed that SPC associates positively with GH and SW and negatively with DFF and SY. The pattern of correlation of SPC with SW is consistent with results of earlier studies which showed that the two traits associate either positively or negatively and sometimes non-significantly depending on genetic material used⁶. In the case of SPC with DFF, negative though small and none significant relationships have been reported in pigeonpea^8,67. The negative and relatively weak correlation between SPC and SY in the present study is consistent and within the range previously reported in pigeonpea^8,67,68, and soybean^69,70. No relationship between SPC and GH has been reported in pigeonpea before. However, the indeterminate and determinate GH in soybean have been reported to be associated with high and low SPC, respectively⁷¹. Significant correlation of SPC with morphological and growth-related traits have also been reported in pea²⁸.

Second, colocalization of M-QTLs and shared E-QTLs for SPC with that of the other traits were found that possibly explains trait correlations. For instance, the colocalization of M-QTLs for SPC with M-QTLs for DFF with opposite allelic effects could explain the negative correlations between SPC and DFF in Pop1, Pop2, Pop3 and Pop4 though the correlations were non-significant in Pop2 and Pop4. Similarly, the colocalization of M-QTLs for SPC and M-QTLs for GH with allelic effects in the same direction in Pop1, Pop2 and Pop3 explains positive correlation between the two traits. Likewise, correlation of SPC with SW in Pop2 could be explained by the overlapping M-QTLs on CcLG02 with allelic effects in the same direction. While the negative correlation of SPC with SY could be attributed to opposing effect of colocalized M-QTLs for the two traits such as in Pop2.

However, not all correlations of SPC with agronomic traits could be explained by colocalization of M-QTLs, for instance, GH and SY showed relatively strong correlation with SPC in Pop4 but no M-QTL overlaps were present. Therefore, presence of E-QTLs shared between SPC and the agronomic traits were searched that could explain correlations that are not explained by the M-QTLs. The phenomenon where one E-QTL affects expression of more than one trait have been termed ‘epistatic pleiotropy’⁷². In this regard, the majority of epistatic pleiotropy involving SPC and other traits in the present study are the type in which the effects of a given pleiotropic locus are dependent upon the alleles present at the other loci⁷³. For example, in Pop1 a QTL on CcLG01 flanked by markers S1_4757043 and S1_1575466, affected (i) SPC when it interacted with other QTLs on CcLG07 and CcLG08, (ii) SW when it interacted with QTLs on CcLG02 and CcLG06, and (iii) SY when it interacted with a QTL on CcLG03.

Similarly, a single epistatically pleiotropic QTL (EP-QTL) on CcLG01 (S1_887236 and S1_3399209) in Pop3 influenced the expression of SPC, SW and GH when it interacted with other QTLs on CcLG02 and CcLG03 and possibly contributed to the significant covariation between SPC and SW, and SPC and GH. Such EP-QTLs involving SPC were widespread among populations, and in some cases provided the only explanation to phenotypic correlation between SPC and the other traits. For instance, the significant correlation between SPC and SY in Pop4 in the absence of overlaps in their M-QTLs could be explained by EP-QTL on CcLG07 flanked by markers S7_14683829 and S7_14588865. The same EP-QTL also influenced expression of SW and DFF although the two traits show weak and non-significant correlation with SPC. In Pop5, three EP-QTLs were detected, two of which influenced SPC and SY, and one influenced SPC and SW even though no significant relationships of SPC with SW and SY were found. Tuberosa et al.⁷⁴ noted that the occurrence of QTL colocalization for multiple traits that possibly share a common morpho-physiological basis, or that are reasonably associated on a cause-effect basis, should lower the chance of declaring false positives in the regions where QTLs overlap.

In conclusion, two to three major M-QTLs in the presence of several modifier/minor effect QTLs, and with additive and non-additive QTL gene action types including epistasis, control the expression of SPC in the present study. Overlaps of main effect and E-QTLs explain the correlations between SPC and agronomic traits. Projection of M-QTLs for SPC and agronomic traits onto the consensus map revealed common genomic regions governing SPC and its relationship with agronomic traits across different genetic backgrounds. Among the genomic regions, QTL Cluster 5 (CcLG03), QTL Cluster 10 (CcLG11) and QTL Cluster 9 (CcLG11), in order of increasing importance, harboured M-QTLs for two or more traits and therefore may be targeted for the simultaneous improvement of the associated characters. More trait-specific regions such as Consensus-PROT-QTL 1 and Consensus-PROT-QTL 2 (CcLG02), Consensus-PROT-QTL 4 (CcLG04), Consensus-PROT-QTL 6 (CcLG11), Consensus-SW-QTL 1 and Consensus-SW-QTL 2 (CcLG01), Consensus-DFF-QTL 1 (CcLG03) and Consensus-GH-QTL 1 (CcLG03) as well as the more population-specific SY M-QTLs with large PVEs on CcLG03 (Pop1 and Pop2), CcLG04 (Pop3), CcLG05 (Pop4), CcLG10 (Pop3 and Pop4) and CcLG11 (Pop2) could also be targeted for the improvement of the traits. The genomic regions identified in the present study would pave the path for early generation screening of large segregating populations or screening of germplasm resources and haplotype based breeding for identification of plants/genotypes carrying favourable alleles/haplotypes and minimizing the negative correlation effect of other traits on SPC. By this way high yielding lines with higher SPC could be developed with less resources and time. However, the large contribution of epistasis to the variation and correlation among the traits and the presence of a large number of population-specific M-QTLs for each of the traits, suggests that breeding approaches that target genome wide variations such as genomic selection⁷⁵ would be an alternative in achieving larger genetic gains for both SPC and yield in a shorter period. Further, the validation of the results in additional germplasm and under diverse environmental conditions may be necessary to determine the stability of the QTLs identified as well as facilitate detection of other loci.

Methods

Crossing parents and seed protein content

Six pigeonpea genotypes that included ICP 11605, ICP 8863, ICP 14209, HPL 24, ICP 5529 and ICPL 87119 were used in the present study. ICP 8863 was selected from landrace ICP 7626 (P-15-3-3) and it is widely cultivated in India. It is high yielding with 100-seed weight of ~9.5 g and matures in 150–160 days. It is resistant to fusarium wilt (FW) but susceptible to sterility mosaic (SM) virus⁷⁶. ICP 8863 has moderate SPC of ~22.0%. ICP 11605 (ICPL 151) was selected from the cross ICP 6997 × Prabhat. It is a determinate cultivar, yielding ~1.03 t/ha with 100-seed weight of 10 g and matures in 120–130 days⁷⁷ and has a low SPC of ~20.9%. ICP 14209 is a landrace variety with moderate SPC (23.0%). ICPL 87119 was developed from the cross ICP 1-6-W3–Wl × C 11 and it is widely adapted and cultivated in India. It matures in 160–180 days, is high yielding and has resistance to FW and SM⁷⁸. It is low in SPC (~19.3%). HPL 24 is an advanced breeding line derived from the cross of cultivar C. cajan cv Baigani × C. scarabaeoides previously reported to have ~30% SPC⁶. It is indeterminate and of medium maturity duration. ICP 5529 with pedigree P-4864-1, originated from India. It is indeterminate with medium maturity duration and with SPC indicated to be 27%.

Mapping populations, field experiments and phenotyping

In order to develop the mapping populations (F₂), five crosses were made: ICP 11605 × ICP 14209, ICP 8863 × ICP 11605, HPL 24 × ICP 11605, ICP 8863 × ICPL 87119 and ICP 5529 × ICP 11605. For brevity, the populations are hereafter referred to as Pop1, Pop2, Pop3, Pop4 and Pop5, respectively. One F₁ plant was selfed to generate F₂ seeds in each of the five populations. For trait evaluation, the parents and 350 to 400 F₂ seeds were sown under field conditions to ensure an adequate number of plants. Sowing was done in 4 m long rows spaced 75 cm apart and 30 cm plant to plant distance within a row. Plot sizes were two rows for each of the two parents and 25 to 28 rows in the F₂. All cultural practices were carried out. At maturity individual pods from individual plants were carefully hand-harvested leaving out plants at the beginning and at the end of each row and those at the field borders to avoid border effects. Sun drying was done for one week before threshing and another one week after threshing to ensure uniform reduction in seed moisture content. Seed protein content was measured as described in Obala et al.^25,67. Besides SPC, data were also recorded for SW in grams, SY in grams per plant, DFF, and GH scored as determinate or indeterminate.

DNA isolation and genotyping

Total genomic DNA (gDNA) from 188 F₂ plants and the parents from each of the five mapping populations were isolated and genotyped-by-sequencing as described in Saxena et al.^12,13. Briefly, the sequence reads obtained from the Illumina HiSeq. 2500 platform were used for SNP identification and genotyping using GBS analysis pipeline implemented in TASSEL v4.020 (TASSEL-GBS)⁷⁹. Firstly, the reads were sorted, separated according to the sample barcodes and trimmed to first 64 bases starting from the enzyme cut site. Reads containing ‘N’ within the first 64 bases and reads with >50% of low-quality base pairs (Phred <5%) were discarded. The filtered, high-quality reads from each sample were aligned to the pigeonpea draft genome sequence (C. cajan v1.0)² using Bowtie 2 sequence alignment software. The alignment file was processed through TASSEL-GBS pipeline for SNP calling and genotyping. The quality of SNPs called in each F₂ individual was compared with the SNPs identified in parental lines. The parental line SNPs were obtained from whole-genome resequencing (WGRS) data⁸⁰. SNPs having confident parental calls were considered for further analysis. SNPs and F₂ individuals having more than 30% and 70% missing data, respectively, were filtered out. The quality SNP data was used for construction of genetic maps and QTL analysis.

Construction of population-specific genetic maps

Four of the five population-specific genetic maps were constructed in the present study while the remaining one population-specific map was constructed under a separate project¹². The construction of all five population-specific genetic maps followed the same procedure as described in Saxena et al.^12,13.

Construction of consensus genetic map

Genotyping data from the five F₂ genetic maps were used to develop a consensus genetic map using JoinMap v4.1 following the procedure described by Bohra et al.⁹. To assess the level of correspondence in the order of markers between consensus and component genetic maps, correlation coefficients (r) were calculated from marker positions in consensus and individual genetic maps and their significance were tested. To further visualize the extent of correlation between consensus and component maps, scatter plots were generated between each of the consensus linkage group and corresponding component linkage group from all the populations. A comparative mapping programme CMap v1.01⁸¹ was used to align all developed genetic maps together to visually assess the congruency of marker orders.

QTL mapping

Composite interval mapping (CIM) implemented in Windows QTL Cartographer v2.5⁸² and inclusive composite interval mapping (ICIM) implemented in QTL Icimapping v4.0⁸³ were used to detect main effect QTLs (M-QTLs) while epistatic QTLs (E-QTLs) were detected using ICIM. The advantage of both CIM and ICIM is that they are regression-based and are therefore robust against non-Gaussian trait distribution⁸⁴. For CIM, the Standard Model 6, walk speed of 1.0 cM, and forward-backward stepwise regression for setting number of marker cofactors for background control were used to identify M-QTLs. To leave out signals within 10.0 cM distance on either side of the flanking markers or QTL test site, a window size of 10 cM was used. Thresholds for declaring QTLs were determined by 1000 permutations at significance of 0.05.

In using ICIM to detect M-QTLs, marker selection was performed just once using stepwise regression and considering all marker information simultaneously⁸⁵. Phenotypic values were then adjusted by all markers retained in the regression equation, except the two markers flanking the current mapping interval. Permutation tests were conducted using SPC in the five F₂ mapping populations to determine the criteria for model selection in the first step of ICIM. For all five F₂ populations, the probability of a marker moving into the model corresponding to the overall type I error α = 0.05 was approximately 10⁻⁵. The probability of a marker moving out of the model was set at twice the probability of a marker moving into the model. The LOD threshold to declare the existence of a QTL was calculated by permutation tests as well. However, because of the always conservative nature of thresholds retained from permutation tests⁸⁶, a default LOD threshold of 2.5 was used to report QTLs and determine common (consensus) QTLs across populations.

Furthermore, where M-QTL identified by CIM was also detected by ICIM, the region was considered as one QTL. Similarly, where an M-QTL for a given trait identified by either CIM or ICIM colocalize with M-QTL(s) of other traits detected by either of the two methods, the region was treated as a region of co-localisation. Type of gene action for each M-QTL was derived from the dominance coefficient (h) defined as the ratio between the observed QTL dominance effect (d) and absolute value of QTL additive effects (|a|)⁸⁷. We used the absolute value of additive effects because the sign of a QTL effect only shows which parent contributed the favorable allele but not the true direction of the specific additive effect⁸⁷. The h was then arbitrarily categorized as under-dominant or recessive (h < 0), additive (h = 0–0.20), partially dominant (h = 0.21–0.80), dominant (h = 0.81–1.20) and over-dominant (h > 1.20)⁸⁸.

For E-QTL mapping, all possible pairs of scanning positions were tested by ICIM, since digenic interactions may be detected regardless of whether the two interacting QTLs have significant additive effects or not⁸⁵. The probability of a marker moving into the model was set at 10⁻⁶ while the probability of a marker moving out of the model was set at twice the probability of a marker moving into the model⁸⁵. The default QTL-Icimapping LOD threshold of 5.0 was used to declare the existence of E-QTLs.

Common or consensus QTLs across five F₂ populations

Due to differences in the individual genetic maps, it was difficult to directly find common QTLs across the five populations on the basis of the QTL or marker position in each genetic map. Therefore, QTLs obtained in each of the five individual populations were projected onto the consensus map by using either QTL peak- or flanking-marker positions indicated in the individual population genetic map using a procedure adopted from Schweizer and Stein⁸⁹ as follows. If only peak-marker positions from the individual map were available, the QTL region was assumed by default to extend 5 cM left and right of the peak-marker position, resulting in a confidence interval of 10 cM. If only one flanking marker could be projected onto the consensus map, a QTL interval of 10 cM extension left or right from the left or right flanking marker, respectively, was assumed by default. If neither peak nor flanking markers were included in the consensus map, nearby tightly linked markers (maximum of 5 cM from the peak or flanking markers) were searched on the consensus map. If no replacement markers could be identified within this distance, the QTL was excluded from the analysis. Based on these projections, two types of common QTLs were defined. Firstly, a ‘Consensus QTL’ was defined as any region of the consensus genetic map with overlapping M-QTL intervals for a particular trait from more than one population. Secondly, a region of consensus genetic map at which M-QTL interval for one trait overlaps with that of one or more of the other traits was considered a ‘QTL Cluster’

QTL nomenclature

For individual populations, a specific identifier was assigned to each QTL, whereby “q” stands for QTL, followed by a set of upper case letters indicating the trait, followed by linkage group (CcLG) name, then a hyphen, method of QTL detection, and lastly, the QTL number on that CcLG in ascending order. For example, the designation “qPROT-cim-3.1” stands for “QTL for SPC” detected using CIM on LG “CcLG03” and it is the first QTL for SPC on that CcLG. For QTLs projected onto the consensus genetic map, a prefix is added to the QTL name indicating the source population. For example “Pop1qPROT-cim-3.1” indicates a QTL for SPC from Pop1.

References

Li, L. et al. QQS orphan gene regulates carbon and nitrogen partitioning across species via NF-YC interactions. Proceed. Natl. Acad. Sci. USA 112, 14734–14739 (2015).
Article ADS CAS Google Scholar
Varshney, R. K. et al. Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat. Biotechnol. 30, 83–89 (2012).
Article CAS Google Scholar
Rao, P., Birthal, P. S., Bhagavatula, S. & Bantilan, M. C. S. Chickpea and pigeonpea economies in Asia: facts, trends and outlook. (International Crops Research Institute for the Semi-Arid Tropics, 2010).
Akibode, C. S. & Maredia M. Global and regional trends in production, trade and consumption of food legume crops. (Report submitted to the Standing Panel on Impact Assessment (SPIA) of the CGIAR Science Council, 2011).
Khoury, C. K. et al. Crop wild relatives of pigeonpea (Cajanus cajan (L.) Millsp.) distributions, ex situ conservation status, and potential genetic resources for abiotic stress tolerance. Biol. Conserv. 184, 259–270 (2015).
Article Google Scholar
Saxena, K. B., Kumar, R.V. & Rao, P.V. Pigeonpea nutrition and its improvement in quality improvement in field crops (ed. Basra, A. S. & Randhawa, I. S.) 227–260 (Food Products Press, 2002).
Vaghela, K. O., Desai, R. T., Nizama, J. R., Patel, J. D. & Sharma, V. Combining ability analysis in pigeonpea (Cajanus cajan (L.) Millsp.) Legume Res. 32, 274–277 (2009).
Google Scholar
Rekha, R., Prasanthi, L., Sekhar, M. R. & Priya, M. S. Studies on selection indices in pigeonpea (Cajanus cajan (L.) Millsp). Int. J. Appl. Biol. Pharm. Technol. 4, 291–294 (2013).
Google Scholar
Bohra, A. et al. An intraspecific consensus genetic map of pigeonpea (Cajanus cajan (L) Millspaugh) derived from six mapping populations. Theor. Appl. Genet. 125, 1325–1338 (2012).
Article PubMed PubMed Central Google Scholar
Kumawat, G. et al. Molecular mapping of QTLs for plant type and earliness traits in pigeonpea (Cajanus cajan L. Millsp.). BMC Genetics 13, 84 (2012).
Article PubMed PubMed Central Google Scholar
Singh, V. K. et al. Next-generation sequencing for identification of candidate genes for fusarium wilt and sterility mosaic disease in pigeonpea (Cajanus cajan). Plant Biotechnol. J. 14, 1183–1194 (2015).
Article PubMed PubMed Central CAS Google Scholar
Saxena, R. K. et al. Characterization and mapping of Dt1 locus which co-segregates with CcTFL1 for growth habit in pigeonpea. Theor. Appl. Genet. 130, 1773–1784 (2017).
Article CAS PubMed PubMed Central Google Scholar
Saxena, R. K. et al. Construction of genotyping-by-sequencing based high-density genetic maps and QTL mapping for fusarium wilt resistance in pigeonpea. Sci. Rep. 7, 1911 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Pazhamala, L. T. et al. Gene expression atlas of pigeonpea and its application to gain insights into gene associated with pollen fertility in seed formation. J. Exp. Bot. 68, 2037–2054 (2017).
Article CAS PubMed PubMed Central Google Scholar
Krishnan, H. B., Natarajan, S. S., Oehrle, N. W., Garrett, W. M. & Darwish, O. Proteomic analysis of pigeonpea (Cajanus cajan) seeds reveals the accumulation of numerous stress-related proteins. J. Agric. Food Chem. 65, 4572–4581 (2017).
Article CAS PubMed Google Scholar
Abiola, O. et al. The identification of quantitative trait loci: a community’s view. Nature Rev. Genet. 4, 911–916 (2003).
PubMed Google Scholar
Simon, M. et al. Quantitative trait loci mapping in five new large recombinant inbred line populations of Arabidopsis thaliana genotyped with consensus single-nucleotide polymorphism markers. Genetics 178, 2253–2264 (2008).
Article CAS PubMed PubMed Central Google Scholar
Lynch, M. & Walsh, B. Genetics and analysis of quantitative traits. (Sinauer Associates, 1998).
Cui, Y., Zhang, F., Xu, J., Li, Z. & Xu, S. Mapping quantitative trait loci in selected breeding populations: A segregation distortion approach. Heredity (Edinb.) 115, 538–546 (2014).
Article Google Scholar
Symonds, V. V. et al. Mapping quantitative trait loci in multiple populations of Arabidopsis thaliana identifies natural allelic variation for trichome density. Genetics 168, 1649–1658 (2005).
Article CAS Google Scholar
Gao, W. et al. Multi-trait QTL analysis for agronomic and quality characters of Agaricus bisporus (button mushrooms). AMB Expr. 6, 67 (2016).
Article Google Scholar
Varshney, R. K. Genetic dissection of drought tolerance in chickpea (Cicer arietinum L.). Theor. Appl. Genet. 127, 445462 (2014).
Google Scholar
Argyris, J. M. QTL analyses in multiple populations employed for the fine mapping and identification of candidate genes at a locus affecting sugar accumulation in melon (Cucumis melo L.). Front. Plant Sci. 8, 1679 (2017).
Article PubMed PubMed Central Google Scholar
Cassava, I. & Map, G. High-resolution linkage map and chromosome-scale genome assembly for cassava (Manihot esculenta Crantz) from 10 populations. G3 (Bethesda) 5, 133–144 (2015).
Article CAS Google Scholar
Obala, J. et al. Development of sequence-based markers for seed protein content in pigeonpea. Mol. Genet. Genomics 294, 57–68 (2019).
Article CAS PubMed Google Scholar
Zhang, Y. H. et al. Marker-assisted breeding for transgressive seed protein content in soybean [Glycine max (L.) Merr.]. Theor. Appl. Genet. 128, 1061–1072 (2015).
Article ADS CAS PubMed Google Scholar
Soybase. 2016. The soybean breeder’s toolbox genetic map information, https://www.soybase.org/search/index.php?qtl=Prot (2016).
Burstin, J. et al. Developmental genes have pleiotropic effects on plant morphology and source capacity, eventually impacting on seed protein content and productivity in pea. Plant Physiol. 144, 768–781 (2007).
Article CAS PubMed PubMed Central Google Scholar
Krajewski, P. et al. QTL for yield components and protein content: a multi-environment study of two pea (Pisum sativum L.) populations. Euphytica 183, 323–336 (2012).
Article CAS Google Scholar
Stange, M., Utz, H. F., Schrag, T. A., Melchinger, A. E. & Würschum, T. High-density genotyping: an overkill for QTL mapping? Lessons learned from a case study in maize and simulations. Theor. Appl. Genet. 126, 2563–2574 (2013).
Article CAS PubMed Google Scholar
Liang, S. X., Zhen, S. X. & Zhen, Z. T. Segregation distortion and its effect on genetic mapping in plants. Chin. J. Agric. Biotechnol. 3, 163–169 (2006).
Article Google Scholar
Gaur, R. et al. Advancing the STMS genomic resources for defining new locations on the intraspecific genetic linkage map of chickpea (Cicer arietinum L.). BMC Genomics 12, 117 (2011).
Article CAS PubMed PubMed Central Google Scholar
Xu, S. Quantitative trait locus mapping can benefit from segregation distortion. Genetics 180, 2201–2208 (2008).
Article PubMed PubMed Central Google Scholar
Luo, L., Zhang, Y.-M. & Xu, S. A quantitative genetics model for viability selection. Heredity 94, 347–355 (2005).
Article CAS PubMed Google Scholar
Mir, R. R. et al. Candidate gene analysis for determinacy in pigeonpea (Cajanus spp.). Theor. Appl. Genet. 127, 2663–2678 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ding, G. et al. Identification and multiple comparisons of QTL and epistatic interaction conferring high yield under boron and phosphorus deprivation in Brassica napus. Euphytica 198, 337–351 (2014).
Article CAS Google Scholar
Lestari, P., Van, K., Lee, J., Kang, Y. J. & Lee, S.-H. Gene divergence of homeologous regions associated with a major seed protein content QTL in soybean. Front. Plant Sci. 4, 176 (2013).
Article PubMed PubMed Central Google Scholar
Saxena, K. B. & Sawargaonkar, S. L. Genetic enhancement of seed proteins in pigeonpea – methodologies, accomplishments and opportunities. Int. J. Sci. Res. 4, 254–258 (2015).
Google Scholar
Saxena, K. B. & Sharma, D. Pigeonpea genetics. In The Pigeonpea. (ed. Nene, Y. L., Hall, S. D. & Sheila, V. K.) 137–158 (CAB International, 1990).
Vaghela, K. O., Desai, R. T., Nizama, J. R., Patel, J. D. & Sharma, V. Combining ability analysis in pigeonpea (Cajanus cajan (L.) Millsp.). Legume Res. 32, 274–277 (2009).
Google Scholar
Tchiagam, J.-B. N., Bell, J. M., Nassourou, A. M., Njintang, N. Y. & Youmbi, E. Genetic analysis of seed proteins contents in cowpea (Vigna unguiculata L. Walp.). Afr. J. Biotechnol. 10, 3077–3086 (2011).
Article CAS Google Scholar
Tiwari, D. S., Singh, V. & Shukla, P. S. Combining ability in mungbean (Vigna radiata (L.) Wilczek). Indian. J. Genet. 53, 395–398 (1993).
Google Scholar
Mebrahtu, T. & Mohamed, A. A seven-parental diallel analysis of nutritional composition of common beans. Plant Food. Hum. Nutr. 58, 1–11 (2003).
Article Google Scholar
Craufurd, P. Q., Soko, H. S., Jones, J. K. & Summerfield, R. J. Inheritance of duration from sowing to first flowering in pigeonpea. Euphytica 119, 323–333 (2001).
Article Google Scholar
Melzer, S. et al. Flowering-time genes modulate meristem determinacy and growth form in Arabidopsis thaliana. Nature Genet. 40, 1489–1492 (2008).
Article CAS PubMed Google Scholar
Repinski, S. L., Kwak, M. & Gepts, P. The common bean growth habit gene PvTFL1y is a functional homolog of Arabidopsis TFL1. Theor. Appl. Genet. 124, 1539–1547 (2012).
Article CAS PubMed Google Scholar
Kaur, H. & Banga, S. S. Discovery and mapping of Brassica juncea Sdt1 gene associated with determinate plant growth habit. Theor. Appl. Genet. 128, 235–245 (2015).
Article CAS PubMed Google Scholar
Gupta, S. C., Saxena, K. B. & Sharma, D. Inheritance of days to flower and seed size in pigeonpea. Proceedings of the International Workshop on Pigeonpeas (International Crops Research Institute for the Semi-Arid Tropics, 1981).
Bonato, E. R. & Vello, N. A. E6, a dominant gene conditioning early flowering and maturity in soybeans. Genet. Mol. Biol. 22, 229–23 (1999).
Article Google Scholar
Adhikari, K., Buirchell, B., Yan, G. & Sweetingham, M. Two complementary dominant genes control flowering time in albus lupin (Lupinus albus L.). Plant Breed. 130, 496–499 (2011).
Article CAS Google Scholar
Han, Y. et al. QTL analysis of soybean seed weight across multi-genetic backgrounds and environments. Theor. Appl. Genet. 125, 671–683 (2012).
Article CAS PubMed Google Scholar
Wu, D. et al. Identification of quantitative trait loci underlying soybean (Glycine max (L.) Merr.) seed weight including main, epistatic and QTL × environment effects in different regions of Northeast China. Plant Breed. 137, 194–202 (2018).
Article CAS Google Scholar
Park, S. O. et al. Mapping of QTL for seed size and shape traits in common bean. J. Am. Soc. Hort. Sci. 125, 466–475 (2000).
Article CAS Google Scholar
Alam, A. K. M. M., Somta, P., Muktadir, M. A. & Srinives, P. Quantitative trait loci associated with seed weight in mungbean (Vigna radiata (L.) Wilczek). Kasetsart J. (Nat. Sci.) 48, 197–204 (2014).
Google Scholar
Irzykowska, L. & Wolko, B. Interval mapping of QTLs controlling yield-related traits and seed protein content in Pisum sativum. J. Appl. Genet. 45, 297–306 (2004).
PubMed Google Scholar
Abbo, S., Ladizinsky, G. & Weeden, N. F. Genetic analysis and linkage study of seed weight in lentil. Euphytica 58, 259–266 (1991).
Article Google Scholar
King, A. J. et al. Identification of QTL markers contributing to plant growth, oil yield and fatty acid composition in the oilseed crop Jatropha curcas L. Biotechnol. Biofuels 8, 1–17 (2015).
Article CAS Google Scholar
Su, C. et al. High-density linkage map construction and mapping of yield trait QTLs in maize (Zea mays) using the genotyping-by-sequencing (GBS) technology. Front. Plant Sci. 8, 706 (2017).
Article PubMed PubMed Central Google Scholar
Jing, Y. et al. Identification of the genomic region underlying seed weight per plant in soybean (Glycine max L. Merr.) via high-throughput single-nucleotide polymorphisms and a genome-wide association study. Front. Plant Sci. 9, 1392 (2018).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Construction of a high-density genetic map and mapping of QTLs for soybean (Glycine max) agronomic and seed quality traits by specific length amplified fragment sequencing. BMC Genomics 19, 641 (2018).
Article PubMed PubMed Central CAS Google Scholar
Xin, D. et al. QTL location and epistatic effect analysis of 100-seed weight using wild soybean (Glycine soja Sieb. & Zucc.) chromosome segment substitution lines. PLoS One 11, e0149380 (2016).
Article PubMed PubMed Central CAS Google Scholar
González, A. M. et al. Major contribution of flowering time and vegetative growth to plant production in common bean as deduced from a comparative genetic mapping. Front. Plant Sci. 7, 1940 (2016).
PubMed PubMed Central Google Scholar
Liao, C. Y., Wu, P., Hu, B. & Yi, K. K. Effects of genetic background and environment on QTLs and epistasis for rice (Oryza sativa L.) panicle number. Theor. Appl. Genet. 103, 104–111 (2001).
Article CAS Google Scholar
Cheng, L. et al. Identification of salt-tolerant QTLs with strong genetic background effect using two sets of reciprocal introgression lines in rice. Genome 55, 45–55 (2012).
Article CAS PubMed Google Scholar
Wang, Y., Arenas, C. D., Stoebel, D. M. & Cooper, T. F. Genetic background affects epistatic interactions between two beneficial mutations. Biol. Lett. 9, 20120328 (2012).
Article PubMed Google Scholar
Turner, N. C. et al. Seed size is associated with sucrose synthase activity in developing cotyledons of chickpea. Crop Sci. 49, 621–627 (2009).
Article Google Scholar
Obala, J. et al. Genetic variation and relationships of total seed protein content with some agronomic traits in pigeonpea (Cajanus cajan (L.) Millsp.). Aus. J. Crop Sci. 12, 1859–1865 (2018).
Article CAS Google Scholar
Dahiya, B. S., Brar, J. S. & Bhullar, B. S. Inheritance of protein content and its correlation with grain yield in pigeonpea (Cajanus cajan (L.) MilIsp. Plant Food. Hum. Nutr. 27, 327–334 (1977).
Article CAS Google Scholar
Assefa, Y. et al. Spatial characterization of soybean yield and quality (amino acids, oil, and protein) for United States. Sci. Rep. 8, 14653 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Mello Filho, O. L. et al. Seed yield and seed quality of soybean selected for high protein content. Pesq. Agropec. Bras. 39, 445–450 (2004).
Article Google Scholar
Ouatara, S. & Weaver, B. Effect of growth habit on yield and agronomic characteristics of late-planted soybean. Crop Sci. 34, 870–873 (1994).
Article Google Scholar
Wolf, J. B., Leamy, L. J., Routman, E. J. & Cheverud, J. M. Epistatic pleiotropy and the genetic architecture of covariation within early and late-developing skull trait complexes in mice. Genetics 171, 683–694 (2005).
Article CAS PubMed PubMed Central Google Scholar
Cheverud, J. M. et al. Pleiotropic effects on mandibular morphology II: differential epistasis and genetic variation in morphological integration. J. Exp. Zoolog. 302, 424–435 (2004).
Article CAS Google Scholar
Tuberosa, R. et al. Mapping QTLs regulating morpho-physiological traits and yield: case studies, shortcomings and perspectives in drought-stressed maize. Ann. Bot. 89, 941–963 (2002).
Article CAS PubMed PubMed Central Google Scholar
Michel, S. et al. Simultaneous selection for grain yield and protein content in genomics-assisted wheat breeding. Crop Sci. 132, 1745–1760 (2019).
CAS Google Scholar
ICRISAT. Pigeonpea variety ICP 8863. (International Crops Research Institute for the Semi-Arid Tropics, Plant Material Description No. 44, 1993).
ICRISAT. Pigeonpea variety ICPL 151. International Crops Research Institute for the Semi-Arid Tropics, Plant Material Description No. 41, 1993).
ICRISAT. Pigeonpea variety ICPL 87119. International Crops Research Institute for the Semi-Arid Tropics, Plant Material Description No. 43, 1993).
Glaubitz, J. C. et al. TASSEL-GBS: a high capacity genotyping-by-sequencing analysis pipeline. PLoS One 9, e90346 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Kumar, V., Khan, A. W., Saxena, R. K., Garg, V. & Varshney, R. K. First-generation HapMap in Cajanus spp. reveals untapped variations in parental lines of mapping populations. Plant Biotechnol. J. 14, 1673–81 (2016).
Article CAS PubMed PubMed Central Google Scholar
Youens-Clark, K., Faga, B., Yap, I. V., Stein, L. & Ware, D. CMap 1.01: a comparative mapping application for the Internet. Bioinformatics 25, 3040–3042 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wang, S, Basten, C. J. & Zeng, Z.-B. Windows QTL cartographer 2.5. Department of statistics, North Carolina State University, Raleigh, North Carolina (2012).
Wang, J., Li, H., Zhang, L. & Meng, L. Users’ manual of QTL IciMapping. The quantitative genetics group, Institute of Crop Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China, and Genetic Resources Program, International Maize and Wheat Improvement Center (CIMMYT), Apdo, Mexico (2015).
Rebai, A. Comparison of methods of regression interval mapping in QTL analysis with non-normal traits. Genet. Res. 65, 68–74 (1997).
Google Scholar
Li, H. et al. A high density GBS map of bread wheat and its application for dissecting complex disease resistance traits. BMC Genomics 16, 216 (2015).
Article PubMed PubMed Central CAS Google Scholar
Anderson, M. J. & Ter-Braak, C. J. F. Permutation tests for multi-factorial analysis of variance. J. Stat. Comput. Simul 73, 85–113 (2003).
Article MathSciNet MATH Google Scholar
Sun, X. & Mumm, R. H. Method to represent the distribution of QTL additive and dominance effects associated with quantitative traits in computer simulation. BMC Bioinformatics 17, 73 (2016).
Article PubMed PubMed Central CAS Google Scholar
Vallejo, R. L. et al. Genetic mapping of quantitative trait loci affecting susceptibility tomarek’s disease virus induced tumors in F2 intercross chickens. Genetics 148, 349–360 (1998).
Article CAS PubMed PubMed Central Google Scholar
Schweizer, P. & Stein, N. Large-scale data integration reveals colocalization of gene functional groups with meta-QTL for multiple disease resistance in barley. Mol. Plant-Microbe Interact 24, 1492–1501 (2011).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Authors are thankful to the United States Agency for International Development (USAID) and Department of Agriculture Cooperation & Farmers Welfare, Ministry of Agriculture & Farmers Welfare, Government of India; for funding support. We are thankful to Vinay Kumar, Suryanayana Vechalapu and Meriga Sudhakar for the great technical assistance offered. This work has been undertaken as part of the CGIAR Research Program on Grain Legumes and Dryland Cereals (GLDC). ICRISAT is a member of CGIAR Consortium.

Author information

K. B. Saxena
Present address: Former Principal Scientist (ICRISAT); 17, NMC Housing, Al Ain, Abu Dhabi, UAE

Authors and Affiliations

International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, India
Jimmy Obala, Rachit K. Saxena, Vikas K. Singh, Sandip M. Kale, Vanika Garg & Rajeev K. Varshney
University of KwaZulu-Natal, African Center for Crop Improvement, Scottsville, 3209, Pietermaritzburg, South Africa
Jimmy Obala, Pangirayi Tongoona & Julia Sibiya
Professor Jayashankar Telangana State Agricultural University, Rajendranagar, Hyderabad, 500030, Telangana, India
C. V. Sameer Kumar

Authors

Jimmy Obala
View author publications
You can also search for this author in PubMed Google Scholar
Rachit K. Saxena
View author publications
You can also search for this author in PubMed Google Scholar
Vikas K. Singh
View author publications
You can also search for this author in PubMed Google Scholar
Sandip M. Kale
View author publications
You can also search for this author in PubMed Google Scholar
Vanika Garg
View author publications
You can also search for this author in PubMed Google Scholar
C. V. Sameer Kumar
View author publications
You can also search for this author in PubMed Google Scholar
K. B. Saxena
View author publications
You can also search for this author in PubMed Google Scholar
Pangirayi Tongoona
View author publications
You can also search for this author in PubMed Google Scholar
Julia Sibiya
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev K. Varshney
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.K.V., R.K.S. and J.O. designed the experiment; J.O., R.K.S., V.G. and S.M.K. performed marker analysis of the F₂ populations. J.O., R.K.S., K.B.S. and C.V.S.K. performed field trials; J.O. collected and analysed phenotypic data from the field trials. J.O., R.K.S., R.K.V. and V.K.S. performed QTL analysis; J.O. and R.K.S. wrote the manuscript with inputs from R.K.V., K.B.S., C.V.S.K., V.K.S., P.T. and J.S. All authors reviewed and approved the submission.

Corresponding author

Correspondence to Rajeev K. Varshney.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Obala, J., Saxena, R.K., Singh, V.K. et al. Seed protein content and its relationships with agronomic traits in pigeonpea is controlled by both main and epistatic effects QTLs. Sci Rep 10, 214 (2020). https://doi.org/10.1038/s41598-019-56903-z

Download citation

Received: 30 August 2019
Accepted: 10 December 2019
Published: 14 January 2020
DOI: https://doi.org/10.1038/s41598-019-56903-z

This article is cited by

A genomic toolkit for winged bean Psophocarpus tetragonolobus
- Wai Kuan Ho
- Alberto Stefano Tanzi
- Sean Mayes
Nature Communications (2024)
QTL-seq for the identification of candidate genes for days to flowering and leaf shape in pigeonpea
- Vikas Singh
- Pallavi Sinha
- Rajeev K. Varshney
Heredity (2022)
Multi-omics strategies and prospects to enhance seed quality and nutritional traits in pigeonpea
- Nisha Singh
- Vandna Rai
- Nagendra Kumar Singh
The Nucleus (2020)
Mapping QTL for important seed traits in an interspecific F2 population of pigeonpea
- Abhishek Bohra
- Rintu Jha
- Narendra P. Singh
3 Biotech (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Phenotypic variation in SPC and agronomic traits in five mapping populations

Phenotypic correlation among traits

Sequence data and SNP discovery

Population specific genetic maps

Consensus genetic map

Collinearity between component and consensus genetic maps

Main effect QTLs for SPC and agronomic traits and their colocalization

Seed protein content

100-seed weight

Seed yield

Growth habit

Days to first flowering

QTL colocalization and correlations among traits

Consensus genetic and main effect QTLs across populations

Epistatic QTLs

E-QTLs shared among traits within and across populations

Discussion

Methods

Crossing parents and seed protein content

Mapping populations, field experiments and phenotyping

DNA isolation and genotyping

Construction of population-specific genetic maps

Construction of consensus genetic map

QTL mapping

Common or consensus QTLs across five F2 populations

QTL nomenclature

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links

Common or consensus QTLs across five F₂ populations