Genome-wide association study uncovers genomic regions associated with grain iron, zinc and protein content in pearl millet

Pujar, Mahesh; Gangaprasad, S.; Govindaraj, Mahalingam; Gangurde, Sunil S.; Kanatti, A.; Kudapa, Himabindu

doi:10.1038/s41598-020-76230-y

Download PDF

Article
Open access
Published: 10 November 2020

Genome-wide association study uncovers genomic regions associated with grain iron, zinc and protein content in pearl millet

Mahesh Pujar^1,2,
S. Gangaprasad²,
Mahalingam Govindaraj¹,
Sunil S. Gangurde¹,
A. Kanatti¹ &
…
Himabindu Kudapa¹

Scientific Reports volume 10, Article number: 19473 (2020) Cite this article

4004 Accesses
32 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Pearl millet hybrids biofortified with iron (Fe) and zinc (Zn) promise to be part of a long-term strategy to combat micronutrient malnutrition in the arid and semi-arid tropical (SAT) regions of the world. Biofortification through molecular breeding is the way forward to achieving a rapid trait-based breeding strategy. This genome-wide association study (GWAS) was conducted to identify significant marker-trait associations (MTAs) for Fe, Zn, and protein content (PC) for enhanced biofortification breeding. A diverse panel of 281 advanced inbred lines was evaluated for Fe, Zn, and PC over two seasons. Phenotypic evaluation revealed high variability (Fe: 32–120 mg kg⁻¹, Zn: 19–87 mg kg⁻¹, PC: 8–16%), heritability (h_bs² ≥ 90%) and significantly positive correlation among Fe, Zn and PC (P = 0.01), implying concurrent improvement. Based on the Diversity Arrays Technology (DArT) seq assay, 58,719 highly informative SNPs were filtered for association mapping. Population structure analysis showed six major genetic groups (K = 6). A total of 78 MTAs were identified, of which 18 were associated with Fe, 43 with Zn, and 17 with PC. Four SNPs viz., Pgl04_64673688, Pgl05_135500493, Pgl05_144482656, and Pgl07_101483782 located on chromosomes Pgl04 (1), Pgl05 (2) and Pgl07 (1), respectively were co-segregated for Fe and Zn. Promising genes, ‘Late embryogenesis abundant protein’, ‘Myb domain’, ‘pentatricopeptide repeat’, and ‘iron ion binding’ coded by 8 SNPs were identified. The SNPs/genes identified in the present study presents prospects for genomics assisted biofortification breeding in pearl millet.

Genetic dissection of grain iron and zinc, and thousand kernel weight in wheat (Triticum aestivum L.) using genome-wide association study

Article Open access 20 July 2022

Discovery and validation of candidate genes for grain iron and zinc metabolism in pearl millet [Pennisetum glaucum (L.) R. Br.]

Article Open access 06 October 2020

Genome-wide association study identifies loci and candidate genes for grain micronutrients and quality traits in wheat (Triticum aestivum L.)

Article Open access 29 April 2022

Introduction

Pearl millet is a climate-resilient crop that accounts for two-thirds of the global millet production. The crop covers more than 31 million hectares worldwide and is grown in more than 30 countries in the arid and semi-arid tropical as well as subtropical regions of Asia, Africa, and Latin America. In Asia, India is the largest producer of pearl millet, where it is grown on 9 million hectares with a production of 8.3 million tons¹. In the African region, West and Central Africa has the largest area under the crop—15 million hectares—and has an annual production of 14.1 million tons. Pearl millet is a diploid (2n = 14) cross-pollinating crop (> 80%) with a genome size of ~ 1.79 GB². Its domestication occurred in regions with low fertility soils, heat, and drought, making it naturally adapted to face the challenges associated with climate change.

Pearl millet grains are naturally nutritious and contain high fiber (1.2 g/100 g) and low starch. They are the richest source of grain Fe and Zn compared to other cereals³. Iron and zinc are two important micronutrients that play a vital role in human health. Iron is required for psychomotor development, maintenance of physical activity and work capacity, and resistance to infection⁴, whereas zinc is required for the growth and maintenance of the human immune system; hence it aids in both the prevention of and recovery from various diseases⁵. Apart from Fe and Zn, pearl millet is also rich in grain protein content (8–19%) that is almost at par with that in wheat (11.6 vs 11.8 g/100 g) and considerably higher than that in rice (6.8 g/100 g), sorghum (10.4 g/100 g) and maize (4.7 g/100 g)⁶. High-quality proteins are essential for the physical and mental well-being of humans, especially children^7,8.

Diets deficient in Fe and Zn (micronutrient malnutrition) or protein alone or in combination lead to malnutrition which is also known as ‘hidden hunger’. It has been estimated that over 2 billion people across the world suffer from micronutrient deficiencies in developing countries like Africa and India⁹. Anaemia is alarmingly high, especially among pregnant women (40%) and children (42%) below 5 years¹⁰. In addition to this, cereal proteins deficient in essential amino acids such as methionine, lysine, and tryptophan are a matter of concern in developing countries¹¹. Kwashiorkor, oedema, and marasmus are some of the severe forms of protein deficiency¹². To combat hidden hunger, biofortification, wherein grain micronutrients along with grain protein contents are genetically enhanced through either conventional or molecular breeding, is gaining popularity. Genomics-assisted breeding holds potential for the rapid improvement of varieties using diagnostic markers^13,14.

The wide variability for grain Fe and Zn content in pearl millet unveils the great prospect of developing biofortified pearl millet varieties and hybrids. The International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) has been working towards developing biofortified hybrids and has successfully delivered high-Fe pearl millet varieties and hybrids with high yield potential in India and Africa¹⁵. Biofortification using conventional breeding is time-consuming and incurs a high cost in terms of screening hybrid parental lines for micronutrients and protein in every generation. Hence, it is important to develop a cost-effective strategy to improve nutritional traits in pearl millet breeding programs. Furthermore, Fe and Zn are complex traits governed by additive genes and are affected by G × E interactions. Nutritional traits are very complex and governed by a group of genes. It is a challenge to track the genomic regions/genes that are either directly or indirectly responsible for Fe and Zn loading in the grains. Genome analysis tools provide access to thousands of genomic polymorphisms, considerably broadening the ability to monitor and effectively utilize genetic diversity¹⁶. Quantitative trait loci (QTL) mapping based on linkage analysis provides the high power of QTL detection of a trait of interest; it has a very low mapping resolution because of the few recombination events that it takes into consideration which would ultimately lead to long linkage blocks¹⁷.

Advances in high throughput genotyping technologies such as genotyping-by-sequencing (GBS)¹⁸, DArT¹⁹, and GWAS have enabled the use of these powerful approaches in dissecting quantitative traits²⁰. GWAS is a robust approach that has been successfully applied in the past to identify genomic regions controlling grain/kernel Fe and Zn contents in maize²¹, rice²², and wheat²³. GWAS has been successfully applied in wheat and maize to identify grain PC. The availability of the draft genome of pearl millet² provides the advantage of single nucleotide polymorphism (SNP) and candidate gene discovery. Single nucleotide polymorphism markers are desirable for GWAS, genomic selection, and QTL mapping²⁴. GWAS exploits millions of SNPs generated across the whole genome through GBS, whole-genome re-sequencing (WGRS), DArT, and DArT seq using a diverse group of germplasm lines. GWAS is very effective in pearl millet due to faster LD-decay². The discovery of SNP markers and their validation will help in developing diagnostic markers that can be deployed to develop biofortified pearl millet varieties/hybrids with elevated Fe and Zn content. This study aims to evaluate genetic variability for grain Fe, and Zn and PC among GWAS panel to discover the genomic regions associated with Fe, Zn, and PC in order to develop diagnostic markers for use in the pearl millet biofortification breeding program.

Results

Variability for Fe, Zn, and PC

The analysis of variance recorded significant (P < 0.01) mean squares for Fe, Zn, and PC among the inbred lines. Descriptive statistics revealed the presence of significant variability (Fig. 1) with high heritability (> 90% h_bs²) for three traits studied among 281 GWAS panel of pearl millet (Table 1). The Fe content in grains among inbred lines varied from 32 to 120 mg kg⁻¹ with an average of 74 mg kg⁻¹ (SEm = 2.72). The Zn content in grains varied from 19 to 87 mg kg⁻¹ with an average of 46 mg kg⁻¹ (SEm = 1.39), whereas the PC varied from 8 to 16% with an average of 11% (SEm = 3.06). Among the 281 inbred lines evaluated, 19%, 15%, and 14% of inbred lines belonging to the seed parents whereas, 24%, 18%, and 20% of inbreds belonging to restorer parents recorded higher Fe, and Zn, and PC, respectively in comparison with the overall trial mean. Furthermore, significant (P < 0.01) G × E interaction was recorded for all three traits. Pearson’s correlation coefficient revealed high significant (r = 0.77, P < 0.01) positive association between Fe and Zn, whereas PC recorded significant but moderate positive association with Fe (r = 0.38, P < 0.01) and Zn (r = 0.44, P < 0.01) (Supplementary Fig. S1).

Table 1 Estimates of mean, variance, range and heritability for pooled analysis of phenotypic evaluation of 281 inbred lines across 2017 rainy and 2018 summer, ICRISAT, Patancheru. CV, coefficient of variation; SEm, Standard error of mean; * and **, F-values significant at 0.05, 0.01 probability level.

Full size table

Genome-wide marker profiling

A total of 87,748 DArT seq markers were generated from the 281 GWAS panel representing restorer parents (R-lines), seed parents (B-lines), germplasm progenies and population progenies. The DArT seq markers were subjected to filtering and data quality check. All the SNP loci with > 30% missing data and rare SNPs with < 10% minor allele frequencies (MAF) were filtered and a total of 58,719 high-quality SNPs (derived from the DArT seq platform) were considered for further analysis (Fig. 2).

Population structure and linkage disequilibrium

Dissection of the population structure of the association panel using SNP markers revealed a total of six (K = 6) genetic groups at the corresponding least cross validation error (CV error) of 0.659 (Fig. 3A). Among the six subgroups, group VI (orange) was the largest that consisted of 53 inbreds, followed by group I (blue) with 51, group III (red) with 50, group V (yellow) with 48, group IV (green) with 47 and the group II (purple) with 32 inbred lines (Fig. 3B).

The linkage disequilibrium (LD) between each pair of SNPs across each chromosome was evaluated by the squared Pearson correlation coefficient (R²). A set of 58,719 SNPs with identified physical positions were used for LD analysis (Fig. 4). The pairwise LD across each chromosome showed that the LD (R²) ranged from 0 to 1 with the average LD across the genome being 0.116. Furthermore, chromosome-wise average LD varied in the order of 0.151 > 0.138 > 0.129 > 0.118 > 0.107 > 0.087 > 0.081 for chromosomes Pgl03, Pgl07, Pgl04, Pgl06, Pgl02, Pgl01, and Pgl05, respectively. The LD for 18,80,476 pairwise combinations obtained from 58,719 marker loci across the genome showed that 57% of SNP pairs showed < 0.01 R², whereas 37% of SNP pairs showed 0.01–0.05 R², and only 6% of SNP pairs showed 0.06–0.1 R². Linkage disequilibrium-decay (LDD) across seven chromosomes was determined using the entire set of 58,719 DArT seq markers. The LDD was plotted as LD (R²) between the adjacent pair of markers on the Y-axis against the distance in base pairs (bp) on the X-axis (Fig. 5). The R² threshold level was set to 0.2 and observed rapid LDD across the pearl millet genome with an average LDD of 2.9 kb (2900 bp). Among the seven chromosomes, the shortest LDD was observed in chromosome 1 with 0.2 kb (200 bp, R² = 0.2) and the longest LDD was observed in chromosome 6 with 9 kb (9000 bp, R² = 0.2).

Genome-wide association study

A genome-wide association mapping was performed using 58,719 high-quality SNPs with less than 30% missing data having a call rate of more than 0.7. These SNPs covered around 301 Mb of pearl millet genome and were distributed across the seven chromosomes of pearl millet with a minimum of 6534 SNPs on chromosome 7 to a maximum of 10,942 SNPs on chromosome 2. SNP genotyping data of 58, 719 SNPs along with information on population structure and kinship matrix were used for genome-wide association analysis against Fe, Zn, and PC in grains for the pooled data across the 2017 rainy season and 2018 summer season. Among two models used for GWAS, the general linear model (GLM) considering only population structure (Q) showed high genomic inflation (Fig. 6), whereas the mixed linear model (MLM) which considers both population structure and family relatedness (K) showed low genomic inflation and thus helped overcome the number of false-positive associations for Fe, Zn, and PC. Therefore, significant marker-trait associations (MTAs) finalized based only on MLM are presented here. The threshold level of ‘P’ value was set to 3.0, above which the SNPs are said to be significantly associated. A total of 78 MTAs were identified based on their ‘P’ values. Of the 78 MTAs identified across the three traits, 16 MTAs were identified on chromosome 5 followed by 14 MTAs each on chromosome 4 and chromosome 7; 13 MTAs on chromosome 1; 10 MTAs on chromosome 2; and 3 MTAs on chromosome 3 (Supplementary Table S4 for trait-wise and chromosome-wise MTAs).

Genomic regions identified for grain Fe and Zn content

A total of 61 highly significant MTAs for grain micronutrients were identified. Of the 61 MTAs, 18 were identified for Fe (Table 2; Fig. 7) with ‘P’ values ranging from 1.79 × 10^–5 to 9.83 × 10^–4 which explained 5.07 to 8.23% of phenotypic variation (PVE). The 18 markers that were identified for Fe were distributed across chromosome Pgl01 (1), Pgl02 (4), Pgl04 (7), Pgl05 (3), Pgl06 (2), and Pgl07 (1). No SNPs were found associated with chromosome Pgl03. Pgl05_135500493 was identified with the highest phenotypic variation of 8.23% for Fe with a ‘P’ value of 1.79 × 10^–5.

Table 2 Marker trait associations (MTAs) or SNPs identified for the iron (Fe), zinc (Zn) and protein content (PC) using mixed linear model (MLM) with annotations of corresponding gene.

Full size table

However, a total of 43 significantly associated markers were identified for Zn with ‘P’ values ranging from 2.24 × 10^–5 to 9.78 × 10^–4. Furthermore, the phenotypic variation explained by these SNPs ranged from 5.09 to 8.00% for Zn. These 43 markers identified were distributed across chromosomes Pgl01 (5), Pgl02 (1), Pg103 (3), Pgl04 (6), Pgl05 (12), Pg106 (5) and Pgl07 (11), respectively. Pgl07_101483782 for Zn was identified with the highest phenotypic variation of 8.00% with a ‘P’ value of 2.24 × 10^–5. A total of four SNPs (Pgl04_64673688, Pgl05_135500493, Pgl05_144482656 and Pgl07_101483782) located on three different chromosomes (4, 5 and 7) were found common among grain Fe and Zn contents (Supplementary Table S2).

Grain protein content (PC)

A total 17 MTAs were identified for PC with ‘P’ values ranging from 3.46 × 10^–4 to 9.39 × 10^–4, which explained 5.11 to 5.68% of the phenotypic variation. The 17 markers that were identified for PC were distributed across chromosomes Pgl01 (7), Pgl02 (5), Pgl04 (1), Pgl05 (1), Pgl06 (1), and Pgl07 (2). No SNPs were found associated with chromosome Pgl03. Pgl06_71295563 was identified with the phenotypic variation of ~ 6% for PC with a ‘P’ value of 3.46 × 10^–4.

Candidate genes associated with grain Fe, Zn, and PC

Pearl millet genome sequencing reported a total of 69,398 genes and unraveled the involvement of several genes in the control of both agronomically and nutritionally important traits. The physical positions of each SNP marker from the present study were compared against the pearl millet genome sequence to determine the function of the gene underlying the respective SNP. A total of 18 SNPs associated with Fe were found linked (Table 2 and Supplementary Table S3) to different genes viz., Like-Sm ribonucleoprotein (LSM) domain, late embryogenesis abundant protein, zinc finger, ankyrin repeat, leucine-rich repeat, pentatricopeptide repeat, oligopeptide transferase, and basic leucine zipper which were found to play a significant role in plant metabolism, including iron homeostasis. Similarly, the SNPs associated with the genes viz., protein kinase, Myb transcription factor, glycosyl transferase, chalcone/stilbene synthase, heat shock protein (HSP70), peptidase, copper domain, male sterility, etc., were found to be unique to Zn while protein binding, lipid binding, protein kinase activity, and iron ion binding genes were found associated with SNPs identified for PC.

Discussion

Developing biofortified hybrids in pearl millet requires high Fe and Zn content in both the parents since it’s governed by additive gene²⁵. It is highly feasible to develop biofortified inbred lines through inbreeding which accumulates more of additive variances in subsequent generations. The strong epigenetic influence on these traits expression and sample contamination during handling of breeding materials is a challenge for biofortification in pearl millet^26,27. The process of identifying molecular markers, preferably SNPs tightly linked to genomic regions of Fe, Zn and PC, will enhance the efficiency of biofortification using genomics assisted breeding. Recently, several genomic regions controlling the inheritance of Fe and Zn have been identified through QTL mapping²⁸ using DArT and SSR markers and also through LD-based association mapping²⁹ by SSR markers in pearl millet. Though SSRs are preferred markers, their resolution is relatively low¹⁷. None of the previous studies have reached the gene level; therefore, the present study aimed to dissect the genetic nature of Fe, Zn and PC in pearl millet using GWAS by exploiting the DArT seq markers to discover the genomic regions and candidate genes influencing Fe, Zn and PC.

Grain Fe and Zn content are strongly influenced by the available Fe and Zn content in the soil. The available soil Fe and Zn content in our experimental field was above the critical levels (2.6 to 4.5 mg kg^–1 Fe and 0.6 to 1.0 mg kg^–1 Zn) required for normal growth and development^30,31. Three to fourfold significant variations for Fe (32–120 mg kg⁻¹), Zn (19–87 mg kg⁻¹) and twofold variation for PC (8–16%) in 281 elite inbred lines prospects the breeding feasibility (Supplementary Table S5). Similar variability for Fe/Zn has been reported among germplasm³², breeding lines¹⁵, and commercial cultivars³³. High genetic variance for Fe/Zn indicates the least influence of G × E. Population structure along with shared co-ancestry coefficients between individuals of subdivisions of a population were estimated using ADMIXTURE 1.23⁷³. A total of six genetic groups were formed among 281 inbred lines with some admixtures indicating common allelic combinations in the genomic background of few genotypes. The availability of six subgroups and wide phenotypic variation observed for Fe, Zn, and PC indicated that the present GWAS panel is best suited for genome-wide association study to dissect the genetic basis of high Fe, Zn accumulation, and PC in pearl millet.

LD is the non-random association of alleles at two or more loci and acts as a critical genetic force in determining population structure^34,35. The LD of a population is the result of evolutionary changes in a population that would help in mapping quantitative traits such as Fe, Zn and PC more precisely while it also gives insights into the joint evolution of the linked sets of genes. The pattern of LD across the genome ultimately decides the success of association studies^36,37. In the present study, the average pairwise LD (R²) across the genome decreased rapidly against the increasing distance (bp). Rapid LDD has been reported in earlier studies in pearl millet^2,38. Chromosomes Pgl01, Pgl02, Pgl03, Pgl05, and Pgl07 showed relatively more rapid LDD (~ 0.64 kb) compared to Pgl04 and Pgl06, suggesting that a larger number of markers are required for chromosomes Pgl01, Pgl02, Pgl03, Pgl05, and Pgl07 for GWAS. The gene-rich genomic region tends to have a higher rate of recombination. Thus the LDD would be higher in such genomic regions, requiring a higher marker density for LD analysis in such regions. Of 18,80,476 pairwise LD analysis, 57% of the SNP pairs showed an LD of less than 0.01 (R² = 1%), indicating that the LD in the current GWAS panel is relatively low. This could probably be because pearl millet is a highly cross-pollinated (> 80%) species, wherein some portion of the genome is bound to have heterozygosity (not every locus is heterozygous) as genetic load by the inbreeding process³⁹. The low LD is also due to frequent recombination and higher inbreeding depression by virtue of being a cross-pollinated crop. The low value of LD in turn gives the high resolution of mapping but requires a large number of markers⁴⁰.

While performing GWAS, care should be taken to avoid false associations arising from false positives (Type I error). In the present study, two extensively used statistical models, GLM⁴¹ and MLM^42,43, were used for the MTA. The MLM model is more efficient and superior in reducing false positive associations by correcting for both population structure (Q) and kinship matrix (K) which can be further visualized through Quantile–Quantile (Q–Q) plots to show low genomic inflation for MLM compared to GLM (Fig. 6). However, sometimes MLM tends to overcompensate for both population structure and kinship, which could lead to false negatives, type II errors^44,45. This means that the identification of some MTAs depends on the model used⁴⁶. Therefore, the present study used both GLM and MLM and found that > 70% of SNPs from MLM were common in GLM, with some additional markers that were absent in GLM. Therefore, the results obtained from the MLM model are presented. None of the MTAs met the Bonferroni criteria because of the utilization of 0.058 Million markers generated through the GBS method. The Bonferroni correction would be too stringent to use as not all the markers are independent⁴⁷ and may lead to false negatives^48,49.

Among the significantly associated SNPs for Fe, marker Pgl05_135500493 on chromosome Pgl05 explained the highest phenotypic variation (8.23%). For Zn, markers Pgl07_101483782, Pgl07_101483780, and Pgl07_147179490 exhibited more than 7.5% of phenotypic variation. However, the SNPs identified for PC explained the relatively lower phenotypic variation, wherein the highest phenotypic variation was explained by the SNP Pgl07_71295563 on chromosome Pgl07 (~ 6%). Interestingly, there were four SNPs discovered to be common for both Fe and Zn content on chromosomes Pgl04, Pgl05, and Pgl07 that cumulatively explain about 27.12% and 25.32% of phenotypic variation for Fe and Zn, respectively. The co-localization of both Fe and Zn and highly significant positive correlation between them further suggested some common genes and pathways involved in Fe and Zn homeostasis in plants i.e., from root absorption to till deposition in grains. A common set of markers for Fe and Zn has been reported in pearl millet²⁹ on LG 3, LG 5, and LG 7. QTLs responsible for Fe and Zn have been co-mapped on LG 1 and LG 7²⁸; these probably indicate that chromosome Pgl05 and Pgl07 are likely to control Fe and Zn transport and accumulation in pearl millet. Though no common MTAs were identified for PC with Fe and Zn, the positive significant correlation of PC with both Fe and Zn suggested that the selection for high Fe/Zn expected to increase PC as an associated trait.

To know the conformity of the identified MTAs in this study, they were compared to previous genetic mapping studies for Fe and Zn in pearl millet. SNPs were identified for Fe and Zn in this study were concomitant to reported studies in pearl millet (Table 3). For instance, Anuradha et al.²⁹ reported that Fe was highly influenced by the genes on chromosomes Pgl05 and Pgl07, whereas Kumar et al.²⁸ identified genomic regions for Zn on chromosomes Pgl01 and Pgl04 in pearl millet. Zn content was also influenced by the SNPs on chromosome Pgl03, Pgl04, Pgl05, Pgl06 and Pgl07. Similar results were reported earlier by Anuradha et al.²⁹ while Kumar et al.²⁸ reported genomic regions on LG 1, 4, 5, and 7. This evidence suggests that the SNPs identified on chromosomes were consistent with the previously reported markers which might have a significant role to play in the expression of Fe and Zn content. This calls for fine mapping of these genomic regions that would ultimately provide candidate SNPs for use in marker-assisted breeding to improve grain Fe and Zn. Apart from pearl millet, genomic regions were also discovered for grain Fe and Zn content in other millets and cereals such as rice²², foxtail millet⁵⁰, maize²¹, wheat²³, through genome-wide association mapping. Genetic mapping studies have discovered genomic regions for grain Fe and Zn content in sorghum⁵¹, maize⁵², and wheat⁵³. Hence different genomic regions in this study can be introgressed for trait improvement in pearl millet based on the targeted environment, depending on common MTAs. This is the first report on the discovery of genomic regions using GWAS for PC in pearl millet. The findings will generate research interest to further investigate the regulation of grain PC in pearl millet. A total of 17 MTAs were identified on six chromosomes (Pgl01, Pgl02, Pgl04, Pgl05, Pgl06 and Pgl07) of pearl millet, among which Pgl06_71295563 showed the highest phenotypic variation of 5.86% with a ‘P’ value of 3.46 × 10⁴. Similar genomic regions have been reported for PC in previous studies in maize⁵⁴, rice^55,56, and wheat⁵⁷.

Table 3 QTLs reported from earlier studies for iron (Fe), zinc (Zn) in pearl millet and co-localized associated marker trait associations (MTAs) identified the same genomic region in present study.

Full size table

Gene annotation was performed by comparing the sequence reads of significantly associated SNPs at their respective physical positions against the reference genome of pearl millet. The genes identified in the present study and their functional roles in Fe and Zn metabolism in plants reported through previous studies are presented in Table 4. There were several genes identified, among which very few were involved in Fe transportation, accumulation, and homeostasis. The SNP Pgl07_147858723 corresponding to glutathione S-transferase plays a significant role in iron starvation in roots. In the roots of hexaploid wheat, a significant temporal increase in glutathione S-transferase was observed at both transcriptional and enzymatic activity levels, which established the foundation for designing breeding strategies to improve Fe nutrition in pearl millet. The SNP Pgl02_69256531 and Pgl06_231796045 were found in the region of the MYB-domain. Palmer et al.⁵⁸ observed that the MYB-domain plays a significant role in plant survival under Fe deficiency conditions, and is the most highly induced transcription factor which acted early in the Fe deficiency regulatory cascade to drive gene expression of NAS4. Shen et al.⁵⁹ isolated MYB gene MxMYB1 from Malus xiaojinensis. The expression of MxMYB1 was up-regulated by Fe starvation in the roots but not in the leaves, signifying that MxMYB1 likely to play more in iron absorption from soil to roots and not likely from root to leaves. The SNP Pgl04_190105720 corresponding to the Zinc finger plays a crucial role in preventing toxic ion damage and hence performs an important role in maintaining cellular osmotic adjustment and enzyme activities, leading to significantly improved salt stress tolerance⁶⁰.

Table 4 List of trait wise marker trait associations (MTAs) annotated in the present study and their respective role in iron (Fe) metabolism reported earlier in other crops.

Full size table

The significant phenotypic variability observed in the association panel coupled with high marker density across all chromosomes provided a strong case for whole-genome association mapping of the three (Fe, Zn, and PC) important nutritional traits in pearl millet. This GWAS study which identified marker-trait associations for Fe, Zn, and PC using the genotyping-by-sequencing platform presents greater prospects for utilization and traits mainstreaming. Rapid LDD observed in the current GWAS panel indicates that the SNPs identified through genome-wide association mapping are more reliable and complement previously reported QTLs in pearl millet. Pgl05_135500493 and Pgl05_144482656 SNPs for Fe; Pgl07_101483782, Pgl07_101483780 and Pgl07_147179490 SNPs for Zn, and Pgl06_71295563 SNPs for PC were found promising. Significant phenotypic correlations between Fe and Zn support simultaneous selection and improvement. This linkage and the identified co-localized MTA suggest there is a common physiological pathway. These MTAs help to move towards fine mapping and discovering a set of diagnostic markers to screen segregating population (F₂/F₃s) in order to avoid expensive phenotyping and G × E effects in future. Eight MTAs that were identified for Fe and Zn were found to be involved in Fe mobilization. Thus, the promising MTAs identified in the present study merit further validation in different genetic backgrounds of breeding lines and populations. Eleven inbred lines had ≥ 80 mg kg⁻¹ of Fe, > 60 mg kg⁻¹ Zn, and > 13% of PC that meet global targets and will serve as trait sources in elite backgrounds. Such lines will be easily converted into CMS (maintainers) to make hybrids with high-Fe/Zn/PC restorers for fast-track product development. The inbred panel studied that is part of hybrid parents at ICRISAT. This will enhance the introgression of these traits to develop high-yielding hybrids through marker-assisted back-crossing (MABC) in India where hybrid cultivars are dominant, while in Sub-Saharan Africa where open-pollinated varieties (OPVs) are predominant, it will be done through marker-assisted recurrent selection (MARS) and marker-assisted population improvement (MAPI).

Materials and methods

Plant material

The GWAS panel comprised of 281 inbred lines developed at ICRISAT, Hyderabad, India, differing in grain Fe and Zn as well as agronomic traits such as flowering, plant height, tillering, panicle size, 1000-grain weight, and grain yield. The inbred lines included 112 restorer parents (R-lines), 110 seed parents (B-lines), 32 advanced progenies derived from breeding population/composites, and 27 direct derivatives of germplasm accessions (Supplementary Table S1).

Field trials and agronomic practices

The trials were planted in alpha lattice experimental design with three replications in two contrasting environments, rainy season 2017 and summer season 2018 at ICRISAT, Hyderabad (17.53° N; 78.27°E). Each replication comprised of 20 incomplete blocks with 10 entries in each block, and every entry planted in two rows of 2 m length. Sowing was done by tractor-mounted 4-cone planter (7100 US model) with a spacing of 75 cm between rows during the rainy season 2017 and 60 cm in the summer season 2018. Overplanted plots were thinned 15 days after sowing to single plants spaced 15 cm apart within each row. A basal dose of 100 kg ha⁻¹ of diammonium phosphate (18% N and 46% P) was applied at the time of field preparation and 100 kg ha⁻¹ of urea (46% N) was applied as top dressing within 2 to 4 days of thinning. The trial was irrigated at 7–10 days intervals during the summer season 2018 and as required during the rainy season 2017 to avoid moisture stress. All the recommended agronomic practices were followed for good and healthy crop growth. Observations were recorded for five random plants per plot in each replication for Fe, Zn, and PC.

Estimation of grain iron and zinc content

For grain sampling, open-pollinated main panicles from five representative plants per plot were harvested at physiological maturity (85–90 days after planting). These panicles were stored separately in a cloth bag and sundried for 10 to 15 days, and then hand threshed to produce clean grain samples for micronutrient analysis (Fe and Zn). Utmost care was taken to avoid contact with iron equipment while threshing and handling of threshed samples. Grain Fe and Zn content were analyzed using Inductively Coupled Plasma Optical Emission Spectrometry (ICP-OES) at Flinders University, Australia, following the method described by Wheal et al.⁶¹. Grain samples were finely ground and oven-dried at 60 °C for 48 h before analyzing them for Fe and Zn. A ground sample of 0.2 g was transferred into 25 ml polyprophelene PPT tubes with 2.0 ml of concentrated nitric acid (HNO₃) and 0.5 ml of 30% hydrogen peroxide (H₂0₂). These samples were wetted and predigested overnight at room temperature. Samples were placed in the digestion block and heated at 80 °C for 1 h, followed by digestion at 120 °C for 2 h. After digestion, each sample digest was turned into 25 ml using distilled water. The digests were filtered using Whatman no.1 filter paper and the filtrate was used to estimate Fe and Zn content using ICP-OES.

Estimation of grain protein content

Grain protein content was analyzed using Near-Infrared Spectroscopy (NIRS) at ICRISAT. The quantified grain protein⁶² content was measured in percentage. The grain samples collected were cleaned thoroughly and about two to three grams of whole grain samples were poured in a small cup. The cup was then placed in the NIRS machine and the sample was run for a minute. The readings were then noted.

Estimation of Fe and Zn content from the soil

The soil samples collected from the top 30 cm layer in the field were analyzed for extractable Fe and Zn content by Atomic Absorption Spectroscopy (AAS)⁶³. The mean soil Fe and Zn content extractable with Diethylene Triamine Pentaacetic Acid (DTPA) were 3.8 mg kg⁻¹ and 2.0 mg kg⁻¹ during the rainy season 2017 and 5.0 mg kg⁻¹ and 1.6 mg kg⁻¹ during the summer season 2018, respectively.

DNA extraction and genotyping using DArT seq

Genomic DNA was isolated from tender leaf tissues of 30 day-old seedlings⁶⁴. The quality and quantity of the extracted DNA were checked on 0.8% agarose gel using gel electrophoresis at 80 V using λ-DNA standards. The DNA was subsequently diluted to a volume 50 μl of concentration 50 ng/μl. The samples were then sent to the Diversity Arrays Technology (DArT) Pty Ltd, Australia⁶⁵ for genotyping using DArT markers. The DArT seq assay, an efficient genotyping-by-sequencing platform was employed in the present study. In brief, the DNA samples were digested and ligated primarily with two different adaptors accompanying to overhang by two different restriction enzymes⁶⁶. The Illumina flow cell attachment sequence, sequencing primer sequence, and varying length barcode regions were included while designing the PstI-compatible adapter. The PstI-MseI fragments were amplified for 30 Polymerase Chain Reaction (PCR) cycles using the following reaction conditions: 94 °C for 1 min, followed by 29 cycles of 94 °C for 20 s (s), ramp 2.4 °C/s to 58 °C, 58 °C for 30 s, ramp 2.4 °C/s to 72 °C, 72 °C for 45 s. Amplicons were held at 72 °C for 7 min and then at 10 °C. All PCR amplicons from the 96-well were multiplexed in equimolar amount and kept to c-Bot (Illumina) bridge PCR after that sequenced on Illumina Hiseq2000. Single lane sequencing was followed for all the amplicons; the single read sequencing was run for 77 cycles. All the generated sequences from each lane were subjected to proprietary DArT analytical pipelines. Poor-quality sequences were filtered out from the FASTQ files in the primary pipeline. In the barcode region, more stringent selection criteria (≥ Phred pass score of 30) were employed in comparison with the rest of the sequence. The sequence assignments are authenticated to specific samples. In marker aligning, about 2,000,000 identified sequences per barcode/sample were used. Finally, identical sequences were broken into FASTQ call files. In the secondary proprietary pipeline of DArT P/L, the FASTQ call files were used to detect presence/absence markers (PAM) through SNP calling algorithms (DArTsoftseq)^67,68.

SNP filtering and quality control

Whole-genome genotyping data of 87,748 DArT seq markers on 281 pearl millet inbreds was generated using DArT genotyping platform. DArT seq SNP-derived markers were further filtered to remove SNPs of low quality with > 30% missing data and rare SNPs with < 10% MAF using TASSEL v 5.3.1 (Trait Analysis by Association Evolution and Linkage).

Phenotypic data analysis

The analyses of variance was performed over the rainy season 2017 and summer season 2018 using generalized linear model procedures following a random-effects model^69,70 in SAS University Edition (SAS/STAT, SAS Institute Inc, NC, USA)⁷¹. Heritability⁷² was determined using the following formula:

$${\text{H }} = \frac{{\upsigma _{{\text{g}}}^{2} }}{{\left( {\upsigma _{{\text{g}}}^{2} + \frac{{\upsigma _{{{\text{gs}}}}^{2} }}{{\text{s}}} + \frac{{\upsigma _{{\text{e}}}^{2} }}{{{\text{rs}}}}} \right)}}$$

where ${\upsigma }_{{\text{g}}}^{2}$ is the genotypic variance, ${\upsigma }_{{{\text{gs}}}}^{2}$ is the genotype × season interaction variance, and ${\upsigma }_{\mathrm{e}}^{2}$ is the residual variance; ‘r’ is the number of replications, and ‘s’ is the number of seasons. Mean and coefficient of variation (CV) were also determined using the standard procedure implemented in the SAS University Edition. Pearson’s correlation coefficients among the traits were calculated using the PROC CORR procedure in R version 3.5.1 (R Project for Statistical Computing, (https://www.r-project.org). The standard error of the mean (SEm) was determined in a simple excel program using the following formula:

$${\text{SEm}} = \sqrt {\frac{{{\text{MSS}}}}{{\text{n}}}}$$

where ‘MSS’ is the Mean sum of square and ‘n’ is the number of samples.

Population structure, kinship and genome-wide linkage disequilibrium

Population structure was determined using ADMIXTURE 1.23 software⁷³. The number of genetic clusters (K) was predefined as 1 to 10 to explore the population structure of the tested accessions. This analysis provided maximum likelihood estimates of the proportion of each sample derived from each of the K populations. The optimum K value was selected based on the graph plotted using the respective K value from 1 to10 against cross-validation error (CV-error). The optimal number of sub-population (K) was determined with the lowest cross-validation error. Genetic relatedness or K matrix was generated from TASSEL V 5.3.1.⁷⁴. LD was quantified as adjacent pairwise R² values (the squared allele frequency correlations among alleles at two adjacent SNP markers)⁷⁵ and was estimated for 58,719 SNPs in TASSEL V 5.3.1.

Genome-wide association analysis

Marker trait association was performed using two different models, GLM and MLM, as given below⁷⁶:

$$\begin{aligned} & {\text{y }} = {\text{ X}}_{{\text{a}}} + {\text{ Q}}_{{\text{b}}} + {\text{ e}}\quad \quad {\text{GLM and}} \\ & {\text{y }} = {\text{ X}}_{{\text{a}}} + {\text{ Q}}_{{\text{b}}} + {\text{ Z}}_{{\text{u}}} + {\text{ e}}\quad \quad {\text{MLM}} \\ \end{aligned}$$

where, ‘y’ is phenotype vector, ‘a’ is a marker vector with fixed effects, ‘b’ is a vector with fixed effects, ‘u’ is a vector with random effects (kinship matrix), ‘e’ is a residuals vector, X denotes the accessions/genotypes at the marker, ‘Q’ is the Q-matrix, the result of ADMIXTURE software, and ‘Z’ is an identity matrix.

The GLM principally considers the population structure (Q) while MLM considers both Q and Kinship (K). Further, among the different options available within MLM, the widely adapted approach called ‘optimum levels of compression in combination with P3D’ for variance component estimation was used for association analysis. For the MLM analysis, marker-based kinship matrix (K) obtained using TASSEL was used along with the Q matrix generated through ADMIXTURE to correct for both family and population structure and the phenotypic variation explained (R²) by the marker is reported^74,77. Quantile–Quantile (Q-Q) plots were developed by plotting observed negative Log₁₀ ‘P’ values against expected negative Log₁₀ ‘P’ values for all the available SNPs in R package CMplot⁷⁸. A deviation from ‘P’ values at the initial stage may display the existing population stratification. Manhattan plots were used to visualize chromosome-wise SNPs obtained through the marker-trait association study performed across the genome. -Log₁₀ of the ‘P’ value for each SNP was plotted against seven chromosomes for the respective trait. Based on the SNP distribution, the threshold for significance of associations between SNPs and traits was fixed at [− log 10 (p) < 10⁻⁰³] which gave the optimum number of reliable SNPs. SNP density plots, Q-Q plots, and Manhattan plots were generated using R package CMplot v 3.4.0⁷⁸.

The corresponding genes of associated SNPs or marker-trait associations were identified by using the physical positions of SNPs in gene annotations available in the pearl millet reference genome sequence²; and thus the functions of the respective SNPs were determined.

Candidate genes discovery

The candidate genes corresponding to the significantly associated SNPs were identified using the pearl millet genome² sequence annotations. The SNP subsiding start and end positions of a gene or exons were explored for candidate genes based on their biological function annotation related to the trait of interest (Supplementary Fig. S2). It is possible to obtain multiple SNPs on a gene segment which are referred to as haplotypes⁷⁹.

References

ICRISAT at https://exploreit.icrisat.org/profile/Pearl%20Millet/178 (2020).
Varshney, R. K. et al. Pearl millet genome sequence provides a resource to improve agronomic traits in arid environments. Nat. Biotechnol. 35, 969–976 (2017).
Article CAS Google Scholar
Tako, E. et al. Higher iron pearl millet (Pennisetum glaucum L.) provides more absorbable iron that is limited by increased polyphenolic content. Nutr. J. 14, 11. https://doi.org/10.1186/1475-2891-14-11 (2015).
Article CAS Google Scholar
Stoltzfus, R. J. Iron-deficiency anemia: reexamining the nature and magnitude of the public health problem. Summary: implications for research and programs. J. Nutr. 131, 697S-700S (2001).
Article CAS Google Scholar
Black, R. E. Zinc deficiency, infectious disease and mortality in the developing world. J. Nutr. 133, 1485S-S1489 (2003).
Article CAS Google Scholar
NIN. Nutritive value of Indian Foods, (Eds. Gopalan and Deosthale), National Institute of Nutrition, Hyderabad (2003).
Heine, W., Radke, M. & Wutzke, K. D. The significance of tryptophan in human nutrition. Amino Acids 9, 191–205 (1995).
Article CAS Google Scholar
Tomé, D. & Bos, C. Lysine requirement through the human life cycle. J. Nutr. 137, 1642S-1645S (2007).
Article Google Scholar
WHO. The State of Food Security and Nutrition in the World 2019. Safeguarding against economic slowdowns and downturns. Rome, FAO. License: CC BY-NC-SA 3.0 IGO (2019).
WHO. Anaemia, https://www.who.int/health-topics/anaemia#tab=tab_1 (2020).
Nirgude, M. et al. Development and molecular characterization of genic molecular markers for grain protein and calcium content in finger millet (Eleusine coracana (L.) Gaertn.). Mol. Biol. Rep. 41, 1189–1200. https://doi.org/10.1007/s11033-013-2825-7 (2014).
Article CAS Google Scholar
Wu, G. Dietary protein intake and human health. Food Funct. 7, 1251–1265 (2016).
Article CAS Google Scholar
Taunk, J. et al. Molecular breeding of ameliorating commercial pearl millet hybrid for downy mildew resistance. J. Genet. 97, 1241–1251 (2018).
Article CAS Google Scholar
Serraj, R. C. Recent advances in marker-assisted selection for drought tolerance in pearl millet. Plant Prod. Sci. 8, 334–337. https://doi.org/10.1626/pps.8.334 (2005).
Article Google Scholar
Govindaraj, M. et al. Breeding biofortified pearl millet varieties and hybrids to enhance millet markets for human nutrition. Agriculture 9, 106. https://doi.org/10.3390/agriculture9050106 (2019).
Article CAS Google Scholar
Glaszmann, J. C., Kilian, B., Upadhyaya, H. D. & Varshney, R. K. Accessing genetic diversity for crop improvement. Curr. Opin. Plant Biol. 13, 167–173 (2010).
Article CAS Google Scholar
Mauricio, R. Mapping quantitative trait loci in plants: uses and caveats for evolutionary biology. Nat. Rev. Genet. 2, 371–381 (2001).
Article CAS Google Scholar
Elshire, R. J. et al. Simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6, e19379.39 (2011).
Article CAS Google Scholar
Sánchez-Sevilla, J. F. et al. Diversity arrays technology (DArT) marker platforms for diversity analysis and linkage mapping in a complex crop, the octoploid cultivated strawberry (Fragaria × ananassa). PLoS ONE 10(12), e0144960. https://doi.org/10.1371/journal.pone.0144960 (2015).
Article CAS Google Scholar
Stich, B. & Melchinger, A. An introduction to association mapping in plants. CAB Rev. 5, 1–9 (2010).
Article Google Scholar
Hindu, V. et al. Identification and validation of genomic regions influencing kernel zinc and iron in maize. Theor. Appl. Genet. 131, 1443–1457 (2018).
Article CAS Google Scholar
Descalsota, G. I. L. et al. Genome-wide association mapping in a rice MAGIC plus population detects ATLs and genes useful for biofortification. Front. Plant Sci. 9, 1347 (2018).
Article Google Scholar
Alomari, D. Z. et al. Whole-genome association mapping and genomic prediction for iron concentration in wheat grains. Int. J. Mol. Sci. 20, 76 (2018).
Article CAS Google Scholar
Kumar, A. et al. Genetic variability and character association for grain iron and zinc contents in sorghum germplasm accessions and commercial cultivars. Eur. J. Plant Sci. 6(1), 66–70 (2012).
Google Scholar
Kanatti, A. et al. Grain iron and zinc density in pearl millet: combining ability, heterosis and association with grain yield and grain size. Springer Plus. 3, 763 (2014).
Article CAS Google Scholar
Sager, M. & Mittendorfer, J. Influence of milling or cutting procedures on trace element contents of plant samples. Int. J. Environ. Anal. Chem. 67, 59–71 (1997).
Article CAS Google Scholar
Jones, J. B. Plant analysis. In: Laboratory Guide for Conducting Soil Tests and Plant Analysis, 191–239 (CRC Press, Boca Raton, 2001).
Kumar, S. et al. Mapping grain iron and zinc content quantitative trait loci in an Iniadi-derived immortal population of pearl millet. Genes 9, 248 (2018).
Article CAS Google Scholar
Anuradha, N. et al. Deciphering genomic regions for high grain iron and zinc content using association mapping in pearl millet. Front. Plant Sci. 8, e00412. https://doi.org/10.3389/fpls.2017.00412 (2017).
Article Google Scholar
Tisdale, S. L., Nelson, W. L. & Beaton, J. B. Soil Fertility and Fertilizers 5th edn. (Macmillan Pub. Co, New York, 1990).
Google Scholar
Sahrawat, K. L. et al. Soil testing as a tool for on-farm fertility management: experience from the semi-arid zone of India. Commun. Soil Sci. Plant. 44, 1011–1032 (2013).
Article CAS Google Scholar
Rai, K. N. et al. Iniadi pearl millet germplasm as a valuable genetic resource for high grain iron and zinc densities. Plant Genet. Resour. 13, 75–82. https://doi.org/10.1017/s1479262114000665 (2014).
Article Google Scholar
Rai, K. N. et al. Grain iron and zinc densities in released and commercial cultivars of pearl millet (Pennisetum glaucum). Indian J. Agric. Sci. 86, 291–296 (2016).
CAS Google Scholar
Slatkin, M. Linkage disequilibrium—understanding the evolutionary past and mapping the medical future. Nat. Rev. Genet. 9, 477–485. https://doi.org/10.1038/nrg2361 (2008).
Article CAS Google Scholar
Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution 38, 1358 (1984).
CAS Google Scholar
Kim, S. V. et al. Recombination and linkage disequilibrium in Arabidopsis thaliana. Nat. Genet. 39, 1151. https://doi.org/10.1038/ng2115 (2007).
Article CAS Google Scholar
Mather, K. A. et al. The extent of linkage disequilibrium in rice (Oryza sativa L.). Genet. 177, 2223–2232 (2007).
Article CAS Google Scholar
Serba, D. D. Genetic diversity, population structure, and linkage disequilibrium of pearl millet. Plant Genome https://doi.org/10.3835/plantgenome2018.11.0091 (2019).
Article Google Scholar
Nachimuthu, V. V. et al. Analysis of population structure and genetic diversity in rice germplasm using SSR markers: an initiative towards association mapping of agronomic traits in Oryza sativa. Rice. 8, 1. https://doi.org/10.1186/s12284-015-0062-5 (2015).
Article Google Scholar
Gupta, P. K., Rustgi, S. & Kulwal, P. L. Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Mol. Biol. 57, 461–485 (2005).
Article CAS Google Scholar
Pritchard, J. K., Stephens, M., Rosenberg, N. A. & Donnelly, P. Association mapping in structured populations. Am. J. Hum. Genet. 37, 170–181 (2000).
Article Google Scholar
Yu, J. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203–208. https://doi.org/10.1038/ng1702 (2006).
Article CAS Google Scholar
Price, A. L., Zaitlen, N. A., Reich, D. & Patterson, N. New approaches to population stratification in genome-wide association studies. Nat. Rev. Genet. 11, 459–463. https://doi.org/10.1038/nrg2813 (2010).
Article CAS Google Scholar
Zhao, K. et al. An Arabidopsis example of association mapping in structured samples. PLoS Genet. https://doi.org/10.1371/journal.pgen.0030004 (2007).
Article Google Scholar
Zhao, K. et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat. Commun. 2, 467. https://doi.org/10.1038/ncomms1467 (2011).
Article ADS CAS Google Scholar
Liu, N. et al. Genome-wide association study identifies candidate genes for starch content regulation in maize kernels. Front. Plant Sci. 7, 1046. https://doi.org/10.3389/fpls.2016.01046 (2016).
Article Google Scholar
Yang, Y. et al. Identification of quantitative trait loci responsible for rice grain protein content using chromosome segment substitution lines and fine mapping of qPC-1 in rice (Oryza sativa L.). Mol. Breed. 35, 130. https://doi.org/10.1007/s11032-015-0328-z (2015).
Article CAS Google Scholar
Gupta, P. K., Kulwal, P. L. & Jaiswal, V. Association mapping in crop plants: opportunities and challenges. Adv. Genet. 38, 109–147. https://doi.org/10.1016/B978-0-12-800271-1.00002-0 (2014).
Article CAS Google Scholar
Li, H. et al. Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat. Genet. 45, 43–72 (2013).
Article CAS Google Scholar
Jaiswala, V. et al. Genome-wide association study (GWAS) delineates genomic loci for ten nutritional elements in foxtail millet (Setaria italica L.). J. Cereal Sci. 85, 48–55 (2019).
Article CAS Google Scholar
Phuke, R. M. et al. Mapping QTLs association with grain iron and zinc in sorghum (Sorghum bicolor L. moench) Ph.D. Thesis. Professor Jayashankar Telangana State Agricultural University, Hyderabad, India (2015).
Jin, T. et al. The genetic architecture of zinc and iron content in maize grains as revealed by QTL mapping and meta-analysis. Breed Sci. 63, 317 (2013).
Article CAS Google Scholar
Crespo-herrera, L. A., Velu, G. & Singh, R. P. Quantitative trait loci mapping reveals pleiotropic effect for grain iron and zinc concentrations in wheat. Ann. Appl. Biol. 169, 27–35 (2016).
Article CAS Google Scholar
Wassom, J. J. et al. QTL associated with maize kernel oil, protein, and starch concentrations; kernel mass; and grain yield in Illinois high oil × b73 backcross-derived lines. Crop Sci. 48, 243–252. https://doi.org/10.2135/cropsci2007.04.0205 (2008).
Article Google Scholar
Pradhan, S. K. et al. Association mapping reveals multiple QTLs for grain protein content in rice useful for biofortification. Mol. Genet. Genomic Med. 294, 963–983 (2019).
Article CAS Google Scholar
Chattopadhyay, K. et al. Detection of stable QTLs for grain protein content in rice (Oryza sativa L.) employing high throughput phenotyping and genotyping platforms. Sci. Rep. 9, 3–196. https://doi.org/10.1038/s41598-019-39863-2 (2019).
Article CAS Google Scholar
Johnson, M. et al. Association mapping for 24 traits related to protein content, gluten strength, color, cooking, and milling quality using balanced and unbalanced data in durum wheat [Triticum turgidum L. var durum (Desf).]. Front. Genet. 10, 717. https://doi.org/10.3389/fgene.2019.00717 (2019).
Article CAS Google Scholar
Palmer, C. M., Hindt, M. N., Schmidt, H., Clemens, S. & Guerinot, M. L. MYB10 and MYB72 are required for growth under iron-limiting conditions. PLoS Genet. https://doi.org/10.1371/journal.pgen.1003953 (2013).
Article Google Scholar
Shen, J., Xu, X., Li, T., Cao, D. & Han, Z. An MYB transcription factor from Malus xiaojinensis has a potential role in iron nutrition. J. Integr. Plant Biol. 50, 1300–1306 (2008).
Article CAS Google Scholar
Zang, D. et al. An Arabidopsis zinc finger protein increases abiotic stress tolerance by regulating sodium and potassium homeostasis, reactive oxygen species scavenging and osmotic potential. Front. Plant Sci. 7, 1272. https://doi.org/10.3389/fpls.2016.01272 (2016).
Article Google Scholar
Wheal, M. S., Fowles, T. O. & Palmer, L. T. A cost-effective acid digestion method using closed polypropylene tubes for inductively coupled plasma optical emission spectrometry (ICP-OES) analysis of plant essential elements. Anal. Methods 3, 2854 (2011).
Article CAS Google Scholar
Igne, B. et al. Triticale moisture and protein content prediction by near-infrared spectroscopy (NIRS). Cereal Chem. 84, 328–330 (2007).
Article CAS Google Scholar
Lindsay, W. L. & Norvell, W. A. Development of a DTPA test for zinc, iron, manganese and copper. Soil Sci. Soc. Am. J. 42, 421–428. https://doi.org/10.2136/sssaj1978.03615995004200030009x (1978).
Article ADS CAS Google Scholar
Cuc, L. M. et al. Isolation and characterization of novel microsatellite markers and their application for diversity assessment in cultivated groundnut (Arachis hypogaea). BMC plant. biol. 8, 55 (2008).
Article CAS Google Scholar
DArT at https://www.diversityarrays.com/dart-map-sequences (2020).
Raman, H., Raman, R. & Kilian, A. A consensus map of rapeseed (Brassica napus L.) based on diversity array technology markers: applications in genetic dissection of qualitative and quantitative traits. BMC Genomic 14, 277 (2013).
Article CAS Google Scholar
Vishwakarma, M. K. et al. Identification of two major quantitative trait locus for fresh seed dormancy using the diversity arrays technology and diversity arrays technology-seq based genetic map in Spanish-type peanuts. Plant Breed. 135, 367–375 (2016).
Article CAS Google Scholar
Shasidhar, Y. et al. Molecular mapping of oil content and fatty acids using dense genetic maps in groundnut (Arachis hypogaea L.). Front. Plant Sci. 8, 794. https://doi.org/10.3389/fpls.2017.00794 (2017).
Article Google Scholar
Steel, R. D. G. & Torrie, J. H. Principles and Procedures of Statistics: A Biometrical Approach 2nd edn. (McGraw-Hill Inc, New York, 1980).
MATH Google Scholar
Hallauer, A. R. & Miranda, J. B. Quantitative Genetics in Maize Breeding 1st edn. (Iowa State University Press, Ames, 1981).
Google Scholar
SAS Institute Inc. SAS/STAT 9.2 user’s guide. SAS Institute Inc, Cary (2004).
Hanson, C. H., Robinsion, H. R. & Comstock, R. S. Biometrical studies of yield in segregating population of Korean lespedeza. Agron. J. 47, 314–318 (1956).
Google Scholar
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Article CAS Google Scholar
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635. https://doi.org/10.1093/bioinformatics/btm308 (2007).
Article CAS Google Scholar
Hill, W. G. & Robertson, A. Linkage disequilibrium in finite populations. Theor. Appl. Genet. 38, 226–231 (1968).
Article CAS Google Scholar
Tadesse, W. et al. Genome-wide association mapping of yield and grain quality traits in winter wheat genotypes. PLoS ONE 10, e0141339. https://doi.org/10.1371/journal.pone.0141339 (2015).
Article CAS Google Scholar
Kulwal, P. et al. Association mapping for pre-harvest sprouting resistance in white winter wheat. Theor. Appl. Genet. 125, 793–805. https://doi.org/10.1007/s00122-012-1872-0 (2012).
Article CAS Google Scholar
Yin, L. https://CRAN.R-project.org/package=CMplot (2019).
Gangurde, S. S. et al. Nested-association mapping (NAM)-based genetic dissection uncovers candidate genes for seed and pod weights in peanut (Arachis hypogaea). Plant Biotech. J. 18, 1457–1471 (2020).
Article CAS Google Scholar
Kruger, C., Berkowitz, O., Stephan, U. W. & Hell, R. A. Mmetal-binding member of the late embryogenesis abundant protein family transports iron in the phloem of Ricinus communis L. J. Biol. Chem. 277(28), 25062–25069. https://doi.org/10.1074/jbc.M201896200 (2002).
Article CAS Google Scholar
Su, Y., Yang, Y. & Huang, Y. Loss of ppr3 , ppr4 , ppr6 , or ppr10 perturbs iron homeostasis and leads to apoptotic cell death in Schizosaccharomyces pombe. FEBS J. 284(2), 324–337. https://doi.org/10.1111/febs.13978 (2017).
Article CAS Google Scholar
Chen, Y. H., Wu, X. M., Ling, H. Q. & Yang, W. C. Transgenic expression of DwMYB2 impairs iron transport from root to shoot in Arabidopsis thaliana. Cell Res. 16, 830–840 (2006).
Article CAS Google Scholar

Download references

Acknowledgements

This research was supported by funding from the HarvestPlus Challenge Program of the CGIAR. It was carried out as part of the CGIAR Research Program (CRP) on Agriculture for Nutrition and Health.

Author information

Authors and Affiliations

International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502 324, India
Mahesh Pujar, Mahalingam Govindaraj, Sunil S. Gangurde, A. Kanatti & Himabindu Kudapa
University of Agricultural Sciences, Shivamogga, Karnataka, 577 225, India
Mahesh Pujar & S. Gangaprasad

Authors

Mahesh Pujar
View author publications
You can also search for this author in PubMed Google Scholar
S. Gangaprasad
View author publications
You can also search for this author in PubMed Google Scholar
Mahalingam Govindaraj
View author publications
You can also search for this author in PubMed Google Scholar
Sunil S. Gangurde
View author publications
You can also search for this author in PubMed Google Scholar
A. Kanatti
View author publications
You can also search for this author in PubMed Google Scholar
Himabindu Kudapa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G. and G.P. conceptualized the study; M.G., G.P., and M.P. designed of the experiments; M.G. contribution of experimental materials; M.P., A.K. executed the field/lab experiments and data collection; M.P., S.S.G., and H.B. analyzed the data and prepared tables and figures; G.P., M.P., H.B., and M.G. interpreted the results. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Mahalingam Govindaraj.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pujar, M., Gangaprasad, S., Govindaraj, M. et al. Genome-wide association study uncovers genomic regions associated with grain iron, zinc and protein content in pearl millet. Sci Rep 10, 19473 (2020). https://doi.org/10.1038/s41598-020-76230-y

Download citation

Received: 27 July 2020
Accepted: 22 October 2020
Published: 10 November 2020
DOI: https://doi.org/10.1038/s41598-020-76230-y

This article is cited by

Uncovering the genomic regions underlying grain iron and zinc content using genome-wide association mapping in finger millet
- Ajay Kumar Chandra
- Dinesh Pandey
- Anil Kumar
3 Biotech (2024)
Genome-wide association analyses of agronomic traits and Striga hermonthica resistance in pearl millet
- Armel Rouamba
- Hussein Shimelis
- Abhishek Rathore
Scientific Reports (2023)
The Promise of Millets in the Twenty-First Century: Emphasis on Breeding, Nutrition, Food Security and Sustainability
- Tirthankar Bandyopadhyay
- Roshan Kumar Singh
- Manoj Prasad
Journal of Soil Science and Plant Nutrition (2023)
Identification of quantitative trait loci (QTLs) and candidate genes of seed Iron and zinc content in soybean [Glycine max (L.) Merr.]
- Huan Wang
- Jia Jia
- Hai Nian
BMC Genomics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Variability for Fe, Zn, and PC

Genome-wide marker profiling

Population structure and linkage disequilibrium

Genome-wide association study

Genomic regions identified for grain Fe and Zn content

Grain protein content (PC)

Candidate genes associated with grain Fe, Zn, and PC

Discussion

Materials and methods

Plant material

Field trials and agronomic practices

Estimation of grain iron and zinc content

Estimation of grain protein content

Estimation of Fe and Zn content from the soil

DNA extraction and genotyping using DArT seq

SNP filtering and quality control

Phenotypic data analysis

Population structure, kinship and genome-wide linkage disequilibrium

Genome-wide association analysis

Candidate genes discovery

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links