Genome-Wide Association Study of Major Agronomic Traits in Foxtail Millet (Setaria italica L.) Using ddRAD Sequencing

Jaiswal, Vandana; Gupta, Sarika; Gahlaut, Vijay; Muthamilarasan, Mehanathan; Bandyopadhyay, Tirthankar; Ramchiary, Nirala; Prasad, Manoj

doi:10.1038/s41598-019-41602-6

Download PDF

Article
Open access
Published: 22 March 2019

Genome-Wide Association Study of Major Agronomic Traits in Foxtail Millet (Setaria italica L.) Using ddRAD Sequencing

Vandana Jaiswal¹,
Sarika Gupta¹,
Vijay Gahlaut ORCID: orcid.org/0000-0003-4381-3573²,
Mehanathan Muthamilarasan^3,4,
Tirthankar Bandyopadhyay³,
Nirala Ramchiary¹ &
…
Manoj Prasad³

Scientific Reports volume 9, Article number: 5020 (2019) Cite this article

5610 Accesses
58 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Foxtail millet (Setaria italica), the second largest cultivated millet crop after pearl millet, is utilized for food and forage globally. Further, it is also considered as a model crop for studying agronomic, nutritional and biofuel traits. In the present study, a genome-wide association study (GWAS) was performed for ten important agronomic traits in 142 foxtail millet core eco-geographically diverse genotypes using 10 K SNPs developed through GBS-ddRAD approach. Number of SNPs on individual chromosome ranged from 844 (chromosome 5) to 2153 (chromosome 8) with an average SNP frequency of 25.9 per Mb. The pairwise linkage disequilibrium (LD) estimated using the squared-allele frequency correlations was found to decay rapidly with the genetic distance of 177 Kb. However, for individual chromosome, LD decay distance ranged from 76 Kb (chromosome 6) to 357 Kb (chromosome 4). GWAS identified 81 MTAs (marker-trait associations) for ten traits across the genome. High confidence MTAs for three important agronomic traits including FLW (flag leaf width), GY (grain yield) and TGW (thousand-grain weight) were identified. Significant pyramiding effect of identified MTAs further supplemented its importance in breeding programs. Desirable alleles and superior genotypes identified in the present study may prove valuable for foxtail millet improvement through marker-assisted selection.

Detection of genomic regions associated with tiller number in Iranian bread wheat under different water regimes using genome-wide association study

Article Open access 20 August 2020

Mining genomic regions associated with agronomic and biochemical traits in quinoa through GWAS

Article Open access 22 April 2024

Pilot-scale genome-wide association mapping in diverse sorghum germplasms identified novel genetic loci linked to major agronomic, root and stomatal traits

Article Open access 08 December 2023

Introduction

Foxtail millet (Setaria italica) is a C₄ self-pollinated cereal crop known to be cultivated since 5000–6000 BC on the banks of Yellow River in China¹. The crop has major agronomic advantages in terms of being relatively cheap to cultivate, tolerant to biotic and abiotic stresses^1,2,3,4, efficient in water use⁵ and nutritionally rich^2,3,6,7. It is a major crop in the arid and semi-arid regions of Asia, sub-Saharan Africa and China⁸ and has increasingly emerged as one of the promising climate-resilient crops in the present decade⁹. Moreover, foxtail millet has a relatively smaller diploid genome of 510 Mb^10,11 and is considered as an ideal C₄ model system for genetic studies involving C₄ photosynthesis, agronomically important stress responses, bioenergy potential² among many others. Despite possessing such attractive agronomic traits, endeavours to understand, dissect and utilize the genetic diversity and generate mapping resources of the crop has been limited^12,13,14. Thus, a better understanding of the genetic basis influencing the variation in agronomic traits stands to significantly augment crop improvement strategies through conventional breeding or biotechnological approaches.

In foxtail millet, linkage-based QTL mapping has been conducted for several agronomic traits including yield, grain weight, flowering days, seed number, etc.^15,16,17. Linkage-based mapping suffers from poor mapping resolution, less allele mining due to the utilization of biparental population. It is also very tedious to develop mapping population. Alternatively, linkage disequilibrium (LD) based genome-wide association study (GWAS) has higher mapping resolution due to the utilization of historical recombination events available in a natural population¹⁸. GWAS has been widely conducted in each of the important crops and model plant systems (like wheat, rice, maize, Arabidopsis) for several traits including agronomic, quality, disease resistance, etc.^{19,20,21,22,23,24}. However, in foxtail millet, only a few studies are available on GWAS^25,26.

Advancements in technologies like next-generation sequencing (NGS), genotyping-by-sequencing (GBS) and SNP chips further improve the utility of GWAS through the development of high-density genotyping data. Further, to deal with false positives due to population structure and multiple testing in GWAS, statistical tools are being continually developed²⁷. Given this, high-quality SNPs distributed throughout the foxtail millet genome were mined using Double Digest Restriction Associated DNA (ddRAD) sequencing of 142genotypes. GWAS was performed for ten agronomic traits using 10367 SNPs through FarmCPU approach. Superior alleles and genotypes identified in the present study stand to significantly facilitate the improvement of foxtail millet as a viable and efficient climate resilient crop through marker-assisted selection and other useful crop improvement programmes in the future.

Results

Trait distribution and correlation

Descriptive statistics including minimum, maximum, mean and standard deviation revealed a wide range of variability for each of the ten traits across 142 genotypes and summarized in Supplementary Table 1. In summary, a wide range of variability was observed; for example, day of flowering (DOF) ranged from 36–65 with mean 51.4 and a standard deviation of 5.6. Similar variability was observed in other traits including plant height (PH; mean ± SD; 138.4 ± 19.6), tiller number (TN; mean ± SD; 4.3 ± 1.2), flag leaf length (FLL; mean ± SD; 31.4 ± 6.0), flag leaf width (FLW; mean ± SD; 1.8 ± 0.4), peduncle length (PedL; mean ± SD; 19.0 ± 4.4); panicle length (PanL; mean ± SD; 14.6 ± 3.7), tiller maturity (TM; mean ± SD; 87.8 ± 8.0), grain yield (GY; mean ± SD; 12.8 ± 7.8) and thousand grain weight (TGW; mean ± SD; 2.9 ± 0.6). Frequency distribution of each trait in the population was revealed through histograms (Supplementary Fig. 1). Pearson’s correlation analysis identified that out of 45 trait-pairs (using ten traits), 22 pairs were significantly correlated (Supplementary Table 2). Out of 22 correlations, 14 were positive, and eight were negative. A maximum positive correlation (r² = 0.47) was observed for FLL/FLW. However, TGW and GY were negatively correlated to the maximum extent (r² = −0.716).

Distribution of SNPs on foxtail millet chromosomes

GBS enabled the identification of ~30,000 SNPs. After filtration (removing markers with missing data <30% and minor allele frequency >5%), 12460 SNPs were selected for physical mapping on foxtail millet chromosomes. Out of 12460 SNPs, 10367 were mapped on nine major scaffolds of foxtail millet. These major scaffolds (1–9) were considered as nine chromosomes (1–9), respectively, hereafter. The mapped 10367 SNPs covered a total of 399.9 Mb of foxtail millet genome. Thus, the average SNP frequency in foxtail millet genome was observed as 25.9 SNPs/Mb. On individual chromosome, the number of SNPs ranged from 844 (chromosome 5) to 2153 (chromosome 8) (Table 1). Distribution of SNPs across the nine chromosomes is given in Fig. 1. Length of individual chromosome varied from 35.6 Mb (chromosome 7) to 58.9 Mb (chromosome 9). Maximum SNP density was observed on chromosome 8 (53.0 SNPs/Mb) and minimum on chromosome 9 (16.7 SNPs/Mb).

Table 1 Distribution of 10367 SNPs on nine foxtail millet chromosomes. A summary of SNP pairs showing significant linkage disequilibrium (LD) and LD decay distance on each of the nine chromosomes is also shown.

Full size table

Linkage Disequilibrium

LD and LD decay distance in all the nine foxtail millet chromosomes is summarized in Table 1. A maximum of 28790 SNP pairs on chromosome 8 showed significant (p < 0.05) LD; however, a minimum of 7461 SNP pairs crossed the significance level of LD on chromosome 6. The whole genome average maximum r² value found as 0.46 which dropped to its half (0.23) as distance 177 Kb; thus, considered as whole genome LD decay distance and above which LD decayed (Fig. 2). However, for individual chromosome, maximum LD decay distance was observed for chromosome 4 (357 Kb), followed by chromosome 5 (350 Kb), while chromosome 6 (76 Kb; Table 1, Supplementary Fig. 2) showed the minimum LD decay distance.

Genome-wide marker-trait associations

Altogether, 81 marker-trait associations (involving 79 SNPs) were identified for ten traits using FarmCPU (Table 2) with p-value < 0.001. Two SNPs (C9.37523364 and C7.19705515) were associated with two traits (FLW/TN and GY/TN) each, respectively. Above mentioned 81 MTAs were present on all the nine chromosomes (Fig. 3). Q-Q plots between observed and expected p-values of association revealed appropriate model fitting involving population structure and kinship (Fig. 3), although the power of test statistics was lower in some cases. For DOF, only one SNP (C2.27561819 on chromosome 2) was found to be associated. A maximum of 21 MTAs was identified for FLW involving all the chromosomes except chromosome 5. Most significant MTA for FLW was present on chromosome 9. For GY, 17 MTAs were identified on seven chromosomes (1–3 and 5–8), and most significant MTA was present on chromosome 3. Similarly, for TGW, 10 MTAs were identified on five chromosomes (2, 3, 6, 8 and 9). Seven MTAs identified each for three traits including FLL (chromosomes 2 and 5), PedL (chromosomes 2, 3, 5, 6, 7 and 9), and TM (chromosomes 3, 5–9). Total six, three and two MTAs were identified for TN, PH and PedL, respectively. Summary of above mentioned 81 MTAs along with chromosomal position, p-value, minor allele frequency and SNP effect is given in Table 2.

Table 2 List of significant MTAs along with contrasting alleles, chromosome, position, minor allele frequency (MAF) and SNP effect. 0.001 is considered as P-value cut off for significant MTAs. Earlier studies where QTLs are reported for same trait on same chromosomes are mentioned in last column.

Full size table

High confidence marker-trait association

To eliminate the false positive due to multiple testing, the MTAs were filtered following Bonferroni correction. Out of 81 MTAs, only seven MTAs could fulfill Bonferroni criteria, thus considered as high confidence MTAs (Table 3). These seven MTAs were associated with three traits including FLW (three), GY (two) and TGW (two). All the three MTAs associated with FLW were present on chromosome 9, viz., C9.37225457 (p-value 5.06 × 10⁻¹⁸), C9.37443288 (p-value 3.12 × 10⁻¹⁷) and C9.38068016 (p-value 3.12 × 10⁻¹⁷). Two MTAs associated with GY were present on chromosome 3 (C3.50114070, p-value 1.77 × 10⁻⁷) and chromosome 7 (C7.19705515, p-value 7.82 × 10⁻⁷). Similarly, two MTAs associated with TGW were found on chromosome 6 (C6.34654923, p-value 9.52 × 10⁻⁸) and chromosome 9 (C9.37011889, p-value 4.98 × 10⁻⁹). The above mentioned seven high confidence MTAs were further subjected to downstream analysis including the estimation of allele effect, identification of desirable allele and pyramiding effect of desirable alleles.

Table 3 List of Marker-trait associations (MTAs) fulfill the Bonferroni correction, with desirable allele and desirable allele effect. Desirable genotypes with desirable SNP alleles and phenotype may be used in foxtail breeding.

Full size table

Allele effect and identification of desirable allele

The allele effect was determined for each allele of SNPs involved in seven MTAs involving three traits that fulfilled the Bonferroni criteria (Table 3). For all the three traits including FLW, GY and TGW, positive selection is required; thus, SNP alleles with positive allele effect were considered as desirable. For FLW, C9.37225457-T, C9.37443288-A and C9.38068016-T were found desirable; for GY, C3.50114070-G and C7.19705515-C were desirable; and for TGW, C6.34654923-T and C9.37011889-A were desirable to increase the trait values. The phenotype, with a desirable and undesirable allele of associated SNPs, were further tested using ‘Kruskal–Wallis test,’ which revealed significant difference for each phenotype (with and without desirable SNP allele) for all seven SNPs.

Identification of putative candidate genes

Total 27 candidate genes were identified which were residing within 25 Kb regions (upstream and downstream) of seven high confidence MTAs for threes traits (FLW, GY and TGW), see Table 4, however, no associated SNPs were present within the gene. For three MTAs associated with FLW, total nine candidate genes were identified which found to encode AP2 domain, amino-oxononanoate synthase, ubiquitous protein, peptidases/proteases, pollen allergen1, arabinonate dehydratase/hydrolyase, etc. Similarly, ten candidate genes were identified in genomic regions harbouring two MTAs associated with GY; these genes encoded proteins involved in RBR family ring finger and IBR domain, translation initiation factor 3, pyridine nucleotide-disulfide oxidoreductase domain-containing protein 2, RNA recognition motif, LEA protein, glycosyltransferases, etc. For two MTAs associated with TGW, eight candidate genes were found involved in Sentrin/sumo-specific protease, DNA polymerase delta subunit 4, ATP-binding cassette transporter, etc. (Table 4).

Table 4 Putative candidate genes residing close vicinity (25 Kb either side) to associated SNPs.

Full size table

Pyramiding effect and desirable genotypes

The pyramiding effect of desirable alleles (including more than one SNP associated with the trait) was calculated for above mentioned three traits involving seven MTAs (that fulfilled Bonferroni criteria; Fig. 4). Analysis of pyramiding effect showed that an increase in a number of desirable alleles significantly affects the trait value. For instance, two SNPs associated with GY; genotypes with no desirable allele had a mean GY = 11.0 g, while the genotypes with one desirable allele had a mean GY = 18.4 g and genotypes with two desirable alleles had a mean GY = 33.6 g. The difference between mean GY with two, one and zero desirable alleles was highly significant (r² = 0.40; p ≤ 0.000). A similar trend was observed for TGW. Mean TGW values of genotypes with two (3.1 g), one (2.4 g) and without any desirable alleles (1.5 g) showed significant difference among them (r² = 0.41; p ≤ 0.000). For FLW, genotypes with three desirable alleles had significantly wider flag leaf (2.4 cm) than genotypes with no desirable allele (1.7 cm) and one desirable allele (1.5 cm). However, no significant difference was observed for FLW in genotypes with two desirable alleles and three desirable alleles.

Further, most desirable genotypes with a maximum number of the desirable allele for each of the above mentioned three traits (FLW, GY and TGW) were identified (Table 3). For FLW, two genotypes including F8 and F34 were identified to have desirable alleles of three associated SNPs, and also showed higher traits value (3.2 cm). For GY, one genotype (F14; 42.3 g) was identified to have a desirable allele of two associated SNPs. Similarly, for TGW, two genotypes (D85, D76) were identified to have higher TGW and desirable alleles for two SNPs.

Discussion

GWAS has always been a potential approach for genetic dissection of complex traits, and it has been successfully utilized in a number of crops including wheat, rice, pearl millet, maize and cotton^{22,28,29,30,31,32}. However, in foxtail millet, only a couple of studies are available where GWAS was utilized to identify genomic regions controlling traits of interest^25,26. Genetic diversity of panel is important prerequisites for GWAS and has been conducted in our earlier study²⁶ which suggested that the panel is diverse. Out of 142, 89 genotypes were collected from different parts of India including Andhra Pradesh, Bihar, Tamilnadu, Jammu & Kashmir, Karnataka, Maharashtra, Madhya Pradesh, Rajasthan, Uttara Khand and West Bengal; remaining 53 genotypes were exotic and belonged to nine different countries (for details see Gupta et al.²⁶). Further, population structure creates confounding in GWAS results. Thus it is important to conduct population structure analysis. Our earlier study²⁶ suggested that there were five subpopulations in the panel.

Further, size and trait diversity of population also affects the power of GWAS³³. In the present study, utilization of 142 genotypes which are sufficiently diverse suggested their suitability for GWAS. The size of the population used in the present study is slightly small but found to be comparable with GWAS conducted in other cereal crops including millets^34,35,36. It has been suggested that small population size may be inefficient for the identification of significant associations with minor effect³⁷. In the present study, we have identified significant MTAs even after implementing the most stringent Bonferroni correction. This suggested that the population size is good enough for GWAS; although, we agree that a larger population may lead to the identification of more MTAs. Descriptive statistics and frequency distribution (Supplementary Table 1; Supplementary Fig. 1) suggested that the panel had enough variability for each of the ten traits. Another important prerequisite of GWAS is the high-density marker^19,20,38,39. In the present study, GBS enabled the development of a high-density genotyping data with ~10 K SNPs to conduct GWAS for agronomic traits.

GWAS is based on LD which itself is affected by several factors including physical linkage, recombination, selection, genetic drift, etc.⁴⁰. Furthermore, self-pollinating crops show stronger LD than cross-pollinating crop^40,41,42. Higher recombination rate causes faster LD decay which ultimately results in higher mapping resolution. It is well understood that the recombination rate varies through genome^43,44, and thus, some genomic regions have been identified as recombination hot-spots, where recombination rate is higher and vice-versa⁴⁵. In foxtail millet, genome-wide LD is reported as 100 Kb²⁵. In the present study, we estimated the genome-wide LD decay as well as for each of the nine chromosomes individually. It was observed that the decay distance varied across different chromosomes (76 Kb on chromosome 6 to 357 Kb on chromosome 4); genome-wide decay distance was found to be 177 Kb, which is well at par with the earlier study²⁵. The rapid LD decay suggested that the population used in the present study is sufficiently diversified and suitable to conduct GWAS. Thus, the rapid LD decay and higher mapping resolution also made the study useful for cloning of QTLs.

A major limitation of GWAS is false positives that arise due to population structure. However, false positives may be reduced with the use of statistical models^46,47. Although correction for population structure plays a vital role in reducing false positives, overcorrection may lead to false negative results²². Therefore, we initially tested the model fitting for population structure correction. Q-Q plots of all the ten traits showed the proper distribution of observed p-values over expected, which suggested that the association model used in this study is the best fit and maximized the confidence of GWAS results. Further, false positives may also arise due to multiple testing, because in each test using single SNP there is at least 5% error and with an increase in the number of SNPs (i.e., the number of tests) overall experimental error increases. Several statistical tools are available for multiple testing correction²⁷. In the present study, we also applied corrections for multiple testing (e.g., Bonferroni correction) during GWAS. Here, we observed that out of 81 MTAs (associated with ten traits) only seven MTAs (associated with three traits) qualified Bonferroni correction. Although multiple testing corrections are important to reduce false positive, it becomes highly stringent due to the use of thousands of makers; and may lead to false negatives. Thus, FDR is also considered as a tradeoff and escaped detection of genuine MTAs in earlier studies^22,48,49. In our study, we also observed that even MTAs with a very significant p-value (10⁻⁵) could not qualify multiple testing correction criteria. Thus, all the 81 MTAs may not be false, and thus, need further validation; however, seven MTAs those fulfilled the multiple testing correction criteria had more confidence.

It is widely known that most of the traits are complex and are controlled by a large number of genes/QTLs^{15,17,19,20,50,51}. In foxtail millet, linkage-based QTL mapping has been conducted for a number of agronomic traits such as days to heading, peduncle length, grain weight biomass, spikelet, yield, etc.^15,16,17,52 using biparental mapping population. The identification of 81 MTAs in the present study adds to the existing knowledge of the genetic architecture of traits considered. Further, single locus analysis furnishes biased results since the background is not considered in this approach. Given this, we have conducted multi-locus analysis by considering the background genome as cofactor using recently developed FarmCPU approach⁵³. Interestingly, out of 81 MTAs, 60 were present on the same chromosome where QTL/s for the same traits were reported in earlier studies^17,25,42 (Table 2).

Seven high confidence MTAs were identified for three important agronomic traits including FLW, GY and TGW may prove useful in foxtail millet breeding program through marker-assisted selection after validation. For validation, linkage based interval mapping or joint linkage-LD mapping may be conducted using biparental mapping population or specialized populations (NAM, MAGIC). The above mentioned seven high confidence SNPs also found crucial in identifying important candidate genes underlying these traits. Candidate genes present close vicinity of associated SNPs identified in the present study may be validated in the future so that can be deployed in breeding. For validation, one can use candidate gene-based association mapping using large population, or functional characterization through RNAi, VIGS, etc. Identification of desirable alleles of these MTAs will enable their efficient utilization in crop improvement programs. Interestingly, the significant pyramiding effect of multiple MTAs for the single trait in our study suggested that the associated SNPs may combine to improve the trait substantially. For three traits (FLW, GY and TGW), three desirable genotypes were identified with a maximum number of desirable alleles. These genotypes may be used as a donor in foxtail millet breeding program. Intercrossing of these genotypes may combine desirable traits to develop improved high yielding foxtail genotype.

In foxtail millet, studies are available where GWAS has been conducted for agronomic traits using few SSRs²⁶ and million SNPs²⁵. The present study provides better resolution of trait mapping using 10 K SNPs as compared to an earlier study²⁶. Jia et al.²⁵ conducted GWAS for 47 agronomic traits using 916 accessions and 0.8 million SNPs developed through whole-genome sequencing. The present study, where GWAS was conducted for ten agronomic traits (nine were common with Jia et al.²⁵, TM was not studied in Jia et al.²⁵) with lesser SNPs and accessions, may be questioned for its novelty. There are two parameters which made present study novel- (i) utilization of different accessions with different genetic background and (ii) phenotyping in different environments. These two parameters enabled us to identify some novel genomic regions associated with agronomic traits (Table 2). For example, two high confidence SNPs, one each associated with GY (C6.2476369) and TGW (C9.37011889) were identified on chromosome 6 and chromosome 9, respectively, in the present study. However, Jia et al.²⁴ did not identify any SNPs associated with GY and TGW on chromosomes 6 and 9, respectively.

The present study explored the genetic architecture of ten agronomic traits using LD based GWAS exploiting historical recombination in a natural population. The population used in the present study showed a wide range of variability for traits studied. Further, ddRAD-seq provided high-density genotypic data which is a pre-requisite for GWAS. Our study led to the identification of 81 MTAs for agronomic traits including some novel MTAs and provided better insights into the genetic architecture of traits. Significant pyramiding effect of associated SNPs with the same trait suggested their potential utilization in foxtail breeding. The desirable alleles and genotypes identified in this study will be useful in crop improvement programmes.

Materials and Methods

Plant material and phenotyping

The phenotypic data of 142 foxtail millet genotypes previously reported by Gupta et al.²⁶ was used in the present study. Precisely, the genotypes were phenotyped for ten yield contributing agronomic traits for three consecutive years (2009–2011) at the research fields of National Institute of Plant Genome Research (NIPGR), New Delhi, India, in a randomized complete block design with three replications. Mean data over the years of each of the 10 traits were utilized during the present study. The traits included days to flowering (DOF), plant height (PH), tiller number (TN), flag leaf length (FLL), flag leaf width (FLW), peduncle length (PedL), panicle length (PanL), tiller maturity (TM), grain yield (GY), and 1000 grain weight (TGW). Descriptive statistics, frequency distribution and Pearson’s correlation coefficient between all possible trait-pairs were analyzed using SPSS v17.0 software.

ddRAD sequencing and SNP calling

DNA was extracted from one-month-old leaf samples of 142 genotypes using the CTAB method⁵⁴. The DNA samples were RNase treated to remove RNA contamination, and the quality and quantity were checked on 1% agarose gel and NanoDrop 1000 (Thermo Scientific). For genotyping, Double Digest Restriction Associated DNA (ddRAD) sequencing approach was used⁵⁵, and sequencing was done using Illumina Hiseq4000 (AgriGenome Labs Pvt Ltd, Hyderabad, India). Raw Fastq reads were demultiplexed allowing one mismatch to obtain reads for each sample. Data were filtered on the basis of RAD TAGs. Filtered reads were then subjected to 5′ and 3′ base trimming. Illumina 5′ and 3′ adapter sequences were also removed. Paired-end alignment was performed using Bowtie2 (version 2-2.2.9) program with default parameters to the reference genome (http://genome.jgi.doe.gov). The aligned samples and the reference genome sequence are used for variant calling using default settings of SAMtools version 0.1.18.

Linkage disequilibrium

LD (in terms of r²) analysis was performed for the whole genome as well as individually for each of the nine chromosomes using window size 50 with the help of software TASSEL v5.0. To estimate LD decay, non-linear regression curve was utilized⁵⁶, and LD decay distance was estimated as the physical distance between SNPs where average r² reduced to half of the maximum LD value.

Marker-trait associations

SNPs with <30% missing data and >5% minor allele frequency were utilized for GWAS. All the 142 genotypes used for GWAS were having <30% missing genotypic data. For the association test, a recently developed method called Fixed and random model Circulating Probability Unification (FarmCPU⁵³) was used. This method is highly efficient and also eliminates confounding issues arising due to population structure, kinship, multiple testing correction, etc. This method utilizes both Fixed Effect Model (FEM) and a Random Effect Model (REM), iteratively. REM estimated pseudo-quantitative trait nucleotides (QTNs) and FEM tested marker using pseudo QTNs as covariates. First, three components identified through principal component analysis (PCA) using TASSEL v5.0 were included as a covariate in the association test model. SNP with p-value < 0.001 declared as significant MTAs. Bonferroni-corrected p-value threshold was set as 0.01. To show the model fitting (accounting for population structure), quantile-quantile (Q-Q) plots were also analyzed. The Q-Q plot showed the distribution of observed and expected p-values (association test statistics). Desirably in case of appropriate model fitting, Q-Q plots should show a solid line (i.e., the distribution of observed p-value is similar to expected one) represented no biasness; and sharp curves at the end which represented a small number of true associations among thousands of unassociated SNPs. The extent of deviation of curve end from the diagonal is the measure of the power of test statistics.

Allele effect and pyramiding effect of desirable alleles; identification of desirable genotypes

Phenotypic effect (a_i) of each allele of SNPs (significantly associated with trait following Bonferroni correction) was estimated following Zhang et al.⁵⁷. Kruskal–Wallis test was performed to identify whether the alleles differ considerably for the associated traits. Subsequently, favourable alleles were identified for each of the trait considered according to the breeding objective. For traits with negative selection, a_i < 0 was considered as desirable allele; while, for a trait with positive selection, a_i > 0 was considered as a desirable allele.

The pyramiding effect was estimated in the case where more than two SNPs were found to be associated with the same trait (after Bonferroni correction). To determine the pyramiding effect, linear regression was performed using the number of desirable SNP alleles for traits (independent variable) and corresponding trait values of the genotypes that contained different numbers of desirable SNP alleles (dependent variable). Genotypes with a maximum number of desirable alleles and desirable phenotype were considered as a desirable genotype.

Identification of putative candidate genes

To identify putative candidate genes residing at the close vicinity of high confidence SNPs, the associated SNPs were mapped to the reference genome Setaria italica v2.2 (https://phytozome.jgi.doe.gov/pz/portal.html#!info?alias=Org_Sitalica). Transcripts present within 25 Kb regions from both sides of associated SNPs were fetched along with their description.

Data Availability

The datasets supporting the conclusions of this article are included within the article and its Additional files.

References

Li, Y. & Wu, S. Traditional maintenance and multiplication of foxtail millet (Setaria italica (L.) P. Beauv.) landraces in China. Euphytica 87, 33–38 (1996).
Article ADS Google Scholar
Muthamilarasan, M. & Prasad, M. Advances in Setaria genomics for genetic improvement of cereals and bioenergy grasses. Theor. Appl. Genet. 128, 1–14 (2015).
Article CAS Google Scholar
Muthamilarasan, M., Dhaka, A., Yadav, R. & Prasad, M. Exploration of millet models for developing nutrient rich graminaceous crops. Plant Sci. 242, 89–97 (2015).
Article Google Scholar
Dekker, J. The Foxtail (Setaria) Species-Group. Weed Sci. 51, 641–656 (2003).
Article CAS Google Scholar
Li, P. & Brutnell, T. P. Setaria viridis and Setaria italica, model genetic systems for the Panicoid grasses. J. Exp. Bot. 62, 3031–3037 (2011).
Article CAS Google Scholar
Liang, S., Yang, G. & Ma, Y. Chemical characteristics and fatty acid profile of Foxtail millet bran oil. J. Am. Oil Chem. Soc. 87, 63–67 (2010).
Article CAS Google Scholar
Amadou, I., Amza, T., Shi, Y.-H. & Le., G.-W. Chemical analysis and antioxidant properties of foxtail millet bran extracts. Songklanakarin. J. Sci. Technol. 33, 509–515 (2011).
CAS Google Scholar
Lata, C., Bhutty, S., Bahadur, R. P., Majee, M. & Prasad, M. Association of an SNP in a novel DREB2-like gene SiDREB2 with stress tolerance in foxtail millet [Setaria italica (L.)]. J. Exp. Bot. 62, 3387–3401 (2011).
Article CAS Google Scholar
Muthamilarasan, M., Singh, N.K. & Prasad, M. Multi-omics approaches for strategic improvement of stress tolerance in underutilized crop species: A climate change perspective. Adv. Genet, https://doi.org/10.1016/bs.adgen.2019.01.001 (2019).
Bennetzen, J. L. et al. Reference genome sequence of the model plant. Setaria. Nature Biotechnol. 30, 555 (2012).
Article CAS Google Scholar
Zhang, G. et al. Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential. Nature Biotechnol. 30, 549–554 (2012).
Article CAS Google Scholar
Doust, A. N., Devos, K. M., Gadberry, M. D., Gale, M. D. & Kellogg, E. A. Genetic control of branching in foxtail millet. Proc. Natl. Acad. Sci. USA 101, 9045–9050 (2004).
Article ADS CAS Google Scholar
Doust, A. N., Devos, K. M., Gadberry, M. D., Gale, M. D. & Kellogg, E. A. The genetic basis for inflorescence variation between foxtail and green millet (poaceae). Genetics 169, 1659–1672 (2005).
Article CAS Google Scholar
Wang, C. et al. Population genetics of foxtail millet and its wild ancestor. BMC Genet. 11, 1–13 (2010).
Article MathSciNet Google Scholar
Sato, K., Mukainari, Y., Naito, K. & Fukunaga, K. Construction of a foxtail millet linkage map and mapping of spikelet-tipped bristles 1 (stb1) by using transposon display markers and simple sequence repeat markers with genome sequence information. Mol. Breed. 31, 675–684 (2013).
Article CAS Google Scholar
Yoshitsu, Y. et al. QTL-seq analysis identifies two genomic regions determining the heading date of foxtail millet, Setaria italica (L.) P. Beauv. Breed. Sci. 67, 518–527 (2017).
Article Google Scholar
Zhang, K. et al. Identification of QTLs for 14 agronomically important traits in Setaria italica based on SNPs generated from high-throughput sequencing. G3 (Bethesda) 7, 1587–1594 (2017).
Article CAS Google Scholar
Remington, D. L. et al. Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 98, 11479–11484 (2001).
Article ADS CAS Google Scholar
Atwell, S. et al. Genome-wide association study of 107 phenotypes in a common set of Arabidopsis thaliana inbred lines. Nature 465, 627–631 (2010).
Article ADS CAS Google Scholar
Huang, X. et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat. Genet. 42, 961–969 (2010).
Article CAS Google Scholar
Jaiswal, V., Mir, R. R., Mohan, A., Balyan, H. S. & Gupta, P. K. Association mapping for pre-harvest sprouting tolerance in common wheat (Triticum aestivum L.). Euphytica 188, 89–102 (2012).
Article CAS Google Scholar
Jaiswal, V. et al. Genome wide single locus single trait., multi-locus and multi-trait association mapping for some important agronomic traits in common wheat (T. aestivum L.). Plos One 11, e0159343 (2016).
Article Google Scholar
Francisco, M. et al. Genome wide association mapping in Arabidopsis thaliana identifies novel genes involved in linking allyl glucosinolate to altered biomass and defense. Front. Plant Sci. 7, 1010 (2016).
PubMed PubMed Central Google Scholar
Gao, L., Turner, M. K., Chao, S., Kolmer, J. & Anderson, J. A. Genome wide association study of seedling and adult plant leaf rust resistance in elite spring wheat breeding lines. Plos One 11, e0148671 (2016).
Article Google Scholar
Jia, G. et al. A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica). Nat. Genet. 45, 957–961 (2013).
Article CAS Google Scholar
Gupta, S., Kumari, K., Muthamilarasan, M., Parida, S. K. & Prasad, M. Population structure and association mapping of yield contributing agronomic traits in foxtail millet. Plant Cell Rep. 33, 881–893 (2014).
Article CAS Google Scholar
Gupta, P. K., Kulwal, P. L. & Jaiswal, V. Association mapping in crop plants, opportunities and challenges. Adv. Genet. 85, 109–148 (2014).
Article CAS Google Scholar
Sehgal, D. et al. Exploring potential of pearl millet germplasm association panel for association mapping of drought tolerance traits. PLoS One 10, e0122165 (2015).
Article Google Scholar
Ma, X. et al. Genome-wide association study for plant height and grain yield in rice under contrasting moisture regimes. Front. Plant Sci. 7, 1801 (2016).
PubMed PubMed Central Google Scholar
Yano, K. et al. Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice. Nat. Genet. 48, 927 (2016).
Article CAS Google Scholar
Hu, G. et al. Genome-wide association study Identified multiple Genetic Loci on Chilling Resistance during Germination in Maize. Sci. Rep. 7, 1–11 (2017).
Article ADS Google Scholar
Sun, Z. et al. Genome-wide association study discovered genetic variation and candidate genes of fibre quality traits in Gossypium hirsutum L. Plant Biotechnol. J. 15, 982–996 (2017).
Article CAS Google Scholar
Flint-Garcia, S. A. et al. Maize association population: a high‐resolution platform for quantitative trait locus dissection. Plant J. 44, 1054–1064 (2005).
Article CAS Google Scholar
Breseghello, F. & Sorrells, M. E. Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172, 1165–1177 (2006).
Article Google Scholar
Zhao, K. et al. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat. Commun. 2, 467 (2011).
Article Google Scholar
Anuradha, N. et al. Deciphering genomic regions for high grain iron and zinc content using association mapping in pearl millet. Front. Plant Sci. 8, 412 (2017).
Article CAS Google Scholar
Zhu, C., Gore, M., Buckler, E. S. & Yu, J. Status and prospects of association mapping in plants. Plant Genome 1, 5–20 (2008).
Article CAS Google Scholar
Su, J. et al. Detection of favorable QTL alleles and candidate genes for lint percentage by GWAS in Chinese upland cotton. Front. Plant Sci. 7, 1576 (2016).
PubMed PubMed Central Google Scholar
Su, J. et al. Identification of favorable SNP alleles and candidate genes for traits related to early maturity via GWAS in upland cotton. BMC Genomics 17, 687 (2016).
Article Google Scholar
Gupta, P. K., Rustgi, S. & Kulwal, P. L. Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Mol. Biol. 57, 461–485 (2005).
Article CAS Google Scholar
Gaut, B. S. & Long, A. D. The lowdown on linkage disequilibrium. Plant Cell 15, 1502–1506 (2003).
Article CAS Google Scholar
Nie, X. et al. Genome-wide SSR-based association mapping for fiber quality in nation-wide upland cotton inbreed cultivars in China. BMC Genomics 17, 352 (2016).
Article Google Scholar
Kong, A. et al. Fine scale recombination rate differences between sexes, populations and individuals. Nature 467, 1099–1103 (2010).
Article ADS CAS Google Scholar
Gion, J. M. et al. Genome-wide variation in recombination rate in Eucalyptus. BMC Genomics 17, 590 (2016).
Article Google Scholar
Petes, T. D. Meiotic recombination hot spots and cold spots. Nat. Rev. Genet. 2, 360–369 (2001).
Article CAS Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Yu, J. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203–220 (2006).
Article CAS Google Scholar
Qian, H. R. & Huang, S. Comparison of false discovery rate methods in identifying genes with differential expression. Genomics 86, 495–503 (2005).
Article CAS Google Scholar
Kulwal, P. L. et al. Association mapping for pre-harvest sprouting resistance in white winter wheat. Theor. Appl. Genet. 125, 793–805 (2012).
Article CAS Google Scholar
Rong, J. et al. Meta-analysis of polyploid cotton QTL shows unequal contributions of sub genomes to a complex network of genes and gene clusters implicated in lint fiber development. Genetics 176, 2577–2588 (2007).
Article CAS Google Scholar
Kump, K. L. et al. Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat. Genet. 43, 163–168 (2012).
Article Google Scholar
Mauro-Herrera, M. et al. genetic control and comparative genomic analysis of flowering time in Setaria (Poaceae). G3 (Bethesda) 3, 283 (2013).
Article Google Scholar
Liu, X., Huang, M., Fan, B., Buckler, E. S. & Zhang, Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLOS Genet. 12, 1–24 (2016).
Google Scholar
Rogers, S. O. & Bendich, A. J. Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues. Plant Mol. Biol. 5, 69–76 (1985).
Article CAS Google Scholar
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S. & Hoekstra, H. E. Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non-model species. Plos One 7, e37135 (2012).
Article ADS CAS Google Scholar
Hill, W. G. & Weir, B. S. Variances and covariance of squared linkaged is equilibria in finite populations. Theor. Popul. Biol. 33, 54–78 (1988).
Article CAS Google Scholar
Zhang, T. et al. Variations and transmission of QTL alleles for yield and fiber qualities in upland cotton cultivars developed in China. Plos One 8, e57220 (2013).
Article ADS CAS Google Scholar
Fang, X. et al. A high density genetic map and QTL for agronomic and yield traits in Foxtail millet [Setaria italica (L.) P. Beauv.]. BMC Genomics 17, 336 (2016).
Article Google Scholar
Wang, J. et al. A high-density genetic map and QTL analysis of agronomic traits in foxtail millet [Setaria italica (L.) P. Beauv.] using RAD-seq. Plos one 12, e0179717 (2017).
Article Google Scholar

Download references

Acknowledgements

The study was financially supported by Department of Biotechnology, Ministry of Science and Technology, Government of India [BT/HRD/NBA/37/01/2014(vii)]. V.J., V.G. and M.M. acknowledge the DST-INSPIRE Faculty Awards received from Department of Science and Technology, Ministry of Science and Technology, Government of India. The authors thank Dr. Swarup K Parida, Scientist, National Institute of Plant Genome Research, New Delhi for his expert comments in improving the manuscript. Ms. Annvi Dhaka of National Institute of Plant Genome Research, New Delhi is acknowledged for her assistance in the field work. The authors are thankful to DBT-eLibrary Consortium (DeLCON) for providing access to e-resources.

Author information

Authors and Affiliations

School of Life Science, Jawaharlal Nehru University, New Delhi, 110067, India
Vandana Jaiswal, Sarika Gupta & Nirala Ramchiary
Department of Plant Molecular Biology, University of Delhi South Campus, New Delhi, 110021, India
Vijay Gahlaut
National Institute of Plant Genome Research, New Delhi, 110067, India
Mehanathan Muthamilarasan, Tirthankar Bandyopadhyay & Manoj Prasad
ICAR-National Research Centre on Plant Biotechnology, LBS Centre, Pusa Campus, New Delhi, 110012, India
Mehanathan Muthamilarasan

Authors

Vandana Jaiswal
View author publications
You can also search for this author in PubMed Google Scholar
Sarika Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Vijay Gahlaut
View author publications
You can also search for this author in PubMed Google Scholar
Mehanathan Muthamilarasan
View author publications
You can also search for this author in PubMed Google Scholar
Tirthankar Bandyopadhyay
View author publications
You can also search for this author in PubMed Google Scholar
Nirala Ramchiary
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Prasad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.P. conceived and designed the experiments. V.J., S.G., V.G. and T.B. performed the experiments. V.J., V.G. and N.R. analyzed the results. V.J., V.G. and M.M. wrote the manuscript. M.P. approved the final version of the manuscript.

Corresponding author

Correspondence to Manoj Prasad.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jaiswal, V., Gupta, S., Gahlaut, V. et al. Genome-Wide Association Study of Major Agronomic Traits in Foxtail Millet (Setaria italica L.) Using ddRAD Sequencing. Sci Rep 9, 5020 (2019). https://doi.org/10.1038/s41598-019-41602-6

Download citation

Received: 17 October 2018
Accepted: 05 March 2019
Published: 22 March 2019
DOI: https://doi.org/10.1038/s41598-019-41602-6

This article is cited by

New insights into QTNs and potential candidate genes governing rice yield via a multi-model genome-wide association study
- Supriya Sachdeva
- Rakesh Singh
- Gyanendra Pratap Singh
BMC Plant Biology (2024)
Multi-environment GWAS identifies genomic regions underlying grain nutrient traits in foxtail millet (Setaria italica)
- Vandana Jaiswal
- Tirthankar Bandyopadhyay
- Manoj Prasad
Plant Cell Reports (2024)
Efficient identification of QTL for agronomic traits in foxtail millet (Setaria italica) using RTM- and MLM-GWAS
- Keli Dai
- Xin Wang
- Xianmin Diao
Theoretical and Applied Genetics (2024)
The potentialities of omics resources for millet improvement
- Banshidhar
- Saurabh Pandey
- Satish Kumar Singh
Functional & Integrative Genomics (2023)
Recombinant inbred lines and next-generation sequencing enable rapid identification of candidate genes involved in morphological and agronomic traits in foxtail millet
- Kenji Fukunaga
- Akira Abe
- Makoto Kawase
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Trait distribution and correlation

Distribution of SNPs on foxtail millet chromosomes

Linkage Disequilibrium

Genome-wide marker-trait associations

High confidence marker-trait association

Allele effect and identification of desirable allele

Identification of putative candidate genes

Pyramiding effect and desirable genotypes

Discussion

Materials and Methods

Plant material and phenotyping

ddRAD sequencing and SNP calling

Linkage disequilibrium

Marker-trait associations

Allele effect and pyramiding effect of desirable alleles; identification of desirable genotypes

Identification of putative candidate genes

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links