Abstract
Genome-wide association study (GWAS) is effective in identifying favorable alleles for traits of interest with high mapping resolution in crop species. In this study, we conducted GWAS to explore quantitative trait loci (QTL) for eight fruit traits using 162 tomato accessions with diverse genetic backgrounds. The eight traits included fruit weight, fruit width, fruit height, fruit shape index, pericarp thickness, locule number, fruit firmness, and brix. Phenotypic variations of these traits in the tomato collection were evaluated with three replicates in field trials over three years. We filtered 34,550 confident SNPs from the 51 K Axiom® tomato array based on < 10% of missing data and > 5% of minor allele frequency for association analysis. The 162 tomato accessions were divided into seven clusters and their membership coefficients were used to account for population structure along with a kinship matrix. To identify marker-trait associations (MTAs), four phenotypic data sets representing each of three years and combined were independently analyzed in the multilocus mixed model (MLMM). A total of 30 significant MTAs was detected over data sets for eight fruit traits at P < 0.0005. The number of MTA per trait ranged from one (brix) to seven (fruit weight and fruit width). Two SNP markers on chromosomes 1 and 2 were significantly associated with multiple traits, suggesting pleiotropic effects of QTL. Furthermore, 16 of 30 MTAs suggest potential novel QTL for eight fruit traits. These results facilitate genetic dissection of tomato fruit traits and provide a useful resource to develop molecular tools for improving fruit traits via marker-assisted selection and genomic selection in tomato breeding programs.
Similar content being viewed by others
Introduction
Tomato (Solanum lycopersicum L.) is an economically important crop species in the Solanaceae family, which includes potato, pepper, and eggplant. It is cultivated worldwide and one of the most consumed vegetables. In 2018, the world production of tomato exceeded 182 million tons from 4.76 million ha1. With its economic value, large efforts have been made to improve horticultural traits and disease resistance in tomato breeding programs. Tomato has diverse genetic variations in fruit traits, such as shape, size, and weight. Therefore, QTL mapping has been extensively conducted using bi-parental populations for genetic dissection of fruit traits and several major genes were identified2,3,4,5,6,7. QTL detection in the structured populations derived from two parents has the disadvantage of low mapping resolution due to limited recombination events8,9.
As an effective mapping method for complex traits, genome-wide association study (GWAS) allows to identify the tight linkage between marker and QTL with dense genome coverage in the unstructured populations, such as collections of germplasm and breeding lines. These GWAS panels have higher recombination rates to increase mapping resolutions relative to bi-parental populations10. In addition, diverse alleles for a trait of interest can be explored in these populations11,12. The discovery of genome-wide single nucleotide polymorphisms (SNPs) has facilitated GWAS in crop species. As the most common type of sequence variation, SNPs are suitable for high-throughput genotyping with automation. Advances in next-generation sequencing (NGS) technology have led to an accumulation of SNPs. In tomato, a NGS-based transcriptome analysis of five cultivated varieties and one wild species generated 17 Gb of sequences and identified 62,576 non-redundant SNPs13. Of these, 8784 SNPs were used to develop the first high-throughput genotyping array14. Whole-genome sequencing of diverse tomato accessions also identified a large number of SNPs across 12 chromosomes15,16. Furthermore, a total of 51,912 SNPs was detected by resequencing 96 large-fruit commercial varieties with a mean depth of 1.9x and these SNPs were used to develop the Axiom tomato genotyping array17.
In addition to NGS-based SNP discovery, several statistical models have been developed to improve the accuracy and efficiency of GWAS10,18,19. With these advances, GWAS has been successfully conducted to explore allele variations for fruit quality and morphology in tomato. Several marker-trait associations (MTAs) were detected for phenolic compounds, ascorbic acid, β-carotene, trans-lycopene, and titratable acidity using the worldwide collection of 96 accessions representing landraces, vintage, and modern varieties20. GWAS in 163 tomato accessions identified a total of 44 candidate loci for 19 fruit metabolites, including amino acids, sucrose, malate, ascorbate, and citrate21. Two mapping populations were also used to investigate the genetic architecture of tocochromanol content in tomato fruit22. Genetic dissection of tomato flavor was also conducted and a large number of significant associations was found for flavor-related traits23,24,25. For fruit morphological traits, a number of favorable alleles was detected by GWAS in the tomato collections26,27. A recent study investigated genetic variations for six fruit traits in 192 tomato accessions and identified a total of 54 loci associated with these traits28. In addition, a germplasm collection of 163 accessions representing S. lycopersicum and S. pimpinellifolium was used to identify genomic regions associated with fruit, flower, and vegetative traits via GWAS29. This study revealed a total of 107 MTAs for eight quantitative traits, including fruit weight and locule number.
Although a number of loci associated with fruit traits was found in the previous studies, these loci are responsible for partial genetic variations of each trait in tomato. Therefore, the present study was conducted to explore novel QTL for eight fruit traits in a collection of 162 tomato accessions representing different genetic backgrounds from the previous GWAS panels. The eight fruit traits used in our study included fruit weight, fruit width, fruit height, fruit shape index, pericarp thickness, locule number, fruit firmness, and brix. GWAS with phenotypic data from field trials over three years identified a number of potential novel QTL along with previously known genes. These results will be a useful resource to develop breeder’s toolboxes for marker-assisted selection and genomic selection in tomato breeding programs.
Results
Genome-wide SNP identification
The 51,214 SNPs of the Axiom® tomato array were polymorphic in the 162 tomato accessions. Of these, 34,550 SNPs were filtered with missing data rate (< 10%) and minor allele frequency (> 5%). These confident SNPs were distributed over 12 chromosomes and covered a total of 751.75 Mb with a range of 45.52 Mb on chromosome 6–90.24 Mb on chromosome 1 (Table 1). The number of SNPs per chromosome ranged from 1292 (chromosome 7) to 5469 (chromosome 1). In addition, the average of marker intervals across all 12 chromosomes was 0.021 Mb, ranging from 0.013 Mb on chromosome 11 to 0.050 Mb on chromosome 7 (Table 1). The largest gap of 21.60 Mb was found on chromosome 9, while the maximum marker intervals were 1.05–15.71 Mb on the other chromosomes.
Phenotypic variations of fruit traits in the tomato collection
The 162 tomato accessions showed wide ranges of phenotypic variations for eight fruit traits, including fruit weight, fruit width, fruit height, fruit shape index, pericarp thickness, locule number, fruit firmness, and brix (Figs. 1 and S1). Fruit weight ranged from 9.66 to 315.80 g, and the means of each year were 68.15 g in 2018, 74.00 g in 2019, and 96.27 g in 2020. The Pearson correlation coefficients between the three years were 0.75–0.85 (Table 2). We found phenotypic variations of 18.08–85.06 mm for fruit width and 25.82–86.30 mm for fruit height. These traits also showed high levels of correlation over three years with coefficients of 0.78–0.86 for width and 0.78–0.83 for height. Fruit shape index showed the highest correlation ranging of 0.93–0.95 in 2018–2020 with the phenotypic variations of 0.46–2.27 with means of 1.05–1.12 (Fig. 1 and Table 2). These results indicate that the tomato collection represents diverse fruit sizes and shapes.
For the other traits, substantial phenotypic variations were observed for pericarp thickness and locule number with the correlation coefficients of 0.76–0.85 and 0.83–0.88 over three years (Fig. 1 and Table 2). Fruit firmness showed the means of 39.42 kgf/cm2 in 2018, 58.22 kgf/cm2 in 2019, and 62.05 kgf/cm2 in 2020. In addition, the 2018 phenotypic data of this trait showed a low correlation coefficient of 0.33 with each of the 2019 and 2020 data relative to the coefficient of 0.81 between 2019 and 2020 (Table 2). This is due to the difference between destructive (2018) and non-destructive (2019 and 2020) methods. Brix ranged from 3.10 to 9.09% over three years with the means of 5.18–5.51% and correlation coefficients of 0.45–0.72 (Fig. 1 and Table 2). The phenotypic data of three years for eight fruit traits were used for GWAS without normalization.
Identification of marker-trait associations for fruit traits
The 34,550 confident SNPs were used to infer a population structure in the 162 tomato accessions representing 29 small fruit (round, cylinder, and oval), 119 medium fruit (flat, cylinder, oval, and round), and 14 large fruit (flat) germplasm. The delta K method30 suggested that the best K (number of clusters) was seven in the model-based clustering analysis and the number of tomato accessions per cluster ranged from eight (cluster 7) to 46 (cluster 1) (Fig. 2 and Table S1). Cluster 1, which is the largest, consisted of 42 medium fruit accessions and four large fruit accessions. Of these medium fruit accessions, the oval shape was dominant (20 accessions) followed by 11 cylinder, seven flat, and four round accessions. Cluster 2 included 32 medium fruit accessions (18 flat, seven round, five oval, and two cylinder accessions). Six large fruit accessions were also found with these medium fruit accessions in this cluster. The other 44 medium fruit accessions were divided into clusters 3 (23 accessions), 4 (14 accessions), and 5 (seven accessions). In these clusters, we also found two small fruit accessions (cluster 3), one large fruit accession (cluster 4), and three large fruit accessions (cluster 5). Cluster 6 consisted of only 20 small fruit accessions (17 cylinder, two oval, and one round). Similarly, the dominant accession in cluster 7 was small fruit accessions (five round and two oval plum). This cluster also included a medium flat fruit accession (Fig. 2 and Table S1). In addition, the hierarchical clustering analysis based on Nei’s genetic distance also found that most of the accessions were grouped as shown in the seven clusters (Fig. S2). Considering geographic relations, 104 of 162 tomato accessions were collected from India and distributed into six clusters excluding cluster 7 (Table S1). Furthermore, we found no country-specific clusters in the other 58 accessions. Semi-determinant and determinant accessions were found together in the same cluster. Therefore, the population structure in the tomato collection is likely due to multiple factors, such as fruit size and pedigree.
GWAS using the 34,550 confident SNPs identified a total of 30 significant marker-trait associations (MTAs) for eight fruit traits at P < 0.0005 (Table 3, Fig. 3, and S3). These MTAs were repeatedly detected in at least two of four phenotypic data sets (each of three years and combined). Of these, we found that 16 MTAs were significant at P < 0.00005 that was determined as a genome-wide significance threshold based on 1677 SNPs, the effective number of independent markers31. For fruit weight, seven MTAs were found on chromosomes 1, 2, 4, 8, and 10 at P < 0.0005, and five of these MTAs also showed significance at P < 0.00005 (Table 3 and Fig. 3). The phenotypic variance explained (PVE) for two MTAs on chromosomes 1 and 8 ranged from 11.98 to 28.95%, while the PVE for other MTAs were < 10%. In addition, two MTAs were detected within several Mb distances on both chromosomes 1 (11.46 Mb) and 2 (8.88 Mb). We found significant MTAs for fruit shape-related traits, fruit width (seven MTAs), fruit height (three MTAs), and fruit shape index (four MTAs) at P < 0.0005 (Table 3 and Fig. 3). Two MTAs for fruit width on chromosomes 1 and 2 explained 18.66–25.79% of phenotypic variance. Of these, the SLA773077 marker on chromosome 1 showed significant associations with both fruit weight and width. Furthermore, the 2nd MTA on chromosome 2 was found at 9.23 Mb away from the 1st MTA and its marker (SLA773357) was significantly associated with both the fruit weight and width (Table 3 and Fig. 3). These MTAs on chromosome 2 were also detected at P < 0.00005. For fruit height, one of two MTAs on chromosome 4 showed significance at P < 0.000005 and its PVE ranged from 20.27 to 42.60%. Another MTA was found on chromosome 8, explaining 15.44% (2018) and 9.82% (2019) of phenotypic variance. Three of four MTAs for fruit shape index were detected on chromosomes 2, 4, and 12 at P < 0.00005 (Table 3 and Fig. 3). The MTA on chromosome 2 showed large effects (up to 31.46% in the combined data), while the other MTAs explained < 10% of phenotypic variance.
We detected four MTAs for pericarp thickness on chromosomes 2, 9, and 12 at P < 0.0005 (Table 3 and Fig. 3). The SLA769530 marker on chromosome 9 was also associated with this trait at P < 0.00005 in the 2018 data. Two MTAs on chromosome 2 were found at 35.89 and 50.74 Mb, respectively. The 1st MTA explained up to 29.22% of phenotypic variance in the combined data. In contrast, the other MTAs including the 2nd MTA on chromosome 2 showed small effects (< 5%). For locule number, four MTAs were detected on chromosomes 2, 3, 6, and 10 at P < 0.0005, and three of these (excluding one on chromosome 6) also showed significance at P < 0.00005 (Table 3 and Fig. 3). Two of these MTAs on chromosomes 2 and 3 explained from 10.47 to 15.38% of phenotypic variance, while the other MTAs explained < 5%. In addition, the SLA773357 marker on chromosome 2 was significantly associated with not only locule number but also the fruit weight and width (Table 3 and Fig. 3). Association analysis with fruit firmness identified three MTAs on chromosomes 2, 4, and 8 at P < 0.0005 (Table 3 and Fig. 3). The PVE for two MTAs on chromosome 4 and 8 ranged from 14.94 to 23.73%. The MTA on chromosome 2 was significantly found at P < 0.000005 in the 2020 data and its PVE was 16.35%. For brix, only one significant MTA was detected on chromosome 9 at P < 0.00005 (2018) and P < 0.0005 (combined), explaining 26.22 and 28.73% of phenotypic variance in these data (Table 3 and Fig. 3).
Discussion
In this study, a collection of 162 tomato accessions was used to identify favorable alleles associated with eight fruit traits using genome-wide SNPs. Their phenotypic variations of the traits were evaluated in field trials over three years. The observed large phenotypic variation of each trait in every year suggests that the tomato accessions originated from diverse genetic backgrounds. This genetic diversity provided an opportunity to explore novel QTL for improving the fruit traits in tomato breeding programs. Furthermore, six traits excluding fruit firmness and brix revealed high correlation coefficients (0.75–0.95) between the phenotypic data collected over three years. For fruit firmness, the 2018 phenotypic data showed the coefficient of 0.33 to each of the 2019 and 2020 data, while the coefficient between 2019 and 2020 was 0.81. We used a digital destructive penetrometer in 2018 and a non-destructive penetrometer in 2019 and 2020. Although both destructive and non-destructive penetrometers are commonly used to measure fruit firmness in crop species, our result demonstrates that these types of penetrometer can generate inconsistent measurements in tomato. Brix also showed relatively low correlation coefficients, especially when the phenotypic data in 2020 was compared with each of the 2018 and 2019 data. It is likely due to high precipitation and temperature in the 2020 growing season. In addition, this result is supported by lower heritability of brix (0.63) than other fruit traits, such as fruit weight (0.83) and locule number (0.85) reported in a previous study, indicating that this trait is more sensitive to environmental variations7,36.
For GWAS, we used the multilocus mixed model (MLMM) that effectively reduces false positives and false negatives18. A total of 30 significant MTAs was found with at least two phenotypic data sets for eight fruit traits at P < 0.0005. In addition, candidate genes for 10 MTAs were found using the tomato genome assembly SL4.0 and ITAG 4.0 (Table S2). Of the 30 MTAs, 14 likely represent previously known loci for fruit traits. There are three major genes for tomato fruit development on chromosome 2, including fw2.2 for fruit weight at 50.29 Mb37, ovate for fruit shape at 46.38 Mb38, and lc for locule number at 45.19 Mb4. Two MTAs were detected in the genomic regions of these major genes on chromosome 2. One of these MTAs at 44.07 Mb showed significant associations with three traits (fruit weight, fruit width, and locule number), while another MTA was significant for the fruit shape index. We also found two additional MTAs for the fruit weight (35.19 Mb) and fruit width (34.84 Mb) in the known QTL region on chromosome 228. The other three MTAs for fruit weight are likely to correspond to the known QTL on chromosomes 1, 4, and 1028,33,39. For the fruit shape index, two MTAs were found in the previously reported QTL regions on chromosomes 4 and 122,39. Furthermore, two MTAs for pericarp thickness were found 0.45 Mb and 0.70 Mb away from the lc gene4 and known QTL28 for fruit weight on chromosome 2, respectively. A QTL for this trait was previously reported in the vicinity of our MTA on chromosome 122. For brix, a single MTA was detected at 62.64 Mb on chromosome 9, located 3.85 Mb away from a known QTL7.
Interestingly, 16 MTAs were identified in genomic regions without previously known loci for eight fruit traits, suggesting discovery of novel QTL. For six of these MTAs, we found candidate genes that are related to fruit development. The GWAS panel used in this study consisted of determinate and semi-determinate tomato accessions that mostly originated from Southern and Western Asian countries. Therefore, this collection is more likely to represent different genetic backgrounds relative to the mapping populations of previous studies2,7,27,28,32,33,34,35. For example, we found that the fruit height ranged from 25.82 to 86.30 mm with a mean of 52.22 mm, while another population showed 17.46–111.47 mm with a mean of 44.69 mm28. Similarly, different phenotypic variations between populations were found for the fruit weight, fruit width, and pericarp thickness. In addition, the fruit shape index used in this study was measured differently from the fruit shape. The first was determined based on the ratio of maximum height and to maximum width using the Tomato Analyser software, while the second was based on 1–9 scales28. This distinction could lead to novel QTL identification for the fruit traits in our study.
Two SNP markers showed significant associations with multiple traits, suggesting that corresponding QTL have pleiotropic effects. The SLA773077 marker on chromosome 1 was associated with both the fruit weight and fruit width. The other marker (SLA773357) on chromosome 2 showed associations with the fruit weight, fruit width, and locule number. Since phenotypic correlations between the fruit traits of tomato have been reported in the present and previous studies28,40,41, identification of QTL with pleiotropic effects was expected. However, we found no QTL associated with both fruit height and shape index, even though the traits are correlated. In addition, a previous study found several QTL with pleiotropic effects between the fruit weight and fruit height28. This result may be due to a small number of MTAs for the fruit height and fruit shape index in our study.
Detection of year or environment specific MTAs commonly occurred in the association mapping studies of tomato fruit traits7,28,34. We also found that a few MTAs were detected in all three years. In addition, 14 of 30 MTAs explained < 10% of phenotypic variations for eight fruit traits. Therefore, these MTAs can represent small effect QTL that are easily affected by environmental variations. For marker-assisted selection, large effect QTL has been commonly used in crop breeding programs because this approach was cost-effective and rapid to improve traits of interest. In contrast, MAS has been unsuccessful for complex quantitative traits that are controlled by small effect QTL42,43. Genomic selection (GS) has emerged as an alternative to overcome the limitations of MAS for these traits and predicts the breeding values of individuals using a large number of genome-wide markers43,44. Recently, it was reported that the use of QTL-associated markers increased the prediction accuracy of GS in several crops45,46,47. In this aspect, the MTAs from our study will be useful for GS in tomato breeding programs.
In conclusion, we reported a total of 30 MTAs for eight fruit traits in a collection of 162 tomato accessions. Of these, 16 MTAs represent potential novel QTL for six fruit traits and in silico analysis found candidate genes in the genomic regions of six MTAs. The resulting SNP markers and candidate genes for these MTAs are a useful resource for further characterization of novel QTL via the fine mapping and gene editing approaches. These MTAs can also be used to investigate a GS method with greater prediction accuracy for fruit traits in tomato. Therefore, our results will benefit the tomato research community by providing an additional tool to breeders for elite cultivar development.
Materials and methods
Plant materials and genotyping
The 162 tomato accessions used in this study were derived from a private breeding program and originated from seven countries, including India, China, Turkey, and Israel (Table S1). This collection consisted of determinate and semi-determinate accessions with diverse morphological variations for fruit traits, representing 29 small fruit (< 25 g), 119 medium fruit (25–130 g), and 14 large fruit (> 130 g) tomatoes. For each accession, genomic DNA was isolated using fresh and young leaf tissues from 4-week-old seedlings according to a modified cetyl trimethyl ammonium bromide (CTAB) method48. The isolated DNA pellets were resuspended with T1/10E buffer (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA). The quality and quantity of DNA were measured using the NanoDropTM One spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). The final concentration of DNA was adjusted to 50 ng/μL for SNP array-based genotyping.
The collection of 162 tomato accessions was genotyped using the 51 K Axiom® tomato array containing 51,912 SNPs17. For this genotyping, 200 ng of genomic DNA from each sample was amplified and randomly fragmented into 25–125 bp using the Axiom® 2.0 reagent kit (Thermo Fisher Scientific, Waltham, MA, USA). The DNA fragments were hybridized to the array in the Affymetrix® GeneTitan system according to the manufacturer’s instructions. The hybridization signals in the form of CEL files were processed using the Affymetirx® Power Tools software package v1.18 for SNP calling. The high-quality SNPs were filtered based on < 10% of missing data, > 5% of minor allele frequency. For the resulting SNPs, missing data were imputed using BEAGLE v5 with default parameter settings49.
Phenotypic evaluation
We evaluated phenotypic variations of fruit weight, fruit width, fruit height, fruit shape index, pericarp thickness, locule number, fruit firmness, and brix over three years (2018–2020) of field trials in the 162 tomato accessions. Plants were first grown in a greenhouse, and 6-7-week-old seedlings were transplanted into plastic-covered fields (high-tunnel) with 30 cm spacing between plants. The field trials were conducted using a randomized complete block design with three replications per genotype and there were four plants per replication. For phenotypic evaluation, fully ripe fruits were harvested from the 2nd to 4th flowering clusters, and 4–10 fruits per replicate for each genotype were used. Image analysis was conducted using the Tomato Analyzer (TA) v4.0 software50 for fruit height, fruit width, fruit shape index, locule number, and pericarp thickness. For this analysis, we used ten fruits for small fruit accessions, six fruits for medium fruit accessions, and four fruits for large fruit accessions. Fruits were longitudinally and horizontally cut through the center, placed cut-side down on a scanner, and digitalized according to the user manual of TA37. For fruit weight, we used average values of five fruits per replicate. Brix was measured using a PAL-1 refractometer (ATAGO, WA, USA). For fruit firmness, we used the Digital Fruit Firmness Penetrometer (Agriculture Solutions, ME, USA) in 2018 and HPE II Fff (Bareiss, Oberdischingen, Germany) in 2019–2020. The phenotypic data collected in 2018–2020 were independently used for association analysis. An additional data set was also generated by combining those of all three years for seven traits excluding fruit firmness. The combined data set for fruit firmness was based on the two year data (2019 and 2020) due to the use of different penetrometer types. The outliers for the combined data were removed using the IQR method51 in R. This data set from multiple years was also used in further association analysis to confirm the result.
Population structure and association analysis
Population structure in the tomato collection was inferred using the STRUCTURE v.2.3.4 program52. The STRUCTURE model used in this study allows for admixture and correlated allele frequencies. To determine the best K (number of clusters), we performed 10 independent simulations for each 10 Ks (1–10) with a burn-in period of 10,000 iterations and a Markov Chain Monte Carlo (MCMC) run length of 10,000 iterations. After the 1st round of analysis, six Ks (4–9) were selected for further simulations with a burn-in period of 20,000 iterations and a MCMC run length of 100,000 iterations. The resulting log-likelihood estimates for the Ks were used to find the best K in the delta K method30. A population structure matrix (Q matrix) was then generated using the membership coefficients of 162 tomato accessions based on the best K. In addition, hierarchical clustering was conducted using the R packages. The Nei’s genetic distances53 were estimated between tomato accessions using the poppr package54 and then hierarchical clustering analysis was conducted with an unweighted pair group method with arithmetic mean (UPGMA).
To identify marker-trait associations (MTAs) for eight fruit traits, we performed association analysis using the multilocus mixed model (MLMM)18 implemented in genomic association and prediction integrated tool (GAPIT)55. Both Q and kinship matrices were used as covariates to reduce false-positive associations due to population structure and familial relatedness10. The kinship matrix was generated using the VanRaden algorithm56. Significant MTAs were first detected at P < 0.0005. We also used a genome-wide threshold (P < 0.00005) that was determined based on the effective number of independent markers, Me31. The Me was estimated using the Genetic Type I Error Calculator (GEC) software (http://pmglab.top/gec/#/) and a genome-wide threshold was calculated with the equation, 0.05/Me. The phenotypic variance explained (PVE) by a significant marker was estimated using the equation in R:
where SS is the sum of square and e is the residuals from the ANOVA fitted with a linear model incorporating the phenotypic data and all significant markers57. Candidate genes for MTAs detected in this study were investigated using the tomato reference genome assembly SL4.0 and the international tomato annotation group (ITAG) 4.0 at the Sol Genomics Network (https://solgenomics.net).
References
Food & Agriculture Organization of the United Nation. FAOSTAT statistical database. https://search.library.wisc.edu/catalog/999890171702121 (2020).
Celik, I., Gurbuz, N., Uncu, A. T., Frary, A. & Doganlar, S. Genome-wide SNP discovery and QTL mapping for fruit quality traits in inbred backcross lines (IBLs) of solanum pimpinellifolium using genotyping by sequencing. BMC Genomics. 18, 1 (2017).
Lippman, Z. & Tanksley, S. D. Dissecting the genetic pathway to extreme fruit size in tomato using a cross between the small-fruited wild species Lycopersicon pimpinellifolium and L. esculentum var. Giant Heirloom. Genetics. 158, 413–422 (2001).
Munos, S. et al. Increase in tomato locule number is controlled by two single-nucleotide polymorphisms located near WUSCHEL. Plant Physiol. 156, 2244–2254 (2011).
Ranc, N. et al. Genome-wide association mapping in tomato (Solanum lycopersicum) is possible using genome admixture of Solanum lycopersicum var cerasiforme. G3 (Bethesda). 2, 853–864 (2012).
Rodriguez, G. R., Kim, H. J. & van der Knaap, E. Mapping of two suppressors of OVATE (sov) loci in tomato. Heredity. 111, 256–264 (2013).
Xu, J. et al. Phenotypic diversity and association mapping for fruit quality traits in cultivated tomato and related species. Theor. Appl Genet. 126, 567–581 (2013).
Holland, J. B. Genetic architecture of complex traits in plants. Curr. Opin. Plant Biol. 10, 156–161 (2007).
Perez-de-Castro, A. M. et al. Application of genomic tools in plant breeding. Curr. Genomics. 13, 179–195 (2012).
Yu, J. & Buckler, E. S. Genetic association mapping and genome organization of maize. Curr. Opin. Biotechnol. 17, 155–160 (2006).
Gupta, P. K., Rustgi, S. & Kulwal, P. L. Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Mol. Biol. 57, 461–485 (2005).
Zhu, C., Gore, M., Buckler, E. S. & Yu, J. Status and prospects of association mapping in plants. Plant Genome-Us. 1, 5–20 (2008).
Hamilton, J. P. et al. Single nucleotide polymorphism discovery in cultivated tomato via sequencing by synthesis. Plant Genome-Us. 5, 17–29 (2012).
Sim, S. C. et al. High-density SNP genotyping of tomato (Solanum lycopersicum L.) reveals patterns of genetic variation due to breeding. PLoS ONE. 7, e45520 (2012).
Consortium, T. G. S. et al. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole‐genome sequencing. Plant J. 80, 136–148 (2014).
Liu, H. et al. An evaluation of genotyping by sequencing (GBS) to map the Breviaristatum-e (ari-e) locus in cultivated barley. BMC Genomics. 15, 104 (2014).
Yamamoto, E. et al. A simulation-based breeding design that uses whole-genome prediction in tomato. Sci. Rep. 6, 19454 (2016).
Segura, V. et al. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations. Nat. Genet. 44, 825–830 (2012).
Zhang, Z. et al. Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 42, 355–360 (2010).
Ruggieri, V. et al. An association mapping approach to identify favourable alleles for tomato fruit quality breeding. BMC Plant Biol. 14, 337 (2014).
Sauvage, C. et al. Genome-wide association in tomato reveals 44 candidate loci for fruit metabolic traits. Plant Physiol. 165, 1120–1132 (2014).
Burgos, E. et al. Validated MAGIC and GWAS population mapping reveals the link between vitamin E content and natural variation in chorismate metabolism in tomato. Plant J.: Cell Mol. Biol. 105, 907–923 (2021).
Tieman, D. et al. A chemical genetic roadmap to improved tomato flavor. Science. 355, 391–394 (2017).
Zhang, J. et al. Genome-wide association mapping for tomato volatiles positively contributing to tomato flavor. Front Plant Sci. 6, 1042 (2015).
Zhao, J. et al. Meta-analysis of genome-wide association studies provides insights into genetic control of tomato flavor. Nat. Commun. 10, 1534 (2019).
Lin, T. et al. Genomic analyses provide insights into the history of tomato breeding. Nat. Genet. 46, 1220–1226 (2014).
Sacco, A. et al. Exploring a tomato landraces collection for fruit-related traits by the aid of a high-throughput genomic platform. PLoS ONE. 10, e0137139 (2015).
Phan, N. T. et al. Identification of loci associated with fruit traits using genome-wide single nucleotide polymorphisms in a core collection of tomato (Solanum lycopersicum L.). Sci. Hortic.-Amst. 243, 567–574 (2019).
Mata-Nicolás, E. et al. Exploiting the diversity of tomato: the development of a phenotypically and genetically detailed germplasm collection. Hortic. Res. 7, 1–14 (2020).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Li, M. X., Yeung, J. M., Cherny, S. S. & Sham, P. C. Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets. Hum. Genet 131, 747–756 (2012).
Adhikari, P., McNellie, J. & Panthee, D. R. Detection of quantitative trait loci (QTL) associated with the fruit morphology of tomato. Genes (Basel) 11, 1117 (2020).
Brekke, T. D., Stroud, J. A., Shaw, D. S., Crawford, S. & Steele, K. A. QTL mapping in salad tomatoes. Euphytica 215, 115 (2019).
Liu, X., Huang, M., Fan, B., Buckler, E. S. & Zhang, Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet 12, e1005767 (2016).
Safaei, M. et al. Four genetic loci control compact plant size with yellow pear-shaped fruit in ornamental tomato (Solanum lycopersicum L.). Plant Genome-Us 13, e20017 (2020).
Gautier, H. et al. How does tomato quality (sugar, acid, and nutritional quality) vary with ripening stage, temperature, and irradiance? J. Agric. Food Chem. 56, 1241–1250 (2008).
Frary, A. et al. fw2.2: a quantitative trait locus key to the evolution of tomato fruit size. Science 289, 85–88 (2000).
Liu, J., Van Eck, J., Cong, B. & Tanksley, S. D. A new class of regulatory genes underlying the cause of pear-shaped tomato fruit. Proc. Natl Acad. Sci. USA. 99, 13302–13306 (2002).
Liu, X., Geng, X., Zhang, H., Shen, H. & Yang, W. Association and genetic identification of loci for four fruit traits in tomato using InDel markers. Front Plant Sci. 8, 1269 (2017).
Grandillo, S., Ku, H. M. & Tanksley, S. D. Identifying the loci responsible for natural variation in fruit size and shape in tomato. Theor. Appl. Genet. 99, 978–987 (1999).
Hernández-Bautista, A. et al. Fruit size QTLs affect in a major proportion the yield in tomato. Chil. J. Agric. Res. 75, 402–409 (2015).
Bernardo, R. Bandwagons I, too, have known. Theor. Appl. Genet. 129, 2323–2332 (2016).
Heffner, E. L., Sorrells, M. E. & Jannink, J. L. Genomic selection for crop improvement. Crop Sci. 49, 1–12 (2009).
Crossa, J. et al. Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 22, 961–975 (2017).
Liabeuf, D., Sim, S. C. & Francis, D. M. Comparison of marker-based genomic estimated breeding values and phenotypic evaluation for selection of bacterial spot resistance in tomato. Phytopathology. 108, 392–401 (2018).
Spindel, J. E. et al. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity. 116, 395–408 (2016).
Zhang, H., Yin, L., Wang, M., Yuan, X. & Liu, X. Factors affecting the accuracy of genomic selection for agricultural economic traits in maize, cattle, and pig populations. Front Genet. 10, 189 (2019).
Kabelka, E., Franchino, B. & Francis, D. M. Two loci from Lycopersicon hirsutum LA407 confer resistance to strains of Clavibacter michiganensis subsp. michiganensis. Phytopathology. 92, 504–510 (2002).
Browning, B. L., Zhou, Y. & Browning, S. R. A One-Penny Imputed Genome from Next-Generation Reference Panels. Am. J. Hum. Genet. 103, 338–348 (2018).
Brewer, M. T. et al. Development of a controlled vocabulary and software application to analyze fruit shape variation in tomato and other plant species. Plant Physiol. 141, 15–25 (2006).
Rohlf, F. J. & Sokal, R. R. Biometry: the principles and practice of statistics in biological research. (Freeman New York, 1981).
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Nei, M. Estimation of average heterozygosity and genetic distacne from a small number of individuals. Genetics 89, 583–590 (1978).
Kamvar, Z. N., Brooks, J. C. & Grunwald, N. J. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality. Front Genet 6, 208 (2015).
Lipka, A. E. et al. GAPIT: genome association and prediction integrated tool. Bioinformatics 28, 2397–2399 (2012).
VanRaden, P. M. Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4423 (2008).
Knoch, D. et al. Strong temporal dynamics of QTL action on plant growth progression revealed through high‐throughput phenotyping in canola. Plant Biotechnol. J. 18, 68–82 (2020).
Acknowledgements
This work was carried out with the support of Cooperative Research Program for Agriculture Science and Technology Development (Project No. PJ0158982021) and the Next-Generation BioGreen21 Program (Project No. PJ0131142020), Rural Development Administration, Republic of Korea.
Author information
Authors and Affiliations
Contributions
S.S., J.A., and G.K. conceived and designed the project. M.K., T.N., and J.A. performed experiments and analyzed the data. M.K. and T.N. wrote the first draft of the manuscript and SS critically revised the manuscript. All authors reviewed and approved the submitted version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kim, M., Nguyen, T.T.P., Ahn, JH. et al. Genome-wide association study identifies QTL for eight fruit traits in cultivated tomato (Solanum lycopersicum L.). Hortic Res 8, 203 (2021). https://doi.org/10.1038/s41438-021-00638-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41438-021-00638-4
This article is cited by
-
Prediction accuracy of genomic estimated breeding values for fruit traits in cultivated tomato (Solanum lycopersicum L.)
BMC Plant Biology (2024)
-
Development of Tomato Fruit Stage Index (TFSI) to characterise different fruit growth stages of tomato using multivariate techniques
Journal of Food Measurement and Characterization (2024)
-
Genetic architecture of fresh-market tomato yield
BMC Plant Biology (2023)
-
Bayesian estimation of multi-allele QTLs for agricultural traits in tomato using recombinant inbred lines derived from two F1 hybrid cultivars
Euphytica (2023)