Abstract
Phalaenopsis spp. represent the most popular orchids worldwide. Both P. equestris and P. aphrodite are the two important breeding parents with the whole genome sequence available. However, marker–trait association is rarely used for floral traits in Phalaenopsis breeding. Here, we analyzed markers associated with aesthetic traits of Phalaenopsis orchids by using genome-wide association study (GWAS) with the F1 population P. Intermedia of 117 progenies derived from the cross between P. aphrodite and P. equestris. A total of 113,517 single nucleotide polymorphisms (SNPs) were identified in P. Intermedia by using genotyping-by-sequencing with the combination of two different restriction enzyme pairs, Hinp1 I/Hae III and Apek I/Hae III. The size-related traits from flowers were negatively related to the color-related traits. The 1191 SNPs from Hinp1 I/ Hae III and 23 simple sequence repeats were used to establish a high-density genetic map of 19 homolog groups for P. equestris. In addition, 10 quantitative trait loci were highly associated with four color-related traits on chromosomes 2, 5 and 9. According to the sequence within the linkage disequilibrium regions, 35 candidate genes were identified and related to anthocyanin biosynthesis. In conclusion, we performed marker-assisted gene identification of aesthetic traits with GWAS in Phalaenopsis orchids.
Similar content being viewed by others
Introduction
The Orchidaceae is one of the largest families in angiosperm, with 27,315 species. Phalaenopsis is the most popular orchid genus, with approximately 92 species as well as 35,129 hybrids recorded for the registration in the Royal Horticultural Society1. Both P. equestris and P. aphrodite are two model orchid plants used in academic studies and are major breeding parents in orchid nurseries. The whole-genome sequences of P. equestris2 and P. aphrodite3 have been published and are available in Orchidstra3 (http://orchidstra.abrc.sinica.edu.tw) and OrchidBase4 (http://orchidbase.itps.ncku.edu.tw) , respectively. The genetic information for P. equestris has been used in several functional genomics studies of flower morphogenesis, pigmentation patterning, floral fragrances, stress response, etc.5,6,7,8,9,10,11,12. However, marker-assisted gene isolation has rarely been used for functional characterization of agricultural traits for Phalaenopsis breeding.
Marker-assisted selection (MAS) involves using molecular markers to support desired phenotypic selections in crop development. Next-generation sequencing technologies aim to effectively identify single nucleotide polymorphism (SNP) markers from ultra-throughput sequences. The method has revolutionized plant genotyping in crops and plant breeding13,14. To broaden next-generation sequencing use to large–genome crops, genotyping-by-sequencing (GBS) has been established and used to sequence pooled samples that identify the molecular markers and for the genotyping15. So far, GBS has been effectively used in genome-wide association study (GWAS) because of its cost-effectiveness and as an ultimate MAS16. Other applications of GBS are equally important in plants, such as study of genetic diversity, genetic linkage investigation, molecular marker detection, and genomic selection in breeding programs14. By genotyping large-size populations, GBS is an outstanding method for plant breeding, even without the information on reference genome sequences; examples of plants investigated are rapeseed17, lettuce18, switchgrass13, soybean19, and maize20.
GWAS is a useful and powerful approach for identifying genetic variations that underlie many important and complex phenotypes, especially quantitative trait loci (QTL) controlled by multiple genes21. In cassava, a useful GBS pipeline has been established to discover SNPs both within and among the mapping population and varied African cassava varieties, which improved the MAS programs to increase the disease resistance ability and the nutrition concerns22. Recently, several studies focused on GWAS for aesthetic floral traits even though it is not a major field for most crops. These studies involved rose23, cultivated sunflower24, woody plant Prunus mume25, and chrysanthemum26 and identified SNP markers associated with flower color and floral shape.
For Phalaenopsis breeding, both P. equestris and P. aphrodite are popular parents; in total, 35,129 hybrids have been registered by the Royal Horticultural Society1. P. equestris has small flowers 2.5 ~ 3.8 cm in floral diameter and various flower colors, including red flowers with red lip, red flowers with orange lip, white flowers with white lip, white flowers with yellow lip, and light blue-purple flowers. P. aphrodite has white and medium-size flowers 6 cm in diameter; it is the major breeding parent to offer pure-white flowers. The cross between P. aphrodite and P. equestris resulted in the F1 progeny P. Intermedia, containing red, pink, to white flowers with small to medium sizes. Therefore, P. Intermedia provides a good population for investigating floral aesthetic traits with MAS and GWAS. In addition, high-quality and well-annotated genome sequences have been published for both parent materials2,3 and could be used as reference genomes for SNP calling. However, GWAS has not been well established in Phalaenopsis orchids.
In this study, by using the valuable P. Intermediate population, we first constructed a genetic map of P. equestris by using SNPs obtained from GBS and previously identified simple sequence repeat (SSR) markers. We revealed the relationships by combining the SNPs and floral aesthetic traits to identify QTL that contribute to 4 different color-related traits in Phalaenopsis. This is the first report of the genetic map and flower color-related QTL in Phalaenopsis orchids.
Results
Distribution of various agricultural traits
We recorded flower size and flower color-related phenotypes from the 117 F1 progenies of P. Intermedia. A total of 16 phenotypes were assessed, including 12 traits involved in flower size and 4 involved in color-related traits. The entire flower, sepal, petal and lip were measured for their width, length and area. The color-related traits included petal magenta area, petal magenta, lip magenta and lip yellow color. The flower size-related traits showed a normal distribution (Fig. 1a–l), but the flower color-related traits showed a complex distribution (Fig. 1–p). We assessed the relationship among all characteristics, and most of the flower size-related traits were positively correlated, with strong significant values, the strongest correlation (correlation coefficient = 0.96, p = 9.8584E−52) being between flower area and petal area (Supplementary Table S1, combination 1). In addition, most color-related traits were positively correlated with each other, with the highest correlation coefficient 0.89 (p = 5.6991E−32) between petal magenta and petal magenta area (Supplementary Table S1, combination 8). However, a low positive to negative correlation was found between size-related and color-related traits (Fig. 2; Supplementary Table S1).
Genetic map and distribution of SNPs in the Phalaenopsis genome
With GBS, the SNP numbers obtained by using different restriction enzyme combinations of Hinp1 I/Hae III and ApeK I/Hae III were 1,633 and 111,884 by Mi-seq and Hi-seq, respectively. A total of 1,191 SNPs from Hinp1 I along with 23 SSRs previously identified by Dr. Wen-Luan Wu’s lab27 were successfully used to construct a genetic map for the F1 population of P. Intermedia (Supplementary Table S2), which revealed 27 linkage groups (LGs) (Supplementary Table S3). With the assistance of 21,350 BAC-end sequences (BESs)27, the 27 LGs were further assembled into 19 homologous groups (HGs) (Supplementary Table S4; Fig. 3). The number of total markers in each HG ranged from 15 in HG 19 to 144 in HG 14. These markers spread to a total map length of 15,192.05 cM, with a physical distance of 875,501 bp and average distance between two SNPs of 721 kb (Supplementary Table S4).
A total of 113,517 SNPs from the two enzyme digestions were used in GWAS. The SNP number in each chromosome ranged from 24,121 for chromosome 1 to 1636 for chromosome 19, and the average distance between SNPs ranged from 16,266 bp in chromosome 2 to 2849 bp in chromosome 19. The average distance between SNPs in the whole genome was 8854 bp (Table 1). To identify the genomic structure in the F1 population, we used multidimensional scaling (MDS) analysis based on the total SNPs above and showed that the genomic structure should be assessed in the following GWAS analysis (Supplementary Fig. S1).
GWAS results for flower color-related traits and candidate genes
GWAS was used to identify the SNPs correlated with floral aesthetic traits. Ten SNPs were identified for color-related traits contributing to phenotype variations for lip yellow color (LipYel), lip magenta color (LipMag), petal magenta color (PetalMag) and petal magenta area (PetalMagArea) (Figs. 4, 5, 6, 7). Of note, seven SNPs were associated with only one color-related trait: S2_195281745 on chromosome 2 and S5_43813578 on chromosome 5 were associated with the PetalMagArea trait, and SNP S5-10776122, S5-23606179, and S5-24982925 on chromosome 5 and S9-14147427 on chromosome 9 were associated with the LipYel trait. In addition, SNP S5_45647571 on chromosome 5 was related to the PetalMag trait. The remaining three SNPs affected multiple traits. For example, SNP S5_45345022 located on chromosome 5 could explain the variation in PetalMagArea and PetalMag traits. The SNPs S5_43867748 and S5_46455357 on chromosome 5 contributed to the traits LipMag, PetalMagArea and PetalMag.
We calculated the linkage disequilibrium (LD) for this population based on SNPs. The LD decay was 50 Kb, with r2 ˂ 0.12 (Supplementary Fig. S2 across all chromosomes. The sequence within 50 Kb upstream and downstream of the significant SNPs was then extracted and screened for candidate genes by using a BLAST search against the NCBI nr database (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Matched genes with total score > 150 were kept and narrowed down to 35 candidate genes based on the BLAST targeted species of P. equestris. Anthocyanin biosynthesis-related genes, such as MYB genes and flavanone 3-hydroxylase (F3H5), were identified within the LD region of the associated SNPs S2_195281745, S5_10776122, S5_43813578 and S5_45647571. Other genes including a MADS box protein (SUPPRESSOR OF OVEREXPRESSION OF CO 1, SOC1), ABC transporter B family, sugar transport protein, auxin-binding protein, tubby-like F-box protein and GEM-like protein were also identified within the LD regions related to the phenotype traits LipYel, PetalMagArea, LipMag and PetalMag (Table 2).
Discussion
Flower size-related traits are highly related
Flower size and color are important aesthetic traits for ornamental flowers. We estimated the correlation of each group of flower traits, such as size-related traits and color-related traits, and found significant positive correlations within each group. This finding make sense that the width and length of each flower organ affect the entire flower size. Previous study showed flower disc diameter positively correlated with disc area in sunflower24. However, we found significant negative correlations between size-related and color-related traits. Similarly, a negative relation was discovered between petal length and red accumulation in Camissoniopsis cheiranthifolia28. In addition, carotenoid content in cultivated sunflower was found negatively correlated with flower size traits24. These results suggest that the larger the size, the lighter the yellow/red color for the F1 population of P. Intermedia. However, more studies are needed to determine the control mechanism between flower size and color.
Genetic map for P. equestris
Several genetic maps are available for Orchidaceae, with different molecular markers, including Dendrobium, Vallina, and Phalaenopsis. In Dendrobium, 349 polymorphic loci are identified from the cross between Dendrobium officinale × D. aduncum with a total length of 1580.4 cM and 19 LGs, covering 71% of the genome29. In addition, specific locus-amplified fragment sequencing was used recently to build genetic maps: a genetic map with high density was developed by the cross of D. moniliforme and D. officinale, with a longer length map of 2737.49 cM and 19 LGs30. In vanilla, a genetic linkage map with total length of 1035.85 cM and 18 LGs was built based on 225 amplified fragment length polymorphism markers31. In Phalaenopsis, 2905 SNP markers from restriction site-associated DNA sequencing with the cross between P. aphrodite and P. modesta was used to build a genetic map with a total length of 3075.8 cM and 22 LGs, with 85% coverage of the P. aphrodite genome3. In this study, we used 1191 SNPs and 23 SSRs to successfully construct a genetic map of 19 HGs with length of 15,192.05 cM and 875,501 kb, which is equivalent to the chromosome number of P. equestris, that covered 75.5% of the genome region. With the assistance of abundant BESs, the linkage map was assembled into 19 homologous maps.
Flower color-related QTL
GWAS is a state-of-art study for identifying genomic loci associated with desired traits32. Understanding the numbers and locations of loci regulating a trait is important to resolve genetic architecture and is valuable to plan a successful breeding strategy. Therefore, combining a high-throughput genotyping technology and a precise phenotyping method provides precise results for the GWAS analysis. The GBS approach we used provides rapid, cheap, high-throughput, and reliable results for genotyping hundreds of individuals in one population33. We obtained 113,517 SNPs after combining two different GBS libraries (Hinp1 I/Hae III and ApeK 1/Hae III), with an average distance between 2 markers of 8854 bp/SNP, and these markers supported a great resolution for the GWAS. However, no SNPs were identified for the association with size-related traits. Increasing the population size with enlarged flower size variations may be needed for future GWAS.
Candidate genes of red color-related QTL
Ten SNPs showed significant associations with the 4 color-related traits, and 3 were associated with more than one trait (Table 2). Pleiotropy has been found a common phenomenon in floral trait QTL34. QTL with pleiotropy may contribute to similar phenotypes, including petal length, sepal length and stamen length, or affect the phenotype for different traits, such as flower length, scent and pigmentation34. In our study, the situation of pleiotropy showed that the SNPs S5_45345022, S5_43867748 and S5_46455357 contributed to PetalMagArea and PetalMag, affecting the color magenta and the color distribution by the same QTL.
The regulation of flower color is a complex network; many factors affect the performance. For instance, plant hormones35, sugar transportation36, environmental stress37, retrotransposon activation38, carotenoid and anthocyanidin biosynthesis pathway39, and regulatory genes such as the MYB family40 and MADS-box family41 are all involved in the flower color regulation network. We identified 35 candidates within 10 QTL for 4 different color-related traits after a BLAST search of the NCBI nr database; 26 candidate genes located within 6 QTL related to 3 different traits: PetalMagArea, PetalMag and LipMag (Table 2). These genes include MYB52-like gene and the MYB11 promoter region around the significant SNPs S2_195281745 and S5_43813578, respectively, correlated with the PetalMagArea trait. Different genes in the MYB family have the function of determining red color in floral tissue by regulating anthocyanin biosynthesis; examples are OgMYB1 in Oncidium42, GmMYB-G20-I in soybean43, and the MYB117 promoter in the hybrid poplar Populus tremula × tremuloides44. MYB108 regulates the color of petal, stigma, calyx and bud in the woody plant Prunus mume25, and R2R3-MYB from Phalaenopsis controls floral pigmentation patterning11. In addition, MADS-box genes are involved in the anthocyanin pathway in potato41,45 and Phalaenopsis orchid46. We identified the MADS-box family genes AGL65, MADS5, and MADS4 within the QTL of the SNPs S2_195281745, S5_45647571, and S5_45345022, respectively (Table 2), so they may not only regulate floral morphogenesis but also be involved in the anthocyanin biosynthesis pathway.
Sucrose activation36 and sucrose signaling pathways47 also affect anthocyanin accumulation. We identified a sugar transport protein 8-like gene within the LD region of the significant SNP S5_43813578. In addition, auxin, the important plant hormone for plant growth and development, has a role in anthocyanin modification in apple48,49. We found an auxin-binding protein ABP19a-like gene within the QTL S5_45345022 for PetalMag and PetalMagArea traits. Cytochrome P450 is required for fully activating the formation of flower color50,51. We identified a cytochrome P450 71A1-like gene associated with the SNP S5_43867748 for LipMag, PetalMag and PetalMagArea traits. In addition, 2 serine/threonine-protein kinases, RIPK-like and PBL23, were identified within the LD region associated with the SNP S5_43867748 for LipMag, PetalMag and PetalMagArea traits. This result is consistent with a previous study of rose, in which a serine/threonine-protein kinase, PBS1, was a candidate gene modifying anthocyanin content in petal23. Insertion of an HORT1 retrotransposon controlling flower color was confirmed recently38. In our study, a short fragment of HORT1 was found within the QTL S5_45647571.
Candidate genes of yellow color-related QTL
Nine candidate genes located within 4 QTL contributing to the LipYel trait included flavanone 3-hydroxlase (F3H) gene and sulfoquinovosyl transferase SQD-2 like gene (Table 2). A short fragment of F3H was found in the QTL S5_45647571, also associated with the PetalMag trait. Thus, F3H may be involved in regulating both red and yellow colors in Phalaenopsis flower. The role of F3H in the anthocyanin biosynthesis pathway has been investigated carefully, and it was found to have an important role regulating red color accumulation in flower37,39,52. For regulating yellow color, less expression of F3H was found in yellow fruit of Fragaria vesca as compared with red fruit53, and minor amino acid differences caused no anthocyanin accumulation in Gentiana lutea54. In addition, F3H loss by antisense suppression produced a yellow carnation55. The evidence from these previous studies indicates that F3H may be involved in red and yellow flower color regulation and support our findings in this study. These candidate genes within the QTL most likely participate in and support their roles for these color-related traits.
Overall, most candidate genes identified by GWAS were correlated with anthocyanin biosynthesis, and the rest of them were discovered for the first time to be involved in regulating flower color; examples are ammonium transporter 1, the ABC transporter B family, GEM-like protein, histone H2B.1-like, and digalactosyldiacylglycerol synthase 1. SOC1, a MADS-box gene, was identified within QTL associated with all color-related traits, with a high total score and significant E-value, so SOC1 may participate in the flower color regulation pathway. SOC1 functions to regulate flowering time and floral development56,57. Our report is the first to reveal its function related to flower color regulation, possibly by regulating very upstream signals. However, more experiments are needed to confirm this.
In conclusion, this study provides the first marker-assisted gene identification of important agricultural traits in Phalaenopsis with GWAS and found that the associated SNPs could be used as selection markers for breeding programs of Phalaenopsis orchids.
Materials and methods
Plant materials and phenotyping
A total of 117 individuals from the F1 generation P. Intermedia and their parents, P. aphrodite and P. equestris, were used for both genotyping and phenotyping in this study. We analyzed the aesthetic phenotypes of these plants with the flower size-related traits, including width, length, and area of flower, sepal, petal, and lip and flower color-related traits and the red area in petal, petal red, lip red, and lip yellow. The traits for flower colors were collected with their CMYK values, magenta and yellow corresponding to red and yellow colors, respectively, in plant flower color. All plants were grown under normal light and the temperature was controlled between 23 °C and 27 °C in the greenhouse at Taiwan Sugar Corp. (Tainan, Taiwan). The correlation among all traits were estimated by using Pearson correlation in R58. The parental materials P. aphrodite and P. equestris, and the P. Intermedia, offspring derived from the cross between them, are all commercially available. The authenticity of these materials was verified by Taiwan Sugar Corp. These materials have been deposited in the greenhouse at National Cheng Kung University. Experimental research and field studies of plants, including the collection of plant material, comply with relevant institutional, national, and international guidelines and legislation.
Two- enzyme GBS approach
DNA extraction involved using the Qiagen DNeasy 96 Plant kit (Cat. no. 69181, Qiagen) following the manufacturer’s protocol. For the two-enzyme GBS approach, HinpI 1 and ApeK1 were paired with HaeIII for library preparation following a previous protocol59 with minor modifications. Single-end sequencing of multiplex GBS libraries involved using Mi-seq (for HinpI 1 library) and Illumina HiSeq2000 (for ApeK I library) platforms. Illumina sequence read processing, mapping, and SNP calling involved using the TASSEL-GBS pipeline60. The raw data of the above libraries and SNPs were uploaded into NCBI (accession no. GSE176215).
Genetic map construction
To construct the basic genetic map, 23 SSRs that were polymorphic between the two parents and 1,191 SNPs from the HinpI 1 GBS library were collected. The OneMap61 package implemented in R58 was used to conduct recombination fractions and linkage phase estimation and construct a linkage map. The linkage groups (LGs) were identified by a 2-point test, then the genetic map was obtained. Any pairwise markers with logarithm of odds (LOD) score > 4.0 and recombination fraction < 0.50 were considered linked between any pairwise markers. The ordering algorithms was applied to each group. An exhaustive search was performed with the “compare” function for groups with < 6 markers to obtain the best order. The “order.seq” command was used for groups with > 6 markers. The best order was used as a structure for the following inclusion of new markers. Once these groups were obtained, the “try.seq” function was used to confirm markers linked over unlinked according to the initial procedure and that likely participated in the pre-ordered groups. The LGs with > 5 markers were reconstructed by using the “ripple” algorithm within a sliding window of 5 markers as the final step. Manual adjustment was used as required throughout the map-building process. In total, 27 LGs were sketched by using MAPCHART 2.3062. The constructed genetic maps then were rearranged according to SSRs and SNP markers that corresponded to 19 putative groups of scaffolds, which were grouped in parallel with the 21,350 BAC-end sequences (BESs) from pair-end sequencing. The homologous groups (HGs) were established from LGs that contained markers from the same group of scaffolds.
GWAS
The following analyses involved using TASSEL5.060. The SNP calling involved using Bowtie 2 in TASSEL5.0 with default settings. A kinship matrix and principal component analysis (PCA) was used with all SNPs. The mixed linear model (MLM) was used to identify the association between traits and SNPs:
where y indicates the phenotype observed and stored as a vector, β an unknown fixed effect shown as a vector as well, and genetic marker and population structure (Q) are counted here. “u” is a random additive genetic effect from multiple background QTL for individuals but is an unknown vector. X and Z are the known design matrices. “e” is an unobserved vector of random residual.
LD estimation
For LD estimation, SNPs with the fewest missing data (NA) among all samples were selected every 5000 bp, then the LD was estimated among each of the collected SNPs. The LD plot and LD estimation involved using R58. The estimated LD between any two loci was based on comparing the observed and expected frequency of one haplotype, as follows:
where PAB indicates the frequency of observed haplotype and PAPB the frequency of the expected haplotype from two loci.
Candidate gene identification
To predict the candidate genes around the significant associated SNPs, sequences of 50 Kb upstream and downstream around the most significant associated SNPs were extracted based on the LD decay region. The extracted sequences were uploaded to NCBI for BLAST analysis with default settings, and only genes with a total score > 150 and belonging to P. equestris were selected as candidates.
Data availability
The datasets generated in this study are available in NCBI with the accession number GSE176215.
References
OrchidWiz. OrchidWiz encyclopedia version X4.2 June 2018 Database. Ames. IA (2018).
Cai, J. et al. The genome sequence of the orchid Phalaenopsis equestris. Nat. Genet. 47, 65–72 (2015).
Chao, Y. T. et al. Chromosome-level assembly, genetic and physical mapping of Phalaenopsis aphrodite genome provides new insights into species adaptation and resources for orchid breeding. Plant Biotechnol. J. 16, 2027–2041 (2018).
Tsai, W. C. et al. OrchidBase 2.0: Comprehensive collection of Orchidaceae floral transcriptomes. Plant Cell Physiol. 54, e7–e7 (2013).
Tsai, W.-C., Kuoh, C.-S., Chuang, M.-H., Chen, W.-H. & Chen, H.-H. Four DEF-like MADS box genes displayed distinct floral morphogenetic roles in Phalaenopsis orchid. Plant Cell Physiol. 45, 831–844 (2004).
Tsai, W.-C. et al. PeMADS6, a GLOBOSA/PISTILLATA-like gene in Phalaenopsis equestris involved in petaloid formation, and correlated with flower longevity and ovary development. Plant Cell Physiol. 46, 1125–1139 (2005).
Hsiao, Y. Y. et al. A novel homodimeric geranyl diphosphate synthase from the orchid Phalaenopsis bellina lacking a DD (X)2–4D motif. Plant J. 55, 719–733 (2008).
Hsieh, M.-H. et al. Optimizing virus-induced gene silencing efficiency with Cymbidium mosaic virus in Phalaenopsis flower. Plant Sci. 201, 25–41 (2013).
Hsieh, M.-H. et al. Virus-induced gene silencing unravels multiple transcription factors involved in floral growth and development in Phalaenopsis orchids. J. Exp. Bot. 64, 3869–3884 (2013).
Pan, Z. J. et al. Flower development of Phalaenopsis orchid involves functionally divergent SEPALLATA-like genes. New Phytol. 202, 1024–1042 (2014).
Hsu, C.-C., Chen, Y.-Y., Tsai, W.-C., Chen, W.-H. & Chen, H.-H. Three R2R3-MYB transcription factors regulate distinct floral pigmentation patterning in Phalaenopsis spp. Plant Physiol. 168, 175–191 (2015).
Chuang, Y.-C., Hung, Y.-C., Tsai, W.-C., Chen, W.-H. & Chen, H.-H. PbbHLH4 regulates floral monoterpene biosynthesis in Phalaenopsis orchids. J. Exp. Bot. 69, 4363–4377 (2018).
Yang, H. et al. Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L. BMC Genomics 13, 1–12 (2012).
He, J. et al. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding. Front. Plant Sci. 5, 484 (2014).
Jaganathan, D. et al. Genotyping-by-sequencing based intra-specific genetic map refines a ‘‘QTL-hotspot” region for drought tolerance in chickpea. Mol. Genet. Genomics 290, 559–571 (2015).
Poland, J. A. & Rife, T. W. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome 5, 92–102 (2012).
Bus, A., Hecht, J., Huettel, B., Reinhardt, R. & Stich, B. High-throughput polymorphism detection and genotyping in Brassica napus using next-generation RAD sequencing. BMC Genomics 13, 1–11 (2012).
Truong, H. T. et al. Sequence-based genotyping for marker discovery and co-dominant scoring in germplasm and populations. PLoS ONE 7, e37565 (2012).
Lu, F. et al. Switchgrass genomic diversity, ploidy, and evolution: Novel insights from a network-based SNP discovery protocol. PLoS Genet. 9, e1003215 (2013).
Su, C. et al. High density linkage map construction and mapping of yield trait QTLs in maize (Zea mays) using the genotyping-by-sequencing (GBS) technology. Front. Plant Sci. 8, 706 (2017).
Zhang, H. et al. Identification of QTLs for resistance to leaf spots in cultivated peanut (Arachis hypogaea L.) through GWAS analysis. Theor. Appl. Genet. 133, 2051–2061 (2020).
Prochnik, S. et al. The cassava genome: Current progress, future directions. Trop. Plant Biol. 5, 88–94 (2012).
Schulz, D. F. et al. Genome-wide association analysis of the anthocyanin and carotenoid contents of rose petals. Front. Plant Sci. 7, 1798 (2016).
Dowell, J. A. et al. Genome-wide association mapping of floral traits in cultivated sunflower (Helianthus annuus). J. Hered. 110, 275–286 (2019).
Zhang, Q. et al. The genetic architecture of floral traits in the woody plant Prunus mume. Nat. Commun. 9, 1–12 (2018).
Sumitomo, K. et al. Genome-wide association study overcomes the genome complexity in autohexaploid chrysanthemum and tags SNP markers onto the flower color genes. Sci. Rep. 9, 1–9 (2019).
Hsu, C.-C. et al. An overview of the Phalaenopsis orchid genome through BAC end sequence analysis. BMC Plant Biol. 11, 1–11 (2011).
Button, L., Villalobos, A. L., Dart, S. R. & Eckert, C. G. Reduced petal size and color associated with transitions from outcrossing to selfing in Camissoniopsis cheiranthifolia (Onagraceae). Int. J. Plant Sci. 173, 251–260 (2012).
Lu, J., Wang, S., Zhao, H., Liu, J. & Wang, H. Genetic linkage map of EST-SSR and SRAP markers in the endangered Chinese endemic herb Dendrobium (Orchidaceae). Genet. Mol. Res. 11, 4654–4667 (2012).
Lepers-Andrzejewski, S., Causse, S., Caromel, B., Wong, M. & Dron, M. Genetic linkage map and diversity analysis of Tahitian vanilla (Vanilla× tahitensis, Orchidaceae). Crop Sci. 52, 795–806 (2012).
Lu, J. et al. High-density genetic map construction and stem total polysaccharide content-related QTL exploration for Chinese endemic Dendrobium (Orchidaceae). Front. Plant Sci. 9, 398 (2018).
Korte, A. & Farlow, A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods 9, 1–9 (2013).
Sonah, H. et al. An improved genotyping by sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping. PLoS ONE 8, e54603 (2013).
Smith, S. D. Pleiotropy and the evolution of floral integration. New Phytol. 209, 80–85 (2016).
Li, L. et al. Co-regulation of auxin and cytokinin in anthocyanin accumulation during natural development of purple wheat grains. J. Plant Growth Regul. 182, 1–13 (2020).
Hara, M., Oki, K., Hoshino, K. & Kuboi, T. Enhancement of anthocyanin biosynthesis by sugar in radish (Raphanus sativus) hypocotyl. Plant Sci. 164, 259–265 (2003).
Winkel-Shirley, B. Biosynthesis of flavonoids and effects of stress. Curr. Opin. Plant Biol. 5, 218–223 (2002).
Hsu, C.-C., Su, C.-J., Jeng, M.-F., Chen, W.-H. & Chen, H.-H. A HORT1 retrotransposon insertion in the PeMYB11 promoter causes harlequin/black flowers in Phalaenopsis orchids. Plant Physiol. 180, 1535–1548 (2019).
Ohmiya, A. Molecular mechanisms underlying the diverse array of petal colors in chrysanthemum flowers. Breed. Sci. 2018, 17075 (2018).
Takahashi, R., Yamagishi, N. & Yoshikawa, N. A MYB transcription factor controls flower color in soybean. J. Hered. 104, 149–153 (2013).
Liu, F., Yang, Y., Gao, J., Ma, C. & Bi, Y. A comparative transcriptome analysis of a wild purple potato and its red mutant provides insight into the mechanism of anthocyanin transformation. PLoS ONE 13, e0191406 (2018).
Chiou, C.-Y. & Yeh, K.-W. Differential expression of MYB gene (OgMYB1) determines color patterning in floral tissue of Oncidium Gower Ramsey. Plant Mol. Biol. 66, 379–388 (2008).
Takahashi, R., Benitez, E. R., Oyoo, M. E., Khan, N. A. & Komatsu, S. Nonsense mutation of an MYB transcription factor is associated with purple-blue flower color in soybean. J. Hered. 102, 458–463 (2011).
Ma, D. et al. Poplar MYB117 promotes anthocyanin synthesis and enhances flavonoid B-ring hydroxylation by upregulating the flavonoid 3’, 5’-hydroxylase gene. J. Exp. Bot. 72, 3864–3880 (2021).
Lalusin, A. G., Nishita, K., Kim, S.-H., Ohta, M. & Fujimura, T. A new MADS-box gene (IbMADS10) from sweet potato (Ipomoea batatas (L.) Lam) is involved in the accumulation of anthocyanin. Mol. Genet. Genomics 275, 44–54 (2006).
Hsu, H.-F. et al. Multifunctional evolution of B and AGL6 MADS box genes in orchids. Nat. Commun. 12, 1–12 (2021).
Van den Ende, W. & El-Esawe, S. K. Sucrose signaling pathways leading to fructan and anthocyanin accumulation: a dual function in abiotic and biotic stress responses?. Environ. Exp. Bot. 108, 4–13 (2014).
Ji, X.-H., Zhang, R., Wang, N., Yang, L. & Chen, X.-S. Transcriptome profiling reveals auxin suppressed anthocyanin biosynthesis in red-fleshed apple callus (Malus sieversii f. niedzwetzkyana). Plant Cell Tissue Organ Cult. 123, 389–404 (2015).
Wang, Y.-C. et al. Auxin regulates anthocyanin biosynthesis through the Aux/IAA–ARF signaling pathway in apple. Hort. Res. 5, 1–11 (2018).
de Vetten, N. et al. A cytochrome b5 is required for full activity of flavonoid 3′, 5′-hydroxylase, a cytochrome P450 involved in the formation of blue flower colors. Proc. Natl. Acad. Sci. 96, 778–783 (1999).
Tanaka, Y. & Brugliera, F. Flower colour and cytochromes P450. Philos. Trans. R. Soc. B Biol. Sci. 368, 20120432 (2013).
Zhao, D. & Tao, J. Recent advances on the development and regulation of flower color in ornamental plants. Front. Plant Sci. 6, 261 (2015).
Zhang, Y. et al. Transcript quantification by RNA-Seq reveals differentially expressed genes in the red and yellow fruits of Fragaria vesca. PLoS ONE 10, e0144356 (2015).
Berman, J. et al. Red anthocyanins and yellow carotenoids form the color of orange-flower gentian (Gentiana lutea L. var. aurantiaca). PLoS ONE 11, e0162410 (2016).
Zuker, A. et al. Modification of flower color and fragrance by antisense suppression of the flavanone 3-hydroxylase gene. Mol. Breed. 9, 33–41 (2002).
Moon, J. et al. The SOC1 MADS-box gene integrates vernalization and gibberellin signals for flowering in Arabidopsis. Plant J. 35, 613–623 (2003).
Lee, J. & Lee, I. Regulation and function of SOC1, a flowering pathway integrator. J. Exp. Bot. 61, 2247–2254 (2010).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/ (2013).
Elshire, R. J. et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6, e19379 (2011).
Bradbury, P. J. et al. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinform. 23, 2633–2635 (2007).
Margarido, G. R., Souza, A. P. & Garcia, A. A. OneMap: Software for genetic mapping in outcrossing species. Hereditas 144, 78–79 (2007).
Voorrips, R. MapChart: Software for the graphical presentation of linkage maps and QTLs. J. Hered. 93, 77–78 (2002).
Acknowledgements
The authors thank the Taiwan Sugar Crop. for assistance in taking care of the F1 population of P. Intermedia from the cross between P. aphrodite and P. equestris.
Funding
The funding was provided by Ministry of Science and Technology, Taiwan, MOST 108-2313-B-006-006-MY3.
Author information
Authors and Affiliations
Contributions
C.C.H. and S.Y.C. performed the experiments and wrote the draft of the original manuscript. S.Y.C. assisted SNP analysis, C.Y.L. helped on the statistical analysis, P.H.L. supported the phenotyping, T.S. and A.H.P. participated in the GWAS analysis, W.L.W. provide the SSRs for linkage analysis, W.H.C. contributed to the population crossing, H.H.C. assisted in the experimental design, interpreted the data and finished the final draft of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hsu, CC., Chen, SY., Chiu, SY. et al. High-density genetic map and genome-wide association studies of aesthetic traits in Phalaenopsis orchids. Sci Rep 12, 3346 (2022). https://doi.org/10.1038/s41598-022-07318-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-07318-w
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.