Identification of basic/helix-loop-helix transcription factors reveals candidate genes involved in anthocyanin biosynthesis from the strawberry white-flesh mutant

As the second largest transcription factor family in plant, the basic helix-loop-helix (bHLH) transcription factor family, characterized by the conserved bHLH domain, plays a central regulatory role in many biological process. However, the bHLH transcription factor family of strawberry has not been systematically identified, especially for the anthocyanin biosynthesis. Here, we identified a total of 113 bHLH transcription factors and described their chromosomal distribution and bioinformatics for the diploid woodland strawberry Fragaria vesca. In addition, transcription profiles of 113 orthologous bHLH genes from various tissues were analyzed for the cultivar ‘Benihoppe’, its white-flesh mutant ‘Xiaobai’, and the ‘Snow Princess’ from their fruit development to the ripening, as well as those under either the ABA or Eth treatment. Both the RT-PCR and qRT-PCR results show that seven selected FabHLH genes (FabHLH17, FabHLH25, FabHLH27, FabHLH29, FabHLH40, FabHLH80, FabHLH98) are responsive to the fruit anthocyanin biosynthesis and hormone signaling according to transcript profiles where three color modes are observed for strawberry’s fruit skin and flesh. Further, prediction for the protein interaction network reveals that four bHLHs (FabHLH25, FabHLH29, FabHLH80, FabHLH98) are involved in the fruit anthocyanin biosynthesis and hormone signaling transduction. These bioinformatics and expression profiles provide a good basis for a further investigation of strawberry bHLH genes.

Strawberry (Fragaria × ananassa Duch.) is well recognized universally as a delicious and healthy food 30 . In recent years, white strawberry is more and more favored by consumers, such as 'Xiaobai' 31 , 'Snow Princess' and 'Tokun' varieties. As a result, numerous researchers have been casting their eyes on the fruit ripening, ABA (abscisic acid) signaling pathway [32][33][34][35][36] and anthocyanin biosynthesis 37,38 . Roles of MYB transcription factors have been highlighted in the anthocyanin biosynthesis 20,37,38 , while very few reports on bHLH transcription factors have been made [38][39][40] and they are mostly limited to the single bHLH. For example, anthocyanin biosynthesis is essentially regulated by the FvDFR (F. vesca DFR, dihydroflavonol 4-reductase) and FvUFGT (F. vesca UFGT, 3-O-glucosyltransferase), which can be activated by FvbHLH33 (F. vesca bHLH33) with the co-expression of FvMYB10 (F. vesca MYB10) 39 . Moreover, FabHLH3 (F. ananassa bHLH3) and FabHLH3∆ (encode putative negative regulator), by interacting with the four MYBs, are found to be involved in the proanthocyanidins biosynthesis for strawberry 38 . In order to systematically explore the molecular basis of bHLH from all of FvbHLHs involved in the anthocyanin biosynthesis and hormone response pathway, we will first analyze the bioinformation of 113 bHLH genes for the diploid woodland strawberry, F. vesca, and reveal their structure, evolution and function. Furthermore, we will study the transcript profiles of FabHLH genes from various tissues for the cultivar 'Benihoppe' , its white flesh mutant 'Xiaobai' , and the 'Snow Princess' from their fruit development to the ripening period, as well as those under either the ABA or Eth (ethephon) treatment. We finally discover that seven FabHLHs are crucial to the anthocyanin biosynthesis and fruit ripening for the strawberry fruit. We hope that this work will serve as a solid foundation for further investigations into functions of bHLH genes for the anthocyanin biosynthesis.

Identification and annotation of bHLH transcription factors in strawberry.
To identify bHLH transcription factors for F. vesca, a total of 166 bHLH members for strawberry via the BLAST-P (Basic Local Alignment Search Tool) search in the database of NCBI (National Center for Biotechnology Information) were obtained by comparing with the 112 strawberry bHLH amino acid sequences from the Plant Transcription Factor Database for the diploid woodland strawberry accession Hawaii-4 genome. Subsequently, to verify the reliability of the selection, a survey was conducted to confirm the presence of the conserved bHLH domain in protein sequences using the online CDD (Conserved Domains Database), SMART (Simple Modular Architecture research tool), and InterProScan database. The unique hits are kept, and duplications and similar DNA or protein sequences (with several bases different) are ruled out with only one of them left 10 . For example, there are four alternative variants for the sequence of FvbHLH64, only the longest variant is kept for the further analysis. In the end, 113 out of the 166 FvbHLH members are eventually selected (Table 1) out, forming the bHLH family for strawberry. The first 107 genes are renamed from FvbHLH1 to FvbHLH107 according to their distributions on the chromosome 1-7 from NCBI database 8,41 (Table 1; Fig. 1). In particular, the left 6 on unknown chromosome are renamed from FvbHLH108 to FvbHLH113 by their position value from the minimum to the maximum( Table 1). The acquired 113 bHLH genes will be further used to study their bioinformation and biofunction, specially for the anthocyanin biosynthesis.

Phylogenetic analysis and multiple sequence alignments of the strawberry FvbHLH proteins.
Reflecting on the past researches, the exact number of the classified subfamily for bHLH proteins has barely been reported 8 . To investigate the classification and evolution as well as to gain insights into the potential function of FvbHLH proteins for strawberry, we constructed a phylogenetic tree (Fig. 2) for the 113 FvbHLHs from F. vesca and 158 AtbHLHs from Arabidopsis. 26 of bHLH subfamilies are further classified according to the nomenclature protocol proposed by Heim et al. 5 , with some modifications. For example, I(a + b) is divided into Ia and Ib, and IIIa and IIIc are combined into III(a + c); bHLHs that are not located in any of the 24 subfamilies are classified as "orphans" (Fig. 2). We find that FvbHLH protein is persistently present in all subfamilies and the number of it varies hugely from subfamily to subfamily. For instance, each of the smallest group II, IVd, XIII and XIV contains one FvbHLH gene, while the largest clade group XII contains twelve. Consequently, the classification of bHLH genes provides an evidence for relationships among genes during their evolution. two helix regions and one loop region. We find that residues of His-2, Glu-6, Arg-7, Arg-9, Arg-10, Leu-20, Leu-23, Leu-36, Leu-46, etc., in the bHLH domain are conserved, implying that the amino acid residues may play an important role in strawberry's evolution. In addition, we notice that the basic region of the bHLH domain can bind to DNA and it is critical to the gene biofunction 4 . It also has been known that both Glu-6 and Arg-9 in basic region of bHLH domain play important roles in the DNA binding 4,9,13 and recognition of G-box and E-box (binding mode). As a result, we divided the FvbHLH binding into three modes: G-box (with the presence of His/ Lys-2, Glu-6 and Arg-9), E-box (with the presence of Glu-6 and Arg-9) and non-E-box (without the simultaneous presence of Glu-6 and Arg-9) binding 42 . As is demonstrated in Fig. S1, FvbHLH proteins are divided into three types: 57 for the G-box-binding, 25 for the E-box-binding, 31 for the non-E-box-binding.
Gene structure and conserved motif analysis of FvbHLH genes. Gene structure and conserved motif analysis of Arabidopsis and strawberry bHLH were performed to acquire more information about gene families 5 .
By scanning all aspects of gene structure and conserved motif, genes within each subfamily are discovered to contain a similar number of intron and conserved motif, while the number of them is strikingly different on genes from different subfamily (Figs S2; S3), in consistent with the previous bootstrap analysis [43][44][45] . For instance, each gene from III(d + e) subfamily contains one exon except for FvbHLH97 and AtbHLH14 genes. In sharp contrast to this, 77.8% of bHLH genes from Ia subfamily contain three exons and two introns.
It has been pointed out that part of motifs, acting as activation domain, are important for the interaction with other modules of the transcription complex, and are the targets of signal transduction chains 5,10 . It might be inspiring to see how the motif structure is related to the gene classification. Thus, we searched 24 conserved motifs by MEME (Multiple Expectation Maximization for Motif Elicitation) program to obtain their distributions on bHLH sequences (Figs S2; S3). As is shown in Fig. S2, the bHLH proteins identified from the same subfamily share similar conserved motif. For example, motif 21 is exclusively located in all members from the XIII subfamily, whereas all bHLH sequences from IVc subfamily contain motif 1, motif 2, motif 10 and motif 15 at the C-terminal region. As bHLH is composed of motif 1 and motif 2, both of which are consistently identified in all strawberry and Arabidopsis bHLH proteins (Figs S2; S3). Hence, the classification of 26 subfamilies is thus further supported by the gene structure and motif analysis. Transcript patterns of FabHLH genes during the fruit development and ripening for the whiteflesh mutant strawberry. In order to identify bHLH genes involved in the color formation of strawberry fruit, three cultivated strawberry varieties were used in this study: Benihoppe, Xiaobai and Snow Princess. Colors of both the fruit flesh and skin of 'Benihoppe' are red. As the mutant of 'Benihoppe' , 'Xiaobai' carries on the white or yellow color for its flesh with its fruit skin red or pink 31 . White is found for the color of 'Snow Princess' fruit flesh and skin (Fig. 4A). Additionally, strawberry fruit development and ripening are divided into seven stages: S1 small green fruit, S2 middle green fruit, S3 large green fruit, S4 white fruit, S5 initial red, S6 partial red, S7 full red (Fig. 4B). Because of the strong correlation between the gene expression pattern with its function, transcript patterns of 113 FabHLH genes for the color formation during the fruit development and ripening stages for the three varieties are tracked and summarized in Fig. 5B, in which the synthesis of anthocyanin is recorded from the turning stage to the red stage 28 . To examine the transcript of FvbHLH genes involved in the anthocyanin biosynthesis, both the RT-PCR (semi-quantitative reverse-transcription PCR) and qRT-PCR (quantitative RT-PCR) techniques are adopted to analyze genes' expression level. Figure 5B reveals that the number of up-regulated expression of FabHLH genes from 'Benihoppe' is 71 during the fruit ripening and this number from the 'Snow Princess' and 'Xiaobai' continuously falls down to 45 and 24, respectively. Depending on the consistency between the expression level of the up-regulated genes and anthocyanin content (Fig. 4B), 7 FabHLH genes are chosen out of the 113 genes to further investigate the possible expression patterns of bHLHs involved in the anthocyanin biosynthesis (Fig. 5): FabHLH17, FabHLH25,  FabHLH27, FabHLH29, FabHLH40, FabHLH80, FabHLH98. In the following will be reported three relevant gene expression patterns: First, we will focus on the FabHLH25. Its expression is significantly up-regulated during all stages for 'Benihoppe' fruit, in accordance with its color of fruit skin and flesh, indicating that FabHLH25 promotes the anthocyanin biosynthesis for 'Benihoppe'; for 'Xiaobai' fruit, it is up-regulated at S2 stage and subsequently down-regulated at S5 stage, in discordance with the color of fruit skin while coinciding with the color of fruit flesh, suggesting that FabHLH25 is not relevant to the anthocyanin biosynthesis for 'Xiaobai'; however, the expression of FabHLH25 is always down-regulated in the whole life for 'Snow Princess' fruit, agreeing well    Asterisk symbol corresponds to each column above, which stands for the percentage of presence of amino acids at each site and the color of the asterisk symbol corresponds bHLH regions from the top insert. The analysis of the amino acids composition at each site marked by the asterisk indicates that the conservation of conserved amino acids is over 50%. with the color of fruit skin and flesh, implying that FabHLH25 is barely related to the anthocyanin biosynthesis for 'Snow Princess' . As a consequence, expression level of FabHLH25 shows significant difference between 'Benihoppe' and 'Xiaobai' , and no observable difference between 'Xiaobai' and 'Snow Princess' is found from S4 to S7. This result implies that the FabHLH25 might be involved in the anthocyanin biosynthesis for the fruit flesh. Second, we will turn to FabHLH27 gene. Its expression is up-regulated during the overall stages for both the 'Benihoppe' and 'Xiaobai' fruits. This mode coincide with the color of fruit skin for 'Benihoppe' and 'Xiaobai' and the color of fruit flesh for 'Benihoppe' , and is inconsistent with the color of fruit flesh for 'Xiaobai' . The consistency here indicates that FabHLH27 promotes the anthocyanin biosynthesis for both the 'Benihoppe' and 'Xiaobai' . Nevertheless, FabHLH27 gene's expression is always down-regulated for 'Snow Princess' fruit, in perfect agreement with the color of fruit skin and flesh for 'Snow Princess' , implying that FabHLH27 is not in charge of the anthocyanin biosynthesis for 'Snow Princess' . In brief, expression level of FabHLH27 shows significant difference among 'Benihoppe' , 'Xiaobai' and 'Snow Princess' from S4 to S7. This feature signifies that the FabHLH27 could promote the anthocyanin biosynthesis for the fruit skin. Third, we will cast our eyes on the FabHLH80 gene. Its expression is constantly down-regulated for 'Benihoppe' fruit, in good accordance with the color of fruit skin and flesh for 'Benihoppe' , suggesting that FabHLH80 is not involved in the anthocyanin biosynthesis for 'Benihoppe' . FabHLH80 gene's expression is up-regulated at S2 stage and subsequently down-regulated at S5 stage for 'Xiaobai' fruit, going inversely with the color of fruit skin and flesh for 'Xiaobai' , indicating that FabHLH80 does not promote the anthocyanin biosynthesis for 'Xiaobai'; nevertheless, FabHLH80 becomes down-regulated at S2 stage and up-regulated at S4 stage for 'Snow Princess' fruit, in good accordance with the color of fruit skin and flesh for 'Snow Princess' , implying that FabHLH80 does not promote the anthocyanin biosynthesis for 'Snow Princess' either. As a short summarize, expression level of FabHLH80 shows significant difference from S4 to S7 for three varieties. Such a mode leads us to the conclusion that the FabHLH80 may inhibit the anthocyanin biosynthesis. Based on those observations and our more extensive data on expression patterns of the 7 previously selected bHLH genes, it is shown that they are indeed related to the anthocyanin biosynthesis.

Transcript patterns of
Transcript patterns of the FabHLHs genes' response to hormone treatment. Regarding 25 genes are discovered to be simultaneously responsive for the three varieties. For example, the expression level of FabHLH29 from IIIf subfamily strikingly increases at the initial stage (0.5 hpt (hour post treatment) to 2 hpt) and maintains a high value afterwards in response to ABA treatment for 'Xiaobai' and 'Snow Princess' , while it decreases thoroughly under the ABA treatment for 'Benihoppe' . When subjected to the Eth, FabHLH29 expresses highly for 'Benihoppe' and keeps relatively low yet higher than the control for both 'Xiaobai' and 'Snow Princess' . In addition, expression level of FabHLH98 from IIIf subfamily is invariably high for the three varieties under both treatments compared with the control: the increase of it is significantly induced at early stages (0.5 hpt to 2 hpt), and it reaches the peak at later stages (4 hpt to 9 hpt) in response to the ABA treatment for 'Benihoppe' and 'Xiaobai' . However, it is induced and starts to reach its maximum from 6 hpt to 9 hpt in response to ABA treatment for 'Snow Princess'; under the treatment of Eth, FabHLH98 's expression is induced and begins to reach the peak at later stages (4 hpt to 12 hpt) for the three varieties. Besides, bHLH genes from III(d + e) subfamily are realized to be responsive to both treatments for the three varieties as well. This finding demonstrates that subfamilies of III(d + e) and IIIf might be involved in the fruit ripening and plant response to abiotic stress.
Network interaction analysis of FabHLHs response to anthocyanin biosynthesis and hormone stress. The above results argue that 7 FabHLH genes are highly possible to be involved in the anthocyanin biosynthesis and hormone response pathway for strawberry as a result of the interaction between bHLH and other proteins. Network interaction analysis has been recently demonstrated to be a powerful method to study the gene function. Online software of STRING 10 is used to reconstruct the interaction network of the 7 FvbHLH based on the orthologous gene of Arabidopsis. Only 4 bHLHs (FvbHLH25, FvbHLH29, FvbHLH80, and FvbHLH98) are proved to be able to predict the interacting proteins ( Fig. 8; Table S2). According to the database of STRING 10, they are involved in the control of flavonoid pigmentation, epidermal cell fate specification and regulation of ABA-inducible genes under drought stress conditions. As is shown in Fig. 8; Table S2, FvbHLH25 (homologous to AT4G1640 for Arabidopsis) can be associated with MYB113, which could combine with several bHLH proteins in the anthocyanin biosynthesis 48 . Besides, FvbHLH25 also interacts with JAZ5 (JASMONATE ZIM-Domain 5) and JAZ6, which are the repressor of jasmonate response. FvbHLH29 (homologous to TT8 for Arabidopsis) can interact with MYB75, which promotes the synthesis of anthocyanin biosynthesis by activating the expression of DFR (dihydroflavonol-4-reductase) such that it is eventually involved in the control of flavonoid pigmentation. Moreover, FvbHLH80 (homologous to MYC2 for Arabidopsis) could react with MYB2 in the regulation of ABA-induced genes under drought stress conditions, as well as with MYC3 and MYC4 in the control of subsets of JA-dependent responses. In addition, FvbHLH98 (homologous to EGL3 for Arabidopsis) participates in the anthocyanin accumulation in Arabidopsis 1,48,49 and tomato 21 . These results show that 4 FvbHLHs are involved in the fruit ripening and hormone response pathway 25,34,38,47,50 .

Discussion
With the functionality being the transcription, bHLH family are involved in the regulatory process of fruit ripening, hormone signaling and abiotic stress 12 . In the past few decades, features and functions of the bHLH gene family have been identified and investigated for several plant species 3,8,12 . Though as one of the most important horticultural crops grown worldwide providing ingredient for processed foods like jams and juices, strawberry has been barely studied for its bHLH family, who participates in the anthocyanin biosynthesis in the fruit SCIeNTIFIC RepoRts | (2018) 8:2721 | DOI:10.1038/s41598-018-21136-z ripening. Very few bHLHs have been investigated for the strawberry, such as FabHLH3 38 , FaSPT (spatula) 40 and FvbHLH33 39 . In the present study, we first identified a total of 113 bHLH genes based on the F. vesca genome (Table 1 and Fig. 1), and further implemented their bioinformation analysis (Figs 2; 3; S2) followed by the expression pattern classification during the fruit ripening under hormone treatments for three varieties (Figs 5; 6; 7).
With the rapid development of bioinformation analysis, the information stored in various genomes can be decoded to elucidate mechanisms that regulate fruit ripening and response to abiotic stress 4 . We firstly identified 113 unique bHLH proteins using the conserved motif of bHLH by filtering candidate genes according to the criteria described by Sun et al. 3 . Next, based on the phylogenetic analysis of FvbHLH, the selected FvbHLHs were classified into 26 subfamilies (Fig. 3) with the methodology similar to the classification of Arabidopsis (26 subfamilies), tomato (26 subfamilies) and Chinese cabbage (26 subfamilies) [2][3][4]13 . Moreover, the analysis of motif and gene structure is performed to gain evidence to support phylogenetic relationship for gene families.
Most bHLH proteins identified so far are mostly functionally characterized for Arabidopsis and tomato, with the revealing of their effects on the regulation of plant development, fruit ripening, anthocyanin biosynthesis and hormone signaling responses 6,16 . Those results prove that transcript pattern of a gene is closely related to its function, based on which we designed to examine the expression patterns of 113 FvbHLH genes from tissues, at fruit ripening stage, as well as those under the treatment of hormone (Figs 5; 6; 7). We discover that the expression patterns for the 78 out of the 113 genes from various tissues for the three varieties are similar to each other. To Figure 6. Transcript accumulation patterns of 113 bHLH genes for the three strawberry varieties under hormone stress (ABA and Eth). FvActin, FvRib413 and FvGAPDH2 were used as an internal control. The transcript accumulation profiles were generated by semi-quantitative PCR and were visualized as heat maps. The color scale represents the relative transcript level with increased (red) and decreased (green) transcript abundance. The FvbHLH genes marked by red asterisk indicate their candidacy in the anthocyanin biosynthesis.
comprehensively understand the role of bHLH genes on the anthocyanin biosynthesis, RT-PCR and qRT-PCR analyses for the three varieties with different fruit flesh and skin colors were performed (Figs 4; 5B; 6; 7). 7 FabHLHs are found to be highly responsive for the anthocyanin biosynthesis depending on their different expression levels : FabHLH17, FabHLH25, FabHLH27, FabHLH29, FabHLH40, FabHLH80, FabHLH98. For example, the expression level of FabHLH27 is high for both 'Benihoppe' and 'Xiaobai' (red or pink skin) at the later stages (S5 → S7), while it stays low for 'Snow Princess' (white skin) at the similar stage S5. This implies that this gene is involved in the anthocyanin biosynthesis of fruit skin. It has been reported that IIIf subfamily matters for the fruit color formation. Hereby, we focus on the 2 out of the 7 candidate FabHLHs that fall into the IIIf subfamily: FabHLH29 and FabHLH98. We found that FabHLH29 is relevant to the anthocyanin biosynthesis according to its expression pattern during the fruit ripening for the three varieties. Besides, gene sequence of FabHLH29 is highly similar to that of AtTT8 (AtbHLH42), which has been reported to be involved in anthocyanin biosynthesis 1,6,15 . Moreover, the FabHLH29 also is responsive to both the ABA and Eth treatments, thought with certain difference (down-regulated for 'Benihoppe' under ABA treatment, up-regulated for rest cases), for the three varieties. More evidence for the involvement FabHLH29 in the anthocyanin biosynthesis comes from the interaction network. Proteins (F3H (Flavanone 3-hydyroxylase), DFR, TTG1 and MYB), located in the pathway of anthocyanin biosynthesis, are predicated to interact with FabHLH29 (AtTT8) (Fig. 8). Researchers have realized that the TT8 from subfamily IIIf is active in regulating the synthesis of anthocyanin and proanthocyanidin for Arabidopsis 1,6,50,51 by forming a stabilized MBW complex with TT2 and TTG1, and it is involved in the anthocyanin biosynthesis for rice as well 22 . We also find that the expression pattern of FabHLH98 (homologous to EGL3) shows no significant difference during the fruit ripening for 'Benihoppe' , 'Xiaobai' , and 'Snow Princess' , which denies the participation of FabHLH98 in the anthocyanin biosynthesis. However, FabHLH98 is responsive to the abiotic stress with the implement of ABA and Eth, which seems to suggest its involvement in the fruit ripening. What's more, analysis of interaction network of FabHLH98 demonstrates that it also plays a role in the activation of anthocyanin biosynthesis, possibly with MYB75/PAP1, inconsistent with previous results from the analysis of expression pattern during the fruit ripening in this study, yet in good agreement with the precursor reports 1,6,27 . In brief, expression pattern analysis under hormone treatments fits well with results from the interaction network investigation for the three varieties. However, both are inconsistent with expression pattern results during the fruit ripening. Consequently, FabHLH98 is selected as the candidate gene for the study of anthocyanin biosynthesis and a further study on its precise role is still in demand.
Previous papers inform that genes from bHLH subfamily III(d + e) take part in JA signal pathway, resulting into the regulation of plant defense during developmental process for Arabidopsis 23,25,26 and the promotion of anthocyanin biosynthesis 24,27 for apple. Moreover, the function of bHLH subfamily IIId, including bHLH3, can negatively regulate JA-mediated plant defence and development 13 , while the function of bHLH subfamily IIIe can activate JA-induced leaf senescence 25 . In addition, as a repressor in the JA signaling pathway, MdJAZ can be phosphorylated by MdSnRK1.1 (Snf1-Related protein Kinases) to facilitate its 26S proteasome-mediated degradation, releasing MdbHLH3 which will bind to promoters of the anthocyanin biosynthesis genes MdDFR and MdUFGT, thus finally promotes the biosynthesis of anthocyanin and proanthocyanidin 24,27 . In our experiments, we find that FabHLH25 from III(d + e) subfamily might be correlated with the anthocyanin biosynthesis of fruit flesh (Figs 5B, 7) from the analysis of the expression pattern for the three varieties during their ripening. Moreover, the FabHLH25 (homologous to AT4G16430, FabHLH3 and MdbHLH3) protein strongly interact with MYB113, JAZ5 and JAZ6 proteins (Fig. 8) according to results from interaction network analysis, in consistent with the known knowledge that FabHLH25 is able to interact with MYB and form the MBW complex to regulate the expression of genes involved in the proanthocyanidin biosynthesis 38 . What's more, it has been mentioned that MdMYC2 positively regulates anthocyanin biosynthesis by modulating the expression of positive regulators in JA signaling (MdMYB1, MdbHLH3, MdbHLH33) for the apple 52 . From our observation, the transcript pattern and interaction network analysis evidence that the FabHLH80 (homologous to MYC2) from III(d + e) subfamily might also be present in the anthocyanin biosynthesis. Therefore, our research hereby paves the way for further studies and understandings of bHLH genes function in the fruit ripening and anthocyanin biosynthesis for strawberry.
In conclusion, the first comprehensive and systematic analysis of strawberry bHLH transcription factors is performed. First, 113 bHLH transcription factors from the entire strawberry genomes are identified as candidate genes responsible for the anthocyanin biosynthesis and further renamed based on their chromosome distribution. Next, the selected genes are divided to 26 subfamilies according to phylogenetic analyses, gene structures and protein motifs. Third, expression patterns of 113 FabHLHs obtained during fruit development and ripening, as well as those under either the ABA or Eth treatment, suggest that seven FabHLHs (FabHLH17, FabHLH25, FabHLH27,  FabHLH29, FabHLH40, FabHLH80, FabHLH98) are involved in the anthocyanin biosynthesis of strawberry fruit. Finally, results of interaction network analyses of the four FabHLH genes (FabHLH25, FabHLH29, FabHLH80, FabHLH98) reveal that bHLHs proteins might participate in the anthocyanin biosynthesis during the fruit ripening and in the hormone response pathway. This study will provide an insight into a further understanding of functions of bHLH members in the color formation for fruits.

Materials and Methods
Identification of bHLH transcription factors for strawberry. To 53,54 and InterProScan program (http://www.ebi. ac.uk/inter-pro/search/sequence-search) to confirm their completeness and the presence of bHLH domain. Details about the bHLH sequences, such as length of amino acid sequences, theoretical molecular weights (Mw) and isoelectric point (pI), were obtained from ExPASy Proteomics server (http://web.expasy.org/compute_pi/).

Bioinformatic analysis of bHLH transcription factors for strawberry. Chromosomal localization
data was retrieved from NCBI Map Viewer (https://www.arabidopsis.org/mapview/). Genes were mapped to the chromosomes using MapDraw. These genes were renamed from FvbHLH1 to FvbHLH113 according to their position, from the top to bottom, on the F. vesca chromosome 8,41 . Multiple domain alignments of strawberry bHLH proteins and domains were performed using ClustalX 2.0.12 with default settings for obtained sequences of the FvbHLH domains, and alignment results were shown and drew by OriginPro 8 9 . To compare the evolutionary relationship between Arabidopsis (AtbHLH) and strawberry (FvbHLH), we obtained the phylogenetic tree for bHLH proteins using MEGA5.1 with the neighbor-joining method and the following parameters: complete deletion, p-distance model and 1000 replicates of bootstrap method 4,9 . 26 subfamilies were identified according to the clade support values, topology of the trees, branch lengths, visual inspection of the bHLH amino acid sequences and classification of strawberry 2,4,10 . The online Gene Structure Display Server (GSDS 2.0, http://gsds.cbi.pku.edu.cn/) was used to investigate the exon-intron structure of the FvbHLH transcription factors based on each coding sequence (CDS) and corresponding genomic sequence. Conserved motifs in FvbHLH transcription factors were identified from the online MEME (http://meme-suite.org/tools/meme). The FvbHLH25, FvbHLH29, FvbHLH80 and FvbHLH98 protein sequences were employed as queries for the BLAST-P search in Arabidopsis Information Resource (TAIR, https://www.arabidopsis.org/) to obtain protein sequences of AT4G16430, AtTT8, AtMYC2 and AtEGL3, respectively. Specific interaction network with experimental evidences of AT4G16430, AtTT8, AtMYC2 and AtEGL3 was constructed using online STRING 10 (http://string-db.org/) with option value >0.700 or 0.900.

Plant materials, growth conditions and treatments. Three octoploid cultivated strawberry varieties (F.
ananassa Duch. 'Benihoppe'; F. ananassa Duch. 'Xiaobai' , the white-flesh mutant of 'Benihoppe'; F. ananassa 'Snow Princess' with white fruit skin and flesh.) were used in this study (Fig. 4A). Plantlets of the three varieties were grown in the strawberry germplasm resource greenhouse of Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou, Henan, China (Fig. 4A). Strawberry plantlets were transplanted into a plastic pot (diameter: 17 cm, height: 15 cm) containing soil mix (perlite: peat, 1: 4, v/v) and grown in greenhouse with temperatures ranging from 8 °C to 28 °C, relative humidity ranging from 55% to 70%, and without supplemental lighting.
To analyze transcript patterns of bHLH transcription factors, strawberry organs/tissues (roots, young leaves, mature leaves, runners, runner tips, runner with tips and one leaf, anthotaxy, flowers, small green fruit, middle green fruit, large green fruit, white fruit, initial red fruit, partial red fruit, full red fruit) were obtained from different developmental stages. Various vegetative and reproductive tissues were collected and stored at −80 °C for tissue-specific experiments. To analyze the expression level of bHLH transcription factors to different hormones, strawberry plantlets at the stage of the sixth leaf fully expanded were sprayed with ABA at 0.1 mM, Eth at 0.5 g/L, and water, respectively. Leaf samples were collected for RNA extraction at 0, 0.5, 1, 2, 4, 6, 9 and 12 hpt. Leaves with water treatment at 0 hpt were used as control. Each time for each treatment, one leaf from each of the three separate plants, thus three leaves in total, was picked up to conduct one analysis, and all treatments were performed thrice independently.