Abstract
Micro-exons are exons of very small size (usually 3–30 nts). Some micro-exons are alternatively spliced. Their functions, regulation and evolution are largely unknown. Here, we present an example of an alternatively spliced 3 bp micro-exon (micro-Ex8) in the homothorax (hth) gene in Drosophila. Hth is involved in many developmental processes. It contains a MH domain and a TALE-class homeodomain (HD). It binds to another homeodomain Exd via its MH domain to promote the nuclear import of the Hth-Exd complex and serve as a cofactor for Hox proteins. The MH and HD domains in Hth as well as the HTh-Exd interaction are highly conserved in evolution. The alternatively spliced micro-exon lies between the exons encoding the MH and HD domains. We provide clear proof that the micro-Ex8 is produced by alternative splicing from a 48 bp full-length exon 8 (FL-Ex8) and the micro-Ex8 is the first three nt is FL-Ex8. We found that the micro-Ex8 is the ancient form and the 3 + 48 organization of alternatively spliced overlapping exons only emerged in the Schizophora group of Diptera and is absolutely conserved in this group. We then used several strategies to test the in vivo function of the two types of isoforms and found that the micro-Ex8 and FL-Ex8 isoforms have largely overlapping functions but also have non-redundant functions that are tissue-specific, which supports their strong evolutionary conservation. Since the different combinations of protein interaction of Hth with Exd and/or Hox can have different DNA target specificity, our finding of alternatively spliced isoforms adds to the spectrum of structural and functional diversity under developmental regulation.
Similar content being viewed by others
Introduction
In eukaryotes, most genes contain exons that are separated by non-coding introns. Mature mRNAs are formed from primary transcripts by excising introns and splicing exons together. Pre-mRNA splicing is conducted by the spliceosome that recognizes the consensus motifs including the 5′ splice site GU, the branch point followed by a short polypyrimidine tract and the 3′ splice site AG1,2. The spliceosome is a large ribonucleoprotein complex comprised of five small nuclear ribonucleoproteins (snRNPs, U1, U2, U4/U6, and U5) with small nuclear RNAs (snRNAs) and a number of associated proteins3. Alternative splicing confers the flexibility of the pre-mRNA splicing process, and is considered an important trigger for proteome expansion and gene diversity during evolution4,5.
Exons of very small size (usually 3–30 nts) are called micro-exons6,7. Examples include a 6 bp exon in the invected gene in Drosophila8,a 9 bp exon in wheat 1-SST gene9,a 9 bp exon in potato invertase genes10,a 1 bp exon in Arabidopsis APC-11 gene11,a 3 bp exon in Drosophila troponin T gene12,two 5 bp exons in Drosophila Ubx gene6,two 9 bp exons in Drosophila Fasciclin 1 gene13,an alternatively spliced 3 bp exon in mouse and rat NCAM gene14,15,a 6 bp exon in chicken cardiac troponin T gene 1616. Special algorithms were developed to systematically identify the micro-exons, especially from the RNA-Seq data17,18,19,20,21,22. These studies revealed that micro-exons are widely present in eukaryotic genomes7. Many of these micro-exons are alternatively spliced and their inclusion or exclusion in transcripts are often highly regulated during development and these regulations are generally conserved in evolution19,22.
Alternatively spliced micro-exons, if in-frame, may lead to local changes in protein sequence. If it is not in-frame, it may cause a drastic change of protein sequence or lead to protein truncation. Most micro-exons have a length in multiples of three, therefore are in-frame19,22. The local change in protein sequence can affect protein structure, subcellular localization, post-translational modification, enzymatic activity, or protein–protein interactions. Examples are known for effect on protein–protein interaction19,23 and post-translational modification24. Misregulation of micro-exon alternative splicing may lead to diseases, e.g. autism25. Therefore, the identification and functional study of micro-exons in developmentally important genes is vital to the understanding of the full spectrum and diversity of the gene’s functions and regulation.
Homothorax (Hth) is the Drosophila homolog of the vertebrate Meis homeodomain (HD) protein family26,27,28. Hth can bind to another homeodomain protein Extradenticle (Exd) to promote the nuclear translocation of Exd, thus making Exd functional in the nucleus by binding to target genes as Exd-Hth complex. The Exd-Hth complex in turn prevents the degradation of the Hth protein28,29,30,31. The most important feature for Exd-Hth is their function as cofactors of different Hox genes to increase DNA target specificity and contribute to developmental specificity 32,33,34. In addition, Hth-Exd also have Hox-independent functions in development 28,31,35.
Hth contains a highly conserved Meis-Hth (MH) domain (also called Homothorax-Meis (HM) domain26 and a highly conserved TALE class homeodomain (HD)26,27,28. The N-terminal MH domain interacts with Exd while the C-terminal HD is for DNA-binding to target genes31,36. The vertebrate Hth homolog Meis and Prep1 and the Exd homolog Pbx interact similarly37,38,39,40.
The hth transcription unit spans 132,008 bp and generates seven isoforms28,35,41 (FlyBase 2020_2). These can be grouped into three classes, the long isoforms (Hth-A, Hth-C and Hth-H) that encodes protein containing both MH and HD domains, the MH-only (or HDless) isoform (Hth-E, Hth-F and Hth-I) and the HD-only isoform (Hth-G) (Fig. 1). The Hth-E and Hth-F corresponds to the 7′ and 6′ isoform of Noro et al. (2006)35, respectively. The functions of the MH-only and HD-only isoforms have been analyzed35,41.
In this study, we report that the hth long forms have either a 48 bp exon 8 (FL-Ex8) or a 3 bp micro-exon 8 (micro-Ex8). We provide clear evidence that the micro-Ex8 was generated by alternatively splicing from the FL-Ex8. The micro-Ex8 is actually the first 3 bp in the FL-Ex8, so is an example of alternative splicing of overlapping alternative exons. The functions of these isoforms were examined in vivo by expressing specific isoforms in transgenic flies and by specific knockdown of different isoforms. Interestingly, we found that Hth-A (contains micro-Ex8) and Hth-C (contains FL-Ex8) possess isoform-specific functions although they share similarities in most cellular functions. We showed that the micro-Ex8 is conserved among Drosophila species, thereby allowing us to trace the evolutionary history of this alternatively spliced exon to its insect ancestors. The evolutionary conservation provides further support for the functional importance of the alternatively spliced 3 bp micro-Ex8 and the 48 bp FL-Ex8.
Results
hth isoforms with a 3 bp micro-exon
The Hth-C isoform has 14 exons and encodes a protein of 487 amino acids with a N-terminal MH domain and a C-terminal HD. It has a 48 bp exon 8 (FL-Ex8). In contrast, the Hth-A, Hth-H and Hth-G isoforms have a 3 bp micro-exon 8 (micro-Ex8). Their encoded proteins differ by 15 amino acids (SDPDGKFLSSLNPSELTYDGRW vs. SDPDGRW), which is located in a region between the highly conserved MH and HD domains. UAS-Hth constructs have been made using Hth-C28 and Hth-H42,43. RT-PCR analysis revealed that both isoforms are expressed from all developmental stages (Supplementary Fig. 1). By analyzing RNA-seq data, isoforms containing the FL-Ex8 and micro-Ex8 were expressed in adult males and females and in embryos (Fig. 2). The micro-Ex8 is slightly more abundant than the FL-Ex8 transcripts in adults, but much less abundant than FL-Ex8 in hemocytes from embryos. The HDless isoforms (isoforms E, F and I) are more abundant than the HD-containing isoforms (isoforms A, C, G and H) (Supplementary Fig. 2).
Micro-Ex8 is alternatively spliced from exon 8
The micro-Ex8 has only three nucleotides and the sequence of these nucleotides is ATG (Fig. 1a). Intriguingly, the first three nucleotide sequence in FL-Ex8 is also ATG, which is flanked by consensus splice acceptor and donor sites (AGATGGT)44. These observations suggested that the micro-Ex8 could be generated from the FL-Ex8 through an alternative splicing process. It is possible that these two exons belonged to (1) mutually exclusive exons or (2) usage of alternative 5′ donor sites. To check whether there may be two mutually exclusive exon 8, we addressed three possibilities. The first possibility is that there might be a separate micro-exon located in intron 7 or intron 8. By searching the AGATGGT sequence from exon 7 to exon 9 (including introns 7 and 8), only one such sequence was found in exon 8. We therefore hypothesized that the micro-Ex8 is produced by the AGATGGT sequence located in exon 8.
A second possibility is that there might be a genetic polymorphism in exon 8 in the fly population. Although there is a single hth gene in the fly genome, it is possible that the two types of isoforms were generated from two hth alleles in the fly population. Six fly stocks of independent origins, which are mutants independently isolated by different laboratories, were selected for analysis. Each of these carries a viable mutation on the third chromosome, so the third chromosome was isogenized during mutant isolation. If the two isoforms were from different hth alleles, then these fly stocks were expected to each carry only one of the alleles. We checked the six homozygous mutant stocks by PCR of genomic DNA. PCR primers were designed to locate in the introns 7 and 9, thus insuring only genomic DNA can be amplified. A single PCR product of 605 bp was generated from all six fly strains, which contained the full length 48 bp exon 8 (Fig. 3a). The expected product of 540 bp from the putative micro-Ex8 allele was not detected. Since there is a unique NdeI restriction site in the 45 bp stretch that is present in FL-Ex8 but absent in micro-Ex8, only the FL-Ex8 can be digested. Indeed, the PCR products from all six mutant and two wild-type strains, OR and CS-7, contained a NdeI restriction enzyme site and produced the expected products of 373 bp and 232 bp (Fig. 3a,b). The PCR products were also purified from gel and sequenced as a population and confirmed that there was only one type of exon 8 in hth in these fly populations (data not shown). These results conclude that micro-Ex8 is not generated by genetic polymorphism in different alleles.
Third, it is possible that polymorphism might exist within the junctions between exon 7/intron 7 or intron 8/exon 9. If this is the case, the 3 bp ATG may be provided by polymorphism at the end of exon 7 or in the beginning of exon 9. PCR primers were designed to amplify the junctions of exon 7/intron 7 and of intron 8/ exon 9. Among the six mutant strains, a single band was detected in the expected size for both PCR products (Supplementary Fig. 3a,b). By sequencing the PCR products, no extra ATG in these junctions was found.
Ruling out the possibilities for the existence of mutually exclusive exons, our results strongly suggest that the micro-Ex8 is generated from FL-Ex8 by alternative RNA splicing. This is further supported by the following analysis of hth mutation. hthH321 carries an A to T transition (GTAAGT to GTTAGT, Fig. 3a) and changes Lys286 to a stop codon in FL-Ex845,46, thus generating a truncated protein without the HD domain. This same site also constitutes an U1-snRNP recognition site (GTRAGT; R stands for purines)44 located immediately downstream of micro-Ex8 (Fig. 3a). The hthH321 mutation changed the U1-snRNP recognition site from GTAAGT to GTTAGT. Indeed, no micro-Ex8 isoform was detected by RT-PCR in hthH321 while FL-Ex8 level was elevated (Fig. 3c). This result implies that the production of micro-Ex8 is dependent on the U1-snRNP recognition site.
These results showed that micro-Ex8 is derived from the full length exon8 and its generation requires RNA splicing, thus supporting the idea that micro-Ex8 is generated by alternatively RNA splicing from the FL-Ex8. Here we provide an example of alternative usage of 5′ splice donor sites to choose one exon within another exon. The alternative splicing is in-frame, thus producing two Hth proteins that differ in 15 amino acids stretch located between the two conserved MH and HD domains.
Evolutionary history of the hth micro-exon
We then addressed whether these two alternative spliced isoforms exist among insect species. Indeed, this 3 bp micro-Ex8 within a 48 bp FL-Ex8 (designated as 3 + 48) is conserved in all sequenced Drosophila species (Fig. 4). The 3 + 48 exon 8 also exist in the house fly Musca domestica (Diptera: Muscidae). By using RT-PCR and genomic sequencing to trace the evolutionary origin of this exon, we found that the 3 bp micro-Ex8 (ACG), but not the 48 bp FL-Ex8, exists in Bombyx mori and Aedes aegypti. Acyrthosiphon pisum has a 6 bp exon 8. Apis mellifera does not have an exon 8. Based on genomic sequence analysis, another mosquito Anopheles gambiae also only has the 3 bp micro-Ex8. On the phylogenetic tree (Fig. 4, Supplementary Fig. 4), we found that the exon began as 6 bp (AATTGG) in the pea aphid Acyrthosiphon pisum (Hemiptera), then becomes 9 bp (ACGCCATGG) in the jewel wasp Nasonia vitripenis (Hymenoptera), the red flour beetle Tribolium castaneum (Coleoptera), the silk worm Bombyx mori (Lepidoptera), and then becomes 3 bp (ACG) in the Culicidae branch of Diptera which includes the mosquitos Anopheles gambiae, Aedes aegypti and Aedes albopictus. The honeybees (Apis mellifera, Apis dorsata and Apis florea; Hymenoptera) does not have an exon 8. Although the length and coding sequence of exon 8 vary in different insects, the reading frame is preserved in all cases. Altogether, these results suggest that a small exon 8 (9 bp or less) is ancient. The small exon 8 seems to be a constitutive exon. The emergence of the 48 bp FL-Ex8 coincides with the emergence of the Schizophora group 47, which include the Drosophilidae and Tephritidae (Fig. 4). The 3/48 bp micro-Ex8/FL-Ex8 combination becomes an alternative splice of overlapping exons.
Specific 5′ splice site rule for hth splicing in insects
Interestingly, we found that the exon boundary of all internal exons in hth are defined by the CAG-exon-GTGAGT or CAG-exon-GTAAGT, which is a specific subset of the 5′ splice site motifs48. Following this 5′ splice site rule, the annotated exon–intron structure can be re-constituted. For example, the original annotated exon–intron structure of hth in Anopheles gambiae was based on the CAG-exon-GTGAGG and showed that the Hth proteins are 78% identical in Aedes aegypti and Anopheles gambiae. After using the 5′ splice site rule to re-constitute the exon boundaries in hth, the exon length became better matched between Aedes aegypti and Anopheles gambiae (Table 1). The Hth protein now shares 91% identity between the two species. Moreover, we found that this rule is applicable for the hth gene in Diptera, Lepidoptera, Coleoptera, Hymenoptera and Hemiptera (Table 1).
Isoform-specific knockdown by microRNAs
To study the functions of distinct Hth isoforms, we generated UAS transgenes of microRNAs targeting to different regions of the hth transcripts (Fig. 1c,d). miR-C targets the 3′ portion of FL-Ex8 and is thus expected to specifically knockdown the FL-Ex8 isoform (Hth-C). miR-A targets the junction in Ex7/micro-Ex8/Ex9, so is expected to specifically knockdown the micro-Ex8 isoforms (Hth-A, Hth-H and Hth-G).
We generated clones that express each of these microRNAs in imaginal discs of hth1422–4/ + and used anti-Hth antisera to determine the effects by knockdown of distinct Hth isoforms. An antiserum (dG-20)49,50 generated against the C-terminal part of Hth, cannot detect the three HDless isoforms35. Clonal expression of miR-A significantly reduced the dG-20 signal (Fig. 5b,b′). Similarly, clonal expression of miR-C significantly reduced the dG-20 signal (Fig. 5d,d′). When miR-A and miR-C were coexpressed, the dG-20 signal was eliminated (Fig. 5e,e′). These results indicated that the microRNA knockdown was efficient, at least in an hth1422–4/ + background when the hth gene dosage is reduced. The results suggest that both Hth-A, Hth-C and Hth-G contribute to adequate level of Hth, which is also supported by our previous findings (Fig. 2 and Supplementary Fig. 1).
An anti-HthFL antiserum, generated against the full length Hth28, is expected to detect all Hth isoforms. In clones that express miR-A or miR-C alone, the anti-HthFL signal was only slightly reduced (Fig. 5a,a′,c,c′). When miR-A and miR-C were coexpressed, the anti-HthFL signal was not significantly reduced (data not shown). This result suggest that the long isoforms (Hth-A, Hth-H and Hth-C) constitute only a minor proportion of the endogenous Hth, consistent with the previous finding that the HDless isoforms make up a major proportion of the total Hth level35. This is also supported by RNA-seq data (Supplementary Fig. 2).
The miR-A sequence contains 19 nucleotides in the exon 7, raising the possibility that it may target to all isoforms with exon 7 (Hth-A, Hth-C, Hth-H and Hth-G) (Fig. 1a). To better understand the knockdown specificity of miR-A and miR-C, we combined ectopic expression and knockdown. We used the ap-Gal4 to ectopically express Hth-C (abbreviated as ap > Hth-C) in the dorsal region of the wing disc, which is largely non-overlapping with the endogenous Hth in the outer region of the prospective wing pouch (Fig. 5f, visualized by expressing GFP). In ap > miR-A + Hth-C, the Hth level was elevated significantly in the ap expression domain (Fig. 5h–h’, compare with f), indicating that the miR-A could not knockdown Hth-C. In ap > miR-C + Hth-C, the Hth was not detected in the ap expression domain (Fig. 5g, compare with h), except where the hth and ap expression domains overlap. In this overlap region, the Hth level was slightly reduced (Fig. 5g, compare with f). These results indicate that the miR-C can efficiently knockdown Hth-C. These results indicate that the FL-Ex8 (represented by Hth-C) can be efficiently knocked down by miR-C but not by miR-A, and also ruled out the possibility that miR-A might affect both Hth-A and Hth-C. This result is consistent with the conclusion that the sequence at 5′ end of micro RNA is less important for its RNAi function51.
Functional differences between isoforms containing FL-Ex8 and micro-Ex8
We asked whether the micro-Ex8 and FL-Ex8 isoforms may have distinct functions in vivo. First, we compared the effects of overexpressing the different isoforms. When Hth-A or Hth-C were ectopically expressed outside of the endogenous hth expression pattern (driven by ey-Gal4, ap-Gal4, dpp-Gal4, en-Gal4, and Dll-Gal4), both isoforms gave similar phenotypes. ey > Hth-C and ey > Hth-A similarly caused reduction or absence of eyes. Dll > Hth-C and Dll > Hth-A similarly caused absence of antennal a3 segment, truncated legs with no tarsus and tibia, and shorter palp. ap > Hth-C and ap > Hth-A similarly caused significant pupal lethality. The escapers have smaller notum and scutellum, less bristles and smaller and blistered wings (Fig. 7g,h). dpp > Hth-C and dpp > Hth-A similarly caused absence of eye, arista and palp, less bristles on the notum, truncated legs with no distal segments or no tibia but with tarsal segments. en > Hth-C and en > Hth-A similarly caused shorter femur, tibia and tarsus. Therefore, ectopic expression of Hth isoforms outside of the hth expression domain does not reveal functional differences between the FL-Ex8 and micro-Ex8 isoforms.
However, when Hth-A or Hth-C were overexpressed in the endogenous hth expression domain (hth > Hth-A and hth > Hth-C), some different phenotypes could be detected. Both genotypes caused significant pupal lethality (Table 2). The hth > Hth-A and hth > Hth-C escapers showed similar defects in multiple organs, such as the reduction or absence of eyes; deformed ocellar region; deformed or absence of palps; antenna with shorter a3 segment; some with ectopic antennal a2 segment in eye region; some with no or only one haltere; and disordered vibrissae (Fig. 6a,b). The only difference is that hth > Hth-A caused headless or wingless phenotypes with 12% and 6% penetrance, respectively (Fig. 6c,d), while hth > Hth-C does not. Since all the other phenotypes were similar, these differences cannot be attributed to a difference in protein expression level. The headless and wingless phenotypes caused by hth > Hth flies (UAS-GFP-Hth; Hth-H)43 have been described35. Therefore, the micro-Ex8 isoforms (Hth-A and Hth-H) have overexpression effect on head and wing development which is not induced by overexpression of the FL-Ex8 Hth-C.
Secondly, we examined the effect of specific knockdown of each isoform using hth-GAL4 (Table 2). hth > miR-A and hth > miR-C caused 100% pupal lethality but has normal head capsule and clypeus and no antenna-to-leg transformation. Reducing hth dosage in hth1422–1/ + did not enhance the phenotype. Since knockdown of either FL-Ex8 or micro-Ex8 isoforms caused pupal lethality, it suggests that the two isoforms have non-redundant functions and both are required. Knockdown of both FL-Ex8 and micro-Ex8 isoforms caused developmental defects not induced by single knockdown, suggesting that for many developmental processes, the two types of isoforms have redundant functions and both have to be knocked down to cause the developmental defects.
Thirdly, we checked whether the knockdown of one type of isoform can be functionally substituted by overexpressing another type (Table 2). We combined microRNA knockdown and overexpression, using the same hth-GAL4. In many cases we added UAS-GFP to equalize the number of UAS constructs. Coexpression of GFP did not alter the phenotypes. Knockdown by miR-A or miR-C all caused 100% pupal lethality. In hth > miR-A, coexpressing Hth-A or Hth-C gave nearly complete rescue of the pupal lethality and significant rescue of the developmental defects. This suggests that Hth-C (FL-Ex8) can functionally replace Hth-A (micro-Ex8) to a large extent. In hth > miR-C, coexpressing Hth-C gave significant rescue of the pupal lethality (58%) and significant rescue of the developmental defects. However, coexpressing Hth-A failed to rescue (98% lethality) and the pharates have phenotypes similar to those of hth > Hth-A. The lack of rescue indicates that Hth-A (micro-Ex8) cannot functionally replace the loss of Hth-C (FL-Ex8). These results suggest a functional difference between isoforms with FL-Ex8 or micro-Ex8. This difference is unlikely due to difference in expression level nor affected by the number of UAS constructs.
ap > Hth-A and ap > Hth-C caused similar developmental defects (Fig. 7g,h). As expected, the ap > Hth-A overexpression phenotype could be rescued by coexpressing miR-A (ap > miR-A + Hth-A; Fig. 7i), and the ap > Hth-C overexpression phenotype could be rescued by coexpressing miR-C (ap > miR-C + Hth-C; data not shown). Significantly, the ap > Hth-C phenotypes could not be rescued by coexpressing miR-A (ap > miR-A + Hth-C) (Fig. 7j,k,l).
Hth is also involved in tracheal development. At the end of larval stage, the large larval cells in the tracheal metamere Tr2 can divide and become small imaginal cells and incorporate into the adult tracheal system and this repopulation requires hth52. We checked which isoforms is required for tracheal development. Hth-A or Hth-C was knocked down in the tracheal cells by the btl-GAL4 in a hth1422–4/ + heterozygous background (Fig. 7a–f). Tracheal development was not affected in btl > miR-C flies (Fig. 7b,e) while btl > miR-A induced a delay of cell repopulation of tracheae (Fig. 7c,f, compare with Fig. 7a,d). Notably, 96% of btl > miR-C pharate adult has eclosed from the pupa while only 45% of btl > miR-A has eclosed. Taken together, these results imply that Hth-A plays a primary role and Hth-C might play a minor role in the repopulation in Tr2 and TC.
Discussion
A 3 bp micro-exon generated by alternative splicing in the hth gene
In this study, we have provided unequivocal evidence that the Hth-A and Hth-C isoforms of hth differ by the choice of micro-Ex8 or FL-Ex8 through alternative splicing. We first showed that both isoforms are actually expressed, evident by RT-PCR and RNA-Seq. We then ruled out possibilities other than alternative splicing that may explain the existence of two isoforms. Finally, we showed that alteration in the U1-snRNP recognition site located immediately downstream of micro-Ex8 completely abolished the production of micro-Ex8. Therefore, micro-Ex8 and FL-Ex8 are produced by alternative splicing. This is an interesting example of alternative choice of overlapping exons.
Previous studies showed that the short-length of micro-exons are difficult to be recognized by the splicing machinery and can cause exon skipping53. More recently, it has been reported that micro-exons have stronger 5′ and 3′ splice sites and higher density of splicing enhancers. These may promote the association of RNA binding proteins and cause inclusion or exclusion of alternative exons22. The inclusion or exclusion of alternative exons can affect protein function such as altering protein structure, modulating post-translational modification or facilitating degradation54,55 and may contribute to human disease. In the Hth case reported here, the alternative 5′ splice site choice of Hth results in a 15 aa deletion (from FL-Ex8 to micro-Ex8) in the Hth protein. This is similar to the human Wilms’ tumor suppressor gene WT, which has an alternatively spliced exon 9, resulting in the inclusion or exclusion of three amino acids (KTS)56. Mice expressing only + KTS or only − KTS splice isoforms have distinct and severe phenotypes in kidney and gonad formation57, indicating these two isoforms have differential functions on urogenital development. Mutations affecting the alternative slicing results in the Frasier syndrome with gonadal dysgenesis and kidney dysfunction58. Hence, micro-exons are attractive model for studying alternative splicing to aid the understanding of biological functions and processes.
Evolutionary history of the micro-exon 8
Our analysis of available genomic sequences showed that a small and constitutive exon 8 (9 bp or less) in the hth gene is ancient. It appeared in the holometabolous insects. Although the length and coding sequence of exon 8 vary in different insects (0, 3, 6, 9), the reading frame is preserved in all cases. The emergence of the 48 bp FL-Ex8 in the Schizophora group, which include the Drosophilidae and Tephritidae (Fig. 4), suggests it is more derived feature.
The isoforms A and C differ by 15 amino acids (GKFLSSLNPSELTYD) located between the conserved MH and HD domains. This 15 amino acids stretch is 100% conserved among Diptera from Drosophila species to Tephritidae (Ceratitis capitate, Bactrocera latifrons, Zeugodacus cucurbitae), and differs by one residue (GKFLSSLNPTELTYD) in Muscidae (Musca domestica, Stomoxys calcitrans, Lucilia cuprina). There is no homology outside of these groups, consistent with our finding that the 48 bp FL-Ex8 is a newly evolved feature in these groups. Such strong conservation of the 15 amino acids stretch suggests that it may be functional. The strong conservation of the 3/48 bp micro-Ex8/FL-Ex8 also suggest that this alternative splicing is important, perhaps keeping a balance of the level of the two isoforms.
Functional diversity of Hth isoforms by alternative splicing
We tested the functions of isoform Hth-A and Hth-C in vivo by (1) expressing specific isoforms in transgenic flies, (2) specific knockdown of different isoforms and (3) knockdown specific isoform and determine whether the expression of another isoform can rescue the phenotypes. In the ectopic expression experiments, Hth-A and Hth-C caused similar phenotypes in multiple tissues when ectopically expressed out of hth endogenous expression domain. However, when overexpressed in the hth endogenous expression domain, Hth-A (micro-Ex8) caused additional phenotypes (headless and wingless) not seen in Hth-C (FL-Ex8). In knockdown experiments, knockdown of either FL-Ex8 or micro-Ex8 isoforms caused pupal lethality, suggesting that the two isoforms are both required, so must have non-redundant functions. Also, the micro-Ex8 isoform, but not FL-EX8, is required for tracheal development. In the rescue experiments, Hth-C (FL-Ex8) can functionally substitute for the loss of Hth-A (micro-Ex8), but not vice versa. Altogether these results suggest a functional difference between isoforms with FL-Ex8 or micro-Ex8. They are both essential for Drosophila development. This is consistent with the absolute conservation of the 3/48 overlapping alternative exons organization in insects. Since micro-Ex8 is the ancient form, the newer FL-Ex8 has lost certain functions, so it cannot functionally substitute for the loss of micro-Ex8 (for tracheal development and in overexpression). FL-Ex8 must have acquired new and unique functions, so that it is also required for pupa survival.
Both Hth and Exd contain HD of the TALE (Three Amino acids Loop Extension) classes34,59. Hth belongs to the MEIS subclass that includes the Meis1, Meis2, Meis3, Prep1, and Prep2 in mammals and unc-62 and psa-2, in the nematode C. elegans. Exd belongs to the PBC subclass that includes the mammalian Pbx1, Pbx2, Pbx3, Pbx4 genes in mammal and ceh-20, ceh-40, ceh-60 in C. elegans. MEIS proteins can heterodimerize with PBC proteins, heterodimerize with Hox proteins, and can also form MEIS/PBC/Hox trimeric complex32,34,60. The MEIS proteins can function with Hox proteins and also have Hox-independent functions33,43,61,62,63,64. The functional diversity may partly rely on the combination of interactions with diverse protein partners through different regions. The 15 amino acid stretch produced by FL-Ex8 lies in between the MH and HD domains. Although X-ray structure of MEIS proteins has not been reported, the 15 residue difference between micro-Ex8 and FL-Ex8 may potentially affect either the Hth protein structure or its interaction with Exd or other proteins. Therefore, the functional difference between the micro-Ex8 and FL-Ex8 isoforms may stem from differential protein interactions through this region. The alternatively spliced micro-Ex8 and FL-Ex8 exons adds to the structural diversity and potential diversity in protein interactions, providing additional layer of regulation.
Connection with cancers
The mammalian MEIS and PBC genes are involved in the formation of various cancers65. Several HOX genes are also known to be involved in carcinogenesis66. Drugs aimed at affecting the HOX/PBX dimerization are being developed65. Therefore, understanding the structural and functional diversity of the MEIS, PBC and Hox proteins may provide drug targets for specific manipulation of specific protein interactions. Since each of these genes does not produce a single protein species, understanding their alternative splicing isoforms is crucial to understand their full spectrum of structural and functional diversity.
Material and mmethods
Drosophila genetics
Fly culture and crosses were performed according to standard procedure at 25 °C unless otherwise noted. Clonal analysis was generated by FLP/FRT-mediated mitotic recombination. Fly stocks used were hth[1422–4] 27,28,hth[H321]45,46,hth[P1-K6–1]28 and hth-GAL467.
Immunohistochemistry
Antibody staining for imaginal discs was as previously described28. Primary antibodies used were dG-20 (Santa Cruz Biotechnology, Inc) that specifically recognizes the C-terminal of Hth and anti-Hth-FL28. Species-matched Cy2- and Cy3-conjugated secondary antibodies were from Jackson ImmunoResearch. Confocal imaging was performed using Zeiss LSM microscopy.
hth miRNA generation
The sequences of hth isoforms were obtained from FlyBase2020_01. Targeting sequences for miR-A: 5′-atacatcagaccctgatggacg; miR-C: 5′-tcggtccttcagaactaacata; The miRNA construct was generated as described in Yao et al., 200868, using the method of Chen et al., 200769.
RNA-sequencing data analysis
The RNA-seq datasets were obtained from the modENCODE project GSE2807870 and ERP119517. The sequencing reads from the National Center for Biotechnology Information short read archive (https://www.ncbi.nlm.nih.gov/sra) were aligned to the Drosophila melanogaster genome version Release 6 + ISO1 MT/dm6 using STAR 2.6.1a with Ensembl reference71. Sequence reads were aggregated into a count for hth gene using htseq-count. Then read count values were transformed to RPKM with the following formula: RPKM = total exon reads/mapped reads (millions)*exon length (KB). Total reads were calculated with Flagst tools.
References
Mount, S. M. A catalogue of splice junction sequences. Nucleic Acids Res. 10(2), 459–472 (1982).
Shapiro, M. B. & Senapathy, P. RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression. Nucleic Acids Res. 15(17), 7155–7174 (1987).
Will, C. L. & Lührmann, R. Spliceosome structure and function. Cold Spring Harb. Perspect. Biol. 3(7), a003707 (2011).
Lareau, L. F., Green, R. E., Bhatnagar, R. S. & Brenner, S. E. The evolving roles of alternative splicing. Curr. Opin. Struct. Biol. 14, 273–182 (2004).
Ule, J. & Blencowe, B. J. Alternative splicing regulatory networks: functions, mechanisms, and evolution. Mol Cell. 76(2), 329–345 (2019).
Beachy, P. A., Helfand, S. L. & Hogness, D. S. Segmental distribution of bithorax complex proteins during Drosophila development. Nature 313, 545–551 (1985).
Ustianenko, D., Weyn-Vanhentenryck, S. M. & Zhang, C. Microexons: discovery, regulation, and function. Wiley Interdiscip. Rev. RNA. 8(4), 1418 (2017).
Coleman, K. G., Poole, S. J., Weir, M. P., Soeller, W. C. & Kornberg, T. The invected gene of Drosophila: sequence analysis and expression studies reveal a close kinship to the engrailed gene. Genes Dev. 1(1), 19–28 (1987).
Yoshida, M., Lin, D. & Kawakami, A. A mini exon in the sucrose: sucrose 1-fructosyltransferase gene of wheat. J. Plant Physiol. 161(11), 1277–1279 (2004).
Simpson, C. G. et al. Requirements for mini-exon inclusion in potato invertase mRNAs provides evidence for exon-scanning interactions in plants. RNA 6(3), 422–433 (2000).
Guo, L. & Liu, C. M. A single-nucleotide exon found in Arabidopsis. Sci. Rep. 5, 18087 (2015).
Benoist, P., Mas, J. A., Marco, R. & Cervera, M. Differential muscle-type expression of the Drosophila troponin T gene. A 3-base pair microexon is involved in visceral and adult hypodermic muscle specification. J. Biol. Chem. 273(13), 7538–7546 (1998).
McAllister, L., Rehm, E. J., Goodman, G. S. & Zinn, K. Alternative splicing of micro-exons creates multiple forms of the insect cell adhesion molecule fasciclin I. J. Neurosci. 12(3), 895–905 (1992).
Santoni, M. J. et al. Differential exon usage involving an unusual splicing mechanism generates at least eight types of NCAM cDNA in mouse brain. EMBO J. 8(2), 385–392 (1989).
Reyes, A. A., Small, S. J. & Akeson, R. At least 27 alternatively spliced forms of the neural cell adhesion molecule mRNA are expressed during rat heart development. Mol Cell Biol. 11(3), 1654–1661 (1992).
Cooper, T. A. & Ordahl, C. P. A single cardiac troponin T gene generates embryonic and adult isoforms via developmentally regulated alternate splicing. J. Biol. Chem. 260(20), 11140–11148 (1985).
Volfovsky, N., Haas, B. J. & Salzberg, S. L. Computational discovery of internal micro-exons. Genome Res. 13(6A), 1216–1221 (2013).
Wu, T. D. & Watanabe, C. K. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21(9), 1859–1875 (2005).
Irimia, M. et al. A highly conserved program of neuronal microexons is misregulated in autistic brains. Cell 159(7), 1511–1523 (2014).
Wu, J., Anczuków, O., Krainer, A. R., Zhang, M. Q. & Zhang, C. OLego: fast and sensitive mapping of spliced mRNA-Seq reads using small seeds. Nucleic Acids Res. 41(10), 5149–5163 (2013).
Yan, Q. et al. Systematic discovery of regulated and conserved alternative exons in the mammalian brain reveals NMD modulating chromatin regulators. Proc. Natl. Acad. Sci. USA 112(11), 3445–3450 (2015).
Li, Y. I., Sanchez-Pulido, L., Haerty, W. & Ponting, C. P. RBFOX and PTBP1 proteins regulate the alternative splicing of micro-exons in human brain transcripts. Genome Res. 25(1), 1–13 (2015).
Tsyba, L. et al. Alternative splicing affecting the SH3A domain controls the binding properties of intersectin 1 in neurons. Biochem. Biophys. Res. Commun. 372(4), 929–934 (2008).
Zibetti, C. et al. Alternative splicing of the histone demethylase LSD1/KDM1 contributes to the modulation of neurite morphogenesis in the mammalian nervous system. J Neurosci. 30(7), 2521–2532 (2010).
Quesnel-Vallières, M., Irimia, M., Cordes, S. P. & Blencowe, B. J. Essential roles for the splicing regulator nSR100/SRRM4 during nervous system development. Genes Dev. 29(7), 746–759 (2015).
Rieckhof, G. E., Casares, F., Ryoo, H. D., Abu-Shaar, M. & Mann, R. S. Nuclear translocation of extradenticle requires homothorax, which encodes an extradenticle-related homeodomain protein. Cell 91(2), 171–183 (1997).
Kurant, E. et al. dorsotonals/homothorax, the Drosophila homologue of meis1, interacts with extradenticle in patterning of the embryonic PNS. Development 125(6), 1037–1048 (1998).
Pai, C. Y. et al. The Homothorax homeoprotein activates the nuclear localization of another homeoprotein, extradenticle, and suppresses eye development in Drosophila. Genes Dev. 12(3), 435–446 (1998).
Aspland, S. E. & White, R. A. H. Nucleocytoplsmic localisation of extradenticle protein is spatially regulated throughout development in Drosophila. Development 124(3), 741–747 (1997).
Abu-Shaar, M., Ryoo, H. D. & Mann, R. S. Control of the nuclear localization of Extradenticle by competing nuclear import and export signals. Genes Dev. 13(8), 935–945 (1999).
Jaw, T. J. et al. Direct interaction of two homeoproteins, homothorax and extradenticle, is essential for EXD nuclear localization and function. Mech. Dev. 91(1–2), 279–291 (2000).
Mann, R. S. & Affolter, M. Hox proteins meet more partners. Curr. Opin. Genet Dev. 8(4), 423–429 (1998).
Moens, C. B. & Selleri, L. Hox cofactors in vertebrate development. Dev Biol. 291(2), 193–206 (2006).
Mann, R. S., Lelli, K. M. & Joshi, R. Hox specificity unique roles for cofactors and collaborators. Curr. Top. Dev. Biol. 88, 63–101 (2009).
Noro, B. et al. Distinct functions of homeodomain-containing and homeodomain-less isoforms encoded by homothorax. Genes Dev. 20(11), 1636–1650 (2006).
Ryoo, H. D., Marty, T., Casares, F., Affolter, M. & Mann, R. S. Regulation of Hox target genes by a DNA bound Homothorax/Hox/Extradenticle complex. Development 126(22), 5137–5148 (1999).
Chang, C. P. et al. Meis proteins are major in vivo DNA binding partners for wild-type but not chimeric Pbx proteins. Mol Cell Biol. 17(10), 5679–5687 (1997).
Knoepfler, P. S., Calvo, K. R., Chen, H., Antonarakis, S. E. & Kamps, M. P. Meis1 and pKnox1 bind DNA cooperatively with Pbx1 utilizing an interaction surface disrupted in oncoprotein E2a-Pbx1. Proc. Natl. Acad. Sci. USA 94(26), 14553–14558 (1997).
Berthelsen, J., Zappavigna, V., Ferretti, E., Mavilio, F. & Blasi, F. The novel homeoprotein Prep1 modulates Pbx-Hox protein cooperativity. EMBO J. 17(5), 1434–1445 (1998).
Knoepfler, P. S. et al. A conserved motif N-terminal to the DNA-binding domains of myogenic bHLH transcription factors mediates cooperative DNA binding with pbx-Meis1/Prep1. Nucleic Acids Res. 27(18), 3752–3761 (1992).
Corsetti, E. & Azpiazu, N. Functional dissection of the splice variants of the Drosophila gene homothorax (hth). Dev Biol. 384(1), 72–82 (2013).
Azpiazu, N. & Morata, G. Functional and regulatory interactions between Hox and extradenticle genes. Genes Dev. 12(2), 261–273 (1998).
Casares, F. & Mann, R. S. Control of antennal versus leg development in Drosophila. Nature 392(6677), 723–726 (1998).
Mount, S. M. et al. Splicing signals in Drosophila: intron size, information content, and consensus sequences. Nucleic Acids Res. 20(16), 4255–4262 (1992).
Salzberg, A. et al. Mutations affecting the pattern of the PNS in Drosophila reveal novel aspects of neuronal development. Neuron 13(2), 269–287 (1994).
Kurant, E., Eytan, D. & Salzberg, A. Mutational analysis of the Drosophila homothorax gene. Genetics 157(2), 689–698 (2001).
Wiegmann, B. M. et al. Episodic radiations in the fly tree of life. Proc. Natl. Acad. Sci. USA 108(14), 5690–5695 (2011).
Yeo, G. & Burge, C. B. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J. Comput. Biol. 11(2–3), 377–394 (2004).
Peng, H. W., Slattery, M. & Mann, R. S. Transcription factor choice in the Hippo signaling pathway: homothorax and yorkie regulation of the microRNA bantam in the progenitor domain of the Drosophila eye imaginal disc. Genes Dev. 23(19), 2307–2319 (2009).
Wang, C. W. & Sun, Y. H. Segregation of eye and antenna fates maintained by mutual antagonism in Drosophila. Development 139(18), 3413–3421 (2012).
Brennecke, J., Stark, A., Russell, R. B. & Cohen, S. M. Principles of microRNA-target recognition. PLoS Biol. 3(3), e85 (2005).
Sato, M., Kitada, Y. & Tabata, T. Larval cells become imaginal cells under the control of homothorax prior to metamorphosis in the Drosophila tracheal system. Dev Biol. 318(2), 247–257 (2008).
Dominski, Z. & Kole, R. Selection of splice sites in pre-mRNAs with short internal exons. Mol Cell Biol. 11(12), 6075–6083 (1991).
Tsyba, L. et al. Alternative splicing affecting the SH3A domain controls the binding properties of intersectin 1 in neurons. Biochem. Biophys. Res. Commun. 372(4), 929–934 (2008).
Zibetti, C. et al. Alternative splicing of the histone demethylase LSD1/KDM1 contributes to the modulation of neurite morphogenesis in the mammalian nervous system. J Neurosci. 30(7), 2521–2532 (2010).
Barbaux, S. et al. Donor splice-site mutations in WT1 are responsible for frasier syndrome. Nat Genet. 17(4), 467–470 (1997).
Hammes, A. et al. Two splice variants of the Wilms’ tumor 1 gene have distinct functions during sex determination and nephron formation. Cell 106(3), 319–329 (2001).
Klamt, B. et al. Frasier syndrome is caused by defective alternative splicing of WT1 leading to an altered ratio of WT1 +/−KTS splice isoforms. Hum. Mol. Genet. 7(4), 709–714 (1998).
Bürglin, T. R. & Affolter, M. Homeodomain proteins: an update. Chromosoma 125(3), 497–521 (2016).
Merabet, S. & Mann, R. S. To be specific or not: the critical relationship between Hox and TALE proteins. Trends Genet. 32(6), 334–347 (2016).
Bessa, J. et al. meis1 regulates cyclin D1 and c-myc expression, and controls the proliferation of the multipotent cells in the early developing zebrafish eye. Development 135(5), 799–803 (2008).
Casares, F. & Mann, R. S. The ground state of the ventral appendage in Drosophila. Science 293(5534), 1477–1480 (2001).
Jiang, Y., Shi, H., Amin, N. M., Sultan, I. & Liu, J. Mesodermal expression of the C. elegans HMX homolog mls-2 requires the PBC homolog CEH-20. Mech Dev. 125(5–6), 451–461 (2008).
Laurent, A. et al. Identification of a new type of PBX1 partner that contains zinc finger motifs and inhibits the binding of HOXA9-PBX1 to DNA. Mech Dev. 124(5), 364–376 (2007).
Morgan, R. et al. Targeting HOX/PBX dimers in cancer. Oncotarget 8, 32322–32331 (2017).
Li, B., Huang, Q. & Wei, G. H. The role of HOX transcription factors in cancer predisposition and progression. Cancers 11(4), 528 (2019).
Wernet, M. F. et al. Homothorax switches function of Drosophila photoreceptors from color to polarized light sensors. Cell 115(3), 267–279 (2003).
Yao, J. G. et al. Differential requirements for the Pax6(5a) genes eyegone and twin of eyegone during eye development in Drosophila. Dev Biol. 315(2), 535–551 (2008).
Chen, C. H. et al. A synthetic maternal-effect selfish genetic element drives population replacement in Drosophila. Science 316(5824), 597–600 (2007).
Graveley, B. R. The developmental transcriptome of Drosophila melanogaster. Nature 471(7339), 473–479 (2011).
Dobin, A. & Gingeras, T. R. Mapping RNA-seq Reads with STAR. Curr. Protoc. Bioinformatics. 51, 11141–111419 (2015).
Acknowledgements
We thank the Bloomington Stock Center, Drosophila Genomics Resource Center, and Taiwan FlyCore for providing fly stocks and cDNA clones, and the Imaging Core, Genomics Core and Bioinformatics-Biology Service Core of IMB, and the Neuroscience Core Facility of Academia Sinica (AS-CFII-108-106, Taiwan) for technical support. We thank Kuo-Chen Fang (IBMS, Academia Sinica) for helping with RNA-seq analysis. We thank Dr. Cheng-ting Chien, Chi-Kuang Yao and Trees-Juen Juang for helpful suggestions on the manuscript. This work was supported by grants from the Ministry of Science and Technology to Y.H.S. (NSC 91-2312-B-001-006, NSC 92-2312-B-001-006, NSC 96-2321-B-001-002, NSC 97-2321-B-001-002, NSC 98-2321-B-001-034, MOST 107-2311-B-001-018-MY3) and to L.-H.W. (MOST-107-2311-B-016-001-MY3 and MOST- 108-3111-Y-016-008) and postdoctoral fellowship from MOST to L.-W.C. (NSC-97-2811-B-001-073).
Author information
Authors and Affiliations
Contributions
I-C.T. conducted the validation of alternative splicing of micro-exon 8 (Figs. 1a, 3, Supp Fig 1,3) L-W.C. conducted the in vivo functional analysis (Fig. 1c,d, 5–7, Tables 1, 2). L-H.W. conducted the RNA-seq analysis (Fig. 2, Supp. Fig. 2). L-W.C. and L-H.W. conducted the evolutionary analysis (Fig. 4). L-H.W. prepared Fig. 1b and Supp. Fig. 4. Y.H.S. prepared Fig. 1a. I-C.T., L-W.C., L-H.W. and Y-H.S. all contributed to the design of the study and the writing of the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chang, LW., Tseng, IC., Wang, LH. et al. Isoform-specific functions of an evolutionarily conserved 3 bp micro-exon alternatively spliced from another exon in Drosophila homothorax gene. Sci Rep 10, 12783 (2020). https://doi.org/10.1038/s41598-020-69644-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-69644-1
This article is cited by
-
Pervasive misannotation of microexons that are evolutionarily conserved and crucial for gene function in plants
Nature Communications (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.