Many long Drosophila introns are processed by an unusual recursive strategy. The presence of ~200 adjacent splice acceptor and splice donor sites, termed ratchet points (RPs), were inferred to reflect ‘zero-nucleotide exons’, whose sequential processing subdivides removal of long host introns. We used CRISPR–Cas9 to disrupt several intronic RPs in Drosophila melanogaster, some of which recapitulated characteristic loss-of-function phenotypes. Unexpectedly, selective disruption of RP splice donors revealed constitutive retention of unannotated short exons. Assays using functional minigenes confirm that unannotated cryptic splice donor sites are critical for recognition of intronic RPs, demonstrating that recursive splicing involves the recognition of cryptic RP exons. This appears to be a general mechanism, because canonical, conserved splice donors are specifically enriched in a 40–80-nt window downstream of known and newly annotated intronic RPs and exhibit similar properties to a broadly expanded class of expressed RP exons. Overall, these studies unify the mechanism of Drosophila recursive splicing with that in mammals.
This is a preview of subscription content, access via your institution
Open Access articles citing this article.
Differential fates of introns in gene expression due to global alternative splicing
Human Genetics Open Access 14 December 2021
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 per month
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Get just this article for as long as you need it
Prices may be subject to local taxes which are calculated during checkout
De Conti, L., Baralle, M. & Buratti, E. Exon and intron definition in pre-mRNA splicing. Wiley Interdiscip. Rev. RNA 4, 49–60 (2013).
Singh, J. & Padgett, R. A. Rates of in situ transcription and splicing in large human genes. Nat. Struct. Mol. Biol. 16, 1128–1133 (2009).
Hollander, D., Naftelberg, S., Lev-Maor, G., Kornblihtt, A. R. & Ast, G. How are short exons flanked by long introns defined and committed to splicing? Trends Genet. 32, 596–606 (2016).
Hatton, A. R., Subramaniam, V. & Lopez, A. J. Generation of alternative Ultrabithorax isoforms and stepwise removal of a large intron by resplicing at exon-exon junctions. Mol. Cell 2, 787–796 (1998).
Burnette, J. M., Miyamoto-Sato, E., Schaub, M. A., Conklin, J. & Lopez, A. J. Subdivision of large introns in Drosophila by recursive splicing at nonexonic elements. Genetics 170, 661–674 (2005).
Duff, M. O. et al. Genome-wide identification of zero nucleotide recursive splicing in Drosophila. Nature 521, 376–379 (2015).
Sibley, C. R. et al. Recursive splicing in long vertebrate genes. Nature 521, 371–375 (2015).
Cook-Andersen, H. & Wilkinson, M. F. Molecular biology: Splicing does the two-step. Nature 521, 300–301 (2015).
Kelly, S. et al. Splicing of many human genes involves sites embedded within introns. Nucleic Acids Res. 43, 4721–4732 (2015).
Bender, W. et al. Molecular genetics of the bithorax complex in Drosophila melanogaster. Science 221, 23–29 (1983).
Fambrough, D., Pan, D., Rubin, G. M. & Goodman, C. S. The cell surface metalloprotease/disintegrin Kuzbanian is required for axonal extension in Drosophila. Proc. Natl. Acad. Sci. USA 93, 13233–13238 (1996).
Kondo, S. & Ueda, R. Highly improved gene targeting by germline-specific Cas9 expression in Drosophila. Genetics 195, 715–721 (2013).
Kondo, S. et al. New genes often acquire male-specific functions but rarely become essential in Drosophila. Genes Dev. 31, 1841–1846 (2017).
Mohn, F., Sienski, G., Handler, D. & Brennecke, J. The rhino-deadlock-cutoff complex licenses noncanonical transcription of dual-strand piRNA clusters in Drosophila. Cell 157, 1364–1379 (2014).
McMahon, A. C. et al. TRIBE: hijacking an RNA-editing enzyme to identify cell-specific targets of RNA-binding proteins. Cell 165, 742–753 (2016).
Chen, Y. A. et al. Cutoff suppresses RNA Polymerase II termination to ensure expression of piRNA precursors. Mol. Cell 63, 97–109 (2016).
Rodriguez, J., Menet, J. S. & Rosbash, M. Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila. Mol. Cell 47, 27–37 (2012).
Sienski, G., Dönertas, D. & Brennecke, J. Transcriptional silencing of transposons by Piwi and maelstrom and its impact on chromatin state and gene expression. Cell 151, 964–980 (2012).
Rozhkov, N. V., Hammell, M. & Hannon, G. J. Multiple roles for Piwi in silencing Drosophila transposons. Genes Dev. 27, 400–412 (2013).
Ferrari, F. et al. “Jump start and gain” model for dosage compensation in Drosophila based on direct sequencing of nascent transcripts. Cell Rep. 5, 629–636 (2013).
Wang, W. et al. Slicing and binding by Ago3 or Aub trigger Piwi-bound piRNA production by distinct mechanisms. Mol. Cell 59, 819–830 (2015).
Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14, R36 (2013).
Reese, M. G., Eeckman, F. H., Kulp, D. & Haussler, D. Improved splice site detection in Genie. J. Comput. Biol. 4, 311–323 (1997).
We thank the Bloomington Drosophila Stock Center and the Developmental Studies Hybridoma Bank for fly stocks and antibodies used in this study. S.K. was supported by the Mochida Memorial Foundation for Medical and Pharmaceutical Research. Work in E.C.L.’s group was supported by the National Institutes of Health (R01-NS083833 and R01-GM083300) and MSK Core Grant P30-CA008748.
The authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Integrated supplementary information
Supplementary Figure 1 Evidence and mechanistic models for recursive splicing.
(a) Sawtooth RNA-seq patterns are indicative of recursive splicing intermediates. Generally, RNA-seq coverage in introns is reflective of nascent transcription and resembles right-angled triangles, with highest coverage at the 5’ end and lowest at the 3’ end of the intron. However, introns that undergo recursive splicing consist of multiple intronic segments, each with its own right-angled triangle coverage, producing a sawtooth pattern. This property has been exploited to infer recursive splicing and annotate RPs. (b) Models for processing introns with RPs. It is conceivable that introns that contain RPs will be processed in one of two ways. First, the RP is utilized constitutively (path 1) and the intron is removed in two sequential steps. Second, the RP is skipped such that the entire intron is spliced out in one step (path 2). (c) A molecular model for recursive splicing. We propose that recursive splicing proceeds by first defining a cryptic RP exon, which is specified by the RP splice acceptor and a downstream cryptic splice donor. Definition of the cryptic RP exon allows removal of the first intron segment and production of the recursive intermediate. In the second splicing reaction, we propose that the regenerated RP splice donor outcompetes the cryptic splice donor, thereby removing the whole intron and ligating neighboring exons.
Supplementary Figure 2 Genes with RPs were manipulated to identify cryptic exons.
(a) UCSC genome browser screenshots display the three genes (Bx, kuz, and Ubx) manipulated in this study, including the approximate genomic locations of RPs within long host introns. (b) Nature of mutant alleles along with sequence alignments. (c) UCSC genome browser nucleotide-level screenshots of mutated RPs (grey highlight) along with cryptic exons detected in mutants (yellow highlight). *Ubx[RP] is an insertion mutant, which separates the RP splice acceptor and donor sites by 38nt.This 38nt insertion is retained in mutant animals. However, bioinformatic analysis has identified a naturally occurring cryptic exon and splice donor as indicated.
Supplementary Figure 3 kuz cryptic exon is retained in S2-R+ cells.
Sashimi plots were used to display the usage of the cryptic exon in modENCODE total RNA-seq data from S2-R+ cells, L1 stage larvae and 22-24hr embryos. Spliced reads can only be detected into and out of the cryptic exons in S2-R+ cells, suggesting selective inclusion and/or stabilization here.
Supplementary Figure 4 Quantification of relative cryptic exon inclusion ratios from wild-type and mutant recursive splicing minigenes.
x and kuz minigenes that contained the indicated mutations (see Fig 2E, G) were transfected into S2 cells and subjected to rt-PCR analysis. Relative exon inclusion was calculated by normalizing the intensity of the mRNA band to all indicated bands in the same lane, and then scaled to total expression observed in wt lane. Representative gels are shown (see also main Figure 2).
Supplementary Figure 5 Nascent RNA-seq datasets are more suited for detection of RPs.
(a, b) Nascent RNA-, total RNA- and mRNA-seq datasets from S2 cells were evaluated for different criteria. (a) Nascent RNA datasets have higher coverage at intronic loci. Long intronic segments - with no overlapping genes - were identified and reads mapping to these regions were summed and normalized. (b) Junction spanning reads that mapped to RPs identified in Duff et al. were summed and normalized. (c) Junction spanning reads found from all nascent RNA-seq and GRO-seq were merged and those with 3’ ends mapping to intronic regions were stratified by junction split (intron length) into three categories. For each category, pie charts were drawn to indicate tetranucleotide distributions at the 3’ junction end. Note that AGGT, which resembles minimal ratchet point sequences are enriched only within the long intron category.
Supplementary Figure 6 Novel RPs share sequence, structural, and evolutionary properties of known RPs.
(a) Comparison of average phyloP scores for “0-nt” RPs. New RPs were grouped into two categories based on whether they had sawtooth patterns in RNA-seq data or not. (b) Average length of host introns for RPs found in Duff et all and this study. (c) Sequence logos for categorized RPs.
Supplementary Figure 7 Example of intronic RP and RP-exon annotation in the msi gene.
The BLAT tool in UCSC genome browser was used to map RNA-seq reads to musashi (msi). 5’ ends of reads map to a msi 5' exon (zoomed in shot highlighted in red). 3’ ends of one read maps to an intronic RP (blue highlight) and a zoomed in nucleotide-level screenshot is included in blue. 3’ ends of the other read maps to an RP exon (green highlight) and a zoomed in nucleotide-level screenshot is included in green. Note the RNA-seq coverage in screenshots and that RP exons have distinct exon coverage, whereas intronic RPs have sawtooth coverage pattern. The core AGGT splice acceptor-donor pairs are marked in gray, while the larger splice consensus motifs are highlighted in yellow.
Supplementary Figure 8 Conservation and coding properties of RP exons by subcategory.
(a) Distribution of RP exons according to location in gene models. (b) RP exons were divided according to their location in 5’UTR, CDS, and 5’UTR/CDS (ones that contained alternate 5’UTR/start sites). The fully coding RP exons have a high level of evolutionary conservation, and the set with partial coding potential exhibit an intermediate level of conservation.
Supplementary Figure 9 Positional bias and conservation of cryptic donors stratified by splice scores.
(a) Splice donors found downstream of RPs (cryptic splice donors) or 1000 control AGGTs sites were grouped based on NNSPLICE splice site strength. Plotted is the positional bias of splice donor site position relative to ratchet points. Substantial enrichment is observed in the ~40-80 window downstream of ratchet points, but not control AGGT sequences, at NNSPLICE scores down to 0.5-0.6 (b) Average phyloP scores of splice donor sites downstream of ratchet points (RPs) segregated into those that are <100nt from RPs and >100nt away from RPs. Clear local conservation is observed amongst groups of cryptic donors scored down to ~0.6. (c) Table showing the number of non-overlapping RPs with cryptic splice sites grouped by splice site score.
Supplementary Text and Figures
Supplementary Figures 1–9 and Supplementary Note 1
Supplementary Dataset 1
Uncropped gels reported in this study.
Supplementary Dataset 2
Summary of datasets analyzed in this study.
Supplementary Dataset 3
Annotations of Drosophila intronic RPs and RP exons.
Supplementary Dataset 4
Examples of Drosophila RP exons. Shown are UCSC Genome Browser screenshots with gene models and RNA-seq evidence at cassette exons that can regenerate conserved 5’ splice sites.
Supplementary Dataset 5
UCSC genome browser screenshots of intronic RPs and their associated cryptic exons. Shown are 50 examples of intronic RPs with high-scoring cryptic splice donors; the inferred cryptic exons are highlighted in red.
Supplementary Dataset 6
Gene Ontology analyses of Drosophila genes undergoing recursive splicing. Hypergeometric tests and Bonferroni corrections were used to obtain adjusted P values.
Rights and permissions
About this article
Cite this article
Joseph, B., Kondo, S. & Lai, E.C. Short cryptic exons mediate recursive splicing in Drosophila . Nat Struct Mol Biol 25, 365–371 (2018). https://doi.org/10.1038/s41594-018-0052-6
This article is cited by
Differential fates of introns in gene expression due to global alternative splicing
Human Genetics (2022)