Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Genome-wide identification of zero nucleotide recursive splicing in Drosophila



Recursive splicing is a process in which large introns are removed in multiple steps by re-splicing at ratchet points—5′ splice sites recreated after splicing1. Recursive splicing was first identified in the Drosophila Ultrabithorax (Ubx) gene1 and only three additional Drosophila genes have since been experimentally shown to undergo recursive splicing2,3. Here we identify 197 zero nucleotide exon ratchet points in 130 introns of 115 Drosophila genes from total RNA sequencing data generated from developmental time points, dissected tissues and cultured cells. The sequential nature of recursive splicing was confirmed by identification of lariat introns generated by splicing to and from the ratchet points. We also show that recursive splicing is a constitutive process, that depletion of U2AF inhibits recursive splicing, and that the sequence and function of ratchet points are evolutionarily conserved in Drosophila. Finally, we identify four recursively spliced human genes, one of which is also recursively spliced in Drosophila. Together, these results indicate that recursive splicing is commonly used in Drosophila, occurs in humans, and provides insight into the mechanisms by which some large introns are removed.

This is a preview of subscription content

Access options

Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Figure 1: Identification and validation of recursive splice sites in Drosophila.
Figure 2: Identification of recursive lariat introns in Drosophila.
Figure 3: Characteristics of Drosophila ratchet points.
Figure 4: Expression characteristics of recursively spliced Drosophila genes.

Accession codes

Primary accessions

Sequence Read Archive

Data deposits

Sequences are available from and have been deposited in the Short Read Archive under accession numbers SRP056962, SRP056965 and SRP056969 and those listed in Supplementary Table 1.


  1. Hatton, A. R., Subramaniam, V. & Lopez, A. J. Generation of alternative Ultrabithorax isoforms and stepwise removal of a large intron by resplicing at exon-exon junctions. Mol. Cell 2, 787–796 (1998)

    CAS  Article  Google Scholar 

  2. Burnette, J. M., Miyamoto-Sato, E., Schaub, M. A., Conklin, J. & Lopez, A. J. Subdivision of large introns in Drosophila by recursive splicing at nonexonic elements. Genetics 170, 661–674 (2005)

    CAS  Article  Google Scholar 

  3. Conklin, J. F., Goldman, A. & Lopez, A. J. Stabilization and analysis of intron lariats in vivo. Methods 37, 368–375 (2005)

    CAS  Article  Google Scholar 

  4. Mackay, T. F. et al. The Drosophila melanogaster Genetic Reference Panel. Nature 482, 173–178 (2012)

    CAS  ADS  Article  Google Scholar 

  5. Graveley, B. R. et al. The developmental transcriptome of Drosophila melanogaster. Nature 471, 473–479 (2011)

    CAS  ADS  Article  Google Scholar 

  6. Brown, J. B. et al. Diversity and dynamics of the Drosophila transcriptome. Nature 512, 393–399 (2014)

    CAS  ADS  Article  Google Scholar 

  7. Oesterreich, F. C., Bieberstein, N. & Neugebauer, K. M. Pause locally, splice globally. Trends Cell Biol. 21, 328–335 (2011)

    CAS  Article  Google Scholar 

  8. Hollins, C., Zorio, D. A., MacMorris, M. & Blumenthal, T. U2AF binding selects for the high conservation of the C. elegans 3′ splice site. RNA 11, 248–253 (2005)

    CAS  Article  Google Scholar 

  9. Sibley, C. R. et al. Recursive splicing in long vertebrate genes. Nature (2015)

  10. Bieberstein, N. I., Carrillo Oesterreich, F., Straube, K. & Neugebauer, K. M. First exon length controls active chromatin signatures and transcription. Cell Rep. 2, 62–68 (2012)

    CAS  Article  Google Scholar 

  11. Huff, J. T., Plocik, A. M., Guthrie, C. & Yamamoto, K. R. Reciprocal intronic and exonic histone modification regions in humans. Nature Struct. Mol. Biol. 17, 1495–1499 (2010)

    CAS  Article  Google Scholar 

  12. Kolasinska-Zwierz, P. et al. Differential chromatin marking of introns and expressed exons by H3K36me3. Nature Genet. 41, 376–381 (2009)

    CAS  Article  Google Scholar 

  13. Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009)

    CAS  Article  Google Scholar 

  14. Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009)

    Article  Google Scholar 

  15. Taggart, A. J., DeSimone, A. M., Shih, J. S., Filloux, M. E. & Fairbrother, W. G. Large-scale mapping of branchpoints in human pre-mRNA transcripts in vivo. Nature Struct. Mol. Biol. 19, 719–721 (2012)

    CAS  Article  Google Scholar 

  16. Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004)

    CAS  Article  Google Scholar 

  17. Berriz, G. F., Beaver, J. E., Cenik, C., Tasan, M. & Roth, F. P. Next generation software for functional trend analysis. Bioinformatics 25, 3043–3044 (2009)

    CAS  Article  Google Scholar 

  18. Tilgner, H. et al. Nucleosome positioning as a determinant of exon recognition. Nature Struct. Mol. Biol. 16, 996–1001 (2009)

    CAS  Article  Google Scholar 

Download references


This work was supported by NHGRI grant U54HG006994 to S.E.C. (PI) and B.R.G. (co-PI), and R01GM095296 to B.R.G.

Author information

Authors and Affiliations



B.R.G. and S.E.C. supervised data production. S.O., A.O., M.B. and S.C.G. performed experiments. M.O.D., X.W., A.P., M.B., S.C.G. and B.R.G. performed computational analysis. B.R.G. wrote the paper with input from all authors.

Corresponding author

Correspondence to Brenton R. Graveley.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Extended data figures and tables

Extended Data Figure 1 Two approaches for identifying recursive splice sites.

a, Identification of recursive splice sites by parsing alignments. RNA-seq reads were mapped to the genome using TopHat in a manner that allowed for novel splice junctions to be predicted. The alignments were then parsed for splice junction reads where the 5′ splice site mapped to an annotated 5′ splice site, but the 3′ splice site was unannotated. b, De novo identification of recursive splice sites. A database was generated in which each annotated 5′ splice site was spliced to all AG/GT sequences in an intron that did not correspond to an annotated 3′ splice site. All RNA-seq reads were aligned to this database and the alignments parsed to find cases where reads mapped perfectly with at least three distinct offsets and at least an 8 nt overhang.

Extended Data Figure 2 Characteristics of Drosophila ratchet points.

a, Distribution of the number of ratchet points per recursive intron. b, Size distribution (log10(bp)) of all (red) and recursive (blue) introns. c, Size distribution (in kb) of the individual intron segments removed by recursive splicing binned by the number of segments per intron.

Extended Data Figure 3 RT–PCR validations of Drosophila recursive splicing events.

am, RT–PCR validation of ratchet points (red dots) from the indicated genes using primers in the upstream constitutive exon and flanking the putative ratchet points. The RP primers are expected to yield RT–PCR products if the constitutive exon is spliced to the ratchet point. The URP primers, which are upstream of each ratchet point, serve as negative controls. The identity of all RT–PCR products was verified by Sanger sequencing. Although the URP control RT–PCR reactions yielded a product for hppy RP1 and pum RP2, we were not able to generate sequence from them and therefore consider them to be amplification artefacts.

Extended Data Figure 4 Number of mapped reads per sample used for gene expression analysis.

Extended Data Figure 5 Chromatin marks associated with recursive splice sites.

a, Examples of chromatin marks at the luna gene locus, which contains 5 recursive splice sites (red triangles) within a single long intron. b, Heat maps show relative ChIP-seq enrichment for H3K4me3 (top, red), H3K79me2 (middle, green) and H3K36me3 (bottom, blue), within 2 kb of the indicated gene features from 171 genes containing at least one ratchet point. Heat maps are centred around gene features, which include the transcription start site of the first exon (first exon, arrow), the 5′ splice site of the exon upstream of the recursive splice site (upstream exon, black rectangle), the ratchet point (red triangle), the 3′ splice site of the exon downstream of the recursive splice site (downstream exon, black rectangle), and the poly(A) site of the last exon (last exon, red octagon); the average exon of each gene feature is drawn to scale. Genes are sorted from top to bottom by decreasing expression level. For genes containing more than one ratchet point, the first, upstream, downstream and last exons are represented multiple times. c, Histogram illustrating the intron positions the ratchet points reside in based on RefSeq annotations.

Extended Data Table 1 Summary of recursive intron lariats identified by directed RT–PCR and sequencing
Extended Data Table 2 Summary of total RNA-seq data from ldbr and U2AF RNAi experiments
Extended Data Table 3 Summary of total RNA-seq data from related Drosophila species
Extended Data Table 4 Summary of ratchet points experimentally validated in other Drosophila species
Extended Data Table 5 Summary of total RNA-seq data from human tissues

Supplementary information

Supplementary Information

This file contains Supplementary Figure 1 and full legends for Supplementary Tables 1-7. (PDF 8163 kb)

Supplementary Table 1

This table contains a summary of D. melanogaster RNA Sequencing Samples Generated for these studies – see Supplementary Information file for full legend. (XLSX 33 kb)

Supplementary Table 2

This table contains summary information for 197 D. melanogaster ratchet points – see Supplementary Information file for full legend. (XLSX 63 kb)

Supplementary Table 3

This table contains a comparison of ratchet points identified in this study and in Burnette et al. – see Supplementary Information file for full legend. (XLSX 43 kb)

Supplementary Table 4

This table contains a summary of recursive intron lariats identified from total RNA-Seq data – see Supplementary Information file for full legend. (XLSX 14 kb)

Supplementary Table 5

This table contains details of ratchet points experimentally validated in other Drosophila species – see Supplementary Information file for full legend. (XLSX 62 kb)

Supplementary Table 6

This table contains summary information for 5 human ratchet points– see Supplementary Information file for full legend. (XLSX 39 kb)

Supplementary Table 7

This table contains GO analysis of recursively spliced Drosophila genes – see Supplementary Information file for full legend. (XLSX 73 kb)

PowerPoint slides

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Duff, M., Olson, S., Wei, X. et al. Genome-wide identification of zero nucleotide recursive splicing in Drosophila. Nature 521, 376–379 (2015).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing