Diversity and dynamics of the Drosophila transcriptome

Brown, James B.; Boley, Nathan; Eisman, Robert; May, Gemma E.; Stoiber, Marcus H.; Duff, Michael O.; Booth, Ben W.; Wen, Jiayu; Park, Soo; Suzuki, Ana Maria; Wan, Kenneth H.; Yu, Charles; Zhang, Dayu; Carlson, Joseph W.; Cherbas, Lucy; Eads, Brian D.; Miller, David; Mockaitis, Keithanne; Roberts, Johnny; Davis, Carrie A.; Frise, Erwin; Hammonds, Ann S.; Olson, Sara; Shenker, Sol; Sturgill, David; Samsonova, Anastasia A.; Weiszmann, Richard; Robinson, Garret; Hernandez, Juan; Andrews, Justen; Bickel, Peter J.; Carninci, Piero; Cherbas, Peter; Gingeras, Thomas R.; Hoskins, Roger A.; Kaufman, Thomas C.; Lai, Eric C.; Oliver, Brian; Perrimon, Norbert; Graveley, Brenton R.; Celniker, Susan E.

doi:10.1038/nature12962

Download PDF

Article
Open access
Published: 16 March 2014

Diversity and dynamics of the Drosophila transcriptome

James B. Brown^1,2^na1,
Nathan Boley¹^na1,
Robert Eisman³^na1,
Gemma E. May⁴^na1,
Marcus H. Stoiber¹^na1,
Michael O. Duff⁴,
Ben W. Booth²,
Jiayu Wen⁵,
Soo Park²,
Ana Maria Suzuki^6,7,
Kenneth H. Wan²,
Charles Yu²,
Dayu Zhang⁸,
Joseph W. Carlson²,
Lucy Cherbas³,
Brian D. Eads³,
David Miller³,
Keithanne Mockaitis³,
Johnny Roberts⁸,
Carrie A. Davis⁹,
Erwin Frise²,
Ann S. Hammonds²,
Sara Olson⁴,
Sol Shenker⁵,
David Sturgill¹⁰,
Anastasia A. Samsonova^11,12,
Richard Weiszmann²,
Garret Robinson¹,
Juan Hernandez¹,
Justen Andrews³,
Peter J. Bickel¹,
Piero Carninci^6,7,
Peter Cherbas^3,8,
Thomas R. Gingeras⁹,
Roger A. Hoskins²,
Thomas C. Kaufman³,
Eric C. Lai⁵,
Brian Oliver¹⁰,
Norbert Perrimon^11,12,
Brenton R. Graveley⁴ &
…
Susan E. Celniker²

Nature volume 512, pages 393–399 (2014)Cite this article

61k Accesses
434 Citations
147 Altmetric
Metrics details

Subjects

Abstract

Animal transcriptomes are dynamic, with each cell type, tissue and organ system expressing an ensemble of transcript isoforms that give rise to substantial diversity. Here we have identified new genes, transcripts and proteins using poly(A)⁺ RNA sequencing from Drosophila melanogaster in cultured cell lines, dissected organ systems and under environmental perturbations. We found that a small set of mostly neural-specific genes has the potential to encode thousands of transcripts each through extensive alternative promoter usage and RNA splicing. The magnitudes of splicing changes are larger between tissues than between developmental stages, and most sex-specific splicing is gonad-specific. Gonads express hundreds of previously unknown coding and long non-coding RNAs (lncRNAs), some of which are antisense to protein-coding genes and produce short regulatory RNAs. Furthermore, previously identified pervasive intergenic transcription occurs primarily within newly identified introns. The fly transcriptome is substantially more complex than previously recognized, with this complexity arising from combinatorial usage of promoters, splice sites and polyadenylation sites.

Diverse cell-specific patterns of alternative polyadenylation in Drosophila

Article Open access 13 September 2022

Seungjae Lee, Yen-Chung Chen, … Eric C. Lai

Paralog transcriptional differentiation in the D. melanogaster-specific gene family Sdic across populations and spermatogenesis stages

Article Open access 20 October 2023

Bryan D. Clifton, Imtiyaz Hariyani, … José M. Ranz

A complete temporal transcription factor series in the fly visual system

Article 06 April 2022

Nikolaos Konstantinides, Isabel Holguera, … Claude Desplan

Main

Next-generation RNA sequencing (RNA-seq) has permitted the mapping of transcribed regions of the genomes of a variety of organisms^1,2. These studies demonstrated that large fractions of metazoan genomes are transcribed, and they also catalogued individual elements of transcriptomes, including transcription start sites³, polyadenylation sites^4,5, exons and introns⁶. However, the complexity of the transcriptome arises from the combinatorial incorporation of these elements into mature transcript isoforms. Studies that inferred transcript isoforms from short-read sequence data focused on a small subset of isoforms, filtered using stringent criteria^7,8. Studies using complementary DNA (cDNA) or expressed sequence tag (EST) data to infer transcript isoforms have not had sufficient sampling depth to explore the diversity of RNA products at most genomic loci⁹. Although the human genome has been the focus of intensive manual annotation¹⁰, analysis of strand-specific RNA-seq data from human cell lines reveals over 100,000 splice junctions not incorporated into transcript models¹¹. Thus, a large gap exists between genome annotations and the emerging transcriptomes observed in next-generation sequence data. In Drosophila, we previously described a non-strand-specific poly(A)⁺ RNA-seq analysis of a developmental time course through the life cycle⁶ and cap analysis of gene expression (CAGE) analysis of the embryo¹², which discovered thousands of unannotated exons, introns and promoters, and expanded coverage of the genome by identified transcribed regions, but not all elements were incorporated into full-length transcript models. Here we describe an expansive poly(A)⁺ transcript set modelled by integrative analysis of transcription start sites (CAGE and 5′ rapid amplification of cDNA ends (RACE)), splice sites and exons (RNA-seq), and polyadenylation sites (3′ expressed sequence tags (ESTs), cDNAs and RNA-seq). We analysed poly(A)⁺ RNA data from a diverse set of developmental stages⁶, dissected organ systems and environmental perturbations; most of this data is new and strand-specific. Our data provide higher spatiotemporal resolution and allow for deeper exploration of the Drosophila transcriptome than was previously possible. Our analysis reveals a transcriptome of high complexity that is expressed in discrete, tissue- and condition-specific messenger RNA and lncRNA transcript isoforms that span most of the genome and provides valuable insights into metazoan biology.

A dense landscape of discrete poly(A)⁺ transcripts

To broadly sample the transcriptome, we performed strand-specific, paired-end sequencing of poly(A)⁺ RNA in biological duplicate from 29 dissected tissue samples including the nervous, digestive, reproductive, endocrine, epidermal and muscle organ systems of larvae, pupae and adults. To detect RNAs not observed under standard conditions, we sequenced poly(A)⁺ RNA in biological duplicate from 21 whole-animal samples treated with environmental perturbations. Adults were challenged with heat-shock, cold-shock, exposure to heavy metals (cadmium, copper and zinc), the drug caffeine or the herbicide paraquat. To determine whether exposing larvae resulted in RNA expression from previously unidentified genes, we treated them with heavy metals, caffeine, ethanol or rotenone. Finally, we sequenced poly(A)⁺ RNA from 21 previously described¹³ and three ovary-derived cell lines (Supplementary Methods). In total, we produced 12.4 billion strand-specific read pairs and over a terabase of sequence data, providing 44,000-fold coverage of the poly(A)⁺ transcriptome.

Reads were aligned to the Drosophila genome as described⁶, and full-length transcript models were assembled using our custom pipeline termed GRIT¹⁴, which uses RNA-seq, poly(A)⁺seq, CAGE, RACE¹², ESTs¹⁵ and full-length cDNAs¹⁶ to generate gene and transcript models (Supplementary Methods). We integrated these models with our own and community manual curation data sets to obtain an annotation (Supplementary Information, section 12) consisting of 304,788 transcripts and 17,564 genes (Fig. 1a and Supplementary Fig. 1), of which 14,692 are protein-coding (Supplementary Data 1 and updates available at http://fruitfly.org). Ninety per cent of genes produce at most 10 transcript and five protein isoforms, whereas 1% of genes have highly complex patterns of alternative splicing, promoter usage and polyadenylation, and may each be processed into hundreds of transcripts (Fig. 1a, b). Our gene models span 72% of the euchromatin, an increase from 65% in FlyBase 5.12 (FB5.12), the reference annotation at the beginning of the modENCODE project (Supplementary Table 1 compares annotations in 2008–13). There were 64 euchromatic gene-free regions longer than 50 kb in FB5.12, and 25 remaining in FB5.45. Our annotation includes new gene models in each of these regions. Newly identified genes (1,468 total) are expressed in spatially and temporally restricted patterns (Supplementary Fig. 2), and 536 reside in previously uncharacterized gene-free regions. Others map to well-characterized regions, including the ovo locus, where we discovered a new ovary-specific, poly(A)⁺ transcript (Mgn94020, Supplementary Data 1 and 2), extending from the second promoter of ovo on the opposite strand and spanning 107 kb (Fig. 1c). Exons of 36 new genes overlap molecularly defined mutations with associated phenotypes (genome structure correction (GSC) P value ∼0.0002), indicating potential functions (Supplementary Table 2). For example, the lethal P-element insertions l(3)L3051 and l(3)L4111 (ref. 17) map to promoters of Mgn095159 and Mgn95009, respectively, indicating these may be essential genes. Nearly 60% of the intergenic transcription we previously reported⁶ is now incorporated into gene models.

**Figure 1: Overview of the annotation of the *Drosophila melanogaster* transcriptome.**

Transcript diversity

Over half of spliced genes (7,412; 56%) encode two or more transcript isoforms with alternative first exons. Most of such genes produce alternative first exons through coordinated alternative splicing and promoter usage (59%, 4,389 genes, hypergeometric P value < 1 × 10⁻¹⁶); however, a substantial number of genes use one, but not both mechanisms (Fig. 2a). Only 1,058 spliced genes have alternative first exons that alter protein-encoding capacity and increase the complexity of the predicted proteome. Some genes, such as G protein β-subunit 13F (Gβ13F, Fig. 2b and Supplementary Fig. 3) have exceptionally complex 5′ UTRs, but encode a single protein.

**Figure 2: Splicing complexity across the gene body.**

We measured splicing efficiency using the ‘per cent spliced in’ (Ψ) index—the fraction of isoforms that contain the particular exon⁶. Introns flanked by coding sequence are retained at an average Ψ = 0.7, whereas introns flanked by non-coding sequence are retained > fivefold more often, with an average Ψ = 3.8 (P < 1× 10⁻¹⁶ subsampling/two-sample t-test), and is most frequent in 5′ UTRs (mean Ψ = 5.1, Fig. 2c).

Despite the depth of our RNA-seq, these data show that 42% of genes encode only a single transcript isoform, and 55% encode a single protein isoform (Supplementary Methods). In mammals, it has been estimated that 95% of genes produce multiple transcript isoforms^18,19, (estimates for protein-coding capacity have not been reported).

The majority of transcriptome complexity is attributable to forty-seven genes that have the capacity to encode >1,000 transcript isoforms each (Supplementary Table 3), and account for 50% of all transcripts (Fig. 3a). Furthermore, 27% of transcripts encoded by these genes were detected exclusively in samples enriched for neuronal tissue, and another 56% only in the embryo (83% total). To determine their tissue specificities we conducted embryonic in situ expression assays (Fig. 3b) and found that 18 of 35 are detected only in neural tissue (51% compared with 10% genome-wide, hypergeometric P value < 1 × 10⁻¹⁶, Supplementary Table 4). Of these genes, 48% have 3′ UTR extensions in embryonic neural tissue²⁰ (5% genome-wide, P < 1× 10⁻¹⁶). Furthermore, 44% are targets of RNA editing (4% genome-wide⁶, P < 1 × 10⁻¹⁶, with 18 of 21 validated²¹), and 21% have 3′ UTR extensions and RNA editing sites (10 of 65 genome-wide, P < 1× 10⁻¹⁰⁰). The capacity to encode thousands of transcripts is largely specific to the nervous system and coincides with other classes of rare, neural-specific RNA processing.

**Figure 3: Complex splicing patterns are mainly limited to neural tissues.**

Tissue- and sex-specific splicing

To examine the dynamics of splicing, we calculated switch scores or ΔΨ, for each splicing event by comparing the maximal and minimal Ψ values across all samples, and in subsets including just the developmental and tissue samples. In contrast to the median Ψ values, the distribution of ΔΨ values is strikingly different between the developmental and tissue samples. Among the developmental samples, 38% of events have a ΔΨ ≥ 50%, whereas between the tissue samples 63% of events have a ΔΨ ≥ 50%. This difference is even more pronounced at higher ΔΨ thresholds—only 6% of events have a ΔΨ ≥ 80% between the developmental samples, whereas 31% of events have a ΔΨ ≥ 80% between the tissue samples. Thus, most splicing events are highly tissue-specific. Of the 17,447 alternative splicing events analysed (Supplementary Information, section 19), we find that 56.6% changed significantly (ΔΨ > 20%, Bayes factor >20). Clustering revealed groups of splicing events that are co-ordinately regulated in a tissue-specific manner. For example, 1,147 splicing events are specifically included in heads and excluded in testes or ovaries, whereas 797 splicing events are excluded in heads but included in testes or ovaries (Fig. 4a).

**Figure 4: Sex-specific splicing is mainly tissue-specific splicing.**

We identified hundreds of sex-specific splicing events from adult male and female RNA-seq data⁶. To further explore sex-specific splicing, we compared the splicing patterns in male and female heads enriched for brain tissues. There were striking differences in gene expression levels, however, only seven splicing events were consistently differentially spliced at each time point after eclosion (average ΔΨ > 20%), and these largely corresponded to genes in the known sex-determination pathway (Supplementary Information, section 19A). We find few examples of head sex-specific splicing. This is in contrast to previous studies, which have come to conflicting conclusions and used either microarrays analysing only a subset of splicing events or single read 36-bp RNA-seq^22,23 with an order of magnitude fewer reads²⁴.

We identified 575 alternative splicing events that are differentially spliced in whole male and female animals (ΔΨ > 20%) and analysed the tissue-specific splicing patterns of each event (Fig. 4b). We found that 186 of the 321 male-biased splicing events were most strongly included in testes or accessory glands, and 157 of 254 female-biased exons were ovary-enriched. Consistent with the extensive transcriptional differences observed in testes compared to other tissues, the genes containing male-specific exons are enriched in functions related to transcription. In contrast, the female-specific exon containing genes are enriched in functions involved in signalling and splicing ((http://reactome.org)²⁵, Supplementary Table 6). Together, these results indicate that the majority of sex-specific splicing is due to tissue-specific splicing in tissues present only in males or females.

Long non-coding RNAs

A growing set of candidate long non-coding RNAs (lncRNAs) have been identified in Drosophila^6,26,27. In FB5.45 there were 392 annotated lncRNAs, and it has been suggested that as many as 1,119 lncRNAs may be transcribed in the fly²⁸. However, this number was based on transcribed regions, not transcript models, and used non-stranded RNA-seq data²⁸. We find 3,880 genes produce transcripts with ORFs encoding fewer than 100 amino acids. Of these, 795 encode conserved proteins (Methods) longer than 20 amino acids. For example, a single exon gene on the opposite strand and in the last intron of the early developmental growth factor spätzle encodes a 42-amino-acid protein that is highly conserved across all sequenced Drosophila species. We identified 1,875 candidate lncRNA genes producing 3,085 transcripts, 2,990 of which have no overlap with protein-coding genes on the same strand (Supplementary Data 2). Some of these putative lncRNAs may encode short polypeptides, for example, the gene tarsal-less encodes three 11-amino-acid ORFs with important developmental functions²⁹. We determined protein conservation scores for each ORF between 20 and 100 amino acids (Supplementary Table 6). Of the 1,119 predicted lncRNAs²⁸, we provide full-length transcript models for 246 transcribed loci; the remainder were expressed at levels beneath thresholds used in this study. This is not surprising, the expression patterns of lncRNAs are more restricted than those of protein-coding genes: the average lncRNA is expressed (bases per kilobase per million mapped bases⁶ (BPKM) > 1) in 1.5 developmental and 3.2 tissue samples, compared to 6.6 and 17 for protein-coding genes, respectively. Many lncRNAs (563 or 30%) have peak expression in testes, and 125 are detectable only in testes. Similarly restricted expression patterns have been reported for lncRNAs in humans and other mammals^30,31.

Interestingly, all newly annotated genes overlapping molecularly defined mutations with phenotypes are lncRNAs (Supplementary Table 2). For instance, the mutation D114.3 is a regulatory allele of spineless (ss) that maps 4 kb upstream of ss³² and within the promoter of Mgn4221. Similarly, Mgn00541 corresponds to a described, but unannotated 2.0 kb transcript overlapping the regulatory mutant allele ci⁵⁷ of cubitus interruptus³³. It remains to be determined whether these mutations are a result of the loss-of-function of newly annotated transcripts or cis-acting regulatory elements (for example, enhancers) or both.

Antisense transcription

Drosophila antisense transcription has been reported³⁴, but the catalogue of antisense transcription has been largely limited to overlapping mRNAs transcribed on opposite strands. We identify non-coding antisense transcript models for 402 lncRNA loci that are antisense to mRNA transcripts of 422 protein-coding genes (for example, prd, Fig. 5a), and 36 lncRNAs form ‘sense-antisense gene-chains’ overlapping more than one protein-coding locus, as observed in mammals^30,35. In Drosophila, 21% of lncRNAs are antisense to mRNAs, whereas in human 15% of annotated lncRNAs are antisense to mRNAs (GENCODE v.10). We assembled antisense transcript models for 5,057 genes (29%, compared to previous estimates of 15%³⁴). For 67% of these loci, antisense expression is observable in at least one cell line, indicating that sense/antisense transcripts may be present in the same cells. LncRNA-mediated antisense accounts for a small minority of antisense transcription: 94% of antisense loci correspond to overlapping protein-coding mRNAs transcribed on opposite strands, and of these, 323 loci (667 genes) share overlapping CDSs. The majority of antisense is due to overlapping UTRs: 1,389 genes have overlapping 5′ UTRs (divergent transcription), 3,430 have overlapping 3′ UTRs (convergent transcription), and 540 have both, meaning that, as with many lncRNAs, they form gene-chains across contiguously transcribed regions. A subset of antisense gene-pairs overlap almost completely (>90%), which we term reciprocal transcription. There are 13 such loci (Supplementary Fig. 5) and seven are male-specific (none are female-specific).

The mRNA/lncRNA sense-antisense pairs tend to be more positively correlated in their expression than mRNA/mRNA pairs, (mean r = 0.16 compared with 0.13, Kolmogorov–Smirnov (KS) two-sample one-sided test P < 10⁻⁹), and although this mean effect is subtle, the trend is clearly visible in the quantiles (95th percentile lncRNA/mRNA 0.729 versus mRNA/mRNA 0.634, Supplementary Fig. 6a). This effect is stronger when the analysis is restricted to cell line samples (Supplementary Fig. 6b).

Even in homogenous cell cultures, evidence for sense-antisense transcription does not guarantee that both transcripts exist within individual cells: transcription could originate from exclusive events occurring in different cells. Cis-natural antisense transcripts (cis-NATs) are a substantial source of endogenous siRNAs³⁶, and their existence directly reflects the existence of precursor dsRNA. Cis-NAT-siRNA production typically involves convergent transcription units that overlap on their 3′ ends, but other documented loci generate siRNAs across internal exons, introns or 5′ UTRs^37,38,39. Analysis of head, ovary and testis RNAs showed that 328 unique sense/antisense gene pair regions generated 21-nucleotide RNAs indicative of siRNA production (Supplementary Table 8), and these were significantly enriched (Supplementary Fig. 7a, Supplementary Methods) for pairs showing positively correlated expression between sense and antisense levels across tissues (P = 2 × 10⁻⁵), embryo developmental stages (P = 4 × 10⁻³), conditions (P = 9 × 10⁻⁴) and across all samples (P = 3 × 10⁻⁵). The tissue distribution of these cis-NAT-siRNAs showed a bias for testis expression (Supplementary Fig. 7b), with fourfold greater number relative to ovaries (P = 2 × 10⁻¹⁷, binomial test) and sevenfold relative to heads (P = 4 × 10⁻²⁵) and expression levels of siRNAs were substantially higher in testes than other tissues (Supplementary Fig. 7c).

Over 80% of cis-NAT-siRNAs were derived from 3′-convergent gene pairs. Abundant siRNAs emanate from an overlap of the gryzun and CG14967 3′ UTRs (Supplementary Fig. 5). The remainder were distributed amongst CDSs, introns and 5′ UTRs. We identified abundant testis-enriched siRNA production from a 5′-divergent overlap of Cyt-c-d and CG31808 (Fig. 5b) and from the entire CDS of dUTPase and its antisense non-coding transcript Mgn99994.

Transcriptional effects of environmental stress

Whole-animal perturbations each exhibited condition-specific effects, for example, the metallothionein genes were induced by heavy metals (Fig. 6a), but not by other treatments (Supplementary Table 9). The genome-wide transcriptional response to cadmium (Cd) exposure involves small changes in expression level in thousands of genes (48 h after exposure), but only a small group of genes change > 20-fold, and this group includes six lncRNAs (the third most strongly induced gene is CR44138, Fig. 6a, Supplementary Fig. 8a). Four newly modelled lncRNAs are differentially expressed (1% false discovery rate (FDR)) in at least one treatment, and constitute newly described eco-responsive genes. Furthermore, 57 genes and 5,259 transcripts (of 811 genes) were detected exclusively in these treatment samples. Although no two perturbations revealed identical transcriptional landscapes, we find a homogeneous response to environmental stressors (Fig. 6b, Supplementary Fig. 8b). The direction of regulation for most genes is consistent across all treatments; very few are upregulated in one condition and downregulated in another. Classes of strongly upregulated genes included those annotated with the GO term “Response to Stimulus, GO:0050896” (most enriched, P value < 1 × 10⁻¹⁶, Supplementary Fig. 8c), and those that encode lysozymes (> tenfold), cytochrome P450s, and mitochrondrial components mt:ATPase6, mt:CoI, mt:CoIII (> fivefold). Genes encoding egg-shell, yolk and seminal fluid proteins are strongly downregulated in response to every treatment except ‘cold2’ and ‘heat shock’ (Supplementary Fig. 8d). For these two stressors, samples were collected 30 min after exposure, corresponding to an ‘early response test’ showing suppression of germ cell production is not immediate.

**Figure 6: Effects of environmental perturbations on the *Drosophila* transcriptome.**

Discussion

Most transcriptional complexity in Drosophila occurs in tissues of the nervous system, and particularly in the functionally differentiating central and peripheral nervous systems. A subset of ultra-complex genes encodes more than half of detected transcript isoforms and these are dramatically enriched for RNA editing events and 3′ UTR extensions, both phenomena largely specific to the nervous system. Our study indicates that the total information output of an animal transcriptome may be heavily weighted by the needs of the developing nervous system.

The improved depth of sampling and spatiotemporal resolution resulted in the identification of more than 1,200 new genes not discovered in our previous study of Drosophila development⁶. A large fraction of the new genes are testes-specific, and many of these are antisense RNAs, as previously described in mammals³⁰. Some new lncRNAs, such as Mgn94020 (Fig. 1), form sense/antisense gene-chains that bring distant protein-coding genes into transcriptional relationships, another phenomenon previously described only in mammals⁴⁰. Whenever Mgn94020 is detectably transcribed, the genes on the opposite strand in its introns are not, indicating that its transcription may serve a regulatory function independent of the RNA transcribed. The presence of short RNAs at many regions of antisense transcription indicates that sense and antisense transcripts are present in the same cells at the same times. Many of these Drosophila antisense transcripts correspond to ‘positionally equivalent’³⁰ antisense transcripts in human. In the two species we found antisense lncRNAs opposite to orthologous protein-coding genes. The apparent positional equivalence of fly and human antisense transcription at genes like Monocarboxylate transporter 1 (MCT1), even-skipped (EVX1), CTCF (CTCF), Adenosine receptor (ADORA2A), and many others^10,31 across 600 million years of evolution suggests a conserved regulatory mechanism basal to sexual reproduction in metazoans.

Perturbation experiments identified new genes and transcripts, but perhaps more importantly, a general response to stress that is broader than the heat shock pathway. A similar study conducted on marsh fishes in the wake of the Deepwater Horizon incident in the Gulf of Mexico⁴¹ demonstrated that the killifish response to chronic hydrocarbon exposure included induction of lyzosome genes, P450 cytochromes and mitochondrial components, and the downregulation of genes encoding eggshell and yolk proteins⁴¹. This overlap of expressional responses by gene families across phyla suggests a conserved metazoan stress response involving enhanced metabolism and the suppression of genes involved in reproduction.

We defined an extensive catalogue of putative lncRNAs. However, many genes are known to encode poorly conserved, short polypeptides, including genes specific to the male gonad and accessory gland. Analysis of ribosome profiling initially indicated that a number of mammalian lncRNAs may be translated⁴², but this observation has been difficult to validate by proteomics⁴³, and further analysis has suggested that although lncRNAs have signatures of ribosome occupancy, they are not translated⁴⁴. Therefore, while we refer to these RNAs as ‘non-coding’, additional data are needed to determine if they produce small polypeptides.

The biological consequences of many of the phenomena reported here, including the observation that many genes encoding RNA binding proteins exhibit extraordinary splicing complexity, often within their 5′ UTRs, require further study. The splicing factor pUf68 encodes more than 100 alternatively spliced 5′ UTR variants, but encodes a single protein. The idea that splicing factors may regulate one another to generate complex patterns of splicing is consistent with recent computational models⁴⁵. More generally, the role of complex splicing in the adult and developing nervous system is unclear. To answer the questions that come with increasingly complete transcriptomes in higher organisms, it will be necessary to study gene regulation downstream of transcription initiation, including the regulation of splicing, localization and translation.

Methods Summary

Animal staging, collection and RNA extraction

Tissues were dissected from Oregon R larval, pupal and adult staged animals synchronized with appropriate age indicators. Pupal and adult animals were treated with a number of environmental stresses. RNA was isolated using TRIzol (Invitrogen), treated with DNase and purified on a RNAeasy column (Qiagen). Poly(A)⁺ RNA was prepared from an aliquot of each total RNA sample using an Oligotex kit (Qiagen).

RNA-seq

Libraries were generated and sequenced on an Illumina Genome Analyzer IIx or HiSeq 2000 using paired-end chemistry and 76-bp or 100-bp cycles. The 454 sequencing used poly(A)⁺ RNA from Oregon R adult males and females and mixed-staged y¹ cn¹bw¹ sp¹. embryos. Sequences are available from the Short Read Archive (Accession numbers available in Supplementary Table 10) and the modENCODE website (http://www.modencode.org/, Supplementary Table 10). CAGE⁴⁶ was sequenced on an Illumina Genome Analyzer IIx with 36-bp reads. Poly(A)⁺seq was generated using a custom protocol (Supplementary Methods).

Analysis

RNA-seq, CAGE and poly(A)⁺ reads were mapped and filtered¹². GRIT was used to identify transcript models¹⁴. Expression levels for genes and exons were computed in BPKM⁶. GSC P values were computed⁴⁷. Ψ values were calculated with MISO⁴⁸. Differential expression analysis was conducted with a custom method (Supplementary Methods) and with DEseq⁴⁹. RPS-BLAST was used to conduct the conserved domain search with version v3.08 of the NCBI Conserved Domains Database (CDD) (Supplementary Methods). Orthology analysis between human and fly was conducted using DIOPT (http://www.flyrnai.org/cgi-bin/DRSC_orthologs.pl). Phenotypic alleles were downloaded from FlyBase r5.50, and were selected as any allele localized to the genome with a disease phenotype.

Accession codes

Data deposits

Sequences are available from the Short Read Archive and the modENCODE website, a list of accession numbers is given in Supplementary Table 10.

References

Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature Methods 5, 621–628 (2008)
Article CAS PubMed Google Scholar
Nagalakshmi, U. et al. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320, 1344–1349 (2008)
Article CAS ADS PubMed PubMed Central Google Scholar
Takahashi, H., Kato, S., Murata, M. & Carninci, P. CAGE (cap analysis of gene expression): a protocol for the detection of promoter and transcriptional networks. Methods Mol. Biol. 786, 181–200 (2012)
Article CAS PubMed PubMed Central Google Scholar
Mangone, M. et al. The landscape of C. elegans 3′UTRs. Science 329, 432–435 (2010)
Article CAS ADS PubMed PubMed Central Google Scholar
Jan, C. H., Friedman, R. C., Ruby, J. G. & Bartel, D. P. Formation, regulation and evolution of Caenorhabditis elegans 3′UTRs. Nature 469, 97–101 (2011)
Article CAS ADS PubMed Google Scholar
Graveley, B. R. et al. The developmental transcriptome of Drosophila melanogaster. Nature 471, 473–479 (2011)
Article CAS ADS PubMed Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature Protocols 7, 562–578 (2012)
Article CAS PubMed PubMed Central Google Scholar
Collins, J. E., White, S., Searle, S. M. & Stemple, D. L. Incorporating RNA-seq data into the zebrafish Ensembl genebuild. Genome Res. 22, 2067–2078 (2012)
Article CAS PubMed PubMed Central Google Scholar
Carninci, P. et al. Targeting a complex transcriptome: the construction of the mouse full-length cDNA encyclopedia. Genome Res. 13, 1273–1289 (2003)
Article PubMed PubMed Central Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012)
Article CAS PubMed PubMed Central Google Scholar
Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101–108 (2012)
Article CAS ADS PubMed PubMed Central Google Scholar
Hoskins, R. A. et al. Genome-wide analysis of promoter architecture in Drosophila melanogaster. Genome Res. 21, 182–192 (2011)
Article CAS PubMed PubMed Central Google Scholar
Cherbas, L. The transcriptional diversity of 25 Drosophila cell lines. Genome Res. 21, 301–314 (2011)
Article CAS PubMed PubMed Central Google Scholar
Boley, N. et al. Genome guided transcript construction from integrative analysis of RNA sequence data. Nature Biotechnol. http://dx.doi.org/10.1038/nbt.2850 (2014)
Celniker, S. E. & Rubin, G. M. The Drosophila melanogaster genome. Annu. Rev. Genomics Hum. Genet. 4, 89–117 (2003)
Article CAS PubMed Google Scholar
Stapleton, M. et al. The Drosophila gene collection: identification of putative full-length cDNAs for 70% of D. melanogaster genes. Genome Res. 12 1294–1300 (2002) 2
Article PubMed PubMed Central Google Scholar
Spradling, A. C. et al. The Berkeley Drosophila Genome Project gene disruption project: single P-element insertions mutating 25% of vital Drosophila genes. Genetics 153, 135–177 (1999)
CAS PubMed PubMed Central Google Scholar
Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470–476 (2008)
Article CAS ADS PubMed PubMed Central Google Scholar
Pan, Q., Shai, O., Lee, L. J., Frey, B. J. & Blencowe, B. J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genet. 40, 1413–1415 (2008)
Article CAS PubMed Google Scholar
Smibert, P. et al. Global patterns of tissue-specific alternative polyadenylation in Drosophila. Cell Rep. 1, 277–289 (2012)
Article CAS PubMed PubMed Central Google Scholar
St Laurent, G. et al. Genome-wide analysis of A-to-I RNA editing by single-molecule sequencing in Drosophila. Nature Struct. Mol. Biol. 20, 1333–1339 (2013)
Article CAS Google Scholar
Telonis-Scott, M., Kopp, A., Wayne, M. L., Nuzhdin, S. V. & McIntyre, L. M. Sex-specific splicing in Drosophila: widespread occurrence, tissue specificity and evolutionary conservation. Genetics 181, 421–434 (2009)
Article CAS PubMed PubMed Central Google Scholar
Hartmann, B. et al. Distinct regulatory programs establish widespread sex-specific alternative splicing in Drosophila melanogaster. RNA 17, 453–468 (2011)
Article CAS PubMed PubMed Central Google Scholar
Chang, P. L., Dunham, J. P., Nuzhdin, S. V. & Arbeitman, M. N. Somatic sex-specific transcriptome differences in Drosophila revealed by whole transcriptome sequencing. BMC Genomics 12, 364 (2011)
Article CAS PubMed PubMed Central Google Scholar
Matthews, L. et al. Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res. 37, D619–D622 (2009)
Article CAS PubMed Google Scholar
Lipshitz, H. D., Peattie, D. A. & Hogness, D. S. Novel transcripts from the Ultrabithorax domain of the bithorax complex. Genes Dev. 1, 307–322 (1987)
Article CAS PubMed Google Scholar
Tupy, J. L. et al. Identification of putative noncoding polyadenylated transcripts in Drosophila melanogaster. Proc. Natl Acad. Sci. USA 102, 5495–5500 (2005)
Article CAS ADS PubMed PubMed Central Google Scholar
Young, R. S. et al. Identification and properties of 1,119 candidate lincRNA loci in the Drosophila melanogaster genome. Genome Biol. Evol. 4, 427–442 (2012)
Article CAS PubMed PubMed Central Google Scholar
Kondo, T. et al. Small peptide regulators of actin-based cell morphogenesis encoded by a polycistronic mRNA. Nature Cell Biol. 9, 660–665 (2007)
Article CAS PubMed Google Scholar
Katayama, S. et al. Antisense transcription in the mammalian transcriptome. Science 309, 1564–1566 (2005)
Article ADS PubMed Google Scholar
Derrien, T. et al. The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 22, 1775–1789 (2012)
Article CAS PubMed PubMed Central Google Scholar
Duncan, D. M., Burgess, E. A. & Duncan, I. Control of distal antennal identity and tarsal development in Drosophila by spineless-aristapedia, a homolog of the mammalian dioxin receptor. Genes Dev. 12, 1290–1303 (1998)
Article CAS PubMed PubMed Central Google Scholar
Schwartz, C., Locke, J., Nishida, C. & Kornberg, T. B. Analysis of cubitus interruptus regulation in Drosophila embryos and imaginal disks. Development 121, 1625–1635 (1995)
CAS PubMed Google Scholar
Misra, S. et al. Annotation of the Drosophila melanogaster euchromatic genome: a systematic review. Genome Biology 3, research0083 (2002)
Article PubMed PubMed Central Google Scholar
Lipovich, L. et al. Activity-dependent human brain coding/noncoding gene regulatory networks. Genetics 192, 1133–1148 (2012)
Article CAS PubMed PubMed Central Google Scholar
Okamura, K. & Lai, E. C. Endogenous small interfering RNAs in animals. Nature Rev. Mol. Cell Biol. 9, 673–678 (2008)
Article CAS Google Scholar
Okamura, K., Balla, S., Martin, R., Liu, N. & Lai, E. C. Two distinct mechanisms generate endogenous siRNAs from bidirectional transcription in Drosophila melanogaster. Nature Struct. Mol. Biol. 15, 581–590 (2008)
Article CAS Google Scholar
Czech, B. et al. An endogenous small interfering RNA pathway in Drosophila. Nature 453, 798–802 (2008)
Article CAS ADS PubMed PubMed Central Google Scholar
Ghildiyal, M. et al. Endogenous siRNAs derived from transposons and mRNAs in Drosophila somatic cells. Science 320, 1077–1081 (2008)
Article CAS ADS PubMed PubMed Central Google Scholar
Engström, P. G. et al. Complex loci in human and mouse genomes. PLoS Genet. 2, e47 (2006)
Article PubMed PubMed Central Google Scholar
Whitehead, A. et al. Genomic and physiological footprint of the Deepwater Horizon oil spill on resident marsh fishes. Proc. Natl Acad. Sci. USA 109, 20298–20302 (2012)
Article CAS ADS PubMed Google Scholar
Ingolia, N. T., Ghaemmaghami, S., Newman, J. R. & Weissman, J. S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324, 218–223 (2009)
Article CAS ADS PubMed PubMed Central Google Scholar
Bánfai, B. et al. Long noncoding RNAs are rarely translated in two human cell lines. Genome Res. 22, 1646–1657 (2012)
Article PubMed PubMed Central Google Scholar
Guttman, M., Russell, P., Ingolia, N. T., Weissman, J. S. & Lander, E. S. Ribosome profiling provides evidence that large noncoding RNAs do not encode proteins. Cell 154, 240–251 (2013)
Article CAS PubMed PubMed Central Google Scholar
Huelga, S. C. et al. Integrative genome-wide analysis reveals cooperative regulation of alternative splicing by hnRNP proteins. Cell Rep. 1, 167–178 (2012)
Article CAS PubMed PubMed Central Google Scholar
Takahashi, H., Lassmann, T., Murata, M. & Carninci, P. 5′ end-centered expression profiling using cap-analysis gene expression and next-generation sequencing. Nature Protocols 7, 542–561 (2012)
Article CAS PubMed PubMed Central Google Scholar
Bickel, P. J., Boley, N., Brown, J. B., Huang, H. & Zhang, N. R. Subsampling methods for genomic inference. Ann. Appl. Stat. 4, 1660–1697 (2010)
Article MathSciNet Google Scholar
Katz, Y., Wang, E. T., Airoldi, E. M. & Burge, C. B. Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nature Methods 7, 1009–1015 (2010)
Article CAS PubMed PubMed Central Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010)
CAS PubMed PubMed Central Google Scholar
Yepiskoposyan, H. et al. Transcriptome response to heavy metal stress in Drosophila reveals a new zinc transporter that confers resistance to zinc. Nucleic Acids Res. 34, 4866–4877 (2006)
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the members of the modENCODE transcription consortium, especially J. Landolin and J. Sandler for their early contributions to these studies. We also thank A. Kundaje and H. Huang for helpful discussions. This work was funded by a contract from the National Human Genome Research Institute modENCODE Project, contract U01 HG004271 and U54 HG006944, to S.E.C. (principal investigator) and P.C., T.R.G., R.A.H. and B.R.G. (co-principal investigators) with additional support from R01 GM076655 (S.E.C.) both under Department of Energy contract no. DE-AC02-05CH11231. J.B.B.’s work was supported by NHGRI K99 HG006698. Work in P.J.B.’s group was supported by the modENCODE DAC sub-award 5710003102, 1U01HG007031-01 and the ENCODE DAC 5U01HG004695-04. Work in Bloomington was supported in part by the Indiana METACyt Initiative of Indiana University, funded by an award from the Lilly Endowment. Work in E.C.L.’s group was supported by U01-HG004261 and RC2-HG005639.

Author information

James B. Brown, Nathan Boley, Robert Eisman, Gemma E. May and Marcus H. Stoiber: These authors contributed equally to this work.

Authors and Affiliations

Department of Statistics, University of California Berkeley, Berkeley, \94720, California, USA
James B. Brown, Nathan Boley, Marcus H. Stoiber, Garret Robinson, Juan Hernandez & Peter J. Bickel
Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, 94720, California, USA
James B. Brown, Ben W. Booth, Soo Park, Kenneth H. Wan, Charles Yu, Joseph W. Carlson, Erwin Frise, Ann S. Hammonds, Richard Weiszmann, Roger A. Hoskins & Susan E. Celniker
Department of Biology, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405, USA,
Robert Eisman, Lucy Cherbas, Brian D. Eads, David Miller, Keithanne Mockaitis, Justen Andrews, Peter Cherbas & Thomas C. Kaufman
Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, Connecticut 06030, USA,
Gemma E. May, Michael O. Duff, Sara Olson & Brenton R. Graveley
Sloan-Kettering Institute, 1017C Rockefeller Research Labs, 1275 York Avenue, Box 252, New York, New York 10065, USA,
Jiayu Wen, Sol Shenker & Eric C. Lai
RIKEN Omics Science Center, Yokohama, Kanagawa 230-0045, Japan,
Ana Maria Suzuki & Piero Carninci
Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, 230-0045, Japan,
Ana Maria Suzuki & Piero Carninci
Center for Genomics and Bioinformatics, Indiana University, 1001 East 3rd Street, Bloomington, Indiana 47405, USA,
Dayu Zhang, Johnny Roberts & Peter Cherbas
Cold Spring Harbor Laboratory, Cold Spring Harbor, 11724, New York, USA
Carrie A. Davis & Thomas R. Gingeras
Section of Developmental Genomics, Laboratory of Cellular and Developmental Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, 20892, Maryland, USA
David Sturgill & Brian Oliver
Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA,
Anastasia A. Samsonova & Norbert Perrimon
Howard Hughes Medical Institute, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, Massachusetts 02115, USA,
Anastasia A. Samsonova & Norbert Perrimon

Authors

James B. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Boley
View author publications
You can also search for this author in PubMed Google Scholar
Robert Eisman
View author publications
You can also search for this author in PubMed Google Scholar
Gemma E. May
View author publications
You can also search for this author in PubMed Google Scholar
Marcus H. Stoiber
View author publications
You can also search for this author in PubMed Google Scholar
Michael O. Duff
View author publications
You can also search for this author in PubMed Google Scholar
Ben W. Booth
View author publications
You can also search for this author in PubMed Google Scholar
Jiayu Wen
View author publications
You can also search for this author in PubMed Google Scholar
Soo Park
View author publications
You can also search for this author in PubMed Google Scholar
Ana Maria Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth H. Wan
View author publications
You can also search for this author in PubMed Google Scholar
Charles Yu
View author publications
You can also search for this author in PubMed Google Scholar
Dayu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Joseph W. Carlson
View author publications
You can also search for this author in PubMed Google Scholar
Lucy Cherbas
View author publications
You can also search for this author in PubMed Google Scholar
Brian D. Eads
View author publications
You can also search for this author in PubMed Google Scholar
David Miller
View author publications
You can also search for this author in PubMed Google Scholar
Keithanne Mockaitis
View author publications
You can also search for this author in PubMed Google Scholar
Johnny Roberts
View author publications
You can also search for this author in PubMed Google Scholar
Carrie A. Davis
View author publications
You can also search for this author in PubMed Google Scholar
Erwin Frise
View author publications
You can also search for this author in PubMed Google Scholar
Ann S. Hammonds
View author publications
You can also search for this author in PubMed Google Scholar
Sara Olson
View author publications
You can also search for this author in PubMed Google Scholar
Sol Shenker
View author publications
You can also search for this author in PubMed Google Scholar
David Sturgill
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia A. Samsonova
View author publications
You can also search for this author in PubMed Google Scholar
Richard Weiszmann
View author publications
You can also search for this author in PubMed Google Scholar
Garret Robinson
View author publications
You can also search for this author in PubMed Google Scholar
Juan Hernandez
View author publications
You can also search for this author in PubMed Google Scholar
Justen Andrews
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. Bickel
View author publications
You can also search for this author in PubMed Google Scholar
Piero Carninci
View author publications
You can also search for this author in PubMed Google Scholar
Peter Cherbas
View author publications
You can also search for this author in PubMed Google Scholar
Thomas R. Gingeras
View author publications
You can also search for this author in PubMed Google Scholar
Roger A. Hoskins
View author publications
You can also search for this author in PubMed Google Scholar
Thomas C. Kaufman
View author publications
You can also search for this author in PubMed Google Scholar
Eric C. Lai
View author publications
You can also search for this author in PubMed Google Scholar
Brian Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Perrimon
View author publications
You can also search for this author in PubMed Google Scholar
Brenton R. Graveley
View author publications
You can also search for this author in PubMed Google Scholar
Susan E. Celniker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.A., T.R.G., B.R.G., R.A.H., T.C.K. and S.E.C. designed the project. J.A., P.Ch., T.R.G., B.R.G., R.A.H., J.B.B., B.O. and S.E.C. managed the project. R.E. designed treatment protocols and prepared biological samples. T.C.K., J.A. and L.C. oversaw biological sample production. B.D.E., D.M. and J.R. prepared biological samples. D.Z. and B.E. prepared RNA samples. J.A. oversaw RNA sample production. G.E.M., S.O. and L.Y. prepared Illumina RNA-seq libraries. A.M.S. prepared CAGE libraries. P.Ca. oversaw production of CAGE libraries. C.A.D., G.E.M., S.O., L.Y., S.P. and K.H.W. performed Illumina sequencing. B.R.G. and S.E.C. managed Illumina sequencing production. R.A.H. conceived the poly(A)⁺seq method. R.W. and R.A.H. developed the poly(A)⁺seq protocol and produced the libraries. K.M. performed 454 sequencing. C.Y., S.P. and K.H.W. performed cDNA library screens and full-insert cDNA sequencing. S.E.C. oversaw cDNA production. E.F. and N.B. installed and administered computer infrastructure for data storage and analysis. J.B.B., N.B., M.H.S., M.O.D., B.W.B., D.S., J.W.C., S.S., J.W., A.A.S., N.P., E.C.L., P.J.B. and B.R.G. developed analysis methods. J.B.B., N.B., M.H.S., M.O.D., B.W.B., A.S.H., E.F., R.A.H., S.S., D.S., L.C., G.R., J.H., J.W., A.A.S., E.C.L., K.H.W., B.R.G. and S.E.C. analysed data. N.B., J.B.B. M.H.S., K.H.W. and S.E.C. generated annotations. D.S. and B.O. analysed species validation data. S.S., J.W. and E.C.L. analysed 3′ UTR and antisense data. A.S.H., E.F. and S.E.C. analysed image data. M.H.S. analysed proteomics data. M.H.S., S.S., D.S., B.O., E.C.L., T.C.K., R.E., R.A.H. and P.Ch. contributed to the text. A.S.H. assisted with manuscript preparation. J.B.B., B.R.G. and S.E.C. wrote the paper with input from all authors. All authors discussed the results and commented on the manuscript.

Corresponding authors

Correspondence to James B. Brown, Brenton R. Graveley or Susan E. Celniker.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Text and References, a guide to the Supplementary Tables and Supplementary Data files and Supplementary Figures 1-11 – see the contents page for details. (PDF 2385 kb)

Supplementary Tables

This zipped file contains Supplementary Tables 1-10 - see Supplementary Information document p.13 for more details. (ZIP 2098 kb)

Supplementary Data

This zipped file contains Supplementary Data sets 1-6 - see Supplementary Information document p.13 for more details. (ZIP 19021 kb)

Supplementary Data

This zipped file contains Supplementary Data sets 7-9 - see Supplementary Information document p.13 for more details. (ZIP 28618 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

PowerPoint slide for Fig. 5

PowerPoint slide for Fig. 6

Rights and permissions

This work is licensed under a Creative Commons Attribution-Non-Commercial-ShareAlike 3.0 Unported licence. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.

Reprints and permissions

About this article

Cite this article

Brown, J., Boley, N., Eisman, R. et al. Diversity and dynamics of the Drosophila transcriptome. Nature 512, 393–399 (2014). https://doi.org/10.1038/nature12962

Download citation

Received: 20 April 2013
Accepted: 18 December 2013
Published: 16 March 2014
Issue Date: 28 August 2014
DOI: https://doi.org/10.1038/nature12962

This article is cited by

Genome-wide association in Drosophila identifies a role for Piezo and Proc-R in sleep latency
- Matthew N. Eiman
- Shailesh Kumar
- Susan T. Harbison
Scientific Reports (2024)
Slik maintains tissue homeostasis by preventing JNK-mediated apoptosis
- Chenglin Li
- Xiaojie Zhu
- Lei Xue
Cell Division (2023)
Mapping splice QTLs reveals distinct transcriptional and post-transcriptional regulatory variation of gene expression and identifies putative alternative splicing variation mediating complex trait variation in pigs
- Fei Zhang
- Deborah Velez-Irizarry
- Wen Huang
BMC Genomics (2023)
Differential adaptive RNA editing signals between insects and plants revealed by a new measurement termed haplotype diversity
- Yuange Duan
- Ye Xu
- Hu Li
Biology Direct (2023)
Comprehensive mapping of exon junction complex binding sites reveals universal EJC deposition in Drosophila
- Lucía Morillo
- Toni Paternina
- Hervé Le Hir
BMC Biology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.