Sequences encoding C2H2 zinc fingers inhibit polyadenylation and mRNA export in human cells

Russo, Joseph; Jalkanen, Aimee L.; Heck, Adam M.; Schmidt, Caleb M.; Wilusz, Jeffrey; Wilusz, Carol J.

doi:10.1038/s41598-018-35138-4

Download PDF

Article
Open access
Published: 19 November 2018

Sequences encoding C2H2 zinc fingers inhibit polyadenylation and mRNA export in human cells

Joseph Russo¹^na1,
Aimee L. Jalkanen²^na1,
Adam M. Heck²,
Caleb M. Schmidt¹,
Jeffrey Wilusz^1,2 &
…
Carol J. Wilusz ORCID: orcid.org/0000-0001-9939-0747^1,2

Scientific Reports volume 8, Article number: 16995 (2018) Cite this article

2310 Accesses
3 Citations
Metrics details

Subjects

Abstract

The large C2H2-Zinc Finger (C2H2-ZNF) gene family has rapidly expanded in primates through gene duplication. There is consequently considerable sequence homology between family members at both the nucleotide and amino acid level, allowing for coordinated regulation and shared functions. Here we show that multiple C2H2-ZNF mRNAs experience differential polyadenylation resulting in populations with short and long poly(A) tails. Furthermore, a significant proportion of C2H2-ZNF mRNAs are retained in the nucleus. Intriguingly, both short poly(A) tails and nuclear retention can be specified by the repeated elements that encode zinc finger motifs. These Zinc finger Coding Regions (ZCRs) appear to restrict polyadenylation of nascent RNAs and at the same time impede their export. However, the polyadenylation process is not necessary for nuclear retention of ZNF mRNAs. We propose that inefficient polyadenylation and export may allow C2H2-ZNF mRNAs to moonlight as non-coding RNAs or to be stored for later use.

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Nuclear mRNA decay: regulatory networks that control gene expression

Article 18 April 2024

Endogenous aldehyde-induced DNA–protein crosslinks are resolved by transcription-coupled repair

Article Open access 10 April 2024

Introduction

Zinc-finger (ZNF)-containing proteins are among the largest families of proteins in the human genome. The Cys2His2 (C2H2)-type ZNF family has over 700 members, many of which are unique to primates and have arisen through gene duplication^1,2. The vast majority of C2H2-ZNF proteins have no unique biological function assigned to them; however, they are widely accepted as DNA binding proteins and those with an associated KRAB domain can recruit the transcriptional repressor KAP1/TRIM28³. Recent studies have suggested that expansion of this family in primates occurred in part to combat the parallel expansion and evolution of transposable elements^4,5,6,7. Notably, the human C2H2-ZNF genes have on average ~10 tandem copies of the ZNF motif², which is sufficient to recognize a 30nt sequence, far longer than necessary for specificity⁸. Indeed, only ~60% of ZNF motifs were predicted to bind DNA in a recent analysis⁹. This gives rise to the possibility that some ZNF domains, or perhaps even the sequences encoding them, have other roles.

In support of the zinc finger motif having additional functions, many C2H2-ZNFs were recently identified as putative RNA-binding proteins (RBPs)¹⁰ and the C2H2-ZNF motif has also been implicated in protein-protein interactions^11,12. In addition, the linker domains between individual ZNFs are targeted by the TOPK/PBK kinase during mitosis to induce dissociation of these proteins from condensing chromatin^13,14. Thus, these highly conserved repeated protein motifs may indeed serve functions other than binding DNA.

It is also possible that the DNA or RNA sequences encoding C2H2-ZNF domains have an additional, non-coding function. Sequence elements shared between different genes can facilitate coordinated control through recruitment of regulatory factors (such as RBPs and miRNAs). Several lines of evidence suggest that C2H2-ZNF genes share regulatory elements that allow their expression to be coordinated. First, four miRNAs can target the repeated regions encoding zinc fingers found in related C2H2-ZNF mRNAs^15,16. These miRNAs can simultaneously downregulate multiple C2H2-ZNF genes through translational repression and/or mRNA decay^15,16. Second, C2H2-ZNF mRNAs are over-represented among transcripts with shorter than average poly(A) tails which could indicate unique processing or deadenylation mechanisms are at play¹⁷. Third, a large group of C2H2-ZNF transcripts are decayed more slowly in stem cells than in fully differentiated fibroblasts¹⁸. This could be facilitated by recognition of shared sequence elements by a stem cell or fibroblast-specific RBP or miRNA. Fourth, mRNAs encoding KRAB C2H2-ZNFs share similar RNA metabolism profiles including efficient synthesis and processing, and a tendency to be retained in the nucleus¹⁹. Next, the 3′ exons of C2H2-ZNF genes have unusual chromatin modification (H3me3K9) suggesting that they may auto-regulate through recruitment of the KAP1/TRIM28 transcriptional repressor^20,21. Interestingly the level of H3me3K9 modification is strongly correlated with the number of ZNF motifs in the 3′ exon²¹. Finally, C2H2-ZNF genes are significantly over-represented among genes that have acquired Alu insertions²² which again may facilitate coordinated control either at the level of transcription or through ADAR-editing²³.

Based on the evidence outlined above, we hypothesized that sequence elements found within the ORF or 3′UTR of C2H2-ZNF mRNAs are responsible for aspects of their unique metabolism. We focused specifically on the observations made through global analyses that many C2H2-ZNF mRNAs have significant populations with shorter than average poly(A) tails¹⁷ and that they have an increased propensity to be retained in the nucleus¹⁹. In mammalian cells, a poly(A) tail of ~250 nt is added to the 3′ end of the majority of nascent mRNAs²⁴ and influences downstream metabolism. Failure to polyadenylate a nascent transcript limits its ability to form a RiboNucleoprotein Complex (RNP) that is competent for export and leads to rapid decay²⁵. Once a polyadenylated transcript has been exported, the poly(A) tail, in complex with PABPC1, potentiates translation by helping to recruit initiation factors. Over time, the poly(A) tail can be shortened by various deadenylases²⁶. When the tail is shortened to such an extent that PABPC1 can no longer bind, rapid decay of the mRNA body generally ensues²⁷. Thus at steady state, most mRNAs exist as a population with poly(A) tails varying in length from ~20 to ~250 nt^28,29. Exceptions to this are the histone mRNAs³⁰, transcripts with poly(A) limiting elements (PLEs)^31,32 and mRNAs bearing cytoplasmic polyadenylation elements (CPEs)³³ which can persist with short or no poly(A) tail. Here, we show that repetitive sequence elements encoding C2H2-ZNF motifs (Zinc finger Coding Regions or ZCRs) act as poly(A) limiting elements, resulting in a sub-population of mRNAs that have short poly(A) tails. In addition, we provide evidence that C2H2-ZNF mRNAs are retained in the nucleus through mechanisms that do not require polyadenylation.

Results

The poly(A) tails of C2H2-ZNF mRNAs are bimorphic

Most mRNAs can be effectively captured by hybridization of the poly(A) tail to oligo(dT)³⁴. In contrast, histone mRNAs and many non-coding RNAs (rRNAs, tRNAs etc) fail to bind to oligo(dT) because they lack a poly(A) tail. Yang et al.¹⁷ identified a group of mRNAs in human H9 embryonic stem cells and HeLa cells that are “bimorphic”, in that a significant portion of the population fails to bind oligo(dT). Among these are many C2H2-ZNF mRNAs.

We evaluated six C2H2-ZNF mRNAs with different properties (numbers of ZNF motifs, +/−KRAB effector domain, chromosomal locations, etc), some of which were previously identified as bimorphic¹⁷ and most of which share similar metabolism profiles (“biotypes”) as classified by Mukherjee et al.¹⁹ who evaluated various properties of mRNAs including nuclear/cytoplasmic distribution, synthesis, processing, translation and decay rates (Table 1).

Table 1 Properties of C2H2-ZNF genes analyzed in this study.

Full size table

First, we assessed whether the selected transcripts exhibited bimorphic behavior with respect to binding oligo(dT). Total RNA isolated from HeLa cells was fractionated using oligo(dT) magnetic beads and the proportion of each mRNA in each fraction was quantified by qRT-PCR. Control transcripts with poly(A) tails of known length were spiked in and evaluated in the same way (Fig. S1). As shown in Fig. 1A, PPIA and GAPDH, abundant polyadenylated mRNAs, were found almost exclusively in the bound fraction. However, all six of the C2H2-ZNF mRNAs had significant populations that failed to bind oligo(dT). Interestingly, this phenomenon is also observed in primary fibroblasts and in induced pluripotent stem cells (Fig. S3) suggesting it is not unique to cancer cells.

C2H2-ZNF mRNAs can persist without a poly(A) tail

Failure of a fraction of mRNAs to bind oligo(dT) could either reflect a heterogeneous mixture of mRNAs with long and short poly(A) tails, or there may be a homogenous population having a poly(A) tail of an intermediate length (~12–25 nt) that binds inefficiently to oligo(dT). In order to distinguish these possibilities we used a qualitative RT-PCR-based assay³⁵ to evaluate the poly(A) tail length of GAPDH, PPIA, ZNF12 and ZNF627 mRNAs in the oligo(dT) bound and flow-through fractions (Fig. 1B). The bound GAPDH and PPIA transcripts had a range of poly(A) tail lengths between ~30–250 nt in length, while those found in the unbound fraction all had tails of ~27 nt (Fig. 1C). This is consistent with observations that PABPC1 protects around 23–27 adenosine residues and that most mRNAs are rapidly degraded once the poly(A) tail is shortened to the point where PABPC1 can no longer associate. Thus, no transcripts completely lacking a poly(A) tail are detected^36,37,38. Both ZNF12 and ZNF627 also have a range of tail lengths in the bound fraction, supporting that at least some of each population undergoes normal polyadenylation. However, in the major fraction that does not bind oligo(dT) these mRNAs exhibit strikingly little adenylation. The transcripts detected have very short, if any, poly(A) tails (Fig. 1B,C). This suggests that ZNF12 and ZNF627 mRNAs are relatively stable even when they lack the ability to associate with poly(A) binding proteins. Based on the results shown in Fig. 1A–C, we conclude that at least half the mRNA population for ZNF12 and ZNF627 has a poly(A) tail of less than 10 nt.

C2H2-ZNF mRNAs are retained in the nucleus

Polyadenylation is required for export of mRNAs, as well as influencing other aspects of mRNA metabolism³⁹. Moreover, KRAB-domain ZNF mRNAs were enriched among mRNAs with a profile that includes increased nuclear retention^19,40. We therefore wondered whether these ZNF transcripts with short poly(A) tails were also retained in the nucleus. We isolated nuclear and cytoplasmic RNA populations and quantified the level of ZNF and control RNAs in each fraction by qRT-PCR (Fig. 2A). Interestingly, while GAPDH and PPIA mRNAs are primarily cytoplasmic, all the ZNF transcripts had significantly larger populations retained in the nucleus. There is a clear correlation between the extent of polyadenylation and the proportion of the population exported to the cytoplasm (Fig. 2B). Nuclear retention of ZNF transcripts was also observed in fibroblasts and iPS cells (Fig. S4).

In order to address directly whether the nuclear retained transcripts also have short poly(A) tails we performed a two-step fractionation, in which we separated the nucleus and cytoplasm, isolated total RNA from each fraction and finally bound the RNA to oligo(dT) to separate polyadenylated RNAs. We quantified the amount of mRNA in each fraction by qRT-PCR as before. The results shown in Fig. S5 suggest that for five of the six ZNF mRNA the nuclear population generally exhibits slightly less polyadenylation than the cytoplasmic population. This difference is most obvious for ZNF43 and ZNF134 mRNAs where the cytoplasmic population is polyadenylated to a similar extent as GAPDH mRNA while the nuclear population is significantly less polyadenylated. This observation is consistent with the two phenotypes, nuclear retention and short poly(A) tails, being connected. However, as polyadenylated ZNF mRNAs are present in the nucleus and unadenylated ZNF mRNAs are detected in the cytoplasm we cannot draw any definitive conclusions.

Sequence elements encoding C2H2 zinc fingers influence polyadenylation and export

In order to further characterize the mechanisms responsible for restricting the length of the poly(A) tail and retaining these transcripts in the nucleus, we created a reporter constructs in which the ORF and 3′UTR of the ZNF12 mRNA were fused to Renilla Luciferase with an upstream intron and expressed under a tetracycline responsive promoter (Fig. 3A). The same promoter was used to drive expression of ZsGreen in the opposite direction as an internal control for transfection efficiency. As a negative control, we created a similar reporter with the ORF and 3′UTR of PPIA. The ZsGreen mRNA behaved as expected and was primarily polyadenylated (Fig. 3B). Both the PPIA and ZNF12 reporter RNAs also behaved similarly to their endogenous counterparts despite having heterologous 3′ end formation signals (derived from the SV40 late poly(A) signal).

The PPIA reporter was primarily polyadenylated and cytoplasmic, while the ZNF12 reporter largely failed to bind oligo(dT) and was present mainly in the nuclear fraction (Fig. 3B,C). We also evaluated the poly(A) tail of the ZNF12 reporter via RNAseH northern blotting and found that, like the endogenous ZNF12 transcript (Fig. 1B), the vast majority of mRNA produced lacks poly(A) (Fig. 3D, Supplementary Fig. S6).Taken together these results suggest that sequence elements present in the 3′UTR and/or ORF of ZNF12 can confer nuclear retention, and restrict the length of the poly(A) tail. Conversely, the 5′UTR, the poly(A) signal and downstream sequences are not involved.

We next utilized RNA-FISH to assess the subcellular localization of each reporter. As expected, the PPIA reporter was primarily cytoplasmic with minimal nuclear accumulation. In contrast, the ZNF12 reporter was distributed throughout the cell with significant nuclear staining. This pattern is consistent with the distribution of the reporter and endogenous ZNF12 between nuclear and cytoplasmic fractions and supports that sequences in the ORF and/or 3′UTR of ZNF12 can influence the localization of the reporter transcript. Interestingly, most cells exhibited accumulation of the reporter RNA in multiple nuclear foci (Fig. 3E).

In order to narrow down the region of the ZNF12 mRNA required for nuclear retention and poly(A) restriction, we cloned the ZNF12 ORF and 3′UTR separately into the RLuc reporter (Fig. 4A). Surprisingly, both of these constructs behaved similarly to the full-length reporter. These reporter mRNAs were predominantly nuclear and had a large population that failed to bind oligo(dT). The simplest explanation is that there are multiple redundant sequence elements in the 3′UTR and ORF that can influence poly(A) status and localization.

We hypothesized that perhaps the ZNF12 3′UTR might contain sequence elements encoding remnants of C2H2-ZNF motifs that were inactivated during gene duplication or other evolutionary events. This is a relatively common occurrence for C2H2-ZNF genes⁴¹. By aligning the ORF and 3′UTR to each other, we were able to identify ZNF-related sequence elements at the 5′ end of the 3′UTR as well as additional inactivated ZNF-related motifs within the 5′ half of the ORF closer to the KRAB domain (Fig. S7). These degenerate ZNF sequences are depicted as open diamonds in Fig. 4A. Thus, every construct tested to this point bears multiple sequence elements related to those encoding C2H2-ZNFs. This was strong evidence indeed that sequences encoding C2H2-ZNFs are responsible for the effects on polyadenylation and localization. We made three additional ORF constructs; one containing just the KRAB domain and no C2H2-ZNF sequences (KRAB), a second containing the degenerate ZNF motifs within the 5′ end of the ORF (degZNF) and the third containing the remaining canonical ZNF motifs (canZNF). We also made a construct bearing just the region of the 3′UTR that contains sequences that may have encoded functional C2H2-ZNF motifs in the past (utrZNF). As predicted, the KRAB only reporter behaved much like the PPIA control; the transcript was polyadenylated and exported to the cytoplasm. In contrast, all the reporters bearing ZNF-like sequences (degZNF, canZNF, utrZNF) behaved more like the full length reporter in that these mRNAs were predominantly nuclear and did not efficiently bind oligo(dT). There was a clear positive correlation between the level of polyadenylation and the cytoplasmic accumulation of the ZNF12 reporter constructs (Fig. S7B). We conclude that the sequences encoding ZNF motifs can influence poly(A) tail length and export from the nucleus.

C-rich sequence elements were recently identified as nuclear retention signals in several long non-coding RNAs^42,43. HnRNP K was implicated as a factor recognizing these signals and mediating their effects⁴³,which is notable because hnRNP K has also been linked with reduced polyadenylation of the lncRNA NEAT1-1⁴⁴ and more generally with 3′ end processing and transcription termination⁴⁵. It seemed conceivable that hnRNP K might be responsible for both the short poly(A) tail on ZNF mRNAs and their nuclear retention. There are short C-rich stretches in the Zinc Finger Coding (ZFC) regions that could possibly recruit hnRNP K (Fig. S6) but we could find no evidence in existing eCLIP datasets to suggest that hnRNP K associates with the ZFC regions of our ZNF mRNAs⁴⁶. Nevertheless, as the connections between hnRNP K and both polyadenylation and nuclear retention were intriguing, we evaluated the effect of depleting hnRNP K on ZNF mRNA localization (Fig. S8). Unfortunately, despite efficient (>70%) knockdown of hnRNP K at both the mRNA and protein level (Fig. S7A,B), there was no discernable effect on ZNF mRNA localization or that of another nuclear-retained mRNA, MLXPIL, which was reported previously to undergo export following hnRNP K knockdown (Fig. S8D)⁴³. We found that PTOV1-AS1, a lncRNA whose accumulation is dependent on hnRNP K⁴⁷, was down-regulated (Fig. S7C) supporting that hnRNP K was sufficiently depleted to have a biological impact. Based on our results, and a dearth of evidence for association of hnRNP K with ZNF mRNAs, we conclude that hnRNP K is not likely to be a major component of the pathway responsible for nuclear retention of ZNF mRNAs.

C2H2-ZNF mRNAs fail to experience polyadenylation

Thus far, we have provided compelling evidence that the ~84 nt sequence that encodes ZNF motifs can restrict polyadenylation and lead to retention of ZNF mRNAs in the nucleus. This function is apparently independent of the ZNF12 poly(A) signal itself, as the majority of our constructs contain the SV40 late poly(A) signal. As the sequences coding for ZNF motifs are targeted by miRNAs^15,16 we hypothesized that perhaps this interaction was contributing to the unusual metabolism of these mRNAs. If this were the case, then depletion of DICER (the enzyme that processes miRNAs in the cytoplasm) and the consequent reduction in miRNA levels should allow polyadenylation and export to occur. Despite strong evidence for expression of ZNF-targeting miRNAs in HeLa cells⁴⁸ (Fig. S9), depleting DICER had no discernable effect on abundance, or polyadenylation of the ZNF transcripts (Fig. 5) although it did very effectively induce expression of LIN28A, a transcript repressed by the let-7 miRNA⁴⁹. We conclude that interaction of miRNAs with the sequences encoding ZNF motifs does not influence poly(A) tail length of the ZNF mRNAs. This is consistent with the fact that the 3′UTR and utrZNF reporter mRNAs, which lack binding sites for ZNF-targeting miRNAs, still have short poly(A) tails and are retained in the nucleus (Fig. 4).

To determine whether ZNF mRNAs with very short tails fail to get polyadenylated immediately following 3′ end cleavage, or instead get polyadenylated efficiently and then deadenylated at a later time, we investigated the length of tail on nascent mRNAs. HeLa cells were treated with 4-thiouridine (4sU) for 10 minutes and then total RNA was isolated^50,51. 4sU-labeled nascent RNAs were conjugated to biotin and isolated by binding to streptavidin beads. The eluted RNA was then bound to oligo(dT) beads as before, and ZNF mRNAs were assessed in each fraction by qRT-PCR. As shown in Fig. 5D, a large fraction (~50% or more) of the nascent ZNF transcripts, but not the PPIA and GAPDH control mRNAs, fail to bind oligo(dT) suggesting that the polyadenylation process itself may be ineffective for these mRNAs. It seems that this is a stochastic phenomenon as the remainder of the population binds oligo(dT), suggesting it is polyadenylated normally.

We wondered whether the polyadenylation machinery, or factors recruited following a failure to polyadenylate, might be responsible for nuclear retention of the ZNF transcripts. To investigate this idea we wanted to evaluate reporters with 3′ ends made independent of the polyadenylation machinery. The MALAT1 non-coding RNA 3′ end is formed through cleavage by RNAse P without involvement of the canonical polyadenylation machinery⁵². Heterologous transcripts bearing the MALAT1 3′ end signal are neither adenylated nor retained in the nucleus⁵³. When we replaced the 3′ end of the full length ZNF12 and PPIA reporters with the MALAT1 3′ end signal, the PPIA reporter transcript was still efficiently exported but the ZNF12 reporter mRNA was retained in the nucleus (Fig. 5E) to a similar extent as the reporter bearing the SV40 poly(A) signal (Fig. 3C). Therefore, nuclear retention of the ZNF12 transcript is not due to aberrant or failed polyadenylation, although we cannot at this point rule out that the lack of a poly(A) tail is significant.

Discussion

In the experiments described above, we have shown that C2H2-ZNF mRNAs can be synthesized as two populations; with long/normal and very short poly(A) tails. In addition, these mRNAs exhibit a nuclear distribution consistent with failure to export, or with very rapid decay upon entry to the cytoplasm. There is a clear correlation between the level of polyadenylation and the efficiency of export for C2H2-ZNF mRNAs and the same sequence elements seem to influence both phenomena. Notably, both failure to polyadenylate and nuclear retention are conferred on a heterologous RNA by sequences capable of encoding C2H2 zinc finger motifs, which we have called Zinc finger Coding Regions or ZCRs. Furthermore, ZNF12 reporter mRNAs that do not experience polyadenylation are also retained in the nucleus supporting that nuclear retention can be separated from the polyadenylation process.

Possible mechanisms for poly(A) tail length restriction

Polyadenylation initiates with recognition of the poly(A) signal (AAUAAA) upstream of the cleavage site and a downstream element (DSE) by Cleavage Polyadenylation Specificity Factor (CPSF) and Cleavage Stimulation Factor (CstF) respectively. Following cleavage, poly(A) polymerase (PAP) begins to add adenosine residues to the 3′ end of the mRNA⁵⁴. Once the poly(A) tail becomes long enough to bind the nuclear poly(A) binding protein (PABPN1), an interaction between PAP, CPSF and PABPN1 causes the polyadenylation reaction to become more processive and the tail is rapidly extended to ~250 nt²⁴. The C2H2-ZNF mRNAs appear to undergo cleavage, but the poly(A) tail is not extended beyond a few nucleotides suggesting that recruitment or activity of PAP may be impaired by factors associating with the ZCR. Alternatively, ZCR-binding factors could prevent recruitment of PABPN1 or other factors connected with poly(A) tail length control such as ZC3H14⁵⁵.

One characterized mechanism to inhibit polyadenylation is through binding of splicing factors that can inhibit PAP close to the poly(A) site^56,57. However, this phenomenon requires binding of the inhibitory factor relatively close to the poly(A) signal while ZCRs are able to function from a distance. In addition, inhibition of polyadenylation by splicing factors generally results in reduced mRNA abundance suggesting that the unadenylated transcripts are degraded but the C2H2-ZNF mRNAs are clearly able to persist despite their short poly(A) tails. ZCRs bear more functional similarity to poly(A) limiting elements (PLE) which can be located in the coding region, within the last exon, and can restrict polyadenylation even in the presence of a heterologous strong poly(A) signal³¹. However, unlike the C2H2-ZNF mRNAs, PLE-containing mRNAs appear to be exported and translated efficiently despite lacking a poly(A) tail³². The mechanism by which PLEs limit polyadenylation is not understood, although the splicing factor U2AF65 associates with PLEs and can modulate their activity⁵⁸. Notably, when we prevented polyadenylation, by replacing the poly(A) signal with the MALAT1 3′ end sequence, the ZNF12 reporter mRNA was still retained in the nucleus suggesting that any trans-acting factors involved in nuclear retention are not recruited by the polyadenylation machinery.

Nuclear retention of RNAs

It appears there are many mechanisms for RNAs to be retained in the nucleus. In some cases, a specific and rather short sequence element (such as the C-rich SIRLOIN element in Alu containing mRNAs⁴³ or the AGCCC element in BORG⁵⁹) is sufficient to prevent export by recruiting a protein factor such as hnRNP K. Another protein able to block RNA export is the ZFC3H1 exosome adaptor which can compete with the export factor ALYREF for binding to the RNA⁶⁰. In the case of lncRNAs like MALAT1 and NEAT1, the minimal region required for nuclear retention is much longer (100 s of nucleotides) and may function by providing structural scaffold for proteins to bind and/or regions for RNA-RNA hybridization⁶¹. It appears specific properties of these lncRNAs allow them to nucleate phase separation and accumulate in subnuclear compartments⁶². Given that the ZNF12 reporter accumulates in foci (Fig. 3E) and the ZFCs have no known nuclear retention sequence in common, they may be functioning in a similar way to lncRNAs.

Nuclear export requires assembly of an export-competent RNP which generally depends on accurate and complete processing of the transcript⁶³. Failure to splice, cap or polyadenylate a nascent mRNA can lead to degradation or a delay in export due to formation of aberrant mRNPs that do not recruit the appropriate export factors. Transcripts that undergo non-canonical processing have alternative fates. For example, mRNAs lacking introns must rely on additional signals for export^64,65. Importantly, the alternative 3′ end processing pathway experienced by MALAT1 results in a transcript with no poly(A) tail, but this is not sufficient on its own to result in nuclear retention: The MALAT1 3′ end cannot specify nuclear retention of the PPIA reporter (Fig. 5E) or of a GFP mRNA⁵³. In order for an RNA bearing the MALAT1 3′ end to be retained in the nucleus, it requires some additional property conferred by a ~600 nt region of the MALAT1 RNA body⁶¹, or as shown here, by ZFCs. Importantly, the MALAT1 nuclear retention element can also function on an RNA bearing a canonical poly(A) signal, although it is not clear whether this transcript actually acquired a poly(A) tail⁶¹. Based on this result we propose that the ZFCs function through a pathway similar to that utilized by MALAT1. In this respect, we note many non-coding RNAs that undergo canonical splicing and polyadenylation, including the Xist⁶⁶, Bsr⁶⁷ and BORG⁵⁹ lncRNAs, are retained in the nucleus through various mechanisms, and in a variety of subnuclear locations⁶⁸. Certain factors binding the ZCR could perhaps reduce recruitment of export factors by partitioning these mRNAs to inaccessible nuclear domains or by preventing interactions between the export machinery and mRNA processing factors.

Alu elements are unlikely to be involved

Transcripts containing inverted repeat (IR) Alu elements are retained in the nucleus, in paraspeckles, due to extensive post-transcriptional A-to-I editing⁶⁹. Alu elements also contain C-rich sequences that drive nuclear localization through binding of hnRNP K⁴³. Although C2H2-ZNF genes are more likely to contain Alu-derived sequences than other transcripts²³, and C2H2-ZNF mRNAs are significantly enriched among a group of 333 genes with 3′UTR IR-Alu elements⁷⁰, this mechanism cannot explain our observations for several reasons. First, only one of the six ZNF mRNAs we studied here (ZNF43) retains IR-Alu sequences in the mature transcript and experiences editing⁷⁰. Second, unlike the C2H2-ZNF mRNAs, the vast majority of Alu-containing mRNAs are polyadenylated¹⁷. Third, there is no evidence for association of hnRNP K with the transcripts we evaluated⁴⁶ and hnRNP K knockdown had no effect on localization of the ZNF mRNAs (Fig. S7). Finally, the nuclear retention of Alu-containing mRNAs is abrogated in embryonic stem cells concomitant with loss of paraspeckles⁷¹; yet the ZNF mRNAs we tested are retained in the nucleus even in pluripotent cells (Fig. S4) and C2H2-ZNF mRNAs were over-represented in the bimorphic population in H9 embryonic stem cells¹⁷.

The nucleus as a storage compartment

The retention of C2H2-ZNF mRNAs within the nucleus presumably prevents them from functioning as mRNAs. Nuclear retention is an established mechanism to regulate gene expression. For example, the CTN non-coding RNA contains IR-Alu repeats that are extensively edited, leading to nuclear retention as described above. However, in response to stress, the CTN transcript is processed to remove the edited region and generate the mature mCAT2 mRNA⁷². The mCAT2 mRNA is efficiently exported and translationally competent. A more widespread pathway for delaying nuclear export is through retention of introns^73,74. Transcripts with one or more unspliced introns can remain in the nucleus until an appropriate stimulus induces completion of splicing and allows formation of an export competent RNP^74,75. Although our reporter transcripts have an intron and undergo splicing, the ZFC element can still retain RNAs within the nucleus. Further investigation is required to determine whether a poly(A) tail or the polyadenylation experience, can overcome this retention. If this were the case, then polyadenylation could perhaps induce rapid export and translation when needed, such as during certain phases of the cell cycle, or in response to stress.

Sequence elements with dual coding and non-coding functions

There are several instances where coding region sequence elements moonlight to regulate mRNA export or other aspects of mRNA metabolism⁷⁶. One particularly relevant example is the Signal Sequence Coding Region (SSCR) found at the 5′ end of transcripts encoding secretory proteins which both encodes signal peptide and enhances RNA export⁷⁷. In this case, the SSCRs role as an mRNA export element does not compete with its role in coding for protein because the two functions are required sequentially. However, in other examples a non-coding function can compete with the coding function of an mRNA. This is the case for the bifunctional Steroid Receptor RNA Activator, SRA, which acts as an RNA scaffold to assemble factors involved in nuclear receptor signaling⁷⁸. An alternatively spliced isoform of the SRA RNA encodes a protein that plays a role in trans-activation of nuclear steroid hormone receptors in the nucleus. The coding isoform is exported to the cytoplasm and translated and thus cannot act as a nuclear scaffold while the non-coding isoform lacks appropriate start codons, fails to be exported and cannot be translated to make protein⁷⁹. It seems possible that the C2H2-ZNF mRNAs have acquired a nuclear non-coding function such as acting as a scaffold or miRNA decoy.

Future experiments will aim at characterizing the ZCRs in more detail, and identifying trans-acting factors that associate with the ZCRs to prevent polyadenylation and/or export. We are also interested in characterizing the nuclear domain these transcripts are accumulating in and evaluating whether cellular conditions such as stress or the cell cycle influence export and/or polyadenylation of these RNAs. Such studies might eventually allow us to modulate the polyadenylation/export of the entire class of C2H2-ZNF bimorphic transcripts and discern the biological impact.

Methods

Plasmid Constructs

In order to generate templates for in vitro transcription of control RNAs with different lengths of poly(A) tail, fragments of the S. cerevisiae ACT1, MET3 and MET25 genes were amplified by RT-PCR using primers shown in Table S1A and cloned between the BamHI and SalI sites of pGEM4 (Promega) plasmids that had adenosine stretches of 15, 45 or 149 adenosines, respectively, inserted 3′ of the multiple cloning site. These plasmids were linearized with PvuII and the empty pGEM4 plasmid was linearized with SmaI (to generate an RNA with no poly(A) tail). Each linearized plasmid was in vitro transcribed using SP6 RNA polymerase (Thermo Fisher Scientific) and the resulting RNA was gel purified before use.

The parent vector for ZNF12 and PPIA expression constructs was generated by cloning the β-globin/IgG chimeric intron from pCI-Neo (Promega) into the ApaI site of pTRE3G-BI-ZsGreen (Clontech), and then inserting Renilla luciferase amplified from pLightSwitch (Switchgear Genomics) into the BglII/NotI sites. The empty parent vector was digested with Not1. Fragments containing ORF and/or 3′UTR sequences from ZNF12 (NM_016265.3) and PPIA (NM_021130.4) were amplified from HeLa cDNA or from previously generated plasmids. The 3′ end of the mouse MALAT1 gene (NR_002847.3) was synthesized as a gBlock (Integrated DNA Technologies) and used to replace the 3′ end of PPIA or ZNF12 in the original clones. Individual constructs were created using In-Fusion HD (Clontech) or NEBuilder HiFi (New England Biolabs) which are ligation independent cloning kits. Primers for the PCR reactions are shown in Table S1A. Fragments were combined with vector in the ratios recommended by the manufacturer and incubated at 50 °C for 20–30 min before transformation into E. coli. All plasmids were sequenced prior to use.

Cell Culture and Transfection

HeLa S3 cells adapted to adherent culture were maintained in DMEM (4.5 g/L glucose) supplemented with 10% fetal bovine serum (FBS), 2 mM L-glutamine, and 0.37% sodium bicarbonate at 37 °C in 5% CO₂. HeLa Tet-Off Advanced cells (Clontech) were maintained in DMEM supplemented with 10% Tetracycline-free FBS (Clontech), 2 mM L-glutamine, 0.37% sodium bicarbonate, and 100 μg/mL G418 to maintain expression of the transactivator.

Induced Pluripotent Stem (iPS) cells (System Biosciences Cat # SC101A-iPSC, Lot # 110415-01) were cultured on Matrigel (BD Biosciences) in mTesR1 media (STEMCELL Technologies) at 37 °C in 5% CO2. Human foreskin fibroblasts (HFFs; System Biosciences Cat # SC101A-HFF, Lot # 110509) were maintained in DMEM (4.5 g/L glucose) supplemented with 2 mM L-glutamine, 0.1 mM non-essential amino acids, and 10% FBS at 37 °C, 5% CO₂.

HeLa Tet-Off Advanced cells were transfected with reporter plasmids using jetPRIME (Polyplus) prior to allowing them to adhere to the dish. A ratio of 3 µL reagent to 1 µg of plasmid was employed according to the manufacturer’s recommendations. Cells were harvested 24–48 hr after transfection. Transfections were evaluated for ZsGreen expression using a Nikon Diaphot 200 microscope with a GFP-B filter cube (Nikon). Transfection efficiency was generally >70% at ~48 hours post transfection.

Nuclear/Cytoplasmic Fractionation

Nuclei and cytoplasm were separated as described previously⁸⁰. Cells were scraped into ice cold phosphate buffered saline (PBS), collected by centrifugation, resuspended in NP40 lysis buffer (0.5% NP-40, 10 mM Tris-HCl pH 8.5, 1.5 mM MgCl2, 10 mM EDTA, 140 mM NaCl) and incubated on ice for 5 min. Nuclei were pelleted by centrifugation for 5 min at 500 × g at 4 °C, the supernatant/cytoplasm was transferred to a fresh tube and an equal volume of TRIzol was added. After washing with NP40 lysis buffer, the nuclear pellets were lysed in TRIzol. RNA was isolated as described below. The RNA from the nucleus and cytoplasm was resuspended in an equal volume such that an equal number of cells were represented in 1 µl regardless of RNA concentration.

RNA Isolation and qRT-PCR

TRIzol was added to cells after removal of the media. RNA isolation was performed according to the manufacturer’s instructions except an additional phenol/chloroform/isoamyl alcohol (25:24:1) extraction was performed immediately prior to addition of isopropanol. Total RNA was treated with DNAse I (Thermo Fisher Scientific) or with TURBO DNAse (Thermo Fisher Scientific) and DpnI restriction enzyme if plasmid DNA had been transfected into the cells. DNase was removed by phenol/chloroform/IAA extraction and ethanol precipitation.

Reverse transcription was performed in 20 µL reactions with Improm-II Reverse Transcriptase (Promega) using 0.5 µg of random hexamers, according to the manufacturer’s instructions. 1–2.5 µL of cDNA was used to set up triplicate qPCR reactions using IQ SYBR-Green Supermix (BioRad) and primers as listed in Supplementary Table 1. A standard 2 step amplification protocol was performed using a BioRad CFX96 instrument with annealing/extension at 60 °C for 30 sec and denaturation at 95 °C for 10 sec. Amplification efficiency was determined for all primer pairs (Suppl. Table 1) and each pair produced a single product of the expected molecular weight. Relative abundance of mRNAs was determined using the BioRad CFX Manager™ software which relies on the Pfaffl method⁸¹.

Control RNAs and Oligo(dT) Selection

80 fmol of a mixture of in vitro transcribed control RNAs with poly(A) tails of 0, 15, 45 and 149 nt were spiked into 10–20 µg total RNA samples in binding buffer (20 mM Tris-HCl, pH 7.5, 500 mM LiCl, 0.5% LiDS, 1 mM EDTA, 5 mM DTT) prior to fractionation. Oligo(dT)₂₅ magnetic beads (New England Biolabs) were equilibrated in binding buffer. RNA samples were denatured at 70 °C for 2 min before being mixed with the beads and incubated at room temperature for 10 min. A magnetic field was used to retrieve the beads and the unbound fraction was reserved. The beads were washed with wash buffer (20 mM Tris-HCl, pH 7.5, 500 mM LiCl,0.1% LiDS, 1 mM EDTA, 5 mM DTT) containing 1 µL Ribolock RNase Inhibitor (Thermo Fisher Scientific) and the wash was added to the unbound fraction. This step was repeated with wash buffer and again with low salt buffer (20 mM Tris-HCl, pH 7.5, 200 mM LiCl, 1 mM EDTA), with the flow through being added to the unbound fraction each time. Finally, the poly(A)⁺ RNA was eluted in Tris-HCl, pH 7.5, 1 mM EDTA at 50 °C for 2 min. This step was repeated. The RNA in each fraction was recovered by precipitation with glycogen carrier and resuspended in 20 µL of nuclease-free water. Equivalent volumes of the bound (poly(A)+) and unbound(poly(A)−) fractions were used to make cDNA.

Labeling and Isolation of Nascent RNAs

HeLa cells were treated with 500 µM 4-thiouridine for 10 min and then scraped into TRIzol. RNA was isolated using the RNAeasy kit (Qiagen) according to the manufacturer’s protocol. 75 µg of total RNA was labeled with biotin and fractionated using streptavidin magnetic beads (Miltenyi Biotec) as described previously⁵⁰.

Linker-Ligation-Poly(A) Tail (LLM-PAT) Assay

The LLM-PAT assay was performed as described previously³⁵. Briefly, a population with no poly(A) tail was generated by treatment with oligo(dT)₁₈ and 5 U of RNAse H (Fischer-Fermentas, EN0201) for 30 minutes. Next, 1 µg of this treated sample and an untreated RNA sample were each ligated to a linker RNA (10 µM 5′rApp-TTTAACCGCGAATTCCAG-ddC-3′, Linker-3 Integrated DNA Technologies) using T4 RNA Ligase 1 (New England Biolabs) at 16 °C overnight. The RNA was recovered and converted to cDNA using a primer complementary to the linker (5′-CTGGAATTCGCGGTT-3′) and Improm II Reverse Transcriptase (Promega). Finally, the 3′ end of each transcript of interest was amplified by PCR using OneTaq Hot Start Polymerase (New England Biolabs) employing a gene-specific forward primer paired with the linker primer as the reverse primer. The annealing temperature (55–59 °C) and number of cycles (27–32) were adjusted to optimize the amount of product for easy visualization. PCR products were separated and visualized by agarose gel electrophoresis and imaged on a BioRad Gel Doc EZ Imager. Image Lab Software (BioRad) was used to determine poly(A) tail lengths.

RNase H Northern Blot Analysis

HeLa Tet-Off Advanced cells were transfected with the ZNF12 reporter plasmid as described above. Cells were then collected in TRIzol and total RNA was prepared. 10 µg of total RNA was hybridized to a gene-specific DNA oligonucleotide (Actin [5′-GTGGACTTGGGAGAGGACTG-3′] or ZNF12 [5′-TATACAACTTACCAGATGTAATT-3′]) complementary to a region just upstream of the poly(A) signal. Oligo(dT) was included as indicated. Following treatment with 5 U of RNAse H (Fischer-Fermentas, EN0201) for 30 minutes, RNA was recovered and separated on a 5% denaturing polyacrylamide gel. RNA was then transferred to a nylon membrane (Hybond N+, RPN119B GE Healthcare) followed by UV crosslinking. The blots were then pre-washed (0.1xSSC/0.1%SDS) at 65 °C for 1 hour, pre-hybridized (10x Denhart’s Solution, 6x SSC, 0.1%SDS) at 42 °C for 1 hour and then probed using ³²P-end-labeled DNA oligonucleotides (resuspended in pre-hybridization buffer) specific to the gene of interest (Actin [5′-CATAATTTACACGAAAGCAATGCTATC-3′] or ZNF12 [5′-CACAAATATGCTATGAATATAG-3′]) overnight. The following day, blots were washed (6xSSC/0.1%SDS) three times at room temperature for 5 minutes followed by an additional wash at 50 °C for 20 minutes. Following an overnight exposure, phosphor screens were scanned using a Typhoon Trio (GE Healthcare) and the results analyzed with ImageQuant software (GE Healthcare).

DICER and HNRNPK knockdown and western blotting

For DICER knockdown, HeLa S3 cells were transduced with lentiviral particles containing DNA encoding an shRNA that targets DICER (Sigma-Aldrich:TRCN0000051261) or a negative control lentiviral particles generated using the pLKO-1 empty vector⁸². Pools of stably transduced cells were selected with 10 μg/mL puromycin and then maintained in 1 μg/mL puromycin.

To transiently knockdown HNRNPK, HeLa Tet-Off Advanced cells were transfected with a pre-designed siRNA targeting HNRNPK (Sigma, Mission SASI_Hs02_00333311, target sequence: 5′-GCAAGAATATTAAGGCTCT-3′) or control siRNA (Sigma SIC001) at a final concentration of 50 nM using jetPRIME (Polyplus) reagent in accordance with the manufacturer’s protocol. Knockdown was assessed 48 hours after transfection.

In each case, knockdown was verified by qRT-PCR using primers described in Table S2 and also by western blot using rabbit anti-DICER antibodies (H212; Santa Cruz Biotechnology; 1:50 dilution in Tris-buffered saline containing 0.1% Tween-20) or rabbit anti-HNRNPK antibodies (Abcam ab18195, 1:1000). As a loading control, HSC70 was detected using rat anti-HSC70 (1B5; Santa Cruz Biotechnology; 1:1000 dilution) or GAPDH was detected using mouse anti-GAPDH (Millipore Sigma, MAB374, 1:5000). Detection was accomplished using HRP-conjugated secondary antibodies and SuperSignal West Pico Chemiluminescent Substrate (Thermo Fisher Scientific). Blots were imaged using a ChemiDoc XRS (BioRad) and quantified using ImageLab (BioRad).

RNA-fluorescent in situ hybridization (RNA-FISH)

HeLa Tet-Off Advanced cells were transfected with PPIA or ZNF12 reporters and plated on coverslips. After 24–30 hr, cells were fixed and permeabilized using 3:1 methanol:acetic acid for 10 min at room temperature and an additional 2 hr at 4 °C. After washing in Wash Buffer (10% deionized formamide in 2xSSC) the cells were treated with Stellaris RNA-FISH hybridization buffer containing 125 nM of Stellaris RNA-FISH Q570 probe targeting Renilla luciferase (VSMF-1034–5, Biosearch Technologies, Inc.) at 37 °C overnight. Coverslips were washed in Wash Buffer and then in 1xSSC before mounting with Prolong Diamond AntiFade mountant with DAPI (Thermo Fisher Scientific). The cells were visualized using an Olympus IX71 inverted fluorescent microscope at 100X magnification using the 31000 DAPI/Hoechst filter (EX360, EM460) to visualize DAPI and the 41002 TRITC (Rhodamine)/Cy3 filter (EX535, EM610) to visualize Quasar570. Images were captured using Q Imaging Retiga 2000R camera.

Data analysis

To determine the percentage of an mRNA bound to oligo(dT) the amount of each mRNA in equal volumes of the bound and flow through fraction was assessed by qRT-PCR and added to give the total. For graphing and subsequent analysis the amount bound to oligo(dT) was expressed as a percentage of this total. Similarly, to determine the percentage of an mRNA in the cytoplasm the amount of each mRNA in equal volumes of the cytoplasmic and nuclear fractions was assessed by qRT-PCR and added to give the total. The amount in the cytoplasm was expressed as a percentage of the total for graphing and analysis.

Three independent replicates were performed for each experiment unless otherwise noted. For pairwise comparisons, a paired two-tailed t-test was employed. For multiple comparisons, Levene’s test⁸³ was used to demonstrate equal variance, then a one-way ANOVA test was performed with post-hoc Tukey test⁸⁴ or Dunnett’s test as indicated. P values of less than 0.05 were considered significant.

Much of the work described was published in a thesis by Aimee L. Jalkanen⁸⁵.

References

Emerson, R. O. & Thomas, J. H. Adaptive Evolution in Zinc Finger Transcription Factors. PLOS Genet. 5, e1000325 (2009).
Article PubMed PubMed Central CAS Google Scholar
Tadepally, H. D., Burger, G. & Aubry, M. Evolution of C2H2-zinc finger genes and subfamilies in mammals: species-specific duplication and loss of clusters, genes and effector domains. BMC Evol. Biol. 8, 176 (2008).
Article PubMed PubMed Central CAS Google Scholar
Friedman, J. R. et al. KAP-1, a novel corepressor for the highly conserved KRAB repression domain. Genes Dev. 10, 2067–2078 (1996).
Article CAS PubMed Google Scholar
Jacobs, F. M. J. et al. An evolutionary arms race between KRAB zinc-finger genes ZNF91/93 and SVA/L1 retrotransposons. Nature 516, 242–245 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Imbeault, M., Helleboid, P.-Y. & Trono, D. KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks. Nature 543, 550–554 (2017).
Article ADS CAS PubMed Google Scholar
Rowe, H. M. et al. KAP1 controls endogenous retroviruses in embryonic stem cells. Nature 463, 237–240 (2010).
Article ADS CAS PubMed Google Scholar
Thomas, J. H. & Schneider, S. Coevolution of retroelements and tandem zinc finger genes. Genome Res. 21, 1800–1812 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wolfe, S. A., Nekludova, L. & Pabo, C. O. DNA Recognition by Cys2His2 Zinc Finger Proteins. Annu. Rev. Biophys. Biomol. Struct. 29, 183–212 (2000).
Article CAS PubMed Google Scholar
Najafabadi, H. S. et al. C2H2 zinc finger proteins greatly expand the human regulatory lexicon. Nat. Biotechnol. 33, 555–562 (2015).
Article CAS PubMed Google Scholar
Brannan, K. W. et al. SONAR Discovers RNA-Binding Proteins from Analysis of Large-Scale Protein-Protein Interactomes. Mol. Cell 64, 282–293 (2016).
Article CAS PubMed PubMed Central Google Scholar
Brayer, K. J. & Segal, D. J. Keep your fingers off my DNA: protein-protein interactions mediated by C2H2 zinc finger domains. Cell Biochem. Biophys. 50, 111–131 (2008).
Article CAS PubMed Google Scholar
Schmitges, F. W. et al. Multiparameter functional diversity of human C2H2 zinc finger proteins. Genome Res. 26, 1742–1752 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rizkallah, R., Alexander, K. E. & Hurt, M. M. Global mitotic phosphorylation of C2H2 zinc finger protein linker peptides. Cell Cycle Georget. Tex 10, 3327–3336 (2011).
Article CAS Google Scholar
Rizkallah, R., Batsomboon, P., Dudley, G. B. & Hurt, M. M. Identification of the oncogenic kinase TOPK/PBK as a master mitotic regulator of C2H2 zinc finger proteins. Oncotarget 6, 1446–1461 (2015).
Article PubMed PubMed Central Google Scholar
Schnall-Levin, M. et al. Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs. Genome Res. 21, 1395–1403 (2011).
Article CAS PubMed PubMed Central Google Scholar
Huang, S. et al. MicroRNA-181a modulates gene expression of zinc finger family members by directly targeting their coding regions. Nucleic Acids Res. 38, 7211–7218 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, L. et al. Genomewide characterization of non-polyadenylated RNAs. Genome Biol. 12, R16 (2011).
Article CAS PubMed PubMed Central Google Scholar
Neff, A. T., Lee, J. Y., Wilusz, J., Tian, B. & Wilusz, C. J. Global analysis reveals multiple pathways for unique regulation of mRNA decay in induced pluripotent stem cells. Genome Res. 22, 1457–1467 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mukherjee, N. et al. Integrative classification of human coding and noncoding genes through RNA metabolism profiles. Nat. Struct. Mol. Biol. 24, 86–96 (2017).
Article CAS PubMed Google Scholar
O’Geen, H. et al. Genome-wide analysis of KAP1 binding suggests autoregulation of KRAB-ZNFs. PLoS Genet. 3, e89 (2007).
Article PubMed PubMed Central CAS Google Scholar
Blahnik, K. R. et al. Characterization of the Contradictory Chromatin Signatures at the 3′ Exons of Zinc Finger Genes. PLOS ONE 6, e17121 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Shen, S. et al. Widespread establishment and regulatory impact of Alu exons in human genes. Proc. Natl. Acad. Sci. USA 108, 2837–2842 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Daniel, C., Silberberg, G., Behm, M. & Öhman, M. Alu elements shape the primate transcriptome by cis-regulation of RNA editing. Genome Biol. 15, R28 (2014).
Article PubMed PubMed Central CAS Google Scholar
Kühn, U. et al. Poly(A) tail length is controlled by the nuclear poly(A)-binding protein regulating the interaction between poly(A) polymerase and the cleavage and polyadenylation specificity factor. J. Biol. Chem. 284, 22803–22814 (2009).
Article PubMed PubMed Central CAS Google Scholar
Saguez, C. et al. Nuclear mRNA surveillance in THO/sub2 mutants is triggered by inefficient polyadenylation. Mol. Cell 31, 91–103 (2008).
Article CAS PubMed Google Scholar
Yan, Y.-B. Deadenylation: enzymes, regulation, and functional implications. Wiley Interdiscip. Rev. RNA 5, 421–443 (2014).
Article CAS PubMed Google Scholar
Yamashita, A. et al. Concerted action of poly(A) nucleases and decapping enzyme in mammalian mRNA turnover. Nat. Struct. Mol. Biol. 12, 1054–1063 (2005).
Article CAS PubMed Google Scholar
Chang, H., Lim, J., Ha, M. & Kim, V. N. TAIL-seq: Genome-wide Determination of Poly(A) Tail Length and 3′ End Modifications. Mol. Cell 53, 1044–1052 (2014).
Article CAS PubMed Google Scholar
Subtelny, A. O., Eichhorn, S. W., Chen, G. R., Sive, H. & Bartel, D. P. Poly(A)-tail profiling reveals an embryonic switch in translational control. Nature 508, 66–71 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Dominski, Z. & Marzluff, W. F. Formation of the 3′ end of histone mRNA: getting closer to the end. Gene 396, 373–390 (2007).
Article CAS PubMed PubMed Central Google Scholar
Gu, H., Das Gupta, J. & Schoenberg, D. R. The poly(A)-limiting element is a conserved cis-acting sequence that regulates poly(A) tail length on nuclear pre-mRNAs. Proc. Natl. Acad. Sci. USA 96, 8943–8948 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, J. & Schoenberg, D. R. mRNA with a < 20-nt poly(A) tail imparted by the poly(A)-limiting element is translated as efficiently in vivo as long poly(A) mRNA. RNA N. Y. N 11, 1131–1140 (2005).
Article CAS Google Scholar
Charlesworth, A., Meijer, H. A. & de Moor, C. H. Specificity factors in cytoplasmic polyadenylation. Wiley Interdiscip. Rev. RNA 4, 437–461 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rio, D. C., Ares, M., Hannon, G. J. & Nilsen, T. W. Enrichment of poly(A)+ mRNA using immobilized oligo(dT). Cold Spring Harb. Protoc. 2010, pdb.prot5454 (2010).
PubMed Google Scholar
Garneau, N. L., Wilusz, C. J. & Wilusz, J. Chapter 5. In vivo analysis of the decay of transcripts generated by cytoplasmic RNA viruses. Methods Enzymol. 449, 97–123 (2008).
Article CAS PubMed Google Scholar
Loflin, P. T., Chen, C.-Y. A., Xu, N. & Shyu, A.-B. Transcriptional Pulsing Approaches for Analysis of mRNA Turnover in Mammalian Cells. Methods 17, 11–20 (1999).
Article CAS PubMed Google Scholar
Webster, M. W. et al. mRNA Deadenylation Is Coupled to Translation Rates by the Differential Activities of Ccr4-Not Nucleases. Mol. Cell 70, 1089–1100.e8 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yi, H. et al. PABP Cooperates with the CCR4-NOT Complex to Promote mRNA Deadenylation and Block Precocious Decay. Mol. Cell 70, 1081–1088.e5 (2018).
Article CAS PubMed Google Scholar
Katahira, J. Nuclear export of messenger RNA. Genes 6, 163–184 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sultan, M. et al. Influence of RNA extraction methods and library selection schemes on RNA-seq data. BMC Genomics 15, 675 (2014).
Article PubMed PubMed Central CAS Google Scholar
Nowick, K., Hamilton, A. T., Zhang, H. & Stubbs, L. Rapid sequence and expression divergence suggest selection for novel function in primate-specific KRAB-ZNF genes. Mol. Biol. Evol. 27, 2606–2617 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shukla, C. J. et al. High-throughput identification of RNA nuclear enrichment sequences. EMBO J. 37 (2018).
Lubelsky, Y. & Ulitsky, I. Sequences enriched in Alu repeats drive nuclear localization of long RNAs in human cells. Nature 555, 107–111 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Naganuma, T. et al. Alternative 3′-end processing of long noncoding RNA initiates construction of nuclear paraspeckles. EMBO J. 31, 4020–4034 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mikula, M., Bomsztyk, K., Goryca, K., Chojnowski, K. & Ostrowski, J. Heterogeneous nuclear ribonucleoprotein (HnRNP) K genome-wide binding survey reveals its role in regulating 3′-end RNA processing and transcription termination at the early growth response 1 (EGR1) gene through XRN2 exonuclease. J. Biol. Chem. 288, 24788–24798 (2013).
Article CAS PubMed PubMed Central Google Scholar
Van Nostrand, E. L. et al. Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP). Nat. Methods 13, 508–514 (2016).
Article PubMed PubMed Central CAS Google Scholar
Shin, C. H., Ryu, S. & Kim, H. H. hnRNPK-regulated PTOV1-AS1 modulates heme oxygenase-1 expression via miR-1207-5p. BMB Rep. 50, 220–225 (2017).
Article PubMed PubMed Central Google Scholar
Panwar, B., Omenn, G. S., Guan, Y. & Hofacker, I. miRmine: a database of human miRNA expression profiles. Bioinformatics 33, 1554–1560 (2017).
CAS PubMed PubMed Central Google Scholar
Rybak, A. et al. A feedback loop comprising lin-28 and let-7 controls pre-let-7 maturation during neural stem-cell commitment. Nat. Cell Biol. 10, 987–993 (2008).
Article CAS PubMed Google Scholar
Lee, J. E. et al. The PARN deadenylase targets a discrete set of mRNAs for decay and regulates cell motility in mouse myoblasts. PLoS Genet. 8, e1002901 (2012).
Article CAS PubMed PubMed Central Google Scholar
Windhager, L. et al. Ultra short and progressive 4sU-tagging reveals key characteristics of RNA processing at nucleotide resolution. Genome Res., https://doi.org/10.1101/gr.131847.111 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wilusz, J. E., Freier, S. M. & Spector, D. L. 3′ end processing of a long nuclear-retained noncoding RNA yields a tRNA-like cytoplasmic RNA. Cell 135, 919–932 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wilusz, J. E. et al. A triple helix stabilizes the 3′ ends of long noncoding RNAs that lack poly(A) tails. Genes Dev. 26, 2392–2407 (2012).
Article CAS PubMed PubMed Central Google Scholar
Neve, J., Patel, R., Wang, Z., Louey, A. & Furger, A. M. Cleavage and polyadenylation: Ending the message expands gene regulation. RNA Biol. 1–26, https://doi.org/10.1080/15476286.2017.1306171 (2017).
Article PubMed PubMed Central Google Scholar
Kelly, S. M. et al. A conserved role for the zinc finger polyadenosine RNA binding protein, ZC3H14, in control of poly(A) tail length. RNA N. Y. N 20, 681–688 (2014).
Article CAS Google Scholar
Boelens, W. C. et al. The human U1 snRNP-Specific U1A protein inhibits polyadenylation of its own pre-mRNA. Cell 72, 881–892 (1993).
Article CAS PubMed Google Scholar
Ko, B. & Gunderson, S. I. Identification of new poly(A) polymerase-inhibitory proteins capable of regulating pre-mRNA polyadenylation. J. Mol. Biol. 318, 1189–1206 (2002).
Article CAS PubMed Google Scholar
Gu, H. & Schoenberg, D. R. U2AF modulates poly(A) length control by the poly(A)-limiting element. Nucleic Acids Res. 31, 6264–6271 (2003).
Article CAS PubMed PubMed Central Google Scholar
Zhang, B. et al. A Novel RNA Motif Mediates the Strict Nuclear Localization of a Long Noncoding RNA. Mol. Cell. Biol. 34, 2318–2329 (2014).
Article PubMed PubMed Central CAS Google Scholar
Silla, T., Karadoulama, E., Mąkosa, D., Lubas, M. & Jensen, T. H. The RNA Exosome Adaptor ZFC3H1 Functionally Competes with Nuclear Export Activity to Retain Target Transcripts. Cell Rep. 23, 2199–2210 (2018).
Article CAS PubMed PubMed Central Google Scholar
Miyagawa, R. et al. Identification of cis- and trans-acting factors involved in the localization of MALAT-1 noncoding RNA to nuclear speckles. RNA 18, 738–751 (2012).
Article CAS PubMed PubMed Central Google Scholar
Nakagawa, S., Yamazaki, T. & Hirose, T. Molecular dissection of nuclear paraspeckles: towards understanding the emerging world of the RNP milieu. Open Biol. 8(10), pii: 180150, Review. PubMed PMID: 30355755, https://doi.org/10.1098/rsob.180150 (2018).
Article PubMed PubMed Central Google Scholar
Björk, P. & Wieslander, L. Integration of mRNP formation and export. Cell. Mol. Life Sci. 1–23, https://doi.org/10.1007/s00018-017-2503-3 (2017).
Article PubMed PubMed Central CAS Google Scholar
Lei, H. et al. Evidence that a consensus element found in naturally intronless mRNAs promotes mRNA export. Nucleic Acids Res. 41, 2517–2525 (2013).
Article CAS PubMed Google Scholar
Lei, H., Dias, A. P. & Reed, R. Export and stability of naturally intronless mRNAs require specific coding region sequences and the TREX mRNA export complex. Proc. Natl. Acad. Sci. USA 108, 17985–17990 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Cohen, H. R. & Panning, B. XIST RNA exhibits nuclear retention and exhibits reduced association with the export factor TAP/NXF1. Chromosoma 116, 373–383 (2007).
Article CAS PubMed Google Scholar
Royo, H. et al. Bsr, a Nuclear-retained RNA with Monoallelic Expression. Mol. Biol. Cell 18, 2817–2827 (2007).
Article CAS PubMed PubMed Central Google Scholar
Mas-Ponte, D. et al. LncATLAS database for subcellular localisation of long noncoding RNAs. RNA N. Y. N, https://doi.org/10.1261/rna.060814.117 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. & Carmichael, G. G. The Fate of dsRNA in the Nucleus. Cell 106, 465–476 (2001).
Article CAS PubMed Google Scholar
Chen, L.-L., DeCerbo, J. N. & Carmichael, G. G. Alu element‐mediated gene silencing. EMBO J. 27, 1694–1705 (2008).
Article CAS PubMed PubMed Central Google Scholar
Chen, L.-L. & Carmichael, G. G. Altered nuclear retention of mRNAs containing inverted repeats in human embryonic stem cells: functional role of a nuclear noncoding RNA. Mol. Cell 35, 467–478 (2009).
Article CAS PubMed PubMed Central Google Scholar
Prasanth, K. V. et al. Regulating Gene Expression through RNA Nuclear Retention. Cell 123, 249–263 (2005).
Article CAS PubMed Google Scholar
Pimentel, H. et al. A dynamic intron retention program enriched in RNA processing genes regulates gene expression during terminal erythropoiesis. Nucleic Acids Res. 44, 838–851 (2016).
Article CAS PubMed Google Scholar
Naro, C. et al. An Orchestrated Intron Retention Program in Meiosis Controls Timely Usage of Transcripts during Germ Cell Differentiation. Dev. Cell 41, 82–93.e4 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ninomiya, K., Kataoka, N. & Hagiwara, M. Stress-responsive maturation of Clk1/4 pre-mRNAs promotes phosphorylation of SR splicing factor. J. Cell Biol. 195, 27–40 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sampath, K. & Ephrussi, A. CncRNAs: RNAs with both coding and non-coding roles in development. Development 143, 1234–1241 (2016).
Article CAS PubMed Google Scholar
Palazzo, A. F. et al. The Signal Sequence Coding Region Promotes Nuclear Export of mRNA. PLOS Biol. 5, e322 (2007).
Article PubMed PubMed Central CAS Google Scholar
Liu, C. et al. Steroid receptor RNA activator: Biologic function and role in disease. Clin. Chim. Acta Int. J. Clin. Chem. 459, 137–146 (2016).
Article CAS Google Scholar
Emberley, E. et al. Identification of new human coding steroid receptor RNA activator isoforms. Biochem. Biophys. Res. Commun. 301, 509–515 (2003).
Article CAS PubMed Google Scholar
Weil, D., Boutain, S., Audibert, A. & Dautry, F. Mature mRNAs accumulated in the nucleus are neither the molecules in transit to the cytoplasm nor constitute a stockpile for gene expression. RNA 6, 962–975 (2000).
Article CAS PubMed PubMed Central Google Scholar
Pfaffl, M. W. A new mathematical model for relative quantification in real-time RT–PCR. Nucleic Acids Res. 29, e45 (2001).
Article CAS PubMed PubMed Central Google Scholar
Stewart, S. A. et al. Lentivirus-delivered stable gene silencing by RNAi in primary cells. RNA N. Y. N 9, 493–501 (2003).
Article CAS Google Scholar
Levene, H. Robust Tests for Equality of Variances. in Contributions to Probability and Statistics: Essays in Honor of Harold Hotelling (ed. Olkin, I.) 278–292 (Stanford University Press, 1960).
Tukey, J. Comparing Individual Means in the Analysis of Variance. Biometrics 5, 99–114 (1949).
Article MathSciNet CAS PubMed Google Scholar
Jalkanen, A. Repeated sequences encoding Cys2His2 zinc finger motifs influence mRNA polyadenylation and localization. (Colorado State University, 2017).
Marchler-Bauer, A. & Bryant, S. H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–331 (2004).
Article CAS PubMed PubMed Central Google Scholar
Wong, N. & Wang, X. miRDB: an online resource for microRNA target prediction and functional annotations. Nucleic Acids Res. 43, D146–D152 (2015).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Simo Li for assistance in evaluating ZNF reporter gene expression, Tori Taillac and Alba Mariner for help with cloning, and Barbara Graham for statistical advice. This work was supported by the National Institutes of Health, National Institute for General Medical Sciences Award #GM114247 to CJW and JW. AJ was supported by the Morris Animal Foundation and National Institutes of Health Training Grant OD010437 (Principal Investigator: Edward Hoover). AMH was supported in part through NSF-NRT Award # 1450032 (PrincipaI Investigator: Tom Chen) and an NSF Graduate Research Fellowship.

Author information

Joseph Russo and Aimee L. Jalkanen contributed equally.

Authors and Affiliations

Department of Microbiology, Immunology & Pathology, Colorado State University, Fort Collins, Colorado, USA
Joseph Russo, Caleb M. Schmidt, Jeffrey Wilusz & Carol J. Wilusz
Program in Cell & Molecular Biology, Colorado State University, Fort Collins, Colorado, USA
Aimee L. Jalkanen, Adam M. Heck, Jeffrey Wilusz & Carol J. Wilusz

Authors

Joseph Russo
View author publications
You can also search for this author in PubMed Google Scholar
Aimee L. Jalkanen
View author publications
You can also search for this author in PubMed Google Scholar
Adam M. Heck
View author publications
You can also search for this author in PubMed Google Scholar
Caleb M. Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Wilusz
View author publications
You can also search for this author in PubMed Google Scholar
Carol J. Wilusz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, C.J.W. and J.W.; Methodology, J.R., A.L.J.; Investigation, J.R., A.L.J., A.M.H., C.M.S.; Formal Analysis, A.M.H.; Resources, C.M.S.; Writing –Original Draft, C.J.W., Writing – Reviewing and Editing, J.R., A.L.J., A.M.H., C.M.S., J.W., C.J.W.; Visualization, A.M.H., A.L.J., C.J.W.; Supervision, C.J.W., J.W., J.R., A.L.J.; Project Administration, C.J.W., J.W.; Funding Acquisition, J.W., C.J.W., A.L.J., A.M.H.

Corresponding author

Correspondence to Carol J. Wilusz.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplemental Figures

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Russo, J., Jalkanen, A.L., Heck, A.M. et al. Sequences encoding C2H2 zinc fingers inhibit polyadenylation and mRNA export in human cells. Sci Rep 8, 16995 (2018). https://doi.org/10.1038/s41598-018-35138-4

Download citation

Received: 02 October 2018
Accepted: 31 October 2018
Published: 19 November 2018
DOI: https://doi.org/10.1038/s41598-018-35138-4

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.