Surveying the global landscape of post-transcriptional regulators

Reynaud, Kendra; McGeachy, Anna M.; Noble, David; Meacham, Zuriah A.; Ingolia, Nicholas T.

doi:10.1038/s41594-023-00999-5

Download PDF

Article
Open access
Published: 25 May 2023

Surveying the global landscape of post-transcriptional regulators

Kendra Reynaud¹,
Anna M. McGeachy²,
David Noble²,
Zuriah A. Meacham² &
…
Nicholas T. Ingolia ORCID: orcid.org/0000-0002-3395-1545^1,2

Nature Structural & Molecular Biology volume 30, pages 740–752 (2023)Cite this article

9492 Accesses
3 Citations
37 Altmetric
Metrics details

Subjects

Abstract

Numerous proteins regulate gene expression by modulating mRNA translation and decay. To uncover the full scope of these post-transcriptional regulators, we conducted an unbiased survey that quantifies regulatory activity across the budding yeast proteome and delineates the protein domains responsible for these effects. Our approach couples a tethered function assay with quantitative single-cell fluorescence measurements to analyze ~50,000 protein fragments and determine their effects on a tethered mRNA. We characterize hundreds of strong regulators, which are enriched for canonical and unconventional mRNA-binding proteins. Regulatory activity typically maps outside the RNA-binding domains themselves, highlighting a modular architecture that separates mRNA targeting from post-transcriptional regulation. Activity often aligns with intrinsically disordered regions that can interact with other proteins, even in core mRNA translation and degradation factors. Our results thus reveal networks of interacting proteins that control mRNA fate and illuminate the molecular basis for post-transcriptional gene regulation.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

Bioorthogonal masked acylating agents for proximity-dependent RNA labelling

Article 09 April 2024

Shubhashree Pani, Tian Qiu, … Bryan C. Dickinson

Targeting DCAF5 suppresses SMARCB1-mutant cancer by stabilizing SWI/SNF

Article 27 March 2024

Sandi Radko-Juettner, Hong Yue, … Charles W. M. Roberts

Main

A network of proteins regulates the expression of messenger RNA (mRNA) to maintain homeostasis and adapt cell physiology to changing environments¹. This network includes cis-acting mRNA sequence elements and trans-acting factors that bind the transcript to regulate its fate². RNA-binding proteins (RBPs) determine whether an mRNA is translationally activated or repressed, localized to a specific region within the cell, or degraded¹. RBPs can also remodel RNA structure and act as chaperones to prevent RNA aggregation^3,4. Determining the effects of regulatory RBPs is critical to understanding post-transcriptional control of gene expression.

Efforts to identify RBPs and their mRNA targets have revealed general principles of protein-RNA interactions. Recurring RNA-binding domains (RBDs) can individually recognize four to nine nucleotide motifs in RNA, and often appear in combination to achieve greater specificity^5,6,7. Protein-RNA crosslinking reveals a diverse mRNA interactome that includes many proteins without canonical RBDs⁸, including ~700 high-confidence RNA-protein interactions in budding yeast^8,9. Reciprocally, crosslinking and immunoprecipitation (CLIP) experiments have defined the mRNA targets for hundreds of these RBPs^2,10,11. These approaches expose a dense web of interactions, suggesting complex patterns of post-transcriptional regulation, but not the functional impact of these proteins on their target mRNAs. Measuring how individual RBPs regulate their direct mRNA targets¹² and examining how these targets change when the protein is perturbed¹³ do not provide a scalable approach to characterize the regulatory networks underlying post-transcriptional regulation.

The modular architecture of regulatory RBPs⁵ has spurred the development of the tethered function assay to bypass the endogenous RNA specificity of RBPs and instead measure their activity on a heterologous reporter transcript¹⁴. This approach can interrogate the regulatory effects of RBPs, isolated domains, or cofactors that do not bind RNA directly¹⁵. In the tethering assay, candidate regulatory proteins are targeted to a reporter transcript using the specific, high-affinity interaction between a bacteriophage coat protein and a cognate RNA hairpin, obviating whatever interactions might recruit a protein to its endogenous targets. This independence from endogenous target mRNAs and compatibility with robust reporters makes the tethering assay well-suited to high-throughput characterization of post-transcriptional regulators. Indeed, tethering assays have revealed over 50 regulatory proteins in a systematic analysis of 700 full-length human RBPs¹⁶. It is also amenable to unbiased screening, as demonstrated by the identification of almost 300 post-transcriptional regulators in trypanosomes¹⁷.

In this Article we adapt the tethering assay to survey regulatory activity across the entire yeast proteome. Our approach allowed us to identify hundreds of proteins that modulate mRNA translation and stability, including highly active, non-canonical RBPs. We subdivided proteins and mapped their regulatory activity to particular domains and regions, in some cases uncovering effects that are not apparent in the context of the full-length protein. This fine resolution allowed us to identify protein domains and short peptide motifs enriched among the most active post-transcriptional regulators. Notably, although many active regulators were canonical RBPs, their regulatory activity generally mapped outside the RBD. Our systematic, functional characterization of post-transcriptional regulators in budding yeast expands our understanding of the complex network of proteins that control RNA metabolism.

Results

Functional analysis of post-transcriptional regulators

We set out to functionally assess the RNA regulatory activity of proteins across the entire yeast proteome through the tethered-function assay. Flow cytometry provides high-throughput, single-cell phenotypic measurements and enables large, pooled screens using fluorescence-activated cell sorting (FACS). FACS analysis of tethered-function assays relies on fluorescent protein reporters¹⁷, and so we devised a budding yeast tethering assay coupled to a ratiometric fluorescence readout. We tethered a transcript encoding a yellow fluorescent protein (YFP) with five boxB hairpins in its 3′ untranslated region (UTR) to a candidate regulatory protein fused to the λN coat protein¹⁸. To control for non-specific changes in cell size and physiology, we normalized the YFP measurements against a red fluorescent protein (RFP) control expressed from a transcript that is not targeted by λN. Changes in the ratio of fluorescence intensity between the yellow reporter and the red control precisely measure specific regulatory activity affecting the targeted mRNA while controlling for global effects (Fig. 1a). To further control for the possibility that binding of λN itself affects the reporter, we normalized the fluorescence ratio of the tethered fusion constructs against a tethered HaloTag protein, which exhibits no inherent regulatory effect.

**Fig. 1: The dual reporter tethering assay reports reproducible and quantitative regulatory effects.**

We validated our assay by measuring how well characterized regulators affected reporter expression. Tethered poly(A)-binding protein (Pab1 in budding yeast) enhances reporter expression by stabilizing mRNA¹⁴ and promoting its translation¹⁹. We observed an approximately threefold target RNA activation by tethered Pab1-λN, relative to an inactive HaloTag-λN control. Conversely, the CCR4–NOT complex is responsible for the majority of cytosolic mRNA deadenylation²⁰, and tethering of the CAF1 deadenylase (Pop2 in budding yeast) greatly destabilizes target mRNAs²¹. We saw approximately fivefold reporter repression by tethered Pop2-λN (Fig. 1b,c). We further tested how the particular choice of the λN•boxB interaction pair affected our results by tethering Pab1 and Pop2 to reporters containing one PP7 hairpin using fusions with the PP7 coat protein (PP7cp)¹⁸ (Fig. 1b and Extended Data Fig. 1a). Both PP7cp fusions showed similar activity on their cognate targets as λN, although Pop2-PP7cp repression appeared weaker than Pop2-λN repression, potentially due to the use of only a single PP7 hairpin (Fig. 1c). We went on to measure the activity of the RBP Sgn1, which is linked to translation by genetic interactions and co-immunoprecipitation with Pab1 (Extended Data Fig. 1b)²². We found that Sgn1 served as a powerful activator that upregulated YFP expression by over sixfold relative to RFP (Fig. 1d), in addition to modestly increasing RFP levels and cell size (Extended Data Fig. 1c–f). Sgn1 tethering increased YFP RNA abundance ~2.5-fold (Extended Data Fig. 1g); based on the larger change we see in YFP fluorescence, we infer that it activates translation as well. These results confirm that our tethering assay provides robust and quantitative measurements of mRNA-specific regulatory activity, even in the face of additional non-specific effects on the cell, and thus provides a powerful tool for a high-throughput, proteome-wide survey of mRNA regulators.

A proteome-wide survey of post-transcriptional regulators

We set out to comprehensively survey the yeast proteome for post-transcriptional regulators by creating a large pool of cells that each expressed one λN fusion construct, sorting these cells into subpopulations according to their fluorescence phenotypes, and quantifying the tethering constructs in each of these sorted groups by deep sequencing. Tethering protein fusions with regulatory activity would alter the fluorescence phenotype of the host cell, shifting it into a subpopulation with an unusually low or high fluorescence ratio (Fig. 1a), and altering its distribution across the sorted cells.

We began by generating a proteome-scale library of λN fusions that would enable unbiased discovery of regulatory proteins and identification of functional domains within these regulators. We reasoned that we could construct an unbiased λN fusion library directly from randomly fragmented genomic DNA as budding yeasts have a compact and intron-poor genome, and thereby obtain a uniform representation of all proteins. However, we required an additional selection for fragments matching the correct strand and frame of a gene. We generated fragments by transposon-mediated tagmentation^23,24 and selected fragments of ~500 base pairs to capture whole protein domains, which have a typical size of ~100 amino acids²⁵ (Fig. 2a and Extended Data Fig. 2a). We captured these fragments into a vector that required in-frame translation through the fragmented sequence to express a downstream selectable marker (Fig. 2b). We found that ten out of ten individual clones encoded in-frame fusions (Supplementary Data 1). We then transferred our fragment library into a λN fusion expression vector and added random, 25-nucleotide barcodes that identify each fragment uniquely (Supplementary Data 2)^26,27. The mean fragment size in our barcoded λN fusion library was ~500 base pairs, consistent with the fragment size of the genomic DNA input (Fig. 2c), and contained at least one representative fragment from roughly half of all yeast genes.

**Fig. 2: Generating an unbiased, proteome-wide survey of tethered in-frame protein fragments.**

We analyzed the regulatory activity of each individual protein fragment in our library by pooled transformation, flow sorting and sequencing. We separated a population of cells transformed with our λN fusion library into four subpopulations of equal size according to the YFP/RFP fluorescence ratio, isolated library plasmid DNA from sorted cells, and quantified the barcodes by next-generation sequencing (Fig. 2d). We expected activators to be enriched in bins with higher YFP/RFP ratios, while repressors should be enriched in bins with lower ratios.

Indeed, certain tethering constructs displayed a dramatic skew in their abundance across the sorted cells. For example, one fragment of the RBP Sbp1 was sorted almost entirely into the highest YFP gate, indicating that it strongly activated reporter expression (Fig. 2e). We saw a similar strong enrichment for fragments of Pab1 (Extended Data Fig. 2b), reproducing the positive effect of tethering full-length Pab1 (Fig. 1c). Conversely, fragments of the nonsense-mediated decay factor Ebs1 and the RNA destabilizing protein Cth1 acted as strong repressors that were found almost exclusively in the lowest YFP subpopulation (Fig. 2f and Extended Data Fig. 2c). To quantify this enrichment, we computed an ‘activity score’ for each fragment: a maximum likelihood estimate of its average fluorescence, expressed as a z-score relative to the overall population. These scores ranged from −1.9 for strong repressors like Ebs1 and Cth1 to +1.9 for strong activators like Sbp1 and Pab1. Most fragments in our library had activity scores close to zero, indicating little or no effect on reporter transcript expression (Fig. 2g and Supplementary Data 3). Activity scores were reproducible between two biological replicate screens (Extended Data Fig. 2d); fragments with adequate sequencing coverage (at least 1,000 total reads across all bins) in both experiments had an activity score correlation of r ≈ 0.7. We did note a linear rescaling of scores between the two screens, leading to saturation of strong activators and repressors in one replicate screen relative to the other. Because of this saturation effect, the strong correlation nonetheless underrepresents the actual agreement between the two screens. We relied on activity scores derived from the screen with broader dynamic range for our subsequent analysis.

We identified active fragments from many well-known regulatory proteins, such as the translation initiation factor Ded1^28,29,30 and Ngr1, which induces the decay of POR1 mRNA³¹. Our unbiased approach also uncovered post-transcriptional regulation in proteins with other well characterized cellular functions, including the small heat shock chaperone Hsp26, which also has previously identified mRNA-binding activity³². Furthermore, we uncovered regulatory regions in proteins of unknown function, like Her1, which may interact with ribosomes based on co-purification experiments³³. These results illustrate the power of our approach to discover proteins that control mRNA stability and translational efficiency and quantify how this affects gene expression.

Full-length protein activity resembles truncated fragments

We selected 12 fragments across a range of activity scores and biological functions (Fig. 3a) and directly measured their effect on reporter fluorescence. All 12 fragments shifted the fluorescence ratio in the direction expected from the large-scale survey (Fig. 3b), and the magnitude of the change correlated very well with their activity score (r = 0.91) (Fig. 3c and Extended Data Fig. 3a–e). This strong quantitative agreement demonstrates that the activity score derived from sorting and sequencing is an accurate measure of the regulatory effect of a fragment.

**Fig. 3: Protein fragment activity in the tethering screen represents real, verifiable regulatory function.**

Isolated protein fragments may have different activities than the full protein from which they are derived due to the absence of regulatory domains, altered protein-protein interactions, or other reasons. We thus selected a handful of active fragments to explore how fragment activity relates to the full protein. Sbp1 is an RBP with two RNA recognition motifs (RRMs) in addition to an arginine––glycine–glycine (RGG) motif that recruits Pab1³⁴. The fragment that we characterized as an approximately threefold activator (Fig. 2e) contained only the first RRM and the RGG motif, whereas the full-length version of the protein was a weaker, approximately twofold activator (Fig. 3d). We hypothesize that the inclusion of the second RRM interferes with Pab1 recruitment, making it a weaker activator. In other cases, such as Sro9, the full-length protein had a stronger effect than the isolated fragment. Sro9 is an RBP that contains a La-motif and is hypothesized to activate translation through recruitment of the closed-loop-forming translation initiation complex³⁵. We identified an Sro9 fragment that activated expression approximately twofold, whereas the full-length protein increased reporter expression by nearly fourfold (Fig. 3e). Tethering the entire yeast Puf-domain protein Jsn1 likewise produced a stronger repressive effect than the fragment we identified in our tethering library (Fig. 3f). In contrast, the intact version of the endocytic protein Yap1801³⁶ was less repressive than our fragment (Fig. 3g), perhaps because of differences in localization³⁷. Nonetheless, in all four cases, the full-length protein exerted an effect in the same direction as the fragment tested in our screen. Our approach is thus well suited to survey the regulatory activity contained in the native proteome and ascribe functions to RBPs.

Activity in RBPs but not RBDs

Our tethering assay can detect regulatory activity in truncated proteins lacking RBDs and in co-regulator proteins that lack intrinsic RNA-binding activity. Nonetheless, we did expect a substantial overlap between the post-transcriptional regulators detected in the screen and known RBPs. To test this hypothesis, we compiled a list of budding yeast RBPs from proteins appearing in at least two of four RNA-protein interaction datasets (Fig. 4a)^{9,38,39,40,41}. Fragments from these known RBPs had substantially higher absolute activity scores than the overall proteome (Fig. 4b), further confirming the relevance of our results for endogenous programs of post-transcriptional regulation controlled by these RBPs. It also raised the question of whether regulatory activity was associated with the RBDs of these RBPs.

**Fig. 4: Global analyses reveal enrichment of protein domains, motifs and protein-protein interactions amongst most active screen fragments.**

Our fragment library allowed us to ascribe quantitative regulatory effects to particular regions and domains within proteins. We were thus able to investigate which protein domains were enriched among the most active fragments in our screen, and whether these active regions coincided with RBDs. We identified fragments that contained at least 75% of some protein domain family from the Pfam database⁴² and tested each family individually to determine whether the activity scores of fragments containing that family were significantly higher or lower than the library overall (Fig. 4c).

Dozens of protein families were associated with active regulators, and some of the strongest associations involved domains with clear connections to translation and RNA decay (Supplementary Data 6). We observed the strongest positive mean activity score among fragments derived from the translation initiation factor eIF3⁴³. We also saw a trend for activators among that DEAD box helicase family proteins, which include the translation initiation factors eIF4A and Ded1⁴⁴. The endo/exonuclease/phosphatase family showed up among the strong repressors; these include certain subunits of the Ccr4–Not complex, for example⁴². We also saw many families encoding metabolic functions such as adenylosuccinate synthetase⁴⁵, FAD-dependent oxidoreductase and the malic enzyme N-terminal domain. Metabolic enzymes have emerged as cryptic RBPs⁹, and so it seems noteworthy that they appear to show regulatory activity as well. Notably, although many canonical RBDs such as RRMs appear in Pfam, they were not enriched in the active fragments. Canonical RNA-recognition domains appear more important for mRNA target selection, and regions outside of the RNA-interacting domains typically provide regulatory activity for RBPs.

Our screen also identified strong activity in fragments lacking an identifiable, folded domain. Indeed, many proteins contain intrinsically disordered regions (IDRs), which play important roles in post-transcriptional regulation⁴⁶. In some cases, IDRs form protein-protein interactions, as in the case of the disordered N terminus of Ded1^47,48, whereas others serve as flexible linkers^49,50. Functional IDRs can include short linear interaction motifs (SLiMs), which are often responsible for protein-protein interactions⁵¹. Although SLiMs are distinct from Pfam domains, they may be recognizable as peptide sequence motifs.

Motivated by the possibility that SLiMs could explain regulatory effects, we searched for peptide motifs enriched in active fragments using the MEME tool, and then scanned the yeast proteome for occurrences of these motifs using FIMO⁵². Some motifs were highly repetitive; although these repetitive motifs may have regulatory activity, it is difficult to interpret them, so they were excluded. We identified six non-degenerate motifs and repressors (Extended Data Fig. 4a,b and Supplementary Data 7 and 8), which align to genes with functions spanning many aspects of cell biology, including cell wall maintenance, cytoskeleton functions, transcription and translation. The glutamine-rich motif (repressor motif 2 in Extended Data Fig. 4a) is particularly enriched in genes involved in mRNA metabolism, such as NGR1, POP2 and PUF3, which all have diverse roles in mRNA deadenylation and decay^31,53,54. Likewise, the RGG repeat in activator motif 5 (Extended Data Fig. 4b) is widespread among RBPs and is linked to post-transcriptional regulation⁵⁵.

Regulatory RBPs often exert their effects by recruiting and activating core cellular machinery involved in translation and RNA decay. We thus expected that distinct active fragments from our screen might share common interactors. We intersected our library fragments with the physical protein-protein interactions in the BioGRID database⁵⁶ and searched for proteins with a significant over-representation of activating or repressing fragments among their interactors. We identified a dozen proteins enriched for interaction with activators (Supplementary Data 9), most tied clearly to RNA biology (Fig. 4d). Strikingly, the poly-(A) binding protein Pab1 showed one of the highest degrees of enrichment^19,57. The translation regulator Gis2^58,59 was also substantially enriched in activators, and shared many interaction partners with Pab1 (Fig. 4e). Surprisingly, the exonuclease Xrn1 exhibited the strongest enrichment in activator interactions (Fig. 4d), despite its role in mRNA decay⁶⁰. This enrichment may reflect a common core of mRNA-binding proteins that accompany transcripts during both translation and degradation. Alternatively, Xrn1 is reported to promote the translation of some transcripts encoding membrane proteins, and so this enrichment might also represent a more direct effect⁶¹.

Endoplasmic reticulum/Golgi protein Gta1 is a bimodal repressor

Several overlapping, C-terminal fragments of the protein Gta1 harboring a repressor-associated peptide emerged as potent repressors (Figs. 4b and 5a and Supplementary Data 5). Although the Gta1 protein co-purifies with the translational machinery³³, genetic evidence links it to golgi and vesicle transport^62,63, and the Gta1-GFP fusion protein localizes to the endoplasmic reticulum (ER)^64,65,66. Owing to its reported association with ribosomes and the presence of a repressive motif, we generated λN fusion constructs of the strongly repressive Gta1(603–767) fragment and the full-length Gta1 protein, and tested their effects on reporter expression (Fig. 5b).

**Fig. 5: The tethering screen identifies RNA-regulatory roles of poorly characterized proteins.**

Both full-length Gta1 and the Gta1(603–767) fragment robustly reduced median YFP and produced a strongly bimodal distribution of reporter expression (Fig. 5c and Extended Data Fig. 5a), a distinctive pattern that we did not see for any other tethering construct we examined individually. As expression of the isolated Gta1(603–767) fragment slowed cell growth, we focused our analysis on full-length Gta1. Gta1 tethering greatly reduced reporter mRNA abundance (Fig. 5d), suggesting that it promoted mRNA turnover. To track how bimodality emerged when the Gta1-λN fusion was switched on acutely, we expressed it from an inducible promoter. Levels of the YFP reporter began to decline within 1 h of inducing the tethering construct, and clear bimodality emerged within 2 h (Fig. 5e); continuing decline of reporter levels in the lower peak probably reflects the loss of pre-existing YFP through degradation or dilution. Notably, deletion of the repressor fragment that we identified in a Gta1Δ603–767-λN tethering construct abolishes this effect entirely (Fig. 5f,g), confirming that the Gta1(603–767) region containing our repressive peptide motif is both necessary and sufficient for its regulatory effect.

We next tested whether the bimodal reporter expression resulted from variation in the abundance of the Gta1 tethering fusion. Indeed, we saw a broad, bimodal distribution of blue fluorescent protein (BFP) fluorescence from the Gta1-BFP-λN construct after 4 h of induction (Fig. 5h), with levels increasing uniformly in the first hour of induction, followed by the emergence of two distinct phenotypes (Fig. 5h). Notably, we saw a similar trajectory after induction of the Gta1Δ603–767-λN tethering construct (Fig. 5i), although it did not affect YFP expression (Fig. 5f). We also measured the mRNA abundance of inducible GTA1, which quickly rose upon induction, then declined substantially after 4 h in the continuous presence of the inducer (Fig. 5j). In contrast, levels of the mRNA encoding the inactive Halo-BFP-λN tethering control increased steadily in the 2 h following induction (Extended Data Fig. 5b). These mRNA abundance measurements reflect population averages, whereas flow cytometry highlights the cell-to-cell variability.

We also noted that induction of full-length GTA1, or the GTA1Δ603–767 mutant lacking RNA destabilization activity, cause an atypical, elongated morphology and persistent clusters of cells, akin to the filamentous growth that Saccharomyces cerevisiae can undergo upon starvation (Fig. 5k)^67,68. Because Gta1Δ603–767 is not a strong repressor, but still impacts budding morphology (Fig. 5i), RNA turnover appears separable from budding effects.

IDRs mediate regulatory activity

Our library contained many fragments of Ccr4, one of two deadenylases in the Ccr4–Not complex and thus a key mRNA decay factor⁶⁹. Consistent with this role, we identified many repressive Ccr4-derived fragments; the median activity score of Ccr4 fragments was −0.5, and the strongest repressive fragment, Ccr4(2–203), had an activity score of −1.8 (Fig. 6a,b). This strongly repressive fragment originated from the N terminus of Ccr4 rather than the C-terminal nuclease domain^70,71. Indeed, the disordered N terminus yielded the strongest repressors, while the adjoining, folded leucine-rich repeat had little activity on its own (Fig. 6c and Extended Data Fig. 6a). Our results suggest a regulatory role for the disordered N terminus, which is not required for Ccr4 nuclease activity or assembly into the Ccr4–Not complex⁷¹.

**Fig. 6: The tethering screen defines functional domain boundaries of well characterized RBPs.**

A similar pattern emerged among the regulatory fragments derived from the translation initiation factor Ded1. This highly conserved RNA helicase of the DEAD-box family interacts with core translation initiation factors in the cap-binding eIF4F complex and is important for translation of many yeast mRNAs^47,48,72,73. In agreement with its positive role in mRNA expression, Ded1 fragments appeared among the strongest of the post-transcriptional activators (Fig. 6d and Extended Data Fig. 6b). Longer fragments of the disordered N terminus of Ded1 had greater activity (Fig. 6e,f), consistent with deletion analyses that identified two distinct N-terminal regions required for interactions with translation initiation factors eIF4A and eIF4E⁴⁷. Full activity of Ded1 fragments in the tethered-function assay requires both of these interactions, mediated by Ded1(30–60) and Ded1(60–100), respectively. Our analysis of Ded1 and Ccr4 emphasizes that important regulatory effects are often associated with disordered interaction motifs.

Regulatory functions of Sro9 and Cdc48

We identified powerful positive regulatory activity in an N-terminal fragment of Sro9 (Fig. 3b,e). This poorly characterized RBP, one of three La-motif-containing proteins in yeast, associates with translating ribosomes⁷⁴ and translation initiation factors^35,75. It appears to bind and stabilize target mRNAs enriched for functions in protein synthesis³⁵. Sro9 also contains an activator-associated peptide motif (Fig. 7a and Extended Data Fig. 4b), although our validated N-terminal fragment did not include this motif. We thus tested the full-length protein along with one truncation, Sro9(1–151), that encompassed our validated fragment, and a longer Sro9(1–251) truncation that included the activator-rich motif as well. We also tested the remaining C-terminal fragment, Sro9(252–434), which includes the La-motif and is implicated in RNA binding³⁵ (Fig. 7a). Inclusion of the activator-associated motif did not further increase the activity of the Sro9 N terminus, although full-length Sro9 was a substantially stronger activator (Fig. 7b). The C-terminal portion alone was essentially inactive, which is consistent with the separation of RNA-binding regions and effector domains (Fig. 7c). The stronger effect of the full-length protein cannot be explained by differences in protein abundance (Extended Data Fig. 7a), and so the context of full-length Sro9 must potentiate the positive effect of the N-terminal region.

**Fig. 7: The tethering screen reveals regulatory roles of known RBPs.**

Sro9 is reported to interact with several translation factors, including Pab1, which emerged as a common interaction partner for many activators (Fig. 4d,e)³⁵. We thus wanted to test whether the Sro9(1–151) fragment was sufficient for a stable Pab1 interaction. Indeed, we found that Pab1 co-purified with epitope-tagged N-terminal Sro9(1–151) (Fig. 7d and Supplementary Data 7b). The co-purification of Pab1 with full-length Sro9 protein was somewhat stronger than the N terminus, consistent with its stronger activation. Pab1 can enhance expression by stabilizing mRNAs or by promoting their translation. We found that Sro9 tethering increased YFP mRNA abundance by only ~1.5-fold (Fig. 7e), indicating that increased translation explains the majority of its regulatory effect. These results are consistent with the modest quantitative changes in transcript abundance observed in sro9Δ yeast³⁵.

We also observed positive effects upon tethering proteins with little known role in mRNA regulation. Notably, an N-terminal fragment of the AAA ATPase Cdc48 increased reporter expression, although Cdc48 is linked most prominently with protein degradation, including the ER-associated degradation (ERAD) pathway for quality control of transmembrane and secreted proteins⁷⁶. Cdc48 acts as an unfoldase that extracts proteins from membranes and complexes to make them available for proteasomal degradation^77,78,79,80. Cdc48 was recently reported to interact with RNA⁹, although its role in RNA regulation remains mysterious. Cdc48’s known functions suggest that it would negatively regulate protein expression.

Nonetheless, tethering the N-terminal Cdc48(1–155) fragment to a reporter transcript robustly activated its expression (Fig. 7f and Supplementary Data 5). Furthermore, this appeared to result from enhanced translation, as reporter mRNA levels increased only modestly (Fig. 7g). Interestingly, full-length Cdc48 did not show this same activity (Fig. 7f). The N terminus of Cdc48 binds to substrates, cofactors and ubiquitin, while the C-terminal domains form the hexameric AAA ATPase⁷⁶ (Fig. 7h). Our results thus implicate cofactor interactions of the isolated N terminus in translational activation. We thus deleted the gene encoding the cofactor Ubx2, which localizes Cdc48 to the sites of ERAD and mitochondrial protein translocation-associated degradation (mitoTAD)^81,82,83 (Fig. 7i), and saw much weaker Cdc48(1–155) activity (Fig. 7j and Extended Data Fig. 7c). Removal of its UBX-domain, which mediates the interaction between Ubx2 and Cdc48⁸², had a smaller impact. Localization of Cdc48(1–155) to endomembranes may recruit tethered transcripts and thereby modulate their expression. Alternately, the isolated N terminus could displace binding of full-length Cdc48 and reduce protein turnover.

Discussion

We report a broad and unbiased survey of the budding yeast proteome that identifies proteins controlling mRNA translation and decay. We have recovered a wide array of active proteins that includes many known regulators, strongly enriched for RBPs. We have also delineated the active regions within these proteins, revealing that regulatory activity typically maps outside the RBDs themselves. Post-transcriptional regulators thus seem to display a modular architecture, with RBDs that determine their mRNA specificity and separate regulatory domains that modulate the expression of these target transcripts^5,84. We have found regulatory activity associated with folded domains, but also with disordered regions, highlighting the importance of functional IDRs in post-transcriptional regulation⁸⁴. Indeed, the repressive fragments of the Ccr4 deadenylase included its disordered N terminus, and disordered fragments from the N terminus of the translation initiation factor Ded1 activated expression. Two broad models have emerged to explain how such IDRs might show specific molecular functions. General patterns of amino acid composition, such as interspersed acidic and aromatic residues, seem to underlie transcriptional activation by IDRs^85,86,87. Other IDRs harbor SLiMs that act through well-defined peptide-protein docking⁸⁸. Because unstructured regions may be easier to capture in our library, or more likely to function in isolation, we could not determine whether activity was more likely to occur in folded domains or disordered regions. Nonetheless, both degenerate and specific peptide motifs emerged from our survey, suggesting that both modes of actions play important roles in mRNA regulation.

Many of the regulators we identified may exert their effect through their interactions with other proteins. This pattern held even in Ccr4 and Ded1, which both contain enzymatic activities that could act directly on a tethered mRNA. Protein-protein interactions can affect expression of a target transcript by recruiting the large, multi-protein complexes involved in translation and mRNA decay or modulating their activity. Indeed, positive regulators were enriched for interaction with the poly-(A) binding protein Pab1, which stabilizes mRNAs and promotes their translation, suggesting that these regulators could converge onto core pathways controlling the fate of mRNAs. Similar patterns have been seen in organisms ranging from trypanosomes to humans, and so this convergence may reflect a general organizational principle of eukaryotic post-transcriptional regulation^16,17.

Although our approach allowed us to take a broad view across the yeast proteome, not restricted to known RBPs, it also necessitated trade-offs. We obtained fragment coverage for roughly half of the yeast proteome. This might in part reflect technical limitations of generating fragments, although we used Tn5 transposase²³, which compares favorably to other methods of random fragmentation. Selection for in-frame fragments could exclude certain proteins based on poor expression or toxicity—although these effects would also interfere with our tethering assay. Additionally, our results reflect regulatory activity on one reporter transcript in a particular growth condition. The regulatory effects of RBPs can vary based on codon optimality⁸⁹ and interactions with other regulators binding the same 3′ UTR^90,91. Understanding how post-transcriptional regulation varies between transcripts and changes in response to cell physiology remains an important challenge.

Despite the limitations, we have identified strong post-transcriptional activity for over a 1,000 protein fragments, laying a foundation for future work. The fluorescence-based tethering assay offers a tool to further explore these regulatory networks, understand the mechanistic basis for post-transcriptional regulation, and decipher the functional consequences of the diverse RNA-binding proteome that has recently come into view.

Methods

Strain construction

The dual fluorescent reporter (YFP::PP7/RFP::boxB) strain NIY289 was constructed as follows. pNTI252 was integrated into BY4741 at URA3 to generate NIY106. pNTI476 was integrated into BY4742 at URA3 to generate NIY287. NIY106 was crossed with NIY287 to create NIY293. The dual fluorescent reporter (YFP::boxB/RFP::PP7) strain NIY293 was constructed as follows. pNTI282 was integrated into BY4741 at URA3 to generate NIY114. pNTI473 was integrated into BY4742 at URA3 to generate NIY286. NIY114 was crossed with NIY286 to create NIY293. Wild-type BY4741 was used for the in-frame library selection. The yKS090 dual reporter strain expressing the ZIF synthetic transcription factor was generated by integrating pNTI727 into the yeast XII-5 integration site⁹². The UBX2 mutation strains were generated as follows. We amplified the KanMX cassette from pCfB2225 with primers to generate homologous overlapping sequences to the UBX2 locus in the yeast genome, and then integrated this cassette into one locus of UBX2 in NIY293. We verified correct chromosomal integration by colony polymerase chain reaction (PCR) which indicated a heterozygous ubx2Δ/UBX2 genotype (yKS092). We then amplified the KILEU2 cassette from pCfB2189 with either homologous sequences to the remaining UBX2 locus or to the C-terminal UBX domain of UBX2, and then integrated these amplicons into yKS092 to create the ubx2Δ and ubx2Δc strains, respectively (yKS093 and yKS094). Genotypes were confirmed through colony PCR. Plasmids and strains are listed in Supplementary Tables 1 and 2, respectively.

Culturing conditions

Cultures for the single protein tethering assay were grown to mid-exponential growth phase at an optical density at 600 nm (OD₆₀₀) of 0.6 then collected via gentle centrifugation at 5,000g for 1 min for flow cytometry analysis with 30-min incubation in 4% paraformaldehyde (PFA). For in-frame fragment selection, cultures were incubated after transformation at 30 °C with shaking for 96 h in SCD-His medium with twice-daily back-dilution to avoid culture saturation. NIY293 was transformed with the in-frame tethered fragment library using the high-efficiency lithium acetate method⁹³ and then transferred to a turbidostat⁹⁴ for 48 h in SCD-His medium before collecting cells. Cultures for the inducible Gta1 tethering assay were grown to stationary phase overnight, back-diluted to OD₆₀₀ of 0.1 and allowed to reach early exponential growth phase. The tethered proteins were then induced with 5 nM β-estradiol, then collected and fixed in PFA as described above.

Flow cytometric measurements and fluorescence-activated cell sorting

The expression of YFP and RFP in the tethering assay were measured using flow cytometric readout on a BD LSR Fortessa X20 using BD FACSDiva version 6.2 with excitation by a 488-mm blue laser and 561-mm yellow-green laser, captured on the FITC and PE-TexRed channels, respectively. Fluorescence measurements for 50,000 cells were collected per sample, and gates were drawn to include populations of the ~25% cells with modal forward- and side-scatter. FACS was performed with an Aria Fusion sorter by gating four equal-sized populations based on the ratio of FITC and PE-TexRed emission. Approximately two million cells were sorted into each gate. The sort was performed with two technical replicate libraries from the same library transformation.

In-frame and fragment tethering library generation

Genomic yeast DNA was tagmented using the Nextera XT DNA library prep kit from Illumina, and then size-selected with Beckman AMPureXP beads. Size selection was confirmed with an Agilent Tapestation 2200 on a High Sensitivity D1000 Screentape (Extended Data Fig. 2a). BY4741 yeast were then co-transformed with the tagmented yeast gDNA and linear pKS132 and cultured as described above. After a long outgrowth in selective media, plasmids were collected with the Zymo yeast miniprep kit. The selected fragments were then amplified by PCR with primers KS605(GTAATTATCTACTTTTTACAACAAATcctgcaggGGCTCGGAGATGTGTATAAGAGACAG) and KS630(CTGTCTCTTATACACATCTGACGCcGGAAGCGGAAGCGGAAGCCGCGCCGACGCACAAAC), designed to anneal to the Nextera XT sites introduced during tagmentation, and subcloned into the SbfI-linearized tethering library vector pKS137 by Gibson assembly. This tethering library was propagated in DH10β cells. Barcodes were then introduced by Gibson assembly of N25 randomized oligonucleotide barcodes, amplified with KS633(ACGAGGCGCGTGTAAGTTACAGGCAAGCGATCCGTCCGTAATACGACTCACTATAGCACG) and KS634(GATCCTGTAGCCCTAGACTTGATAGCCATGACTTCAACTCAAGACGCACAGATATTATAA) into the BamHI-linearized tethering library. Assembly reactions were transformed into DH10β and selected in liquid cultures at varying dilutions to obtain a transformant pool with approximately three barcodes per fragment. This library was transformed into NIY293 through the lithium acetate method as described in ref. ⁹³. Transformations were used to inoculate a turbidostat and grown in selective SCD-His media for 48 h before performing FACS on live cells. Library plasmid DNA was then collected from sorted cells with the Zymo yeast plasmid prep kit, then barcode RNA was in vitro-transcribed with the HiScribe T7 High Yield RNA kit from New England Biolabs. RNA was collected with the previously described phenol chloroform method⁹⁵. All PCR reactions were performed using Q5 polymerase according to the manufacturer’s protocols. Barcodes were amplified through a limited-cycle PCR with Illumina dual-index primers. Barcodes were assigned to yeast fragment DNA with next-generation sequencing using the PacBio single molecule real-time (SMRT) technology (Supplementary Data 2).

Barcode quantification and sequencing analysis

Sequencing data were processed using cutadapt v1.16 to remove sequencing adapter sequences. HISAT2 v2.1.0 was used to align sequencing reads to the yeast genome to identify fragment DNA. Trimmed barcodes were then counted and tabulated as described in ref. ⁹⁶. Barcodes that lacked at least 32 counts in one of the sorted gates were filtered out.

RNA quantification

Total RNA was collected from triplicate cultures of each strain using the phenol chloroform method⁹⁵. Quantification of YFP reporter RNA expression in the tethering assay was performed via RT–qPCR analysis by comparing YFP Ct values to RFP Ct values, and Ct differences for cells expressing an active tethering protein were compared to a tethered Halo protein control Ct differences. The fold change in GTA1 expression in the induced cultures (Extended Data Fig. 5b) was compared to the endogenous GTA1 levels in the Halo-expressing control strain.

Domain and motif enrichment analysis

The search for domain enrichment among the tethered library fragments proceeded as follows. Active fragments were first considered as those with an activity score of less than −1 or more than +1. Active fragments that were 90% or more similar to another fragment were considered overlapping, and the most highly sequenced from a group of overlapping fragments was used in the analysis. A given Pfam protein domain was considered represented if one or more fragments covered at least 75% of the domain. The activity scores of represented protein domains were averaged and the mean value was reported for each domain. False discovery rates were calculated with the Benjamini–Hochberg procedure, and domains with an adjusted P value of less than 0.05 are reported in Extended Data Fig. 4c.

To search for short peptide motifs enriched in our active fragments, we again considered active fragments as those with an activity score of less than −1 or greater than +1. We then ran MEME analysis to search for recurring motifs within the sequences of our active fragments⁵². We collapsed sequences that were 50% or more similar into the same fragment to avoid detecting a motif multiple times within the same gene. We then used FIMO⁵² to scan the yeast genome for occurrences of the motifs that were enriched in our active fragments. We manually removed two motifs that came from a single peptide sequence as these did not represent a consensus sequence from multiple distinct proteins, and we removed alignments that fell within highly repetitive genomic sequences.

Protein expression analysis via western blotting

Total protein was isolated from mid-exponentially growing yeast by rapid capture of protein expression through 5% tricarboxylic acid treatment for 10 min, followed by a wash in acetonitrile. The cell pellets were then dried at room temperature for 30 min before bead-beating in Tris-acetate-EDTA buffer for 5 min at room temperature. Samples were then resuspended in SDS loading buffer from NuPage, boiled for 5 min, and loaded on 4–12% polyacrylamide Bis-Tris gels and separated by electrophoresis in MOPS buffer. Proteins were then transferred to a nitrocellulose membrane and were blocked for 1 h in TBST (Tris-buffered saline, 0.1% Tween 20) with 5% bovine serum albumin. Primary antibodies (DYKDDDDK Tag Antibody, Cell Signaling Technology 2368S; α-Pab1 Antibodies-Online ABIN1580454 (clone 1G1)) were incubated with membranes at a 1:1,000 dilution in TBST for 1 h at room temperature, washed with TBST, then incubated for 30 min at room temperature with anti-rabbit (Cell Signaling Technology, 7074S) and anti-mouse (Cytiva NA931-1ML) HRP-linked antibodies at a 1:10,000 dilution. Membranes were developed with Pierce ECL western blotting substrate and imaged on the chemiluminescence channel on a ProteinSimple instrument.

Microscopy

Mid-exponential phase cells were collected by gentle centrifugation and then fixed for 30 min in 4% PFA. The cells were washed in 1× PBS buffer and visualized with a Leica DM IL LED microscope at ×40 magnification, acquired with Leica Application Suite v4.8.0, and processed using ImageJ 1.53t. Fields of view for saved images were randomly selected.

Statistics and reproducibility

No statistical method was used to predetermine sample sizes, but our sample sizes are similar to those reported in previous publications (ref. ⁹⁷). No data were excluded from the analyses, and all replicate experiments were successful. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment. Data distributions were assumed to be normal, but this was not formally tested.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this Article.

Data availability

High-throughput sequencing data has been deposited with the NCBI Short Read Archive. Long-read sequencing linking tethering constructs and barcodes is available at SRR10355648, and short-read sequencing quantifying the barcodes is available at SRR10353306 through SRR10353315, as described below. Publicly available datasets used here include the S. cerevisiae proteome and Pfam domain annotations from InterPro proteome UP000002311, and S. cerevisiae genome annotations from http://sgd-archive.yeastgenome.org/sequence/S288C_reference/genome_releases/S288C_reference_genome_R64-2-1_20150113.tgz. BioGRID data are from https://downloads.thebiogrid.org/Download/BioGRID/Release-Archive/BIOGRID-4.3.195/BIOGRID-MV-Physical-4.3.195.tab3.zip. Source data are provided with this paper.

Code availability

Custom software and scripts are available from Zenodo at https://doi.org/10.5281/zenodo.4963329.

References

Gebauer, F., Preiss, T. & Hentze, M. W. From cis-regulatory elements to complex RNPs and back. Cold Spring Harb. Perspect. Biol. 4, a012245 (2012).
Article PubMed PubMed Central Google Scholar
Hafner, M. et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell 141, 129–141 (2010).
Article CAS PubMed PubMed Central Google Scholar
Gerstberger, S., Hafner, M. & Tuschl, T. A census of human RNA-binding proteins. Nat. Rev. Genet. 15, 829–845 (2014).
Article CAS PubMed Google Scholar
Ott, M. Cell biology: choreography of protein synthesis. Nature 533, 472–473 (2016).
Article CAS PubMed Google Scholar
Lunde, B. M., Moore, C. & Varani, G. RNA-binding proteins: modular design for efficient function. Nat. Rev. Mol. Cell Biol. 8, 479–490 (2007).
Article CAS PubMed PubMed Central Google Scholar
Kramer, K. et al. Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins. Nat. Methods 11, 1064–1070 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lambert, N. et al. RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. Mol. Cell 54, 887–900 (2014).
Article CAS PubMed PubMed Central Google Scholar
Castello, A. et al. Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell 149, 1393–1406 (2012).
Article CAS PubMed Google Scholar
Beckmann, B. M. et al. The RNA-binding proteomes from yeast to man harbour conserved enigmRBPs. Nat. Commun. 6, 10127 (2015).
Article CAS PubMed Google Scholar
Licatalosi, D. D. et al. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature 456, 464–469 (2008).
Article CAS PubMed PubMed Central Google Scholar
Van Nostrand, E. L. et al. A large-scale binding and functional map of human RNA-binding proteins. Nature 583, 711–719 (2020).
Article PubMed PubMed Central Google Scholar
Hafner, M. et al. CLIP and complementary methods. Nat. Rev. Methods Prim. 1, 20 (2021).
Article CAS Google Scholar
Van Nostrand, E. L. et al. Principles of RNA processing from analysis of enhanced CLIP maps for 150 RNA binding proteins. Genome Biol. 21, 90 (2020).
Article PubMed PubMed Central Google Scholar
Coller, J. M., Gray, N. K. & Wickens, M. P. mRNA stabilization by poly(A) binding protein is independent of poly(A) and requires translation. Genes Dev. 12, 3226–3235 (1998).
Article CAS PubMed PubMed Central Google Scholar
Bos, T. J., Nussbacher, J. K., Aigner, S. & Yeo, G. W. Tethered function assays as tools to elucidate the molecular roles of RNA-binding proteins. Adv. Exp. Med. Biol. 907, 61–88 (2016).
Article CAS PubMed PubMed Central Google Scholar
Luo, E.-C. et al. Large-scale tethered function assays identify factors that regulate mRNA stability and translation. Nat. Struct. Mol. Biol. 27, 989–1000 (2020).
Article CAS PubMed PubMed Central Google Scholar
Erben, E. D., Fadda, A., Lueong, S., Hoheisel, J. D. & Clayton, C. A genome-wide tethering screen reveals novel potential post-transcriptional regulators in Trypanosoma brucei. PLoS Pathog. 10, e1004178 (2014).
Article PubMed PubMed Central Google Scholar
Keryer-Bibens, C., Barreau, C. & Osborne, H. B. Tethering of proteins to RNAs by bacteriophage proteins. Biol. Cell 100, 125–138 (2008).
Article CAS PubMed Google Scholar
Kessler, S. H. & Sachs, A. B. RNA recognition motif 2 of yeast Pab1p is required for its functional interaction with eukaryotic translation initiation factor 4G. Mol. Cell. Biol. 18, 51–57 (1998).
Article CAS PubMed PubMed Central Google Scholar
Collart, M. A. The Ccr4-Not complex is a key regulator of eukaryotic gene expression. Wiley Interdiscip. Rev. RNA 7, 438–454 (2016).
Article CAS PubMed PubMed Central Google Scholar
Finoux, A.-L. & Séraphin, B. In vivo targeting of the yeast Pop2 deadenylase subunit to reporter transcripts induces their rapid degradation and generates new decay intermediates. J. Biol. Chem. 281, 25940–25947 (2006).
Article CAS PubMed Google Scholar
Winstall, E., Sadowski, M., Kuhn, U., Wahle, E. & Sachs, A. B. The Saccharomyces cerevisiae RNA-binding protein Rbp29 functions in cytoplasmic mRNA metabolism. J. Biol. Chem. 275, 21817–21826 (2000).
Article CAS PubMed Google Scholar
Adey, A. et al. Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition. Genome Biol. 11, R119 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lu, B. et al. Transposase-assisted tagmentation of RNA/DNA hybrid duplexes. eLife 9, e54919 (2020).
Article CAS PubMed PubMed Central Google Scholar
Heger, A. & Holm, L. Exhaustive enumeration of protein domain families. J. Mol. Biol. 328, 749–767 (2003).
Article CAS PubMed Google Scholar
Michlits, G. et al. CRISPR-UMI: single-cell lineage tracing of pooled CRISPR–Cas9 screens. Nat. Methods 14, 1191–1197 (2017).
Article CAS PubMed Google Scholar
Schmierer, B. et al. CRISPR/Cas9 screening using unique molecular identifiers. Mol. Syst. Biol. 13, 945 (2017).
Article PubMed PubMed Central Google Scholar
Chuang, R. Y., Weaver, P. L., Liu, Z. & Chang, T. H. Requirement of the DEAD-Box protein ded1p for messenger RNA translation. Science 275, 1468–1471 (1997).
Article CAS PubMed Google Scholar
De La Cruz, J., Iost, I., Kressler, D. & Linder, P. The p20 and Ded1 proteins have antagonistic roles in eIF4E-dependent translation in Saccharomyces cerevisiae. Proc. Natl Acad. Sci. USA 94, 5201–5206 (1997).
Article PubMed PubMed Central Google Scholar
Tarn, W.-Y. & Chang, T.-H. The current understanding of Ded1p/DDX3 homologs from yeast to human. RNA Biol. 6, 17–20 (2009).
Article CAS PubMed Google Scholar
Chang, L.-C. & Lee, F.-J. S. The RNA helicase Dhh1p cooperates with Rbp1p to promote porin mRNA decay via its non-conserved C-terminal domain. Nucleic Acids Res. 40, 1331–1344 (2012).
Article CAS PubMed Google Scholar
Swaney, D. L. et al. Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation. Nat. Methods 10, 676–682 (2013).
Article CAS PubMed Google Scholar
Fleischer, T. C., Weaver, C. M., McAfee, K. J., Jennings, J. L. & Link, A. J. Systematic identification and functional screens of uncharacterized proteins associated with eukaryotic ribosomal complexes. Genes Dev. 20, 1294–1307 (2006).
Article CAS PubMed PubMed Central Google Scholar
Brandariz-Núñez, A., Zeng, F., Lam, Q. N. & Jin, H. Sbp1 modulates the translation of Pab1 mRNA in a poly(A)- and RGG-dependent manner. RNA 24, 43–55 (2018).
Article PubMed PubMed Central Google Scholar
Kershaw, C. J. et al. The yeast La related protein Slf1p is a key activator of translation during the oxidative stress response. PLoS Genet. 11, e1004903 (2015).
Article PubMed PubMed Central Google Scholar
Wendland, B. & Emr, S. D. Pan1p, yeast eps15, functions as a multivalent adaptor that coordinates protein-protein interactions essential for endocytosis. J. Cell Biol. 141, 71–84 (1998).
Article CAS PubMed PubMed Central Google Scholar
Kaksonen, M. & Roux, A. Mechanisms of clathrin-mediated endocytosis. Nat. Rev. Mol. Cell Biol. 19, 313–326 (2018).
Article CAS PubMed Google Scholar
Hogan, D. J., Riordan, D. P., Gerber, A. P., Herschlag, D. & Brown, P. O. Diverse RNA-binding proteins interact with functionally related sets of RNAs, suggesting an extensive regulatory system. PLoS Biol. 6, e255 (2008).
Article PubMed PubMed Central Google Scholar
Mitchell, S. F., Jain, S., She, M. & Parker, R. Global analysis of yeast mRNPs. Nat. Struct. Mol. Biol. 20, 127–133 (2013).
Article CAS PubMed Google Scholar
Beckmann, B. M. RNA interactome capture in yeast. Methods 118–119, 82–92 (2017).
Article PubMed PubMed Central Google Scholar
Tsvetanova, N. G., Klass, D. M., Salzman, J. & Brown, P. O. Proteome-wide search reveals unexpected RNA-binding proteins in Saccharomyces cerevisiae. PLoS ONE 5, e12671 (2010).
Article PubMed PubMed Central Google Scholar
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucleic Acids Res. 47, D427–D432 (2019).
Article CAS PubMed Google Scholar
Aitken, C. E. et al. Eukaryotic translation initiation factor 3 plays distinct roles at the mRNA entry and exit channels of the ribosomal preinitiation complex. eLife 5, e20934 (2016).
Article PubMed PubMed Central Google Scholar
Linder, P. & Jankowsky, E. From unwinding to clamping—the DEAD box RNA helicase family. Nat. Rev. Mol. Cell Biol. 12, 505–516 (2011).
Article CAS PubMed Google Scholar
Honzatko, R. B., Stayton, M. M. & Fromm, H. J. Adenylosuccinate synthetase: recent developments. Adv. Enzymol. Relat. Areas Mol. Biol. 73, 57–102 (1999).
CAS PubMed Google Scholar
Varadi, M., Zsolyomi, F., Guharoy, M. & Tompa, P. Functional advantages of conserved intrinsic disorder in RNA-binding proteins. PLoS ONE 10, e0139731 (2015).
Article PubMed PubMed Central Google Scholar
Gulay, S., Gupta, N., Lorsch, J. R. & Hinnebusch, A. G. Distinct interactions of eIF4A and eIF4E with RNA helicase Ded1 stimulate translation in vivo. eLife 9, e58243 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gupta, N., Lorsch, J. R. & Hinnebusch, A. G. Yeast Ded1 promotes 48S translation pre-initiation complex assembly in an mRNA-specific and eIF4F-dependent manner. eLife 7, e38892 (2018).
Article PubMed PubMed Central Google Scholar
Oberstrass, F. C. et al. Structure of PTB bound to RNA: specific binding and implications for splicing regulation. Science 309, 2054–2057 (2005).
Article CAS PubMed Google Scholar
Diarra dit Konté, N. et al. Aromatic side-chain conformational switch on the surface of the RNA recognition motif enables RNA discrimination. Nat. Commun. 8, 654 (2017).
Article PubMed PubMed Central Google Scholar
Balcerak, A., Trebinska-Stryjewska, A., Konopinski, R., Wakula, M. & Grzybowska, E. A. RNA-protein interactions: disorder, moonlighting and junk contribute to eukaryotic complexity. Open Biol. 9, 190096 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bailey, T. L. et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 37, W202–W208 (2009).
Article CAS PubMed PubMed Central Google Scholar
Olivas, W. & Parker, R. The Puf3 protein is a transcript-specific regulator of mRNA degradation in yeast. EMBO J. 19, 6602–6611 (2000).
Article CAS PubMed PubMed Central Google Scholar
Parker, R. & Song, H. The enzymes and control of eukaryotic mRNA turnover. Nat. Struct. Mol. Biol. 11, 121–127 (2004).
Article CAS PubMed Google Scholar
Thandapani, P., O’Connor, T. R., Bailey, T. L. & Richard, S. Defining the RGG/RG motif. Mol. Cell 50, 613–623 (2013).
Article CAS PubMed Google Scholar
Oughtred, R. et al. The BioGRID database: a comprehensive biomedical resource of curated protein, genetic and chemical interactions. Protein Sci. 30, 187–200 (2021).
Article CAS PubMed Google Scholar
Sachs, A. The role of poly(A) in the translation and stability of mRNA. Curr. Opin. Cell Biol. 2, 1092–1098 (1990).
Article CAS PubMed Google Scholar
Rojas, M. et al. Yeast Gis2 and its human ortholog CNBP are novel components of stress-induced RNP granules. PLoS ONE 7, e52824 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sammons, M. A., Samir, P. & Link, A. J. Saccharomyces cerevisiae Gis2 interacts with the translation machinery and is orthogonal to myotonic dystrophy type 2 protein ZNF9. Biochem. Biophys. Res. Commun. 406, 13–19 (2011).
Article CAS PubMed Google Scholar
Parker, R. RNA degradation in Saccharomyces cerevisae. Genetics 191, 671–702 (2012).
Article CAS PubMed PubMed Central Google Scholar
Blasco-Moreno, B. et al. The exonuclease Xrn1 activates transcription and translation of mRNAs encoding membrane proteins. Nat. Commun. 10, 1298 (2019).
Article PubMed PubMed Central Google Scholar
Costanzo, M. et al. A global genetic interaction network maps a wiring diagram of cellular function. Science 353, aaf1420 (2016).
Article PubMed PubMed Central Google Scholar
Mattiazzi Usaj, M. et al. Systematic genetics and single-cell imaging reveal widespread morphological pleiotropy and cell-to-cell variability. Mol. Syst. Biol. 16, e9243 (2020).
Article CAS PubMed PubMed Central Google Scholar
Huh, W.-K. et al. Global analysis of protein localization in budding yeast. Nature 425, 686–691 (2003).
Article CAS PubMed Google Scholar
Chong, Y. T. et al. Yeast proteome dynamics from single cell imaging and automated analysis. Cell 161, 1413–1424 (2015).
Article CAS PubMed Google Scholar
Kraus, O. Z. et al. Automated analysis of high‐content microscopy data with deep learning. Mol. Syst. Biol. 13, 924 (2017).
Article PubMed PubMed Central Google Scholar
Lorenz, M. C., Cutler, N. S. & Heitman, J. Characterization of alcohol-induced filamentous growth in Saccharomyces cerevisiae. Mol. Biol. Cell 11, 183–199 (2000).
Article CAS PubMed PubMed Central Google Scholar
Cullen, P. J. & Sprague, G. F. Jr The regulation of filamentous growth in yeast. Genetics 190, 23–49 (2012).
Article CAS PubMed PubMed Central Google Scholar
Webster, M. W. et al. mRNA deadenylation is coupled to translation rates by the differential activities of Ccr4-Not nucleases. Mol. Cell 70, 1089–1100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Xu, K., Bai, Y., Zhang, A., Zhang, Q. & Bartlam, M. G. Insights into the structure and architecture of the CCR4-NOT complex. Front. Genet. 5, 137 (2014).
Article PubMed PubMed Central Google Scholar
Basquin, J. et al. Architecture of the nuclease module of the yeast Ccr4-not complex: the Not1-Caf1-Ccr4 interaction. Mol. Cell 48, 207–218 (2012).
Article CAS PubMed Google Scholar
Sen, N. D., Zhou, F., Ingolia, N. T. & Hinnebusch, A. G. Genome-wide analysis of translational efficiency reveals distinct but overlapping functions of yeast DEAD-box RNA helicases Ded1 and eIF4A. Genome Res. 25, 1196–1205 (2015).
Article CAS PubMed PubMed Central Google Scholar
Guenther, U.-P. et al. The helicase Ded1p controls use of near-cognate translation initiation codons in 5′ UTRs. Nature 559, 130–134 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sobel, S. G. & Wolin, S. L. Two yeast La motif-containing proteins are RNA-binding proteins that associate with polyribosomes. Mol. Biol. Cell 10, 3849–3862 (1999).
Article CAS PubMed PubMed Central Google Scholar
Castelli, L. M. et al. The 4E-BP Caf20p mediates both eIF4E-dependent and independent repression of translation. PLoS Genet. 11, e1005233 (2015).
Article PubMed PubMed Central Google Scholar
Baek, G. H. et al. Cdc48: a Swiss army knife of cell biology. J. Amino Acids 2013, 183421 (2013).
Article PubMed PubMed Central Google Scholar
Sommer, T. & Wolf, D. H. Endoplasmic reticulum degradation: reverse protein flow of no return. FASEB J. 11, 1227–1233 (1997).
Article CAS PubMed Google Scholar
Kostova, Z. & Wolf, D. H. For whom the bell tolls: protein quality control of the endoplasmic reticulum and the ubiquitin-proteasome connection. EMBO J. 22, 2309–2317 (2003).
Article CAS PubMed PubMed Central Google Scholar
Wolf, D. H. & Stolz, A. The Cdc48 machine in endoplasmic reticulum associated protein degradation. Biochim. Biophys. Acta 1823, 117–124 (2012).
Article CAS PubMed Google Scholar
Olszewski, M. M., Williams, C., Dong, K. C. & Martin, A. The Cdc48 unfoldase prepares well-folded protein substrates for degradation by the 26S proteasome. Commun. Biol. 2, 29 (2019).
Article PubMed PubMed Central Google Scholar
Hartmann-Petersen, R. et al. The Ubx2 and Ubx3 cofactors direct Cdc48 activity to proteolytic and nonproteolytic ubiquitin-dependent processes. Curr. Biol. 14, 824–828 (2004).
Article CAS PubMed Google Scholar
Neuber, O., Jarosch, E., Volkwein, C., Walter, J. & Sommer, T. Ubx2 links the Cdc48 complex to ER-associated protein degradation. Nat. Cell Biol. 7, 993–998 (2005).
Article CAS PubMed Google Scholar
Mårtensson, C. U. et al. Mitochondrial protein translocation-associated degradation. Nature 569, 679–683 (2019).
Article PubMed Google Scholar
Gebauer, F., Schwarzl, T., Valcárcel, J. & Hentze, M. W. RNA-binding proteins in human genetic disease. Nat. Rev. Genet. 22, 185–198 (2021).
Article CAS PubMed Google Scholar
Ravarani, C. N. et al. High-throughput discovery of functional disordered regions: investigation of transactivation domains. Mol. Syst. Biol. 14, e8190 (2018).
Article PubMed PubMed Central Google Scholar
Staller, M. V. et al. A high-throughput mutational scan of an intrinsically disordered acidic transcriptional activation domain. Cell Syst. 6, 444–455 (2018).
Article CAS PubMed PubMed Central Google Scholar
Erijman, A. et al. A high-throughput screen for transcription activation domains reveals their sequence features and permits prediction by deep learning. Mol. Cell 78, 890–902 (2020).
Article CAS PubMed PubMed Central Google Scholar
Davey, N. E. et al. Attributes of short linear motifs. Mol. Biosyst. 8, 268–281 (2012).
Article CAS PubMed Google Scholar
Radhakrishnan, A. et al. The DEAD-Box protein Dhh1p couples mRNA decay and translation by monitoring codon optimality. Cell 167, 122–132 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cottrell, K. A., Chaudhari, H. G., Cohen, B. A. & Djuranovic, S. PTRE-seq reveals mechanism and interactions of RNA binding proteins and miRNAs. Nat. Commun. 9, 301 (2018).
Article PubMed PubMed Central Google Scholar
Aranda-Díaz, A., Mace, K., Zuleta, I., Harrigan, P. & El-Samad, H. Robust synthetic circuits for two-dimensional control of gene expression in yeast. ACS Synth. Biol. 6, 545–554 (2017).
Article PubMed Google Scholar
Stovicek, V., Borja, G. M., Forster, J. & Borodina, I. EasyClone 2.0: expanded toolkit of integrative vectors for stable gene expression in industrial Saccharomyces cerevisiae strains. J. Ind. Microbiol. Biotechnol. 42, 1519–1531 (2015).
Article CAS PubMed Google Scholar
Kawai, S., Hashimoto, W. & Murata, K. Transformation of Saccharomyces cerevisiae and other fungi: methods and possible underlying mechanism. Bioeng. Bugs 1, 395–403 (2010).
Article PubMed PubMed Central Google Scholar
McGeachy, A. M., Meacham, Z. A. & Ingolia, N. T. An accessible continuous-culture turbidostat for pooled analysis of complex libraries. ACS Synth. Biol. 8, 844–856 (2019).
Article CAS PubMed Google Scholar
Nilsen, T. W. The fundamentals of RNA purification. Cold Spring Harb. Protoc. 2013, 618–624 (2013).
PubMed Google Scholar
Muller, R., Meacham, Z. A., Ferguson, L. & Ingolia, N. T. CiBER-seq dissects genetic networks by quantitative CRISPRi profiling of expression phenotypes. Science 370, eabb9662 (2020).
Article CAS PubMed PubMed Central Google Scholar
Reynaud, K., Brothers, M., Ly, M. & Ingolia, N. T. Dynamic post-transcriptional regulation by Mrn1 links cell wall homeostasis to mitochondrial structure and function. PLoS Genet. 17, e1009521 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank S. Iwasaki and G. Brar for insightful comments, R. Muller (UC Berkeley) for library plasmids, S. Fernandez for assistance with computational data visualization, and J. Lobel along with other members of the Ingolia laboratory for thoughtful scientific discussions. This work was supported by NIH grants DP2 CA195768 (N.T.I.) and R01 GM130996 (N.T.I.) and by the Rose Hills Innovator Program, the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley, supported by NIH S10 OD018174 Instrumentation Grant, and the Flow Cytometry Facility at UC Berkeley, the UC Berkeley DNA Sequencing Facility and the UC Davis Proteomics Core Facility.

Author information

Authors and Affiliations

California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA
Kendra Reynaud & Nicholas T. Ingolia
Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
Anna M. McGeachy, David Noble, Zuriah A. Meacham & Nicholas T. Ingolia

Authors

Kendra Reynaud
View author publications
You can also search for this author in PubMed Google Scholar
Anna M. McGeachy
View author publications
You can also search for this author in PubMed Google Scholar
David Noble
View author publications
You can also search for this author in PubMed Google Scholar
Zuriah A. Meacham
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas T. Ingolia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.R., A.M. and N.I. conceived and designed the experiments. K.R. and A.M. carried out experiments with assistance from Z.M. K.R., N.I. and D.N. analyzed the data. N.I. supervised the project. K.R. and N.I. drafted the manuscript, with revisions from A.M.

Corresponding author

Correspondence to Nicholas T. Ingolia.

Ethics declarations

Competing interests

N.I. declares financial interests in Velia Therapeutics and Tevard Biosciences. The other authors declare no competing interests.

Peer review

Peer review information

Nature Structural & Molecular Biology thanks Gian Gaetano Tartaglia and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Carolina Perdigoto, in collaboration with the Nature Structural & Molecular Biology team. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 The dual reporter tethering assay reports reproducible and quantitative regulatory effects.

a, Pat1 activity tethered to 3′ UTR of both fluorescent reporters with PP7 is reproducible between replicates and fluorophores. b, Schematic representation of Sgn1 recruiting Pab1 and eIF4G in the tethering assay. c, Comparison of RFP and YFP fluorescence with Sgn1 or a non-regulator control tethered to YFP (n = 3, one representative replicate depicted). d, YFP fluorescence with Sgn1 and the non-regulator Halo protein tethered to the 3′ UTR. e, RFP fluorescence with Sgn1 and the non-regulator Halo protein tethered to the 3′ UTR of YFP. f, Forward scatter fluorescence of Sgn1 and control protein expressing cells indicates larger cell size in the Sgn1 samples. g, RT-qPCR analysis of YFP mRNA levels with Sgn1 or the control tethered to the 3′ UTR.

Extended Data Fig. 2 Generating an unbiased, proteome-wide survey of tethered in-frame protein fragments.

a, Bioanalyzer analysis of tagmented genomic DNA library size distribution. b, As in Fig. 2e, for Pab1(188–403). c, as in Fig. 2e, for Cth1(38–91). d, Comparison of library-wide activity scores between screen replicates I and II (r = 0.7).

Extended Data Fig. 3 Protein fragment activity in the tethering screen represents real, verifiable regulatory function.

a, Sequencing read counts and activity scores for validated screen fragments. b, Schematic key depicting FACS bins based on position and color, c, Comparison of activity scores between screen replicate I and II for validated fragments, d, Exemplary FSC and SSC data showing gate in red, e, Absolute BFP fluorescence measurements by flow cytometry showing expression of validated fragment and BFP fusion proteins.

Extended Data Fig. 4 Motifs enriched amongst most active screen fragments.

a, Peptide motifs significantly enriched amongst repressor screen fragments. Counts represent significant occurrences of that motif in the yeast genome. b, As in a, for the activator screen fragments.

Extended Data Fig. 5 The tethering screen identifies RNA-regulatory roles of poorly-characterized proteins.

a, Histogram of BFP fluorescence as a measure for control, Gta1 and Gta1(603–767) expression and stability in the tethering assay, normalized to control BFP levels. b, RT-qPCR analysis of induced GTA1 and control mRNA over time, normalized to expression levels at 5 minutes induction.

Extended Data Fig. 6 The tethering screen defines functional domain boundaries of well characterized RBPs.

a, As in Fig. 6a, for fragments derived from Ccr4(1–200) disordered N-terminus domain (top), Ccr4(60–390) Leucine-rich repeat interacting domain (middle), and Ccr4(180–340) leucine-rich repeat domain (bottom). b, As in Fig. 6b, for Ded1(1–80).

Extended Data Fig. 7 The tethering screen reveals regulatory roles of known RNA-binding proteins.

a, Western blot analysis of Sro9 full length and truncation protein expression in the tethering assay. Two independent biological replicates are shown. b, Western blot analysis of Pab1 enrichment in FLAG-tag protein purification eluate. Two independent biological replicates are shown. c, Quantification of median values in Fig. 7j for two biological replicates.

Source data

Supplementary information

Supplementary Information

Supplementary Tables 1–3.

Reporting Summary

Peer Review File

Supplementary Data

Supplementary Data 1, Exemplary fragment library sequences. Supplementary Data 2, Barcode-to-fragment assignments. Supplementary Data 3, Fragment read counts and activity scores. Supplementary Data 4, Fragment activity scores. Supplementary Data 5, Fragment protein coordinates. Supplementary Data 6, Fragment Pfam domain coordinates. Supplementary Data 7, Repressor motifs. Supplementary Data 8, Activator motifs. Supplementary Data 9, Protein-protein interactions

Source data

Source Data Fig. 1

Statistical source data.

Source Data Fig. 3

Statistical source data.

Source Data Fig. 5

Statistical source data.

Source Data Fig. 7

Statistical source data.

Source Data Extended Data Fig. 7

Unprocessed western blots.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Reynaud, K., McGeachy, A.M., Noble, D. et al. Surveying the global landscape of post-transcriptional regulators. Nat Struct Mol Biol 30, 740–752 (2023). https://doi.org/10.1038/s41594-023-00999-5

Download citation

Received: 01 July 2021
Accepted: 17 April 2023
Published: 25 May 2023
Issue Date: June 2023
DOI: https://doi.org/10.1038/s41594-023-00999-5

Subjects

Abstract

Similar content being viewed by others

Main

Results

Functional analysis of post-transcriptional regulators

A proteome-wide survey of post-transcriptional regulators

Full-length protein activity resembles truncated fragments

Activity in RBPs but not RBDs

Endoplasmic reticulum/Golgi protein Gta1 is a bimodal repressor

IDRs mediate regulatory activity

Regulatory functions of Sro9 and Cdc48

Discussion

Methods

Strain construction

Culturing conditions

Flow cytometric measurements and fluorescence-activated cell sorting

In-frame and fragment tethering library generation

Barcode quantification and sequencing analysis

RNA quantification

Domain and motif enrichment analysis

Protein expression analysis via western blotting

Microscopy

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links