Signal-noise metrics for RNA binding protein identification reveal broad spectrum protein-RNA interaction frequencies and dynamics

Recent efforts towards the comprehensive identification of RNA-bound proteomes have revealed a large, surprisingly diverse family of candidate RNA-binding proteins (RBPs). Quantitative metrics for characterization and validation of protein-RNA interactions and their dynamic interactions have, however, proven analytically challenging and prone to error. Here we report a method termed LEAP-RBP (Liquid-Emulsion-Assisted-Purification of RNA-Bound Protein) for the selective, quantitative recovery of UV-crosslinked RNA-protein complexes. By virtue of its high specificity and yield, LEAP-RBP distinguishes RNA-bound and RNA-free protein levels and reveals common sources of experimental noise in RNA-centric RBP enrichment methods. We introduce strategies for accurate RBP identification and signal-based metrics for quantifying protein-RNA complex enrichment, relative RNA occupancy, and method specificity. In this work, the utility of our approach is validated by comprehensive identification of RBPs whose association with mRNA is modulated in response to global mRNA translation state changes and through in-depth benchmark comparisons with current methodologies.

As with protein-protein interactions, protein-RNA interactions can vary substantially in their specificities, interaction lifetimes, and apparent affinities 13 .This intrinsic biological property creates methodological hurdles to establishing biologically relevant interactions, particularly when interaction energies are weak and thus readily lost during biochemical isolation.In the case of RNA-protein complexes (RNPs), chemical-or UV cross-linking methods can capture physiologically relevant interactions though as with any cross-linking method, criteria for distinguishing specific from biologically irrelevant interactions are critical.UV cross-linking is preferred due to its high specificity, though it is also inefficient, with only a fraction of interactors forming a covalent adduct [18][19][20] .The generally low crosslinking efficiencies present an analytical challenge because selective, quantitative recovery of the UV-crosslinked protein-RNA complexes (clRNPs) is necessary for accurately determining RNA occupancy states in vivo.Here we report two different sources of non-specific free protein recovery during RNA-centric enrichment of clRNPs.To distinguish RNA-bound and free protein levels, we introduce signalto-noise (S/N) metrics, where S/N is a protein-specific metric representing the ratio of RNA-bound (S) to unbound counterparts (N), and where proteins without RNA-bound counterparts (S = 0) comprise background (B).Because mass spectrometry-based proteomic analysis does not distinguish RNA-bound proteins from their unbound counterparts, S/N-based metrics provide a valuable quantitative criterion for high confidence assignment of RNA-binding function.
In this work, we identify and validate experimental approaches for evaluating S/N and use them to guide the development of a biochemical method termed LEAP-RBP (Liquid-Emulsion-Assisted-Purification of RNA-Bound Protein) for the selective isolation of total RNAbound protein.SILAC LC-MS/MS analysis of LEAP-RBP fractions demonstrated high RNA-bound protein enrichment and through comparative analyses, revealed a key metric for evaluating method specificity for RNA-bound RBPs which we term %TP S , or RNA-bound protein abundance.High %TP S is indicative of low free protein recovery and enables the accurate study of dynamic, cell state-determined changes in RBP occupancy state.Using this signal-based analytical framework, we present perspectives and strategies for studying RNAbound proteomes and their dynamics.The utility of this approach is established through benchmark comparisons of LEAP-RBP with current RNA-centric enrichment methods.

RNA-bound proteins display RNase-dependent SDS-PAGE mobility
The work reported here builds on the discovery of organic phase separation methods for the selective enrichment of clRNPs 3,4,21 .Whereas acidic guanidinium thiocyanate-phenol-chloroform (AGPC) biphasic extraction (i.e., Trizol) has been widely used for the isolation of RNA and protein from biological samples 22,23 , the finding that RBPs exhibit UV-dependent enrichment at the AGPC interphase revealed a utility for this method in RNA-protein interactome discovery (Supplementary Note 1) 3,4,24 .Previously, it was reported that repeated AGPC interphase extraction further enriches RBPs over non-RBPs 4,24 .Here, we assessed the ability of repeated AGPC extraction to enhance S/N (RNA-bound protein/free protein) of AGPC interphase proteins using an SDS-PAGE RNase-sensitivity Assay (SRA) (Fig. 1a).
Fig. 1 | RNase-dependent SDS-PAGE mobility provides a key diagnostic for identifying RNA-binding proteins.AGPC acidic guanidinium thiocyanate-phenol-chloroform, SRA SRA analysis.a Schematic illustration of the experimental approach.Repeated AGPC extraction enriches RBPs over non-RBPs; effect of repeated AGPC extraction on S/N was tested.Graphical representation of SRA: RNase-treated samples are compared to equivalent amounts of untreated samples by SDS-PAGE.RNA-bound proteins in untreated samples migrate at a higher molecular weight than their unbound counterparts.RNase digestion liberates RNA-bound RBPs allowing their mobilization into the separating gel.b Comparison of AGPC interphase samples isolated by methanol precipitation (95% v/v) following 1-6 AGPC extraction(s) of UV-crosslinked or non-crosslinked HeLa cells by SRA and Coomassie Blue (protein), SYBR Safe (RNA&DNA) staining; parallel gels.Protein (BCA) and RNA&DNA (UV-spectrophotometry) yields shown in the adjoining bar charts represent the mean ± 1 SD of three biologically independent samples, pooled (equivalent % fraction) for SRA analysis.c Immunoblot analysis of samples from (b).Sample compositions, target protein RBP-RNA interactome category, and target protein GO:RBP-annotation status indicated in the figure panel.Full bots are provided in Supplementary Fig. 1a, RNA-binding domains and list of studies reporting UV-enrichment* provided in Supplementary Data 1. d Schematic illustration of enhanced S/N output.SRA is unable to distinguish non-RBPs from RBPs with low S/N in RNP fractions isolated by methanol precipitation (95% v/v) from the final AGPC interphase.SRA supports the identification of RNA enrichment methods that further enhance S/N, evidenced by a marked decrease in RNase-insensitive protein.Graphics (a, b) were created with BioRender.com.Experiments (b, c) were performed four times with similar results.Source data are provided as a Source Data file.
As illustrated in Fig. 1a, UV cross-linked RNA-protein complexes, which comprise signal ( = S), migrate at a higher apparent molecular weight than their unbound counterparts (noise = N) in SDS-PAGE 21,25 .RNase treatment liberates RNA-bound protein, increasing the amount of observed ( = O) protein migrating at its expected molecular weight (ΔO = | S | ).Consequently, a comparison of RNase-treated (S + N) and untreated control samples (N) by SDS-PAGE reveals RNase-sensitive RBPs ( | S | > 0) and enables the determination of S/N.As described in Eq. ( 1), log 2 transformation yields RNase-dependent fold-change (Supplementary Note 2).
RNase-dependent fold-change = Δlog 2 ðOÞ = log 2 ðjSj + NÞ RNase -log 2 ðNÞ untreated ð1Þ We used SRA to evaluate UV-dependent enrichment and S/N of proteins recovered from the AGPC interphase of UV-crosslinked (0.4 J/cm 2 , 254 nm) and non-crosslinked HeLa cells, using sequential interphase extraction to maximize RBP enrichment, as previously reported (Fig. 1b) 24 .To track RNA enrichments and RNase digestion efficacy, parallel gels were run and stained with SYBR Safe.Interestingly, while RNA exhibited UV-enrichment in the AGPC interphase (lower gel, red boxes), proteins showed only modest AGPC interphase UV-enrichment and were largely RNase-insensitive (upper gel, blue boxes), indicating that most of the proteins recovered from the AGPC interphase fraction of UV-irradiated cells were not crosslinked to RNA.These data demonstrate that UV-irradiation-dependent enrichment at the AGPC interphase alone is insufficient evidence for RNA-binding activity.To distinguish between contributions of noise (RBPs lacking crosslinked RNA) and background (non-RBPs) to this phenomenon, we compared known GO-annotated RBPs binding different RNA species with non-RBPs by SRA and immunoblot 1 .Here, proteins lacking GO-annotation (GO:RBP) are designated as non-RBPs, but this may not be due to a lack of RNA-binding activity ("Methods").Protein targets were selected on the basis of prior literature, GO-RBP-annotation datasets, RNA-binding domain bioinformatic analysis, and UV cross-linking/CLIP sequencing frequency, to include canonical RBPs containing known RNA-binding domains (red font) and noncanonical RBPs (blue font) which were identified in previous studies as candidate RBPs (i.e., scored as UV-enriched) but which lack known RNA-binding domains (Fig. 1c) 8,11,26,27 .Intriguingly, all canonical RBPs exhibited some degree of UV-dependent RNase sensitivity by SRA (red box) and were recovered from the AGPC interphase, whereas noncanonical RBPs and non-RBPs were either RNase-insensitive or interphase depleted following repeat extraction.In agreement with prior findings, most glycoproteins assayed (asterisks) exhibited UV-independent enrichment at the AGPC interphase 4 , although ribophorin I (RPN1) did not (purple box; Fig. 1c).The finding that known RBPs lacking crosslinked RNA, e.g., nucleolin (NCL), and non-RBPs, e.g., β-tubulin (blue boxes) can score as UV-enriched provides a clear demonstration that orthogonal assays such as SRA are necessary for validation of candidate RBPs.
Even after six AGPC extractions, the established RBP nucleolin (NCL) reached an apparent S/N enrichment limit (gold box; Fig. 1c).We attributed this to intrinsic limitations in the ability of repeated AGPC extraction to enhance the S/N of a given RBP (Supplementary Fig. 2a and Supplementary Notes 3 and 4).This finding suggested that a subset of RNase-insensitive proteins at the final AGPC interphase may be bona fide RBPs with intrinsically low S/N.Combined, these observations revealed a need for enrichment methods that are selective for RNA-bound RBPs and thereby support quantitative S/N determinations (Fig. 1d).

LEAP-RBP provides efficient recovery of RNA-bound protein
Efforts to identify methods meeting this criterion yielded two approaches, INP (Isopropanol NaCl Precipitation) and LEAP-RBP (Liquid-Emulsion-Assisted-Purification of RNA-Bound Protein).Both are RNA-centric enrichment methods that enhance S/N by SRA without significant signal loss (Fig. 2a, b), but only LEAP-RBP (L) clearly distinguishes RBPs with low S/N (e.g., RPN1, TRAPα) from non-RBPs (red box; Fig. 2b).LEAP-RBP is notable for its high selectivity for RNAprotein complexes, with only negligible protein quantities displaying SDS-PAGE mobility in the absence of RNase digestion (Fig. 2a).This conclusion is further supported by assays of protein and RNA yields in the different enrichment methods, where LEAP-RBP yielded a substantially higher total RNA/protein than that obtained by methanol or isopropanol/NaCl precipitation.The utility of LEAP-RBP is readily apparent by SRA re-evaluation of a subset of previously identified candidate RBPs where for example the previously GO-annotated RBP and endoplasmic reticulum (ER) chaperone GRP94 scores as a false positive and the non-RBP ER integral membrane protein TRAPα as a false negative (Fig. 2b).The enhanced S/N of the LEAP-RBP method was achieved using a heterogeneous, lithium-supplemented solvent system which provides rapid and selective precipitation of RNA-bound proteins (Fig. 2c, Supplementary Figs.3a-c and 4a-d, and "Methods") 28 .
By virtue of high specificity, LEAP-RBP did not recover detectable protein from the final AGPC interphase suspension of non-crosslinked cells (Supplementary Fig. 5a).This finding allowed us to determine if protein recovery in the final AGPC interphase suspension of UV-crosslinked cells is restricted to protein-RNA adducts and/or can be UV-irradiation (signal)dependent but independent of RNA cross-linking.In these experiments, LEAP-RBP was applied to AGP input suspensions containing UV cross-linked or non-cross-linked cells (Fig. 3a, b).Here the identification of contaminant DNA (red box; Fig. 3a) prompted development of a DNA depletion step utilizing a second LEAP step to further enhance S/N (Supplementary Fig. 5b).Recovery of protein in the AGPC interphase was primarily UVdependent, with a small but significant difference between crosslinked samples prepared using a single LEAP and those prepared using the full protocol noted above (Fig. 3a).As expected, RNA recovery was mainly dependent on sample prep although UVdependent following six AGPC extractions.Importantly, comparisons of the fractions prepared using the different protocols revealed similar RNase-sensitive protein profiles, thereby validating the critical assumption that all RNA-bound protein partitions to the AGPC interphase (Fig. 3a, b).Remarkably, and as a clear demonstration of the specificity and selectivity of LEAP-RBP for protein-RNA adducts, the S/N of most RBPs isolated by direct LEAP-RBP treatment of AGP input suspensions were comparable to those isolated following six AGPC extractions (gold boxes; Fig. 3b).Together, these data establish the efficacy of the LEAP-RBP method and support the alternative hypothesis that recovery of noise is signal-dependent, not RNA-dependent (red boxes; Fig. 3b).In subsequent experiments we observed that RNA-free RBP recovery from AGP input suspensions was dependent on UVdose and independent of total RNA and protein quantity (Supplementary Fig. 6a-c and Supplementary Note 4).

Enhanced S/N decreases UV-enrichment* specificity
The improvements in S/N conferred by LEAP-RBP provided an opportunity to determine the effect of enhanced S/N on UVenrichment* specificity for GO-annotated RBPs, where asterisks denote statistical significance.To this end, heavy SILAC-labeled crosslinked (CL) and light SILAC-labeled non-crosslinked (nCL) cells were pooled prior to processing to accurately quantify UVdependent free protein recovery by LC-MS/MS and evaluate S/N (Fig. 4a and Supplementary Notes 5 and 6).UV-enriched* proteins were identified by generating log 2 (CL/nCL) ratios with sum peptide intensities (SPI) as shown in Eq. ( 2) and testing against the null hypothesis that log 2 (CL/nCL) = 0 (Fig. 4b and Supplementary Data 2 and 3).
For comparative purposes, INP isolation was performed on parallel samples (Supplementary Data 4 and 5).As shown in Fig. 4c, both INP and LEAP-RBP methods enriched for GO-annotated RBPs, but LEAP-RBP identified nearly twice as many UV-enriched* proteins (1794 vs 937) including 682 of the 719 UV-enriched* and 106 of the 156 non-enriched RBPs identified by INP (Fig. 4d).Notably, the LEAP- RBP method identified a greater number of UV-enriched* RBPs that bind all classes of RNA and/or regulate various RNA processes (Fig. 4e).However, only 270 of the 996 proteins exclusively UVenriched* by LEAP-RBP were GO-annotated RBPs, leading to a paradoxical decrease in UV-enrichment* specificity as compared to the INP method (719/937 vs 952/1794).Exploring this further, we observed a large cluster of UV-enriched* proteins, many lacking prior GO-annotation (i.e., non-RBPs), which were largely absent in INP fractions (Fig. 4f).We postulated that these may represent falsepositives arising from low-frequency, non-specific UV cross-linking which are UV-enriched* by the LEAP-RBP method by virtue of its high selectivity for protein-RNA complexes (Supplementary Fig. 6b and Supplementary Note 7) [29][30][31] .As a relevant example, the ER chaperone GRP94 was detected by immunoblot in INP fractions as an RNase-insensitive band but was exclusively UV-enriched* by LEAP-RBP method while undergoing a 50-fold decrease in abundance.Non-RBPs (GAPDH, GRP78, and β-tubulin) were UV-enriched* and undetectable by SRA in both fractions (Figs.2b and 4f), highlighting the utility of this orthogonal validation method for assigning RNAbinding activity to UV-enriched* proteins and providing a method for identifying likely low-frequency cross-linking events ("Methods").

Enhanced S/N allows the detection of RBP-RNA occupancy dynamics
A critical observation from the SILAC LC-MS/MS studies noted above is that while CL/nCL ratios provide a measure of UV-dependent enrichment, S/N ratio determinations reveal RNA-bound protein contributions across SILAC channels.This relationship is depicted in Fig. 5a and described by Eq. (3).
As graphically depicted in Fig. 5b, c, a threefold enrichment or log 2 (S/N) ratio of 0 indicates equal amounts of RNA-bound (S) and unbound (N) counterparts (50% RNA-bound) while a log 2 (CL/nCL) ≤ 0 indicates an absence of RNA-binding activity (when S = 0, N = B or background); this relationship is described by Eq. ( 4): As is apparent in the LC-MS/MS analysis, S/N is inextricably linked to the ability to detect a change in observed quantity Δlog 2 (SPI O ) in Tested against the null hypothesis that protein log 2 (CL/nCL) ratios (n = 3 biologically independent samples) are equal to 0 using unpaired, upper-tailed, heteroscedastic t tests (RNA-binding proteins are expected to be recovered from UV-crosslinked cells in greater amounts).Correction for multiple hypothesis testing was performed using the Benjamini-Hochberg approach and a false-discovery rate of 5%.c GO-enrichment analysis of proteins identified as E* in I or L fractions by SILAC (Fisher's Exact, two-tailed, correction for multiple hypothesis testing performed using the Benjamini-Hochberg approach and a falsediscovery rate of 5%).response to a change in RNA-bound quantity Δlog 2 (SPI S ).This relationship is depicted in Fig. 5d, illustrating a large decrease in statistical power for proteins displaying negative log 2 (S/N) ratios; these relationships are described by Eqs.(5-7) (Fig. 5e, "Methods").
Detectable Δ log 2 ðOÞ = t 0:95 Ã SEM of log 2 ðOÞ ð 6Þ Detectable Δlog 2 ðSÞ = log 2 ð2 ^detectable Δlog 2 ðOÞ À N=OÞ À log 2 ðS=OÞ ð7Þ A key conclusion evident from this analysis is that RBPs displaying different S/N ratios can be UV-enriched* but the ability to detect Δlog 2 (S) could differ substantially.These concepts are illustrated by comparing the RNase sensitivity (S/N) of RBPs by SRA with their SILAC LC-MS/MS-derived log 2 (S/N) ratios.In principle, RNase sensitivity represents a change in RNA-bound quantity ( | S | ) when noise is constant (N untreated = N RNase ); Eq. ( 1).This is analogous to Eq. ( 5) if S initial = 0 and N initial = N final .Experimental examples of these relationships are depicted in Fig. 5f, where both the INP and LEAP-RBP methods identified NCL, and the ER membrane proteins RPN1 and TRAPα as UVenriched*, but RNase sensitivity was more clearly evident in the LEAP-RBP fractions, which also display higher log 2 (S/N) ratios by SILAC LC-MS/MS.In contrast, RBPs displaying comparable RNase sensitivity (e.g., the ER membrane protein LRRC59) have comparable log 2 (S/N) ratios.Therefore, a change in RNA-bound quantity (S) for RBPs such as NCL, RPN1, and TRAPα will have a larger effect on their observed quantity (O) in LEAP-RBP fractions by LC-MS/MS (Supplementary Notes 5 and 6).

LEAP-RBP displays high method specificity for RNA-bound RBPs
As illustrated in Fig. 1b, e, current AGPC methods are hampered by considerable noise contributions; the interphase fractions contain many RNase-insensitive (free) proteins evident by SRA and Coomassie Blue staining.In contrast, AGPC interphase fractions enriched by the LEAP-RBP protocol would be expected to be both depleted in free proteins and thus contain a higher percentage of RNA-bound protein.This hypothesis was tested on INP and LEAP-RBP fractions using the Total Protein Approach (TPA), which provides protein abundance as a percentage of the total protein in the sample (total SPI), as described by Eq. ( 8) [32][33][34] .
By extension of Eqs. ( 3) and ( 4), TPA can be used to determine the abundance of RNA-bound (%TP S ) and free proteins (%TP N ) as a percentage of total SPI, as described by Eqs.(9-11).
Cumulatively, %TP S and %TP N represent the abundance of total RNA-bound (total SPI S ) and free protein (total SPI N ) in the sample.By this approach, 91% of the total protein in LEAP-RBP fractions is RNAbound compared to 47% for INP.This is consistent with differences in RNP composition (µg protein/µg RNA), though assumes equal noisepartitioning between SILAC channels (Fig. 5g).This finding validates Eq. ( 3); not assuming equal noise-partitioning overestimates the amount of RNA-bound protein in INP fractions by ~60% (Source Data Fig. 5g).Remarkably, GO-annotated RBPs represent 53% of proteins identified as UV-enriched* in LEAP-RBP fractions but contribute 98.3% of total SPI S (Fig. 5h).By comparison, the %TP S of INP fractions is lower (47), but RBPs still contribute 98.6% of total SPI S .
Estimating the abundance of RNA-bound proteins as a percentage of total RNA-bound protein in the sample (total SPI S ) can be represented by %TP (S) , where the parenthetical text denotes the identity of the total protein population ("Methods").While %TP S of INP fractions (47) is less than LEAP-RBP fractions (91), both methods recover near 100% of RNA-bound protein (I or L vs M, RNA yield; Fig. 2a); therefore, comparable %TP (S) contributions from RBPs (~98.5) and non-RBPs (~1.5) in both fractions is indicative of high UV cross-linking specificity.The disparity in %TP S between LEAP-RBP and INP fractions is not, however, readily illustrated by current analytical methods (Fig. 4b-f).To address this discrepancy, the abundance (log 10 (%TP)) of RBPs and non-RBPs identified by INP and LEAP-RBP were evaluated as a function of their S/N ratios (log 2 (S/N)) (Fig. 6a, b).As expected, an increase in % TP S is illustrated by the enhanced enrichment efficiency (S/N) of both RBPs and non-RBPs in LEAP-RBP fractions as compared to INP and higher abundance (%TP) of RBPs relative to non-RBPs (Supplementary Note 8).The increase in %TP S was attributed to a marked decrease in free protein recovery by the LEAP-RBP method.Consequently, LEAP-RBP provides a lower limit of detection (%TP range), thus resulting in identification of many low-abundance proteins not observed in INP fractions (exclusive; Fig. 6c).Because most of these low-abundance proteins are not GO-annotated as RBPs and UV-enriched*, enhanced S/N results in the paradoxical decrease in UV-enrichment* specificity despite a favorable increase in %TP S ("Methods").

High S/N and %TP distinguish bona fide RBPs
As noted above, we postulated that non-specific UV cross-linking, combined with the enhanced S/N provided by the LEAP-RBP method, results in the UV-enrichment* of low-abundance non-RBPs.In support of this hypothesis, all non-RBPs (undetectable by SRA) were UV-enriched* but display low S/N ratios and are less abundant than the majority of RNase-sensitive RBPs (Fig. 6d).Indeed, GO-annotated RBPs display significantly higher S/N ratios and were significantly more abundant than non-RBPs by either method (Fig. 6e), but the latter was more apparent with LEAP-RBP (high %TP S ).Critically, these results demonstrate that UV-enriched* RBPs can be distinguished from UVenriched* non-RBPs in LEAP-RBP fractions by their enrichment efficiencies (S/N) and abundance (%TP).
To help distinguish high and low-confidence RNA-binding proteins, we propose a ranking system based on an RBP-confidence score or RCS, where RCS = log 2 (S/N) * log 10 (%TP).In practice, RCS ranking prioritizes S/N over protein abundance and places proteins with S/N ratios <1 at lower ordinal rank (Fig. 6f).This scoring system accurately ranked proteins assayed by immunoblot (Fig. 6g) and placed the noncanonical ER RNA-binding protein LRRC59 among the top ten most confident "enigmRBPs" of the 163 detected in LEAP-RBP fractions (Fig. 6h) 8 .To facilitate rapid discovery of other noncanonical RBPs by SRA analysis of LEAP-RBP fractions, a full list of identified proteins, their RCS rank, and parameters discussed are included in the provided Source Data or in Supplementary Data 3.

LEAP-RBP allows robust and sensitive detection of Δlog 2 (S)
During comparative RBP profiling experiments, enhanced S/N and high %TP S allows accurate assessment of RNA-bound protein abundance.By comparison to INP, which mirrors current AGPC methods, the LEAP-RBP method allows more sensitive detection of Δlog 2 (S), representing the fold-change in RNA-bound protein quantity (S) necessary to reject the null hypothesis that Δlog 2 (S + N) = 0 (Supplementary Fig. 7a-c).This enables the use of stringent S/N-based criteria (S/N > 3, 75% RNAbound) to limit detection of Δlog 2 (N) and increase statistical power by reducing multiple hypothesis testing (Supplementary Note 6b, c).In addition, the high %TP S of LEAP-RBP fractions decreases the variability of mean-normalized samples by decreasing free protein contributions (%TP N ) of the most abundant proteins in the sample (Supplementary Fig. 7d-f).This allows the least computationally intensive and most accurate label-free LC-MS/MS approach for the detection of Δlog 2 (S) (Supplementary Note 4f) 35,36 .
To illustrate these points, we performed a comparative LEAP-RBP experiment to examine the effect of dynamic translatome remodeling on global RBP-RNA occupancy states.Using harringtonine (HT), a selective inhibitor of translation initiation, we sought to identify RBPs whose interactions with mRNAs were either sensitive to ribosome occupancy (= translation-state-dependent interactors) or whose mRNA association was not sensitive to ribosome occupancy status (= translation-state-independent interactors).Through inhibitory interactions at the ribosomal A-site, HT induces global polyribosome runoff, to yield monosomes bearing initiation codon-locked 80 S ribosomes 37,38 .We first confirmed harringtonine efficacy by sucrose gradient density gradient polyribosome profiling (Fig. 7a and "Methods").As expected, HT treatment resulted in the pronounced accumulation of 80 S monosome/mRNA complexes.Biological triplicate control and HT-treated cell cultures were then prepared by UV irradiation, LEAP-RBP fractions isolated, and comparisons of input (total protein) and clRNP fractions (total RNA-bound protein) were performed by LC-MS/MS analysis.Analysis of the proteomic datasets, with or without the proposed S/N limit, identified 23 RBPs displaying a significant change in RNA occupancy (purple data points; Fig. 7b, c and Supplementary Data 19 and 20).Application of the S/N-based criteria introduced above significantly improved the specificity and sensitivity of our analysis by limiting inclusion of proteins with significant free protein contributions and revealing additional RBPs with known roles in translation and ribonucleoprotein assembly, as demonstrated by GO analysis (gold vs teal markers; Fig. 7d, e).These findings are further detailed in Fig. 7f, which depicts protein gene name, fractional RNA occupancy, fold-change occupancy in response to HT treatment, and the presence of known RNA-binding domains and/or GO-annotated RBP status.It's noteworthy that both the observed fold-change differences as well as the fraction of RBPs whose RNA association is modulated by CDS ribosome occupancy status are generally quite modest.This suggests that CDS translation is largely determined by initiation rate frequency and regulation of ribosome processivity, with RBP-dependent regulation largely biased to 5'/3' UTR interactions 39,40 .Subsequent comparisons of the LEAP-RBPs fractions by SRA analysis confirmed that the RBPs whose association with mRNAs was modulated by ribosome loading status were due to differences in RNAbound abundance (Fig. 7g); comparing RNP fractions yielded similar results (Supplementary Fig. 8a, b).Notably, we identified RBPs such as RPS3, UPF1, SND1, and HDLBP which were previously reported to display decreased RNA binding following treatment with the oxidative stressor sodium arsenite 3 , and include additional proteins such as ABCF3 whose RNA binding is unannotated and which was identified exclusively using the more stringent S/N-based criteria (Fig. 7f, g).Importantly, and as a demonstration of the utility of the signal-based analytical approach, RBPs lying near the proposed S/N limit of 3, such as the translation elongation factor eEF2, displayed translation-statedependent differences in free protein recovery (gold box; Fig. 7g and Supplementary Fig. 8a).We also note that the LEAP-RBP method does not bias towards more abundant proteins, as those displaying S/N ratios >3 in LEAP-RBP fractions were found to be significantly less abundant than others in total protein (input) fractions; Kruskal-Wallis, H(1) = 7.82, P = 0.005; Source Data Fig. 7h).
As an additional demonstration of the utility of LEAP-RBP method for studying context-or cell type-dependent differences in RNA-bound proteomes, a LEAP-RBP analysis was performed on four different cell lines: human cervical cancer cells (HeLa), human embryonic kidney cells (293T), human hepatocyte-derived carcinoma cells (Huh7), and a rat pancreatic insulinoma cell line (832/13) (Supplementary Fig. 8c).Of note, input and clRNP fractions isolated from the different cell lines displayed discernible differences in total and RNA-bound proteomes by SRA analysis (Fig. 7i).Immunoblot analysis revealed more constitutive RNA binders (blue boxes), differentially abundant RNA binders (red boxes), and dynamic RNA binders (gold boxes) whose relative RNA-bound abundance differs from their relative total abundance (Fig. 7j and "Methods") 41,42 .This last category includes the TIA-1a isoform displaying lower total abundance in rat 832/13 cells but comparable RNA-bound abundance (gold asterisk) (Supplementary Note 6d).

Benchmarking RNA-centric methods with signal-based metrics
Comparisons of current RNA-centric approaches include overlap (Venn) analysis of UV-enriched* proteins but lack metrics such as S/N or %TP S (Supplementary Note 7).To ascertain the broader utility of the LEAP-RBP method and S/N-based rubrics, we performed benchmark comparisons of LEAP-RBP to three organic phase separation methods (XRNAX, OOPs, and Ptex), one solid phase separation method (TRAPP), and one affinity-based separation method (RIC) [2][3][4]21,50 . Excep for RIC, which selects for poly(A) RNA-binding proteins, these methods aim to isolate total RNA-protein interactomes.RNP fractions were isolated from UV-crosslinked and non-crosslinked cells according to each of the published methods (Fig. 8a and Supplementary Note 9); S/N, %TP S , and yield were evaluated by SRA analysis of RNP fractions (Fig. 8b, c) and the findings were compared with available MS data (Fig. 8d, e) 3,21,[50][51][52] .Of note, UV-dependent enrichment of free protein is expected to increase non-specific (i.e., non-RBP) %TP (S) contributions during non-SILAC LC-MS/MS experiments (Supplementary Notes 7e and 8).
By SRA analysis, XRNAX and OOPs display low to moderate UVdependent enrichment of free protein (blue boxes; Fig. 8b) and low S/N without signal loss (gold and red boxes; Fig. 8c).Ptex shows moderate UV-dependent enrichment (blue boxes; Fig. 8b), but recovered protein is not RNA-bound (gold boxes; Fig. 8c).This is consistent with prior data (Supplementary Note 8b), and available MS data (non-SILAC) indicating low %TP S (23.2) and high non-specific %TP (S) contributions (28.4;Fig. 8e).XRNAX utilizes partial tryptic digestion and repeated TiO 2 /SiO 2 enrichment to further enrich RNA-bound peptides prior to SILAC LC-MS/MS 3 .This procedure effectively enhances S/N, resulting in a favorable increase in %TP S (70.5, %TP (S), non-RBPs = 2.6; Fig. 8d, e); however, proteins such as β-tubulin or GRP94 remain UV-enriched* and because of the trypsinolysis step cannot be subsequently orthogonally validated by methods such as SRA (Supplementary Data 1).OOPs distinguish UV-enriched RBPs (gold boxes) from non-RBPs (blue boxes) at the interphase by their RNase-dependent partitioning into the organic phase (Fig. 8c), but continued partitioning of free protein decreases S/N 4 .It should be noted that the reported methodology for the OOPs protocol includes non-SILAC comparisons which are expected to result in non-specific enrichment of free protein (purple boxes; Fig. 8c) 51,53,54 .Available MS data showing high non-specific %TP (S) contributions (26.5; %TP S = 73.5)supports this assessment (Fig. 8d, e) 51 .By SRA analysis, both TRAPP and RIC display high UV-dependent enrichment of RNase-sensitive protein (blue boxes; Fig. 8b), and signaldependent recovery of noise (gold boxes; Source Data Fig. 5c), which are both indicative of high %TP S (Supplementary Note 8a).Comparisons using a higher percentage of RNP fractions isolated from UV-crosslinked cells suggest LEAP-RBP achieves higher S/N, %TP S , and yield than TRAPP or RIC methods (goldbox; Fig. 9a, b).Available MS data from RIC (non-SILAC) and TRAPP (SILAC) experiments support these observations and indicates that the LEAP-RBP method provides more sensitive detection of Δlog 2 (S) for a greater number of proteins  6b; blue markers: other; additional labels and color overlay ("label": color marker) for proteins displaying a significant difference in RNA occupancy (HT/ DMSO, clRNP, unpaired, two-tailed, homoscedastic t test; n = 3 biologically independent samples) after FDR-correction (Benjamini-Hochberg approach, falsediscovery rate of 5%) with S/N limit ("gold hits": gold marker), without S/N limit ("teal hits": teal marker), or both ("purple hits": purple marker).b Volcano plot showing significance values of proteins before FDR-correction as a function of log 2 (HT/DMSO), (clRNP, non-SILAC).c Scatterplot comparing log 2 (HT/DMSO) ratios of proteins identified in both input and clRNP fractions (shared, non-SILAC).d Observed protein abundances (clRNP, non-SILAC) as a function of log 2 (S/N) (clRNP, SILAC).e GO-enrichment analysis of gold, teal, and purple hits; shown are the top ten most enriched GO terms for purple hits (Fisher's Exact, two-tailed, no correction).f Category-distributed comparison of hits showing % RNA-bound (clRNP, SILAC), observed fold-change (clRNP, non-SILAC), presence of RNA-binding domain (blue: no; red: yes); and RBP-annotation (GO:RBP) status (blue: no; red: yes; red with blue outline: yes, inferred from UV-enrichment* in RIC-like (non-SILAC) experiments 2,55 ).g SRA and immunoblot of pooled (equivalent µg protein) input and clRNP fractions quantified in Supplementary Fig. 8d-g and analyzed by LC-MS/ MS (non-SILAC); includes observed FC(HT/DMSO) and S/N ratios (clRNP, SILAC); asterisk: significant FC(HT/DMSO) after FDR-correction with S/N limit.h Scatterplot comparing observed protein abundances in input and clRNP fractions (shared, non-SILAC).i Comparison of input and clRNP fractions isolated from four different cell lines by SRA and Coomassie Blue (protein), SYBR Safe (RNA&DNA) staining.j Immunoblot analysis of samples from (i), n = 3 biologically independent samples, pooled (equivalent % fraction) for SRA.Experiments were performed once (a-h), or three times with similar results (i, j).Source data are provided as a Source Data file.
(Fig. 9c-g).It's important to note that available MS data for TRAPP is from yeast; therefore, non-specific %TP (S) contributions are due to incomplete RBP annotations (GO:RBP) for ribosomal proteins (Supplementary Note 8b).Curiously, TRAPP shows efficient recovery of RNA-bound ribosomal proteins (blue boxes) but relatively poor recovery of others (red boxes; Fig. 9b), consistent with available MS data (Fig. 9h, i).We speculate that this RBP-specific signal loss may occur because of the stringent denaturing washes employed in the purification process.The optimal amount of UV-energy (254 nm) for TRAPP was discussed by these authors, based on GO analysis of UVenriched* proteins, with "lower doses being potentially less noisy but at the cost of recovering fewer proteins with annotated functions in RNA biology" 50 .From an S/N perspective, higher UV doses increase % TP S but decrease UV-enrichment* specificity (Fig. 9j).As expected, RIC recovers RNA-bound mRNA binders more efficiently than TRAPP (red boxes; Fig. 9b), but still recovers RNA-bound ribosomal proteins (blue box) and rRNA (Fig. 8a).More stringent protocols utilizing LNA probe capture (eRIC) have been developed to address these concerns (Fig. 9k, l) 52 .However, unless rRNA is entirely removed from the sample, UV-enrichment* of rRNA-bound protein is likely to occur.Indeed, ribosomal proteins are less abundant in eRIC fractions compared to RIC (1.2 vs 4.1) but are detected and UV-enriched* (Fig. 9m).A similar trend is observed for exclusive DNA binders which are less abundant in LEAP-RBP fractions compared to other methods but were also UV-enriched* in greater numbers (Fig. 9n).Surprisingly, while RIC and eRIC are aimed at selective recovery of mRNA binders, LEAP-RBP identified a greater number of mRNA binders and they are more abundant in LEAP-RBP fractions (Fig. 9i, m).Nonetheless, ribosomal proteins are less abundant in RIC (4.1) and eRIC (1.2) fractions than LEAP-RBP fractions (7.9).These data indicate that eRIC-and to a lesser extent, RIC-are selective for mRNA binders, while LEAP-RBP provides a comprehensive assessment of the RNAbound proteome.

Discussion
We report LEAP-RBP as a highly selective and cost-efficient method for the purification of RNA-bound protein from biological samples.We identify S/N and %TP S (RNA-bound protein abundance) as key metrics for evaluating RNA-bound protein enrichment and method specificity for RNA-bound RBPs.Importantly, we present practical, experimentally accessible strategies for the accurate determination of in vivo RNA-binding activity and for robust profiling of RNA-bound proteomes at steady-state and following dynamic cell state transitions.
A S/N-based comparative analysis of RBP profiling data generated by LEAP-RBP and other RNA-centric methods revealed the complexity and challenges inherent in accurate identification of direct RNA binders based on their UV-enrichment* and assessment of RNA-binding activity based on protein recovery alone 2,3,21,26,42,50-52,55,56 .These method-intrinsic challenges can be compounded by low method specificity and/or non-SILAC comparisons, both of which result in apparent UV-dependent enrichment of free protein.While RBP enrichment methods utilizing SILAC LC-MS/MS and stringent sample washes achieve higher %TP S , the benchmark comparisons performed here reveal both reduced yields and biases in signal recovery which were previously unrecognized.These observations provide insights into why non-poly(A) RNA binders such as ribosomal proteins can represent a large fraction of MS spectra 52 .The high selectivity of LEAP-RBP achieves high %TP S without the need for high-stringency washes, and thus provides a more specific, selective portrait RNA interactomes.
RNA-binding proteins containing well-established canonical RNAbinding domains display higher S/N ratios and RNA-bound abundance, which greatly simplifies study of their RNA interactome dynamics, largely independent of limitations in existing methods.A primary challenge in the field however is the study of candidate RBPs lacking canonical RNA-binding domains, known functions in RNA biology, relatively low UV cross-linking frequencies, and/or significant free protein contributions in phase separation-based RNA-centric methods, all of which can hinder interpretation as well as meta-analysis of RNA interactomes and their dynamic regulation (Supplementary Notes 4-6).The signal-based analytical framework described here addresses these limitations and provides experimental avenues for the discovery and study of potentially novel RNA interactors with previously unknown roles in RNA biology.We highlight the noncanonical integral membrane RBP candidates LRRC59, TRAPα, and RPN1, all of which are resident proteins of the endoplasmic reticulum and which may function in mRNA and/or ribosome localization to the ER, as a representative example of the utility of LEAP-RBP 49 .We also note a primary conclusion from our study of the effect of selective reduction in CDS ribosome occupancy status on RNA interactome composition, where global inhibition of translation initiation and ribosome runoff elicited RNA occupancy changes in only a small fraction of the RNA interactome.For those RBPs whose RNA interactions were sensitive to global translation initiation inhibition, differences in RNA-bound protein abundances were relatively modest, suggesting that for the supermajority of the RNA interactome, regulatory RBP-RNA interactions are biased to interactions at the 5' and 3' UTRs 39,40,57 .The successful application of this approach to identify and validate the dynamic responses of bona fide RBPs involved in translation initiation and uncover additional RNA interactors with previously unknown roles in RNA regulation provides strong experimental evidence of its utility for biological discovery.
The results presented herein suggest that the number of RNAbinding proteins currently thought to comprise the RNA interactome (~4925 human RBPs) 54 and/or those with GO-RBP annotations (~1693) is an overestimation.This perspective is consistent with findings in a recent CLIP-based RNA-protein cross-linking frequency analysis, where a crosslink frequency threshold that distinguishes bona fide and lowsignificant protein-RNA interactions was reported 11 .Importantly, LEAP-RBP combined with quantitative proteomic and SRA analysis provides direct experimental evidence of RNA binding and orthogonal validation of RBP activity 15,41,58 .The principle behind SRA has been previously reported and performed in combination with silica-based RBP purification methods 21,25 .Yet, biases in RBP-RNA adduct recovery or low sensitivity ( | S | /µg RNA) and/or low S/N can confound detection of many bona fide RBPs by SRA analysis alone (e.g., pAbPC1 and XRN1) (T; Fig. 9b).To that end, high %TP S and efficient, unbiased recovery of RNAbound protein is critical to the accurate identification of RNA interactomes and their state change dynamics, and is a goal largely met by LEAP-RBP (Supplementary Note 3).The inability to validate metabolic enzymes such as GAPDH which are frequently identified as candidate RBPs by other RBP profiling methods such as RIC and eRIC may indicate Fig. 8 | Benchmark comparisons illustrate utiity of LEAP-RBP and S/N, %TP S metrics.N/AGPC neutral/acidic guanidinium thiocyanate-phenol-chloroform, AGP acidic guanidinium thiocyanate-phenol, L LEAP-RBP, X XRNAX, O OOPs, P Ptex, T TRAPP, R RIC, SILAC, SILAC LC-MS/MS analysis, SRA SRA analysis, E* significantly UV-enriched*, NE not E*. a Experimental flow outlining the main steps of LEAP-RBP and five referenced RNA-centric methods.TBE gel analysis was performed on 1 µg of RNA isolated by NGPC extraction of proteinase K-treated RNP fractions isolated from UV-crosslinked cells and an equivalent % fraction of their corresponding noncrosslinked samples.UV-dependence of protein and RNA recovery as well as S/N were evaluated by SRA with SYBR Safe (RNA&DNA), Coomassie Blue (protein), and silver stain (RNA, DNA, and protein) staining (b) or immunoblot (c).Sample compositions and/or normalization values are indicated in figure panels b, c.Immunoblot targets found UV-enriched* in referenced studies (Supplementary Data 1) were marked with gold asterisks in c; black asterisk (T): no yeast homolog, hence n/a; n.d.: not detected.d Specificity and selectivity of the different methods for RNA-bound RBPs was evaluated by comparing observed abundances of RBPs and non-RBPs as a function of their log 2 (S/N) ratios (SILAC and non-SILAC).Differences in corresponding frequency curves plotted as a function of log 2 (S/N) (solid) or log 10 (%TP) (dashed) indicate differences in protein-RNA adduct enrichment efficiency (S/N) or abundance (%TP), respectively.e Stacked bar charts showing the number of RBPs and non-RBPs identified as E* or NE by each method, or the estimated %TP S and %TP N contributions of RBPs and non-RBPs in each RNP fraction.Experiments were performed once (a-c); n = 1 (X, O, P, T, R); n = 3 biologically independent samples (L), pooled (equivalent % fraction) for SRA.Input samples isolated from AGP input suspensions containing UV-crosslinked and/or non-crosslinked HeLa cells were used as inter-run controls during SRA analysis, n = 1.Complete MS datasets and parameters discussed in this study for each of the referenced studies are provided as individual Excel files (Supplementary Data 10-20).Source data are provided as a Source Data file.a limitation of our methodology for validation of low-frequency RNA interactors (Supplementary Note 7d).In these scenarios, validation of RNA-binding function in situ using complementary and/or orthogonal methods may be preferable.The high specificity and selectivity of the LEAP-RBP method for RNA-bound protein allows efficient capture of broad-spectrum RNA interactors from biological samples.Potential applications beyond those demonstrated here, including PAR and chemical cross-linking approaches, are reasonable to consider using the provided strategies (Supplementary Figs.9-42, Supplementary Notes 1-8, and "Methods").

Methodical and analytical framework
A description of sample types, terminologies, quantitative metrics, and analytical approaches are provided in the Supplementary Methods.Analytical approaches: evaluating UV-dependent enrichment and S/N by SDS-PAGE RNase-sensitivity Assay (SRA); estimating RBP-specific UV cross-linking efficiencies and S/N ratios by SDS-PAGE and immunoblot; evaluating total protein and total RNA-bound protein abundance by SDS-PAGE; MS data analysis; RCS rank analysis.

Criteria for assignment of RNA-binding activity
Protein displaying CL/nCL ratios >0 in LEAP-RBP fractions by SILAC LC-MS/MS were considered high or low-confidence RBPs based on their observed enrichment efficiency (S/N) and abundance (%TP).However, only those displaying discernible RNase sensitivity by SRA and immunoblot were considered bona fide RNA-binding proteins.Proteins which remained RNase-insensitive by SRA or undetectable were not considered bona fide RBPs regardless of GO-annotation (e.g., GRP94, a GO-annotated RBP).However, because the inability to detect a protein by SRA and immunoblot could be due to their low RNAbound protein abundance, negative data were not considered formal confirmation of an absence of RNA-binding activity.To this point, validation of RNA-binding activity with LEAP-RBP and SRA requires that RNA-protein interactors are susceptible to UV cross-linking.

LEAP-RBP optimization and quality control
The ability of LEAP-RBP to rapidly (< 5') recover total RNA-bound protein from AGP suspensions with near 100% recovery is supported by a lack of quantifiable RNA and RNase-sensitive bands in the unprecipitated fraction by SRA and Coomassie Blue (protein) staining (Supplementary Fig. 3a).Total RNA-bound protein recovery was further validated by performing repeated LEAP steps without a significant, discernible decrease in protein-bound RNA yield (Supplementary Fig. 3b and Supplementary Note 1b).The importance of the liquid-liquid interphase during the LEAP step was evidenced by a significant decrease in RNA recovery (30-50%) when the solvents were quickly mixed (Supplementary Fig. 3c).RNA-dependence of LEAP-RBP was validated by performing LEAP-RBP on RNase-treated clRNP fractions resulting in a loss of detectable protein by SDS-PAGE and Coomassie Blue (protein) staining.In addition, performing LEAP-RBP on proteinase K-treated clRNP fractions resulted in recovery of RNA, but not protein (determined by Coomassie Blue staining), thereby demonstrating RNA-centricity (Supplementary Fig. 3d).Efficiency of DNA depletion and signal recovery during MS sample prep steps were validated by qPCR and SRA analysis (Supplementary Fig. 5b).
An RNA-seq analysis of small RNA composition was performed to determine if small RNA species are recovered by LEAP-RBP from final AGPC interphase suspensions of UV-crosslinked cells.RNA samples were found to be of high integrity (RIN > 9) and contained diverse sRNA species displaying broad genome distributions (Supplementary Fig. 4a-d).As noted by others, small RNA species were expected to be depleted following repeated AGPC extraction relative to other larger RNA species due to lower UV cross-linking efficiencies and depletion of free RNA 3,4 .Therefore, assessing the abundance of different RNA biotypes in clRNP fractions relative to their abundance in total RNA samples was considered uninformative.Nonetheless, SDS-PAGE of LEAP-RBP fractions isolated from AGP input suspensions demonstrate recovery of 60-100 bp RNA species visible as RNase-sensitive bands by SYBR Safe (RNA&DNA) staining migrating between 17 and 30 kD (nCL, w/o repeated AGPC extractions; Fig. 3a).For additional supporting information on the LEAP-RBP method, see Supplementary Notes 1-3.

Acidic guanidinium thiocyanate-phenol-chloroform extraction
400 µL of acidic phenol (0981, VWR) were added to 800 µL GT cell extracts.Alternatively, cells were lysed in 1.2 mL Trizol reagent (15596026, Invitrogen) and transferred to a 2 ml microcentrifuge tube.Cell lysates were prepared by passage through a 19 ga 1-1/2" needle fifteen times (305187, BD).For AGPC extraction, 240 µL chloroform (CX-1060-1, Millipore) or ~3/5th vol of phenol were added to samples and vigorously vortexed for 10 s.Samples were centrifuged at 10,000 × g for 10 min at 4 °C with slow brake setting and ~80% (v/v) of the aqueous and organic phases were removed.For repeated AGPC extraction, 800 µL of fresh acidic guanidinium thiocyanate-phenol (2:1) buffer and 160 µL chloroform were added to the AGPC interphase and the process was repeated.The final AGPC interphase was resuspended in 1.0-1.5 mL fresh acid guanidinium thiocyanate-phenol (2:1) buffer.If AGP suspensions appeared cloudy, an additional AGPC extraction was performed.Additional protocol information is included in the Supplementary Methods.
Precipitation of RNA from aqueous phase samples Sodium chloride (5 M) was added to aqueous phase samples to a final concentration of 0.6 M and mixed by brief vortexing.One part isopropanol was added to a final concentration of 50% and samples were mixed by brief vortexing.Samples were incubated on a rotator for 15 min at 4 °C and centrifuged at 18,000 × g for 15 min at 4 °C with slow brake setting.Following removal of the supernatant, pellets were washed three times with ice-cold 75% ethanol (twice the volume of precipitation mixture), incubated for 5 min on ice with occasional agitation, and centrifuged at 18,000 × g for 5 min at 4 °C with slow brake setting.Pellets were air-dried and resuspended at the desired volume with DEPC-treated water or TE buffer.For long-term storage, precipitates were stored in 75% ethanol at −80 °C.Final working sample concentrations ranged from 0.2-2.0 µg of RNA/µL.

Methanol precipitation (95% v/v)
Samples were mixed with 19 parts room temperature (RT) 100% methanol, incubated on a rotator for 1 h at RT, and centrifuged at 20,000 × g for 10 min at 20 °C with slow brake setting.Following removal of the supernatant fraction, precipitates were washed twice with 1.0 mL RT 95% methanol (for up to 100 µg protein).For each wash, samples were vortexed for 5 sec, incubated on a rotator for 10 min at RT, and centrifuged at 20,000 × g for 10 min at 20 °C with slow brake setting.Three 400 µL aliquots of RT 95% methanol were used to recover precipitates adhering to the sides of the tubes and combined in a 1.5 mL microcentrifuge tube.The tubes were then stored vertically at 4 °C overnight or at RT for 30 min to allow precipitates to settle at the bottom of the tube.Samples were centrifuged at 20,000 × g for 10 min at 20 °C with slow brake setting and supernatants were removed.Pellets were air-dried and resuspended at the desired concentration with 1% lithium dodecyl sulfate (LiDS) (J32816, Thermo) in TE.For long-term storage, samples were stored as precipitates in 95% methanol or as 1% LiDS TE suspensions at −80 °C.Working concentrations of methanol precipitated samples in 1% LiDS TE ranged from 0.1-5.0 µg protein/µL, 0.1-8.0 µg of RNA/µL, or 0.1-2.0 µg of protein-bound RNA/µL.

Isolation of RNP fractions by INP
Final AGPC interphase suspensions were split between 2 mL microcentrifuge tubes (160 µL each).AGP suspensions were either stored at −80 °C or used immediately for precipitation.For precipitation, the following reagents were added to each AGP suspension in order while mixing by brief vortexing (5 sec) after each addition: 3 µL of GlycoBlue (AM9515, Invitrogen), 640 µL of 1% LiDS TE, 96 µL 5.0 M NaCl, and 899 µL isopropanol.Samples were vortexed for 5 sec and incubated on a rotator for 15 min at 4 °C.Samples were centrifuged at 14,000 × g for 15 min at 4 °C with slow brake setting.Following removal of the supernatant fraction, samples were washed three times with 1 mL icecold 75% ethanol, incubated for 5 min on ice, and centrifuged at 14,000 × g for 5 min at 4 °C with slow brake setting.Samples were then washed twice with 1 mL RT 95% methanol, incubated on a rotator for 10 min at RT and centrifuged at 20,000 × g for 10 min at 20 °C with slow brake setting.Supernatants were removed.Precipitates were airdried and resuspended at the desired concentration in 1% LiDS in TE.
For long-term storage, precipitates were stored in 95% methanol or as 1% LiDS TE suspensions at −80 °C.Working concentrations of INPprecipitated RNPs ranged from 1.0 to 3.0 µg of protein-bound RNA/µL.

Isolation of RNP fractions by LEAP-RBP
AGP input suspensions or final AGPC interphase suspensions were aliquoted (200 µL) across 1.5 mL microcentrifuge tube and stored at −80 °C or used immediately for precipitation.Chloroform was added to a final concentration of ~7% v/v and the sample was mixed by vortex to form an emulsion (after step A; Fig. 2c).Four parts of a precipitation solution containing 3.75 M LiCl (10515, VWR) and 50% isopropanol (v/v) were layered onto the AGPC mixtures, and the tubes were closed.Samples were slowly inverted to 90 degrees and/or until the AGPC mixture was displaced from the bottom of the tube, and then returned to an upright position followed by incubation for 1 min.This process was repeated at least four times, switching the direction of inversion, increasing the angle, and increasing the speed during reversion.Samples were then homogenized by vigorous vortexing, centrifuged at 14,000 × g for 5 min at 20 °C, and supernatants were removed.RNP pellets were rinsed twice with 1 mL RT 95% methanol by inverting the tube multiple times and removing the supernatant.RNP pellets were then washed with 1 mL RT 95% methanol by incubating for 5 min at RT with occasional inversion.Following removal of the final 95% methanol wash, pellets were air-dried and resuspended at the desired concentration with 1% LiDS TE.Additional protocol information is included in the Supplementary Methods.Working concentrations of LEAP-RBP isolated RNP fractions ranged from 0.1 to 4.0 µg of protein-bound RNA/µL.

LEAP-RBP DNA depletion step
DNA digestion was performed using the Turbo DNase kit (Thermo, AM2238).RNP pellets containing <55 µg RNA&DNA were fully resuspended in 15 µL of TE buffer and 5 µL of a master mix containing 10× Turbo DNase buffer, TE buffer, and Turbo DNase were added to a final concentration of 1× Turbo DNase buffer and 1 µL of Turbo DNase/10 µg DNA.Samples were incubated at 37 °C for 15 min and nine parts (180 µL) fresh acid guanidinium thiocyanate-phenol (2:1) buffer were added.Samples were precipitated according to the LEAP-RBP protocol using 14 µL of chloroform and resuspended in 1% LiDS TE at the desired concentration.Additional protocol information is included as part of the Supplementary Methods.

RNA and protein quantitation
Samples containing more than 1.5 µg RNA/µL were diluted 1:5 in their respective buffers for RNA quantitation by UV-spectrophotometry (Thermo Scientific, Nanodrop ND-1000).For samples where DNA contamination is expected to impact RNA quantitation by more than 10%, "RNA&DNA" was used in place of "RNA" for figure panels.Protein concentrations were determined by BCA protein assay (23225, Thermo) using a microplate 96-well format and BSA as a protein standard.Typically, 1% LiDS TE sample suspensions were clarified prior to protein quantitation: sample suspensions were incubated at 55 °C for 20 s, mixed by brief vortex, centrifuged at 3000 × g for 20 s at 20 °C, and clarified supernatants (~90% v/v) were transferred to a new tube.Two 2 µL aliquots of the clarified sample suspensions, typically containing between 0.1 and 1.0 µg protein were added to separate wells and mixed with 200 µL working reagent (Pierce BCA kit, 50:1 A:B) for BCA quantitation.

Sample preparation for MS proteomic analysis
Digestion and depletion of RNA and/or DNA from input samples and RNP fractions were necessary prior to MS-based proteomic analysis (Supplementary Figs.5b and 9a-c).For SILAC LC-MS/MS experiments, RNP suspensions were normalized to 3 µg/µL protein-bound RNA in 1% LiDS TE, and 33 µL were used per 100 µL reaction containing 13.3 µL RNase Cocktail, 1× RNase digest buffer (10 mM Tris-HCl pH 7.5, 100 mM NaCl, and 1 mM EDTA pH 8.0), and 1× protease inhibitors (11836153001, Roche).For the comparative LEAP-RBP experiment, 10 µL of input samples containing 2.0 µg protein/µL were used per 25 µL reaction containing 0.6 µL RNase Cocktail, 1× RNase digest buffer, and 1× protease inhibitors; 20 µL of clRNP fractions containing 0.2 µg RNAbound protein/µL were used per 50 µL reaction containing 3.1 µL RNase Cocktail, 1× RNase digest buffer, and 1× protease inhibitors.RNase digests were incubated for 2 h at 37 °C and then precipitated with 95% methanol v/v as described above.Each input sample was suspended in TE buffer and DNA was digested using 2.5 µL Turbo DNase (Thermo, AM2238) and a final concentration of 1× Turbo DNase buffer in a 50 µL reaction (15 min, 37 °C) and precipitated with 95% methanol v/v as described above.Pellets were air-dried and submitted for proteomics analysis.For long-term storage, we recommend storing precipitates in 95% methanol at −80 °C.

SILAC LC-MS/MS analysis of LEAP-RBP and INP fractions
Prior to LC-MS/MS analysis, samples were supplemented with 50 µL 8.0 M urea in 50 mM ammonium bicarbonate and subjected to two rounds of probe sonication.Next, samples were spiked with either a total of 120 or 240 fmol of bovine casein, supplemented with 15 µL 20% SDS, reduced with 10 mM dithiothreitol for 30 min at 45 °C and alkylated with 20 mM iodoacetamide for 45 min at RT.Then, samples were supplemented with a final concentration of 1.2% phosphoric acid and 543 µL of S-Trap (Protifi) binding buffer (90% methanol/100.0mM TEAB).Proteins were collected on the S-Trap, digested using 20 ng/µL sequencing grade trypsin (Promega) for 1 h at 47 °C, and eluted using 50 mM TEAB, followed by 0.2% FA, and lastly using 50% ACN/0.2%FA.All samples were then lyophilized to dryness and resuspended in 12 µL 1% TFA/2% acetonitrile containing 12.5 fmol/µL yeast alcohol dehydrogenase (ADH_YEAST).
Quantitative LC-MS/MS was performed on 1 µg of each sample, using a nanoAcquity UPLC system (Waters Corp) coupled to a Thermo Orbitrap Fusion Lumos high-resolution accurate mass tandem mass spectrometer (Thermo) equipped with a FAIMSPro device via a nanoelectrospray ionization source.Briefly, peptides were trapped on a Symmetry C18 20 mm × 180 µm trapping column (5 µL/min at 99.9/ 0.1 v/v water/acetonitrile), after which the analytical separation was performed using a 1.8 µm Acquity HSS T3 C18 75 µM × 250 mm column (Waters Corp.) with a 90-min linear gradient of 5 to 30% acetonitrile with 0.1% formic acid at a flow rate of 400 nanoliters/minute (nL/min) with a column temperature of 55 °C.Data collection on the Fusion Lumos mass spectrometer was performed for three difference compensation voltages (40 V, 60 V, 80 V).Within each CV, a datadependent acquisition (DDA) mode of acquisition with a r = 120,000 (m/z 200) full MS scan from m/z 375 to 1500 with a target AGC value of 4e5 ions was performed.MS/MS scans were acquired in the ion trap in rapid mode from m/z 100 with a target AGC value of 2e4 and max fill time of 100 ms.The total cycle time for each CV was 1 s, with total cycle times of 3 s between like full MS scans.A 45 s dynamic exclusion was employed to increase depth of coverage.The total analysis cycle time for each fraction injection was ~2 h.Data were imported into Proteome Discoverer 2.5 (Thermo Scientific Inc.) and all LC-MS/MS runs were aligned based on the accurate mass retention time of detected ions ("features") which contained MS/ MS spectra using Minora Feature Detector algorithm in Proteome Discoverer.Relative peptide abundance was calculated based on areaunder-the-curve (AUC) of the selected ion chromatograms of the aligned features across all runs.A filter was applied which required each peptide to be measured in at least 2 unique samples and in at least 50% of at least one of the unique biological groups.The MS/MS data was searched against the SwissProt H. sapiens database (downloaded 11/2019) and an equal number of reversed sequence "decoys" for falsediscovery rate determination.Mascot Distiller and Mascot Server (v 2.5, Matrix Sciences) were utilized to produce fragment ion spectra and to perform database searches.Database search parameters included fixed modification on Cys (carbamidomethyl) and variable modifications on Meth (+16, oxidation) and Arg/Lys (+10/ +8 for heavy SILAC residues K + 8, R + 10).Peptide Validator and Protein FDR Validator nodes in Proteome Discoverer were used to annotate the data at a maximum 1% protein false-discovery rate based on q value calculations.Note that peptide homology was addressed using razor rules in which a peptide matched to multiple different proteins was exclusively assigned to the protein that has more identified peptides.Protein homology was addressed by grouping proteins that had the same set of peptides to account for their identification.Following database searching and peptide scoring using Proteome discoverer validation, the data was annotated at a 1% protein false-discovery rate.Supplementary Data 6.For RIC and eRIC, a pseudo-third replicate was added by averaging non-zero SPI values of replicates 1 and 2 (Supplementary Data 14 and 15) 52 .Because XRNAX was performed without replicates, samples were first mean-normalized and the average nonzero SPI values of 12 different samples were used for MS data analyses (Supplementary Data 10) 3 ; MCF7, HEK293, and HeLa; half-confluent and confluent; 15 min and 30 min partial digestion prior to silica purification (3 × 2 × 2 = 12 different samples).For the remaining MS datasets, proteins not detected in all UV-crosslinked samples were excluded from downstream sample normalization procedures and data analysis.Replicate samples were mean-normalized to total SPI and SPI nCL values equal to 0 were replaced with the average non-zero SPI nCL value of the same protein ID.

LC-MS/MS analysis (comparative LEAP-RBP experiment)
Prior to LC-MS/MS analysis, samples were supplemented with 50 µL 8.0 M and subjected to two rounds of probe sonication.Next, samples were spiked with either a total of 120 or 240 fmol of bovine casein, supplemented with 7.9 µL 20% SDS, reduced with 10 mM dithiolthreitol for 30 min at 32 °C and alkylated with 20 mM iodoacetamide for 45 min at RT.Then, samples were supplemented with a final concentration of 1.2% phosphoric acid and 472 µL of S-Trap (Protifi) binding buffer (90% methanol/100.0mM TEAB).Proteins were collected on the S-Trap, digested using 4 or 20 ng/µL (for clRNP fractions containing 4 µg protein or input samples containing 20 µg protein, respectively) sequencing grade trypsin (Promega) for 1 h at 47 °C, and eluted using 50 mM TEAB, followed by 0.2% FA, and lastly using 50% ACN/0.2%FA.All samples were then lyophilized to dryness and resuspended in 12 or 60 µL (for clRNP fractions or input samples, respectively) of 1% TFA/2% acetonitrile containing 12.5 fmol/µL yeast alcohol dehydrogenase (ADH_YEAST).
Quantitative LC-MS/MS was performed on 3 µL (1 µg) of each sample, using a nanoAcquity UPLC system (Waters Corp) coupled to a Thermo Orbitrap Fusion Lumos high-resolution accurate mass tandem mass spectrometer (Thermo) equipped with a FAIMSPro device via a nanoelectrospray ionization source.Briefly, peptides were trapped on a Symmetry C18 20 mm × 180 µm trapping column (5 µL/min at 99.9/0.1 v/v water/acetonitrile), after which the analytical separation was performed using a 1.8 µm Acquity HSS T3 C18 75 µM × 250 mm column (Waters Corp.) with a 90-min linear gradient of 5 to 30% acetonitrile with 0.1% formic acid at a flow rate of 400 nanoliters/minute (nL/min) with a column temperature of 55 °C.Data collection on the Fusion Lumos mass spectrometer was performed for three different compensation voltages (40 V, 60 V, 80 V).Within each CV, a data-dependent acquisition (DDA) mode of acquisition with a r = 120,000 (m/z 200) full MS scan from m/z 375 to 1500 with a target AGC value of 4e5 ions was performed.MS/MS scans were acquired in the Orbitrap at r = 50,000 (m/z 200) from m/ z 100 with target AGC value of 1e5 and max fill time of 35 ms.The total cycle time for each CV was 1 s, with total cycle times of 3 sec between like full MS scans.A 45 s dynamic exclusion was employed to increase depth of coverage.The total analysis cycle time for each fraction injection was approximately 2 h.
Following 15 total UPLC-MS/MS analyses (excluding conditioning runs, but including three replicate SPQC samples), data were imported into Proteome Discoverer 3.0 (Thermo Scientific Inc.), and individual LC-MS data files were aligned based on the accurate mass retention time of detected precusor ions ("features") using Minora Feature Detector algorithm in Proteome Discoverer.Relative peptide abundance was measured based on peak intensities of the selected ion chromatograms of the aligned features across all runs.The MS/MS data was searched against the SwissProt H. sapiens database (downloaded 08/2022), a common contaminant/spiked protein database (bovine albumin, bovine casein, yeast ADH, etc.), and an equal number of reversed sequence "decoys" for false-discovery rate determination.Sequest was utilized to produce fragment ion spectra and to perform the database searches.Database search parameters included fixed modification on Cys (carbamidomethyl) and variable modification on Met (oxidation).Search tolerances were 2 ppm and 0.8 Da product ion with full trypsin enzyme rules.Peptide Validator and Protein FDR Validator nodes in Proteome Discoverer were used to annotate the data at a maximum 1% protein false-discovery rate based on q value calculations.Note that peptide homology was addressed using razor rules in which a peptide matched to multiple different proteins was exclusively assigned to the protein that has more identified peptides.Protein homology was addressed by grouping proteins that had the same set of peptides to account for their identification.A master protein within a group was assigned based on % coverage.

LC-MS/MS data processing (comparative LEAP-RBP experiment)
Initial data processing and generation of sum peptide intensities were done separately for each fraction and each sample group (input or clRNP and DMSO or HT).Peptide intensities of common contaminants and spike-ins (human keratins, BSA, porcine trypsin, yeast alcohol dehydrogenase) were manually curated from protein lists.Proteins not detected in all three replicates of both sample groups (DMSO and HT) for a given fraction (input or clRNP) and containing at least two unique peptide matches were excluded from downstream sample normalization and data analysis.Samples were mean-normalized to total SPI and log 2 normalized SPI values were used to test for differences in protein recovery between samples groups (DMSO vs HT) for each fraction (input or clRNP) using independent two-tailed homoscedastic t tests (Supplementary Data 8 and 9).Correction for multiple hypothesis testing was performed with the Benjamini-Hochberg approach and a false-discovery rate of 5% on total protein IDs (no S/N limit) or only those which displayed S/N ratios >3 in LEAP-RBP (clRNP) fractions by SILAC LC-MS/MS analysis.

Gene-Ontology (GO) enrichment analysis
GO-enrichment analyses were performed for UV-enriched* proteins identified by INP and LEAP-RBP using PANTHER V17.0 60 .For additional analyses of GO-annotated groups, lists were exported and assigned primary Uniprot IDs as described in Supplementary Data 6.The resulting GO-annotated protein lists were used to sort protein IDs (e.g., RBP vs non-RBP) for downstream analyses and can be found in Supplementary Data 7.

Sample preparation for sRNA-seq and data analysis
Two independent samples (HeLa) were UV-crosslinked with 0.4 J/cm 2 (254 nm).clRNPs were isolated from the final (6th) AGPC interphase suspension by LEAP-RBP and resuspended in TE buffer.Ca. 6 µg of protein-bound RNA was treated with Turbo DNase as outlined above (LEAP-RBP DNA depletion step) without performing the second LEAP step.Then, 20 µL of 2× proteinase K buffer and 3 µL proteinase K stock (20 mg/mL) were added, and samples were processed as described above (Proteinase K digestion).
Library construction, quality control, and sRNA sequencing For sRNA library construction, 3' and 5' adapters were ligated to 3' and 5' ends of small RNAs, respectively.First-strand cDNA was synthesized after hybridization with a reverse transcription primer and doublestranded cDNA libraries generated via PCR enrichment.After purification and size selection, libraries with insertions between 18-40 bp were selected.Library concentrations and QC was performed via Qubit and real-time PCR for quantitation and Bioanalyzer for size distribution analysis.Quantified libraries were pooled and sequenced on Illumina platforms in SE50 mode.

Fig. 2 |
Fig. 2 | LEAP-RBP provides rapid and efficient recovery of RNA-bound protein and identification of RBPs by SDS-PAGE RNase-sensitivity Assay (SRA).AGPC acidic guanidinium thiocyanate-phenol-chloroform, M methanol precipitation (95% v/v), I INP, L LEAP-RBP, SRA SRA analysis.a Comparison of RNP fractions isolated by M, I, and L methods from final AGPC interphase suspensions of UVcrosslinked cells by SRA and Coomassie Blue (protein), SYBR Safe (RNA&DNA) staining.Protein (BCA) and RNA (UV-spectrophotometry) yields shown in the adjoining bar charts represent the mean ± 1 SD of three biologically independent samples, pooled (equivalent % fraction) for SRA analysis.Effect of isolation method on protein yield analyzed using one-way ANOVA (Fisher's PLSD post hoc test, unpaired, two-tailed, homoscedastic, no correction): significant main effect of isolation method (m*), F(2, 6) = 41.91,P < 0.001 (MvI*, P = 0.018; IvL*, P = 0.001; MvL*, P < 0.001).Effect of isolation method on RNA yield analyzed using a one-way ANOVA (Fisher's PLSD post hoc test, unpaired, two-tailed, homoscedastic, no correction): non-significant main effect of isolation method (n.s.), F(2, 6) = 0.04, P = 0.965 (post hoc testing not applicable).b Immunoblot analysis of samples from a, sample compositions indicated in figure panel.c Pictorial representation of LEAP-RBP method: (A) Addition of chloroform and vortexing.(B) Addition of precipitation solution (LiCl and isopropanol), inversion and incubation for 1 minute; repeated 5+ times.(C) Vortexing (D) Centrifugation and rinsing with 95% methanol (v/v).Experiments (a, b) were performed twice with similar results.Source data are provided as a Source Data file.

Fig. 4 |
Fig. 4 | Enhanced S/N decreases UV-enrichment* specificity.AGPC acidic guanidinium thiocyanate-phenol-chloroform, I INP, L LEAP-RBP, SILAC SILAC LC-MS/ MS analysis, SRA SRA analysis, E* significantly UV-enriched*, NE not E*. a Schematic illustration of the experimental approach.Heavy SILAC-labeled UV-crosslinked and light SILAC-labeled non-crosslinked cells were mixed prior to repeated AGPC extraction and isolation of RNP fractions by L or I methods.Graphic prepared in BioRender.b Volcano plots showing proteins identified as E* (red) or NE (blue) in I or L fractions by SILAC.Log 2 (CL/nCL) ratios were generated with SPI CL values and average SPI nCL values.Tested against the null hypothesis that protein log 2 (CL/nCL) ratios (n = 3 biologically independent samples) are equal to 0 using unpaired, upper-tailed, heteroscedastic t tests (RNA-binding proteins are expected to be recovered from UV-crosslinked cells in greater amounts).Correction for multiple hypothesis testing was performed using the Benjamini-Hochberg approach and a false-discovery rate of 5%.c GO-enrichment analysis of proteins identified as E* in I or L fractions by SILAC (Fisher's Exact, two-tailed, correction for multiple

d
Venn diagram showing overlap of total and E* proteins identified in I and/or L fractions by SILAC.Pie charts showing the number of RBPs and non-RBPs identified as E* or NE in I or L fractions by SILAC.For information on protein ID assignment and protein lists for each category or GO term used in this study, see Supplementary Data 6 and 7. e Stacked bar charts showing the number of proteins with annotated RNA-binding or RNA-related functions identified as E* or NE in I or L fractions by SILAC.f Histograms showing average log 2 (CL/nCL) ratios of RBPs and non-RBPs identified in I or L fractions by SILAC.Font color for individual proteins reflects conclusions from SRA and immunoblot analysis of total clRNP fractions; red text: RNase-sensitive RBP; blue text: undetected or RNase-insensitive protein.Asterisks: E*.I and L SILAC LC-MS/MS experiments (a) were performed once with three biologically independent samples for each SILAC label group (CL, nCL).Source data are provided as Source Data file.

Fig. 5 |
Fig. 5 | SILAC LC-MS/MS analysis of LEAP-RBP fractions demonstrates high RNA-bound protein enrichment.I INP, L LEAP-RBP, SILAC SILAC LC-MS/MS analysis, SRA SRA analysis, E* significantly UV-enriched*, NE not E*. a Predicted relationship between CL/nCL and S/N ratios for a specific RBP or non-RBP identified in RNP fractions isolated from pooled UV-crosslinked (red) and non-crosslinked (blue) samples by SILAC.b Relationship from a shown as a continuous function.c Predicted relationship between log 2 (S/N) and the percentage (%) of observed protein quantity during SILAC LC-MS/MS experiments that is RNA-bound.d Predicted change in observed quantity Δlog 2 (S + N) in response to a change in RNA-bound quantity Δlog 2 (S) for proteins displaying increasing S/N ratios.e Estimated Δlog 2 (S) to successfully reject the null hypothesis that Δlog 2 (S + N) = 0 increases with decreasing S/N and increasing SD.f Log 2 (S/N) ratios quantified by SILAC were used to estimate the observed quantity of nucleolin and three noncanonical endoplasmic reticulum candidate RBPs in RNase-treated ( | S | + N) and untreated (N) I and L fractions by SDS-PAGE and immunoblot.Immunoblots were selected from Fig. 2b; and performed twice with similar results.g Estimated amount of RNA-bound and free protein in I and L fractions by SILAC is more accurate when assuming equal noise-partitioning between SILAC channels.h Stacked bar charts showing the number of RBPs and non-RBPs identified as E* or NE by each method, or the estimated %TP S and %TP N contributions of RBPs and non-RBPs in each RNP fraction.Graphics prepared in BioRender (a-c).Source data are provided as a Source Data file.

Fig. 6 |
Fig. 6 | High method specificity for RNA-bound RBPs allows accurate RCS ranking of RNA-binding proteins.I INP, L LEAP-RBP, SILAC SILAC LC-MS/MS analysis, SRA SRA analysis.Specificity of I (a) and L (b) methods for enrichment of RNA-bound RBPs was evaluated by comparing observed abundances of RBPs and non-RBPs as a function of their log 2 (S/N) ratios (SILAC).Distributions of immunoblot targets reported in Fig. 5f demonstrates significance of vertical intercept.c L exclusives fall within the lower range of detection (%TP range).d, b With labeled immunoblot targets; red font: RNase-sensitive RBP; blue text: undetected or RNaseinsensitive protein.e Cumulative frequency curves for RBPs (red) and non-RBPs (blue) identified in I (dashed) and L (solid) fractions as a function of log 2 (S/N) or log 10 (%TP) (SILAC).Effect of GO-annotation (GO:RBP) on analyzed using independent Kruskal-Wallis tests: significant effect of GO-annotation on log 2 (S/N) ratios of proteins identified in I and L fractions (SILAC), H(1) = 194.63,P < 0.001, H(1) = 436.62,P < 0.001, respectively; significant effect of GO-annotation on log 10 (% TP) of proteins identified in I and L fractions (SILAC), H(1) = 111.11,P < 0.001, H(1) = 632.29,P < 0.001, respectively.f, b With color overlay based on ordinal RCS rank.g RCS as a function of RCS rank.Protein IDs represent proteins examined by SRA and immunoblot in Fig. 2b; red font: RNase-sensitive RBP; blue font: undetected or RNase-insensitive proteins with positive RCS (i.e., log 2 (S/N) > 0) are considered more representative of their RNA-bound counterparts.h, f With top 10 RCS ranked enigmRNPs.Source data are provided as a Source Data file.

Fig. 9 |
Fig. 9 | LEAP-RBP provides a high-stringency, comprehensive portrait of the RNA-bound proteome.L: LEAP-RBP: TRAPP; R: RIC: E: eRIC: SILAC: SILAC LC-MS/ MS analysis; SRA: SRA analysis; E*: significantly UV-enriched*; NE: not E*.a, b UVdependent enrichment and S/N were evaluated by SRA with SYBR Safe (RNA&DNA), Coomassie Blue (protein), and Silver Stain (RNA, DNA, and protein) staining (a) or immunoblot (b).Sample compositions and/or normalization values included in figure panel b.c Stacked bar charts showing the number of RBPs and non-RBPs identified as E* or NE by each method, or the estimated %TP S and %TP N contributions of RBPs and non-RBPs in each RNP fraction.d Evaluation of method performance by RCS rank analysis.Proteins are ranked by their RBP-confidence scores and binned (n = 100 per bin).The number (#) of GO-annotated RBPs, RBPconfidence scores (RCS), %TP contributions, log 2 (S/N) ratios, # of unique peptides (i.e., coverage), and detectable Δlog 2 (S) are compared.For methods with high performance and/or %TP S , RCS rank predicts log 2 (S/N), %TP, coverage, and detectable Δlog 2 (S).e-g Method specificity for RNA-bound RBPs was evaluated by comparing observed abundances of RBPs and non-RBPs as a function of their log 2 (S/N) ratios (SILAC and non-SILAC).h Comparison of the 40 most abundant (% TP) proteins identified by each method with their estimated %TP S and %TP N contributions in each RNP fraction.i Stacked bar charts showing the number of proteins for different classes of RNA binders and their estimated %TP S and %TP N contributions in each RNP fraction.j Comparison of TRAPP experiments performed at differing UV cross-linking energies as described in (c).k Comparison of RIC (R) and eRIC (E) as described in c shows eRIC achieves a higher %TP S than RIC.l Evaluation of eRIC specificity as done in (e-g).m Comparison of mRNA and rRNA binders identified by RIC (R) and eRIC (E) as described in (i).n Comparison of exclusive DNA binders identified by TRAPP (T), RIC (R), eRIC (E), INP (I), and LEAP-RBP (L) as described in (i).Experiments were performed once (a, b); n = 1 (T, R); n = 3 biologically independent samples (L), pooled (equivalent % fraction) for SRA.Source data are provided as a Source Data file.