Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

Nagasawa, Ryosuke; Onizuka, Kazumitsu; Komatsu, Kaoru R.; Miyashita, Emi; Murase, Hirotaka; Ojima, Kanna; Ishikawa, Shunya; Ozawa, Mamiko; Saito, Hirohide; Nagatsugi, Fumi

doi:10.1038/s42004-024-01181-8

Download PDF

Article
Open access
Published: 01 May 2024

Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

Communications Chemistry volume 7, Article number: 98 (2024) Cite this article

1564 Accesses
21 Altmetric
Metrics details

Subjects

Abstract

The large-scale analysis of small-molecule binding to diverse RNA structures is key to understanding the required interaction properties and selectivity for developing RNA-binding molecules toward RNA-targeted therapies. Here, we report a new system for performing the large-scale analysis of small molecule–RNA interactions using a multiplexed pull-down assay with RNA structure libraries. The system profiled the RNA-binding landscapes of G-clamp and thiazole orange derivatives, which recognizes an unpaired guanine base and are good probes for fluorescent indicator displacement (FID) assays, respectively. We discuss the binding preferences of these molecules based on their large-scale affinity profiles. In addition, we selected combinations of fluorescent indicators and different ranks of RNA based on the information and screened for RNA-binding molecules using FID. RNAs with high- and intermediate-rank RNA provided reliable results. Our system provides fundamental information about small molecule–RNA interactions and facilitates the discovery of novel RNA-binding molecules.

De novo generation of multi-target compounds using deep generative chemistry

Article Open access 06 May 2024

Decrypting the molecular basis of cellular drug phenotypes by dose-resolved expression proteomics

Article Open access 07 May 2024

Chemoproteomic discovery of a covalent allosteric inhibitor of WRN helicase

Article 24 April 2024

Introduction

Targeting RNA with small molecules represents an attractive medicinal approach for treating gene-related and infectious diseases^1,2,3,4,5. For example, drugs targeting specific RNA splice sites have been approved to alleviate the symptoms of spinal muscular atrophy^6,7. Further, human precursor microRNAs (pre-miRNAs)^{8,9,10,11,12,13}, various repetitive RNAs, such as CUG^14,15,16,17 and UGGAA¹⁸ repeats, and structured RNA elements of infectious pathogens^19,20,21 are considered promising drug targets. When developing new RNA-binding molecules, profiling the small molecule-binding landscapes of various types of RNA structures is critical for gaining deep insights into their binding properties and selectivities^22,23,24. One powerful way to profile the binding of small molecules is an analysis based on massively parallel DNA sequencing. For example, Disney’s group developed a computational approach, Inforna, based on their screening methods and massive sequencing analysis, that has led to the discovery of various regulatory RNA-binding molecules in RNA-related disease models^10,11,12,25. Their binding profiles focused on the sequence variants within internal loops and bulge structures. More recently, Sugimoto’s group implemented RNA-capturing microsphere particles to establish a new sequencing-based RNA-selection method that does not require any ligand labeling for the RNA-binding fluorescent molecules^26,27. Although these methods are valuable, they could produce inaccurate results in the profiling of specific or stable RNA structures, such as G-quadruplex (G4) structures, owing to structure-dependent amplification biases. This is because polymerase tends to pause at structured RNA sites during reverse transcription or polymerase chain reactions (PCR)^28,29. Therefore, different approaches that do not involve reverse transcription or PCR are required for the profiling of small-molecule binding to diverse RNA structures, particularly highly structured RNAs exhibiting naturally occurring sequences.

Recently, we developed a new method, folded RNA element profiling with structure library (FOREST)³⁰, for the large-scale analysis of protein–RNA interactions using a multiplexed RNA structure library. FOREST quantifies interactions using a DNA barcode microarray that can capture RNA probes in an RNA structure library (Fig. 1) that is designed by extracting structured motifs from RNA structure datasets. In this system, a stabilizing common stem, a unique RNA barcode (5′ terminus), and Cy5 or Cy3 (3′ terminus) were attached to each RNA structure (Fig. 1a). Employing this system, we revealed the interaction landscape of RNA-binding proteins (RBPs) using the RNA structure library that was extracted from human pre-miRNAs, human 5′ UTRs, and the HIV-1 RNA genome. FOREST drives amplification-free quantification, thus facilitating the bias-free detection of different RNA structures and their interactors (e.g., G4 and G4-binding RBPs). Notably, we identified cross-reactive interactions among some of the tested RBPs. For example, we observed that three G4-binding proteins exhibited different binding preferences to G4 and interacted with non-G4 RNA motifs (e.g., the r(GAA)_n motif) with different selectivity. Thus, we hypothesized that our method could be used as a platform for profiling the RNA-binding landscapes of small molecules.

**Fig. 1: Method overview and the tested small molecules.**

In this study, we introduced a systematic and large-scale approach for investigating small molecule–RNA interaction profiles. By subjecting small molecules to FOREST, our system is advantageous for analyzing large-scale datasets of diverse RNA structures derived from naturally occurring sequences. As the detection of the binding affinities of different RNA structures is based on microarray analysis, FOREST avoids sequencing and structure-dependent amplification biases. Additionally, the results include not only high-affinity interactions but intermediate- and low-affinity ones. Therefore, our datasets will be invaluable resources for understanding the fine determinants of small molecule–RNA interactions.

Results and discussion

Design of the platform for the large-scale analysis of small molecule–RNA interactions

Regarding the first RNA structure library for the analysis (Library-1), we designed 1824 RNA structural motifs by extracting the terminal loops of human pre-miRNAs and adding several repetitive and control sequences³⁰. Five different barcodes were allocated to each motif structure to exclude the outliers representing non-specific binding to the barcode sequences. Thereafter, the small molecule was immobilized onto beads via biotin–streptavidin interactions (Fig. 1a). We performed the pull-down process by mixing the RNA structure library and immobilizing the small molecule, followed by the washing and elution steps to collect the bound RNAs. The RNAs that were pulled down were quantified by a DNA barcode microarray to obtain the fluorescence intensity of each RNA structure because of the correlation of fluorescence intensities with binding affinities after background subtraction by no-ligand-conjugated streptavidin control samples³⁰.

In this study, we selected G-clamp and thiazole orange (TO) derivatives as the binding molecules (Fig. 1). G-clamp can recognize an unpaired guanine base in RNA loop structures by forming four hydrogen bonds (Fig. 1b)^31,32,33. G-clamp was used to validate our system because it binds strongly to a wide range of RNAs. Conversely, the TO derivatives, TO-PRO-1 and TO-PRO-3, are known as fluorescent light-up probes for imaging and fluorescent indicator displacement (FID) assays (Fig. 1c)^{34,35,36,37,38}. FID represents a high-throughput method for identifying novel RNA-binding molecules^{39,40,41,42,43,44,45}. For example, TO-PRO-3, a deep-red fluorescent indicator, was used in an FID assay to screen for compounds that bind to the bacterial A-site, influenza A virus RNA, and G4 DNA^37,38,46. However, the binding information of these fluorescent indicators and their target RNA sequences is still limited. We believed that it would be beneficial to determine the RNA binding profiles of such conventionally used indicators to further expand the repertoire of target RNA sequences that can be used in FID assays. Based on the structure of TO-PRO-1, we designed the N₃-modified TO–N₃ and TO–N₃-2 exhibiting different linker positions (Fig. 1d). Similarly, we designed TO-3–N₃ and TO-3–N₃-2. These N₃-modified molecules were conjugated to biotin via a strain-promoted azide–alkyne cycloaddition (SPAAC) with DBCO–biotin (Figs. 1a, S1, and S2)^47,48 and used for the large-scale analysis.

Large-scale analysis of the interaction of G-clamp-N₃ with Library-1

First, we ranked the RNA motifs from Library-1 based on their G-clamp binding (Supplementary Data 1). In Supplementary Data 1, the sequences, binding scores, Z-scores, and CVs are shown in order of rank. To understand the binding properties of G-clamp, the numbers of bases in the single-stranded (ss) and double-stranded (ds) RNA regions were investigated using the secondary structures of the pre-miRNA loops predicted by RNAsubopt in the ViennaRNA package⁴⁹ (Fig. 2). The ssRNA region refers to the terminal loop, bulge, or internal loop. Boxes were generated for each of the five subpopulations based on their rankings. Regarding ssRNA, the G count of high-ranking RNAs (1–360) was significantly higher than that of all the pre-miRNAs in Library-1. Contrarily, the G count of the low-ranking RNAs (1441–1800) was significantly lower than that of all the examined pre-miRNAs. Conversely, the C counts of the high- and low-ranking RNAs were lower and higher than those of all the pre-miRNAs in Library-1, respectively. The U count of the high-ranking RNAs was lower than that of all the pre-miRNAs, and the A count of ssRNA was not significantly different among the rank sections. Regarding dsRNA, the four bases exhibited smaller differences among the ranks compared with ssRNA. The C and U counts were inversely proportional to the G count, as C and U in the ssRNA region can form base pairs with the neighboring G bases. Furthermore, the percentage of the unpaired G count highlighted an unpaired-G selectivity (Fig. S3). Five or more unpaired Gs were mainly observed in high-ranking RNAs (1–180), and the percentage decreased gradually as the rank decreased. Contrarily, few RNAs without any or only a single unpaired Gs were observed in the high-ranking group, and the percentage gradually increased as the rank decreased. These results corresponded to the fact that G-clamp mostly recognizes G base in the ssRNA regions³².

**Fig. 2: Box plots of the number of bases in single-stranded RNA (ssRNA) and double-stranded RNA (dsRNA), as determined by RNA secondary structure prediction.**

Next, to validate our screening platform for RNA structures, we selected 17 sequences from the high-affinity (top 100), intermediate-affinity (101–1000), and low-affinity (1001–1824) groups and measured their apparent dissociation constants (K_Dapp) by fluorescence titration (Fig. S4). To shorten the common stem and keep the RNA motif structures stable in the titration assays, a shorter common stem (three base pairs) was attached to the motifs (5’-AGC-motif-GCU-3’). A histogram of Z-scores and the correlation between the Z-scores and K_Dapp values are shown in Fig. 3a and b and Table S1. The minimum free energy structures of the selected RNAs are shown in Figs. 3c and S5. The ranks 1 and 2 RNAs (Fig. 3c, top) contained unpaired guanine bases in their loop structures and exhibited strong G-clamp binding (K_Dapp = 0.024 and 0.022 μM, respectively). For the rank 1 RNA (hsa-mir-4520-1 loop), we performed the G mutation assay using two G-mutated hsa-mir-4520-1 loops (hsa-mir-4520-1-mutG2A and -mutG7A). Although mutG2A exhibited strong binding (K_Dapp = 0.011 μM) similar to the wild type, mutG7A exhibited weaker binding (K_Dapp = 15 μM). The double mutant mutG2,7A also exhibited weaker binding (K_Dapp = 3.7 μM) than the wild type, indicating that G7 contributes to the strong interaction with G-clamp. Surface plasmon resonance (SPR) analysis also showed the same binding tendency as the values obtained by fluorescence titration experiments, although the values slightly increased (Fig. S6). While the wild-type and mutG2A exhibited strong binding (K_Dapp = 0.10 ± 0.02 and 0.044 ± 0.008 μM, respectively), mutG7A exhibited much weaker binding (K_Dapp > 50 μM). To consider the selectivity of G7, the molecular modeling of the complex structure between hsa-mir-4520-1 and G-clamp–N₃ was performed using RNAComposer^50,51 and MacroModel (Fig. 3d). When G-clamp is bound to 7G by hydrogen bonds, it can interact with neighboring bases. We considered that these interactions, such as stacking with CG base pair and a hydrogen bond with G base at the top of the stem (Fig. 3e), would facilitate strong binding in addition to the formation of the hydrogen bonds with the target G base. When G-clamp was bound to 2 G by hydrogen bonding, stacking interactions were not observed with neighboring bases (Fig. S7). These results indicate that G-clamp does not recognize all Gs on the loop (G-clamp recognizes specific Gs). The high number of G bases in the ssRNA region of high-ranking RNAs probably increased the probability of the presence of G bases that bind to G-clamp strongly (Fig. S3). In the high-affinity group, two of the selected RNA motifs contained the G4 structure. The K_Dapp values of the hsa-mir-6850 loop (rank 28) and G4_(GGGU)₆ (rank 38) were 0.19 and 0.15 μM, respectively. This may be because G-clamp intercalated on G4 RNAs. In the intermediate-affinity group, even though hsa-mir-548ba (rank 522) exhibited a loop that was similar to that in hsa-mir-4520-1, its K_Dapp value (10 μM) was much higher. Comparing the modeling structures of hsa-mir-4520-1 and hsa-mir-548ba (Fig. S8) revealed that G-clamp–N₃ cannot strongly interact with adjacent bases when it forms hydrogen bonds with a G base on the loop structure of hsa-mir-548ba. In the low-affinity group, the loops without any G bases, such as hsa-mir-4773-1 (rank 1192), hsa-mir-4282 (rank 1775), exhibited weak binding (K_Dapp > 20 μM) and common stem sequence with four Us in the terminal loop also exhibited weak binding (K_Dapp = 9 μM) (Figs. S4 and S5). Within the group of selected RNAs, only (CUG)₁₆ (rank 43) deviated from our expectations in the fluorescence titration experiment (Fig. 3b, green color). Overall, we observed a good correlation between the Z-scores and observed K_Dapp (Fig. 3b, Spearman’s correlation coefficient: −0.86); the coefficient without considering (CUG)₁₆ exhibited an even higher correlation (−0.95). The G4 structures, which are susceptible to bias when using sequencing-based methods, were evaluated and ranked. These results indicate that our system for the large-scale analysis of the RNA structure libraries can ensure accurate assessments of small molecule–RNA interactions.

**Fig. 3: Large-scale analysis of the interaction of G-clamp–N₃ with Library-1 (1824 different sequences).**

Large-scale analysis of the interaction of the thiazole orange derivatives with Library-2

Next, we investigated the binding of different RNA motifs to the TO derivatives using our second RNA structure library, Library-2 (Supplementary Data 2–5). Library-2 contains 3000 RNA structural motifs that were designed by extracting the terminal loops of human pre-miRNAs, along with SARS-CoV-2 and influenza A virus RNAs and several repetitive and control sequences. Compared with the G-clamp binding profile, TO and TO-3 exhibited distinct profiles (Fig. 4a), although a significant correlation was observed between their binding profiles (Fig. 4b). These data indicate that the TO derivatives exhibited similar selectivities, which were unique compared with the G-clamp, as expected. The correlation coefficient between TO–N₃ and TO–N₃-2 with different linker positions (r = 0.78) was lower than that between TO–N₃ and TO-3–N₃ with the same linker position (r = 0.91), suggesting that the linker positions affect the binding profile (Fig. 4b). The high-affinity group of RNAs for the TO derivatives was mainly populated with G4 RNAs. The kernel density estimation of the Z-scores of the TO derivatives indicated the significant enrichment of the G4 control RNAs (Fig. S9).

**Fig. 4: Analysis of the binding properties of the TO derivatives.**

To understand the binding properties of the TO derivatives, the numbers of bases in the ssRNA and dsRNA regions were quantified using the predicted secondary structure of the pre-miRNA loops similar to the analysis of the G-clamp (Fig. 4c). For ssRNA, the G count of the high-ranking RNAs (1–360) was significantly higher than that of all the pre-miRNAs in Library-2. Contrarily, the ssRNA counts of the other bases were not significantly different among the different ranks. Regarding dsRNA, the G and C counts of the high-ranking RNAs (1–360), as well as the A and U counts of the low-ranking RNAs (1441–1800), were significantly higher than that of all the pre-miRNAs. The count tendencies of TO-3–N₃ and TO–N₃ were similar. Overall, these results altogether suggest that the TO derivatives prefer G-rich ssRNA and G/C-rich rigid stem structures, such as hsa-mir-5091 and -4437 (Fig. 4d). Regarding ssRNA, we further examined the total number of nucleotides in the internal and terminal loops (Fig. 4e). Although high-ranking RNAs exhibited more G and A bases in their internal loops, the terminal loops of high-ranking RNAs only exhibited a preference for more G but no other bases. These results suggest that the TO derivatives prefer the G/A bases in the internal and G-rich terminal loops. A likely explanation is that the internal loops comprising G/A bases may create a binding pocket that is ideal for intercalation, whereas the G-rich terminal loops may form G4-like structures. To confirm the preference of the TO derivatives for internal loops comprising G/A bases, we compared the K_Dapp values of hsa-mir-4437 and its internal loop (AGG to UCC) mutant, mir-4437-mut (Figs. 4d and S10). Although the K_Dapp values of TO–N₃ and TO-3–N₃ for the wild type hsa-mir-4437 loop were relatively low, 4.4 and 11 μM, respectively, the K_Dapp values of mir-4437-mut were much higher (>40 μM), suggesting that the G/A bases in the internal loop are crucial to the strong binding of the TO derivatives to the hsa-mir-4437 loop at least.

To further validate the binding profiles of the TO derivatives that were generated by our screening platform, the K_Dapp values of TO–N₃ and TO-3–N₃ interacting with 15 RNAs (pre-miRNAs, G4 RNAs, and virus RNAs) were measured by fluorescence titration (Figs. S11 and S12 and Table S2). For the high-ranking RNAs (top 100), the K_Dapp values correlated well with the Z-scores of TO–N₃ and the Spearman correlation coefficient was −0.95 (Fig. 5a). Contrarily, no strong binding was observed for the low-ranking RNAs (K_Dapp > 40 μM). Similarly, the K_Dapp values of TO-3–N₃ also correlated well with the Z-scores of TO-3–N₃ of high-ranking RNAs (top 100), as the coefficient was −0.85 (Fig. 5b). These results confirm that our system can provide accurate assessments of different binding modes of ligands and structured RNAs containing G4 structures.

**Fig. 5: Correlation between the Z-score in FOREST and K_Dapp value of the TO derivatives.**

Additionally, we extended this analysis to the commercially available indicators, TO-PRO-1 and TO-PRO3, by measuring their K_Dapp values to the 16 selected RNAs and calculating the correlations with the Z-scores of TO–N₃ and TO-3–N₃, respectively (Figs. 5c, d and S13–S15 and Tables S3 and S4). Regarding TO-PRO-1, the K_Dapp values exhibited weak and improved correlations with the Z-scores of TO–N₃ (r = −0.60) and TO–N₃-2 (r = −0.71), respectively, indicating that the binding profile of TO–N₃-2 may reflect TO-PRO-1 binding by various RNA motifs more accurately (Fig. 5c). Conversely, for TO-PRO-3, there were significant correlations between the K_Dapp values and Z-scores of TO-3–N₃ (r = −0.89) and TO-3–N₃-2 (r = −0.90) (Fig. 5d). Taken together, these binding profiles will benefit the selection of the proper combinations of target RNA and fluorescent indicators for FID assays.

Screening of the novel RNA-binding molecules by fluorescent indicator displacement assay using TO-PRO-1 and TO-PRO-3

Based on the binding profiles of the TO derivatives, we selected the intermediate-affinity-ranked combinations of the indicator and disease-related human pre-miRNAs previously observed to be dysregulated in several tumors, hsa-mir-221, -191, and -21, for the FID assay (Fig. 6)^52,53,54. As a high-rank G4 RNA control, hsa-mir-6850 was selected. Additionally, as a low-rank control, the terminal loop motifs from hsa-mir-374a and SARS-CoV-2 RNA (SARS-low) were selected. The predicted RNA secondary structures are shown in Fig. 6a, and the K_Dapp values of TO-PRO-1 and TO-PRO-3 to these target and control RNAs are listed. The signal-to-background (S/B) ratios of TO-PRO-1 and TO-PRO-3 for these RNAs are summarized in Fig. 6b. The S/B ratios of the low-rank RNAs were significantly lower than the others. A low S/B ratio is not favorable for performing an accurate FID assay. To identify the small molecules that bind to the target human pre-miRNAs listed above, we employed FID to screen a commercially available chemical library comprising 118 oxidation–reduction compounds (Targetmol) (Supplementary Data 6–8). In this library, chelerythrine chloride (Che)^55,56,57 is a known intercalating molecule with large π-plane and cationic sites and will be used as a positive control. The fluorescence emission of TOs depends on the RNA binding: free TOs exhibit low fluorescence, although the intensity increases upon RNA binding. Thus, the fluorescence emission of TOs decreases when a test compound interacts with a target RNA via the same site as the fluorescent indicator, thereby identifying it as a hit compound (Fig. 6c). We defined the hit threshold as the mean subtracted by twice standard deviations (mean−2σ). Through this screen, we identified a total of four hit compounds that disrupted TO–RNA interactions (Figs. 6d, e and S16). Although three of these compounds—baicalein (Bai), myricetin (Myr), and Che—were hits obtained from the assay when using TO-PRO-1, Bai did not meet our selection criteria when TO-PRO-3 was used as the indicator; rather, AS 602801 (AS) became a hit compound (Fig. 6d). This is probably because TO-PRO-3 differs in size and/or fluorescent properties compared with TO-PRO-1, indicating that diverse fluorescent indicators should be included to avoid false negatives and positives. Regarding the hit compounds, Myr^58,59,60 and Che^55,56,57 have been reported as DNA or RNA binders, whereas AS has not been reported.

**Fig. 6: Fluorescent indicator displacement (FID) assay using TO-PRO-1 or TO-PRO-3.**

The RNA binding of the four hit compounds was validated by measuring their K_Dapp values by fluorescence titrations (Fig. 7). These experiments revealed that Bai exhibits weak RNA binding (K_Dapp > 40), indicating that it is a false-positive compound for targeting disease-related human pre-miRNAs when using TO-PRO-1 (Figs. 7a, b and S17). The structurally similar flavonoid, Myr, exhibited moderate binding (K_Dapp = 16–25) to target RNAs, as the indicators revealed (Figs. 7 and S18). Unexpectedly, Myr bound strongly to hsa-mir-6850, which forms a G4 structure, although it was not identified as a hit compound when TO-PRO-3 was used (Fig. 7a and c). This suggests that Myr and TO-PRO-3 might have different binding sites. When using low-rank RNAs, Myr exhibited weak RNA binding (K_Dapp > 40) even though the indicators exhibited positive. Moreover, we observed that Che was bound to all the RNAs (K_Dapp = 2.6–16) though the indicators exhibited negative for low-rank RNAs (Figs. 7 and S19). Overall, predictably unreliable results were obtained when low-rank RNAs were used. The precisions of demonstrating the reliability of the assay data across the investigated RNAs became worse as the RNA ranking decreased (Fig. S20), suggesting that our binding profiles offered insight into the selection of applicable RNA targets for indicators in FID assays.

**Fig. 7: Validation of fluorescent indicator displacement (FID) assay results.**

Finally, we observed AS binding to hsa-mir-191, -21, and -6850 (K_Dapp = 14, 20, and 4.5, respectively). Interestingly, this compound exhibited strong light-up properties (Figs. 8 and S21): although free AS exhibited almost no fluorescence (Φ_free = 0.00063), strong fluorescence was observed after RNA binding (Φ_bound = 0.054). The methine tautomer⁶¹ likely contributes to this light-up property. TO-PRO-1 could not detect the RNA binding of this compound because of the interference of its strong light-up property at a similar wavelength range with the detection of the fluorescence originating from TO-PRO-1. These characteristics make AS an interesting seed compound for developing novel RNA binders and fluorescence probes.

**Fig. 8: Fluorescence spectra of AS in the titration assay.**

Conclusions

We developed the large-scale analytical platform for investigating small molecule–RNA interactions by subjecting the small molecules to FOREST. The affinity profiles generated by FOREST include not only high-affinity interactions but intermediate and low-affinity ones on the wide range of RNA structures that were derived from naturally occurring sequences. Additionally, compared with methods using massively parallel DNA sequencing, FOREST—by using microarray analysis to determine the binding affinities of RNA structure libraries—presents the affinity profiles of small molecules without any structure-dependent amplification bias³⁰. First, we validated our system using the unpaired G-specific binding property of the G-clamp (Figs. 2 and 3). The FOREST system ranked the G-clamp bindings of high-, intermediate-, and low-affinity RNA targets. The mutation experiments using rank 1 RNA (hsa-mir-4520-1 loop) showed that G-clamp forms hydrogen bonds with specific Gs. For further studies that will reveal detailed complex structures, such as X-ray crystallography or NMR, the large-scale affinity profile would help select suitable sequences for structure determination because the difficulty of these structural analyses differs depending on the sequence. Second, we generated the binding profiles of the TO derivatives using this platform (Figs. 4 and 5). Employing FOREST profiling, G4 structures, which are susceptible to bias by sequencing-based methods, were evaluated and ranked as top-tier interactors of the TO derivatives. Additionally, the analysis of the affinity profiles reveals a binding preference of the TO derivatives for RNA motifs containing G-rich terminal loops, internal loop G/A bases, and/or G/C-rich stem structures (Fig. 4c–e).

The library-wide binding landscape and profiles were also applicable to commercially available fluorescent indicators, TO-PRO-1 and TO-PRO-3, for FID assay (Fig. 6). Since our knowledge of fluorescent indicator–RNA combinations remains limited, the profiles generated by this system can benefit the selection of optimal combinations and further expand the repertoire of target RNA sequences for FID assays. In this study, we conducted FID assays using different ranks of RNA and TO-PRO-1 or TO-PRO-3 as target RNA and fluorescent indicators, respectively. The FID assays using these indicators and low-rank RNAs could not provide accurate hit compounds, while high- and intermediate-rank RNA provided reliable results (Fig. 7), demonstrating that our binding profiles are valuable for selecting applicable combinations for the FID assay. Moreover, we demonstrated the utility of this screening approach by identifying AS 602801 as an RNA binder that binds hsa-mir-191, -21, and -6850 with remarkable light-up properties (Figs. 7a, 8, and S21). Considering AS 602801 was identified using only TO-PRO-3 and FID assays have the limitation that they can basically detect compounds with similar binding sites and modes, the use of multiple fluorescent indicators is recommended for FID assays. In addition, the development of new fluorescent indicators that differ from known ones will be important to address the limitation in hit compound types. For example, indicators that conjugate fluorescent units whose fluorescence changes with an RNA binding event to an RNA-binding molecule^45,62,63 with various binding modes are expected to provide new hit compounds that have been overlooked by existing indicators in FID assays. In this case, FOREST will be valuable for obtaining RNA-binding information, designing the conjugated indicator, and understanding its binding preference.

The FOREST system in this study provides the basis for future efforts to identify new small molecule–RNA interactions, investigate the binding profiles and selectivities of various RNA-binding molecules, and aid the design of novel RNA-binding molecules through FID assays.

Methods

In silico RNA motif extraction

All motifs, including human pre-miRNA in library-1 and -2 were extracted from miRBase as detailed previously³⁰. To design library-2, the human pre-miRNA motifs were filtered based on length (<107 nt), with 1804 species collected in total. Next, we obtained RNA secondary structure datasets as determined by SHAPE-MaP or DMS-MaPseq with structural analysis^64,65. Predicted structures and conserved elements of SARS-CoV2 were obtained from a published study⁶⁶. From the collected datasets, we divided long continuous RNAs into terminal motifs and defined them as structural units using FOREST.py (https://github.com/KRK13/FOREST2020). In total, 1099 motifs were collected from the transcripts of SARS-CoV2 and Influenza A viruses. As controls, selected RNA structural motifs, aptamers, and defective mutants were collected and loaded into the libraries.

Design of a template pool of RNA structure library and DNA barcode microarray

Multiplexed single-stranded DNA sequences were used as templates for RNA probes in the library. The extracted RNA motifs were attached with T7 promoter, RNA barcodes, and stabilizing stem sequences for detection and hybridization to the DNA barcode microarray as previously described³⁰. The ssDNA templates were synthesized by SurePrint oligonucleotide library synthesis (Agilent Technologies). The size of the oligo template was limited to 170 nt for RNA structure library-1 and 190 nt for library-2. After assigning barcodes to RNA structures, the DNA reverse complementary strands of RNA barcodes were used by SureDesign (Agilent Technologies), a custom CGH array design service, to synthesize DNA barcode microarrays. The probe replication factor was set to 5× and 3×.

3’-Terminal labeling with Cy5 or Cy3

All RNA probes in the RNA structure libraries were labeled with a fluorescent dye at the 3’ end. Ten micromolar RNA structure library, 100 μM pCp-Cy5 or pCp-Cy3 (Jena Bioscience), and 0.5 U/μL T4 RNA Ligase (Thermo Fisher Scientific) were mixed in 100 μL of 1× T4 Ligase Buffer (Thermo Fisher Scientific). The mixture was incubated at 16 °C for 48 h on a ThermoMixer (Eppendorf) with ThermoTop (Eppendorf). After incubation, the labeled RNA was purified using Zymo RNA Clean and Concentrator (Zymo Research) and stored at −28 °C until use.

Synthesis of N₃-modified RNA binders

The N₃-modified G-clamp–N₃, TO–N₃, and TO-3–N₃ were synthesized using N₃–PEG₃–NH₂ as an N₃ linker after preparing the carboxylic acid intermediates (Supplementary Methods and Schemes S1–S3). TO–N₃-2 and TO-3–N₃-2 were synthesized using N₃–PEG₄–NHS ester as an N₃ linker after preparing the amine intermediates (Supplementary Methods and Schemes S4 and S5)^67,68.

RNA pull-down

The RNA structure library was prepared in 1× Binding buffer (20 mM phosphate pH 7.0, 20 mM NaCl, 80 mM KCl)³⁰. For folding, RNA was heated at 95 °C and cooled to 4 °C on a ProFlex Thermal Cycler (Thermo Fisher Scientific) with a ramp rate of −6 °C/s. During the folding step, 100 pmol of small molecules and 50 μL of Streptavidin Mag Sepharose (Cytiva) were mixed in 900 μL of 1× Binding buffer to prepare the small molecule-conjugated beads. The mixture was incubated on a ThermoMixer (Eppendorf) at 25 °C for 60 min with vortex mixing at 1200 rpm. The tube was placed on a magnetic rack to remove the supernatant and 1 μg of the refolded RNA structure library in 1 mL of 1× Binding buffer was added. A mixture containing only the beads was prepared as a control for background subtraction. The mixture was incubated on a ThermoMixer at 25 °C for 60 min with vortex mixing at 1200 rpm. The mixture was washed three times with 1× Binding buffer when the reaction ended. Two hundred microlitres of 1× Elution buffer (1% SDS, 10 mM Tris–HCl, 2 mM EDTA) was added to the magnetic beads, and the mixture was heated at 95 °C for 3 min. The bound RNA structures were collected from the supernatant by removing the magnetic beads and purified with phenol-chloroform extraction and ethanol precipitation.

Hybridization and microarray scanning

Eighteen microlitres of the bound RNA structures were mixed with 4.5 μL of 10× Blocking Agent (Agilent Technologies) and 22.5 μL of Hi-RPM Hybridization Buffer (Agilent Technologies). The samples were incubated for 5 min in a heat block set at 104 °C, then rapidly cooled and incubated for 5 min in ice water. The samples were applied to an 8 × 60K Agilent microarray gasket slide (Agilent Technologies). The prepared gasket slide and CGH custom array 8 × 60K (Agilent Technologies) were assembled with SureHyb. Hybridization was performed for 20 h at a temperature of 55.5 °C at 20 rpm. The microarray slide was washed for 5 min with Gene Expression Wash Buffer 1 (Agilent Technologies) in a glass container at room temperature following hybridization. The microarray slide was moved to a glass container containing Gene Expression Wash Buffer 2 (Agilent Technologies), which was immersed in a thermostatic bath at 37 °C. The washing step was performed for 5 min. Fluorescence scanning was performed on the microarray, and fluorescence image data were acquired using SureScan (Agilent Technologies). The acquired images were converted to numeric fluorescence intensities for each spot by Feature Extraction (Agilent Technologies) and GeneSpringGX (Agilent Technologies).

Calculation of binding intensity

The binding intensities of each RNA structure were calculated by subtracting the fluorescence intensities of the no-ligand control samples. To alleviate the effect of undesired interactions with the RNA barcode, we calculated the mean fluorescence intensities of each RNA structure from the intensities of three RNA probes that had the same RNA structure but different RNA barcodes. For this reason, we filtered the maximum and minimum values from a set of five intensities.

Statistics

For testing statistical significance, the two-tailed Brunner–Munzel test with Bonferroni correction was performed using Julia 1.6. standard error (SE) was calculated using the three probes of the RNA structure library. The binding strength is normalized as a Z-score using Eq. (1): μ is the mean value of the library population, σ is the standard deviation, and x is the binding intensity of each probe in the library.

$${{Z\,{score}}}_{x}=\frac{x-\mu }{\sigma }$$

(1)

Fluorescence binding assay

A solution (100 μL) of the binder (0.01 or 0.1 μM for G-clamp, 0.1 μM for TO–N₃ and TO-PRO-1, 1 μM for TO-3–N₃, 0.1 or 0.5 μM for TO-PRO-3) in 1x phosphate buffer (1% DMSO, 20 mM phosphate, 20 mM NaCl and 80 mM KCl) was transferred to a micro quartz cell with a 1-cm path length. Serial aliquots of a concentrated solution of RNA in 1× buffer were added to the binder solution and allowed to equilibrate for 2 min. The excitation wavelength was set at 360 nm for G-clamp, 501 nm for TO–N₃ and TO-PRO-1, 623 nm for TO-3–N₃ and TO-PRO-3, and the emission was recorded at 20 °C. Fluorescence measurements were performed with a JASCO-6500 spectrofluorometer (JASCO, Tokyo, Japan).

The data from the titrations were analyzed according to the independent-site model by non-linear fitting to Eqs. (2) or (3), in which F₀ is the initial fluorescence intensity in the absence of RNA, Q (=F_max/F₀) is the fluorescence enhancement upon saturation, A = K_Dapp/C_ligand and X = nC_RNA/C_ligand (n is the putative number of binding sites on RNA and n = 1 was used)⁶⁹. The parameters Q and X were determined by KaleidaGraph (Synergy Software, PA). The K_Dapp values in the main text show the mean values of two or three experiments.

$${{{{{\rm{F}}}}}}/{{{{{\rm{F}}}}}}_{0}=1+({{{{{\rm{Q}}}}}}-1)/2\{{{{{{\rm{A}}}}}}+1+{{{{{\rm{X}}}}}}-[({{{{{\rm{X}}}}}}+1+{{{{{\rm{A}}}}}})^{2}-4{{{{{\rm{X}}}}}}]^{1/2}\}$$

(2)

$${{{{{{\rm{or}}}}}}}\,\Delta {{{{{\rm{F}}}}}}={{{{{\rm{F}}}}}}-{{{{{\rm{{F}}}}}}}_{0}={{{{{\rm{{F}}}}}}}_{0}({{{{{{\rm{Q}}}}}}}-1)/2\{{{{{{\rm{{A}}}}}}}+1+{{{{{\rm{{X}}}}}}}-[({{{{{\rm{X}}}}}}+1+{{{{{\rm{A}}}}}})^{2}-4{{{{{\rm{X}}}}}}]^{1/2}\}$$

(3)

SPR analysis

Immobilization: 5′-biotinylated RNA (hsa-mir-4520-1 loop, mutG2A, or mutG7A) was diluted to 1 μM in 1× Binding buffer (20 mM phosphate pH 7.0, 20 mM NaCl, and 80 mM KCl), and the solution was heated at 95 °C for 5 min and cooled on ice. The folded RNAs were injected over a streptavidin-coated sensor chip (Series S Sensor chip SA, Cytiva) at 60 μL/min to reach an immobilized level of 1481, 1379, and 1387 RU for the hsa-mir-4520-1 loop, mutG2A, and mutG7A, respectively.

Binding analysis by single-cycle kinetics: the RNA binder (G-clamp-N₃) in 1× Binding buffer (20 mM phosphate pH 7.0, 20 mM NaCl, and 80 mM KCl) was injected at increasing concentrations (100, 200, 300, 400, and 500 nM for hsa-mir-4520-1 loop, 20, 40, 60, 80, and 100 nM for mutG2A, or 1, 2, 3, 4, and 5 μM for mutG7A) to the RNA-immobilized sensor surface without a regeneration step between each concentration. The RNA binder was injected with a flow rate of 60 μL/min, contact time of 30 s, and dissociation time of 120 s using the running buffer at 25 °C. All sensorgrams were corrected by subtracting the blank flow cell and buffer injection responses. All kinetics were obtained by Biacore T200 evaluation software.

Binding analysis by multi-cycle kinetics: the RNA binder (G-clamp-N₃) in 1× Binding buffer (20 mM phosphate pH 7.0, 20 mM NaCl, and 80 mM KCl, 1%DMSO) was injected at increasing concentrations (1, 2, 3, 5, 10, 20, 30, and 50 μM for mutG7A) to the RNA-immobilized sensor surface with a regeneration step between each concentration. The RNA binder was injected with a flow rate of 60 μL/min, contact time of 30 s, and dissociation time of 120 s using the running buffer at 25 °C. A regeneration step was conducted with a flow rate of 60 μL/min and contact time of 30 s using 1 M NaCl solution. All sensorgrams were corrected by subtracting the blank flow cell and buffer injection responses. SPR response values at 20 min were used to compute the K_Dapp value using the 1:1 binding equation {y = (B_max + x)/(K_Dapp + x)}, where y is the SPR response, B_max is the maximum SPR response, K_Dapp is the apparent dissociation constant, and x is the concentration of the added RNA binder.

RNA secondary structure prediction and visualization

The forna website⁷⁰ was used to generate illustrations of the RNA secondary structures predicted by RNAfold 2.4.13 in the ViennaRNA package⁴⁹ with the temperature set to 25 °C. The RNA structures extracted from the long transcripts (5’ UTR and HIV-1 genome) included in library-2 were taken from a previous study³⁰.

Structural preference analysis

Following previous studies⁷¹, secondary structure prediction of RNA motifs in the RNA structure library was performed by RNAsubopt 2.4.13 in the ViennaRNA package⁴⁹ with parameters set to the following: (command: RNAsubopt --temp=25 --stochBT=30). Each nucleotide (A, G, U, C) of each base pair state (ssRNA or dsRNA) or each structural motif (terminal loop, inner loop, or stem) was counted using the secondary structures generated by RNAsubopt as input.

Molecular modeling

The RNA 3D structures were predicted using RNAComposer^50,51. The energy minimization of complex structures between RNA and G-clamp–N₃ was performed using MacroModel (Schrödinger) after setting G-clamp–N₃ to face the G base so that hydrogen bonds could be formed. OPLS3e and water were used as the force field and solvent, respectively.

FID assay

Fluorescence intensities in FID assays were measured with a microplate reader Infinite® 200 PRO (TECAN Group Ltd., Mannedorf, Switzerland) using i-control® and LBS-coated Optiplate^TM-96F as 96-well plates. Buffer solution (20 mM phosphate pH 7.0, 20 mM NaCl, 80 mM KCl) was added to each well (49.5 μL for blank well and negative control well, 49 μL for positive control well and sample well), followed by the addition of 0.25 μL of ligand solution (20 μM for TO-PRO-1 and 100 μM for TO-PRO-3) to each well except for blank wells. RNA solution (0.5 μL of 10 μM for TO-PRO-1 and 50 μM for TO-PRO-3) in Binding buffer was dispensed in positive control and sample wells. DMSO was added to the control (negative and positive, 0.25 μL) and blank (0.5 μL) wells; while 0.25 μL of compound solution in DMSO (1 mM, Targetmol) was added to each sample well and mixed with RNA-ligand solutions. Fluorescence intensities of the mixtures were measured after incubating for 30 min. The excitation wavelength was set at 485 nm for TO-PRO-1 or 620 nm for TO-PRO-3. Normalized fluorescence intensity (F) was calculated using Eq. (4) described below:

$${ {Normalized}\; F}=\frac{{F}_{({ {indicator}}+{{RNA}}+{ {test}\; {compounds}})}-{F}_{({ {buffer}}+{{indicator})}}}{{F}_{({ {indicator}}+{ {RNA}})}-{F}_{({ {buffer}}+{{indicator}})}}$$

(4)

Hits were selected based on a reduction of TO-PRO-1 or TO-PRO-3 signal by less than two standard deviations (2σ) from the mean. Normalized fluorescence intensities >1.5 were excluded from calculations for the mean and σ.

Calculation of fluorescent quantum yield

The fluorescent quantum yields (QY) of AS 602801 in the presence of RNA were calculated using quinine sulfate in 0.1 M H₂SO₄ as a standard (Φ = 0.55). Absorbance and fluorescence values were recorded 3 min after mixing RNA and AS 602801. For calculating QY, conditions for absorbance measurement were as follows: [AS 602801] = 2.5 μM, [RNA] = 5 μM, and ε366; and for fluorescence measurement: [AS 602801] = 1 μM, [RNA] = 2 μM, emission spectrum area of 380–600 nm was used for integration. QY values were calculated according to Eq. (5):

$${\phi }_{{{sam}}.}={\phi }_{{{ref}}.}\times \frac{{\varepsilon }_{{{ref}}.}}{{\varepsilon }_{{ {sam}}.}}\times \frac{{c}_{{{ref}}.}}{{c}_{{ {sam}}.}}\times \frac{{({n}_{{ {sam}}.})}^{2}}{{({n}_{{{ref}}.})}^{2}}\times \frac{{F}_{{ {sam}}.}}{{F}_{{{ref}}.}}$$

(5)

where Φ_sam. is quantum yield of the sample, Φ_ref. is the quantum yield of the reference compound, ε_sam. is the molar extinction coefficient of the sample, ε_ref. is the molar extinction coefficient of the reference compound, c_ref is the concentration of the reference compound, c_sam is the concentration of the sample, n_sam. is the refractive index of the sample solution, n_erf. is the refractive index of the reference solution, F_sam. is the fluorescence intensity of the sample solution, and F_ref. is the fluorescence intensity of the reference solution.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The datasets of FOREST are available in Supplementary Data 1–5. The compound structures from the chemical library are available in Supplementary Data 6. The datasets of the FID assay are available in Supplementary Data 7 and 8.

Code availability

The custom codes for terminal motif extraction and designing the RNA structure library are available on the Github page (https://github.com/KRK13/FOREST2020/). The other codes used in this study are available from the corresponding authors upon request.

References

Warner, K. D., Hajdin, C. E. & Weeks, K. M. Principles for targeting RNA with drug-like small molecules. Nat. Rev. Drug Discov. 17, 547–558 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sztuba-Solinska, J., Chavez-Calvillo, G. & Cline, S. E. Unveiling the druggable RNA targets and small molecule therapeutics. Bioorg. Med. Chem. 27, 2149–2165 (2019).
Article CAS PubMed PubMed Central Google Scholar
Guan, L. & Disney, M. D. Recent advances in developing small molecules targeting RNA. ACS Chem. Biol. 7, 73–86 (2012).
Article CAS PubMed Google Scholar
Bush, J. A. et al. Systematically studying the effect of small molecules interacting with RNA in cellular and preclinical models. ACS Chem. Biol. 16, 1111–1127 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hargrove, A. E. Small molecule–RNA targeting: starting with the fundamentals. Chem. Commun. 56, 14744–14756 (2020).
Article CAS Google Scholar
Cheung, A. K. et al. Discovery of small molecule splicing modulators of survival motor neuron-2 (SMN2) for the treatment of spinal muscular atrophy (SMA). J. Med. Chem. 61, 11021–11036 (2018).
Article CAS PubMed Google Scholar
Sturm, S. et al. A phase 1 healthy male volunteer single escalating dose study of the pharmacokinetics and pharmacodynamics of risdiplam (RG7916, RO7034067), a SMN2 splicing modifier. Br. J. Clin. Pharmacol. 85, 181–193 (2019).
Article CAS PubMed Google Scholar
Bose, D. et al. The tuberculosis drug streptomycin as a potential cancer therapeutic: inhibition of miR-21 function by directly targeting its precursor. Angew. Chem. Int. Ed. 51, 1019–1023 (2012).
Article CAS Google Scholar
Vo, D. D. et al. Targeting the production of oncogenic microRNAs with multimodal synthetic small molecules. ACS Chem. Biol. 9, 711–721 (2014).
Article CAS PubMed Google Scholar
Velagapudi, S. P., Gallo, S. M. & Disney, M. D. Sequence-based design of bioactive small molecules that target precursor microRNAs. Nat. Chem. Biol. 10, 291–297 (2014).
Article CAS PubMed PubMed Central Google Scholar
Velagapudi, S. P. et al. Design of a small molecule against an oncogenic noncoding RNA. Proc. Natl Acad. Sci. USA. 113, 5898–5903 (2016).
Article CAS PubMed PubMed Central Google Scholar
Liu, X. et al. Targeted degradation of the oncogenic microRNA 17-92 cluster by structure-targeting ligands. J. Am. Chem. Soc. 142, 6970–6982 (2020).
Article CAS PubMed PubMed Central Google Scholar
Yan, H., Bhattarai, U., Guo, Z.-F. & Liang, F.-S. Regulating miRNA-21 biogenesis by bifunctional small molecules. J. Am. Chem. Soc. 139, 4987–4990 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wong, C.-H. et al. Targeting toxic RNAs that cause myotonic dystrophy type 1 (DM1) with a bisamidinium inhibitor. J. Am. Chem. Soc. 136, 6355–6361 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rzuczek, S. G. et al. Precise small-molecule recognition of a toxic CUG RNA repeat expansion. Nat. Chem. Biol. 13, 188–193 (2017).
Article CAS PubMed Google Scholar
Reddy, K. et al. A CTG repeat-selective chemical screen identifies microtubule inhibitors as selective modulators of toxic CUG RNA levels. Proc. Natl Acad. Sci. USA. 116, 20991–21000 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. et al. Intrinsically cell-penetrating multivalent and multitargeting ligands for myotonic dystrophy type 1. Proc. Natl Acad. Sci. USA. 116, 8709–8714 (2019).
Article CAS PubMed PubMed Central Google Scholar
Shibata, T. et al. Small molecule targeting r(UGGAA)n disrupts RNA foci and alleviates disease phenotype in Drosophila model. Nat. Commun. 12, 236 (2021).
Article CAS PubMed PubMed Central Google Scholar
Howe, J. A. et al. Selective small-molecule inhibition of an RNA structural element. Nature 526, 672–677 (2015).
Article CAS PubMed Google Scholar
Fedorova, O. et al. Small molecules that target group II introns are potent antifungal agents. Nat. Chem. Biol. 14, 1073–1078 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rangan, R. et al. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res. 49, 3092–3108 (2021).
Article CAS PubMed PubMed Central Google Scholar
Velagapudi, S. P. et al. Defining RNA–small molecule affinity landscapes enables design of a small molecule inhibitor of an oncogenic noncoding RNA. ACS Central Sci. 3, 205–216 (2017).
Article CAS Google Scholar
Ursu, A. et al. Gini coefficients as a single value metric to define chemical probe selectivity. ACS Chem. Biol. 15, 2031–2040 (2020).
Mukherjee, H. et al. PEARL-seq: a photoaffinity platform for the analysis of small molecule–RNA interactions. ACS Chem. Biol. 15, 2374–2381 (2020).
Article CAS PubMed Google Scholar
Disney, M. D. Targeting RNA with small molecules to capture opportunities at the intersection of chemistry, biology, and medicine. J. Am. Chem. Soc. 141, 6776–6790 (2019).
Article CAS PubMed PubMed Central Google Scholar
Endoh, T., Ohyama, T. & Sugimoto, N. RNA-capturing microsphere particles (R-CAMPs) for optimization of functional aptamers. Small 15, 1805062 (2019).
Article Google Scholar
Satpathi, S., Endoh, T., Podbevšek, P., Plavec, J. & Sugimoto, N. Transcriptome screening followed by integrated physicochemical and structural analyses for investigating RNA-mediated berberine activity. Nucleic Acids Res. 49, 8449–8461 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kwok, C. K., Marsico, G., Sahakyan, A. B., Chambers, V. S. & Balasubramanian, S. rG4-seq reveals widespread formation of G-quadruplex structures in the human transcriptome. Nat. Methods 13, 841–844 (2016).
Article CAS PubMed Google Scholar
Murat, P., Guilbaud, G. & Sale, J. E. DNA polymerase stalling at structured DNA constrains the expansion of short tandem repeats. Genome Biol. 21, 209 (2020).
Article CAS PubMed PubMed Central Google Scholar
Komatsu, K. R. et al. RNA structure-wide discovery of functional interactions with multiplexed RNA motif library. Nat. Commun. 11, 6275 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lin, K.-Y. & Matteucci, M. D. A cytosine analogue capable of clamp-like binding to a guanine in helical nucleic acids. J. Am. Chem. Soc. 120, 8531–8532 (1998).
Article CAS Google Scholar
Murase, H. & Nagatsugi, F. Development of the binding molecules for the RNA higher-order structures based on the guanine-recognition by the G-clamp. Bioorg. Med. Chem. Lett. 29, 1320–1324 (2019).
Article CAS PubMed Google Scholar
Murase, H., Nagatsugi, F. & Sasaki, S. Development of a selective ligand for G–G mismatches of CGG repeat RNA inducing the RNA structural conversion from the G-quadruplex into a hairpin-like structure. Org. Biomol. Chem. 20, 3375–3381 (2022).
Article CAS PubMed Google Scholar
Krishnamurthy, M., Schirle, N. T. & Beal, P. A. Screening helix-threading peptides for RNA binding using a thiazole orange displacement assay. Biorg. Med. Chem. 16, 8914–8921 (2008).
Article CAS Google Scholar
Asare-Okai, P. N. & Chow, C. S. A modified fluorescent intercalator displacement assay for RNA ligand discovery. Anal. Biochem. 408, 269–276 (2011).
Article CAS PubMed Google Scholar
Tran, T. & Disney, M. D. Identifying the preferred RNA motifs and chemotypes that interact by probing millions of combinations. Nat. Commun. 3, 1125 (2012).
Article PubMed Google Scholar
Sato, Y. et al. Trimethine cyanine dyes as deep-red fluorescent indicators with high selectivity to the internal loop of the bacterial A-site RNA. Chem. Commun. 55, 3183–3186 (2019).
Article CAS Google Scholar
Sato, Y. et al. Strong binding and off–on signaling functions of deep-red fluorescent TO-PRO-3 for influenza A virus RNA promoter region. ChemBioChem 20, 2752–2756 (2019).
Article CAS PubMed Google Scholar
Zhang, J., Umemoto, S. & Nakatani, K. Fluorescent indicator displacement assay for ligand−RNA interactions. J. Am. Chem. Soc. 132, 3660–3661 (2010).
Article CAS PubMed Google Scholar
Murata, A., Harada, Y., Fukuzumi, T. & Nakatani, K. Fluorescent indicator displacement assay of ligands targeting 10 microRNA precursors. Biorg. Med. Chem. 21, 7101–7106 (2013).
Article CAS Google Scholar
Fukuzumi, T., Murata, A., Aikawa, H., Harada, Y. & Nakatani, K. Exploratory study on the RNA-binding structural motifs by library screening targeting pre-miRNA-29a. Chem. Eur. J. 21, 16859–16867 (2015).
Article CAS PubMed Google Scholar
Wicks, S. L. & Hargrove, A. E. Fluorescent indicator displacement assays to identify and characterize small molecule interactions with RNA. Methods 167, 3–14 (2019).
Article CAS PubMed PubMed Central Google Scholar
del Villar-Guerra, R., Gray, R. D., Trent, J. O. & Chaires, J. B. A rapid fluorescent indicator displacement assay and principal component/cluster data analysis for determination of ligand–nucleic acid structural selectivity. Nucleic Acids Res. 46, e41 (2018).
Article PubMed PubMed Central Google Scholar
Das, B., Murata, A. & Nakatani, K. A small-molecule fluorescence probe ANP77 for sensing RNA internal loop of C, U and A/CC motifs and their binding molecules. Nucleic Acids Res. 49, 8462–8470 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shibata, T. et al. Fluorescent indicator displacement assay for the discovery of UGGAA repeat-targeted small molecules. Chem. Commun. 59, 5071–5074 (2023).
Article CAS Google Scholar
Largy, E., Hamon, F. & Teulade-Fichou, M.-P. Development of a high-throughput G4-FID assay for screening and evaluation of small molecules binding quadruplex nucleic acid structures. Anal. Bioanal. Chem. 400, 3419–3427 (2011).
Article CAS PubMed Google Scholar
Agard, N. J., Prescher, J. A. & Bertozzi, C. R. A strain-promoted [3 + 2] azide−alkyne cycloaddition for covalent modification of biomolecules in living systems. J. Am. Chem. Soc. 126, 15046–15047 (2004).
Article CAS PubMed Google Scholar
Debets, M. F., van der Doelen, C. W., Rutjes, F. P. & van Delft, F. L. Azide: a unique dipole for metal-free bioorthogonal ligations. ChemBioChem 11, 1168–1184 (2010).
Article CAS PubMed Google Scholar
Lorenz, R. et al. ViennaRNA Package 2.0. Algorithms Mol. Biol. 6, 26 (2011).
Article PubMed PubMed Central Google Scholar
Popenda, M. et al. Automated 3D structure composition for large RNAs. Nucleic Acids Res. 40, e112 (2012).
Article CAS PubMed PubMed Central Google Scholar
Biesiada, M., Pachulska-Wieczorek, K., Adamiak, R. W. & Purzycka, K. J. RNAComposer and RNA 3D structure prediction for nanotechnology. Methods 103, 120–127 (2016).
Article CAS PubMed Google Scholar
Mukohyama, J. et al. miR-221 targets QKI to enhance the tumorigenic capacity of human colorectal cancer stem cells. Cancer Res. 79, 5151–5158 (2019).
Article CAS PubMed PubMed Central Google Scholar
Elyakim, E. et al. hsa-miR-191 is a candidate oncogene target for hepatocellular carcinoma therapy. Cancer Res. 70, 8077–8087 (2010).
Article CAS PubMed Google Scholar
Si, M. L. et al. miR-21-mediated tumor growth. Oncogene 26, 2799–2803 (2007).
Article CAS PubMed Google Scholar
Bai, L.-P., Hagihara, M., Nakatani, K. & Jiang, Z.-H. Recognition of chelerythrine to human telomeric DNA and RNA G-quadruplexes. Sci. Rep. 4, 6767 (2014).
Article PubMed PubMed Central Google Scholar
Basu, P. & Suresh Kumar, G. Small molecule–RNA recognition: binding of the benzophenanthridine alkaloids sanguinarine and chelerythrine to single stranded polyribonucleotides. J. Photochem. Photobiol. B: Biology 174, 173–181 (2017).
Article CAS PubMed Google Scholar
Chen, H. et al. Chelerythrine as a fluorescent light-up ligand for an i-motif DNA structure. New J. Chem. 45, 28–31 (2021).
Article Google Scholar
Mondal, S., Jana, J., Sengupta, P., Jana, S. & Chatterjee, S. Myricetin arrests human telomeric G-quadruplex structure: a new mechanistic approach as an anticancer agent. Mol. Biosyst. 12, 2506–2518 (2016).
Article CAS PubMed Google Scholar
Das, A., Majumder, D. & Saha, C. Correlation of binding efficacies of DNA to flavonoids and their induced cellular damage. J. Photochem. Photobiol. B: Biology 170, 256–262 (2017).
Article CAS PubMed Google Scholar
Khan, E. et al. Myricetin reduces toxic level of CAG repeats RNA in Huntington’s disease (HD) and spino cerebellar ataxia (SCAs). ACS Chem. Biol. 13, 180–188 (2018).
Article CAS PubMed Google Scholar
Gaillard, P. et al. Design and synthesis of the first generation of novel potent, selective, and in vivo active (benzothiazol-2-yl)acetonitrile Inhibitors of the c-Jun N-terminal kinase. J. Med. Chem. 48, 4596–4607 (2005).
Article CAS PubMed Google Scholar
Sato, Y., Saito, H., Aoki, D., Teramae, N. & Nishizawa, S. Lysine linkage in abasic site-binding ligand–thiazole orange conjugates for improved binding affinity to orphan nucleobases in DNA/RNA hybrids. Chem. Commun. 52, 14446–14449 (2016).
Article CAS Google Scholar
Pei, R., Rothman, J., Xie, Y. & Stojanovic, M. N. Light-up properties of complexes between thiazole orange-small molecule conjugates and aptamers. Nucleic Acids Res. 37, e59 (2009).
Article PubMed PubMed Central Google Scholar
Simon, L. M. et al. In vivo analysis of influenza A mRNA secondary structures identifies critical regulatory motifs. Nucleic Acids Res. 47, 7003–7017 (2019).
Article CAS PubMed PubMed Central Google Scholar
Manfredonia, I. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rangan, R. et al. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA 26, 937–959 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ikeda, S., Kubota, T., Yuki, M. & Okamoto, A. Exciton-controlled hybridization-sensitive fluorescent probes: multicolor detection of nucleic acids. Angew. Chem. Int. Ed. 48, 6480–6484 (2009).
Article CAS Google Scholar
Ikeda, S. et al. Hybridization-sensitive fluorescence control in the near-infrared wavelength range. Org. Biomol. Chem. 9, 4199–4204 (2011).
Article CAS PubMed Google Scholar
Stootman, F. H., Fisher, D. M., Rodger, A. & Aldrich-Wright, J. R. Improved curve fitting procedures to determine equilibrium binding constants. Analyst 131, 1145–1151 (2006).
Article CAS PubMed Google Scholar
Kerpedjiev, P., Hammer, S. & Hofacker, I. L. Forna (force-directed RNA): simple and effective online RNA secondary structure diagrams. Bioinformatics 31, 3377–3379 (2015).
Article CAS PubMed PubMed Central Google Scholar
Dominguez, D. et al. Sequence, structure, and context preferences of human RNA binding proteins. Mol. Cell 70, 854–867.e859 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Kelvin Hui (Kyoto University) for critical reading of the manuscript. We thank Yoshikazu Tanaka, Kanami Takahashi and Eiko Hanzawa (Tohoku University) for supporting SPR analysis and Asako Murata (Kyushu University) for advice regarding SPR analysis. This work was supported in part by Grant-in-Aid for Scientific Research on Innovative Areas “Middle Molecular Strategy” (No. JP15H05838 to F.N.), “ncRNA neotaxonomy” (No. JP17H05601 to H.S.), “Frontier Research on Chemical Communications” (No. JP20H04762 to K.O.), Transformative Research Areas (A) “Biophysical Chemistry for Material Symbiosis” (No. JP23H04051 to K.O.), Scientific Research (B) (JP19H02845 to K.O. and JP20H02855 and JP23H02076 to F.N.), Specially Promoted Research (No. JP20H05626 to H.S.), Challenging Exploratory Research (No. JP19K22387 to H.S. and No. JP21K19038 to F.N.) from the Japan Society for the Promotion of Science (JSPS); Japan Science and Technology Agency (JST) FOREST program (No. JPMJFR2002 to K.O.) and SPRING program (No. JPMJSP2114 to R.N.); the Takeda Science Foundation (K.O.), the Uehara Memorial Foundation (K.O.), the Noguchi Foundation (K.O.), the Tokyo Biochemical Research Foundation (K.O.), the Naito Foundation (K.R.K.), the Mitsubishi Foundation (H.S.), and the research program of ̏Crossover Alliance to Create the Future with People, Intelligence and Materials ̋ from MEXT, Japan.

Author information

These authors contributed equally: Ryosuke Nagasawa, Kazumitsu Onizuka, Kaoru R. Komatsu.
These authors jointly supervised this work: Kazumitsu Onizuka, Hirohide Saito, Fumi Nagatsugi.

Authors and Affiliations

Institute of Multidisciplinary Research for Advanced Materials, Tohoku University, Miyagi, 980-8577, Japan
Ryosuke Nagasawa, Kazumitsu Onizuka, Hirotaka Murase, Kanna Ojima, Shunya Ishikawa, Mamiko Ozawa & Fumi Nagatsugi
Department of Chemistry, Graduate School of Science, Tohoku University, Miyagi, 980-8578, Japan
Ryosuke Nagasawa, Kazumitsu Onizuka, Kanna Ojima, Shunya Ishikawa & Fumi Nagatsugi
Division for the Establishment of Frontier Sciences of Organization for Advanced Studies, Tohoku University, Miyagi, 980-8577, Japan
Kazumitsu Onizuka
Center for iPS Cell Research and Application (CiRA), Kyoto University, Kyoto, 606-8507, Japan
Kaoru R. Komatsu, Emi Miyashita & Hirohide Saito

Authors

Ryosuke Nagasawa
View author publications
You can also search for this author in PubMed Google Scholar
Kazumitsu Onizuka
View author publications
You can also search for this author in PubMed Google Scholar
Kaoru R. Komatsu
View author publications
You can also search for this author in PubMed Google Scholar
Emi Miyashita
View author publications
You can also search for this author in PubMed Google Scholar
Hirotaka Murase
View author publications
You can also search for this author in PubMed Google Scholar
Kanna Ojima
View author publications
You can also search for this author in PubMed Google Scholar
Shunya Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar
Mamiko Ozawa
View author publications
You can also search for this author in PubMed Google Scholar
Hirohide Saito
View author publications
You can also search for this author in PubMed Google Scholar
Fumi Nagatsugi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.O. and K.R.K. designed the experiments. K.O., H.S. and F.N. mentored the research. R.N., H.M., K. Ojima, and S.I. synthesized compounds. R.N., K.R.K., E.M. and M.O. performed analytical experiments. R.N., K.O., K.R.K., and E.M. analyzed the results. R.N., K.O. and K.R.K. mainly wrote the manuscript. All authors discussed the results and provided feedback on the study and manuscript.

Corresponding authors

Correspondence to Kazumitsu Onizuka, Hirohide Saito or Fumi Nagatsugi.

Ethics declarations

Competing interests

K.R.K. and H.S. own shares of xFOREST Therapeutics Co., Ltd. All other authors declare no competing interests.

Peer review

Peer review information

Communications Chemistry thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplemental Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Supplementary Data 7

Supplementary Data 8

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nagasawa, R., Onizuka, K., Komatsu, K.R. et al. Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries. Commun Chem 7, 98 (2024). https://doi.org/10.1038/s42004-024-01181-8

Download citation

Received: 20 September 2023
Accepted: 17 April 2024
Published: 01 May 2024
DOI: https://doi.org/10.1038/s42004-024-01181-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.