Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet

Zhu, Haoran; Yang, Yuning; Wang, Yunhe; Wang, Fuzhou; Huang, Yujian; Chang, Yi; Wong, Ka-chun; Li, Xiangtao

doi:10.1038/s41467-023-42547-1

Download PDF

Article
Open access
Published: 26 October 2023

Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet

Haoran Zhu¹,
Yuning Yang²,
Yunhe Wang³,
Fuzhou Wang⁴,
Yujian Huang⁵,
Yi Chang¹,
Ka-chun Wong ORCID: orcid.org/0000-0001-6062-733X⁴ &
…
Xiangtao Li ORCID: orcid.org/0000-0002-8716-9823¹

Nature Communications volume 14, Article number: 6824 (2023) Cite this article

4262 Accesses
5 Citations
4 Altmetric
Metrics details

Subjects

Abstract

RNA-binding proteins play crucial roles in the regulation of gene expression, and understanding the interactions between RNAs and RBPs in distinct cellular conditions forms the basis for comprehending the underlying RNA function. However, current computational methods pose challenges to the cross-prediction of RNA-protein binding events across diverse cell lines and tissue contexts. Here, we develop HDRNet, an end-to-end deep learning-based framework to precisely predict dynamic RBP binding events under diverse cellular conditions. Our results demonstrate that HDRNet can accurately and efficiently identify binding sites, particularly for dynamic prediction, outperforming other state-of-the-art models on 261 linear RNA datasets from both eCLIP and CLIP-seq, supplemented with additional tissue data. Moreover, we conduct motif and interpretation analyses to provide fresh insights into the pathological mechanisms underlying RNA-RBP interactions from various perspectives. Our functional genomic analysis further explores the gene-human disease associations, uncovering previously uncharacterized observations for a broad range of genetic disorders.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis

Article Open access 25 March 2024

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Article 26 February 2024

Introduction

RNA-binding proteins (RBPs) are an essential group of proteins that interact with RNA by recognizing specific RNA-binding domains, and are involved in post-transcriptional regulation of RNA splicing, translocation, sequence editing, intracellular localization, and translational control^1,2,3. Accurately identifying RBP binding states in specific cellular conditions is a significant challenge that required for unraveling the underlying regulatory mechanisms and understanding their biological function. Traditional experiment-based biological methods such as systematic evolution of ligands by exponential selection (SELEX)⁴, RNAcompete⁵, and RNA Bind-n-Seq⁶ have been developed to characterize the sequence preferences of RBPs in vitro, while RNA immunoprecipitation (RIP)⁷ and other immunoprecipitation-based technologies^8,9,10 were proposed to identify RBP binding sites in vivo. Unfortunately, these laboratory-based experiments are time-consuming, labor-intensive, and susceptible to measurement errors. Therefore, developing high-throughput, accurate, and robust approaches to investigate RBP binding modes is of great importance¹¹.

Thanks to the development of cross-linked immunoprecipitation sequencing technology^8,12, many RBP-RNA binding targets have been uncovered^13,14, enabling us to develop effective data-driven computational methods¹⁵. These methods can be broadly classified into two categories¹⁶, either predicting RNA-binding sites on the protein surface¹⁷ or modeling the preferred RNA sequences of RNA-binding proteins¹⁸. From the protein perspective, several computational tools have been developed to predict the RBP-RNA binding sites at the protein level. For instance, SCRIBER, uses predictions of binding residues for several partner types to effectively reduce cross-prediction of the output protein-binding residues, combining novel and previously used input types¹⁹, while aPRBind, developed by Liu et al., combines protein sequence and structural features for RNA-binding residue prediction²⁰. DRNApred²¹, a fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues was proposed by Yan et al., with regression that penalizes cross-predictions, and a two-layered architecture. However, these statistical or machine learning-based algorithms often encounter performance limitations as the size of the dataset continues to grow. This is primarily due to their inability to effectively capture complex patterns and relationships in large-scale data, which can lead to unsatisfactory predictive performance. Recently, to address such limitations, deep learning methods have been developed; for instance, DeepSite employs a 3D deep convolutional neural network to predict the binding site using protein structure²². Xia et al. proposed GraphBind, an end-to-end graph neural network that uses hierarchical graph neural networks to identify nucleic-acid-binding residues on proteins²³. Zhang et al.²⁴ presented DeepDISOBind, a deep multi-task architecture that accurately predicts DNA-, RNA- and protein-binding regions from protein sequences. Most recently, Lam et al. introduced NucleicNet, a deep learning model that predicts the binding preference of RNA backbone constituents and different bases from local physicochemical characteristics of the protein structure surface²⁵. These methods have collectively enhanced our understanding of the binding properties of RBPs at the protein surface level.

In parallel, the establishment of links between RBPs and their targets from the perspective of RNA sequences enables understanding of the regulation mechanism. A variety of efforts have been developed to address it in identical cellular conditions, referred to as static protein-RNA interactions; for instance, Deepbind²⁶ was developed to understand RBP binding preferences in RNA sequence data using a deep neural network (DNN). Ilan et al. developed DLPRB²⁷, a new DNN approach based on convolutional neural networks (CNN) and recurrent neural networks (RNN) for learning intrinsic RBP binding preferences and predicting novel interactions. Pan et al. developed iDeep²⁸, a hybrid convolutional neural network and deep belief network-based model to predict the RBP interaction sites and motifs on RNAs. Daniel et al. proposed GraphProt²⁹ that integrates sequence and computationally predicted RNA secondary structure information into graph-kernel features. Laverty et al. introduced PRIESSTESS³⁰, a universal RNA motif-finding/scanning strategy capable of identifying enriched RNA sequence and/or structure motifs that are subsequently reduced to a set of core motifs by logistic regression with LASSO regularization. However, these methods only account for a specific cellular condition and are hence limited in their ability to predict RNA-protein interactions in other cell lines while considering the dynamic contexts.

Indeed, the binding behavior of RBP-RNA interactions has been demonstrated to be dynamic in different cell types, as it is influenced by different cellular or tissue environments³¹. In other words, the binding sites of the RBP that are exclusively present in specific cells or tissues can be designated as dynamic protein-RNA interactions. Recently, a new deep learning-based method called PrismNet³², was developed to accurately predict dynamic RBP binding in various cellular conditions by integrating in vivo experimental RNA structure information³³. Nonetheless, unfortunately, we conceive that there are still room for improvements; for instance, the architectural design of the deep neural network within PrismNet can still be investigated in the context of neural network architecture search, indicating the potential for further improvements in RBP-RNA binding prediction. In addition, the one-hot encoding representation in PrismNet exhibits limitations that may hinder its ability to capture RNA features globally. Furthermore, the heterogeneity of transcripts results in widely sequential relationships across different contexts, such as various cell lines, tissues, and normal or disease conditions. The one-hot encoding may overlook this contextual information, treating each position independently and potentially missing crucial sequence patterns. Therefore, it is crucial to address these limitations by exploring alternative coding methods to leverage the contextual and sequential nature of RNA sequences.

The contextual relationship has been a focus of increasing attention, as word embedding techniques^34,35,36 have proven to be effective frameworks for automatically encoding RNA sequences due to their syntactic, lexical, and semantic similarities to human language³⁷. On this basis, much effort has been devoted to the application of advanced NLP techniques in RNA-related problems^38,39,40. However, most existing research on the application of NLP techniques in RNA-related problems has relied on static embedding models such as Word2Vector⁴¹, GloVe³⁵, Doc2Vector³⁶, and FastText⁴². Similar to one-hot representation, static embedding methods maintain the same nucleotide encoding across all contexts, which leads to a failure to capture the context-based dynamic semantic information of RNA sequences and therefore cannot represent RNA sequence features globally. Moreover, static embedding does not capture the underlying information of nucleotide sequences in different cellular contexts, thus limiting the ability of dynamic prediction in other cellular conditions. To address this knowledge gap, we propose to adopt and customize the Bidirectional Encoder Representations from Transformers (BERT) model⁴³ to learn the context-dependent information of RNA sequences, which can generate robust expressions containing the global contextual information by pre-training with a large-scale unlabeled text corpus in a self-supervised fashion. In contrast to static embeddings where nucleotide embeddings remain the same regardless of the context, the primary advantage of dynamic embeddings lies in their ability to generate context-specific features in different nucleotide contexts. Moreover, considering the heterogeneity of transcripts under different cellular, tissue, or physiological conditions (e.g., normal or diseased), dynamic embeddings can overcome the limitation of a single representation for diverse contextual sequences, enriching the global features of the sequences and surpassing the performance bottleneck in dynamically predicting RBP binding sites. Therefore, by leveraging the advantages of the transformer model, we adopt the BERT model to encode RNA sequences in dynamic embedding vectors that then contain rich, global contextual semantic information for identifying dynamic RNA-binding events between different cell lines, tissues, or physiological conditions (normal or diseased).

Here, we propose HDRNet (High-throughput Dynamic Cellular RNA-binding Event Identification using Deep Neural Network), a new end-to-end deep learning model for identifying RNA-binding interactions from eCLIP-seq data in various cellular conditions. To capture the hierarchical relationships between nucleotide sequences, we adopt multi-source biological information, including in vivo RNA secondary structure information and bio-language features, to characterize both the sequence and structural features of RNA. Then, we combine biological information from different sources using the unified alignment model to uncover possible relationships between nucleotide sequences and latent structural information. Additionally, hierarchical multi-scale residual networks (HMRN) are leveraged to comprehend the contextual dependencies between the nucleotides, and deep protein-RNA binding predictor (DPRBP) is developed to extract the contextual significance of the nucleotide sequences by stacking several pyramid convolutional blocks, and incorporating batch normalization and residual shortcut connections into the network to boost robustness and reduce overfitting. We evaluate HDRNet on 261 linear RNA datasets and compare it with other baseline methods. We demonstrated its validity and scalability in both static and dynamic cellular conditions. In addition, we performed motif and interpretation analyses to gain new insights into the pathological mechanisms of RNA-RBP interactions. Our functional genomic analysis revealed the association between genes and human diseases, leading to previously unknown observations on a wide range of inherited diseases.

Results

Overview of HDRNet

The HDRNet framework has the capability to perform accurate prediction of RBP binding events by leveraging robust features from multi-source biological information to aid in the identification of high-attention binding peaks and subsequent analysis of RBP binding data, as depicted in Fig. 1. The HDRNet pipeline consists of four key components aimed at achieving the reliable prediction of RNA-binding protein (RBP) interactions. (1) The dynamic global contextual information and in vivo RNA secondary structure information are extracted to characterize both the sequence and structural properties of RNA; (2) A unified alignment of multi-source feature representation is developed to embed the dynamic contextual information and icSHAPE contour vectors with dimensional homogeneity to generate the potential feature representations; (3) We establish the hierarchical multi-scale residual network (HMRN) to extract the sequence and structural information and then calculate the binding score of the RNA-RBP interaction using the deep protein-RNA binding predictor (DPRBP), which picks the most prominent nucleotide characteristics progressively; (4) The latent embedded representation learned by the HDRNet model allows capturing the high-focus binding peaks and binding patterns of RNA sequences to investigate the association between dynamic binding sites and human diseases. In addition, it is also possible to jointly interpret gene-level knowledge in a transcriptomic context, providing insights into disease regulatory mechanisms.

**Fig. 1: The network architecture of the HDRNet algorithm.**

The HDRNet framework starts by generating multiple sources of biological information to represent the RNA sequence or structure, where the dynamic global contextual embedding representation comes from tagging the input RNA sequence as genetic codon tokens using the k-mer method and then encoding each token as a dynamic embedding vector using the pre-trained deep bidirectional transformer model (BERT). Dynamic embedding implies that the same token has different encoding in different contexts and therefore contains more nucleotide contextual information as well as long-distance dependencies. Meanwhile, the icSHAPE-pipeline is employed to analyze and generate the in-vivo RNA secondary structure profile, which is a numerical vector of the same length as the RNA sequence, providing the model with valuable information on the structural binding preferences of RBP.

Afterward, HDRNet utilizes a unified alignment of multi-source feature representation, which consists of two convolutional neural network (CNN) modules, to preliminary extract the underlying features of both the sequence and structure, and to unify their feature dimensionality, respectively. HDRNet then adopts two HMRNs to analyze the sequence and structural features independently. Each extractor comprises of CNN modules with distinct kernel size to comprehend the contextual dependencies between the nucleotides and their structures at varying distance, for accurate interaction prediction. Thereafter, HDRNet amalgamates all the extracted features and implements the DPRBP with a multi-layer feature selector module, which learns the underlying representation and selects the crucial nucleotide tokens in each layer of the module. Furthermore, HDRNet employs a sigmoid activation function on the network’s output to forecast the binding of the RNA sequence to the protein.

Finally, we can identify and align the binding motifs of RNA binding proteins by exploring the interpretability of HDRNet. Further, we delved into the underlying connections between the dynamic binding sites and human diseases by identifying the high-attention binding regions and their potential variants. To support this finding, our study also integrated transcriptomics to unveil disease regulatory mechanisms, including gene ontology enrichment, KEGG pathway analysis, protein-protein interaction network analysis, transcription factor-gene interaction analysis, miRNA-gene interaction analysis, and drug prediction analysis. Our analyses provide new insights into the interpretation of RBP datasets and the treatment of human diseases.

HDRNet provides better performance than baseline methods in static cellular protein-RNA interaction identification

We evaluated the performance of our proposed HDRNet by comparing it with five other state-of-the-art computational methods, namely PrismNet³², PRIESSTESS³⁰, DMSK³⁹, iDeep²⁸, DeepBind²⁶, and Graphprot²⁹ on 261 static RBP binding site datasets. The area under the receiver operating characteristic curve (AUC) was adopted as the performance metric for all computational methods. For each RBP dataset, we partitioned the binding sites into training and test sets. Then, we used the test set to benchmark all prediction methods³². Note that we used the data from a uniform pipeline³², which ensures that the data is accurate and consistent and that the comparison is fair.

Circos plot in Fig. 2a shows the overall experimental results of HDRNet compared to the other methods across the 261 RBP datasets. As depicted in this figure, we notice that HDRNet consistently outperformed other approaches. In particular, our model substantially enhanced the performance in identifying FMR1 and FXR2 binding sites in HEK293 cells; for instance, FMR1: PrismNet = 0.67 vs. HDRNet = 0.80; FXR2: PrismNet = 0.71 vs. HDRNet = 0.87. Moreover, we conducted additional analyses on the associated characteristics of FMR1 and FXR2, identifying their specific features (Supplementary Note 6, Supplementary Fig. 8). Meanwhile, we notice that HDRNet outperformed PRIESSTESS on almost all datasets. This discrepancy may arise from the fact that PRIESSTESS is highly dependent on the motif-extracting process in the initial stage, and may not be able to identify salient patterns across the datasets with insufficient binding patterns, resulting in a limited training feature set. Moreover, the size of the dataset also plays a crucial role in influencing the efficiency of motif recognition. When the dataset is relatively small, it can potentially result in inaccuracies or the failure to recognize motifs, consequently diminishing the predictive performance. In addition, the logistic regression model based on LASSO regularization within PRIESSTESS could yield suboptimal performance when confronted with nonlinear decision boundaries, while deep learning methodologies tend to exhibit more favorable outcomes. Although PRIESSTESS demonstrated suboptimal performance, its unique Motif extraction process could potentially enhance HDRNet, as illustrated in Supplementary Note 3 and Supplementary Figs. 4 and 5. In the violin plot of Fig. 2b, we observe that our proposed HDRNet performed more consistently and had better prediction performance on the majority of the datasets than the competing approaches. The reason for the improved performance may come not only from the self-supervised capability of the transformer that captures the global contextual and semantic information of the RNA sequences but also from the ability of the proposed network architecture to learn and transform long-range dependencies. We also provide the receiver operating characteristic (ROC) curve of the first four datasets that are plotted in Fig. 2c. The ROC curve analyses demonstrated that HDRNet had a higher true positive rate (TPR) compared to the other methods, indicating that HDRNet has a higher sensitivity for identifying RBP binding sites. Moreover, we also identified existing sub-groups of binding events that are better characterized by HDRNet, as discussed in Supplementary Note 5 and Supplementary Fig. 7. In addition, we used the t-SNE clustering method to analyze the validity of the output feature of HDRNet. As shown in Fig. 2d, we clearly observe that HDRNet provided the best clustering results compared with the other baseline methods, where the positive (RNA fragment that is an RBP binding site) and negative (RNA fragment that is not an RBP binding site) samples are separated, demonstrating the superior feature extraction capability of HDRNet. In summary, these results indicate the high effectiveness and feasibility of the proposed HDRNet.

**Fig. 2: HDRNet predicts RBP binding events more accurately than other state-of-the-art methods.**

New insights by characterizing RNA binding events between different cell lines in a dynamic manner

The binding of RBPs is influenced by different cellular environments, and therefore, this binding is expected to be dynamic in diverse cell lines. BERT extracts the dynamic semantic information from RNA sequences globally, and is capable of revealing different RNA-protein interactions in particular cellular conditions. We evaluated the performance of HDRNet in predicting dynamic RNA-protein interactions on 62 RBP datasets obtained from K562 and HepG2 cell lines.

Specifically, we trained HDRNet on the RBP datasets of the K562 cell line. We then used the corresponding RNA sequence information generated by BERT and in vivo secondary structure information to predict the RBP binding sites in the HepG2 cell line, which was performed as an independent test set. Subsequently, we trained HDRNet on the HepG2 cell line data, and used data on K562 cell line as an independent test set. We then compared the dynamic prediction performance of HDRNet with other state-of-the-art deep learning methods, including PrismNet, PRIESSTESS, iDeep, DMSK, GraphProt, and Deepbind. Figure 3a provides the heatmap of the dynamic prediction results, from which we can see that the proposed HDRNet outperformed the other methods for both K562 and HepG2 predictions while Deepbind performed the worst across the majority of datasets, with either the lowest AUC value or being incapable of making dynamic predictions. When predicting the RBP sites in K562 cells after training on HepG2 cells, the average AUC result of HDRNet was 0.79, which is 4% higher than PrismNet. In particular, the AUC results of RBM15 and XRN2 RBPs are 12% and 9% higher than PrismNet, respectively. Similarly, predicting HepG2 cells binding sites after training on K562 cells, the average AUC result of HDRNet was also 4% better than PrismNet. The results for RBM15, SF3B4 and SLTM also showed improvement over PrismNet, with the performance gains of 9%, 11% and 9%, respectively. Meanwhile, we also notice that HDRNet substantially outperformed PRIESSTESS for the dynamic tasks with higher AUC values while PRIESSTESS outperformed Deepbind, GraphProt, iDeep and DMSK in terms of dynamic prediction. Figure 3b shows the predicted vs observed binding sites of HNRNPA1 on the MT-ND5 transcript. According to eCLIP, the MT-ND5 transcript contains 11 HNRNPA1 binding sites in HepG2 cells, and 6 binding sites in K562 cells. We found that HDRNet correctly predicted all 11 binding sites within the MT-ND5 transcript in HepG2 cells with no false positives, by using the model trained on K562 cells. In contrast, DeepBind and GraphProt, correctly predicted only 2 of the 11 sites, and iDeep and DMSK correctly predicted 4 of the 11 sites, and PrismNet and PRIESSTESS correctly predicted 7 of the 11 binding sites. Figure 3c depicts the ROC curves of the first two datasets. Similar to the before AUC results, HDRNet had superior dynamic prediction than the other methods supported by better TPR values. Furthermore, we also used t-SNE to represent the significance of HDRNet features in dynamic prediction tasks. As shown in Fig. 3d, we see that the clustering results of HDRNet were still the best under the dynamic prediction task, illustrating the superior feature learning ability and robustness of HDRNet in cross-cell prediction. In addition, we compared HDRNet with seven machine-learning algorithms. As shown in Supplementary Note 1 and Supplementary Fig. 1, HDRNet exhibited better performance compared to these algorithms, in both static prediction and dynamic prediction tasks. Moreover, HDRNet had better performance for RBPs with high and low expression levels and for target RNA events with high and low expression levels in different cellular contexts, as illustrated in Supplementary Note 7 and Supplementary Figs. 9 and 10. In summary, the results indicate that the proposed dynamic contextual information representation scheme contains comprehensive features that are valuable for the identification of dynamic RBP binding sites.

**Fig. 3: HDRNet successfully performs dynamic RBP binding predictions in both K562 cells and HepG2 cells.**

HDRNet predicts dynamic RNA-protein interactions across tissues in normal and disease conditions and captures significant binding regions

We went on to explore the dynamic prediction capabilities of HDRNet between different in vivo tissue contexts, especially in normal and disease conditions. Specifically, we collected the MBNL2 (Muscleblind Like Splicing Regulator 2) binding data⁴⁴ (GEO accession: GSE68890) from POSTAR⁴⁵, which studied the regulation of MBNL proteins in human brain tissues. In particular, a total of 5 datasets were obtained from autospy tissues (hippocampus and frontal cortex) of patients with myotonic dystrophy type 1 (DM1, 2 datasets), myotonic dystrophy type 2 (DM2, 1 dataset of hippocampus), and control patients (2 datasets). HDRNet and the baseline models, including PrismNet, DMSK, iDeep, GraphProt, DeepBind, and PRIESSTESS were then trained on these datasets separately, and the trained models were tested on different tissues with different control context. As illustrated in Fig. 4a, HDRNet had the best performance in dynamic prediction across tissues comparing with baseline models. Moreover, we notice that PRIESSTESS failed to optimize the binding sites in the frontal cortex of DM1 dataset, and thus cannot perform dynamic prediction. In addition to dynamic prediction in the same control condition, we also observe that HDRNet demonstrated the capability of dynamic prediction between control context. For instance, as depicted in Fig. 4b, when predicting the DM1-frontal cortex RBP binding sites using the control-hippocampus model, HDRNet achieved the highest AUC of 0.8, whereas the other baseline methods only reached up to 0.7. This indicates that HDRNet could provide potential insights into disease-related biological analyses. Indeed, as revealed in refs. ^44,46, MBNL2 directly interacts with DM1 expanded CUG repeats and DM2 CCUG expansion RNAs in the brain, which functionally depletes the MBNL proteins. As illustrated in Fig. 4c and Supplementary Fig. 11a, by employing the SHAP tool⁴⁷ to extract high-attention dynamic semantic information, we discovered that HDRNet successfully captured these disease-related high-attention regions. For example, in both DM1 datasets, HDRNet highlights continuous segments of CUG expansions, while in the DM2 dataset, HDRNet also detected significant regions of continuous CCUG expansions. These findings further confirm the superior biological interpretability of HDRNet and its potential to provide theoretical support for pathological research.

**Fig. 4: HDRNet Predicts dynamic binding sites across tissues.**

Besides tissue-specific dynamic prediction, we asked whether the HDRNet model trained using cell line data is sufficient for predicting dynamic interactions in tissues. To clarify this, we retrieved two additional eCLIP RBP datasets from ENCODE, DGCR8, and HNRNPU, which were both derived from adrenal gland tissue. We then employed the models trained on K562 and HepG2 cell line data to validate these tissue RBP binding data. As expected, Supplementary Fig. 11b demonstrates that HDRNet performed best when predicting the dynamic binding in tissue data using the cell line-trained model in both of the newly retrieved datasets. Interestingly, we observed an improvement in the performance of PrismNet on the new eCLIP data compared to the previous MBNL2 data, albeit slightly inferior to HDRNet. We speculate that this improvement is due to the fact that the PrismNet model was proposed based on eCLIP data. However, its use of static encodings limits its performance on data from other platforms, which highlights the advantage of HDRNet. Moreover, HDRNet notably discerned the prominent binding domains of DGCR8 and HNRNPU, including the CGG-rich segment associated with DGCR8 and the G-quadruplex structure in the context of HNRNPU, as illustrated in Fig. 4d and Supplementary Fig. 11c. In addition, to further validate the robustness of HDRNet for dynamic prediction of RBP binding sites on different data platforms, we collected MBNL1 binding data from⁴⁶ that studied the direct regulatory targets of MBNL1 in brain, heart, muscle, and myoblasts from mice. We obtained a total of five datasets (GEO accession: GSE39911), including two from the brain, one from muscle, one from heart, and one from myoblasts (C2C12 cells). As illustrated in Fig. 4e and Supplementary Fig. 12a, HDRNet provided significant performance improvements compared to other baseline methods in each dynamic prediction task. Furthermore, HDRNet also highlighted the specific disease-related binding preferences similar to MBNL2, as depicted in Supplementary Fig. 12b. Overall, the experimental results and analyses presented above and in Supplementary Note 8 demonstrate that only HDRNet is capable of handling dynamic prediction tasks across different platforms and even different species, highlighting the remarkable effectiveness and robustness of HDRNet. Moreover, through the investigation of RBP data from diseases, we have observed that HDRNet is able to extract the salient binding regions associated with diseases, thereby providing new insights for subsequent pathological studies.

Validation of HDRNet from the contextual information and structural perspectives

We further discuss the superiority of HDRNet from the perspective of NLP methods and RNA secondary structure, respectively. We first conducted experiments comparing BERT with other existing static embedding methods, including Word2Vector⁴¹, Glove³⁵, Doc2Vector³⁶, FastText⁴⁸, and the One-Hot coding scheme. Briefly speaking, the difference between dynamic and static methods is that dynamic encoding generates different embeddings in different contexts, while the embedding of each token is fixed in the static encoding methods. Figure 5a depicts the consolidated results of static prediction on all 261 datasets using different feature descriptors. We notice that integrating BERT brought the best performance compared with the other static embedding methods, with an average AUC of 0.84 for HDRNet using BERT on the 261 RBP datasets, much higher than One-Hot (0.83), Word2Vec (0.80), Doc2Vector (0.80), Glove (0.76) and FastText (0.80). The main reason is likely due to the pre-training process and the multi-head self-attention mechanism; words consisting of multiple nucleotides contain more information about the word’s position and the connection between each nucleotide and its context. We then evaluated the performance of the dynamic embedding method and static embedding methods on the dynamic prediction task using HDRNet. First, to predict the RBP binding sites in K562 cells using HepG2 models, we observe in Fig. 5b that HDRNet using BERT as the feature descriptor performed the best of all the NLP methods, with an average AUC of 0.81, better than the other encoding schemes, One-Hot (0.71), W2V (0.75), D2V (0.74), Glove (0.75), and FastText (0.75). Indeed, we found that the static NLP methods do not provide better RNA-RBP binding recognition performance, also indicating that static coding schemes do not represent the contextual information of nucleotides. Second, we evaluated the performance of predicting binding sites in HepG2 cells using the K562-trained model. As indicated in Fig. 5b, BERT still achieved the highest performance (0.78) despite a decreased difference in results between BERT and static NLP methods. In addition, Fig. 5c visualizes the correlation heatmap of features generated by HDRNet using different NLP descriptors, where we observed that the features generated by HDRNet using BERT showed a stronger correlation than the other static descriptors, indicating that HDRNet effectively learns the contextual dependence of the RNA sequence via the dynamic contextual information generated by BERT. These evaluation results demonstrate the superior adaptability of the dynamic global contextual information generated by BERT, which contains rich context dependencies of nucleotide sequences in different cellular conditions, providing better adaptability in dynamic prediction tasks.

**Fig. 5: Validation Study of HDRNet on contextual and structure information.**

Then, to further investigate the advantage of adding in vivo secondary structure information in vivo, we conducted several experiments to dissect how RNA sequence and structural information contribute to the accurate prediction by HDRNet. Firstly, we evaluated the performance of HDRNet with or without the secondary structure information in static prediction tasks, respectively. As expected, Fig. 5d shows that the HDRNet model adding secondary structure features outperformed the model without structural information for almost all RBP binding prediction, indicating that RBPs incorporate structural preferences for binding recognition, and that secondary structure features enrich the static representation from a biological perspective leading to improved prediction results. In addition, we evaluated the impact of secondary structure information on the performance of dynamic binding site predictions. As depicted in Fig. 5e, we observed a significant reduction in accurate dynamic prediction when only sequence features were used as input data in HDRNet. Moreover, we extended our evaluations to include other in vivo or computationally predicted secondary structure features, and evaluated the performance of HDRNet using various combinations of these different structural feature descriptor. As illustrated in Supplementary Note 2 and Supplementary Figs. 2 and 3, the original HDRNet consistently demonstrated optimal performance, indicating that the structure is relevant to RBP-RNA binding and the robustness of HDRNet.

Evaluation of hyperparameter selection and ablation study

We explored the impact of several key parameters on HDRNet functioning. We first investigated the performance of different k-mer sizes in HDRNet, including 3bp to 6bp. As visualized in Fig. 6a, we note that the 3-mer model achieved the best AUC value of 0.875, surpassing the 4-mer, 5-mer, and 6-mer models, achieving 0.868, 0.862, and 0.869, respectively. We found surprisingly only a slight decrease in performance as the value of K increased, which is in contrast to the results of previous studies⁴⁹. We speculate that the reason for this is the powerful feature learning capability of HDRNet, able to extract important information from dynamic embeddings at different levels. After the Hierarchical Multi-scale Residual Network and before DPRBP, we also tried a different way to combine features by adding sequence features and structural features together instead of stringing them together. The experimental results are summarized in Fig. 6b, we observe that the concatenated features outperformed the summed features, and we speculate that this is because summing up two features destroys the feature construction learned by the hierarchical network, which leads to a decrease in performance.

Then, to assess the contribution of each component in our proposed deep network architecture, we ablated each component of HDRNet as follows: (1) We first removed the Hierarchical Multi-scale Residual Network and DPRBP and replaced them with a fully connected network, called HDRNet_DUSO; (2) We employed only the DPRBP stage of HDRNet for prediction, called HDRNet_DPRBP; (3) We tested HDRNet without the Hierarchical Multi-scale Residual Network, called HDRNet${}_{{N}_{{{{{{{{\rm{MVRS}}}}}}}}}}$; (4) We replaced the max-pooling layer of DPRBP in HDRNet with an average-pooling layer, called HDRNet_AVGpool. As depicted in Fig. 6c, the performance of HDRNet outperformed all the ablated frameworks, with an optimal AUC value of 0.88. Indeed, with a fully connected network, HDRNet_DUSO is unable to provide accurate predictions, indicating that DPRBP has significant feature extraction and prediction ability. Furthermore, using only DPRBP, the performance of HDRNet_DPRBP decreased by 4%, demonstrating that multiple deep learning mechanisms are effective in learning and integrating the underlying features of sequence and structure information. Moreover, the proposed HDRNet is higher than HDRNet${}_{{N}_{{{{{{{{\rm{MVRS}}}}}}}}}}$ without the Hierarchical Multi-scale Residual Network (0.86). In depth, to intuitively visualize the feature maps of HDRNet and HDRNet${}_{{N}_{{{{{{{{\rm{MVRS}}}}}}}}}}$, the correlation heatmaps are depicted in Fig. 6d. We can clearly observe that there was a significant positive and negative correlation in the Hierarchical Multi-Scale Resnet output, allowing us to accurately identify RNA-RBP binding. Additionally, we highlighted the advantages of the hierarchical structure, as discussed in Supplementary Note 4 and depicted in Supplementary Fig. 6, demonstrating the necessity of the Hierarchical Multi-scale Residual Network.

Furthermore, after each DPRBP block, the sequence length is halved using pooling layers that preserve the most important features. We conducted an experiment to investigate the impact of various pooling layers, employing either the maximum pooling layer or the average pooling layer. Figure 6c shows that the average AUC of HDRNet with the maximum pooling layer and the average pooling layer is 0.88 and 0.86, respectively. To further demonstrate the influence of the maximum pooling layer, we projected the embedding layer into two dimensions using t-SNE and annotated them with the true labels in order to display the features extracted by the two pooling layers of HDRNet as summarized in Fig. 6e. We can clearly see that the positive and negative samples were separated, and the clustering effect of HDRNet using the maximum pool was clearly better than that using the average pooling. A potential reason could be that the maximum pooling layer always selects the features of the most prominent tokens, resulting in the most prominent features being kept until the end, whereas the average pooling layer fuses the features of each token together, resulting in the loss of important information. In conclusion, the results indicate the validity and reasonableness of each HDRNet component.

HDRNet’s attenations reveal biologically meaningful interpretable learning patterns and motif inferences

The self-attentive mechanism can capture critical RNA sequence fragments, hence enhancing the ability to recognize motifs⁵⁰. To identify motifs by our HDRNet, we put all of the RNA sequences through BERT’s fine-tuning process and generated an attention vector for each sequence. Then, we used the attention vectors to find consecutive high-attention segments using sliding windows and aligned related subregions as the final binding motifs. Once all high-attention sequence fragments were recovered, their nucleotide occurrences were counted and transformed into position weight matrices (PWMs). As visualized in Fig. 7a, a total of 172 motifs were found that matched the known RNA-binding motifs of RBPs. Interestingly, RBPs involved in the same RNA regulatory pathways were generally grouped together via hierarchical clustering. We then utilized the TOMTOM tool⁵¹ in the MEME Suite to match the authenticated motifs in the ATtRACT database with the motifs discovered by the transformers. As can be seen in Supplementary Table 1, the extracted motifs from the transformers are very comparable to known motifs. We also showed that the dynamic contextual embedding takes into account the location of global words and supports out-of-vocabulary words by implementing a multi-headed self-focus mechanism, thus flexibly adapting to multiple cases of motif pattern extraction.

**Fig. 7: Interpretation study of HDRNet.**

Moreover, the SHAP tool⁴⁷ was employed to extract high-attention dynamic semantic information. We found unexpectedly that HDRNet captured the binding peaks associated with the extracted motifs and identified the structural preferences of the binding events. Taking the input sequence of RBP TIA1 in Hela cells as an example, Supplementary Fig. 13 reflects the impact of each token of the input dynamic contextual feature and the icSHAPE structural information, where HDRNet successfully captured the poly-U binding motifs with structural preference of single-stranded, which is in line with the confirmations of earlier investigations³² (Supplementary Note 10). In addition, HDRNet can successfully identify specific binding events in different cellular conditions. The saliency maps of the dynamic prediction of RBP LIN28B are presented in Fig. 7b, where the top strip plots the potential binding motifs; the second strip is the heatmap of the sequence attention scores; the third strip indicates the specific sequences; and the bottom strip contains the icSHAPE scores represented as a line plot and structural attention heat-map. From this we understand that HDRNet was capable of identifying dynamic binding events of the same transcript in different cells, as depicted in Fig. 7b, showing that the binding probability of the local region is 0.948 in HepG2 cells and 0.002 in K562 cells, respectively, indicating the possible existence of diverse gene expression in the different cells. Then, as shown in Fig. 7c, we used the HepG2 model to scan the K562 data, and we observed that the LIN28B binding fragment with high attention was highlighted by HDRNet (GAGAAGA). Similarly, we used the K562 model to scan HepG2 data and we obtained the same binding peak as with the K562 data. Moreover, we noticed that LIN28B RBP shares the same structural binding preference (single-stranded) in both K562 and HepG2 cells, since the icSHAPE score of the obtained binding peaks regions was >0.8. These results demonstrate the potential of HDRNet for biological interpretability tasks. We also evaluated the advantages of adopting the dynamic global contextual embedding, by exploring the distribution of attention weights in BERT from HDRNet. As shown in Supplementary Note 11 and Supplementary Fig. 14, we see that the selected tokens did not decay noticeably with increasing distance, indicating that BERT successfully learned and preserved the long-distance dependencies and short-distance context of the sequence.

In addition, we explored the contribution of the refined in-vivo biological features and the dynamic contextual information for RNA-RBP binding event identification. The analyses results are depicted in Fig. 7d, e. First, we extracted the output feature matrix of each phase of HDRNet during the training process and projected it onto a two-dimensional space using t-SNE to better explain the learning process of HDRNet. As shown in Fig. 7d, the first subplot represents the t-SNE clustering results of the output features of the dimensionality reduction and up-sampling network, where the prototype of each cluster appeared; the second subplot displays the t-SNE results of the output features of the multi-scale ResNet, where the features have a regular distribution. The last subplot reveals the t-SNE results after processing by DPCNN, where we can clearly observe a distinct clustering result, demonstrating the robustness of the HDRNet architecture. Figure 7e depicts the impact of dynamic contextual information of RNA sequence and icSHAPE structural features on identification of RNA-RBP binding events after the learning of the deep neural network in HDRNet, where the higher SHAP values denote that the particular feature plays a greater role in the final prediction decision, namely the high-attention region. From Fig. 7e, it is evident that the regions of high attention for both dynamic contextual information and secondary structure information were located in the same continuous tokens, forming a binding peak that may represent the final binding point. In addition, we observed that the dynamic contextual information was given greater weight with higher SHAP values than structural information, indicating that RNA sequence information is more relevant than RNA secondary structure information for predicting RNA-RBP interaction events. On this basis, we confirmed the effectiveness of the suggested deep network architecture and the possible biological interpretability of HDRNet.

Identification and visualization of TDP-43 binding patterns under potential genetic variants

Genetic variants (GVs), primarily including Single-Nucleotide Polymorphisms (SNPs) and Single-Nucleotide Variations (SNVs), are permanent changes in the nucleotides of DNA sequence that makes up a gene. The alternative alleles of a GV may confer different binding specificity to an RBP since each RBP has its own sequence specificity, implying that GVs may disrupt the identification of RNA substrates by the RBP, resulting in allele-specific functional consequences and leading to severe diseases⁵².

TARDBP, or TDP-43 (TAR DNA binding protein 43) protein that binds to DNA and RNA, plays an important function in intracellular RNA transcription, selective shearing, and regulation of mRNA stability⁵³. Many studies have demonstrated that TDP-43 is associated with cancers⁵⁴ and severe neurological disorders such as epilepsy, amyotrophic lateral sclerosis (ALS), frontotemporal dementia (FTLD) and Alzheimer’s disease (AD)^55,56,57,58. To further elaborate the relationship between TDP-43 binding properties and disease-related genetic variants, we first used HDRNet to scan the TDP-43 dataset and obtained a total of 10589 TDP-43 binding sites (5231 binding sites in K562 cells and 5298 binding sites in HEK293 cells, respectively). By comparing with the dbSNP⁵⁹ and COSMIC databases⁶⁰, we found that the TDP-43 binding sites were enriched with a substantial number of SNP and SNV mutant alleles, especially in the high-attention regions identified by HDRNet. To elucidate the correlation between diseases and potential genetic variants identified by HDRNet, we conducted experiments to observe binding behavior after applying a transformation to the variant alleles. As shown in Fig. 8 and Supplementary Fig. 15, we found a decreasing trend of TDP-43 binding to RNA in most of the transcripts, where alteration in the variant resulted in significant changes of binding events in the high-attention region; for example, in transcript ENST00000547986 from K562 cells shown in Fig. 8a, a mutation of the poly-U binding peak at position 94,971,529 on chromosome 12 resulted in a significant reduction of binding in this local region, leading to a risk of breast and lung tumor; in transcript ENST00000533549 of K562 cells shown in Fig. 8c, the SNV mutation associated with large intestine tumor also affects the TDP-43 binding event in the adjacent region, suggesting that this genetic variant may affect gene regulation and thus lead to possible disease pathogenesis. Notably, we observed that nucleotide mutations lead to potential RNA structural changes, as demonstrated in Supplementary Note 13 and Supplementary Figs. 16-17. In addition, as depicted in Fig. 8f in transcript ENST00000533549 of HEK293 cells, intron variant located at chr10:100517088 also reduced the binding tendency and is also associated with large intestine tumor.

**Fig. 8: Identification and visualization of potential disease-causing genomic variants.**

Notably, we show that the dynamic binding events of Fig. 8c, f located in the NDUFB8 transcript were highly associated with the large intestine cancer, indicating that the disruption of such dynamic RNA-RBP interactions are potential links with carcinogenesis. In summary, we can conclude that these analyses validate the biological interpretability of HDRNet to identify potential genetic variants and reveal underlying associations of the RBP binding sites with human diseases.

Characterizing human disease-associated RNA and protein interactors with TDP-43 from a transcriptomic angle

After examining the latent relationship between RBP binding sites and human disease, we continued to study the link between RBP dynamic binding and human disease with a view of functional genomic and to find possible medical interventions from a transcriptomic angle. After obtaining TDP-43 binding sites, we mapped the binding transcript IDs to gene symbols and defined these genes as TDP-43-binding genes. In particular, the shared mapped binding sites were defined as dynamic binding genes. Firstly, we examined the interrelationships between the TDP-43 binding sites in K562 and HEK293 cells. Among the identified binding sites, 1063 binding genes (BGs) for K562 cells and 1894 BGs for HEK293 cells were recoginzed. The Venn diagram⁶¹ in Fig. 9a shows that the two cell lines shared 544 TDP-43 BGs. Then, we used Enrichr⁶² to perform a gene ontology enrichment analysis on the 544 TDP-43 binding genes of K562 and HEK293 cells. This helped us to understand what they might mean biologically and which pathways were enriched. Then, we used the GO database as an annotation source to annotate three types of gene ontology analysis: biological processes, molecular function, and cellular components. The top 10 terms of these three categories based on p-value are depicted in Fig. 9b. For the biological processes, cytoplasmic translation (41 genes) and gene expression (68 genes) were among the top GO terms. Indeed, it has been disclosed that in ALS motor neurons, especially neurons with mislocalized TDP-43, the amount of TARDBP mRNA is increased in the cytoplasm⁶³, and is involved in other cellular processes such as microRNA biogenesis, apoptosis, and cell division⁶⁴. In the molecular function experiment, we saw RNA binding (220 genes) and mRNA binding (55 genes) were the two top GO terms. It is known that RNA binding proteins (RBPs) are highly associated with neurological diseases⁶⁵, corroborating the crucial need for studying and predicting RNA-RBP interactions. The nucleus (265 genes) and intracellular membrane-bounded organelles (283 genes) were the top GO terms for cellular components. Previous research has demonstrated a significant loss of neurons within the lateral part of the pedunculopontine nucleus in individuals with idiopathic Parkinson’s disease and in individuals with combined Parkinson’s and Alzheimer’s diseases⁶⁶, revealing the association between the nucleus and the neurological disease. In addition, variations in intracellular membrane-bounded organelles are likewise a potential cause of neurological disorders^67,68.

**Fig. 9: Transcriptomic analysis of human disease-associated RNAs and proteins interacting with TDP-43.**

Further, we performed KEGG pathway analysis on the 544 shared TARDBP binding genes between K562 cells and HEK293 cells, revealing that the organism is capable of reacting to inherent modifications. KEGG pathway analysis can demonstrate the interaction between various diseases through basic molecular or biological processes⁶⁹. The most significant pathways of the genes in K562 cells and HEK293 cells of the TARDBP binding sites evaluated by p-value are summarized in Fig. 9b. Among the top 10 KEGG human pathways listed, we observe that they were highly associated with neurological diseases, including Parkinson’s disease (PKD)⁷⁰, Amyotrophic lateral sclerosis (ALS)⁵⁶, Huntington’s Disease (HD)⁷¹, Prion Disease (PD)⁷², and Alzheimer’s disease (AD)⁷³.

In another context, we fed the TARDBP binding genes into the STRING⁷⁴ to build a protein–protein interaction (PPI) network to visualize the interactions and adhesion pathway. As depicted in Supplementary Fig. 18 and Supplementary Note 14, a total of 537 nodes and 1851 edges were obtained, where neurological disease-related genes are highlighted (PKD - red, ALS - blue, HD - green, PD - yellow, AD - pink, a total of 68 genes, see Supplementary Figs. 18 and 19), and we observed that these genes are highly correlated and clustered. Figure 9c depicts the most significant disease-related genes, including a total of 28 genes. Then, the Molecular Complex Detection (Mcode) plugin⁷⁵ in Cytoscape was adopted to identify the most important modules, as shown in Fig. 9d. Further, the top three hub genes were selected by the Cytohubba plugin in Cytoscape⁷⁶ using the MCC method⁷⁶. As shown in Fig. 9e, the top three hub genes were NDUFA12, NDUFB9 and NDUFB8, and mutations in these genes are new causes of complex I deficiency^77,78,79, which is known to be associated with Parkinson’s disease⁸⁰. Moreover, focusing on these top hub TARDBP binding genes, we performed the pan-cancer analyses to characterize the differential expression of these genes in various cancers using TIMER⁸¹ (Supplementary Note 15). As depicted in Supplementary Fig. 20, we observe that the human disease-related TDP-43 binding genes were also significantly differentially expressed between tumors and normal tissues in human cancers. Notably, some of the cancers identified (Breast Cancer (BRCA), Lung Squamous Cell Carcinoma (LUSC), Lung Adenocarcinoma (LUAD) and Colon adenocarcinoma (COAD) were consistent with those we found in RBP binding site mutations, thus corroborating the potential association of RBPs with cancer. In addition, Supplementary Fig. 21 in Supplementary Note 16 depicts the hub gene-disease association network, where we found a possible link between liver tumors and subunits of NADH dehydrogenase. Supplementary Figs. 22–23 in Supplementary Note 17 showed the interaction network of TFs and miRNAs and the disease-related genes, and we indeed observed diseases related to the nervous system. Based on the identified disease-linked binding genes, we finally investigated possible drug molecules for treatment of neurological disorders, as tabulated in Supplementary Table 2 and Supplementary Note 18.

Discussion

Cross-linked immunoprecipitation sequencing technology enables the high-throughput measurements of RNA-binding protein (RBP) binding patterns at the transcript level, accounting for the dynamic cellular conditions. However, current computational methods have limitations in considering the diversity of cellular conditions, which poses a significant challenge for predicting the cross-prediction events between RNAs and proteins in different cells. Here, we proposed an end-to-end deep learning-based framework to precisely predict dynamic RBP binding events across diverse cellular conditions. Specifically, we adopted multi-source biological information including the dynamic global contextual embedding and the in-vivo RNA secondary structure profile to characterize both the sequence and structural properties of RNA. Then, a unified alignment of multi-source feature representation was employed to generate potential feature maps with dimensional homogeneity for both biological representation. After that, we proposed a hierarchical multi-scale residual network to comprehend the contextual dependencies between the nucleotides and their structures at varying distances. Finally, a deep protein-RNA binding predictor with a multi-layer feature selector module was developed to learn and select the underlying key nucleotides tokens and employed a sigmoid function to forecast the binding of the proteins to RNA sequences.

We compared HDRNet with five RBP prediction methods in static cellular protein-RNA interaction identification. The experimental results showed that HDRNet had the best prediction performance, outperforming all compared methods on most benchmark datasets. In particular, HDRNet was able to efficiently learn implicit representations of RNA sequences and structural information. Meanwhile, HDRNet brought new insights by characterizing RNA binding events between different cell lines in a dynamic manner. Specifically, we trained HDRNet on the RBP dataset of K562 or HepG2 cells, and then predicted the RBP binding sites in the other cell line as an independent test set. We compared the dynamic prediction performance of HDRNet with the 5 benchmark methods, and found that HDRNet outperformed other methods on both K562 and HepG2 cells. HDRNet successfully predicted all binding sites of RBP HNRNPA1 on the MT-ND5 transcript with no false positives. Moreover, HDRNet pioneeringly accomplished cross-tissue dynamic prediction tasks and successfully highlighted the significant binding regions, illustrating the robustness of HDRNet in dynamic prediction.

In addition, HDRNet can also elucidate the underlying binding motifs from the dynamic global contextual embedding. We obtained a total of 172 motifs and discovered that RBPs involved in the RNA regulatory pathways were generally grouped together via hierarchical clustering. In addition, we found that HDRNet can detect the specific binding-peak by capturing the attention of the input data, thus enabling the identification of specific binding events in different cellular conditions. Then, through comparison with the genomic variants dataset, we identified that HDRNet captured high attention regions enriched with mutable alleles, and that alterations in these variants had significant changes in binding events in these high attention regions, revealing a potential link between RBP binding sites and human diseases. We also investigated the interrelationships between RBP binding sites and human diseases from a transcriptomic perspective by mapping the binding sites onto gene symbols and performing a series of genomic analyses, and provided possible drug molecules for disease treatment.

In summary, HDRNet discards the traditional RNA sequence representation and uses multi-source biological information to characterize the binding patterns of RNA binding proteins. As a new deep-learning method, HDRNet can simultaneously perform both static and dynamic RBP binding prediction, potential feature extraction, binding-peak identification, and interpretability analyses.

Methods

Data processing

For each RBP dataset, the resulting peaks were defined as binding sites, and the top 5000 binding sites with the most confident peaks were reserved for training and test sets as positive samples³². Among these, the length of each binding site was fixed at 101 nt while a region shorter than 101 nt was extended from the middle to both sides and a region longer than 101 nt was cut off from both sides. In addition, 10000 negative samples were generated by randomly selecting 101 nt from the whole transcriptome. Then, we labeled the positive samples as ’1’ and negative samples as ’0’. For each RBP dataset, we constructed a HDRNet model for each cell line separately. During the training of each RBP dataset, 20% of the samples were randomly selected as an independent test set to evaluate the performance of the model. Among the remaining 80% of the training samples, 20% of the training samples were randomly selected as the validated set to optimize the model parameters, while the rest of the data was considered as the training set. For convenience, we have provided all datasets and the BERT model on the website https://doi.org/10.6084/m9.figshare.24132423.

In-vivo RNA secondary structure representation

In general, the function of RNA is closely related to its complex folded secondary and tertiary structures. In addition, RNA secondary structure motifs are essential for the regulation of several biological procedures, such as protein binding, subcellular localization, and RNA decay⁸². Therefore, it is essential to effectively identify RNA secondary structure features for precise RBP binding prediction. Although computational methods of structural analysis can also yield predictions of RNA structure from sequences with a certain degree of accuracy, the predicted RNA structures do not reflect the real situation under dynamic cellular conditions, which greatly limits dynamic RBP prediction. In vivo click selective 29-hydroxyl acylation and profiling experiment icSHAPE⁸³ pipeline allows for the generation of RNA structure profiles, providing insights into the dynamic nature of RNA structure across the entire transcriptome from in vivo experiments, enabling the global and accurate characterization of the relationship between RNA structure and RNA binding protein (RBP) interactions^84,85. In particular, icSHAPE is a chemical modification-based approach, which is known for providing the natural reflection on RNA folding.⁸². Lastly, another key factor is the widespread acceptance and usage within the research community of icSHAPE^84,85, and this tool has garnered significant support and is backed by high-quality data. Thus, in our study, we adopted the icSHAPE technique to characterize the in vivo RNA secondary structure in different cellular conditions.

Briefly, the icSHAPE structure score R is calculated based on reverse transcription (RT) counts and polymeric sequencing coverage (base density, BD) on individual bases, where each mapped read contributes an RT value for the first base upstream of the starting point of the mapping and a BD value for all bases covered. For each sequence, the window of size wSize is initialized in the 5’ direction and slid to the 3’ direction with window step wStep. Indeed, the RT and BD between two replicates of the DMSO library and the NAI library of each base are combined by direct addition as follows:

$${r}_{i}^{C}={r}_{i}^{C1}+{r}_{i}^{C2}$$

(1)

$${r}_{i}^{T}={r}_{i}^{T1}+{r}_{i}^{T2}$$

(2)

$${b}_{i}^{C}={b}_{i}^{C1}+{b}_{i}^{C2}$$

(3)

$$1\le i\le wSize$$

(4)

where ${r}_{i}^{C}$ is the RT of the DMSO library, ${r}_{i}^{T}$ is the RT of the NAI library, and ${b}_{i}^{C}$ is the BD of the DMSO library. Then, these values are normalized by dividing by the average of the previous 90% to 95% values:

$${r}_{i}^{C}={r}_{i}^{C}/{r}_{q95}^{C}$$

(5)

$${r}_{i}^{T}={r}_{i}^{T}/{r}_{q95}^{T}$$

(6)

$${b}_{i}^{C}={b}_{i}^{C}/{b}_{q95}^{C}$$

(7)

$$1\le i\le wSize;$$

(8)

where ${r}_{q95}^{C}$, ${r}_{q95}^{T}$, and ${b}_{q95}^{C}$ are the calculated normalization factors for each replicate. On this basis, the enrichment signal is computed as follows:

$${e}_{i}=\frac{{r}_{i}^{T}-\alpha \times {r}_{i}^{C}}{{b}_{i}^{C}}$$

(9)

where α is the subtraction factor to measure the effect of background noise from the DMSO sample in the signal of the NAI-N3 sample. Finally, the resulting icSHAPE score is normalized to [0, 1] as below:

$${R}_{i}=\left\{\begin{array}{l}\min \left(1,\max \left(0,\frac{{e}_{i}-{e}_{q5}}{{e}_{q95}-{e}_{q5}}\right)\right),\quad {b}_{i}^{C} \, \ge \, 200 \\ \quad \quad \quad NULL,\qquad \quad \qquad \quad{b}_{i}^{C} < \, 200\end{array}\right.$$

(10)

where e_q5 is the bottom 5% signal value and e_q95 is the top 95% value. In the final icSHAPE score, it is noteworthy that only nucleotides with more than 200x coverage are considered as effective quality control. Accordingly, we process the RNA sequences using icSHAPE, and each RNA fragment generates an equal-length vector of real numbers representing secondary structural features as the input of the identification of RNA binding events in various cellular conditions.

Dynamic global contextual embedding

To characterize the sequence information of RNA, traditional one-hot encoding methods for RNA sequence information characterization focus solely on the sequential alignment of nucleotides, neglecting the biological properties and contextual information of RNA sequences, thus restricting the performance of the model. We thus adopted the Bidirectional Encoder Representations from Transformers (BERT) model⁴³, a self-attention-based language representation model, within the HDRNet framework to generate the dynamic global contextual information for RNA sequences in diverse cellular conditions. Benefitting from the multi-head self-attention mechanism, BERT has been widely used for capturing the long-range dependencies between tokens in sequential data (discussed in Supplementary Note 19). Inspired by ref. ⁵⁰, we first converted all the input data into k-mer tokens. For instance, given an RNA sequence ’ACGUGA’, we can obtain fragments {ACG, CGU, GUG, UGA} after 3-mer processing. Then, we added two special tokens, [CLS] and [SEP], denoting the beginning and end of the sequence respectively, to each RNA sequence. Following this, all the input tokens were embedded into dynamic feature vectors of 768 dimensions, resulting in the construction of a feature matrix X for each sequence. To generate the weighted sum of the feature vectors across all tokens, we utilized a multi-head self-attention module. This module employs a multi-headed attention mechanism to capture the dependencies between tokens, resulting in a weighted sum of feature vectors as follows:

$${{{{{{{\bf{Output}}}}}}}}=\,{{\mbox{MultiHead}}}\,({{{{{{{\bf{Q}}}}}}}},\, {{{{{{{\bf{K}}}}}}}},\, {{{{{{{\bf{V}}}}}}}})=\,{{\mbox{Concat}}}\,({{{{{{{{\bf{head}}}}}}}}}_{1},\, ...,\, {{{{{{{{\bf{head}}}}}}}}}_{h}){{{{{{{{\bf{W}}}}}}}}}^{O}$$

(11)

$${{{{{{{{\bf{head}}}}}}}}}_{i}=\,{{\mbox{Attention}}}\,({{{{{{{\bf{Q}}}}}}}},\, {{{{{{{\bf{K}}}}}}}},\, {{{{{{{\bf{V}}}}}}}})$$

(12)

$$\,{{\mbox{Attention}}}\,({{{{{{{\bf{Q}}}}}}}},\, {{{{{{{\bf{K}}}}}}}},\, {{{{{{{\bf{V}}}}}}}})=\,{{\mbox{softmax}}}\,\left(\frac{{{{{{{{\bf{Q}}}}}}}}{{{{{{{{\bf{K}}}}}}}}}^{T}}{\sqrt{{d}_{k}}}\right)\cdot {{{{{{{\bf{V}}}}}}}}$$

(13)

$$\left\{\begin{array}{l}{{{{{{{\bf{Q}}}}}}}}={{{{{{{\bf{X}}}}}}}}\cdot {{{{{{{{\bf{W}}}}}}}}}_{i}^{Q}\\ {{{{{{{\bf{K}}}}}}}}={{{{{{{\bf{X}}}}}}}}\cdot {{{{{{{{\bf{W}}}}}}}}}_{i}^{K}\\ {{{{{{{\bf{V}}}}}}}}={{{{{{{\bf{X}}}}}}}}\cdot {{{{{{{{\bf{W}}}}}}}}}_{i}^{V}\end{array}\right.$$

(14)

where Q, K, V denote Query, Key and Value, respectively, which are projected through h diverse transformers encoder. $\sqrt{{d}_{k}}$ stands for the scaling factor to control the magnitude of the dot product. ${\{{{{{{{{{\bf{W}}}}}}}}}_{i}^{Q},\, {{{{{{{{\bf{W}}}}}}}}}_{i}^{K},\, {{{{{{{{\bf{W}}}}}}}}}_{i}^{V}\}}_{i=0}^{h}$ are trainable parameters matrices of Query, Key, and Value vectors of the i-th head, respectively, and W^O is the learnt weight matrix of the final linear projection of multi-heads. After L such transformation layers, the model is capable of learning rich global semantics and encoding tokens into dynamic embedding representations. On this basis, we extracted the hidden states from the final layer of the model as numerical representations and removed the special markers [CLS] and [SEP] that are added at the beginning and end of the sequence. Additionally, the BERT architecture used in our work has 12 transition layers, each of which has 768 hidden units and 12 self-attention heads, and the weights of the model were obtained from ref. ⁵⁰. This attention mechanism enables the model to dynamically focus on the crucial nucleotide in the sequences under different cellular conditions and capture the contributions of features at individual tokens to facilitate the final prediction.

Hierarchical deep neural networks

After obtaining multi-source biological characteristics of RNA sequences, an end-to-end hierarchical deep neural network was designed to simultaneously learn feature representation and identification via explicit modeling of those RBP binding data, as depicted in Fig. 1.

Unified alignment of multi-source feature representation

As expounded in the preceding section, we presented two distinct methods for characterizing RNA sequences. The first approach involved in vivo secondary structure representation produced by icSHAPE, while the second approach utilized dynamic global contextual sequence embedding generated by BERT. However, as the distributions of the two feature descriptors differ, a unified alignment model was developed to extend and extract structural and sequence features respectively, which can effectively capture global features of the in vivo secondary structure representation and local features of dynamic global contextual sequence embedding.

Considering the structural information, we first embedded the RNA sequences of the RNA-RBP binding datasets into a one-dimensional vector of size d, which is defined as S = [s₁, s₂, s₃, s₄, ⋯ , s_d], where s_i denotes the i-th in-vivo RNA secondary structure score. We can clearly observe that the structural feature vector has a small number of channels, which limits the global structural information contained in the in-vivo RNA secondary structure representation. To address this limitation, we first fed such one-dimensional vector into a CNN block to enrich the global dependence of the structural information, as follows:

$$\tilde{{{{{{{{\bf{S}}}}}}}}}={F}_{C}({{{{{{{\bf{S}}}}}}}})$$

(15)

$${F}_{C}({{{{{{{\bf{x}}}}}}}})=\,{{\mbox{ReLU}}}({{\mbox{BN}}}({{\mbox{Conv}}}\,({{{{{{{\bf{x}}}}}}}})))$$

(16)

$$\,{{\mbox{Conv}}}\,({{{{{{{\bf{x}}}}}}}})=\left\{\begin{array}{c}\mathop{\sum }\limits_{i=1}^{l} {{{{{{{{\bf{W}}}}}}}}}_{c,i} * {{{{{{{{\bf{x}}}}}}}}}_{n,\, i-{p}_{l}}\quad \quad \,{{\mbox{if}}}\,\quad {p}_{l} \, < \, i \, < \, {p}_{l}+l\\ 0\quad \quad {{{{{{{\rm{else}}}}}}}}\end{array}\right.$$

(17)

where F_C is the customized sequential CNN block designed to detect the local sensitive regions of RBP binding. Conv(x) denotes a 1D-convolutional layer with a learnable convolutional kernel W ∈ R^d of c channels and a padding p_l to ensure that the input and output sizes are identical. After that, to enhance the ability of the model to detect sensitive regions and to avoid overfitting, we add a batch normalization layer (BN)⁸⁶ after the convolution layer as follows:

$$\,{{\mbox{BN}}}\,({{{{{{{\bf{x}}}}}}}})=\gamma \left(\frac{x-E({{{{{{{\bf{x}}}}}}}})}{\sqrt{\,{{\mbox{Var}}}\,({{{{{{{\bf{x}}}}}}}})+\varepsilon }}\right)+\beta$$

(18)

where γ and β are learnable parameters in the batch normalization layer. E(x) and Var(x) represent the mini-batch average and variance, respectively, and ε is added for numerical stability. During the training process, the batch normalization layer continuously updates running estimates of the mean and variance, which are used for normalization during evaluation. These running estimates have a default momentum value of m=0.1. The current estimate $\hat{{{{{{{{\bf{x}}}}}}}}}$ is calculated as a combination of the normalized input x, the previous estimate ${\hat{{{{{{{{\bf{x}}}}}}}}}}_{p}$, and the momentum value m using the formula:

$${{{{{{{\bf{x}}}}}}}}=(1-m)\times {\hat{{{{{{{{\bf{x}}}}}}}}}}_{p}+m\times \hat{{{{{{{{\bf{x}}}}}}}}}$$

(19)

Then, to retain and enhance the learned RBP-sensitive regions and to prevent the gradient from vanishing, the Rectified Linear Unit activation function (ReLU)⁸⁷ was employed to efficiently transfer the gradient, thus accelerating the convergence of the model as follows:

$$\,{{\mbox{ReLU}}}\,(x)=\left\{\begin{array}{ll}x & {{{{{{{\rm{if}}}}}}}} \; \; x \, > \, 0 \hfill \\ 0 & {{{{{{{\rm{else}}}}}}}}\end{array}\right.$$

(20)

In line with the approach taken for structural features, we applied a similar procedure to the BERT features. However, unlike secondary structure features, BERT features possess a higher number of channels, which presents an issue of global information being overly rich, thus disregarding the local dependence of nucleotides. Therefore, for BERT features, we also utilized CNN modules with varying numbers of channels to extract and enrich the local information of the BERT features.

Hierarchical multi-scale residual network

To capture the hierarchical relationships between the nucleotides and their structure, we developed a hierarchical multi-scale residual network to capture the contextual dependencies between the nucleotides and their structure in an exhaustive manner. In comparison with traditional conventional convolutional neural networks (CNNs), residual neural networks (ResNet) have been shown to improve information flow and prevent the vanishing gradient problem, which often leads to overfitting in deep neural networks⁸⁸ (also discussed in Supplementary Note 19). As mentioned previously, the Unified Alignment of Multi-source Feature Representation was utilized to obtain the unified multi-source biological features. To further investigate the potential representations of these multi-source biological features, we constructed a hierarchical multi-scale CNN network with multiple sequential CNN modules, each of which has a different scale, to capture the potential different distance dependencies, which is represented as follows:

$$\,{{\mbox{MultiScale}}}\,({{{{{{{\bf{x}}}}}}}})=\left|{\tilde{O}}_{1}({{{{{{{\bf{x}}}}}}}}),\, {\tilde{O}}_{2}({{{{{{{\bf{x}}}}}}}}),\, {\tilde{O}}_{3}({{{{{{{\bf{x}}}}}}}}),\, {\tilde{O}}_{4}({{{{{{{\bf{x}}}}}}}})\right|$$

(21)

$${\tilde{O}}_{j}({{{{{{{\bf{x}}}}}}}})=\mathop{\prod}\limits_{j}\left({F}_{{C}_{j}}({{{{{{{\bf{x}}}}}}}})\right)$$

(22)

$${F}_{{C}_{j}}({{{{{{{\bf{x}}}}}}}})={F}_{j} * {{{{{{{{\bf{x}}}}}}}}}_{i+p-1}=\,{{\mbox{ReLU}}}\,\left(BN\left({{{{{{{{\bf{w}}}}}}}}}_{j}\cdot {{{{{{{{\bf{x}}}}}}}}}_{i+p-1}+b\right)\right.$$

(23)

$$j\in \{1,\, 2,\, 3,\, 4\}$$

(24)

where ∣∣ ⋅ ∣∣ denotes the concat operation, ${\tilde{O}}_{j}({{{{{{{\bf{x}}}}}}}})$ denotes the output of each scale network, F_C denotes a CNN block consisting of a one-dimensional convolutional layer, BN layer and ReLU activation, w is a learnable convolutional kernel, and b is a bias term. ∏_j denotes j sequential CNN blocks in scale ${\tilde{O}}_{j}({{{{{{{\bf{x}}}}}}}})$, each of which has the identical number of channels.

Through multi-scale learning, the model is able to efficiently learn the different contextual dependencies of structural features at different distances to capture their unique binding properties. After that, we employed a residual mechanism to fuse the uniformly aligned secondary structural or dynamic global contextual embedding features with their corresponding multiscale biological features to enhance their local information, which is formulated as follows:

$$\,{{\mbox{ResidualBlock}}}\,({{{{{{{\bf{x}}}}}}}})=\,{{\mbox{ReLU}}}\,({{{{{{{\bf{x}}}}}}}}+\,{{\mbox{MultiScale}}}\,({{{{{{{\bf{x}}}}}}}}))$$

(25)

Deep protein-RNA binding predictor

After applying the structural and sequence information to the hierarchical multi-scale residual network, we first fused the two enriched features by concatenating them to obtain a comprehensive feature matrix. However, the restriction on sequence length still makes it challenging to make precise predictions. To address this challenge, we developed the Deep protein-RNA binding predictor based on deep pyramid convolutional neural network⁸⁹, which is capable of capturing global dependence of long-distance nucleotides at the token-level resolution. In our deep protein-RNA binding predictor, we set the sequence length halved after each layer, which specifically carries out a max-pooling layer and produces a new internal representation of the RNA sequence by taking the token-wise maximum over three consecutive internal vectors, which shortens the sequence while maintaining all its information. Since the number of feature channels is fixed, the computational complexity is also halved after each layer as the sequence length is halved. Moreover, unlike the original DPCNN⁸⁹, in DPRBP we set up two independent CNN blocks instead of sharing their parameters to dynamically learn global context dependencies. The specific representation is as follows:

$$t({{{{{{{\bf{x}}}}}}}})={F}_{{C}_{1}}\left({F}_{{C}_{2}}({{{{{{{\bf{x}}}}}}}})\right)$$

(26)

$$r({{{{{{{\bf{x}}}}}}}})=\,{{\mbox{ReLU}}}\,(x+t({{{{{{{\bf{x}}}}}}}}))$$

(27)

where t denotes two consecutive distinct CNN blocks C₁ and C₂, which are capable of dynamically learning the spatial relationships between sequences and structures. In addition, a residual connection r was added to enhance the perception of DPRBP in the high-attention region of sequences and structures. Subsequently, a max-pooling layer of size 3 with a step size of 2 was utilized to identify the specific binding of RBPs from the token-level loci, as follows:

$$g({{{{{{{\bf{x}}}}}}}})=\,{{\mbox{MaxPool}}}\,(r({{{{{{{\bf{x}}}}}}}}))$$

(28)

Through this max-pooling layer, the length of the sequence is halved in such a way that the most significant nucleotides in the sequence are retained. Simultaneously, since each CNN layer has the same number of channels, the corresponding computational complexity is also halved. After ${\log }_{2}(L)$ layer, the length of the sequence is finally reduced to 1, and a feature vector h is generated that is enriched with the structure-sequence space relationship and contains the global information of the whole sequence, denoted as follows:

$${{{{{{{\bf{h}}}}}}}}=\,{{\mbox{ReLU}}}\,\left(\mathop{\prod}\limits_{{\log }_{2}l}\left({g}_{j}({{{{{{{\bf{x}}}}}}}})\right)\right)$$

(29)

Afterwards, to predict the interaction of RBP with RNA, the obtained h will be fed into a fully connected neural network:

$$P=\sigma \left({{{{{{{{\bf{W}}}}}}}}}_{d}\times {{{{{{{\bf{h}}}}}}}}\right)$$

(30)

$$\sigma (x)=\frac{1}{1+{e}^{-x}}$$

(31)

where d denotes the dimension of the vector h and σ(⋅) is the sigmoid activation function. Finally, the output P is considered as the probability of whether it is an RBP binding site or not.

Training of HDRNet

Generally, we trained an HDRNet model for identifying dynamic cellular RNA-binding events, including 261 RBP datasets for 172 human proteins. First, we used RNA sequences as the input data with BERT and icSHAPE secondary structures for multi-source feature representation. During the training process, we randomly selected 20% of the samples as the independent test set for performance evaluation, and the rest were considered as the training set, where 20% of the training set was randomly selected as the validation set. HDRNet was trained to learn the parameters that minimize the binary cross-entropy loss function:

$$\,{{\mbox{Loss}}}\,({{{{{{{\bf{P}}}}}}}},\, {{{{{{{\bf{Y}}}}}}}})=-\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}[{y}_{i} * \log {p}_{i}+(1-{y}_{i}) * \log (1-{p}_{i})]+\lambda | W{| }_{2}$$

(32)

where N denotes the mini batch size, y_i is the true label and p_i is the predicted binding probability. λ∣W∣₂ denotes the L₂ norm on all parameters, which acts as the weight decay term to reduce overfitting in the training model. In addition, to demonstrate the performance of our model, the area under the receiver operating characteristic curve (AUC) is chosen for performance evaluation.

Parameter settings

In the experimental section, we describe the implementation details of HDRNet. Sequence features obtained from BERT were first reduced in dimensionality using 128 one-dimensional convolutional kernels with a kernel size of 1. In addition, RNA secondary structure features were upsampled using 128 one-dimensional convolutional filters with a kernel size of 3. The resulting sequence and structural features were then separately fed into a Hierarchical multi-scale ResNet with identity maps for information enrichment, which contains one-dimensional convolutional layers with kernel sizes of 1, 3, 5 and 7 and CNN blocks of 1, 2, 3 and 4 in each view, totaling 32 channels. After concatenating the feature maps, they were passed through the deep protein-RNA binding predictor module, which had a kernel size of 5 and 256 channels, followed by a fully connected layer with one hidden unit and a Sigmoid activation function. All CNN layers were arranged in the order of Conv-BN-ReLU and had no bias. The model is trained on PyTorch 1.10 with parameters initialized using Kaiming initialization⁹⁰. To prevent overfitting, dropout layers with a rate of 0.3 were added after each activation layer. The network was trained end-to-end using the Adam optimizer with a batch size of 32, a learning rate of 0.001, and a weight decay of 1e-6. In addition, we employed a linearly scaled warm-up scheme to calibrate the learning rate to overcome optimization challenges in the early training. Our model was trained on an NVIDIA GeForce RTX 3090 GPU with 24 GB of memory and early stops were controlled by validation loss, and taking ~80 h on all 261 RBP datasets.

Competing methods

HDRNet was implemented in Python, and the core model was built on the PyTorch framework that is publicly available at https://github.com/zhuhr213/HDRNet. To elucidate the effectiveness of our proposed model, we compared HDRNet with several deep learning models and machine learning algorithms as follows:

PrismNet³² (https://github.com/kuixu/PrismNet) is a recent study that developed a convolutional neural network (CNN)-based deep learning approach, which effectively incorporates in vivo RNA structure data and RNA-binding protein (RBP) binding data to make precise predictions of RBP binding sites. This method applied an “attention” strategy to precisely identify RBP binding nucleotides. Notably, PrismNet is the first tool designed for dynamic prediction tasks.
PRIESSTESS³⁰ (https://github.com/kaitlin309/PRIESSTESS) introduces a universal RNA motif-finding/scanning strategy capable of identifying enriched RNA sequences and/or structure motifs. PRIESSTESS consists of two steps. The first step generates a large collection of enriched motifs encompassing both RNA sequence and structure. The second step produces an aggregate model, which combines the motif scores into a single value, and gauges the relative importance of each motif.
iDeep²⁸ (https://github.com/xypan1232/iDeep) is proposed and leverages a novel hybrid CNN network and deep belief network to predict the RBP interaction sites and motifs on RNAs by converting the original data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated.
DMSK³⁹ (https://github.com/Rebecca3150/DMSK) is a novel identification method based on multi-view deep learning, subspace learning, and a multi-view classifier for the identification of circRNA-RBP interaction sites involved in computationally predicted RNA secondary structures.
GraphProt²⁹ (https://github.com/dmaticzka/GraphProt) models the sequence and structure binding preferences of RBP binding sites using graph kernel features based on sequence and computationally predicted structure information.
DeepBind²⁶ (https://github.com/jisraeli/DeepBind) is a CNN-based deep learning model that predicts RBP binding sites based on only RNA sequences.

In addition, we conducted a comprehensive analysis of machine learning methods to compare performance to our proposed HDRNet. Specifically, we evaluated XGBoost⁹¹, Random Forest (RF)⁹², Logistic Regression (LR)⁹³, Artificial Neural Network (ANN)⁹⁴, ExtraTreeClassifier (ETC)⁹⁵, SGDClassifier (SGDC)⁹⁶, and GaussianNB (GNB)⁹⁷ using the same feature representation as HDRNet, including the dynamic contextual embedding and the icSHAPE secondary structure information. For the ANN implementation, we used a Pytorch environment and set the size of the hidden layers to 512, 32, and 1, respectively. XGBoost was implemented using the Python package of version 1.5.0. The remaining algorithms were implemented using the scikit-learn package⁹⁸. Meanwhile, we applied the default parameter settings for Logistic Regression, Random Forest, and XGBoost.

Tissue data pre-processing

The raw data for tissue dataset is stored in the ’.bed’ format, which records the genomic positions of each binding peak. We employed the ’bedtools getfasta’ function or ’getFastaFromBed’ to obtain the corresponding nucleotide sequences based on the chromosomal coordinates. Specifically, for human tissue data, we utilized hg38 as the reference genome, while for mouse data, we employed mm9 as reference.

Motif calculation

The multi-head self-attention mechanism has the capacity to accurately identify and decode significant genomic regions. In our study, we leverage dynamic global contextual embedding to explore the biological functionalities of RNA sequences. More explicitly, we computed the attention score for the i-th embedded k-mer token, denoted as Score_i through the summation, thereby identifying the transcript fragment in each RNA sequence that offers the most substantial contribution towards downstream classification, which can be formulated as follows:

$$Scor{e}_{i}=\mathop{\sum }\limits_{n=1}^{N}\frac{\exp \left({{{{{{{{\bf{Q}}}}}}}}}_{CLS}^{T}\cdot {{{{{{{{\bf{K}}}}}}}}}_{i}/\sqrt{d}\right)}{{\sum }_{t=1}^{T}\exp \left({{{{{{{{\bf{Q}}}}}}}}}_{CLS}^{T}\cdot {{{{{{{{\bf{K}}}}}}}}}_{t}/\sqrt{d}\right)}$$

(33)

where N symbolizes the number of attention heads, T represents the number of tokens in a given RNA sequence, ${{{{{{{{\bf{Q}}}}}}}}}_{CLS}^{T}$ denotes the query vector of the special tokens [CLS], K_i denotes the key vector of the i-th k-mer token with i ∈ {1, 2, . . . , L} where L denotes the number of input tokens, and d is the dimensionality of the feature vectors. After computing the attention score for each token, we classified RNA fragments as RBP-binding motifs based on the following three criteria: (1) The attention score of the fragment exceeds the average score of the sequence; (2) The attention score of the fragment is 10 times higher than the lowest attention score; and (3) The minimum length of the fragment is 6.

High attention region visualization

We adopted the SHAP tool⁴⁷ to calculate the specific attention score for each token of the input RNA sequence. The ’GradientExplainer’ function was employed, where a score matrix of the same size as the input dynamic context matrix coule be obtained. Each numerical value in this matrix represents the attention value at the corresponding position in the input matrix. We then selected the maximum value of each token as the final attention score, and got an attention vector Att with length 99. Subsequently, we allocated the attention scores of each token evenly across every nucleotide, as demonstrated below:

$$Scor{e}_{i}=\left\{\begin{array}{r}{{{{{{{{\bf{Att}}}}}}}}}_{i},\, i=1\\ \frac{{{{{{{{{\bf{Att}}}}}}}}}_{i-1}+{{{{{{{{\bf{Att}}}}}}}}}_{i}}{2},\, i=2\\ \frac{{{{{{{{{\bf{Att}}}}}}}}}_{i-1}+{{{{{{{{\bf{Att}}}}}}}}}_{i-2}}{2},\, i=100\\ {{{{{{{{\bf{Att}}}}}}}}}_{i-1},\, i=101 \\ \frac{{{{{{{{{\bf{Att}}}}}}}}}_{i}+{{{{{{{{\bf{Att}}}}}}}}}_{i-1}+{{{{{{{{\bf{Att}}}}}}}}}_{i-2}}{3},\, else\end{array}\right.$$

(34)

where Score_i denotes the attention score of the i-th nucleotide, and i is in range [1, 101].

Functional enrichment

We used the R package ClusterProfiler⁹⁹ to perform KEGG and GO enrichment analysis on the 544 TDP-43 binding genes. We utilized the function ’enrichKEGG’ for the KEGG analysis, where the parameters were set to “pAdjustMethod = fdr, pvalueCutoff = 0.01, qvalueCutoff = 0.05". The GO analysis was performed by function ’enrichGO’, where the parameters were set to “ont = ALL, pAdjustMethod = BH, pvalueCutoff = 0.01, qvalueCutoff = 0.05".

PPI network

We performed preliminary PPI network construction on the 544 shared TDP-43 binding genes using STRING⁷⁴ and the most significant disease-related genes, including a total of 28 genes were input to Cytoscape for visualization. To identify the most important modules in the PPI network, we adopt MCODE⁷⁵ for the network and give the top three modules.

Statistics and reproducibility

The detailed statistical tests were explained in each figure legend. Sample data were obtained from public repositories. Sample size was not predetermined and is the maximum number of samples available for each datasets. No data were excluded from the analyses. No experimental groups were assigned. Our study does not involve group allocation that requires blinding. To reproduce the results, please find the Source Data file we provided.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

We collected 261 RBP binding sites datasets for cell lines across multiple databases, including 172 RBPs constructed using the same flag-marked technique in K562, HepG2, HEK293, HEK293T, HeLa and H9 cell lines. These datasets include 65 CLLP-seq datasets for 61 RBPs from POSTAR database⁴⁵ and 196 eCLIP datasets for 111 RBPs from the ENCODE project¹⁰⁰. These data have been deposited in³². In terms of the RBP binding data in tissues, the processed MBNL2 (Muscleblind Like Splicing Regulator 2) binding peak data in human brain tissues⁴⁴ were available in POSTAR database⁴⁵ under accession code GSE68890 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE68890]; the DGCR8 and HNRNPU binding data in human adrenal gland were collected from ENCODE project¹⁰⁰; and the MBNL1 (Muscleblind Like Splicing Regulator 1) binding data in brain, heart, muscle, and myoblasts from mice were obtained from⁴⁶ under accession code GSE39911 [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE39911]. We have deposited these datasets in FigShare database [https://doi.org/10.6084/m9.figshare.24132423]¹⁰¹. These datasets can also be downloaded from our HDRNet web-server at http://www.aibio-lab.com:5050/. Source data are provided with this paper.

Code availability

HDRNet is an open-source tool available at https://github.com/zhuhr213/HDRNet¹⁰², where all packages are implemented in Python. We provide a user-friendly web server for the HDRNet method at http://www.aibio-lab.com:5050/, which enables users to determine whether a given RNA sequence is a binding site for an RNA-binding protein. Users can choose the precise RBP types and the cell lines by clicking on their corresponding buttons to complete the static and dynamic predictions based on their requirements. To facilitate use, users can enter the query RNA sequences in the input box or upload a text file containing RNA sequences in any format. The submitted jobs and predicted results, including the probability of the RNA sequence binding to the RBP, are then sent to the users’ provided contact addresses. In addition, we provide all datasets along with the BERT model used in this study, which can be downloaded directly from the web server and FigShare¹⁰¹.

References

Castello, A. et al. Insights into rna biology from an atlas of mammalian mrna-binding proteins. Cell 149, 1393–1406 (2012).
Article CAS PubMed Google Scholar
Gerstberger, S., Hafner, M. & Tuschl, T. A census of human rna-binding proteins. Nat. Rev. Genet. 15, 829–845 (2014).
Article CAS PubMed Google Scholar
Gebauer, F., Schwarzl, T., Valcárcel, J. & Hentze, M. W. Rna-binding proteins in human genetic disease. Nat. Rev. Genet. 22, 185–198 (2021).
Article CAS PubMed Google Scholar
Ellington, A. D. & Szostak, J. W. In vitro selection of rna molecules that bind specific ligands. Nature 346, 818–822 (1990).
Article ADS CAS PubMed Google Scholar
Ray, D. et al. Rapid and systematic analysis of the rna recognition specificities of rna-binding proteins. Nat. Biotechnol. 27, 667–670 (2009).
Article CAS PubMed Google Scholar
Lambert, N. et al. Rna bind-n-seq: quantitative assessment of the sequence and structural binding specificity of rna binding proteins. Mol. Cell 54, 887–900 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, C. & Svejstrup, J. Q. Rna immunoprecipitation for determining rna-protein associations in vivo. Curr. Protoc. Mol. Biol. 75, 27–4 (2006).
Google Scholar
Van Nostrand, E. L. et al. Robust transcriptome-wide discovery of rna-binding protein binding sites with enhanced clip (eclip). Nat. Methods 13, 508–514 (2016).
Article PubMed PubMed Central Google Scholar
Hafner, M. et al. Transcriptome-wide identification of rna-binding protein and microrna target sites by par-clip. Cell 141, 129–141 (2010).
Article CAS PubMed PubMed Central Google Scholar
Licatalosi, D. D. et al. Hits-clip yields genome-wide insights into brain alternative rna processing. Nature 456, 464–469 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Hansen, T. B., Venø, M. T., Damgaard, C. K. & Kjems, J. Comparison of circular rna prediction tools. Nucleic Acids Res. 44, e58–e58 (2016).
Article PubMed Google Scholar
Ule, J. et al. Clip identifies nova-regulated rna networks in the brain. Science 302, 1212–1215 (2003).
Article ADS CAS PubMed Google Scholar
Li, J.-H., Liu, S., Zhou, H., Qu, L.-H. & Yang, J.-H. starbase v2. 0: decoding mirna-cerna, mirna-ncrna and protein–rna interaction networks from large-scale clip-seq data. Nucleic Acids Res. 42, D92–D97 (2014).
Article CAS PubMed Google Scholar
Anders, G. et al. dorina: a database of rna interactions in post-transcriptional regulation. Nucleic Acids Res. 40, D180–D186 (2012).
Article CAS PubMed Google Scholar
Si, J., Cui, J., Cheng, J. & Wu, R. Computational prediction of rna-binding proteins and binding sites. Int. J. Mol. Sci. 16, 26303–26317 (2015).
Article CAS PubMed PubMed Central Google Scholar
Wei, J., Chen, S., Zong, L., Gao, X. & Li, Y. Protein–rna interaction prediction with deep learning: structure matters. Brief. Bioinform. 23, bbab540 (2022).
Article PubMed Google Scholar
Yan, J., Friedrich, S. & Kurgan, L. A comprehensive comparative review of sequence-based predictors of dna-and rna-binding residues. Brief. Bioinform. 17, 88–105 (2016).
Article CAS PubMed Google Scholar
Miao, Z. & Westhof, E. A large-scale assessment of nucleic acids binding site prediction programs. PLoS Comput. Biol. 11, e1004639 (2015).
Article ADS PubMed PubMed Central Google Scholar
Zhang, J. & Kurgan, L. Scriber: accurate and partner type-specific prediction of protein-binding residues from proteins sequences. Bioinformatics 35, i343–i353 (2019).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. aprbind: protein–rna interface prediction by combining sequence and i-tasser model-based structural features learned with convolutional neural networks. Bioinformatics 37, 937–942 (2021).
Article PubMed Google Scholar
Yan, J. & Kurgan, L. Drnapred, fast sequence-based method that accurately predicts and discriminates dna-and rna-binding residues. Nucleic Acids Res. 45, e84–e84 (2017).
PubMed PubMed Central Google Scholar
Jiménez, J., Doerr, S., Martínez-Rosell, G., Rose, A. S. & De Fabritiis, G. Deepsite: protein-binding site predictor using 3d-convolutional neural networks. Bioinformatics 33, 3036–3042 (2017).
Article PubMed Google Scholar
Xia, Y., Xia, C.-Q., Pan, X. & Shen, H.-B. Graphbind: protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues. Nucleic Acids Res. 49, e51–e51 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhang, F., Zhao, B., Shi, W., Li, M. & Kurgan, L. Deepdisobind: accurate prediction of rna-, dna-and protein-binding intrinsically disordered residues with deep multi-task learning. Brief. Bioinform. 23, bbab521 (2022).
Article PubMed Google Scholar
Lam, J. H. et al. A deep learning framework to predict binding preference of rna constituents on protein surface. Nat. Commun. 10, 4941 (2019).
Article ADS PubMed PubMed Central Google Scholar
Alipanahi, B., Delong, A., Weirauch, M. T. & Frey, B. J. Predicting the sequence specificities of dna-and rna-binding proteins by deep learning. Nat. Biotechnol. 33, 831–838 (2015).
Article CAS PubMed Google Scholar
Ben-Bassat, I., Chor, B. & Orenstein, Y. A deep neural network approach for learning intrinsic protein-rna binding preferences. Bioinformatics 34, i638–i646 (2018).
Article CAS PubMed Google Scholar
Pan, X. & Shen, H.-B. Rna-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach. BMC Bioinform. 18, 1–14 (2017).
Article Google Scholar
Maticzka, D., Lange, S. J., Costa, F. & Backofen, R. Graphprot: modeling binding preferences of rna-binding proteins. Genome Biol. 15, 1–18 (2014).
Article Google Scholar
Laverty, K. U. et al. Priesstess: interpretable, high-performing models of the sequence and structure preferences of rna-binding proteins. Nucleic Acids Res. 50, e111–e111 (2022).
Article CAS PubMed PubMed Central Google Scholar
Freeberg, M. A. et al. Pervasive and dynamic protein binding sites of the mrna transcriptome in saccharomyces cerevisiae. Genome Biol. 14, 1–20 (2013).
Article Google Scholar
Sun, L. et al. Predicting dynamic cellular protein–rna interactions by deep learning using in vivo rna structures. Cell Res. 31, 495–516 (2021).
Article CAS PubMed PubMed Central Google Scholar
Spitale, R. C. et al. Structural imprints in vivo decode rna regulatory mechanisms. Nature 519, 486–490 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Du, S., Lee, J., Tian, Y., Singh, A., and Poczos, B.. Gradient descent learns one-hidden-layer cnn: don’t be afraid of spurious local minima. In International Conference on Machine Learning, p. 1339–1348. PMLR (2018).
Pennington, J., Socher, R., and Manning, C. D. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference On Empirical Methods In Natural Language Processing (EMNLP), p. 1532–1543 (2014).
Le, Q. and Mikolov, T. Distributed representations of sentences and documents. In International Conference On Machine Learning, p. 1188–1196. PMLR (2014).
Zhang, S. et al. A deep learning framework for modeling structural features of rna-binding protein targets. Nucleic Acids Res. 44, e32–e32 (2016).
Article PubMed Google Scholar
Yang, Y., Hou, Z., Ma, Z., Li, X. & Wong, K.-C. icircrbp-dhn: identification of circrna-rbp interaction sites using deep hierarchical network. Brief. Bioinform. 22, bbaa274 (2021).
Article PubMed Google Scholar
Li, H. et al. circrna-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier. Brief. Bioinform. 23, bbab394 (2022).
Article PubMed Google Scholar
Wang, Z. & Lei, X. Prediction of rbp binding sites on circrnas using an lstm-based deep sequence learning architecture. Brief. Bioinform. 22, bbab342 (2021).
Article PubMed Google Scholar
Mikolov, T., Chen, K., Corrado, G., and Dean, J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017).
Article Google Scholar
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. Bert: pre-training of deep bidirectional transformers for language understanding. Preprint at arXiv:1810.04805 (2018).
Goodwin, M. et al. Mbnl sequestration by toxic rnas and rna misprocessing in the myotonic dystrophy brain. Cell Rep. 12, 1159–1168 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Y. et al. Postar2: deciphering the post-transcriptional regulatory logics. Nucleic Acids Res. 47, D203–D211 (2019).
Article CAS PubMed Google Scholar
Wang, E. T. et al. Transcriptome-wide regulation of pre-mrna splicing and mrna localization by muscleblind proteins. Cell 150, 710–724 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lundberg, S. M. and Lee, S.-I. A unified approach to interpreting model predictions. Advances In Neural Information Processing Systems, 30 (2017).
Joulin, A., Grave, E., Bojanowski, P. & Mikolov, T. Bag of tricks for efficient text classification. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, 427–431 (Association for Computational Linguistics, Valencia, Spain, 2017).
Yang, Y. et al. Hcrnet: high-throughput circrna-binding event identification from clip-seq data using deep temporal convolutional network. Brief. Bioinform. 23, bbac027 (2022).
Ji, Y., Zhou, Z., Liu, H. & Davuluri, R. V. Dnabert: pre-trained bidirectional encoder representations from transformers model for dna-language in genome. Bioinformatics 37, 2112–2120 (2021).
Article CAS PubMed Google Scholar
Bailey, T. L. et al. Meme suite: tools for motif discovery and searching. Nucleic Acids Res. 37, W202–W208 (2009).
Article CAS PubMed PubMed Central Google Scholar
Yang, E.-W. et al. Allele-specific binding of rna-binding proteins reveals functional genetic variants in the rna. Nat. Commun. 10, 1338 (2019).
Article ADS PubMed PubMed Central Google Scholar
Tollervey, J. R. et al. Characterizing the rna targets and position-dependent splicing regulation by tdp-43. Nat. Neurosci. 14, 452–458 (2011).
Article CAS PubMed PubMed Central Google Scholar
Postel-Vinay, S. et al. Common variants near tardbp and egr2 are associated with susceptibility to ewing sarcoma. Nat. Genet. 44, 323–327 (2012).
Article CAS PubMed Google Scholar
White, M. A. et al. Tdp-43 gains function due to perturbed autoregulation in a tardbp knock-in mouse model of als-ftd. Nat. Neurosci. 21, 552–563 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kabashi, E. et al. Tardbp mutations in individuals with sporadic and familial amyotrophic lateral sclerosis. Nat. Genet. 40, 572–574 (2008).
Article CAS PubMed Google Scholar
Van Deerlin, V. M. et al. Tardbp mutations in amyotrophic lateral sclerosis with tdp-43 neuropathology: a genetic and histopathological analysis. Lancet Neurol. 7, 409–416 (2008).
Article PubMed PubMed Central Google Scholar
Kabashi, E. et al. Gain and loss of function of als-related mutations of tardbp (tdp-43) cause motor deficits in vivo. Hum. Mol. Genet. 19, 671–683 (2010).
Article CAS PubMed Google Scholar
Sherry, S. T. et al. dbsnp: the ncbi database of genetic variation. Nucleic Acids Res. 29, 308–311 (2001).
Article CAS PubMed PubMed Central Google Scholar
Tate, J. G. et al. Cosmic: the catalogue of somatic mutations in cancer. Nucleic Acids Res. 47, D941–D947 (2019).
Article CAS PubMed Google Scholar
Bardou, P., Mariette, J., Escudié, F., Djemiel, C. & Klopp, C. jvenn: an interactive venn diagram viewer. BMC Bioinform. 15, 1–7 (2014).
Article Google Scholar
Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90–W97 (2016).
Article CAS PubMed PubMed Central Google Scholar
Koyama, A. et al. Increased cytoplasmic tardbp mrna in affected spinal motor neurons in als caused by abnormal autoregulation of tdp-43. Nucleic Acids Res. 44, 5820–5836 (2016).
Article CAS PubMed PubMed Central Google Scholar
Buratti, E. et al. Multiple roles of tdp-43 in gene expression, splicing regulation, and human disease. Front. Biosci. 13, 867–878 (2008).
Zhou, H., Mangelsdorf, M., Liu, J., Zhu, L. & Wu, J. Y. Rna-binding proteins in neurological diseases. Sci. China Life Sci. 57, 432–444 (2014).
Article CAS PubMed Google Scholar
Zweig, R. M., Jankel, W. R., Hedreen, J. C., Mayeux, R. & Price, D. L. The pedunculopontine nucleus in parkinson’s disease. Ann. Neurol. 26, 41–46 (1989).
Article CAS PubMed Google Scholar
Boassa, D. et al. Mapping the subcellular distribution of α-synuclein in neurons using genetically encoded probes for correlated light and electron microscopy: implications for parkinson’s disease pathogenesis. J. Neurosci. 33, 2605–2615 (2013).
Article CAS PubMed PubMed Central Google Scholar
Afghah, Z., Chen, X. & Geiger, J. D. Role of endolysosomes and inter-organellar signaling in brain disease. Neurobiol. Dis. 134, 104670 (2020).
Article CAS PubMed Google Scholar
Wittig, U. & De Beuckelaer, A. Analysis and comparison of metabolic pathway databases. Brief. Bioinform. 2, 126–142 (2001).
Article CAS PubMed Google Scholar
Rayaprolu, S. et al. Tardbp mutations in parkinson’s disease. Parkinsonism Relat. Disord. 19, 312–315 (2013).
Article PubMed Google Scholar
Schwab, C., Arai, T., Hasegawa, M., Yu, S. & McGeer, P. L. Colocalization of transactivation-responsive dna-binding protein 43 and huntingtin in inclusions of huntington disease. J. Neuropathol. Exp. Neurol. 67, 1159–1165 (2008).
Article PubMed Google Scholar
Smethurst, P., Sidle, K. C. L. & Hardy, J. Prion-like mechanisms of transactive response dna binding protein of 43 kda (tdp-43) in amyotrophic lateral sclerosis (als). Neuropathol. Appl. Neurobiol. 41, 578–597 (2015).
Article CAS PubMed Google Scholar
Amador-Ortiz, C. et al. Tdp-43 immunoreactivity in hippocampal sclerosis and alzheimer’s disease. Ann. Neurol. 61, 435–445 (2007).
Article CAS PubMed PubMed Central Google Scholar
Szklarczyk, D. et al. The string database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 49, D605–D612 (2021).
Article CAS PubMed Google Scholar
Bader, G. D. & Hogue, C. W. V. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 4, 1–27 (2003).
Article Google Scholar
Chin, C.-H. et al. cytohubba: identifying hub objects and sub-networks from complex interactome. BMC Syst. Biol. 8, 1–7 (2014).
Article Google Scholar
Torraco, A. et al. Novel ndufa12 variants are associated with isolated complex i defect and variable clinical manifestation. Hum. Mutat. 42, 699–710 (2021).
Article CAS PubMed Google Scholar
Haack, T. B. et al. Mutation screening of 75 candidate genes in 152 complex i deficiency cases identifies pathogenic variants in 16 genes including ndufb9. J. Med. Genet. 49, 83–89 (2012).
Article CAS PubMed Google Scholar
Piekutowska-Abramczuk, D. et al. Ndufb8 mutations cause mitochondrial complex i deficiency in individuals with leigh-like encephalomyopathy. Am. J. Hum. Genet. 102, 460–467 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schapira, A. H. V. et al. Mitochondrial complex i deficiency in parkinson’s disease. J. Neurochem. 54, 823–827 (1990).
Article CAS PubMed Google Scholar
Li, T. et al. Timer2. 0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 48, W509–W514 (2020).
Article CAS PubMed PubMed Central Google Scholar
Flynn, R. A. et al. Transcriptome-wide interrogation of rna secondary structure in living cells with icshape. Nat. Protoc. 11, 273–290 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, P., Shi, R. & Zhang, Q. C. icshape-pipe: a comprehensive toolkit for icshape data analysis and evaluation. Methods 178, 96–103 (2020).
Article CAS PubMed Google Scholar
Chan, D., Feng, C. & Spitale, R. C. Measuring rna structure transcriptome-wide with icshape. Methods 120, 85–90 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chen, L., Chang, H. Y. & Artandi, S. E. Analysis of rna conformation in endogenously assembled rnps by icshape. STAR Protoc. 2, 100477 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ioffe, S. and Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference On Machine Learning, p. 448–456. PMLR (2015).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, p. 770–778 (2016).
Johnson, R. and Zhang, T. Deep pyramid convolutional neural networks for text categorization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), p. 562–570 (2017).
He, K., Zhang, X., Ren, S., and Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE International Conference On Computer Vision, p. 1026–1034 (2015).
Chen, T. and Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference On Knowledge Discovery And Data Mining, p. 785–794 (2016).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Hosmer Jr, D. W., Lemeshow, S., & Sturdivant, R. X. Applied Logistic Regression, Vol. 398 (John Wiley & Sons, 2013).
Jain, A. K., Mao, J. & Mohiuddin, K. M. Artificial neural networks: a tutorial. Computer 29, 31–44 (1996).
Article Google Scholar
Geurts, P., Ernst, D. & Wehenkel, L. Extremely randomized trees. Mach. Learn. 63, 3–42 (2006).
Article MATH Google Scholar
Zadrozny, B. and Elkan, C. Transforming classifier scores into accurate multiclass probability estimates. In Proceedings of the Eighth ACM SIGKDD International Conference On Knowledge Discovery And Data Mining, p. 694–699 (2002).
Kamel, H., Abdulah, D., and Al-Tuwaijari, J. M. Cancer classification using gaussian naive bayes algorithm. In 2019 International Engineering Conference (IEC), p. 165–170. (IEEE, 2019).
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterprofiler: an r package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS PubMed PubMed Central Google Scholar
Van Nostrand, E. L. et al. A large-scale binding and functional map of human rna-binding proteins. Nature 583, 711–719 (2020).
Article ADS PubMed PubMed Central Google Scholar
Zhu, H. & Li, X. Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet. https://doi.org/10.6084/m9.figshare.24132423 (2023a).
Zhu, H. & Li, X. zhuhr213/HDRNet: Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet. https://doi.org/10.5281/zenodo.8341649 (2023b).

Download references

Acknowledgements

The work described in this paper was substantially supported by the National Natural Science Foundation of China under (Grant No. 62076109) and the Jilin Province Outstanding Young Scientist Program (Grant No. 20230508098RC), and also funded by “the Fundamental Research Funds for the Central Universities, JLU".

Author information

Authors and Affiliations

School of Artificial Intelligence, Jilin University, 130012, Changchun, China
Haoran Zhu, Yi Chang & Xiangtao Li
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
Yuning Yang
School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
Yunhe Wang
Department of Computer Science, City University of Hong Kong, Hong Kong, Hong Kong SAR
Fuzhou Wang & Ka-chun Wong
College of Computer Science and Cyber Security, Chengdu University of Technology, 610059, Chengdu, China
Yujian Huang

Authors

Haoran Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yuning Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yunhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fuzhou Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yujian Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Chang
View author publications
You can also search for this author in PubMed Google Scholar
Ka-chun Wong
View author publications
You can also search for this author in PubMed Google Scholar
Xiangtao Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.L. conceived and supervised the project. H.Z. developed and implemented the algorithms under the guidance of X.L. and H.Z. wrote the manuscript. H.Z. conducted the experiments. X.L., H.Z., and K.W. did the biological interpretation. H.Z., Y.Y., and Y.W. completed the figures and manuscript. H.Z., X.L., and K.W. revised the manuscript. F.W., Y.H., and Y.C. provided advice on method development. All authors approved the manuscript.

Corresponding authors

Correspondence to Ka-chun Wong or Xiangtao Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhu, H., Yang, Y., Wang, Y. et al. Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet. Nat Commun 14, 6824 (2023). https://doi.org/10.1038/s41467-023-42547-1

Download citation

Received: 23 February 2023
Accepted: 13 October 2023
Published: 26 October 2023
DOI: https://doi.org/10.1038/s41467-023-42547-1

This article is cited by

RNA structure: implications in viral infections and neurodegenerative diseases
- Suiru Lu
- Yongkang Tang
- Lei Sun
Advanced Biotechnology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.