Abstract
Chronic kidney disease (CKD) represents a significant global health burden. Currently employed CKD biomarkers are influenced by various factors and lack accuracy in reflecting early-stage renal fibrosis severity. Consequently, there is an urgent need for the identification of early, noninvasive CKD biomarkers. Urine, easily collectible and kidney-derived, has demonstrated potential as a diagnostic source for various kidney diseases by leveraging its RNA content. To address this, we obtained RNA-seq data pertaining to urinary RNAs from both CKD patients and healthy controls via the Gene Expression Omnibus database (GEO). The DEseq2 software was utilized to identify differentially expressed RNAs (DE-RNAs). To evaluate the overall accuracy of these DE-RNAs in urine, we performed Receiver Operating Characteristic analysis (ROC). Selected urinary RNAs were subsequently validated using reverse-transcription quantitative real-time Polymerase Chain Reaction (qRT-PCR) in conjunction with ROC analysis. Computational and experimental analyses revealed significant increases in miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 within the urine of CKD patients, exhibiting high AUC values. In conclusion, our findings suggest that urinary RNAs hold promise as diagnostic biomarkers for CKD.
Similar content being viewed by others
Introduction
Chronic kidney disease (CKD) presents as a progressive condition, starting with inflammation and leading to fibrosis. It has emerged as a significant global health burden, affecting approximately 10% of adults1. The rising prevalence of diabetes is further contributing to an increase in diabetic nephropathy patients, thereby amplifying the overall burden of CKD. The absence of early clinical signs prior to irreversible kidney dysfunction poses a primary challenge in effectively managing CKD patients2. Currently, serum creatinine, urea nitrogen, and protein are the most commonly used biomarkers for diagnosing CKD. However, these biomarkers are influenced by factors such as age, diet, and infection status, limiting their diagnostic accuracy. Additionally, they are inadequate in detecting early renal fibrosis, which is crucial for timely intervention. Renal biopsy provides a gold standard for precise renal pathological examination; however, it is invasive, discomforting, and associated with potential complications. Consequently, it cannot be routinely repeated or applied as a screening tool for large populations. Therefore, there exists an urgent need to identify early and non-invasive biomarkers for CKD that can enhance disease management and improve patient outcomes.
Urine, as a product excreted by the kidneys, holds potential as a valuable source for exploring noninvasive biomarkers. Within urine, various types of RNAs, including mRNA, lncRNA, circRNA, miRNA, piRNA, and tRNA, have been identified3. Due to the ease of urine collection, urinary RNAs have emerged as promising noninvasive biomarkers for detecting kidney diseases. Previous studies have demonstrated the diagnostic utility of urinary RNAs in conditions such as bladder cancer4, renal cancer5, and prostate cancer6. Nonetheless, there remains a critical research gap concerning whether RNAs present in urine can serve as biomarkers for diagnosing CKD.
Therefore, in this study, we aimed to investigate urinary RNAs from both individuals with CKD and healthy controls in order to identify potential RNA biomarkers for diagnosing CKD. Through computational analysis of RNA-sequencing data, we characterized a diverse range of RNA classes present in the urine, including mRNAs, lncRNAs, circRNAs, miRNAs, piRNAs, and tRNAs. Furthermore, our analysis revealed several specific urinary RNAs that exhibited promising diagnostic potential for CKD.
Methods
RNA-sequencing (RNA-seq) data of urinary RNAs from CKD and healthy controls in GEO database
RNA-seq data pertaining to urinary RNAs in relation to CKD and healthy controls were retrieved from the GEO database (http://www.ncbi.nlm.nih.gov/geo/). We selected RNA-seq datasets based on the following criteria: (a) a minimum sample size of 10 samples, and (b) examination of RNA expression specifically in urine. Datasets that did not provide relevant or useful data for our analysis were excluded. Eventually, two GEO datasets, namely GSE121978 (consisting of 80 urine samples derived from CKD patients) and GSE128359 (including 47 urine samples obtained from healthy controls), met our inclusion criteria. Both of these GEO datasets were generated using Illumina HiSeq 2000 technology.
Analysis of RNA-seq data
The quality of the RNA-seq data was assessed using the FastQC software. To remove adapter sequences, the Cutadapt software (V1.9.3) was employed. For circRNA and mRNA analysis, Hisat2 software7 was utilized, utilizing clear reads along with the circBase database8. Regarding miRNA data analysis, we followed a workflow previously described8. Adapter sequences were trimmed using Cutadapt software (V1.9.3), discarding reads larger than 14 nucleotides. These processed reads were then mapped against precursor miRNA sequences retrieved from miRBase (Release 21) via the Shrimp algorithm. In the case of other small noncoding RNAs encompassing piRNAs and tsRNAs, the reads were aligned against the human genomic sequence hg38 (GRCh38) employing Bowtie2 v2.2.79. The resulting read alignment files were used for quantifying the expression of ncRNA annotations derived from Gencode v2410 and DASHR database11.
Identification of dysregulated RNAs (DE-RNAs)
To mitigate batch effects and unwanted variations in RNA counts within each sample of the two GEO datasets, the SVA package was employed. Subsequently, the DEseq2 software was utilized to identify differentially expressed RNAs (DE-RNAs). In our analysis, RNAs meeting the following criteria were considered differentially expressed: reads per million greater than 1, p-value less than 0.01, and the absolute value of log2 fold change (FC) higher than 1.
Collection of urine samples and RNA isolation
Twenty biopsy-proven CKD patients and twenty healthy volunteers were recruited from Jurong People Hospital, and their whole stream early morning urine samples were collected. The demographic and clinical characteristics of the CKD patients and healthy volunteers are presented in Table 1. Healthy controls were defined as individuals without abnormalities identified through routine urinalysis and having normal renal function [estimated glomerular filtration rate (eGFR) > 90 ml/min/1.73 m2]. Subsequently, the urine samples were centrifuged at 3000 g for 30 min at 4 °C, and the supernatant was carefully collected and stored at − 80 °C. To extract RNA from the supernatant, the TRIzol Reagent (Ambion, Life Technologies) was employed following the manufacturer's protocol (Ambion, Life Technologies, USA). Furthermore, the concentration and purity of the extracted RNA were assessed using the absorbance ratio at 260/280 measured with a NanoDrop 2000 spectrophotometer (Thermo, USA).
Real-time RT-qPCR
RT-PCR was performed using specific primers of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507 and CSAG4 (Table 2). After RT (50 °C, 30 min), hot start (95 °C, 15 min) and 40 cycles of PCR (95 °C, 1 min; 60 °C, 1 min; and 72 °C, 1 min) was executed.
Statistical analysis
SPSS 18.0 software was employed for statistical analysis, and GraphPad Prism 6.0 (GraphPad Software, San Diego, CA, USA) was utilized for graph generation. To assess the significance of differences in urinary RNA expression between groups, the Mann–Whitney U test was applied. A p-value less than 0.05 was considered statistically significant.
Ethics approval
This study was approved by the Ethics Committee of Jurong People Hospital. Informed consent was waived by the Ethics Committee of Jurong People Hospital. All the experiments were performed in accordance with the approved guidelines and complied with the Declaration of Helsinki. The urine samples were collected from the residual urine after routine urine test, which belongs to the exemption of informed consent. Therefore, the participants didn’t provide written informed consent.
Results
Differential abundance analysis of urinary RNAs in CKD patients compared to healthy controls
To explore the abundance of urinary RNAs, we performed an analysis using a dataset consisting of 80 CKD samples (GSE121978) and 47 healthy controls (GSE128359). This analysis encompassed a wide range of RNA types, including 36 circRNAs, 58,427 long RNAs (comprising mRNAs and lncRNAs), 1,636 miRNAs, 531 piRNAs, and 24 tRNAs (Table S1). Notably, the application of tSNE analysis revealed clear clustering of the CKD and healthy control samples based on their urinary RNA profiles (Fig. 1A). Furthermore, through visual examination of the volcano plot and heat map, distinct patterns of urinary RNA expression were observed, effectively differentiating CKD patients from healthy controls (Fig. 1B,C).
Using the DESeq2 software, we identified a total of 138 differentially expressed RNAs (DE-RNAs) meeting the significance criteria of |log2 fold change (FC)| ≥ 1 and a p-value ≤ 0.01. Out of these, 16 RNAs were found to be upregulated, while 122 were downregulated in CKD samples compared to healthy controls (Table S2).
Selection of candidate urinary RNAs for the diagnosis of CKD
To assess the overall accuracy of 138 differentially expressed RNAs (DE-RNAs) in discriminating between various groups of CKD patients and healthy controls, ROC analysis was conducted. Specifically, these DE-RNAs were selected from a pool of urinary RNAs with reads above 1 per million. The analysis revealed that among the 100 RNAs under investigation, each exhibited distinct area under the curve (AUC) values, with the highest AUC (AUC = 0.85625) observed for hsa-miR-542-5p (Table S3).
To identify the optimal combination of diagnostic markers for CKD, further scrutiny was focused on RNAs with an AUC exceeding 0.75. As a result, six out of the 100 RNAs were shortlisted: miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, Chondrosarcoma-Associated Gene 2/3 Protein Pseudogene (CSAG4), and miR-95-5p (Table S3). These selected RNAs demonstrated promising diagnostic potential for CKD.
Identification of the six urinary RNAs for the diagnosis of CKD
To validate the differential expression of the six candidate RNAs in urine, we conducted further analysis using the Mann–Whitney U test and ROC analysis. As shown in Fig. 2A–F, the expression levels of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, CSAG4, and miR-95-5p were significantly elevated in CKD patients when compared to healthy controls (Fig. 2A–F). To assess the diagnostic value of these six urinary RNAs for CKD detection, the ROC curve analysis was subsequently performed. As shown in Fig. 3A–F, the AUC values of these six urinary RNAs were 0.8563 (95% CI 0.7913–0.9212) for miR-542-5p, 0.7625 (95% CI 0.6819–0.8431) for miR-33b-5p, 0.8438 (95% CI 0.7764–0.9111) for miR-190a-3p, 0.8188 (95% CI 0.7468–0.8907) for miR-507, 0.7750 (95% CI 0.6962–0.8538) for CSAG4, and 0.7938 (95% CI 0.7178–0.8697) for miR-95-5p (Fig. 3A–F). Then, the diagnostic value of the combination of these urinary RNAs was evaluated by a logistic regression model. Remarkably, the combination of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 yielded the highest AUC of 0.9625 (95% CI 0.9281–0.9969) (Fig. 4). Moreover, the sensitivity and specificity of the combination were 100% and 92.50%, respectively. Based on the outcomes of the aforementioned logistic regression, we computed the risk score (Logit(p)) based on the combination of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 (Table 3):
The obtained results strongly suggest that the combination of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 in urine serves as an ideal biomarker for the detection of CKD.
Validation the six urinary RNAs for the diagnosis of CKD
A total of 40 urine samples were collected, consisting of 20 CKD patients and 20 healthy volunteers. The expression levels of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 in the urine were measured using RT-qPCR. Consistent with the findings from previous RNA sequencing results, the expression levels of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 in the urine of CKD patients were significantly up-regulated (Fig. 5). Subsequently, ROC curve analysis was conducted to assess the diagnostic value of urinary RNAs in the diagnosis of CKD, and the results revealed that urinary miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 exhibited an AUC of 0.8750 (95% CI, 0.7552–0.9948, p < 0.001), 0.7625 (95% CI, 0.6819–0.8431, p < 0.001), 0.8438 (95% CI, 0.7764–0.9111, p < 0.001), 0.8188 (95% CI, 0.7468–0.8907, p < 0.001) and 0.7750 (95% CI, 0.6962–0.8538, p < 0.001), respectively (Fig. 6A–E). The AUC of the combination of miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 was 1.0000 (95% CI 1.0000–1.0000) (Fig. 6F). Thus, urinary miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 could serve as ideal diagnostic biomarker for CKD.
Discussion
Until now, the diagnosis of CKD has primarily relied on urinary protein analysis and assessing changes in the glomerular filtration rate. Diagnostic biomarkers at the gene transcription level, widely employed in the diagnosis of tumors and various diseases, have not been extensively utilized for CKD due to the invasive nature of kidney biopsy. However, as growing evidence indicates the presence of RNAs in urine, similar to serum/plasma samples, the analysis of urinary RNAs presents a new avenue to gain insights into CKD. This non-invasive approach offers a promising opportunity to explore and understand CKD in a novel way.
In this study, high-throughput RNA-sequencing was employed to analyze urinary RNAs with the aim of identifying potential biomarkers for CKD diagnosis. Urine, being easily collectible, has been regarded as an ideal source for kidney-related disease biomarkers. Our analysis revealed the presence of diverse RNA classes in urine, including mRNA, lncRNA, circRNA, miRNA, piRNA, and tRNA. Notably, miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 in urine emerged as promising biomarkers for CKD. Both plasmatic transrenal and postrenal cell-free RNA were identified in urine; however, the focus of existing studies has predominantly been on RNA released into urine postrenally through mechanisms such as apoptosis, necrosis, and active secretion. Previous investigations have successfully detected survirin, cytokeratin 20, mucin 7, and Ki-67 mRNAs in the urine of patients with bladder cancer and various urinary tract infections, highlighting the cells of the urogenital tract as major contributors of urinary RNAs12. Functional analysis of the identified miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 biomarkers revealed their involvement in signal pathways relevant to renal damage, such as TNF receptor signaling pathway and p38 MAPK signaling pathway (Fig. 7). These findings suggest that miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 might be actively or passively released by the kidneys during the progression of CKD. However, further research is necessary to elucidate the exact release mechanisms of these five indicators by the kidneys.
The detection and identification of RNAs in urine can be accomplished using various techniques such as RNA sequencing, microarray technologies, or reverse-transcription quantitative real-time PCR (qRT-PCR). In our study, we utilized RNA sequencing to identify dysregulated RNAs in the urine of CKD patients. However, for practical clinical application, RNA sequencing poses challenges when applied to urine samples. Currently, qRT-PCR is the most commonly used and convenient method for RNA detection. Therefore, further extensive studies are necessary to establish a qRT-PCR-based detection method for miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4 in urine before they can be effectively utilized in clinical practice.
Conclusions
In conclusion, our study identified five urinary RNAs, specifically miR-542-5p, miR-33b-5p, miR-190a-3p, miR-507, and CSAG4, which exhibited significantly higher expression levels in CKD patients compared to healthy controls. These findings suggest that these five urinary RNAs hold great potential as diagnostic biomarkers for CKD.
Data availability
All data generated or analyzed during this study are included in this published article. Raw and processed data are available upon request from Li-Zhi Lv (844713944@qq.com).
References
Jha, V. et al. Chronic kidney disease: Global dimension and perspectives. Lancet 382, 260–272 (2013).
Meng, X. M., Nikolic-Paterson, D. J. & Lan, H. Y. Inflammatory processes in renal fibrosis. Nat. Rev. Nephrol. 10, 493–503 (2014).
Martinez-Fernandez, M., Paramio, J. M. & Duenas, M. RNA detection in urine from RNA extraction to good normalizer molecules. J. Mol. Diagn. 18, 15–22 (2016).
Ng, K., Stenzl, A., Sharma, A. & Vasdev, N. Urinary biomarkers in bladder cancer: A review of the current landscape and future directions. Urol. Oncol. Semin. Ori. 39, 41–51 (2021).
Oto, J. et al. Urinary microRNAs: Looking for a new tool in diagnosis, prognosis, and monitoring of renal cancer. Curr. Urol. Rep. 21, 1–8 (2020).
Bronimann, S. et al. An up-to-date catalogue of urinary markers for the management of prostate cancer. Curr. Opin. Urol. 30, 684–688 (2020).
Kim, D., Landmead, B. & Salzberg, S. L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12, 357-U121 (2015).
Glazar, P., Papavasileiou, P. & Rajewsky, N. circBase: A database for circular RNAs. RNA 20, 1666–1670 (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357-U354 (2012).
Harrow, J. et al. GENCODE: The reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Leung, Y. Y. et al. DASHR: Database of small human noncoding RNAs. Nucleic Acids Res. 44, D216–D222 (2016).
Gyuraszova, M., Kovalcikova, A., Babickova, J., Hodosy, J. & Tothova, L. Cell-free nucleic acids in urine as potential biomarkers of kidney disease. J. Appl. Biomed. 16, 157–164 (2018).
Acknowledgements
We thank the Nanjing infinite Medical Technology Co., Ltd. (Nanjing, China) for the gift of the kits for urinary RNA isolation.
Author information
Authors and Affiliations
Contributions
L.-Z.L. designed the experiments. L.-Z.L. performed the experiments and analyzed the results. L.-Z.L. wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The author declares no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lv, LZ. The urinary RNA atlas of patients with chronic kidney disease. Sci Rep 13, 19084 (2023). https://doi.org/10.1038/s41598-023-46555-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-46555-5
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.