Using Machine Learning to Predict Sensorineural Hearing Loss Based on Perilymph Micro RNA Expression Profile

Shew, Matthew; New, Jacob; Wichova, Helena; Koestler, Devin C.; Staecker, Hinrich

doi:10.1038/s41598-019-40192-7

Download PDF

Article
Open access
Published: 04 March 2019

Using Machine Learning to Predict Sensorineural Hearing Loss Based on Perilymph Micro RNA Expression Profile

Matthew Shew¹,
Jacob New²,
Helena Wichova¹,
Devin C. Koestler³ &
…
Hinrich Staecker¹

Scientific Reports volume 9, Article number: 3393 (2019) Cite this article

4574 Accesses
28 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Hearing loss (HL) is the most common neurodegenerative disease worldwide. Despite its prevalence, clinical testing does not yield a cell or molecular based identification of the underlying etiology of hearing loss making development of pharmacological or molecular treatments challenging. A key to improving the diagnosis of inner ear disorders is the development of reliable biomarkers for different inner ear diseases. Analysis of microRNAs (miRNA) in tissue and body fluid samples has gained significant momentum as a diagnostic tool for a wide variety of diseases. In previous work, we have shown that miRNA profiling in inner ear perilymph is feasible and may demonstrate distinctive miRNA expression profiles unique to different diseases. A first step in developing miRNAs as biomarkers for inner ear disease is linking patterns of miRNA expression in perilymph to clinically available metrics. Using machine learning (ML), we demonstrate we can build disease specific algorithms that predict the presence of sensorineural hearing loss using only miRNA expression profiles. This methodology not only affords the opportunity to understand what is occurring on a molecular level, but may offer an approach to diagnosing patients with active inner ear disease.

Putting the “mi” in omics: discovering miRNA biomarkers for pediatric precision care

Article 29 July 2022

Prediction of hearing recovery with deep learning algorithm in sudden sensorineural hearing loss

Article Open access 29 August 2024

Large-scale annotated dataset for cochlear hair cell detection and classification

Article Open access 23 April 2024

Introduction

Hearing loss is the most common neurodegenerative disease worldwide and is estimated to affect over 432 million adults and 34 million children worldwide¹. Unaddressed hearing loss is estimated to pose an annual global cost of over 750 billion US dollars¹. Despite the significant disease burden and economic impact of hearing loss, diagnosing and treating this condition remains a significant challenge because of the limited ability to perform biopsies in order to understand what aberrant mechanisms are occurring on a molecular level.

There are myriad of etiologies that can lead to hearing loss, including: genetic, infectious, noise trauma, and multifactorial disorders such as presbycusis. Clinicians often rely on an assortment of objective testing, including audiometry and vestibular testing, to guide diagnosis and treatment. While these tests provide a measure of function, they do not provide a molecular diagnosis and often do not correctly reflect the cellular site of lesion². MiRNAs are 19–23 base pair single stranded RNA sequences that regulate post translational gene expression³. These molecules have been identified in all body fluids and are recognized for their promising role as a diagnostic and prognostic marker for neurodegenerative diseases such as Alzheimer’s and various cancers^4,5,6,7,8. We recently demonstrated that miRNA profiling within the inner ear is a feasible methodology and can potentially offer insight into what is occurring on a cellular and molecular level in various inner ear pathologies⁹. In our search for specific hearing loss related biomarkers, we were able to demonstrate that various inner ear diseases, from Meniere’s disease to otosclerosis, express different and distinct miRNA profiles⁹. Similarly, recent investigations have also identified several key and distinct miRNAs within the venous blood in patients with sudden sensorineural hearing loss compared to healthy controls¹⁰. However, one of the challenges facing analysis of miRNA data from the inner ear is the immense and variable expression patterns across various diseases that may not be common to all cases.

Machine learning (ML) is a subdiscipline of artificial intelligence (AI) and borrows from multiple disciplines including mathematics, statistics, and computer science^11,12. The field of ML is broadly concerned with two types of tasks: supervised and unsupervised learning. Supervised learning uses prior information on the outcome of interest (labeled data) with the goal of learning a function that, given data on the both the outcome and predictor variables, best approximates the relationship between the predictors and outcome. Supervised learning can be further subdivided into classification and regression depending on the nature of the outcome variable; the former being used when the outcome is categorical and the latter when the outcome is continuous. Conversely, unsupervised learning does not use labeled outputs, but rather seeks to learn and infer the underlying structure present within a set of data. An example unsupervised learning would the use of gene expression microarray data to identify molecular subtypes of a given disease, or otherwise groups/clusters of subjects with a similar gene expression profile. Simply put, ML methods are used by researchers to analyze large amounts of data to find patterns, and in doing so, better solve problems. With the ever-advancing nature of computing power, ML has gained significant popularity within the scientific community. While ML has been slow to assimilate into healthcare, we have seen ML applications ranging from diagnosing skin cancer¹³, diagnosing brain tumors through MRI¹⁴, diagnosing glaucoma¹⁵, optimizing drug therapies¹⁶, analyzing large genome sequencing^17,18, to predicting various diseases and clinical outcomes^12,19,20,21.

In the current study, we used ML to build disease specific algorithms to predict the presence of sensorineural hearing loss in different inner ear pathologies based on perilymph-derived miRNA expression profiles of the inner ear. Subsequently, we applied our algorithms to de-identified patient samples and established the presence and varying degree of sensorineural hearing loss through miRNA expression profile alone. This methodology offers a promising approach for inner diagnosis, prognosis, and monitoring for various neurotologic diseases. Likewise, using this approach we may be able to understand what may be occurring on a molecular level in inner ear disease specific states in a manner that was previously not possible.

Results

We collected perilymph from a total of sixteen patients. Four patients had otosclerosis and perilymph was collected while undergoing stapedectomy. These patients had a pure conductive hearing loss and served as controls since sampling perilymph from patients without an active ear disorder that requires a surgery is not possible at this point in time. Twelve patients underwent cochlear implantation in which perilymph was collected upon opening up the round window for electrode insertion. Seven out of twelve patients had profound SNHL and were classified as not having any residual hearing (PTA > 80 dB), with a mean PTA of 108.9 dB. A total of five out of twelve patients with severe SNHL were classified as having residual hearing (PTA < 80 dB), with a mean PTA of 69.4 dB (Fig. 1). As the use of ML to predict inner ear disease using miRNA signatures represents a novel application, we elected to build and test multiple ML models to ascertain if one was superior to others.

Conductive Hearing Loss versus Sensorineural Hearing Loss

A ML model was constructed to differentiate between SNHL (patients undergoing cochlear implantation) and CHL (patients undergoing stapedectomy). Using a 70/30 split of the data into training/testing sets, both the decision forest and logistic regression ML models were able to differentiate between SNHL and CHL with 100% accuracy in the testing set. The decision jungle and neural network ML were able to differentiate between SNHL and CHL with 80% accuracy in the testing set. We cross validated the models using a leave-one-out cross approach, and all 4 models had a misclassification error of 6.25%. In order to understand how the model was built, we applied the permutation feature of importance to the ML models. The permutation feature of importance is a feature option within Azure ML software that allows the model to understand and evaluate the weighted importance of each data point, in this case miRNA, within the constructed model. The most heavily weighted miRNAs used to construct the ML model included: human miRNA 4767, miRNA 182 5p, miRNA 6754 5p, miRNA 6797 3p, miRNA 6806 3p, miRNA 6860, miRNA 4278, miRNA 3975, miRNA 4655, and miRNA 4732. The miRNAs were considered significant if they were used in the construction of two or more ML models. The miRNAs identified were compared to gene expression in the inner ear, and miRNA function and potential relationship to hearing loss was evaluated using Ingenuity Pathway Analysis (IPA) software. miRNA 4767, 6754 5p, 6797 3p, 6860, and 4732 had no significant known interactions identified. On the other hand, miRNA 182 5p, 6806 3p, 4278, 3975, and 4655 were identified as high probability or with proven interactions in previous microRNA – gene interaction experiments. Table 1 summarizes the key miRNAs with predicted downstream interactions identified that were significant in building ML model to differentiate SNHL from CHL.

Table 1 Critical miRNA identified and predicted downstream gene expression targets for sensorineural hearing loss vs conductive hearing loss.

Full size table

Severity of sensorineural hearing loss

A ML model was constructed to differentiate between cochlear implant candidates with different degrees of sensorineural hearing loss. Groups were categorized as either having residual hearing (cochlear implantation patients with PTA < 80 dB) or no residual hearing (cochlear implant patients with PTA > 80 dB). All four ML models, which included: decision forest, decision jungle, logistic regression, and neural networks, were able to differentiate between cochlear implant candidates with and without residual hearing with 100% accuracy (Fig. 2B). To cross validate these models, we used a leave-one-out cross validation approach. The misclassification error for each model is: decision forest, 0%; logistic regression, 8.33%; decision jungle, 25%; and neural network, 41.67%. In order to understand how the model was built, we applied the permutation feature of importance to the ML models. The most heavily weighted miRNAs used to construct the ML model included human miRNA 184, miRNA 660, miRNA let 7a 5p, miRNA 3142, and miRNA 335. The miRNAs were considered significant if they were used in the construction of two or more ML models. The miRNAs identified were compared to gene expression in the inner ear, and miRNA function and potential relationship degree of SNHL were evaluated using IPA software. Table 2 summarizes the key miRNA and predicted downstream targets which included miRNA 184, miRNA 660, and miRNA let 7a 5p. No significant known interactions were predicted using IPA software for miRNA 3142 and miRNA 335.

Table 2 Critical miRNA identified and predicted downstream gene expression targets for sensorineural hearing loss with and without residual hearing.

Full size table

A regression ML model was constructed using pure tone average (PTA) in decibels to further study the relationship between miRNA and severity of SNHL. The root mean squared error for our best model was 21.26 dB, allowing us to predict hearing loss with an expected error in the 21 dB range. We observed a decrease in sensitivity of the model when comparing directly with PTA as a continuous variable, compared to categorical characterization using 80 dB as a cut off. The decreased accuracy is consistent with the heterogenous nature of hearing loss within our patient cohort.

Discussion

MiRNAs were initially discovered in 1993 and have since been shown to play an essential role in post translational regulation of gene expression through messenger RNA (mRNA) degradation and splicing^3,22. Because miRNA play a critical role in cell gene expression, miRNA activity has been increasingly recognized as a critical component to many disease states making them a promising biomarker^5,23. Interestingly, miRNA profiles have shown unique expression patterns to different diseases states, from heart failure²⁴ to individual cancers⁷ (colon^25,26,27, ovarian²⁸, and clear cell carcinoma²⁹), neurodegenerative diseases⁴, different cell types³⁰, and play an integral role embryonic development⁸. Pertinent to hearing loss, miRNA have shown to be a promising biomarker and diagnostic marker for otherwise difficult to diagnose neurodegenerative diseases such as Alzheimer’s and Parkinsons^4,5,31. Additionally, miRNA have shown to play a critical role in inner ear development, demonstrate tissue and site specific expression, and may exhibit expression patterns specific to SNHL^10,32,33,34. Taken together, miRNA expression profiling may serve as a promising diagnostic and prognostic tool for the inner ear. We sought to see if ML could differentiate between distinct inner ear miRNA expression profiles specific to different active inner ear disease states.

To date, we have no biopsy equivalent for inner ear disease and have no objective insight into what is occurring on a molecular level in patients with active inner ear disease. The knowledge we have assimilated to date is based on objective testing and the study of various otologic and neurotologic pathologies using temporal bone pathology and animal models. Here, we have shown that various inner ear diseases may demonstrate specific and unique miRNA expression profiles in the very simple model of presence or absence of sensorineural loss, and a model differentiating varying severity of sensorineural hearing loss. Utilizing the unique expression profiles, we can use machine learning to build algorithms to differentiate between different degrees of sensorineural hearing loss with high accuracy, opening the door to further refining the application of this methodology.

Artificial intelligence (AI) and its subdiscipline machine learning (ML) have seamlessly integrated themselves into our modern-day culture, but they have been slow to assimilate into health care. ML enables one to analyze large amounts of data, understand pattern recognition, and make predictions using what it has learned. ML borrows from many subdisciplines including statistics and computer science. However while statistics primarily focuses on inferences related to causation and examines how a system of components relate to one another, ML is novel in that it makes predictions based on large sets of data and experiences³⁵. We can supply an un-analyzed perilymph miRNA profile into our ML algorithms and subsequently make accurate predictions to discern between different degrees of sensorineural hearing loss. By analyzing the function of miRNAs identified by this process we may be able to indirectly identify the molecular pathways involved in different inner ear diseases.

ML is optimally used when applied to large and complex datasets, such as gene expression profiles, where it can analyze various patterns to make predications without being specifically programmed to do so. Using the permutation feature of importance we can gain significant insight into how the different decision algorithms are built and how certain factors are weighted (Tables 1 and 2). ML and permutation feature of importance also offers a novel way to analyze miRNA that may be playing a critical role in various inner ear pathologies. For example, looking at SNHL vs CHL, we inputted the critical miRNA identified (4278, 3975, 4655) into the IPA software, and isolated downstream targets that have been experimentally proven. We then analyzed downstream targets that overlap with two or more of the critical miRNA identified. Of interest we identified KCNJ10, HCN, and Otoferlin. All three genes have been experimentally proven with the individual miRNAs and shown to play critical roles in SNHL on a molecular level. KCNJ10 has been shown within the stria vascularis to have a critical role in generating endocochlear potentials³⁶. Hyperpolarized-activated cyclic nucleotide cation (HCN) channels have been shown to play an essential role within the spiral ganglia and propagating post synaptic potentials³⁷. Otoferlin has been shown to play a critical role in Ca2+ evoked vesicular exocytosis within inner hair cells³⁸. Similarly, miRNA Let 7 family was critical in differentiating CI patients with and without residual hearing. Investigators have experimentally shown how Let 7 miRNA family plays a critical role in RAS signaling³⁹, and along similar lines RAS/MAPK pathway is crucial in inner hair survival following noise induced hearing loss⁴⁰. The unique miRNA profiles identified through ML can offer significant understanding to aberrant disease mechanisms on a molecular level and also potentially identify site specific pathology.

Furthermore, using SNHL with and without residual as a categorical differentiation we are able to construct ML models with nearly 100% accuracy. However, when analyzing PTA as a continuous variable we lose some accuracy with a root mean squared error of 21.26 dB, allowing us to predict +/−21 dB. While these results do raise some inconsistency issues using miRNAs as a direct measure to varying degrees of SNHL, they are not surprising given the heterogeneity of hearing loss within our patient cohorts. PTA is an average hearing loss across multiple frequencies. While patients may have similar PTAs, they can have significantly different patterns and frequencies affected. Secondly patients have different underlying disease mechanisms leading to hearing loss, such as presbycusis versus early onset genetic hearing loss. Taking both these factors into account, the decreased accuracy using PTA as a continuous variable is not surprising. As we continue to grow our patient perilymph database and homogenize our SNHL patients with similar patterns and frequencies, we would expect an improvement within our regression ML model.

Despite the increasing recognition of the unique role miRNAs play in development and various disease states, identifying and validating miRNA target genes has been a great challenge^41,42. Computational analysis on miRNAs and regulation of gene expression are based on aligning and predicting 5′ miRNA and 3′ complimentary known mRNA sequences. While this methodology is the mainstay of miRNA discovery and gene expression analysis, it does have its limitations^42,43. As we continue to grow our miRNA perilymph database, key and critical miRNA identified will need to be validated through miRNA/mRNA target validation, co expression, and ultimately their regulatory effect on gene expression through luciferase and other validated expression assays⁴¹. Validation experiments will be critical in moving forward, particularly as the key miRNA unique to different inner ear disease states are pursued as potential drug therapy targets.

While the ML methodologies employed herein offer an exciting prospect, we must acknowledge several limitations. Definitive conclusions are limited by our small sample size of 16 patients. As we continue to grow our database and include a wider range of pathologies, we do not expect to maintain 100% accuracy in our predictions; however, we do expect the sensitivity of our algorithms to improve. ML algorithms are adaptive, therefore, as we continue to recruit additional patients, the ML algorithms will continue to learn from these new “experiences” and adapt its output accordingly. However, our first goal was to demonstrate that inner ear miRNA expression profiling may be unique to disease specific inner ear pathologies. Using ML, we were able to successfully delineate these unique miRNA expression profiles and use them to predict ongoing inner ear pathologies in patients in real time, a methodology not previously possible on a molecular level.

Finally, one of the major limitations of ML is the “black box” that are the predictive algorithms. We can control the data that goes in and the desired predictions we wish to build, but we have limited insight into the exact mechanics behind each algorithm⁴⁴. One of the shortcomings of ML as compared to traditional statistical methodologies is that we cannot put any significantly meaningful inferences behind how the model is built. There are no confidence intervals or odds ratios equivalent for ML. The only meaningful insight we have into the algorithm is using the different analytical features such as permutation feature of importance to assess the contribution of each miRNA to the constructed model. While we can understand which limited number of the miRNA being used and at what weighted fraction, unfortunately we do not know if its upregulation or downregulation of a pathway for each specific miRNA. There are many ongoing efforts to understand and unlock this “black box” but unfortunately we still have yet to find a definitive solution⁴⁵.

Conclusions

MiRNAs’ unique role and expression profiles are well recognized to play an integral role in different disease states from cancer to neurodegenerative states making them an exciting biomarker. MiRNAs are well established to play a critical role in inner ear development and some preliminary investigations have shown its potential role in SNHL. In this study, we demonstrate using ML we can delineate the miRNA expression profile unique to different inner ear pathologies. This methodology not only provides an understanding of inner ear pathology on a molecular level, but may also offer a novel method to diagnose and prognose patients with active inner ear disease. Similar to how a patient undergoes a lumbar puncture to collect CSF for meningitis, one could theoretically undergo a “round window tap” to diagnose, prognose, and monitor therapeutic interventions for Meniere’s disease or SNHL. While our methodology may offer an exciting prospect for stratification of patients with inner ear disease, multiple safety and validation studies will need to be performed. Additional patient recruitment will be needed to continue to grow our perilymph miRNA database to improve the predictive algorithms and its potential ability to differentiate various inner ear pathologies.

Methods

Human perilymph sampling was approved by the University of Kansas Human Studies Committee and Institutional Review Board (IRB). All experiments performed were in accordance with relevant guidelines and regulations approved by the University of Kansas IRB. Patients were recruited if they were undergoing a surgical procedure in which the inner ear was opened, and informed consent was obtained prior to perilymph collection. Procedures in which patients were recruited included patients undergoing stapedectomy for otosclerosis or cochlear implantation for sensorineural hearing loss (SNHL). All patients received standard of care and underwent standard surgical treatment for either stapedectomy or cochlear implantation. Only difference is when patient’s inner ear was opened for their indicated procedure, a small sterile glass capillary tube was used to collect approximately 2–5 µL volume of perilymph⁹.

Sampling for stapedectomy

The skin of the external auditory canal was injected with 0.5 ml of 1% lidocaine + 1:100,000 epinephrine. Using a round knife, a cut was made in the skin of the external canal and the dependent middle flap was carefully elevated medially revealing the middle ear structures. Using the Omniguide™ CO2 laser with a power setting a four watts, 0.1 second single bursts, the stapes superstructures were removed. Using the laser, a rosette fenestration was made in the stapes footplate. Upon making the fenestration, perilymph could be seen coming out laterally from the vestibula. We then removed excess perilymph with a sterile glass capillary tube. After successful collection of perilymph fluid, the stapes footplate fenestration was enlarged to accommodate the stapes prosthesis. The prosthesis was then hooked around the incus and the surgery was completed in a standard fashion.

Sampling for cochlear implantation

Through a post auricular incision, a mastoidectomy and facial recess exposure of the round window were completed in a standard fashion. The wound was irrigated with antibiotic solution. The oval and round windows were identified, and the bony overhang of the round window niche was removed with a 1 mm micro drill. Using an angled pick, the round window was opened. At this point there was free flow of excess perilymph out of the cochlea which was sampled using a sterile glass capillary tube. The implant electrode was then inserted into the cochlea.

microRNA analysis

The perilymph was collected as described above. Total RNA was extracted with Trizol reagent (Thermofisher, cat #15596018) and purified by centrifuging with phase lock heavy gel (Tiagen, cat # WMS-2302830). RNA was analysed using The Agilent RNA6000 Pico kit using an Agilent Bioanalyzer 2100 yielding on average 0.5–2 ng of total RNA per sample. Samples were processed and analysed with an Affymetrix miRNA 4.0 array to determine the presence of micro RNAs. The Affymetrix miRNA 4.0 array interrogates all miRNA sequences listed in miRBase Release 20; interrogating 30,434 mature miRNAs from 203 organisms of which 2,578 are from humans. The arrays were background corrected, normalized and gene-level summarized using the Robust Multichip Average (RMA) algorithm⁹. This normalization step makes inferences on miRNA expression across conditions possible. In order to ascertain which miRNAs were significantly expressed in each array, for each miRNA probe in the array, a detection p-value was computed based on a Wilcoxon Rank-Sum test of the miRNA probe set signals compared to the distribution of a GC matched background signal comprising of anti-genomic probes in the same array. The detection p-values were adjusted for multiple hypothesis testing (FDR) using the Benjamini and Hochberg method. These analyses were performed using Affymetrix Expression Console Software. miRNAs with a normalized log2 signal intensity ≥7 and an adjusted detection p-value (FDR) ≤ 0.05 were considered significantly expressed in the assay and were considered in downstream ML model development.

Data analysis

Patient selection and audiometry

All patients underwent pure tone audiometry prior to their respective procedures. Air and bone conduction thresholds were determined, and a CT scan of the temporal bone was performed for all patients with diagnosis of otosclerosis. Only patients with conductive hearing loss (CHL) and radiologically confirmed otosclerosis were included in the CHL control group (n = 4). For patients that met audiologic criteria for cochlear implantation, the frequency pure tone average (PTA) was determined. A PTA of 70 dB was used as a limit identifying patients with residual hearing (PTA < 80 dB) (n = 5) and no residual hearing (PTA > 80 dB) (n = 7) (Fig. 1).

Machine learning analysis

In order to analyse the miRNA dataset unique to various inner ear pathologies, we constructed a supervised machine learning classification model using the opensource Azure Machine Learning Studio (Microsoft Corporation). Data was uploaded form the Affymetrix miRNA 4.0 array, and formatted for Azure ML. The log2 transformed signal intensities were used to construct the ML models. The data was randomly split 70/30 into training and testing sets. ML models were built using the training data and subsequently tested on the remaining 30% of data comprising the testing set. We considered multiple multiclass decision models, including: multiclass decision forest (minimum samples per lead node 1; number of random splits 128; maximum depth of decision tree 32; number of decision trees 8), multiclass decision jungle (number of optimization steps per decision DAG layer 2048; maximum width of decision DAGs 128; maximum depth of the decision DAGs 32; number of decision DAGs 8), multiclass logistic regression, and multiclass neural network. Boostrap aggregation was built into model generation for each of the fitted models. “Tune model hyperparameters”, an option within Azure ML software, was used to apply an entire grid wide sweep and determine the optimum parameters settings. After the model was built and tuned, it was then scored and evaluated using the testing set data (Figs 2 and 3). A leave-one-out cross validation approach was subsequently used to better evaluate each model’s performance. To represent the results of the cross validation, the leave-one-out misclassification error rate was determined for each model as the ratio of the number of misclassified samples to the total number of samples. A permutation-based approach was applied to assess feature importance within each multiclass trained decision model in order to analyse which miRNAs had the most influence, along with their corresponding weight in construction of the respective trained model. The metric used in the permutation feature analysis was accuracy, and permutation feature importance scores are determined as: permutation feature importance score = base model accuracy − model accuracy after shuffling a given feature. Thus, a high permutation feature importance score is indicative of a feature with a large influence on the model’s accuracy. We compared the ability of the models to distinguish between patients with CHL and SNHL (stapedectomy vs. cochlear implant patients) and within the cochlear implant group evaluated the ability of the models to distinguish between patients with and without residual hearing.

Evaluation of permutation features

Data analysis was performed along two fronts to understand pathogenesis and look for patterns of expression unique to different disease types. miRNAs identified as significantly influencing the model development were identified. Using Ingenuity Pathway Analysis (IPA) software (Qiagen Bioinformatics) these miRNAs were analysed alongside a human inner ear cDNA library. Known and highly predicted miRNA cochlear mRNA interactions were identified and mapped out to delineate and understand the different regulatory pathways.

References

World Health Organization : Deafness and Hearing Loss, http://www.who.int/news-room/fact-sheets/detail/deafness-and-hearing-loss (2018).
Landegger, L. D., Psaltis, D. & Stankovic, K. M. Human audiometric thresholds do not predict specific cellular damage in the inner ear. Hearing research 335, 83–93, https://doi.org/10.1016/j.heares.2016.02.018 (2016).
Article PubMed PubMed Central Google Scholar
Vidigal, J. A. & Ventura, A. The biological functions of miRNAs: lessons from in vivo studies. Trends in cell biology 25, 137–147, https://doi.org/10.1016/j.tcb.2014.11.004 (2015).
Article CAS PubMed Google Scholar
Burgos, K. et al. Profiles of extracellular miRNA in cerebrospinal fluid and serum from patients with Alzheimer’s and Parkinson’s diseases correlate with disease status and features of pathology. PLoS One 9, e94839, https://doi.org/10.1371/journal.pone.0094839 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Weber, J. A. et al. The microRNA spectrum in 12 body fluids. Clinical chemistry 56, 1733–1741, https://doi.org/10.1373/clinchem.2010.147405 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hamam, R. et al. Circulating microRNAs in breast cancer: novel diagnostic and prognostic biomarkers. Cell death & disease 8, e3045, https://doi.org/10.1038/cddis.2017.440 (2017).
Article Google Scholar
Hayes, J., Peruzzi, P. P. & Lawler, S. MicroRNAs in cancer: biomarkers, functions and therapy. Trends in molecular medicine 20, 460–469, https://doi.org/10.1016/j.molmed.2014.06.005 (2014).
Article CAS PubMed Google Scholar
Chadly, D. M. et al. Developmental profiling of microRNAs in the human embryonic inner ear. PLoS One 13, e0191452, https://doi.org/10.1371/journal.pone.0191452 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shew, M. et al. Feasibility of microRNA profiling in human inner ear perilymph. Neuroreport 29, 894–901, https://doi.org/10.1097/wnr.0000000000001049 (2018).
Article CAS PubMed Google Scholar
Li, Q. et al. RNA sequencing uncovers the key microRNAs potentially contributing to sudden sensorineural hearing loss. Medicine 96, e8837, https://doi.org/10.1097/md.0000000000008837 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kourou, K., Exarchos, T. P., Exarchos, K. P., Karamouzis, M. V. & Fotiadis, D. I. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal 13, 8–17, https://doi.org/10.1016/j.csbj.2014.11.005 (2015).
Article CAS PubMed Google Scholar
Sajda, P. Machine learning for detection and diagnosis of disease. Annual review of biomedical engineering 8, 537–565, https://doi.org/10.1146/annurev.bioeng.8.061505.095802 (2006).
Article CAS PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118, https://doi.org/10.1038/nature21056 (2017).
Article ADS CAS PubMed Google Scholar
Lao, J. et al. A Deep Learning-Based Radiomics Model for Prediction of Survival in Glioblastoma Multiforme. Scientific reports 7, 10353, https://doi.org/10.1038/s41598-017-10649-8 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Rahimy, E. Deep learning applications in ophthalmology. Current opinion in ophthalmology 29, 254–260, https://doi.org/10.1097/icu.0000000000000470 (2018).
Article PubMed Google Scholar
Huang, C., Mezencev, R., McDonald, J. F. & Vannberg, F. Open source machine-learning algorithms for the prediction of optimal cancer drug therapies. PLoS ONE 12, e0186906, https://doi.org/10.1371/journal.pone.0186906 (2017).
Article CAS PubMed PubMed Central Google Scholar
Libbrecht, M. W. & Noble, W. S. Machine learning applications in genetics and genomics. Nature reviews. Genetics 16, 321–332, https://doi.org/10.1038/nrg3920 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liu, J., Wang, X., Cheng, Y. & Zhang, L. Tumor gene expression data classification via sample expansion-based deep learning. Oncotarget 8, 109646–109660, https://doi.org/10.18632/oncotarget.22762 (2017).
Article PubMed PubMed Central Google Scholar
Churpek, M. M. et al. Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards. Critical care medicine 44, 368–374, https://doi.org/10.1097/CCM.0000000000001571 (2016).
Article PubMed PubMed Central Google Scholar
Montazeri, M., Montazeri, M. M. & Beigzadeh, M. A. Machine learning models in breast cancer survival prediction. Technology and Health Care 24, 31–42 (2016).
Article Google Scholar
Sato, F. et al. Prediction of survival in patients with esophageal carcinoma using artificial neural networks. Cancer 103, 1596–1605, https://doi.org/10.1002/cncr.20938 (2005).
Article PubMed Google Scholar
Yates, L. A., Norbury, C. J. & Gilbert, R. J. The long and short of microRNA. Cell 153, 516–519, https://doi.org/10.1016/j.cell.2013.04.003 (2013).
Article CAS PubMed Google Scholar
Goodall, E. F., Heath, P. R., Bandmann, O., Kirby, J. & Shaw, P. J. Neuronal dark matter: the emerging role of microRNAs in neurodegeneration. Frontiers in cellular neuroscience 7, 178, https://doi.org/10.3389/fncel.2013.00178 (2013).
Article CAS PubMed PubMed Central Google Scholar
Naga Prasad, S. V. et al. A unique microRNA profile in end-stage heart failure indicates alterations in specific cardiovascular signaling networks. PLoS One 12, e0170456, https://doi.org/10.1371/journal.pone.0170456 (2017).
Article CAS PubMed PubMed Central Google Scholar
Moler, E. J., Chow, M. L. & Mian, I. S. Analysis of molecular profile data using generative and discriminative methods. Physiological genomics 4, 109–126, https://doi.org/10.1152/physiolgenomics.2000.4.2.109 (2000).
Article CAS PubMed Google Scholar
Furey, T. S. et al. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics (Oxford, England) 16, 906–914 (2000).
Article CAS Google Scholar
Liu, Y. Active learning with support vector machine applied to gene expression data for cancer classification. Journal of chemical information and computer sciences 44, 1936–1941, https://doi.org/10.1021/ci049810a (2004).
Article CAS PubMed Google Scholar
Segal, N. H. et al. Classification and subtype prediction of adult soft tissue sarcoma by functional genomics. The American journal of pathology 163, 691–700, https://doi.org/10.1016/s0002-9440(10)63696-6 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Segal, N. H. et al. Classification of clear-cell sarcoma as a subtype of melanoma by genomic profiling. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 21, 1775–1781, https://doi.org/10.1200/jco.2003.10.108 (2003).
Article CAS Google Scholar
Kuosmanen, S. M., Kansanen, E., Sihvola, V. & Levonen, A.-L. MicroRNA Profiling Reveals Distinct Profiles for Tissue-Derived and Cultured Endothelial Cells. Scientific reports 7, 10943, https://doi.org/10.1038/s41598-017-11487-4 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Eacker, S. M., Dawson, T. M. & Dawson, V. L. Understanding microRNAs in neurodegeneration. Nature reviews. Neuroscience 10, 837–841, https://doi.org/10.1038/nrn2726 (2009).
Article CAS PubMed PubMed Central Google Scholar
Friedman, L. M. & Avraham, K. B. MicroRNAs and epigenetic regulation in the mammalian inner ear: implications for deafness. Mammalian genome: official journal of the International Mammalian Genome Society 20, 581–603, https://doi.org/10.1007/s00335-009-9230-5 (2009).
Article CAS Google Scholar
Rudnicki, A. & Avraham, K. B. microRNAs: the art of silencing in the ear. EMBO molecular medicine 4, 849–859, https://doi.org/10.1002/emmm.201100922 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pang, J. et al. Circulating miR-34a levels correlate with age-related hearing loss in mice and humans. Experimental gerontology 76, 58–67, https://doi.org/10.1016/j.exger.2016.01.009 (2016).
Article CAS PubMed Google Scholar
Bzdok, D., Altman, N. & Krzywinski, M. Statistics versus machine learning. Nature Methods 15, 233, https://doi.org/10.1038/nmeth.4642 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wangemann, P. et al. Loss of KCNJ10 protein expression abolishes endocochlear potential and causes deafness in Pendred syndrome mouse model. BMC medicine 2, 30, https://doi.org/10.1186/1741-7015-2-30 (2004).
Article PubMed PubMed Central Google Scholar
Yi, E., Roux, I. & Glowatzki, E. Dendritic HCN channels shape excitatory postsynaptic potentials at the inner hair cell afferent synapse in the mammalian cochlea. Journal of neurophysiology 103, 2532–2543, https://doi.org/10.1152/jn.00506.2009 (2010).
Article CAS PubMed PubMed Central Google Scholar
Beurg, M. et al. Control of exocytosis by synaptotagmins and otoferlin in auditory hair cells. The Journal of neuroscience: the official journal of the Society for Neuroscience 30, 13281–13290, https://doi.org/10.1523/jneurosci.2528-10.2010 (2010).
Article CAS Google Scholar
Johnson, S. M. et al. RAS is regulated by the let-7 microRNA family. Cell 120, 635–647, https://doi.org/10.1016/j.cell.2005.01.014 (2005).
Article CAS PubMed Google Scholar
Kurioka, T. et al. ERK2 mediates inner hair cell survival and decreases susceptibility to noise-induced hearing loss. Scientific reports 5, 16839, https://doi.org/10.1038/srep16839 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Kuhn, D. E. et al. Experimental Validation of miRNA Targets. Methods (San Diego, Calif.) 44, 47–54, https://doi.org/10.1016/j.ymeth.2007.09.005 (2008).
Article CAS Google Scholar
Gomes, C. P. C. et al. A Review of Computational Tools in microRNA Discovery. Frontiers in Genetics 4, 81, https://doi.org/10.3389/fgene.2013.00081 (2013).
Article CAS PubMed PubMed Central Google Scholar
Lindow, M. & Gorodkin, J. Principles and limitations of computational microRNA gene and target finding. DNA and cell biology 26, 339–351, https://doi.org/10.1089/dna.2006.0551 (2007).
Article CAS PubMed Google Scholar
Yu, M. K. et al. Visible Machine Learning for Biomedicine. Cell 173, 1562–1565, https://doi.org/10.1016/j.cell.2018.05.056 (2018).
Article CAS PubMed Google Scholar
Altmann, A., Tolosi, L., Sander, O. & Lengauer, T. Permutation importance: a corrected feature importance measure. Bioinformatics (Oxford, England) 26, 1340–1347, https://doi.org/10.1093/bioinformatics/btq134 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

The Ingenuity Pathways Analysis (IPA) software used in this publication was supported by the Biostatistics and Informatics Shared Resource, funded by the National Cancer Institute Cancer Center Support Grant P30 CA168524, and the Kansas IDeA Network of Biomedical Research Excellence Bioinformatics Core, supported in part by the National Institute of General Medical Science award P20GM103418.

Author information

Authors and Affiliations

University of Kansas School of Medicine, Department of Otolaryngology-Head and Neck Surgery, Kansas City, KS, USA
Matthew Shew, Helena Wichova & Hinrich Staecker
University of Kansas School of Medicine, Kansas City, KS, USA
Jacob New
University of Kansas School of Medicine, Department of Biostatistics, Kansas City, KS, USA
Devin C. Koestler

Authors

Matthew Shew
View author publications
You can also search for this author in PubMed Google Scholar
Jacob New
View author publications
You can also search for this author in PubMed Google Scholar
Helena Wichova
View author publications
You can also search for this author in PubMed Google Scholar
Devin C. Koestler
View author publications
You can also search for this author in PubMed Google Scholar
Hinrich Staecker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S., D.C. and H.S. wrote the main manuscript text. M.S. and H.W. helped collect specimens and collect data. M.S., J.N., D.C. and H.S. helped perform the machine learning and prepare Figures 1–3 and Tables 1–2. All authors assisted with experimental design and review of the final manuscript.

Corresponding author

Correspondence to Matthew Shew.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shew, M., New, J., Wichova, H. et al. Using Machine Learning to Predict Sensorineural Hearing Loss Based on Perilymph Micro RNA Expression Profile. Sci Rep 9, 3393 (2019). https://doi.org/10.1038/s41598-019-40192-7

Download citation

Received: 26 October 2018
Accepted: 07 February 2019
Published: 04 March 2019
DOI: https://doi.org/10.1038/s41598-019-40192-7

This article is cited by

Structure-constrained deep feature fusion for chronic otitis media and cholesteatoma identification
- Cong Cao
- Jian Song
- Muzhou Hou
Multimedia Tools and Applications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.