Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics

Niemann, Uli; Brueggemann, Petra; Boecking, Benjamin; Mazurek, Birgit; Spiliopoulou, Myra

doi:10.1038/s41598-020-61593-z

Download PDF

Article
Open access
Published: 13 March 2020

Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics

Uli Niemann¹,
Petra Brueggemann²,
Benjamin Boecking²,
Birgit Mazurek² &
…
Myra Spiliopoulou¹

Scientific Reports volume 10, Article number: 4664 (2020) Cite this article

3228 Accesses
9 Citations
7 Altmetric
Metrics details

Subjects

This article has been updated

Abstract

Tinnitus is a complex condition that is associated with major psychological and economic impairments – partly through various comorbidities such as depression. Understanding the interaction between tinnitus and depression may thus improve either symptom cluster’s prevention, diagnosis and treatment. In this study, we developed and validated a machine learning model to predict depression severity after outpatient therapy (T1) based on variables obtained before therapy (T0). 1,490 patients with chronic tinnitus (comorbid major depressive disorder: 52.2%) who completed a 7-day multimodal treatment encompassing tinnitus-specific components, cognitive behavioural therapy, physiotherapy and informational counselling were included. 185 variables were extracted from self-report questionnaires and socio-demographic data acquired at T0. We used 11 classification methods to train models that reliably separate between subclinical and clinical depression at T1 as measured by the general depression questionnaire. To ensure highly predictive and robust classifiers, we tuned algorithm hyperparameters in a 10-fold cross-validation scheme. To reduce model complexity and improve interpretability, we wrapped model training around an incremental feature selection mechanism that retained features that contributed to model prediction. We identified a LASSO model that included all 185 features to yield highest predictive performance (AUC = 0.87 ± 0.04). Through our feature selection wrapper, we identified a LASSO model with good trade-off between predictive performance and interpretability that used only 6 features (AUC = 0.85 ± 0.05). Thus, predictive machine learning models can lead to a better understanding of depression in tinnitus patients, and contribute to the selection of suitable therapeutic strategies and concise and valid questionnaire design for patients with chronic tinnitus with or without comorbid major depressive disorder.

Key recommendations for primary care from the 2022 Global Initiative for Asthma (GINA) update

Article Open access 08 February 2023

Effect of breathwork on stress and mental health: A meta-analysis of randomised-controlled trials

Article Open access 09 January 2023

Delirium

Article 12 November 2020

Introduction

Tinnitus denotes the audiological phantom perception of a sound in the absence of an external source¹. Tinnitus is a common, yet highly severe worldwide health problem that substantially affects quality of life for millions of people^2,3. European studies estimate a tinnitus prevalence between 12% and 30%⁴. Besides potential hearing loss⁵, chronic tinnitus is associated with psychological epiphenomena, including anxiety^4,6, other somatoform disorders^7,8, insomnia⁹ and, first and foremost, depression^10,11,12. Prevalence rates of depression in patients with chronic tinnitus differ considerably, ranging from 14%¹³, to 25.6%¹⁴ up to 59.3%¹⁵. In clinical practice, it is often difficult to identify whether a depression symptomatology leads to higher tinnitus distress, or whether a higher tinnitus distress causes a persistent depressive mood. The question of comorbid depression in chronic tinnitus is hence of vital interest – both regarding the conceptualisation and measurement of distress, as well as the identification of possible obstacles to tinnitus-treatment in the face of major depressive disorder. Therefore, it is important to identify the set of variables that should be assessed at baseline to predict clinically relevant depression in tinnitus patients.

At first visit in an outpatient clinic, patients with (chronic) tinnitus usually undergo comprehensive medical and psychological assessments concerning tinnitus distress, loudness and frequency as well as the presence and severity of psychological distress. However, completing multiple lengthy questionnaires can be tedious and cumbersome for patients – often at the expense of accuracy. Hence, it is of interest to identify the most relevant questions clinicians should focus on – thereby reducing the overall amount of questions within a questionnaire. Reducing the burden of questionnaire completion may improve the quality of answers and thus, the assessment’s accuracy.

The traditional approach of extracting the most important questionnaire items requires medical researchers to carefully formulate hypotheses on the relationship between one or more independent variables and the outcome which are statistically validated subsequently. However, due to the increasingly large volume of data which are assessed for each patient, this approach becomes inappropriate, since it is very likely to miss important observations. Hence, to automatically generate new hypotheses in this study we utilize machine learning by building an accurate prediction model by capturing the inherent relationships between its features (the independent variables) and a defined outcome (the dependent variable). The quality and interpretability of such models depend considerably on the selection of relevant features. Ideally, a model is highly accurate while using only a small number of features. Often, there is a trade-off between a complex, highly predictive model and a less complex, yet more interpretable and generalisable model that uses fewer features.

While machine learning methods have been extensively used to develop prediction models for depression, e.g. in diabetes patients¹⁶ and in the general population¹⁷, we particularly focus on patients with chronic tinnitus.

In this study, we use novel machine learning algorithms (a) to create an accurate model for depression severity after treatment using data extracted from questionnaire answers before treatment, and (b) to minimise the set of predictive features by incrementally removing features on predictive performance. By trading-off high predictive accuracy with low model complexity, our results can help to identify the most important questions that patients may need to answer to accurately assess their depression status.

Methods

We extracted 185 features from 7 tinnitus-related questionnaires and socio-demographic data for a cohort of 1,490 patients during screening. For these patients, we computed the depression severity after treatment (which lasted 7 days). For prediction of depression severity after treatment, we used the workflow depicted in Fig. 1.

Features

We used a total of 185 features for data analysis, including single items, sub-scales and total scales from 7 questionnaires: (a) General Depression Scale - long form^18,19 (“Allgemeine Depressionsskala” - Langform; ADSL), (b) Perceived Stress Questionnaire²⁰ (PSQ), (c) Short Form 8 Health Survey²¹ (SF8), (d) German version of the Tinnitus Questionnaire²² (TQ), (e) Tinnitus Localisation and Quality²³ (TLQ), (f) visual analogue scales measuring tinnitus loudness, frequency and distress (TINSKAL), and (g) a sociodemographics questionnaire²⁴ (SOZK). Most questionnaire items comprised multiple-choice questions with answers on a Likert scale. The associated ordinal features were handled as numerical features in the analysis. Categorical features, e.g. sex, marital status and graduation, were binarised using one-hot encoding. A brief overview of all features is provided in Supplementary-A.

Dataset

We used data from a cohort of a total of 4,117 tinnitus patients who had been treated at Tinnitus Center, Charité Universitaetsmedizin Berlin, Germany, between January 2011 and October 2015. All included patients had been suffering from tinnitus for 3 months or longer, were 18 years of age or older and had sufficient knowledge of the German language. Treatment comprised an intensive, multimodal and tinnitus-specific 7-day programme that included informational counselling, detailed ENT as well as psychosomatic and psychological diagnostics, cognitive-behaviour therapy interventions, relaxation exercises, and physiotherapy. Ethical approval was granted by the Charité Universitaetsmedizin Ethics Committee (reference number EA1/115/15) and informed written consent was received from all patients. All methods were performed in accordance with the relevant guidelines and regulations. Prior to the analyses, all data had been anonymised. Patients who did not complete all 7 questionnaires both before and after outpatient therapy were excluded from data analysis. From the remaining 1,502 patients, 12 patients with any missing values were excluded leaving 1,490 datasets included in the analysis. Tinnitus distress was measured by the TQ total score²² with a distress-cutoff value of 46²² distinguishing between “compensated” (0–46) and “decompensated” (47–84) tinnitus. Table 1 depicts baseline characteristics of all 1,490 included patients before treatment with respect to their tinnitus distress status. The distribution of the defined outcome, the discrete additive depression score (ADSL_adsl_sum) for the patients prior to and after treatment, is shown in Fig. 2. The mean score upon commencing the therapy was 18.2 \(\pm \) 11.7 which was significantly larger (\(p < 0.001\)) than the mean score at the end of the therapy (13.2 \(\pm \) 10.7), indicating a positive effect of the multimodal treatment. The target variable “depression status” was created by dichotomising the depression score using a cutoff of 16¹⁹ distinguishing between “subclinical” (0–15) and “clinical” (16–60) depression. The rate of clinical depression in 755 female patients was 58.6% and significantly larger than the rate of clinical depression in 735 male patients of 45.7% (\(p\, < \) 0.001, Chi-square test). The mean patient age was 49.8 years (SD 12.2 years).

Table 1 Baseline characteristics of patients before treatment commencement (T0).

Full size table

Classification model development

We employed eleven machine learning algorithms for classifier training: LASSO²⁵ (lasso), RIDGE²⁶ (ridge), weighted k-nearest neighbour classifier²⁷ (wknn), Naïve Bayes classifier (nb), support vector machine²⁸ (svm), a feed-forward neural network with one single hidden layer²⁹ (nnet), generalised partial least squares³⁰ (gpls), CART decision tree³¹ (cart), C5.0 decision tree³² (c5.0), random forest³³ (rf) and gradient boosted trees³⁴ (gbt). 10-fold stratified cross-validation was used for classifier evaluation. In \(k\)-fold cross-validation, the data is split into \(k\) partitions. Each partition serves once as test set for the model which is trained on the remainder of the partitions. Finally, the \(k\) performance results are averaged. A grid search was employed for hyperparameter tuning using area under the ROC curve (AUC) as evaluation measure. A detailed description of all tuned parameter values can be obtained from Supplementary-B.

Feature selection

We created a novel incremental feature selection wrapper. In particular, we adapted the feature importance score for random forests³³ and its generalisation to any model type³⁵ which is referred to as “model reliance”. The model reliance estimates the difference in the model error after a feature’s values are randomly permuted in the dataset. An estimate of the model reliance for a feature \(f\in F\) with respect to a model \(\zeta \), a target vector \(y\), a dataset \({\bf{X}}\) and a loss function \(L(y,\zeta ({\bf{X}}))\) is calculated as follows. First, the model error on the original training data \({{\bf{X}}}_{orig}={\bf{X}}\) is calculated: \({e}_{orig}=L(y,\zeta ({{\bf{X}}}_{orig}))\). Secondly, the values of \(f\) are randomly permuted and the model error on the perturbed dataset \({{\bf{X}}}_{perm}\) is calculated: \({e}_{perm}=L(y,\zeta ({{\bf{X}}}_{perm}))\). Finally, the model reliance \(MR(f,\zeta )\) is calculated as ratio of model error with the permuted feature and model error with the original data: \(MR(f,\zeta )=\frac{{e}_{perm}}{{e}_{orig}}\). A \(MR\) value greater than 1 suggests that \(f\) is important, since randomly permuting its values apparently breaks its relationship with the predicted target. Since feature perturbation involves a degree of uncertainty, \(MR\) estimates can be improved by repeating the whole procedure \(k\) times and averaging the \(k\)\(MR\) scores. In this study, \(MR\) was calculated as average over 10 runs.

In iteration \(i=1\), our incremental feature selection wrapper begins by training an initial model \({m}_{1}\) on the full feature set \({F}_{1}=F\). For each feature, the model reliance \(MR(f,{m}_{i})\) is calculated. Features with \(MR(f,{m}_{i}) > 1\) are retained for iteration \(i+1\) while the remaining features are dropped. This procedure continues until either none of the \(MR\) values exceed 1, i.e., \(\forall f\in {F}_{i}:MR(f,{m}_{i})\ \le \ 1\), or the feature set in iteration \(i\) is identical to the feature set in iteration \(i-1\), i.e., \({F}_{i}={F}_{i-1}\).

Results

Distribution of responses

More than half (52.2%) of the 1,490 subjects suffered from clinical depression either at start (T0) and end of treatment (T1) (Fig. 2). The average difference in ADSL score between T0 and T1 comprised 5.0 points (SD 8.2). Hence, roughly one fifth of the patients (22.7%) showed symptoms of clinical depression at T0, but not at T1. Nearly half the subjects (44.4%) reported subclinical depression at both time points whereas only a minor fraction of patients (3.4%) reported an increase of depression severity. We found a strong correlation between the ADSL sum score at both time points (Spearman \(\rho =0.71\)). While we found no correlation between the ADSL score at T1 and patient age (\(\rho =-\,0.01\)), we identified a moderate correlation between the former and the initial values of TQ total score (\(\rho =0.53\)), PSQ stress score (\(\rho =0.53\)) and SF8 general health score (\(\rho =-\,0.48\)).

Predictive performance of classification models

The classification models predicted depression status after therapy based on questionnaire answers and social data acquired prior to therapy with high AUC. Table 2 depicts the performance of all classification methods across iterations. The lasso classifier constructed the best overall model (iteration \(i=1\), AUC: 0.87 \(\pm \) 0.04; mean \(\pm \) SD), followed by ridge (\(i=1\), AUC: 0.86 \(\pm \) 0.04) and gbt (\(i=1\), AUC: 0.86 \(\pm \) 0.04). The AUCs of each classifier’s best model were similar, ranging from 0.81 (c5.0) to 0.87 (lasso).

Table 2 Classification performance.

Full size table

Classification using the best model (lasso, \(i=1\)) based on a probability threshold of 0.5 resulted in an accuracy of 0.79, a true positive rate (sensitivity) of 0.61, a true negative rate (specificity) of 0.88, a precision of 0.72 and a negative predictive value of 0.82. The final model retained 40 features with nonzero coefficients. Fig. 3 shows the median model coefficient of these features across 10 cross-validation folds. From the ADSL questionnaire, 16 single items were included in the final model. Thus, this questionnaire contributed most to the model prediction. Notably, 5 items from the tinnitus-tailored TQ questionnaire were also included in the model. Further, the model utilised 5 items from the socio-demographics questionnaire (SOZK), including nationality (SOZK_nationality) which appeared to have the highest absolute model coefficient, graduation (SOZK_graduate), tinnitus duration (SOZK_tindur), employment (SOZK_job), marital status (SOZK_unmarried) and partnership status (SOZK_partnership). Table 3 provides a description for each of the 25 features with the largest model coefficient for the lasso model (\(i=1\)). The complete list of features included in the final model can be consulted in Supplementary-C.

Table 3 Top-25 features of lasso model.

Full size table

Stability of classifiers on smaller feature sets

With the exception of svm, all classifiers showed high stability when trained on smaller feature subsets. For example, the difference between lasso on 185 features (\(i=1\)) and the same on 6 features (\(i=7\)) was only 0.017 (2% drop). Several classifiers even benefitted from feature selection with respect to predictive performance. For five classifiers (gpls, nnet, cart, c5.0 and rf), the AUC of the model at second or later iteration was larger than the AUC of the first iteration model that used all 185 features. The two decision tree variants cart and c5.0 profited the most from feature selection, since their best performance was reached on the smallest feature subset with a cardinality of 22 and 10, respectively.

Complexity-interpretability tradeoff

Our incremental feature selection wrapper reduces the number of features from 185 to 6 without substantial quality loss. The lasso model of iteration \(i=7\) provides a reasonable trade-off between a clinically useful predictive quality (AUC: 0.85\(\pm \)0.05) and a low model complexity (6 features) in comparison with the best overall lasso model (AUC: 0.87 \(\pm \) 0.04). Figure 4 depicts a graphical representation of the distribution of these 6 features with respect to depression_status. Patients with clinical depression report a significantly higher mean tinnitus distress score TQ_distress (33.15 \(\pm \) 15.2) than patients with subclinical depression (49.8 \(\pm \) 15.4) (t-test, \(\alpha =0.05\)). Analogous, the mean of the stress sum score PSQ_psq_sum (clinical dep.: 0.58 \(\pm \) 0.16 vs. subclinical dep.: 0.40 \(\pm \) 0.17) and the demand score PSQ_demand (clinical dep.: 0.56 \(\pm \) 0.16 vs. subclinical dep.: 0.46 \(\pm \) 0.17) were significantly higher for patients with clinical depression. Additionally, three single items were included in the model which showed significant differences with respect to depression_status (Chi-square test, \(\alpha =0.05\)). For the seventh and tenth question of the ADSL questionnaire (ADSL_adsl07: “During the past week I felt that everything I did was an effort”; ADSL_adsl10: “During the past week I felt fearful”), the portion of patients with clinical depression ticking answers “occasionally” and “most” were higher than for “rarely” and “some”. Accordingly, patients with clinical depression answered the fifth question of the SF8 questionnaire (SF8_sf05: “During the past 4 weeks, how much energy did you have?”) rather with “a little” or “none” instead of “very much”, “quite a lot” or “some”.

Discussion

Machine learning has been used to create prediction models for depression severity based on structured patient interviews^36,37. Despite their high predictive performance, we assume that our current models provide a good fit for our sample only, with other subpopulations being yet to be investigated. However, our models are promising and may serve as starting point for timely prediction of depression severity and treatment course with only a small number of questionnaire items.

In agreement with previous studies, the strong association between TQ_distress and depression status indicate a high association between tinnitus-related distress and depressive symptomatology as measured by ADSL³⁸. In addition, large model coefficients for PSQ overall score and demand score suggest subjective stress as major contributing factor to depression in tinnitus patients¹². From a clinical point of view, the inclusion of features from different questionnaires indicates the importance of combining items from several questionnaire types in order to accurately predict depression status. Hence, emotional epiphenomena and other sequelae must be addressed to optimally meet patients’ needs.

A previous study³⁹ reported high sensitivity in depression recognition using a questionnaire with only two questions. One of the two questions was “During the past month, have you often been bothered by feeling down, depressed, or hopeless.”³⁹ which closely resembles the item ADSL_adsl06 (“During the past week I felt depressed.”) that exhibited the second-largest absolute coefficient in the best lasso model (\(i=1\)) in our study.

In general, caution has to be taken when interpreting model coefficients. For example, the lasso model (\(i=1\)) identified a positive relationship (coefficient: \(-0.370\)) between non-German citizenship and depression severity (Table 3, Fig. 3). Although ethnical differences in depression were reported in some studies^40,41, this result rather suggests a higher perceived social stress of predominantly Turkish-born foreign patients, due to higher unemployment rate, larger families, inferior housing, etc. in this demographic group. Further, these results may also be an effect of overfitting, since only 5.0% of the cohort population were non-German citizens. Moreover, the feature had a model reliance score of under 1.0 and consequently was dropped for iteration 2. Although the age feature is included in 8 of the 11 feature sets associated with the best model per classifier, the lack of correlation with the response lets the effect of age on the predicted depression status remains unclear.

With respect to stability of models on a small number of features, it is encouraging that much simpler models are just minorly inferior to the most predictive model. In fact, 5 out of 11 classification algorithms even improved from feature selection, i.e., the AUC at the second or a higher iteration was larger than at the first iteration that uses all features, including the two decision tree variants that reached highest performance on the smallest feature subset, respectively. It is promising that a model (lasso, i = 7) that used only 6 features from 4 questionnaires was only slightly inferior (AUC = 0.850) to the best overall model (AUC = 0.867). For example, neither features on tinnitus localisation and quality, nor sociodemographic features were included in this model. This result could be used to reduce the number of questions or whole questionnaires that the patients have to answer before and after treatment.

The presented study aims at being a first step in providing physicians with guidance for therapy decisions concerning clinical depression in patients with chronic tinnitus. The models could be used to devise a suitable treatment pathway. When applying the models to practice, it is important to notice that they are learned on cross-sectional data, i.e., the model separates between subclinical and clinical depression based on questionnaire answers and socio-demographics before administration of a treatment. Also, the term “clinical depression” refers to how it was modelled in this study, i.e., the depression status after treatment. One has also take into account that the median time difference between start and end of treatment programme was 7 days.

The dataset used for model development might be subject to a selection bias since patients who did not complete all seven questionnaires both during admission and after treatment were excluded in the present data analyses. We do not see these data as “missing values” because this might lead to the problematic suggestion of using imputation methods. We cannot use imputation, because (i) a proportion of patients did not complete whole questionnaires (rather than just single items), and (ii) we do not know if data are missing at random. However, given that the number of patients is large, we consider our results as sufficiently robust. In future work, we will investigate potential systematic differences between included and excluded patients. Further, the patient population was obtained from only one German hospital. Hence, the model needs to be externally validated on data from different populations and hospitals.

As another limitation, the incremental feature selection mechanism may miss global optima due to its greedy procedure. At each iteration, only features that are identified to make up for some predictive performance of the classifier are retained and the remaining features are dropped. Once a feature has been eliminated from the feature set, it is not considered at any later iteration. It is possible that the inclusion of a removed feature for classifier training at a later iteration leads to a better model. One possible solution to this problem would be to implement a mechanism which allows for backtracking or revisiting previous iterations. Thus, the \(MR\) cutoff value for discarding features could serve as additional tuning parameter. Hence, by testing alternative feature sets at a single iteration, a model with higher predictive performance could be generated.

Motivated by this limitation, future work includes a comparison with other feature selection algorithms. Generally, feature selection algorithms can be roughly divided into embedded methods, filter methods and wrapper methods. Intrinsic methods describe classification methods that internally handle feature selection during model training, e.g., tree- and rule-based classifiers and regularised methods like LASSO. Filter methods are classifier-independent and quantify the relevance of a feature before model training by a scoring function. Popular filter approaches are Relief-based methods^42,43, correlation-based feature selection⁴⁴ and simple statistical scores, e.g., p-value of \(t\)-test, chi-squared test or Wilcoxon signed-rank test. (Search-based) wrapper methods define a “space” of candidate feature sets. Each candidate feature set is evaluated by a search algorithm which is wrapped around the classifier. To prevent exhaustive search, the search algorithm usually utilises a heuristic to guide the search from the previous best feature set to next best candidate set. Well-known wrapper methods include simple forward/backward selection, recursive feature elimination⁴⁵, simulated annealing^46,47 and genetic algorithms⁴⁸. The novel feature selection mechanism that is used in this study can be categorised as wrapper method.

Another limitation of this study is the lack of an independent cohort. In future work, the model needs to be externally validated, i.e., tested on data from different centres. Since the use of cross-sectional data currently limits interpretation of the depression status prediction beyond end of therapy, the model needs to be validated with longitudinal data in the future.

Data availability

Per the Charité Universitaetsmedizin Berlin ethics committee, we cannot make the data public because we do not have the consent of patients to publish their data. Interested researchers can contact the directorate of the Tinnitus Center of Charité Universitaetsmedizin Berlin with data access requests at birgit.mazurek@charite.de.

Change history

15 June 2020
Due to a typesetting error, in the original version of this Article the character é was replaced by the character ï¿½l. This has now been fixed in the Article.

References

Eggermont, J. J. & Roberts, L. E. The neuroscience of tinnitus. Trends in Neurosciences 27, 676–682 (2004).
Article CAS Google Scholar
Baguley, D., McFerran, D. & Hall, D. Tinnitus. The Lancet 382, 1600–1607 (2013).
Article Google Scholar
Bauer, C. A., Berry, J. & Brozoski, T. J. Clinical trials supported by the tinnitus research consortium: Lessons learned, the southern illinois university experience. Hearing Research 334, 65–71 (2016).
Article Google Scholar
McCormack, A. et al. Investigating the association between tinnitus severity and symptoms of depression and anxiety, while controlling for neuroticism, in a large middle-aged uk population. International Journal of Audiology 54, 599–604 (2015).
Article Google Scholar
Martines, F., Bentivegna, D., Martines, E., Sciacca, V. & Martinciglio, G. Assessing audiological, pathophysiological and psychological variables in tinnitus patients with or without hearing loss. European Archives of Oto-Rhino-Laryngology 267, 1685–1693 (2010).
Article Google Scholar
Zöger, S., Svedlund, J. & Holgers, K.-M. Psychiatric disorders in tinnitus patients without severe hearing impairment: 24 month follow-up of patients at an audiological clinic: Alteraciones psiquiátricas en pacientes con tinnitus sin hipoacusia severa: Seguimiento durante 24 meses en una clínica audiólogica. Audiology 40, 133–140 (2001).
Article Google Scholar
Andersson, G. Psychological aspects of tinnitus and the application of cognitive-behavioral therapy. Clinical Psychology Review 22, 977–990 (2002).
Article Google Scholar
Hiller, W., Janca, A. & Burke, K. C. Association between tinnitus and somatoform disorders. Journal of psychosomatic research 43, 613–624 (1997).
Article CAS Google Scholar
Wallhäusser-Franke, E., Schredl, M. & Delb, W. Tinnitus and insomnia: is hyperarousal the common denominator? Sleep Medicine Reviews 17, 65–74 (2013).
Article Google Scholar
Zirke, N. et al. Analysis of mental disorders in tinnitus patients performed with composite international diagnostic interview. Quality of Life Research 22, 2095–2104 (2013).
Article CAS Google Scholar
Zöger, S., Svedlund, J. & Holgers, K.-M. Relationship between tinnitus severity and psychiatric disorders. Psychosomatics 47, 282–288 (2006).
Article Google Scholar
Trevis, K. J., McLachlan, N. M. & Wilson, S. J. A systematic review and meta-analysis of psychological functioning in chronic tinnitus. Clinical psychology review 60, 62–86 (2018).
Article Google Scholar
Stobik, C., Weber, R. K., Münte, T. F., Walter, M. & Frommer, J. Evidence of psychosomatic influences in compensated and decompensated tinnitus. International journal of audiology 44, 370–378 (2005).
Article Google Scholar
Bhatt, J. M., Bhattacharyya, N. & Lin, H. W. Relationships between tinnitus and the prevalence of anxiety and depression. The Laryngoscope 127, 466–469 (2017).
Article Google Scholar
Hu, J.et al. The correlation of the tinnitus handicap inventory with depression and anxiety in veterans with tinnitus. International Journal of Otolaryngology 2015 (2015).
Jin, H., Wu, S. & Di, P. C. Development of a clinical forecasting model to predict comorbid depression among diabetes patients and an application in depression screening policy making. Preventing chronic disease 12, E142–E142 (2015).
PubMed PubMed Central Google Scholar
Wang, J. et al. A prediction algorithm for first onset of major depression in the general population: development and validation. J Epidemiol Community Health 68, 418–424 (2014).
Article Google Scholar
Radloff, L. S. The CES-D scale: a self-report depression scale for research in the general population. Applied psychological measurement 1, 385–401 (1977).
Article Google Scholar
Hautzinger, M. & Bailer, M. ADS-Allgemeine Depressionsskala. In Diagnostische Verfahren in der Psychotherapie (Beltz, 2003).
Fliege, H. et al. The Perceived Stress Questionnaire (PSQ) reconsidered: validation and reference values from different clinical and healthy adult samples. Psychosomatic medicine 67, 78–88 (2005).
Article Google Scholar
Bullinger, M. & Morfeld, M. Der SF-36 Health Survey. In Gesundheitsökonomische Evaluationen, 387–402 (Springer, 2008).
Goebel, G. & Hiller, W. Tinnitus-Fragebogen:(TF); ein Instrument zur Erfassung von Belastung und Schweregrad bei Tinnitus; Handanweisung (hogrefe, Verlag für Psychologie, 1998).
Goebel, G. & Hiller, W. Psychische Beschwerden bei chronischem Tinnitus: Erprobung und Evaluation des Tinnitus-Fragebogens (TF). Verhaltenstherapie 2, 13–22 (1992).
Article Google Scholar
Brüggemann, P. et al. Impact of multiple factors on the degree of tinnitus distress. Frontiers in human neuroscience 10, 341 (2016).
Article Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Hoerl, A. E. & Kennard, R. W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 55–67 (1970).
Article Google Scholar
Hechenbichler, K. & Schliep, K. Weighted k-nearest-neighbor techniques and ordinal classification. In SFB 386, Ludwig-Maximilians University, Munich, vol. 399 of sfb386 (2004).
Boser, B. E., Guyon, I. M. & Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proc. of Workshop on Computational Learning Theory, 144–152 (ACM, 1992).
Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S (Springer, 2002), fourth edn.
Ding, B. and Gentleman, R. Classification using generalized partial least squares. Journal of Computational and Graphical Statistics 14, 280–298 (2005).
Article MathSciNet Google Scholar
Breiman, L., Friedman, J., Olshen, R. & Stone, C. Classification and Regression Trees (Wadsworth and Brooks, 1984).
Quinlan, R. C4.5: Programs for Machine Learning (Morgan Kaufmann Publishers, San Mateo, CA, 1993).
Breiman, L. Random forests. Machine learning 45, 5–32 (2001).
Article Google Scholar
Friedman, J. H. Greedy function approximation: a gradient boosting machine. Annals of Statistics 1189-1232 (2001).
Fisher, A., Rudin, C. & Dominici, F.All models are wrong but many are useful: Variable importance for black-box, proprietary, or misspecified prediction models, using model class reliance. arXiv preprint arXiv:1801.01489 (2018).
van Loo, H. M. et al. Major depressive disorder subtypes to predict long-term course. Depression and anxiety 31, 765–777 (2014).
Article Google Scholar
Kessler, R. C. et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Molecular Psychiatry 21, 1366–1371 (2016).
Langguth, B., Landgrebe, M., Kleinjung, T., Sand, G. P. & Hajak, G. Tinnitus and depression. The world journal of biological psychiatry 12, 489–500 (2011).
Article Google Scholar
Whooley, M. A., Avins, A. L., Miranda, J. & Browner, W. S. Case-finding instruments for depression: Two questions are as good as many. Journal of General Internal Medicine 12, 439–445 (1997).
Article CAS Google Scholar
Riolo, S. A., Nguyen, T. A., Greden, J. F. & King, C. A. Prevalence of depression by race/ethnicity: findings from the national health and nutrition examination survey iii. American journal of public health 95, 998–1000 (2005).
Article Google Scholar
Weinberger, A. H. et al. Trends in depression prevalence in the usa from 2005 to 2015: widening disparities in vulnerable groups. Psychological Medicine 48, 1308–1315, https://doi.org/10.1017/S0033291717002781 (2018).
Article CAS PubMed Google Scholar
Kira, K. & Rendell, L. A. The feature selection problem: Traditional methods and a new algorithm. In Aaai 2, 129–134 (1992).
Google Scholar
Urbanowicz, R. J., Meeker, M., LaCava, W., Olson, R. S. & Moore, J. H. Relief-based feature selection: introduction and review. Journal of Biomedical Informatics (2018).
Hall, M. A. Correlation-based feature selection for discrete and numeric class machine learning. In Proc. of International Conference on Machine Learning (ICML), 359–366 (2000).
Guyon, I. & Elisseeff, A. An introduction to variable and feature selection. Journal of Machine Learning Research 3, 1157–1182 (2003).
MATH Google Scholar
Kirkpatrick, S., Gelatt, C. D. & Vecchi, M. P. Optimization by simulated annealing. science 220, 671–680 (1983).
Article ADS MathSciNet CAS Google Scholar
Van Laarhoven, P. J. & Aarts, E. H. Simulated annealing. In Simulated annealing: Theory and applications, 7–15 (Springer, 1987).
Mitchell, M. Mitchell, M. An introduction to genetic algorithms (MIT press, 1998).

Download references

Author information

Authors and Affiliations

Faculty of Computer Science, Otto von Guericke University Magdeburg, Universitätsplatz 2, Magdeburg, 39106, Germany
Uli Niemann & Myra Spiliopoulou
Tinnitus Center, Charité Universitaetsmedizin Berlin, Charitéplatz 1, Berlin, 10117, Germany
Petra Brueggemann, Benjamin Boecking & Birgit Mazurek

Authors

Uli Niemann
View author publications
You can also search for this author in PubMed Google Scholar
Petra Brueggemann
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Boecking
View author publications
You can also search for this author in PubMed Google Scholar
Birgit Mazurek
View author publications
You can also search for this author in PubMed Google Scholar
Myra Spiliopoulou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

U.N. and M.S. analysed the data. U.N. wrote the manuscript. P.B., B.B., B.M. and M.S. reviewed the manuscript. P.B., B.B. and B.M. provided medical expertise, guidance and the dataset.

Corresponding author

Correspondence to Uli Niemann.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Niemann, U., Brueggemann, P., Boecking, B. et al. Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics. Sci Rep 10, 4664 (2020). https://doi.org/10.1038/s41598-020-61593-z

Download citation

Received: 15 April 2019
Accepted: 28 February 2020
Published: 13 March 2020
DOI: https://doi.org/10.1038/s41598-020-61593-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.