Automatic detection of major depressive disorder using electrodermal activity

Kim, Ah Young; Jang, Eun Hye; Kim, Seunghwan; Choi, Kwan Woo; Jeon, Hong Jin; Yu, Han Young; Byun, Sangwon

doi:10.1038/s41598-018-35147-3

Download PDF

Article
Open access
Published: 19 November 2018

Automatic detection of major depressive disorder using electrodermal activity

Ah Young Kim¹,
Eun Hye Jang¹,
Seunghwan Kim¹,
Kwan Woo Choi²,
Hong Jin Jeon ORCID: orcid.org/0000-0002-6126-542X²,
Han Young Yu¹ &
…
Sangwon Byun³

Scientific Reports volume 8, Article number: 17030 (2018) Cite this article

6978 Accesses
39 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Major depressive disorder (MDD) is a common psychiatric disorder and the leading cause of disability worldwide. However, current methods used to diagnose depression mainly rely on clinical interviews and self-reported scales of depressive symptoms, which lack objectivity and efficiency. To address this challenge, we present a machine learning approach to screen for MDD using electrodermal activity (EDA). Participants included 30 patients with MDD and 37 healthy controls. Their EDA was measured during five experimental phases consisted of baseline, mental arithmetic task, recovery from the stress task, relaxation task, and recovery from the relaxation task, which elicited multiple alterations in autonomic activity. Selected EDA features were extracted from each phase, and differential EDA features between two distinct phases were evaluated. By using these features as input data and performing feature selection with SVM-RFE, 74% accuracy, 74% sensitivity, and 71% specificity could be achieved by our decision tree classifier. The most relevant features selected by SVM-RFE included differential EDA features and features from the stress and relaxation tasks. These findings suggest that automatic detection of depression based on EDA features is feasible and that monitoring changes in physiological signal when a subject is experiencing autonomic arousal and recovery may enhance discrimination power.

Predictive biosignature of major depressive disorder derived from physiological measurements of outpatients using machine learning

Article Open access 25 April 2023

Patterns of activity correlate with symptom severity in major depressive disorder patients

Article Open access 02 June 2022

Identifying misdiagnosed bipolar disorder using support vector machine: feature selection based on fMRI of follow-up confirmed affective disorders

Article Open access 08 January 2024

Introduction

Major depressive disorder (MDD) is one of the most common psychiatric disorders, affecting more than 300 million people worldwide. According to recent estimates from the World Health Organization (WHO), depression is predicted to become the most common disease experienced by individuals of all ages, and the third largest contributor to disease burden by 2020¹. Major depression is characterized by consistent irritability or feelings of sadness, and is associated with various symptoms, including sleep disturbances, loss of interest, persistent fatigue, reduced appetite, anxiety, and physical aches². When severe, these symptoms can cause professional disability, which imposes a substantial economic burden on society owing to impaired work productivity³. Depression is also a risk factor for suicide, and, if left untreated, can lead to elevated mortality risk, and a serious public health concern⁴.

To diagnose a patient with MDD, psychiatrists use standard clinical criteria, such as those outlined in the Diagnostic and Statistical Manual of Mental Disorders (DSM)⁵. Although the DSM provides clear descriptions of symptoms and MDD diagnostic guidelines, diagnoses are often limited as they rely on patients’ subjective symptom reports, derived from clinical interviews and self-report questionnaires. As such, these do not provide an assessment of depression-related physiology or allow for an objective diagnosis⁶. Furthermore, the heterogeneous nature of depression makes it difficult to diagnose, with some reporting that even highly-trained clinicians are only able to agree on an MDD diagnosis between 4 and 15% of times^7,8,9. Consequently, there has been great interest in developing more reliable methods to evaluate depression, which can significantly improve diagnostic accuracy and facilitate more precise treatment for MDD.

The use of physiological signals to assess autonomic nervous system (ANS) activity has attracted great interest in association with MDD, as accumulating evidence suggests that depression is related to ANS dysfunction. For example, previous research has shown that heart rate variability (HRV) was significantly altered in patients with MDD¹⁰. Similarly, electrodermal activity (EDA), which reflects sympathetic nervous system activity, is also sensitive to changes in clinical status¹¹. Specifically, patients with depression exhibited lower skin conductance levels (SCLs) during rest than did healthy control subjects¹². Similarly, stress-induced autonomic arousal, as measured by EDA, was found to be significantly reduced in MDD, indicating that depression may be associated with decreased autonomic responses to stimuli^13,14. Also, EDA among MDD participants was found to be distinguishable from those with other psychopathologies, such as generalized anxiety disorder (GAD) or panic disorder (PD), as individuals with GAD and PD tended to exhibit autonomic hyper-activation^13,14. These results suggest that autonomic activity, as represented by various physiological signals, serves as a quantitative marker of depression.

Based on these findings, recent studies on physiological measures proposed automatic detection of the depression using machine learning methods. For instance, HRV features, when combined with serum proteomics data, have been used to create support vector machine (SVM) algorithms for the diagnosis of MDD¹⁵. Valenza et al. used HRV parameters to assess depressive states in patients with bipolar disorder (BD) and also found that mood changes could be predicted using HRV nonlinear dynamics^16,17. Ongoing work has focused on further developing data-driven strategies for the diagnosis of psychopathologies due to the recent success of machine learning in various medical and healthcare fields¹⁸.

While previous clinical research has suggested a potential role for EDA as a biomarker of MDD, only few have investigated how EDA data and machine learning methods can be used to objectively assess depression symptoms and render MDD diagnoses. The results in literature pertaining relevant EDA features in MDD has been inconsistent¹⁹. This poor consistency is likely a result of the heterogeneous presentation and multifactorial etiology of MDD, which substantially complicates research on this order²⁰. Recently, Picard and colleagues predicted the severity of depressive symptoms in 12 patients with MDD by analyzing EDA, sleep patterns, motion, and other activities assessed via built-in smartphone sensors²¹. While this work is encouraging, the study was conducted without a healthy control comparison group and data were collected across multiple modalities to test prediction models. To the best of our knowledge, a more comprehensive clinical study, specifically focused on distinguishing patients with MDD from healthy controls solely using EDA data, has been rarely reported.

Therefore, the objective of the present study was to investigate the feasibility of automatic differentiation of patients with depression from healthy controls using EDA and machine learning approaches. We measured EDA signals during five consecutive experimental phases: baseline, a mental arithmetic stress task, recovery from the stress task, a relaxation task, and recovery from the relaxation task (Fig. 1). Monitoring sympathetic activity and responses to various external stimuli may enable a better understanding of dysfunctional autonomic control in patients with MDD, possibly resulting in a more accurate diagnosis of their depression^22,23,24,25. To represent autonomic activity, mean amplitude of SCL (MSCL), standard deviation of SCL (SDSCL), skewness of SCL (SKSCL), and non-specific skin conductance response (NSSCR) were extracted from each of the five phases (Supplementary Table S1). We also calculated differences in EDA features between two phases and used these parameters as input data to test their effectiveness in automatic discrimination (Figs 2 and 3). These multiple autonomic alterations may reveal abnormal autonomic control in patients with MDD^24,25.

We have demonstrated that patients with MDD were differentiated from healthy control subjects with an accuracy of 74% using a decision tree classifier. Feature selection performed using a support vector machine recursive feature elimination (SVM-RFE) revealed that differential EDA features and features acquired from the stress and relaxation tasks were highly relevant. These findings suggest that automatic detection of depression based on EDA is feasible and that monitoring changes in physiological signal when a subject is experiencing autonomic arousal and recovery may enhance discrimination power.

Results

Descriptive statistics of subjects

Participants for the present study included 37 healthy controls (21 females) and 30 patients with MDD (22 females). Table 1 shows participants’ descriptive demographic and clinical characteristic statistics. There were no significant differences in sex, age, years of education, marital status, body mass index (BMI), or smoking status. Significant differences were detected in alcohol (P^a = 0.006) and caffeine use (P^a = 0.033) between the two groups. The MDD group had significantly higher Hamilton depression rating scores (HAM-D) (P^a < 0.001), Hamilton anxiety rating scores (HAM-A) (P^a < 0.001), and stress response inventory scores (SRI) (P^a < 0.001) than did the control group.

Table 1 Demographic and clinical characteristics of patients with MDD and healthy control subjects.

Full size table

Statistical analyses of EDA features

Participants were instructed to perform five different tasks while their EDA was measured (Fig. 1). The influence of both group and task on EDA features was statistically examined (Supplementary Table S2). We performed the non-parametric equivalent of a repeated-measures ANOVA as EDA features violated the normality assumption required for an ANOVA (for further details, see the Methods section)²⁶. There were significant main effects of group and task on MSCL, SDSCL, and NSSCR. SKSCL was significantly affected by the task. No features revealed significant interactions between group and task.

The difference in an EDA feature between two experimental phases may be less affected by personal variation in ANS activity when compared to the same feature extracted from a single phase^27,28. In the present study, four differential EDA features (dMSCL, dSDSCL, dSKSCL, and dNSSCR) were assessed from two distinct phases. We selected four pairs of phases, as shown in Fig. 3, to further account for various alterations in autonomic activity (for further details, see the Methods section). To avoid confusion, we use the term “primary dataset” to refer to the set of EDA features that were extracted from an individual phase (P1 to P5 in Fig. 3) and the term “derived dataset” to refer to the set of differential EDA data calculated from a pair of phases (D1 to D4 in Fig. 3). As a result, a total of 16 differential EDA features were assessed. When differential EDA features from the control and MDD groups were statistically compared, no significant differences were observed (Supplementary Table S3).

Classification of control and MDD group participants based on EDA features

Four supervised machine learning algorithms were implemented to classify control and MDD participants based on their EDA features: SVM, decision tree, k-nearest neighbors (k-NN), and Naïve Bayes. Feature selection was performed using SVM-RFE, through which the most relevant features of a total of 36 (i.e., 20 features from primary datasets and 16 differential features from derived datasets) were identified. Performance measures were evaluated using 5-fold cross-validations repeated 200 times.

Figure 4 shows the classification accuracy as a function of the number of selected features. The decision tree outperformed SVM, k-NN, and Naïve Bayes in all tested subsets of features. Performance measures of the decision tree are summarized in Table 2. The best performance (73.71% accuracy, 73.74% sensitivity, and 71.15% specificity) was achieved using the 11 most relevant features. Supplementary Table S4 summarizes performance measures for SVM, k-NN, and Naïve Bayes classifiers. Receiver-operator characteristic (ROC) curves for the decision tree classifier during the use of optimal features are shown in Fig. 5, which depicts the comparison between results from the training and test sets. The areas under the curve (AUCs) for the training set and the test set were 0.88 and 0.78, respectively.

Table 2 Performance measures of the decision tree classifier assessed using 5-fold cross-validations repeated 200 times.

Full size table

Table 3 shows a list of the 36 EDA features sorted in descending order of average rank determined by SVM-RFE. The rank of each feature was averaged over 5-fold cross-validations repeated 200 times. The most relevant feature was MSCL from P1, followed by dMSCL from D2, dSKSCL from D3, SKSCL from P4 (relaxation task) and MSCL from P2 (stress task).

Table 3 Average ranks of the 36 EDA features determined by SVM-RFE.

Full size table

Discussion

Our results suggest that EDA features measured during autonomic arousal and recovery may provide a promising biomarker for MDD. In the present study, we added a relaxation task to our experimental protocol, in addition to a stress task, to improve our classifier’s discrimination power. We also investigated the potential benefits of including differential EDA features in prediction. As summarized in Table 3, feature ranking performed using SVM-RFE revealed that differential features were highly informative. For example, the second and third most relevant features were differential EDA features, which were dMSCL from D2 and dSKSCL from D3, respectively. Six of the top 10 features were differential EDA features. In addition, the fourth and fifth-ranked features were measured during the relaxation task (SKSCL from P4) and stress task (MSCL from P2), respectively. These results suggest that EDA features, which account for various autonomic alterations, may play a major role in developing a predictive model of MDD.

The MDD group assessed here had significantly different MSCL, SDSCL, and NSSCR values from the control group (Supplementary Table S2). These results were consistent with previous studies, which have also reported significantly altered SCL and SCR in patients with MDD¹⁹. We also found a significant main effect of the task in MSCL, SDSCL, SKSCL, and NSSCR. In general, external stimuli elicit autonomic responses, causing physiological measures, such as SCL, to deviate from the basal activity¹³. Differential EDA features did not differ between the MDD and control groups although some differential EDAs were highly ranked by SVM-RFE. Since machine learning does not require prior assumptions about the relationships between variables, successful classification is not necessarily based on significant mean differences between the groups.

In the present study, we measured EDA to conduct machine learning-based detection of MDD. In previous studies that have used machine learning, HRV and electroencephalogram (EEG) have been widely studied as candidate physiological markers of psychiatric disorders and have demonstrated promising results. For example, HRV indices classified MDD and control groups with an accuracy of 88% using a linear discrimination analysis approach²⁵. The prediction of mood states in BD patients achieved an accuracy of 60–83% using HRV parameters with an SVM classifier¹⁷. Additionally, biomarkers extracted from EEG and analysis performed using an artificial neural network distinguished between four patient classes (MDD, BD, schizophrenia, and normal controls) with an average accuracy of 85%²⁹. A similar study involving EEG predicted antidepressant treatment effectiveness for patients with MDD with an accuracy of 88%³⁰.

The results of the present study suggest that the relatively simple measurement of EDA can also provide clinically valuable information on the highly accurate assessment of MDD status in patients. Measuring EDA requires only two electrodes, which can be attached to minimally-obtrusive skin surfaces, such as the forearm³¹. Given this, our approach may be suitable for obtaining data while participants perform daily activities, as well as in more traditional clinical/experimental context. If a wearable device capable of real-time measurement of physiological signals is developed and integrated with prediction software, this technology could be implemented as a personalized healthcare system for real-time monitoring of patients. For example, a wrist-worn device with EDA and accelerometry sensors and machine learning capabilities has been successfully used to detect generalized tonic-clonic (GTC) seizure, demonstrating that wearable EDA sensors can indeed be used as a real-time disease tracking system³².

A limitation of the present study is that the classifications employed were based on a relatively small number of subjects (MDD = 30, control = 37). We are currently recruiting additional subjects and have been tracking longitudinal changes in EDA responses during a 3-month follow-up period. Further study of this cohort will allow us to extend our findings to develop a machine learning technique for disease diagnosis and severity assessment. We believe that these efforts will help us to build a more robust predictor of MDD for future use.

Conclusion

We have demonstrated here, through proof-of-principle experiments, that EDA features can be used as a biomarker for MDD. Patients with MDD and healthy control participants were classified with 74% accuracy using a decision tree algorithm. The current study was specifically designed to test the feasibility of an EDA-based classification of patients with MDD. To increase discrimination power, EDA was measured while subjects underwent both stress-inducing and relaxation tasks. In addition to extracting EDA features from each phase, we also calculated differential features that represented differences in EDA between two distinct phases. Feature selection performed using SVM-RFE revealed that differential EDA features and features measured during the stress and relaxation tasks were highly useful for discrimination. Finally, these findings suggest that the machine learning method proposed and employed here, which accounts for multiple alterations in EDA, offers great potential as an objective marker of MDD which may ultimately improve patient diagnosis and treatment.

Methods

Subjects

This study was conducted between December 2015 and October 2016 at Samsung Medical Center, Seoul, South Korea. Participants included 30 patients with MDD and 37 healthy controls. Senior psychiatrists diagnosed all patients using DSM-IV MDD criteria. On screening, those patients who scored greater than or equal to 16 on HAM-D were recruited into the MDD group³³. Healthy participants without any medical history of psychiatric disorders who responded to our advertisements were recruited into the control group. All subjects were informed about the purpose and method used in the present experiment and signed a written informed consent form. They were also financially compensated ($50) for their participation. This study was approved by the Institutional Review Board of Samsung Medical Center of Seoul, Korea (No. 2015-07-151) and performed according to all relevant guidelines.

Procedure

The experimental protocol used in the current study was design to assess autonomic responses to stress and relaxation tasks (Figs 1 and 3). The whole procedure consisted of five phases, each of which lasted 5 min. We recorded physiological signals, including EDA, while subjects performed a specific task in each experimental phase. The first phase (‘REST’) consisted of a rest period during which subjects were instructed to sit comfortably while minimizing any movement. The second phase (‘MAT’) used a mental arithmetic test to induce mental stress in participants^34,35. Here, we asked subjects to subtract serial 7′s, starting from 500, and verbally report their answers. If a mistake was made, the experimenter told the subject to repeat the calculation. In the third phase (‘REC1’), subjects were instructed to stop the arithmetic calculations and rest. During this phase, autonomic recovery from the stress task was assessed. During the fourth phase (‘RLX’), subjects were asked to relax while watching 10 consecutive images presented on a PC monitor. Each image depicted natural scenery and lasted for 30 sec. In the fifth and final phase (‘REC2’), image presentation ceased, and subjects were instructed to rest. During this phase, recovery from relaxation was assessed.

Physiological recordings

Before measurement began, subjects were seated in a comfortable armchair with a headrest. The experimenter then explained the measurement procedure to the subject. Electrodes used to sense physiological signals were attached. Subjects were then allowed to acclimate to the laboratory environment. We used ProComp Infiniti (SA7500, Computerized Biofeedback system, Thought Technology, Canada) to record EDA, as well as patient electrocardiogram (ECG), photoplethysmogram (PPG), respiration, skin temperature, and electroencephalogram (2-channel EEG for Fp1, Fp2) measures. While in the present study, we only analyzed EDA values, these other measures were taken as part of a larger study. EDA was continuously recorded using SC-Flex/Pro sensors (SA 9308 M, Thought Technology, Canada). A constant electrical voltage (0.5 V) was applied between two dry Ag/AgCl electrodes, which were strapped to the distal phalanges of index and ring fingers of the subject’s non-dominant hand. The sampling rate was 256 Hz. Measurements were made in a humidity- and temperature-controlled room (23 °C and humidity below 50%).

Pre-processing and feature extraction

All EDA data were processed in MATLAB (Mathworks, MA, USA). Figure 2 depicts an overview of our data processing pipeline. After removing motion artifacts, EDA signals were filtered using a second-order Butterworth low-pass filter (1 Hz cutoff, IIR) and a moving average filter to reduce signal noise. Then, the EDA signals were decomposed using a convex optimization approach (cvxEDA)³⁶. The cvxEDA model describes EDA as the sum of three components, tonic component, phasic component, and an additive white Gaussian noise. The cvxEDA algorithm is available online from Mathworks File Exchange (www.mathworks.com/matlabcentral/fileexchange/53326-cvxeda). In the present study, fixed values of α = 0.008, τ₁ = 0.7s, τ₂ = 2s and γ = 0.01 were used for cvxEDA parameters. The tonic component, represented by skin conductance level (SCL), reflects the overall degree of arousal, which increases with alertness and decreases with relaxation³⁷. The phasic component, represented by skin conductance response (SCR), reflects the short-time responses to external stimuli (1–5 sec after stimulus onset)^38,39. From these, we calculated four features, mean amplitude of SCL (MSCL), standard deviation of SCL (SDSCL), skewness of SCL (SKSCL), and non-specific SCR (NSSCR), as summarized in Supplementary Table S1³⁸.

As explained above, the experimental protocol was composed of five phases (REST, MAT, REC1, RLX, REC2). A 150-sec interval was selected for each phase, and EDA features during this interval were calculated (Fig. 3). We selected specific time windows to ensure that EDA features accurately reflected activated or recovered levels of autonomic activity in each phase. The length of the window was 150 sec, which was 25% of the total duration of the phase, which was 600 sec (5 min) long. A similar approach was used in previous studies. For example, Brouwer et al. used 2-min long stimuli to induce emotional states and evaluated skin conductance during the first 30-sec period, which was 25% of the total length of the stimulus interval⁴⁰. Pruneti et al. used three experimental phases consisting 6-min rest, 4-min stress, and 6-min recovery phases. They used the last minute of the rest and recovery phases each (1/6 = 16.7%) and the first minute of the stress phase (1/4 = 25%) to evaluate SCL. In our study, we chose a conservative ratio, i.e., 25%, to define the window length¹⁴. In the present study, the final 150-sec period was selected to represent baseline activity for the REST phase. For the MAT phase, the first 150-sec period was used to measure activation before habituation that could affect measured activation levels. For the REC1 and REC2 phases, the final 150-sec was used to allow sufficient recovery time. Similarly, we selected the final 150-sec of the RLX phase to provide participants with enough time to respond to the relaxation task. A set of EDA features obtained from this individual phase is referred by the term “primary dataset” as shown in Fig. 3.

Also, four differential EDA features (dMSCL, dSDSCL, dSKSCL, and dNSSCR) were calculated from two distinct phases. In the present study, we selected four pairs of phases and used the term “derived dataset” to refer to a set of differential EDA data calculated from a pair of phases (Fig. 3). To measure the extent of reaction to the mental stress phase, EDA features from the REST phase were subtracted from those from the MAT phase (D1). To estimate the difference in autonomic activity from before and after stress, EDA features from the REST phase were subtracted from those from the REC1 phase (D2). Similarly, reactivity to the relaxation task was measured by subtracting EDA features from the RLX phase from those from the REC1 phase (D3). To estimate the difference in autonomic activity before and after the relaxation task, EDA features from the REC2 phase were subtracted from those from the REC1 phase (D2). A total of 16 differential EDA features were calculated.

Statistical analyses

Statistical analyses were performed using MATLAB and R software (The R Foundation for Statistical Computing, Vienna, Austria). Mann-Whitney U tests were used to compare age, years of education, BMI, alcohol use, caffeine use, HAM-D, HAM-A, and SRI between the MDD and control groups, as these factors were not normally distributed. Sex, marital status, and smoking between the two groups were compared by chi-square tests. All four EDA features violated the normality assumption required for an ANOVA. Therefore, the effects of group and task on EDA features were tested by the non-parametric equivalent of a repeated-measures ANOVA using the R statistics package “nparLD”²⁶. This method was developed to assess longitudinal data from repeated measurements based on a factorial design. The non-parametric test provided ANOVA-type statistics for examination of the following hypotheses: no between-subjects effect, no within-subject effect, and no interaction. In the present study, group was used as the between-subjects factor and task as the within-subject factor. Further methodological details can be found in the previous study conducted by Brunner and Puri⁴¹. Mann-Whitney U tests were used to compare differential EDA features between the MDD and control groups, as these features did not meet the normality assumptions. To control type I error of multiple comparisons, P-values were adjusted for the false discovery rate (FDR) using the Benjamini-Hochberg method at a level of 0.05⁴². For all statistical tests, an adjusted P^a-value of less than or equal to 0.05 was considered significant.

Feature selection and classification

We used four supervised machine learning algorithms, support vector machine (SVM), decision tree, k-nearest neighbors (k-NN), and Naïve Bayes, to classify the MDD and control groups using EDA features as input data. As shown in Fig. 2, the 5-fold cross-validation method was used to split data and evaluate classifier performance. Training was performed using 80% of the total data set (4 of 5 folds) and testing was performed with the remaining 20%. We employed an SVM-RFE algorithm as a feature selection method; this algorithm ranked features based on a backward sequential selection method, which removed the features one by one⁴³. To ensure that all classifiers were trained using the same training data and the same subset of features, SVM-RFE was used to first determine the feature rankings for a given training data set. Then, the ranking results and the same training set were applied to train SVM, decision tree, k-NN, and Naïve Bayes models. The prediction error of each classifier model was evaluated using the unseen test data set. This process was repeated five times for each fold to complete the 5-fold cross-validation. It is important to note that cross-validation was external to the feature selection process to evaluate the prediction error accurately. To reduce variance in the prediction error, the 5-fold cross-validation was repeated 200 times, and validation results from all repetitions were averaged to finally report the model’s predictive performance⁴⁴. We also averaged the rank of each feature from the 200 repeats to provide an estimate of the relative importance of the feature.

In binary classification, SVM maps training data into a multidimensional feature space and finds a hyper-plane in the feature space that maximizes the distances between two classes⁴⁵. For both SVM-RFE and classification, we used a linear kernel function with the regularization parameter C = 1. Decision tree performs classification using a recursive binary partition of target variables^46,47. A decision tree model begins at the root node and branches to internal nodes. Decisions to split are made by impurity measures, such as the Gini index or information gain, and this process continues until a final node is reached with a predicted class value. Here, we used the Gini index for impurity measures. A k-NN model is a type of instance-based learning⁴⁸. A k-NN classifier estimates the class label of a new observation by the majority class of the k closest neighbors according to a distance metric. In the present study, the k value was set to three, and Euclidean distance was used as the distance metric. Naïve Bayes is a probabilistic learning method based on the application of Bayes’ theorem⁴⁹. It estimates the parameters for a feature’s probability distribution using training data and assuming that predictors are conditionally independent in every class. Then, with new test data, Naïve Bayes computes the posterior probability of that new data belongs to each class and chooses the class with the largest posterior probability. For the present study, we assumed that EDA features followed a Gaussian distribution. We adopted accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the curve (AUC) as the performance measures.

References

World Health Organization. The Global Burden of Disease: 2004 update. 2004 Update, https://doi.org/10.1038/npp.2011.85 (2008).
Article Google Scholar
Luppa, M., Heinrich, S., Angermeyer, M. C., König, H.-H. & Riedel-Heller, S. G. Cost-of-illness studies of depression. J. Affect. Disord. 98, 29–43 (2007).
Article Google Scholar
Wang, P. S., Simon, G. & Kessler, R. C. The economic burden of depression and the cost-effectiveness of treatment. Int. J. Methods Psychiatr. Res. 12, 22–33 (2003).
Article CAS Google Scholar
Franklin, J. C. et al. Risk Factors for Suicidal Thoughts and Behaviors: A Meta-Analysis of 50 Years of Research. Psychol. Bull. (in press), 187–232 (2016).
Article Google Scholar
Association, A. P. Diagnostic and statistical manual of mental disorders. (American Psychiatric Publishing, 2013).
Jacob, A. Limitations of Clinical Psychiatric Diagnostic Measurements Ayden. J. Neurol. Disord. 1 (2013).
Lieblich, S. M. et al. High heterogeneity and low reliability in the diagnosis of major depression will impair the development of new drugs. Br. J. Psychiatry Open 1, e5–e7 (2015).
Article Google Scholar
Regier, D. A. et al. DSM-5 field trials in the United States and Canada, part II: Test-retest reliability of selected categorical diagnoses. Am. J. Psychiatry 170, 59–70 (2013).
Article Google Scholar
Craddock, N. & Mynors-Wallis, L. Psychiatric diagnosis: Impersonal, imperfect and important. Br. J. Psychiatry 204, 93–95 (2014).
Article Google Scholar
Nahshoni, E. et al. Heart rate variability in patients with major depression. Psychosomatics 45, 129–134 (2004).
Article Google Scholar
Vetrugno, R., Liguori, R., Cortelli, P. & Montagna, P. Sympathetic skin response. Clin. Auton. Res. 13, 256–270 (2003).
Article Google Scholar
Otto, M. W. et al. De novo fear conditioning across diagnostic groups in the affective disorders: Evidence for learning impairments. Behav. Ther. 45, 619–629 (2014).
Article Google Scholar
Pruneti, C., Saccò, M., Cosentino, C. & Sgromo, D. Relevance of Autonomic Arousal in the Stress Response inPsychopathology. J. Basic Appl. Sci. 12, 176–184 (2016).
Article Google Scholar
Pruneti, C., Cosentino, C., Sgromo, M. & Innocenti, A. Skin Conductance Response as a decisive variable in individuals with a DSM-IV TR Axis I diagnosis. JMED Res. 565009, https://doi.org/10.5171/2014.565009 (2014).
Kim, E. Y. et al. Diagnosis of major depressive disorder by combining multimodal information from heart rate dynamics and serum proteomics using machine-learning algorithm. Prog. Neuro-Psychopharmacology Biol. Psychiatry 76, 65–71 (2017).
Article CAS Google Scholar
Valenza, G. et al. Point-process nonlinear autonomic assessment of depressive states in bipolar patients. Methods Inf. Med. 53, 296–302 (2014).
Article CAS Google Scholar
Valenza, G. et al. Predicting Mood Changes in Bipolar Disorder Through HeartbeatNonlinear Dynamics. IEEE J. Biomed. Heal. Informatics 20, 1034–1043 (2016).
Article Google Scholar
Acharya, U. R. et al. Computer-aided diagnosis of depression using EEG signals. Eur. Neurol. 73, 329–336 (2015).
Article Google Scholar
Vahey, R. & Becerra, R. Galvanic skin response in mood disorders: Acritical review. Int. J. Psychol. Psychol. Ther. 15, 275–304 (2015).
Google Scholar
Jentsch, M. C. et al. Biomarker approaches in major depressive disorder evaluated in the context of current hypotheses. Biomark. Med. 9, 277–297 (2015).
Article CAS Google Scholar
Ghandeharioun, A. et al. Objective Assessment of Depressive Symptoms with Machine Learning and Wearable Sensors Data. In International Conference on Affective Computing and Intelligent Interaction (2017).
Hatch, J. P. & Saito, I. Growth and development of biofeedback: A bibliographic update. Biofeedback Self. Regul. 15, 37–46 (1990).
Article CAS Google Scholar
Crocetti, A. et al. Psychophysiological Stress Profile: A Protocol to Differentiate Normal vs Pathological Subjects. Act. Nerv. Super. Rediviva 52, 241–245 (2010).
Google Scholar
Sun, G., Shinba, T., Kirimoto, T. & Matsui, T. An objective screening method for major depressive disorder using logistic regression analysis of heart rate variability data obtained in a mental task paradigm. Front. Psychiatry 7, 1–7 (2016).
Google Scholar
Matsui, T., Kakisaka, K. & Shinba, T. Impaired parasympathetic augmentation under relaxation in patients with depression as assessed by a novel non-contact microwave radar system. J. Med. Eng. Technol. 40, 15–19 (2016).
Article Google Scholar
Noguchi, K., Gel, Y. R., Brunner, E. & Konietschke, F. nparLD: An R software package for the nonparametric analysis of longitudinal data in factorial experiments. J. Stat. Softw. 50, 1–23 (2012).
Article Google Scholar
Rottenberg, J., Salomon, K., Gross, J. J. & Gotlib, I. H. Vagal withdrawal to a sad film predicts subsequent recovery from depression. Psychophysiology 42, 277–281 (2005).
Article Google Scholar
Carroll, D., Phillips, A. C., Hunt, K. & Der, G. Symptoms of depression and cardiovascular reactions to acute psychological stress: Evidence from a population study. Biol. Psychol. 75, 68–74 (2007).
Article Google Scholar
Khodayari-Rostamabad, A. et al. Diagnosis of psychiatric disorders using EEG data and employing a statistical decision model. Conf Proc IEEE Eng Med Biol Soc 2010, 4006–4009 (2010).
PubMed Google Scholar
Khodayari-Rostamabad, A., Reilly, J. P., Hasey, G. M., de Bruin, H. & MacCrimmon, D. J. A machine learning approach using EEG data to predict response to SSRI treatment for major depressive disorder. Clin. Neurophysiol. 124, 1975–1985 (2013).
Article Google Scholar
Poh, M. Z., Swenson, N. C. & Picard, R. W. A wearable sensor for unobtrusive, long-term assessment of electrodermal activity. IEEE Trans. Biomed. Eng. 57, 1243–1252 (2010).
Article Google Scholar
Poh, M. Z. et al. Convulsive seizure detection using a wrist-worn electrodermal activity and accelerometry biosensor. Epilepsia 53, 93–97 (2012).
Article Google Scholar
Hamilton, M. A. X. Development of a rating scale for primary depressive illness. Br. J. Clin. Psychol. 6, 278–296 (1967).
Article CAS Google Scholar
Zarjam, P., Epps, J., Chen, F. & Lovell, N. H. Estimating cognitive workload using wavelet entropy-based features during an arithmetic task. Comput. Biol. Med. 43, 2186–1295 (2013).
Article Google Scholar
Noteboom, J. T., Barnholt, K. R. & Enoka, R. M. Activation of the arousal response and impairment of performance increase with anxiety and stressor intensity. J. Appl. Physiol. 91, 2093–2101 (2001).
Article CAS Google Scholar
Greco, A., Valenza, G., Lanata, A., Scilingo, E. P. & Citi, L. CvxEDA: A convex optimization approach to electrodermal activity processing. IEEE Trans. Biomed. Eng. 63, 797–804 (2016).
PubMed Google Scholar
Nagai, Y., Critchley, H. D., Featherstone, E., Trimble, M. R. & Dolan, R. J. Activity in ventromedial prefrontal cortex covaries with sympathetic skin conductance level: A physiological account of a ‘default mode’ of brain function. Neuroimage 22, 243–251 (2004).
Article CAS Google Scholar
Boucsein, W. et al. Publication recommendations for electrodermal measurements. Psychophysiology 49, 1017–1034 (2012).
Article Google Scholar
Dawson, M., Schell, A. M. & Filion, D. L. The Electrodermal System. In Cacioppo, J. T., Tassinary, L. G. & Berntson, G. G. (Eds), Handbook of Psychophysiology 159–181 (2007).
Brouwer, A.-M., van Wouwe, N., Mühl, C., van Erp, J. & Toet, A. Perceiving blocks of emotional pictures and sounds: effects on physiological variables. Front. Hum. Neurosci. 7, 1–10 (2013).
Google Scholar
Brunner, E. & Puri, M. L. Nonparametric methods in factorial designs. Stat. Pap. 42, 1–52 (2001).
Article MathSciNet Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Guyon, I. Gene Selection for Cancer Classification. 389–422 (2002).
Rodríguez, J. D., Pérez, A. & Lozano, J. A. Sensitivity analysis of kappa-fold cross validation in prediction error estimation. IEEE Trans. Pattern Anal. Mach. Intell. 32, 569–575 (2010).
Article Google Scholar
Vapnik, V. N. The Nature of Statistical Learning Theory. (Springer, 2000).
Moisen, G. G. Classification and Regression Trees. Encycl. Ecol. 582–588 (2008).
Speybroeck, N. Classification and regression trees. Int. J. Public Health 57, 243–246 (2012).
Article CAS Google Scholar
Manning, C. D., Ragahvan, P. & Schutze, H. An Introduction to Information Retrieval. (Cambridge University Press, 2008).
John, G. H. G. & Langley, P. Estimating Continuous Distributions in Bayesian Classifiers. Proc. Elev. Conf. Uncertain. Artif. Intell. Montr. Quebec, Canada 1, 338–345 (1995).
Google Scholar

Download references

Acknowledgements

This work was partly supported by the Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korea government (MSIT) (No. 2015-0-00062, the development of skin adhesive patches for the monitoring and prediction of mental disorders) and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2017R1C1B5017730).

Author information

Authors and Affiliations

Bio-Medical IT Convergence Research Division, Electronics and Telecommunications Research Institute (ETRI), Daejeon, Korea
Ah Young Kim, Eun Hye Jang, Seunghwan Kim & Han Young Yu
Department of Psychiatry, Depression Center, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea
Kwan Woo Choi & Hong Jin Jeon
Department of Electronics Engineering, Incheon National University, Incheon, Korea
Sangwon Byun

Authors

Ah Young Kim
View author publications
You can also search for this author in PubMed Google Scholar
Eun Hye Jang
View author publications
You can also search for this author in PubMed Google Scholar
Seunghwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kwan Woo Choi
View author publications
You can also search for this author in PubMed Google Scholar
Hong Jin Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Han Young Yu
View author publications
You can also search for this author in PubMed Google Scholar
Sangwon Byun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.Y.K., E.H.J., S.B., H.J.J., H.Y.Y. and S.K. designed research. A.Y.K., E.H.J., S.B., H.Y.Y., H.J.J. and K.W.C. performed research. A.Y.K., E.H.J. and S.B. analyzed the data. A.Y.K. and S.B. wrote the paper. All authors commented on the manuscript.

Corresponding authors

Correspondence to Han Young Yu or Sangwon Byun.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kim, A.Y., Jang, E.H., Kim, S. et al. Automatic detection of major depressive disorder using electrodermal activity. Sci Rep 8, 17030 (2018). https://doi.org/10.1038/s41598-018-35147-3

Download citation

Received: 31 January 2018
Accepted: 31 October 2018
Published: 19 November 2018
DOI: https://doi.org/10.1038/s41598-018-35147-3

Keywords

This article is cited by

Task-state skin potential abnormalities can distinguish major depressive disorder and bipolar depression from healthy controls
- Hailong Lyu
- Huimin Huang
- Shaohua Hu
Translational Psychiatry (2024)
Artificial intelligence assisted tools for the detection of anxiety and depression leading to suicidal ideation in adolescents: a review
- Prabal Datta Barua
- Jahmunah Vicnesh
- Udyavara Rajendra Acharya
Cognitive Neurodynamics (2024)
Depression screening using hybrid neural network
- Jiao Zhang
- Baomin Xu
- Hongfeng Yin
Multimedia Tools and Applications (2023)
Automated detection of mental disorders using physiological signals and machine learning: A systematic review and scientometric analysis
- Jaiteg Singh
- Deepika Sharma
Multimedia Tools and Applications (2023)
Cognitive Computing in Mental Healthcare: a Review of Methods and Technologies for Detection of Mental Disorders
- Jaiteg Singh
- Mir Aamir Hamid
Cognitive Computation (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Descriptive statistics of subjects

Statistical analyses of EDA features

Classification of control and MDD group participants based on EDA features

Discussion

Conclusion

Methods

Subjects

Procedure

Physiological recordings

Pre-processing and feature extraction

Statistical analyses

Feature selection and classification

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Comments

Search

Quick links