Multimodal digital assessment of depression with actigraphy and app in Hong Kong Chinese

Chen, Jie; Chan, Ngan Yin; Li, Chun-Tung; Chan, Joey W. Y.; Liu, Yaping; Li, Shirley Xin; Chau, Steven W. H.; Leung, Kwong Sak; Heng, Pheng-Ann; Lee, Tatia M. C.; Li, Tim M. H.; Wing, Yun-Kwok

doi:10.1038/s41398-024-02873-4

Download PDF

Article
Open access
Published: 18 March 2024

Multimodal digital assessment of depression with actigraphy and app in Hong Kong Chinese

Translational Psychiatry volume 14, Article number: 150 (2024) Cite this article

606 Accesses
10 Altmetric
Metrics details

Subjects

Abstract

There is an emerging potential for digital assessment of depression. In this study, Chinese patients with major depressive disorder (MDD) and controls underwent a week of multimodal measurement including actigraphy and app-based measures (D-MOMO) to record rest-activity, facial expression, voice, and mood states. Seven machine-learning models (Random Forest [RF], Logistic regression [LR], Support vector machine [SVM], K-Nearest Neighbors [KNN], Decision tree [DT], Naive Bayes [NB], and Artificial Neural Networks [ANN]) with leave-one-out cross-validation were applied to detect lifetime diagnosis of MDD and non-remission status. Eighty MDD subjects and 76 age- and sex-matched controls completed the actigraphy, while 61 MDD subjects and 47 controls completed the app-based assessment. MDD subjects had lower mobile time (P = 0.006), later sleep midpoint (P = 0.047) and Acrophase (P = 0.024) than controls. For app measurement, MDD subjects had more frequent brow lowering (P = 0.023), less lip corner pulling (P = 0.007), higher pause variability (P = 0.046), more frequent self-reference (P = 0.024) and negative emotion words (P = 0.002), lower articulation rate (P < 0.001) and happiness level (P < 0.001) than controls. With the fusion of all digital modalities, the predictive performance (F1-score) of ANN for a lifetime diagnosis of MDD was 0.81 and 0.70 for non-remission status when combined with the HADS-D item score, respectively. Multimodal digital measurement is a feasible diagnostic tool for depression in Chinese. A combination of multimodal measurement and machine-learning approach has enhanced the performance of digital markers in phenotyping and diagnosis of MDD.

Emotions and brain function are altered up to one month after a single high dose of psilocybin

Article Open access 10 February 2020

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Article Open access 12 April 2024

Effect of breathwork on stress and mental health: A meta-analysis of randomised-controlled trials

Article Open access 09 January 2023

Introduction

Depression is the leading cause of health-related burden globally, affecting an estimated 300 million population [1]. The Hong Kong Mental Morbidity Survey showed that less than 30% of people with common mental disorders had sought help from mental health services [2]. A lack of awareness, stigma [3], and inaccessibility may all contribute to the low rate of help-seeking behaviors. The current gold standard for the diagnosis of depression relies on clinical interview, which serves as a major access block. The insufficient mental health resources are particularly accentuated in a number of Asian and developing regions [4]. In the past decade, there has been a surge of interest in digital phenotyping as a promising, revolutionary, and cost-effective solution [5]. Applying multimodal digital assessment together with artificial intelligence could provide continuous, unobtrusive, and objective assessment without the clinician’s involvement [6].

Digital phenotyping includes a series of digital markers, as depression is more than simply sadness but encompasses a series of physiological, cognitive, mood, and rest-activity changes. For example, emerging data suggested the validity of automatic facial expression analysis for differentiating depression severity [7]. Prosodic language features have been commonly observed in depression [8]. Other passive digital features including sleep, physical activity, location, and phone use data were also studied [9]. The integration of various multimodal digital markers potentially offers a better discriminative power than a single modality [10, 11].

However, there are several gaps regarding the application of digital phenotyping in clinical settings. Few studies have reported the findings of integrating both passive and active features (such as language and facial features) in assessing depression. Besides, the cross-cultural/ethnicity differences in language [12] and lifestyle may suggest the need for local development and validation of digital phenotyping systems. In this study, we utilized a one-week multimodal measurement (actigraphy and our self-developed mobile app named D-MOMO to record rest-activity, facial expressions, voice, and subjective mood state) to assess depression in Hong Kong Chinese.

Methods

Study design

The study design is a case–control study that was conducted between June 2021 and March 2023. The study was conducted in compliance with the Declaration of Helsinki, and approved by the Joint Chinese University of Hong Kong—New Territories East Cluster Clinical Research Ethics Committee (Ref No: 2020.492). Informed written consent was obtained from all subjects.

Study population

The majority of the MDD subjects were recruited from a regional public psychiatric clinic in Hong Kong. These subjects were clinically diagnosed by their attending psychiatrists. In addition, around 22% of the MDD subjects were recruited from the community based on the Structured Clinical Interview for DSM-5—Clinician Version (SCID-5-CV) [13] by a trained medical researcher. The controls were recruited from both the community and sleep centers. MDD patients and controls aged 18 to 65 were recruited. The controls were free from psychiatric diagnosis based on the SCID-5 interview. Exclusion criteria for all study subjects were: (1) Lifetime history of bipolar disorder, schizophrenic-spectrum disorder, dementia, intellectual disability, and neurological disorders; (2) Presence of clear confounding factors (e.g., face injury, speech disorder, night shift workers and motor deficits).

Multimodal measurement via the D-MOMO app and actigraphy

The Android version of D-MOMO was available in June 2021, and the iOS version was published in April 2022. The app includes a mood diary with a sampling rate of 4 times per day for continuous 7 days. The interval between two time points was 4 h. Subjects preset their first recording time according to their preference and the rest of the three time slots were then determined automatically. For each time, subjects had 1-h buffer to complete the measurement. The subjects first verbally answered two questions (How is your mood right now? What have you done during the last four hours?) with cameras and microphones on. The self-evaluated happiness level was then measured on a Likert scale (0 = very unhappy to 10 = very happy). Rest-activity pattern was measured by Actiwatch Spectrum Plus or Actiwatch Spectrum PRO, Philips Respironics (1 min per epoch) for continuous 7 days. Subjects were instructed to wear the actigraphy on their non-dominant wrist [14].

Clinical assessment

A 17-item Hamilton depression scale (17-HDS) was used to assess the severity of depression. A cutoff score of 7 or lower and a lifetime history of MDD was indicative of remission [15]. Subjects also completed the Hospital Anxiety and Depression Scale (HADS) [16] for assessing depressive (HADS-D) and anxiety symptoms (HADS-A). The Insomnia severity index (ISI) [17] and reduced Morningness-eveningness questionnaire (rMEQ) [18] measured the severity of insomnia and chronotype preference.

Actigraphy data processing

Physical activity and sleep estimation were processed by Philips Actiware software 6. A mobile score was defined by 4 or more activity counts per minute. Then, the proportion of mobile time (%mobile) was calculated. Average activity counts per minute (cpm) reflected the quantitative level of physical activity. Sleep parameters included sleep duration, sleep efficiency, sleep midpoint (midpoint between sleep onset and sleep offset), intra-individual variability of sleep midpoint, and intra-individual variability of sleep duration. The circadian rhythm was analyzed using “cosinor” and “nparACT” R packages to perform the cosinor and nonparametric analysis, respectively [19, 20]. More details were provided in Supplementary Methods.

Speech analysis

Acoustic features (fundamental frequency [F0] mean, F0 variability, articulation rate, pause duration mean, pause variability, and pause rate) were determined via Praat [21] and Tencent Cloud. As background noise would interfere, we performed speech analysis only in those audio recordings under a quiet environment setting. In addition, we excluded the speech segments without pause, as pause was one of our studied outcomes. More details were provided in Supplementary Methods.

Natural language processing (NLP)

The transcripts generated by Tencent Cloud were manually checked and revised by research personnel who were native Cantonese speakers. For word segmentation, we employed a deep learning-based Chinese word segmentation engine fastHan [22]. After segmentation, words were translated into linguistic and psychologically meaningful categories using the Chinese version of the Language Inquiry and Word Count (LIWC) dictionary [23]. Two categories (self-reference and negative emotion words) were extracted based on former studies [24, 25]. Then, the proportion of words of each category relative to text length (in percentage) was calculated.

Facial expression analysis

Facial action units (AU) were extracted via OpenFace 2.2.0 software [26]. AUs are all visually discernible facial movements proposed by Paul Ekman [27], and have been widely used for facial expression analysis [7]. We used the presence information (0 or 1) to calculate the proportion of each AU for each video recording. Then, an average value for each AU was calculated based on all the videos of the whole week. Among nine AUs that are related to emotional expression [28], we selected five of them (AU1: Inner brow-raising, AU4: Brow lowering, AU6: Cheek raising, AU12: Lip corner pulling, AU15: Lip corner depressing) for analysis, as they do not occur in speech-affected facial areas [27]. As the study was conducted during the COVID pandemic period, the facial masks might interfere with facial expression, and only videos without masks were studied.

Statistical analysis

All data acquired was processed and analyzed using SPSS 27.0 and R software. Descriptive data was presented as means ± SD or frequencies (%). Shapiro–Wilk test was applied to assess normality. To investigate intergroup differences of means between two groups, a two-sample t-test and Mann–Whitney U-test were used for normally and non-normally distributed continuous variables, respectively. To test intergroup differences of means among three groups, the one-way analysis of variance (ANOVA) test, and Kruskal–Wallis test was applied. The Chi-square test was used to compare the intergroup differences for categorical variables. In post hoc analyses, the Bonferroni post hoc test was used for multiple comparisons. Since age was not matched among the three groups in actigraphy measurement, we performed linear regression analysis for actigraphy data (dependent variable) to study its association with remission/non-remission after adjustment for age and sex. A P < 0.05 was considered statistical significance, and hypothesis tests were 2 sided.

For predicting lifetime diagnosis of depression and non-remission, we used R package “cutpointr” to determine the optimal cutoff point with a compromise between sensitivity and specificity (balanced data) or sensitivity and PPV (imbalanced data), when features were one-dimensional. Machine-learning (ML) approach was applied in case of multi-dimensional features, at which the performance of seven most commonly used supervised ML methods for disease prediction (Random Forest [RF], Logistic regression [LR], Support vector machine [SVM], K-Nearest Neighbors [KNN], Decision tree [DT], Naive Bayes [NB], and Artificial Neural Networks [ANN]) [29] was compared using leave-one-out cross-validation. Details of extracted features for prediction were provided in Supplementary Table 1. The validity was assessed by F1-score, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). More details of statistical analysis were provided in Supplementary Methods.

Results

Actigraphy data analysis

A total of 80 MDD subjects and 76 controls were included. (Fig. 1) Table 1 summarizes demographic and clinical information. Although the average cpm level during the whole week did not differ between MDD subjects and controls, %mobile was significantly lower in MDD subjects (P = 0.006). When we further investigated physical activity levels during active periods and major rest intervals, MDD subjects and controls showed similar average cpm and %mobile levels. The only difference was a significantly longer major rest interval in MDD subjects (P = 0.027). %mobile was not associated with psychotropic medication. Age was significantly younger in the non-remitted MDD group than in the remitted group (corrected P = 0.020). (Supplementary Table 2) After adjustment for age and sex, both remitted and non-remitted MDD subjects had lower %mobile than controls (remitted MDD: P = 0.041; non-remitted MDD: P = 0.017).

**Fig. 1: Subject recruitment outline.**

Table 1 Demographic and clinical information, as well as actigraphy data between controls and MDD subjects.

Full size table

Later sleep midpoint (P = 0.047) and Acrophase (P = 0.024) were observed in MDD subjects compared to controls. (Table 1) After adjustment for age and sex, remitted MDD subjects had a delayed Acrophase (P = 0.009) and sleep midpoint (P = 0.032) than controls, while non-remitted MDD subjects showed a significantly higher intra-individual variability of sleep duration (P = 0.003) and marginally higher intra-individual variability of sleep midpoint (P = 0.088) compared to controls. Further analysis of sleep onset and sleep offset showed that the delayed sleep midpoint in remitted MDD subjects was due to a later sleep offset (P = 0.015) rather than sleep onset (P = 0.11). The sleep onset (P = 0.31) and sleep offset (P = 0.41) were comparable between non-remitted subjects and controls after adjustment for age and sex. Both remitted (ISI: P = 0.002; rMEQ: P = 0.009) and non-remitted MDD (ISI & rMEQ: P < 0.001) were associated with lower rMEQ score (tendency to be more eveningness) and higher ISI score than controls, respectively.

App measurement

Demographic and clinical information, subjective happiness level, and completion rate

Sixty-one MDD subjects and 47 controls completed the app-based measurement. (Fig. 1) Age and sex were matched between two groups (Table 2) or among three groups (Supplementary Table 3). The completion rate of overall 28 data points was 72.89% in MDD subjects and 83.36% in controls. MDD subjects had a lower average subjective happiness level (P < 0.001). The average happiness level in remitted (corrected P = 0.007) and non-remitted MDD subjects (corrected P < 0.001) were both lower than controls. Additionally, the average happiness level was moderately and negatively correlated with 17-HDS score among all subjects (standardized beta coefficients = −0.53, P < 0.001).

Table 2 Demographic and clinical information, as well as app data between controls and MDD subjects.

Full size table

Facial expression analysis

Fifty-four MDD subjects and 42 controls provided at least one valid video (mean ± SD: 15.5 ± 7.16). MDD subjects showed a significantly higher proportion of AU4 and a lower proportion of AU12 than controls (AU4: P = 0.023; AU12: P = 0.007). (Table 2) The proportion of AU4 and AU12 was not associated with psychotropic medications. The proportion of AU4 was significantly higher in non-remitted MDD subjects than controls (corrected P = 0.004). The proportion of AU4 in remitted MDD subjects was comparable to controls. The proportion of other AUs did not differ among the three groups after multiple testing corrections.

Speech analysis

Fifty-four MDD subjects and 41 controls had at least one valid speech segment (mean ± SD: 8.44 ± 5.67). MDD subjects had a significantly lower articulation rate (P < 0.001) and higher pause variability (P = 0.046) than controls. (Table 2) When the comparison was performed among three groups, the articulation rate was significantly higher in controls compared with both remitted (corrected P = 0.042) and non-remitted MDD subjects (corrected P < 0.001). In addition, articulation rate seemed to be associated with anxiolytic use (P = 0.046). Further analysis by the exclusion of subjects taking anxiolytics, the articulation rate of controls remained significantly higher than both MDD groups (remitted MDD: corrected P = 0.029; non-remitted MDD: corrected P = 0.036). Pause variability was not associated with psychotropic medications.

NLP

The NLP analysis was performed at which MDD subjects had significantly higher percentages of self-reference (P = 0.018) and negative emotion words (P < 0.001). In addition, remitted MDD subjects showed a higher percentage of self-reference than controls (corrected P = 0.048), while non-remitted MDD subjects had a higher percentage of negative emotion words than controls (corrected P < 0.001).

Multimodal detection of MDD

As shown in Supplementary Table 4, the prediction performance of ANN was generally more favorable compared to other ML methods for both lifetime diagnosis and non-remission with the fusion of all digital modalities. Therefore, ANN was further trained on multi-dimensional features in different modalities. Table 3 lists the performance of predicting lifetime history of MDD among 41 controls and 40 MDD subjects who had both app and actigraphy data. Fusion of all digital modalities resulted in a superior performance (F1-score = 0.81) than that of any single modality including HADS-D. As for predicting non-remission, the multimodal digital features had a F1 performance score of 0.64 and the addition of HADS-D improved the F1-score further to 0.70. (Table 4).

Table 3 Performance of predicting lifetime history of MDD.

Full size table

Table 4 Performance of predicting non-remission status of MDD.

Full size table

Discussion

In this study, we developed and validated a multimodal digital measurement system (actigraphy and a novel app D-MOMO) to assess depression in Hong Kong Chinese. We found that MDD subjects demonstrated a series of digital features including facial features (more brow lowering and less lip corner pulling), speech features (lower articulation rate, higher pause variability, more self-references and negative emotion words), mood features (lower subjective happiness level), and sleep and circadian features (decreased mobile time, delayed sleep midpoint and Acrophase). For the prediction of a lifetime history of MDD, the performance (F1-score = 0.81) after the fusion of all digital modalities was superior to that of individual modality. On the other hand, the predictive power for non-remission status was relatively lower with the fusion of all digital modalities (F1 score = 0.64) but the predictive power was enhanced by adding HADS-D item (F1-score = 0.70).

Our findings in facial expression analysis supported the mood-facilitation hypothesis [30], at which MDD subjects showed more AU4 (brow lowering) related to negative emotion and less AU12 (lip corner pulling) during positive emotion. Brow lowering is an important component of the omega sign, which was first described by Charles Darwin [31] as a melancholic facial sign [32]. Non-remitted instead of remitted MDD subjects displayed more brow lowering than controls, which suggested that brow lowering was more likely to be a state marker. This was consistent with the hypothesis that brow lowering is activated during hypervigilance or stress response [32, 33].

Similar to another Caucasian study [34], there was a lower articulation rate without a significant increase in pause duration in our MDD subjects. Although only 3 subjects were rated as psychomotor retardation, digital measurement might capture more subtle vocal motor signs. Interestingly, remitted MDD subjects also showed a lower articulation rate. Thus, a lower articulation rate may persist after remission of depression as a trait marker. The findings remained the same after we controlled the use of benzodiazepines [35]. Higher pause variability might reflect more frequent hesitations and stuttering [36]. For physical activity, we found that MDD subjects had decreased mobile time. It was related to longer major rest intervals (i.e., excessive bedtime), which may be contributed by insomnia, anhedonia, or fatigue. The similar cpm level during active period was in contrast with the impression that MDD subjects have reduced motor activity [37]. This might be related to the heterogeneity of depression, as about half of our MDD subjects were in remission.

As for sleep and circadian markers, actigraphy monitoring demonstrated delayed sleep midpoint and Acrophase. This association of eveningness with depression was in line with previous studies [38,39,40]. Intriguingly, we found a lower rMEQ score but no delay of Acrophase and sleep midpoint in non-remitted MDD subjects, which might be partially explained by their higher intra-individual variability of sleep duration and sleep midpoint. This particular irregular sleep pattern was more marked during a depressive episode [41], and supported by higher ISI scores as well.

More negative emotion words from NLP and lower average happiness levels were found, suggesting a smoldering mood. In line with other Caucasian studies, we found that Chinese MDD subjects tended to use more first-person singular pronouns, which reflected a universal cross-cultural phenomenon of self-focused attention in depressive patients [24, 25, 42].

For predicting the lifetime history of MDD, the best F1-score (0.81) was achieved with the fusion of all digital modalities and ANN machine-learning analysis. The improvement by multimodal fusion was also reported previously [11, 43]. In terms of predicting non-remission, the optimal F1-score (0.70) was reasonably satisfactory, albeit lower than that of lifetime depression. Overall, our results supported the potential of applying digital modalities in detecting both depression state and trait.

The main strength of this study was that our digital system was based on convenient, feasible, and remote measurements without clinician involvement. To our best knowledge, only two studies from Europe [44] and Korea [45] have similar multimodal systems that integrated both passive features (rest-activity pattern) and active features (video or audio). Besides, our app users showed a rather satisfactory adherence rate, which supported the feasibility and application to more subjects. Moreover, the clinical outcome was assessed by trained medical researchers rather than simply based on self-report questionnaires.

There were also some limitations. First, the sample size was only moderate, particularly for multimodal detection. In our future research, we plan to recruit a diverse sample of participants to examine the hypothesis regarding potential differences in the predictive power of multimodal measurements across different ages and genders. Second, age was not matched between remitted and non-remitted MDD subjects in actigraphy data analysis, albeit we made further adjustments in the analysis. Third, speech analysis and facial expression analysis in this study required a state of relative quietness and unmasking that might reduce the convenience of the app measure. Fourth, the potential burden to the subjects (repeated mood diary measurement 4 times per day in the app and 1 week of actigraphy measurement) especially with a vision for future screening and monitoring of depression in a larger population may require further improvement, such as minimization of times and duration of monitoring as well as usage of some smart prompting (e.g., virtual agent like SimSense & MultiSense [46]) in the AI system.

In summary, we have identified a series of digital features of depression. The application of digital modalities with ML provided a good predictive performance for lifetime diagnosis of depression and a relatively lower but still satisfactory performance for predicting non-remission status. Further longitudinal study will be needed to determine whether these digital markers could capture the disease progression and treatment response in depression.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Herrman H, Kieling C, McGorry P, Horton R, Sargent J, Patel V. Reducing the global burden of depression: a Lancet-World Psychiatric Association Commission. Lancet. 2019;393:e42–e43.
Article PubMed Google Scholar
Lam LC, Wong CS, Wang MJ, Chan WC, Chen EY, Ng RM, et al. Prevalence, psychosocial correlates and service utilization of depressive and anxiety disorders in Hong Kong: the Hong Kong Mental Morbidity Survey (HKMMS). Soc Psychiatry Psychiatr Epidemiol. 2015;50:1379–88.
Article PubMed Google Scholar
Schnyder N, Panczak R, Groth N, Schultze-Lutter F. Association between mental health-related stigma and active help-seeking: systematic review and meta-analysis. Br J Psychiatry. 2017;210:261–8.
Article PubMed Google Scholar
Wong VT. Recruitment and training of psychiatrists in Hong Kong: what puts medical students off psychiatry–an international experience. Int Rev Psychiatry. 2013;25:481–5.
Article PubMed Google Scholar
Ebner-Priemer U, Santangelo P. Digital phenotyping: hype or hope? Lancet Psychiatry. 2020;7:297–9.
Article PubMed Google Scholar
Huckvale K, Venkatesh S, Christensen H. Toward clinical digital phenotyping: a timely opportunity to consider purpose, quality, and safety. npj Digital Med. 2019;2:88.
Article Google Scholar
Pampouchidou A, Simos PG, Marias K, Meriaudeau F, Yang F, Pediaditis M, et al. Automatic assessment of depression based on visual cues: a systematic review. IEEE Trans Affect Comput. 2019;10:445–70.
Article Google Scholar
Cummins N, Scherer S, Krajewski J, Schnieder S, Epps J, Quatieri TF. A review of depression and suicide risk assessment using speech analysis. Speech Commun. 2015;71:10–49.
Article Google Scholar
De Angel V, Lewis S, White K, Oetzmann C, Leightley D, Oprea E, et al. Digital health tools for the passive monitoring of depression: a systematic review of methods. npj Digit Med. 2022;5:3.
Article PubMed PubMed Central Google Scholar
Dibeklioglu H, Hammal Z, Cohn JF. Dynamic multimodal measurement of depression severity using deep autoencoding. IEEE J Biomed Health Inf. 2018;22:525–36.
Article Google Scholar
Dibeklioğlu H, Hammal Z, Yang Y, Cohn JF. Multimodal detection of depression in clinical interviews. Proc ACM Int Conf Multimodal Interact. 2015;2015:307–10.
Article PubMed PubMed Central Google Scholar
Yip, M. Tone. Cambridge University Press. 2002 https://scholar.google.com/scholar?, https://catdir.loc.gov/catdir/samples/cam033/2002073726.pdf.
First MB, Williams JBW, Karg RS, S RL. Structured Clinical Interview for DSM-5 Disorders, Clinician Version (SCID-5-CV). Porto Alegre: Artmed; 2017.
Google Scholar
Feng H, Chen L, Liu Y, Chen X, Wang J, Yu MWM, et al. Rest-activity pattern alterations in Idiopathic REM sleep behavior disorder. Ann Neurol. 2020;88:817–29.
Article CAS PubMed Google Scholar
Chan JW, Lam SP, Li SX, Chau SW, Chan SY, Chan NY, et al. Adjunctive bright light treatment with gradual advance in unipolar major depressive disorder with evening chronotype—a randomized controlled trial. Psychol Med. 2022;52:1448–57.
Article PubMed Google Scholar
Leung CM, Wing YK, Kwong PK, Lo A, Shum K. Validation of the Chinese-Cantonese version of the hospital anxiety and depression scale and comparison with the Hamilton Rating Scale of Depression. Acta Psychiatr Scand. 1999;100:456–61.
Article CAS PubMed Google Scholar
Chan NY, Li SX, Zhang J, Lam SP, Kwok APL, Yu MWM, et al. A prevention program for insomnia in at-risk adolescents: a randomized controlled study. Pediatrics. 2021;147:e2020006833.
Article PubMed Google Scholar
Chen SJ, Zhang JH, Li SX, Tsang CC, Chan KCC, Au CT, et al. The trajectories and associations of eveningness and insomnia with daytime sleepiness, depression and suicidal ideation in adolescents: a 3-year longitudinal study. J Affect Disord. 2021;294:533–42.
Article PubMed Google Scholar
Blume C, Santhi N, Schabus M. ‘nparACT’ package for R: a free software tool for the non-parametric analysis of actigraphy data. MethodsX. 2016;3:430–5.
Article PubMed PubMed Central Google Scholar
Sachs M. cosinor: tools for estimating and predicting the cosinor model. R package version 1.1. 2014; 2014 https://github.com/sachsmc/cosinor.
Boersma P, Weenink D. PRAAT, a system for doing phonetics by computer. Glot Int. 2001;5:341–5.
Google Scholar
Geng Z, Yan H, Qiu X, Huang X. fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP. The 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing; 2021; Online. pp. 99–106.
Huang C-L, Chung CK, Hui N, Lin Y-C, Seih Y-T, Lam BCP, et al. The development of the Chinese linguistic inquiry and word count dictionary. Chin J Psychol. 2012;54:185–201.
Google Scholar
Edwards TM, Holtzman NS. A meta-analysis of correlations between depression and first person singular pronoun use. J Res Personal. 2017;68:63–8.
Article Google Scholar
Vine V, Boyd RL, Pennebaker JW. Natural emotion vocabularies as windows on distress and well-being. Nat Commun. 2020;11:4525.
Article ADS PubMed PubMed Central Google Scholar
Baltrusaitis T, Zadeh A, Lim YC, Morency LP. OpenFace 2.0: Facial Behavior Analysis Toolkit. 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, China, 2018, pp. 59–66.
Ekman P, Friesen WV. Facial action coding system. Consulting Psychologists Press, Palo Alto, CA. 1978.
Matsumoto D, Ekman P. Facial expression analysis. Scholarpedia. 2008;3:4237.
Article ADS Google Scholar
Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inf Decis Mak. 2019;19:281.
Article Google Scholar
Rosenberg EL. Levels of analysis and the organization of affect. Rev Gen Psychol. 1998;2:247–70.
Article Google Scholar
Darwin C. The expression of the emotions in man and animals. John Murray, London. 1872
Chen J, Li C-T, Li TMH, Chan NY, Chan JWY, Liu Y, et al. A forgotten sign of depression—the omega sign and its implication. Asian J Psychiatry. 2023;80:103345.
Article Google Scholar
Lee IS, Yoon SS, Lee SH, Lee H, Park HJ, Wallraven C, et al. An amplification of feedback from facial muscles strengthened sympathetic activations to emotional facial cues. Auton Neurosci. 2013;179:37–42.
Article PubMed Google Scholar
Cannizzaro M, Harel B, Reilly N, Chappell P, Snyder PJ. Voice acoustical measurement of the severity of major depression. Brain Cogn. 2004;56:30–5.
Article PubMed Google Scholar
Griffin CE 3rd, Kaye AM, Bueno FR, Kaye AD. Benzodiazepine pharmacology and central nervous system-mediated effects. Ochsner J. 2013;13:214–23.
PubMed PubMed Central Google Scholar
Mundt JC, Snyder PJ, Cannizzaro MS, Chappie K, Geralts DS. Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J Neurolinguist. 2007;20:50–64.
Article Google Scholar
Schuch F, Vancampfort D, Firth J, Rosenbaum S, Ward P, Reichert T, et al. Physical activity and sedentary behavior in people with major depressive disorder: a systematic review and meta-analysis. J Affect Disord. 2017;210:139–50.
Article PubMed Google Scholar
Chan JW, Lam SP, Li SX, Yu MW, Chan NY, Zhang J, et al. Eveningness and insomnia: independent risk factors of nonremission in major depressive disorder. Sleep. 2014;37:911–7.
Article PubMed PubMed Central Google Scholar
Li SX, Chan NY, Man Yu MW, Lam SP, Zhang J, Yan Chan JW, et al. Eveningness chronotype, insomnia symptoms, and emotional and behavioural problems in adolescents. Sleep Med. 2018;47:93–9.
Article CAS PubMed Google Scholar
Crouse JJ, Carpenter JS, Song YJC, Hockey SJ, Naismith SL, Grunstein RR, et al. Circadian rhythm sleep-wake disturbances and depression in young people: implications for prevention and early intervention. Lancet Psychiatry. 2021;8:813–23.
Article PubMed Google Scholar
Carpenter JS, Crouse JJ, Scott EM, Naismith SL, Wilson C, Scott J, et al. Circadian depression: a mood disorder phenotype. Neurosci Biobehav Rev. 2021;126:79–101.
Article PubMed Google Scholar
Tackman A, Sbarra D, Carey A, Donnellan M, Horn A, Holtzman N, et al. Depression, negative emotionality, and self-referential language: a multi-lab, multi-measure, and multi-language-task research synthesis. J Pers Soc Psychol. 2018;116:817–834.
Article PubMed Google Scholar
Alghowinem S, Goecke R, Wagner M, Epps J, Hyett M, Parker G, et al. Multimodal depression detection: fusion analysis of paralinguistic, head pose and eye gaze behaviors. IEEE Trans Affect Comput. 2018;9:478–90.
Article Google Scholar
Matcham F, Barattieri di San Pietro C, Bulgari V, de Girolamo G, Dobson R, Eriksson H, et al. Remote assessment of disease and relapse in major depressive disorder (RADAR-MDD): a multi-centre prospective cohort study protocol. BMC Psychiatry. 2019;19:72.
Article CAS PubMed PubMed Central Google Scholar
Hong J, Kim J, Kim S, Oh J, Lee D, Lee S, et al. Depressive symptoms feature-based machine learning approach to predicting depression using smartphone. Healthcare, 2022. 10, https://doi.org/10.3390/healthcare10071189.
Stratou G, Morency LP. MultiSense—context-aware nonverbal behavior analysis framework: a psychological distress use case. IEEE Trans Affect Comput. 2017;8:190–203.
Article Google Scholar

Download references

Acknowledgements

This study was funded by the Health and Medical Research Fund (HMRF, Ref: 09203066). The funder played no role in the study design, data collection, analysis, and interpretation of data, or the writing of this manuscript. We thank all the doctors who have helped with the recruitment of subjects and all the subjects who participated in this study.

Author information

Authors and Affiliations

Li Chiu Kong Family Sleep Assessment Unit, Department of Psychiatry, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
Jie Chen, Ngan Yin Chan, Chun-Tung Li, Joey W. Y. Chan, Yaping Liu, Steven W. H. Chau, Tim M. H. Li & Yun-Kwok Wing
Department of Psychiatry, Fujian Medical University Affiliated Fuzhou Neuropsychiatric Hospital, Fuzhou, China
Jie Chen
Center for Sleep and Circadian Medicine, The Affiliated Brain Hospital of Guangzhou Medical University, Guangzhou, Guangdong, China
Yaping Liu
State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China
Shirley Xin Li & Tatia M. C. Lee
Sleep Research Clinic and Laboratory, Department of Psychology, The University of Hong Kong, Hong Kong SAR, China
Shirley Xin Li
Department of Applied Data Science, Hong Kong Shue Yan University, Hong Kong SAR, China
Kwong Sak Leung
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China
Kwong Sak Leung & Pheng-Ann Heng
Department of Psychology, The University of Hong Kong, Hong Kong SAR, China
Tatia M. C. Lee

Authors

Jie Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ngan Yin Chan
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Tung Li
View author publications
You can also search for this author in PubMed Google Scholar
Joey W. Y. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Yaping Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shirley Xin Li
View author publications
You can also search for this author in PubMed Google Scholar
Steven W. H. Chau
View author publications
You can also search for this author in PubMed Google Scholar
Kwong Sak Leung
View author publications
You can also search for this author in PubMed Google Scholar
Pheng-Ann Heng
View author publications
You can also search for this author in PubMed Google Scholar
Tatia M. C. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Tim M. H. Li
View author publications
You can also search for this author in PubMed Google Scholar
Yun-Kwok Wing
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Jie Chen, Ngan Yin Chan, Tim MH Li, and Yun-Kwok Wing conceived the overall study design. Jie Chen performed data analyses. Jie Chen drafted the initial version of manuscript. Ngan Yin Chan, Chun-Tung Li, Joey WY Chan, Yaping Liu, Shirley Xin Li, Steven WH Chau, Kwong Sak Leung, Pheng-Ann Heng, Tatia M.C. Lee, Tim MH Li, and Yun-Kwok Wing reviewed and edited the manuscript. Yun-Kwok Wing obtained funding for the project.

Corresponding authors

Correspondence to Tim M. H. Li or Yun-Kwok Wing.

Ethics declarations

Competing interests

Yun-Kwok Wing received a consultation fee from Eisai Co., Ltd., an honorarium from Eisai Hong Kong for a lecture, travel support from Lundbeck HK limited for the overseas conference, and an honorarium from Aculys Pharma, Inc for a lecture and Joey WY Chan received a personal fee from Eisai Co., Ltd and travel support from Lundbeck HK limited for overseas conference, which is outside the submitted work. All other authors declare no financial or non-financial competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

supplementary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, J., Chan, N.Y., Li, CT. et al. Multimodal digital assessment of depression with actigraphy and app in Hong Kong Chinese. Transl Psychiatry 14, 150 (2024). https://doi.org/10.1038/s41398-024-02873-4

Download citation

Received: 17 July 2023
Revised: 06 March 2024
Accepted: 12 March 2024
Published: 18 March 2024
DOI: https://doi.org/10.1038/s41398-024-02873-4

Subjects

Abstract

Similar content being viewed by others

Emotions and brain function are altered up to one month after a single high dose of psilocybin

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Effect of breathwork on stress and mental health: A meta-analysis of randomised-controlled trials

Introduction

Methods

Study design

Study population

Multimodal measurement via the D-MOMO app and actigraphy

Clinical assessment

Actigraphy data processing

Speech analysis

Natural language processing (NLP)

Facial expression analysis

Statistical analysis

Results

Actigraphy data analysis

App measurement

Demographic and clinical information, subjective happiness level, and completion rate

Facial expression analysis

Speech analysis

NLP

Multimodal detection of MDD

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

supplementary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links