Using language in social media posts to study the network dynamics of depression longitudinally

Kelley, Sean W.; Gillan, Claire M.

doi:10.1038/s41467-022-28513-3

Download PDF

Article
Open access
Published: 15 February 2022

Using language in social media posts to study the network dynamics of depression longitudinally

Nature Communications volume 13, Article number: 870 (2022) Cite this article

9743 Accesses
16 Citations
53 Altmetric
Metrics details

Subjects

Abstract

Network theory of mental illness posits that causal interactions between symptoms give rise to mental health disorders. Increasing evidence suggests that depression network connectivity may be a risk factor for transitioning and sustaining a depressive state. Here we analysed social media (Twitter) data from 946 participants who retrospectively self-reported the dates of any depressive episodes in the past 12 months and current depressive symptom severity. We construct personalised, within-subject, networks based on depression-related linguistic features. We show an association existed between current depression severity and 8 out of 9 text features examined. Individuals with greater depression severity had higher overall network connectivity between depression-relevant linguistic features than those with lesser severity. We observed within-subject changes in overall network connectivity associated with the dates of a self-reported depressive episode. The connectivity within personalized networks of depression-associated linguistic features may change dynamically with changes in current depression symptoms.

Machine learning of language use on Twitter reveals weak and non-specific predictions

Article Open access 25 March 2022

How epidemic psychology works on Twitter: evolution of responses to the COVID-19 pandemic in the U.S.

Article Open access 23 July 2021

Analysing Twitter semantic networks: the case of 2018 Italian elections

Article Open access 24 June 2021

Introduction

Network theories of mental illness propose that disorders like depression emerge from cascades of casual interactions that occur between symptoms¹. In contrast to traditional frameworks that suggest symptoms are indicators of a single underlying disease state, network theories posit that these symptoms and their interactions are actually what drive these conditions. For example, diminished feelings of worth, compounded by insomnia, may lead to a loss of energy, resulting in weight gain and a decreased ability to think and concentrate. Positive feedback among these symptoms is thought to contribute to the maintenance of a depressive episode^2,3.

Preliminary support for network theory has come from studies comparing the network structure of self-reported depressive symptoms between groups of individuals with and without a diagnosis, or before and after some intervention. Individuals with depression, compared to individuals without depression, are thought to have greater connectivity between depression symptoms, reflecting an elevated vulnerability to ‘knock-on effects’, that may result in fairly sudden and persistent changes in depression². This has been partially born out in the data; studies have shown that patients with depression have increased connectivity among depression symptoms compared to healthy controls^4,5,6 and the same is true for several other mental health conditions^7,8,9,10. Moreover, participants who go on to have persistent depression have more strongly connected networks at baseline than those who later enter remission^11,12 and the same appears to be true for patients with eating disorders who undergo treatment¹³. As a change in a system’s state approaches, e.g., onset of a depressive episode, network connectivity is expected to increase, reflecting elevated vulnerability, as the system becomes less and less able to recover from external stress^14,15,16. One study found some evidence that network connectivity increases approaching a depressive episode, although these effects did not directly capture the onset of an episode and were not shown within-subject¹⁷.

Despite these results, recent studies have yielded some inconsistent findings. Another study failed to extend the findings of Van Borkulo and colleagues (2015) to an adolescent sample—finding instead no difference in baseline network connectivity in patients who went on to have worse outcomes¹⁸. Furthermore, several studies have failed to find evidence for one of the key predictions of network theory, that individuals who recover (e.g., following treatment) show reductions in their network connectivity. In fact, several studies have actually shown an increase in network connectivity following treatment for depression^12,19,20,21.

One explanation for the lack of consistent findings is that the majority of prior work has been based on networks constructed from group-level symptom correlations (i.e. between-subject, cross-sectional). An alternative approach is to construct networks based on repeated assessments gathered from the same individual over time (i.e. within subject, longitudinal), which allows one to characterize each individual’s network structure, sometimes referred to as a personalised network²². This may be an important distinction, because it is unclear to what extent cross-sectional networks capture how symptoms causally relate to one-another over time, within an individual. This issue was addressed empirically when researchers analysed the same dataset in multiple ways, allowing them to directly compare the structure of cross-sectional networks versus personalized ones²³. They found that the two analysis approaches can sometimes yield different results, including different associations between symptoms and finding that different symptoms were the most central.

We therefore considered if the inconsistent findings in the field of network analysis and mental health to date might stem from the over-reliance on cross-sectional methods for characterizing network structure. Longitudinal studies are needed to test some of the key predictions of network theory, such as whether network connectivity increases as someone transitions from a healthy to acutely ill state. We know of only two network studies that attempted to measure symptoms within the same individual over a long enough period to capture a naturally occurring change in mental health state. These two examples are both single-subject observational studies. Wichers et al. (2016) reported data from a single patient over 239 days, a period of time that spanned the transition into a depressed state, concluding that the patient’s network connectivity increased prior to the onset of the depressive episode, though data were insufficient for a formal analysis²⁴. In a longitudinal study of one patient with psychosis, researchers similarly observed a qualitative increase in network connectivity during both an impending and full relapse²⁵. While these studies are suggestive, larger samples and formal statistical tests are needed to address one of the most fundamental predictions of network theory of mental illness - does network connectivity increase during a depressive episode? The present study aimed to fill this gap by comparing the connectivity of networks within versus outside a depressive episode. Rather than using self-report symptoms, however, we used linguistic features associated with depression posted by users on the social media platform Twitter^26,27,28,29.

One reason that studies addressing this question are lacking is because the data required is challenging to gather; multiple assessments are required per day, per participant, over several weeks or months. Challenges are compounded by the need for a naturally occurring depressive episode to have its onset during this period. To circumvent these challenges, we utilized an alternative to ecological momentary assessment (EMA). Instead of asking participants to report their mood, motivation, sleep etc. daily over a prolonged period of time, we analysed depression-associated textual data already archived on Twitter, a social media platform. We analysed data over a 12-month period from nearly 1,000 participants that in some cases spanned the onset of a depressive episode. Central to our approach are prior observations that individuals with depression, both people with a clinical diagnosis and those with self-reported depression symptoms, have significant linguistic differences in both their writing and speech patterns compared to those without depression. For example, individuals with depression use significantly more 1^st person singular pronouns than individuals without depression in personal essays^30,31 and semi-structured interviews³², which is thought to reflect enhanced self-focused attention that occurs in a depressed state²⁷. Along with changes in pronoun use, depression is associated with negatively biased cognitive distortions, e.g., everyone thinks that I am a loser³³. These findings are also observable in social media posts^26,28,34,35, and include increased use of swear and negation words, anger, references to death, and changes in the use of articles and other pronouns^36,37. People with depression are also less active on Twitter in the early morning (3am–6am) than healthy controls exhibiting an altered circadian rhythm, but also used significantly more personal pronouns, negative affect words, and rumination words during this time. Language usage on social media is thus not static, instead changing over time reflecting underlying changes in a participant’s mental health³⁸. By examining longitudinal data, we can examine fluctuations in time within-subjects, allowing us to ask if these depression-associated linguistic features become more connected when someone is in the midst of a depressive episode. For example, when a person is more self-focused, using words like “I”, “me” and “my”, is that person also more anxious, angry or sad? Is that association stronger when someone is currently depressed than when they feel well? This sort of data has the benefit of being objective and relatively plentiful, but the mapping of these linguistic features onto to specific self-reported symptoms diagnostic of depression (such as sadness, sleep disturbances and motivation) has never been formalised and needs considerable further study. With these limitations in mind, we tested if, similar to what has been predicted for self-report symptoms, depression-associated linguistic features are more inter-connected in individuals who have greater self-report depression severity and become more connected, within-subject, during a depressive episode. We first constrained our analysis to networks constructed from 9 text features^26,34,39 that previously studies have linked to depression in the literature to ensure the independence of our analyses, but we subsequently test if our results generalize to a range of networks constructed from depression-associated linguistic more broadly.

In this work, we show that participants with greater depression severity have higher overall network connectivity among a network of 9 a priori selected depression-relevant linguistic features. Among participants with self-reported depressive episodes, we found that network connectivity is higher within vs. outside an episode. These results were not dependent on our chosen network; networks constructed from random samples of depression-related linguistic features are significantly more connected during a depressive compared to networks of depression-irrelevant linguistic features. Our study illustrates that Twitter data, albeit noisy, can be used as an alternative to ecological momentary assessment to study depression longitudinally and in this case, test key predictions of network theory.

Results

We tested whether global personalised network connectivity, constructed from an a priori set of 9 depression-relevant linguistic features, is associated with baseline depression severity using Twitter data from 946 participants. In a subset of 286 participants, we sought to determine whether within-subject personalised network connectivity is greater within vs. outside a self-reported depressive episode. Finally, we checked whether changes in within-episode network connectivity generalised to 1,000 other combinations of 9-node networks.

Association of Twitter text features with current depression symptomatology

As an initial step, we verified whether Linguistic Inquiry and Word Count (LIWC) text features averaged over the past year in our sample were significantly associated with current depression symptom severity (Table 1). Language pertaining to negative emotions (Neg. Emo), use of 1^st person singular (1^st Per. Sing), use of 2^nd person pronouns (2^nd Pers.), swear words (Swear), and negations (Negate) were significantly positively associated with current depression symptom severity. While use of 1^st person plural pronouns (1^st Pers. Pl.), articles (Articles), and words pertaining to positive emotions (Pos. Emo) were negatively associated with depression severity. There was no significant association with 3^rd person pronouns (3^rd Pers.) (Fig. 1). The proportion of days with Tweets within-subject that contained each of the 9 text features is presented in the online supplement (Table S2). Swear words were the least frequently occurring text feature (Mean: 0.30, SD: 0.22), while articles were the most frequent (Mean: 0.80, SD: 0.14).

Table 1 Association of average use of 9 a priori text features over a 12-month period with current depression symptom severity.

Full size table

**Fig. 1: Self-reported depression severity is associated with several text features derived from Tweets.**

In terms of consistency with the prior literature, negative emotions^{28,34,35,40,41,42}, 1^st person singular pronouns^{27,28,30,35,36,40,41}, swear words^{34,36,37,39,40,43}, and negations^36,37,40,44 were shown to be positively associated with depression severity. While, positive emotions^{31,36,40,43,45,46}, 1^st person plural^28,36,41, 2^nd person^36,40,44, and 3^rd person pronouns^{36,40,41,44,47} have been found to be negatively associated with depression. Article use has been found to be significantly associated with depression severity, although there is inconsistency regarding the direction of the effect^{28,36,37,39,40}. We therefore replicated previously established directional associations for 6 (negative emotions, 1^st person singular, 1^st person plural pronouns, swear words, negations, and positive emotions) of 9 LIWC text features, but were unable to replicate negative associations for 2^nd person (we found a significant positive association) and 3^rd person plural pronouns (trend-level in the direction of a positive association). Given conflicting evidence surrounding the direction of associations between article use and depression in the existing literature, our finding of a negative association can be neither confirmatory or dis-confirmatory at this point. Directionality notwithstanding, we were thus broadly assured that the text features we selected showed relevance to depression. Therefore, we used this set of linguistic features as nodes upon which to construct personalised depressive networks.

Overall depression network composition

We constructed personalised networks for each participant, based on these 9 depression-associated text features derived from Tweets posted over the 12 months preceding study enrolment. From these individual networks, we tested how network structure differed as a function of depression severity. In support of a central hypothesis of network theory, we observed a significant positive association between depression severity and global network strength (β = 0.008, SE = 0.003, p = 0.002) (Fig. 2a). That is, those individuals with the highest levels of depression had the most tightly connected networks in the sample. Participants with higher depression severity had significantly larger node strength of negative emotions, swear words and articles [Neg. Emo: β = 0.02, SE = 0.007, p = 0.007; Swear: β = 0.02, SE = 0.007, p = 0.009; Articles, β = 0.01, SE = 0.003, p < 0.001] (Fig. 2b). The overall network of depression-related linguistic features was characterised primarily by several weak positive connections, and one strong positive connection between negative emotions and swear words (Fig. 2c). There was a significant positive association between number of days and global network connectivity (β = 0.00009, SE = 0.00002, p < 0.001) (Figure S3a). However, there was no significant association between number of days in the time-series with current depression severity (β = 0.0002, SE = 0.0003, p = 0.43) (Figure S3b) and the significant association with current depression severity remained after controlling for the number of days in each personalised network (Table S3). Furthermore, our findings were not affected when networks were constructed without 3^rd person pronouns, which had no significant association with current depression severity (Figure S4).

**Fig. 2: The connectivity of personalised networks of depression-relevant language is associated with individual differences in self-reported depression severity.**

Within-subject changes in network connectivity during depressive episodes

To test the hypothesis that networks of depression-associated linguistic features become more tightly connected during an episode, we compared network connectivity within-subject for periods when participants were depressed versus non-depressed over the preceding 12-month period. This required the construction of two personalised networks per person—one ‘within episode’ and another ‘outside episode’ among a subset of the sample who reported an episode in the past 12 months (N = 286). In line with our hypothesis, the networks of our participants had a significantly higher global network strength if constructed using language data gathered during an episode (‘within’) versus a time when they were not currently depressed (‘outside’) (β = 0.03, SE = 0.009, p = 0.005, Fig. 3a). We found our results were robust to unequal variances in the distribution of global network strength with the Wilcoxon-Signed Rank Test (V = 16,840, p = 0.009). We also performed the analysis using a bootstrapped sample of 80% of the data and re-did the within-subject analysis 1,000 times as a strong control for skewed strength centrality distributions. We found that the distribution of within-episode regression coefficients was significantly above zero (β = 0.03, SE = 0.0001, p < 0.001) (Figure S5). In terms of the specific nodes of the network, during a depressive episode, 1^st person singular (1^st Pers. Sing., β = 0.03, SE = 0.01, p = 0.03), 1^st person plural (1^st Pers. Pl., β = 0.04, SE = 0.01, p = 0.002), 2^nd person (2^nd Pers., β = 0.03, SE = 0.01, p = 0.04), 3^rd person (3^rd Pers., β = 0.04, SE = 0.01, p < 0.001), use of articles (Articles, β = 0.04, SE = 0.01, p = 0.01), and negation words (Negate, β = 0.05, SE = 0.01, p = 0.001) all had significantly larger node strengths than the same networks constructed during times when the participants were not currently depressed (Fig. 3b).

**Fig. 3: Personalised network connectivity increases during a depressive episode for specific symptoms.**

Changes in node strength were not due to mean increases in the text features themselves, because there were no significant differences among any of the text features within versus outside an episode (Table S4). However, within-and outside episode networks of these participants had an average duration of 80.8 days (SD: 61.7) and 171.5 days (SD: 85.6) respectively, meaning that on-average participants spent considerably more time in a non-depressed state than in a depressed one (β = −90.8, SE = 6.2, p < 0.001, Figure S3c). This gave us cause for concern as we noted a significant negative association between global network connectivity and the number of days that within-episode networks were based on (r = −0.16(286), p = 0.01) (Figure S3d). The relationship is non-linear with the largest global connectivity values found during short (i.e., under 30 days) within-episode periods despite the removal of outliers. Additionally, there was a significant interaction such that the direction of association between the number of days of data and network strength depended on whether the data was from within vs. outside an episode (β = 0.0005, SE = 0.0001, p < 0.001). We reasoned, therefore, that the difference in number of days upon which within vs outside episode networks were based presented a potential confound to interpretation. Indeed, a permutation test that randomly shuffled the identifier (within-episode with outside-episode) within each participant showed a greater bias towards elevated connectivity for those (fake) within-episode periods than would be expected by chance (\(\hat{\beta }\) = 0.007, SE = 0.0002, p < 0.001; Figure S6). Importantly, however, 99.3% of the betas observed were smaller than in the real unshuffled data, meaning that over and above any bias introduced by differences in the number of days within/outside episode, the true designation of being within an episode led to higher network connectivity. Consistent with this, after adjusting for the number of days in our regression, the significant increase in global network connectivity within versus outside an episode was reduced, but still statistically significant (β = 0.02, SE = 0.01, p = 0.02). Of the individual node strengths examined, only articles (Articles, β = 0.03, SE = 0.02, p = 0.04) still had a significantly larger node strength within an episode. Thus, the finding of increased network connectivity within versus outside an episode for our a priori depression network survived correction for the number of days within episode.

Generalisability of findings to other depression networks

The network of depression-associated features that we constructed was based on features previously described in the literature, and designed to ensure the independence of our analysis from mean-level effects or indeed noise in the present dataset. But it is important to note that this is not the only depression-related linguistic network that can be constructed from these data, nor is it necessarily the best. Of the 87 LIWC text features at our disposal, 59% were significantly associated with current depression severity at an uncorrected p < 0.05 level. Bivariate correlations between all LIWC text features and current depression severity can be found in the supplementary material (Table S5). We thus tested if our results held when networks were constructed from different sets of 9 randomly selected text features associated with current depression severity. Networks of text features associated with depression have significantly larger within-episode connectivity than those of networks not associated with depression (β = 0.01, SE = 0.0005, p < 0.001, Fig. 4A). The network with the largest increase in within-episode connectivity (Fig. 4B) included the following depression relevant features: 1st person singular pronouns (“1^st Pers. Sing”), clout (“Clout”: non-transparent summary variable indicating social status/leadership), personal pronouns (“Pers. Pron.”, e.g. you, they), function words (“Function” e.g. on, and), tentative (“Tentative”, e.g. maybe), negative emotions (“Neg. Emo”., e.g. ugly), power (“Power”, e.g. superior), negation (“Negate”, e.g. not), and achieve (“Achieve”, e.g. win). In the top 100 depression relevant networks, time and tentative words were found in 30% of networks (Fig. 4C). Our a priori selected network is consequently not the only network with elevated within-episode connectivity nor is it the network with the largest increase in connectivity. But rather is part of a general trend that networks constructed from depression-relevant language features have greater connectivity when in the midst of a depressive episode.

**Fig. 4: The generality of within-subject changes in network connectivity to other depression-relevant networks.**

Discussion

The network theory of mental illness posits that causal interactions between symptoms result in positive feedback loops that lead to the development and maintenance of poor mental health episodes. This theory generates a range of predictions that have been difficult to examine in self-reported depression symptom data due to the difficulty in collecting large volumes of longitudinal self-report data. We adopted an approach to test these predictions by studying time series of linguistic features that are associated with depression, extracted from the social media platform Twitter. These linguistic features are outwardly observable indicators of a range of internal states that prior work as shown to be relevant to depression. While these linguistic features of depression cannot be directly mapped to individual clinically recognised symptoms, we nonetheless posited that they might interact and serve to reinforce one-another just as has been predicted by network theory for classic symptoms of depression. We predicted that networks constructed from these depression-relevant language features would be more strongly connected in those with higher levels of depression severity and moreover that they would become even more tightly connected when people were in a depressed state.

To test these initial predictions, we took a conservative approach in using 9 a priori text features with previously established relevance to depression from archival Twitter data. We found significant associations between 8 of 9 text features selected and current depression severity, of which 6 were consistent with now well-established directionality in the literature. These included positive associations between the use of 1^st person singular pronouns and negative emotions and depression symptom severity. Next, we constructed personalized networks from these 9 features and found that higher levels of current depression severity were associated with greater connectivity of our a priori depression-associated linguistic network across participants. Crucially, we then leveraged the longitudinal nature of this dataset to study how connectivity changes within-subject as their mental health changes. Participants retrospectively reported periods of time when they had a depressive episode in the past year and we constructed networks for ‘within’ and ‘outside’ these dates. We demonstrated that the connectivity of depression-related linguistic networks increased within-subject as participants moved into periods of depression. This was true of our a priori network, but crucially also for a range of 9-node networks constructed from randomly selected text features that had related to overall cross-sectional depression symptom severity. That is, networks constructed from depression-relevant text features were more likely to become tightly connected during an episode than networks constructed from depression-irrelevant text features.

Network theory offers a compelling explanation for the heterogeneity of disorders⁴⁸ and are supported by patients’ experiences of causal relationships between symptoms^49,50 and the efficacy of cognitive behavioural therapy, which aims in part to diminish associations between symptoms⁵¹. However, there has been conflicting evidence in the literature regarding whether individuals with a mental illness have greater symptom network connectivity than healthy participants⁵². These results can partially be explained by the over reliance on cross-sectional data, which potentially averages out individual differences in network connectivity. Two prior studies found preliminary within-subject evidence of an increase in network connectivity during an acute phase of mental illness^24,25. However, both involved only one participant. In this study, we established an increase in within-subject connectivity of a depression-relevant network during a depressive episode in a large sample. Crucially, we also leveraged Twitter text features as a tool for estimating personalised depression networks, rather than using self-report data. While that can be construed as a strength of our investigation, it is also a major weakness; there remains a critical gap in testing if self-reported symptoms would behave in a similar manner.

Using network analysis to understand individual vulnerability to depression is a promising avenue for potentially developing novel and personalized interventions. This is because symptoms with a high strength centrality are thought to have a disproportionate ability to activate or deactivate other symptoms. For example, evidence from a cross-sectional social anxiety disorder network suggests that changes in the most central symptoms in anxiety networks are predictive of more distributed changes in symptoms⁵³. In a prospective study of anorexia nervosa patients, higher levels of the most central symptoms at baseline were negatively associated with successful recovery (more so than less central symptoms)⁵⁴. However, a major caveat of research thus far is the use of cross-sectional networks to derive key insights – in some cases these align with personalised networks, but in others not²³. It is hoped that a push towards the development of more individualised approaches to network estimation will allow us to translate these basic findings into clinical practice. This might take the form of targeting symptoms that are, for the individual, most central, thereby preventing an individual from developing a disorder in the first place⁵⁵. To realise this potential, interpretable depression features will likely be essential. While we believe our data shows an interesting generalisation of network theory beyond self-report symptoms, more work will be needed to extract clinically actionable insights, if they exist, from the study of linguistic features of depression.

Our study was not without limitations and caveats. First, we are not suggesting that Twitter posts will ever (or should ever) be used to make clinical decisions. People on social media tend to selectively express their emotions, i.e. impression management, which obfuscates their true emotional state⁵⁶. Some use these platforms for work, to sell things, for self-promotion and in some cases, to vent their emotions. Therefore, the indirect assessment of depression-relevant language through text analysis will always lead to data that is substantially nosier than otherwise obtained via ESM and would never be of sufficient quality, in our view, to make individualised predictions. Indeed, the effect sizes reported here are low. This is in part because the linguistic features in tweets have low correlations with overall depression severity and also because our definition of a depressive episode was broader than is typical and based on a retrospective report. It therefore remains of key importance to establish if networks of goal-standard assessments of self-report depression symptoms display the same characteristics of the linguistic features studied here and to establish if effect sizes are clinically meaningful in such datasets. That said, we believe the present findings are of significant theoretical importance in two key ways.

First, in a large enough sample, we can use noisy data like this to test key aspects of network theory. The broad alignment of our findings with the prior literature (e.g., overall association of network connectivity and depression severity) and predictions of network theory (e.g. within subject changes in connectivity during episodes) affirm there is clear signal in these data. There is significant potential, we believe, in using such data to answer questions that are otherwise practically impossible using EMA. Second, the proof of principle established here suggests that other sources of linguistic data that are potentially more indicative of current mood (e.g., text messages, speech) could be mined to help deliver personalized warning signs to individuals. In this context, it is noteworthy to also acknowledge that our ground-truth measure with respect to depressive episodes is a retrospective report and likely subject to errors in recall. This would only serve to diminish our effect size further, which in our view, makes our results more compelling. In a real-world context, it is likely that changes in connectivity associated with episodes of depression are stronger than reported here.

Other limitations to our data include the fact that social media users tend to be younger, better educated, wealthier, and politically more liberal than the general population, posing potential generalizability problems^57,58. Additionally, online workers on platforms similar to ClickWorker have their own unique sociodemographic profile and crucially, endorse higher rates of a range of mental health problems than the general public^59,60. In the present study, a large proportion of participants had received a mental health diagnosis in the past and more than half reported a depressive episode in the past year. The high rates of the latter were likely partially inflated by our decision to require just 2 symptoms of depression (low mood and reduced interest) to have been present consistently for a two-week period (instead of the usual 5 of 7)⁶¹, but are likely to be partially due to the known profile of online workers. It remains to be tested if the findings from this study will generalize to individuals recruited via other means and indeed through clinical settings. The majority of text and sentiment analysis libraries used to examine the association between language features and mental health are only available in English⁶². Significant differences have been shown in use of negative, positive emotions, personal pronouns, articles, and other lexical attributes between social media users in western (U.S. and U.K.) and non-western (India and South Africa) countries⁶³. The vast majority of our sample came from predominantly English-speaking countries. Because of this, we do not how these associations may generalise to different languages and cultural settings. Similar to several other personalised network papers^64,65, we found that none of the nodes in our a priori network was normally distributed. Network analysis assumes that all nodes are multivariate normally distributed⁶⁶, however it is not yet known the extent to which edge and centrality estimates are affected by deviations from normality. Finally, the LIWC is only able to account for the proportion of words in a particular category, e.g., proportion of 1^st person pronouns in a passage of text. Any context or more subtle usage of language, such as irony, that would change the underlying emotional meaning of a text are not captured by this method. More broadly, there are a range of more sophisticated analytical approaches when it comes to the content and sentiments in tweets that may prove stronger indicators of depression (but see⁶⁷) and thus better candidate nodes to construct depression-relevant networks. We chose instead to use an established library and to focus on language features previously shown to relate to depression to keep a degree of independence in the datasets used to derive depression-relevant features and the one (here) used to study how their network compositions changes through time. Future work might draw on alternative methods and have greater power to interrogate network dynamics in these datasets.

We found support for two of the principal predictions of network theory using a proxy for longitudinal (historical) EMA. Specifically, we found that the connectivity (partial correlation) between a set of pre-defined linguistic features of depression relates to an individual’s current depression symptom severity. Moreover, we found that this network connectivity increases within-subject during a depressive episode. Future work can utilize this methodological approach to test and refine key aspects of network theory. Elevated network connectivity within an episode was not specific to the a priori LIWC text features chosen; they generalised to a broader set of linguistic features that are associated with depression severity and future research might elect to utilize the best performing network we identified here. Whether these findings generalise to other aspects of mental health is not yet known. Recent work suggests that there are a host of commonalities across various aspects of mental health in their use of language on Twitter. Regardless of whether there is some degree of specificity of the nodes that comprise such networks, it will be interesting to determine if network connectivity increases, within-subject, occur during the acute phase of other mental health illnesses, such as bipolar disorder. Given the vast amount of data available and its longitudinal archival nature, social media network analysis is a promising method for testing some of the tougher predictions of network theory, albeit using very different ‘markers’ of depression.

Methods

Participants

We recruited 1,713 participants for this study. The majority were recruited on Clickworker (N = 1,395), an online worker platform, and were paid €2.5 for their participation. A smaller number participated voluntarily (i.e. without payment) and were recruited through general advertising on Twitter and in print media (N = 318). Participants were included for analysis if they were at least 18 years old and had a Twitter account with at least 30 days of tweets and if at least 50% of their tweets were in English. They were also required to pass an attention check, a combination of a captcha and an item with an obvious correct response (“Please select ‘A little’ if you are paying attention”). Of the 1,713 participants recruited, 99 were excluded due to failing the attention check and a further 668 participants were excluded for either not having at least 30 days of tweets or fewer than 50% of their tweets were in English. After excluding these participants, 946 participants were brought forward for analysis. Participants had a mean age of 29.6 years (SD: 10.6, range: 18-66), a majority were female (65.2%), currently unemployed (51.6%), and resided in either the U.K. (35.9%) or U.S. (50.7%).

Particpants reported more than half (59.0%) of the sample reported at least one depressive episode in the past year (mean: 1.56 episodes, SD: 0.81) with an average duration of 104.06 days (SD: 97.06) and 45.7% reported being diagnosed by a physician with depression at some point in their life. Participants were asked to self-report the dates of any depressive episodes in the past year; a depressive episode was defined as a period of at least two weeks with low mood and loss of interest or pleasure in activities every day or nearly every day. Participants that reported at least one depressive episode tweeted (β = 84.7, SE = 38.4, p = 0.03) and liked the Tweets of others (β = 193.8, SE = 70.0, p = 0.006) more frequently than participants without a depressive episode, but did not retweet significantly more (β = 49.8, SE = 32.6, p = 0.13). Individuals who reported a depressive episode in the past 12 months were younger (β = −2.2, SE = 0.70, p = 0.002), more female (χ2(5, N = 946) = 26.0, p < 0.001), less likely to be employed (χ²(1, N = 946) = 4.3, p = 0.04), and more likely to have been diagnosed with depression by a physician (χ²(1, N = 946) = 178.5, p < 0.001). Individuals that reported a depressive episode were also more likely to have a lower educational attainment than those without a depressive episode (χ²(6, N = 946) = 28.0, p < 0.001). There was no significant difference in country of residence for individuals with versus without a depressive episode (χ²(5, N = 946) = 8.1, p = 0.15) (Table 2). Participants recruited through Clickworker tweeted (β = −217.8, SE = 41.5, p < 0.001), retweeted (β = −162.0, SE = 35.4, p < 0.001), and liked other posts (β = −227.0, SE = 76.5, p = 0.003) significantly less than people who were not paid for their participation (Table S1). Furthermore, although there was no difference between groups in the percentage of depressive episodes in the past year (χ²(1, N = 946) = 0.42, p = 0.52), paid participants were significantly less likely to have been ever been diagnosed by a physician with depression (χ²(1, N = 946) = 6.9, p = 0.01).

Table 2 Demographic and Twitter use characteristics of sample.

Full size table

Procedure

After providing informed consent, participants were asked to complete a self-report questionnaire and provide their Twitter handle which was used to collect the most recent (max 3,200) tweets and (max 3,200) likes from their account. Tweets were collected using a data collection app written in Python using the Twitter developer’s Application Programming Interface. Participants were asked to provide their age, gender, country of residence, current employment status, and highest educational attainment. They were also asked if they have ever been diagnosed by a physician with depression and if yes to provide the approximate date of diagnosis. Next, they completed a self-report depression questionnaire to establish their current symptom severity levels. In the first wave of recruitment, 263 participants completed the Centers for Epidemiologic Studies Depression scale⁶⁸ (CES-D 8). In subsequent recruitment waves, the remaining 1,450 participants completed the Zung Self-Rating Depression Scale (SDS)⁶⁹ instead. We combined scores from the two depression scales by standardizing each scale by its mean and standard deviation. Finally, participants were asked to report up to five depressive episodes in the past year. A depressive episode was defined as a period of at least two weeks in which the participant felt both low mood and loss of interest or pleasure in hobbies and activities nearly every day for most of the day. We chose this definition to increase the sensitivity for detecting depressive episodes and to reduce participant burden by only requiring the two essential components of a depression diagnosis. Episodes were recoded to be “not depressed” if they were shorter than 2 weeks in duration and were merged together if separated by fewer than 2 weeks (effectively recoding intervening days as also being depressed).

Pre-processing and text analysis

We restricted our analysis to tweets published in the 12 months prior to survey completion. Before text analysis, extraneous information was removed from tweets including: reply symbol (@), hashtag symbol (#), emojis, punctuation, links (URLs), and all other non-alphanumeric characters. Periods, exclamation points, and question marks were the only punctuation retained because they are necessary to calculate the number of words per sentence. Tweets were aggregated into daily bins and text analysis was then performed on all tweets published per day per user. Daily observations were chosen to increase the amount of text for reliable estimation of text features. Text analysis of daily Tweets was carried out using the Linguistic Inquiry and Word Count (LIWC 2015) dictionary⁷⁰. The LIWC is a dictionary comprised of approximately 6,400 words and word-stems with 90 different output variables including: linguistic characteristics (e.g. articles and pronouns), psychological constructs (e.g. sadness and positive emotions), and general text information (e.g. punctuation and word count). The LIWC has been used in prior studies that reported a relationship between Twitter sentiments, text features, and depression^26,39. As an initial step to verify that these features were picking up depression symptomology, we averaged each feature over the past 12 months and then tested for correlation with current depression severity.

Among the 9 averaged LIWC text features, any value more than 3 standard deviations from the group mean for that text feature was subsequently removed. Approximately 1.1% of data in the full sample of participants was excluded using this criterion.

Feature specification

We selected 9 LIWC text features a priori for network analysis based primarily (but not wholly) on the findings of de Choudhury et al. (2013), who found that the following text features had relevance to self-reported depression severity: 1^st person singular (“1^st Pers. Sing.”, 24 words incl. “I”, “me”, “mine”), 1^st person plural (“1^st Pers. Pl.”: 12 words incl. “we”, “our”), 2^nd person (“2^nd Pers.”, 30 words incl. “you”, “your”), and 3^rd person (“3rd Pers.”, 28 words incl. “she”, “they”) pronouns, negative (“Neg. Emo.”: 744 words incl. “hurt”, “ugly”) and positive emotions (“Pos. Emo”: 620 words incl. “love”, “nice”), swear (“Swear”: 131 words incl. “damn”), articles (“Articles”: 3 words: “a”, “an”, “the”), and negation (“Negate”, 62 words, incl. “not”, “never”) words. Specifically, these words were found to have either a significant change in mean, variance, momentum, or entropy in their sample at a stringent correction for multiple comparisons. Based on findings from prior work, however, we did not average the two 1^st person pronouns plural and singular together into a single 1^st person pronoun category. Prior work has shown they have bidirectional associations to depression and indeed, we confirmed this in our data as well^28,36,41. Using the LIWC, we calculated the proportion of text on each day with tweets in the past year that included words from each of the LIWC’s 87 categories. This resulted in a time-series for each of the 87 text feature categories for each participant. Days without tweets were not considered or assigned any value and consequently participants who tweet less often had fewer days in their time series than participants who tweeted every day.

Network analysis

Networks were constructed by examining the correlation between these text feature time series (nodes), using regularized partial correlations to determine the contemporaneous association between text features⁷¹. The contemporaneous association is based on the residuals of the lag-1 correlation and removes any temporal effects due to other variables measured at the same time point^22,72. Individual node strength, the sum of the absolute values of partial correlations into a node, and global network strength, average of node strength across all nodes, are the primary indicators of network connectivity in psychological networks. Personalised networks were estimated for each participant using the graphicalVar (version 0.2.4) package with LASSO regularisation. Regularized partial correlations control for associations between all other nodes in a network with high specificity. Consequently, an edge that is present in a regularized network likely presents a true edge rather than a false positive. However, as we were not focused on particular edges between nodes, but rather the broader characterisation of ‘connectedness’, we set the hyperparameter (gamma) to 0, which, although still regularised, causes the model to prefer more connections over fewer. This avoided a situation where many edges would be returned as 0 and is the same approach applied in a prior study using cross-sectional networks¹¹. A range of 10 tuning parameters (lambdas) was considered for each person’s model (nLambda = 10).

Network Connectivity of a priori Network and Current Depression

Using this method, we first estimated personalised networks for all participants (N = 946) and created a mean of these personalised networks to describe the network’s overall composition regarding strength, closeness, and betweenness centrality. Any individual node strength value, among the 9 LIWC text features, that was greater than 3 standard deviations from that node’s group mean was excluded from analysis. Using this exclusion criterion, approximately 2.5% of all node strength values were omitted. In order to test for the reliability of edge strengths in the network, we split our sample into two equal halves, calculated personalised networks for all participants, and then correlated the mean edge strengths between the two halves. We found that among the 36 unique edges in the network, there was a high degree of reliability between the split halves (r(36) = 0.99, p < 0.001). The edge strength between Neg. Emo. and Swear was much stronger than between other edges, reliability was r(35) = 0.97, p < 0.001 when we exclude this edge. Because individual networks tended to be sparse, the average of most edges tended towards zero leading to a high correlation between split halves (Figure S1a). Global network strength was not normally distributed (Shapiro-Wilk test, W = 0.95, p < 0.001) and had a rightward skew (Figure S1b). The strength centralities of most nodes, expect Swear and Neg. Emo. had a strong right skew due to the relative sparsity of those nodes (Figure S1c). Note we did not calculate closeness and betweenness centralities for individual networks due to edge sparsity and the strong correlation with strength centrality. To test if network connectivity based on the entire 12-month dataset was related to depression symptom severity, we correlated each individual’s network characteristics (i.e., network node strengths) with their current depression severity.

Change to a priori network within vs outside depressive episodes

Among participants who reported a depressive episode, we estimated two separate personalised networks for each person, representing periods when they were within and outside a depressive episode, hypothesising that networks would be more tightly connected within compared to outside of an acute episode. Individuals were required to have at least 15 days of tweets both within and outside an episode. We compared network connectivity using regression with “depressive episode” (1 = within episode, 0 = outside episode) as a within-subjects variable predicting node strength. We did the same analysis predicting global network strength.

Stability checks

Network analysis is made more robust by having fewer nodes and this was why the networks presented here are limited to 9 nodes⁷³. The number of nodes in the current study is well within the typical range (5-11) included in other network papers on depression^{12,18,20,21,24}. We quantified network stability for each personalised network using the bootnet package (version 1.5). Stability was assessed by repeatedly dropping up to 75% of cases in the sample and correlating the resultant strength centrality to the estimate based on the full sample. Stability was quantified with the correlation stability coefficient (CS coefficient), with 1,000 bootstrapped samples, i.e., the maximum proportion of the sample which can be dropped to retain a 0.7 correlation with the full sample in 95% of cases. A simulation study by Epskamp, Borsboom, and Fried (2018) proposed using a threshold for CS coefficients above 0.50 to ensure that the ordering of centralities is interpretable. Personalised networks of all participants were found to be highly stable (mean strength CS coefficient: 0.65, SD: 0.12) along with networks constructed from within (mean strength CS coefficient: 0.50, SD: 0.23) and outside (mean strength CS coefficient: 0.64, SD: 0.13) a depressive episode.

Generalisability of findings to other depression networks

We based our initial analysis on an a priori network of text features previously linked to depression. The idea behind this was to be as conservative as possible and ensure independence of the selection of features from this dataset. Following this proof of principle, we tested if this finding would extend to other depression-associated linguistic networks (i.e. networks constructed from other text features that were significantly associated with depression severity). Crucially, this analysis controls for the possibility that networks of any kind would be more strongly connected within vs outside an episode. We thus hypothesised that networks comprising text features significantly associated with depression would show greater within subject changes in connectivity compared to those not associated with depression. To test this, we constructed a list of all depression-relevant and 1000 depression-irrelevant text features from LIWC, based on an arbitrary threshold of p < 0.05 for their bivariate association with depression severity (see supplement Figure S5 for full list of associations). We then selected 1000 random sets of 9 features from each of the depression-relevant and irrelevant lists and estimated personalised networks for all 2000 sets. We did this twice for each particpant, once based on Tweets that were published within an episode and once based on Tweets outside an episode. This allowed us to test if global network connectivity was greater within versus outside an episode for each of the 2000 networks, using regression with “depressive episode” (1 = within episode, 0 = outside episode) as a within-subjects variable predicting node strength (exactly the same analysis as for the a priori network). Finally, we took the 2000 betas for the within-subjects variable ‘depressive episode’ in these analyses (Fig. 4a: 1000 depression-relevant and 1000 depression-irrelevant betas) forward to a general linear regression to determine if the extent to which episodes became more connected during an episode was greater in depression-relevant compared to depression-irrelevant networks of language use.

Control analyses

A range of control analyses are presented in the online supplement. These examine several potential confounding influences in network estimation. First, we noted that participants had more data (i.e., more days) outside an episode than within one. To control for the possibility that differences in the number of days might be driving our results, we (i) conducted a permutation test that randomised the identifier “within episode” versus “outside episode” 1000 times within subject and (ii) ran an additional within-subjects regression analysis that included the number of days within and outside an episode as a covariate. Because 3^rd person pronouns (she/he, they) were selected a priori, but not significantly associated with current depression severity in this sample, we repeated our analyses omitting this node to ensure our results were not affected by its inclusion. In the LIWC library, certain ‘supra-categories’ are inclusive of multiple sub-categories. For example, within the personal pronoun category are: 1^st person singular/plural, 2^nd person, 3^rd person, and impersonal pronouns. To test if this had a material effect on our results, we repeated our analyses excluding these supra-categories and the results were unchanged (Figure S2).

Mean personalised networks were visualised using the qgraph package (version 1.6.9). Between and within-subjects regression were preformed using the glm (version 3.6.1) and lmer packages (version 3.1-3). All statistical analyses were performed in R (3.6.1).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw Twitter data are protected and are not available due to data privacy. The processed Twitter data are available from the corresponding author on reasonable request. Processed Twitter data cannot be shared due to the possibility of participant identification. The processed data used to generate figures and tables in this study are provided in the Source Data file. Source data are provided with this paper.

Code availability

The code used to analyse the data in the current study is available at: https://doi.org/10.5281/zenodo.5745764⁷⁴.

References

Borsboom, D. & Cramer, A. O. Network analysis: an integrative approach to the structure of psychopathology. Annu Rev. Clin. Psychol. 9, 91–121 (2013).
Article PubMed Google Scholar
Cramer, A. O. et al. Major depression as a complex dynamic system. PLoS ONE 11, e0167490 (2016).
Article PubMed PubMed Central Google Scholar
Smith, R., Alkozei, A., Killgore, W. D. & Lane, R. D. Nested positive feedback loops in the maintenance of major depression: An integration and extension of previous models. Brain Behav. Immun. 67, 374–397 (2018).
Article PubMed Google Scholar
Lee Pe, M. et al. Emotion-network density in major depressive disorder. Clin. Psychol. Sci. 3, 292–300 (2015).
Article PubMed Google Scholar
Wigman, J. T. et al. Exploring the underlying structure of mental disorders: cross-diagnostic differences and similarities from a network perspective using both a top-down and a bottom-up approach. Psychol. Med. 45, 2375–2387 (2015).
Article CAS PubMed Google Scholar
Santos, H. Jr., Fried, E. I., Asafu-Adjei, J. & Ruiz, R. J. Network structure of perinatal depressive symptoms in latinas: relationship to stress and reproductive biomarkers. Res Nurs. Health 40, 218–228 (2017).
Article PubMed PubMed Central Google Scholar
Heeren, A. & McNally, R. J. Social anxiety disorder as a densely interconnected network of fear and avoidance for social situations. Cogn. Ther. Res. 42, 103–113 (2017).
Article Google Scholar
Segal, A. et al. Changes in the dynamic network structure of PTSD symptoms pre-to-post combat. Psychol. Med. 50, 746–753 (2020).
Article PubMed Google Scholar
van Rooijen, G. et al. A state-independent network of depressive, negative and positive symptoms in male patients with schizophrenia spectrum disorders. Schizophr. Res 193, 232–239 (2018).
Article PubMed Google Scholar
Jimeno, N. et al. Main symptomatic treatment targets in suspected and early psychosis: new insights from network analysis. Schizophr. Bull. 46, 884–895 (2020).
van Borkulo, C. et al. Association of symptom network structure with the course of [corrected] depression. JAMA Psychiatry 72, 1219–1226 (2015).
Article PubMed Google Scholar
McElroy, E., Napoleone, E., Wolpert, M. & Patalay, P. Structure and connectivity of depressive symptom networks corresponding to early treatment response. EClinicalMedicine 8, 29–36 (2019).
Article PubMed PubMed Central Google Scholar
Smith, K. E. et al. A comparative network analysis of eating disorder psychopathology and co-occurring depression and anxiety symptoms before and after treatment. Psychol. Med. 49, 314–324 (2019).
Article PubMed Google Scholar
Chen, L., Liu, R., Liu, Z. P., Li, M. & Aihara, K. Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers. Sci. Rep. 2, 342 (2012).
Article PubMed PubMed Central Google Scholar
Dakos, V., van Nes, E. H., Donangelo, R., Fort, H. & Scheffer, M. Spatial correlation as leading indicator of catastrophic shifts. Theor. Ecol. 3, 163–174 (2009).
Article Google Scholar
Scheffer, M. et al. Early-warning signals for critical transitions. Nature 461, 53–59 (2009).
Article ADS CAS PubMed Google Scholar
van de Leemput, I. A. et al. Critical slowing down as early warning for the onset and termination of depression. Proc. Natl Acad. Sci. USA 111, 87–92 (2014).
Article ADS PubMed Google Scholar
Schweren, L., van Borkulo, C. D., Fried, E. & Goodyer, I. M. Assessment of symptom network density as a prognostic marker of treatment response in adolescent depression. JAMA Psychiatry 75, 98–100 (2018).
Article PubMed Google Scholar
Bos, F. M. et al. Cross-sectional networks of depressive symptoms before and after antidepressant medication treatment. Soc. Psychiatry Psychiatr. Epidemiol. 53, 617–627 (2018).
Article PubMed PubMed Central Google Scholar
Berlim, M. T., Richard-Devantoy, S., Dos Santos, N. R. & Turecki, G. The network structure of core depressive symptom-domains in major depressive disorder following antidepressant treatment: a randomized clinical trial. Psychol. Med., 1–15, (2020).
Snippe, E. et al. The impact of treatments for depression on the dynamic network structure of mental states: two randomized controlled trials. Sci. Rep. 7, 46523 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Epskamp, S. et al. Personalized network modeling in psychopathology: the importance of contemporaneous and temporal connections. Clin. Psychol. Sci. 6, 416–427 (2018).
Article PubMed PubMed Central Google Scholar
Bos, F. M. et al. Can we jump from cross-sectional to dynamic interpretations of networks? Implications for the network perspective in psychiatry. Psychother. Psychosom. 86, 175–177 (2017).
Article PubMed Google Scholar
Wichers, M., Groot, P. C., Psychosystems, E. & Group, E. Critical slowing down as a personalized early warning signal for depression. Psychother. Psychosom. 85, 114–116 (2016).
Article PubMed Google Scholar
Bak, M., Drukker, M., Hasmi, L. & van Os, J. An n=1 clinical network analysis of symptoms and treatment in psychosis. PLoS ONE 11, e0162811 (2016).
Article PubMed PubMed Central Google Scholar
De Choudhury, M., Gamon, M., Counts, S. & Horvitz, E. in Seventh international AAAI conference on weblogs and social media.
Edwards, T. M. & Holtzman, N. S. A meta-analysis of correlations between depression and first person singular pronoun use. J. Res. Personal. 68, 63–68 (2017).
Article Google Scholar
Coppersmith, G., Dredze, M., Harman, C. & Hollingshead, K. in Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality. 1–10.
Zimmermann, J., Wolf, M., Bock, A., Peham, D. & Benecke, C. The way we refer to ourselves reflects how we relate to others: associations between first-person pronoun use and interpersonal problems. J. Res. Personal. 47, 218–225 (2013).
Article Google Scholar
Rude, S., Gortner, E.-M. & Pennebaker, J. Language use of depressed and depression-vulnerable college students. Cognition Emot. 18, 1121–1133 (2004).
Article Google Scholar
Molendijk, M. L. et al. Word use of outpatients with a personality disorder and concurrent or previous major depressive disorder. Behav. Res Ther. 48, 44–51 (2010).
Article PubMed Google Scholar
Zimmermann, J., Brockmeyer, T., Hunn, M., Schauenburg, H. & Wolf, M. First-person pronoun use in spoken language as a predictor of future depressive symptoms: preliminary evidence from a clinical sample of depressed patients. Clin. Psychol. Psychother. 24, 384–391 (2017).
Article PubMed Google Scholar
Bathina, K. C., Ten Thij, M., Lorenzo-Luaces, L., Rutter, L. A. & Bollen, J. Individuals with depression express more distorted thinking on social media. Nat. Hum. Behav. 5, 458–466 (2021).
Article PubMed Google Scholar
Coppersmith, G., Dredze, M. & Harman, C. in Proceedings of the workshop on computational linguistics and clinical psychology: from linguistic signal to clinical reality. 51–60.
Eichstaedt, J. C. et al. Facebook language predicts depression in medical records. Proc. Natl Acad. Sci. USA 115, 11203–11208 (2018).
Article CAS PubMed PubMed Central Google Scholar
De Choudhury, M., Counts, S., Horvitz, E. J. & Hoff, A. in Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing. 626-638.
Al-Mosaiwi, M. & Johnstone, T. In an absolute state: elevated use of absolutist words is a marker specific to anxiety, depression, and suicidal ideation. Clin. Psychol. Sci. 6, 529–542 (2018).
Article PubMed PubMed Central Google Scholar
Ten Thij, M. et al. Depression alters the circadian pattern of online activity. Sci. Rep. 10, 1–10 (2020).
Google Scholar
Reece, A. G. et al. Forecasting the onset and course of mental illness with Twitter data. Sci. Rep. 7, 13006 (2017).
Article ADS PubMed PubMed Central Google Scholar
De Choudhury, M., Counts, S. & Horvitz, E. in Proceedings of the 5th Annual ACM Web Science Conference. 47-56.
Lyons, M., Aksayli, N. D. & Brewer, G. Mental distress and language use: Linguistic analysis of discussion forum posts. Computers Hum. Behav. 87, 207–211 (2018).
Article Google Scholar
Tsugawa, S. et al. in Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems - CHI ‘15 3187–3196 (2015).
Rodriguez, A. J., Holleran, S. E. & Mehl, M. R. Reading between the lines: the lay assessment of subclinical depression from written self-descriptions. J. Pers. 78, 575–598 (2010).
Article PubMed Google Scholar
Leis, A., Ronzano, F., Mayer, M. A., Furlong, L. I. & Sanz, F. Detecting signs of depression in tweets in Spanish: behavioral and linguistic analysis. J. Med Internet Res. 21, e14199 (2019).
Article PubMed PubMed Central Google Scholar
Capecelatro, M. R., Sacchet, M. D., Hitchcock, P. F., Miller, S. M. & Britton, W. B. Major depression duration reduces appetitive word use: an elaborated verbal recall of emotional photographs. J. Psychiatr. Res. 47, 809–815 (2013).
Article PubMed PubMed Central Google Scholar
Lumontod, R. Z. III Seeing the invisible: Extracting signs of depression and suicidal ideation from college students’ writing using LIWC a computerized text analysis. Int. J. Res. 9, 31–44 (2020).
Google Scholar
ODea, B. et al. The relationship between linguistic expression and symptoms of depression, anxiety, and suicidal thoughts: a longitudinal study of blog content. arXiv preprint arXiv:1811.02750 (2018).
Fried, E. I. & Nesse, R. M. Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. J. Affect Disord. 172, 96–102 (2015).
Article PubMed Google Scholar
Frewen, P. A., Allen, S. L., Lanius, R. A. & Neufeld, R. W. Perceived causal relations: novel methodology for assessing client attributions about causal associations between variables including symptoms and functional impairment. Assessment 19, 480–493 (2012).
Article PubMed Google Scholar
Frewen, P. A., Schmittmann, V. D., Bringmann, L. F. & Borsboom, D. Perceived causal relations between anxiety, posttraumatic stress and depression: extension to moderation, mediation, and network analysis. Eur. J. Psychotraumatol. 4, https://doi.org/10.3402/ejpt.v4i0.20656 (2013).
Beck, A. T. Cognitive therapy of depression. (Guilford Press, 1979).
Hakulinen, C. et al. Network structure of depression symptomology in participants with and without depressive disorder: the population-based Health 2000–2011 study. Soc Psychiatry Psychiatr Epidemiol, https://doi.org/10.1007/s00127-020-01843-7 (2020).
Rodebaugh, T. L. et al. Does centrality in a cross-sectional network suggest intervention targets for social anxiety disorder? J. Consult Clin. Psychol. 86, 831–844 (2018).
Article PubMed PubMed Central Google Scholar
Elliott, H., Jones, P. J. & Schmidt, U. Central symptoms predict posttreatment outcomes and clinical impairment in anorexia nervosa: a network analysis. Clin. Psychological Sci. 8, 139–154 (2019).
Article Google Scholar
Fried, E. I. et al. Mental disorders as networks of problems: a review of recent insights. Soc. Psychiatry Psychiatr. Epidemiol. 52, 1–10 (2017).
Article PubMed Google Scholar
Newman, M. W., Lauterbach, D., Munson, S. A., Resnick, P. & Morris, M. E. in Proceedings of the ACM 2011 conference on Computer supported cooperative work. 341-350.
Mellon, J. & Prosser, C. Twitter and Facebook are not representative of the general population: Political attitudes and demographics of British social media users. Research & Politics 4, https://doi.org/10.1177/2053168017720008 (2017).
Wojcik, S. & Hughes, A. Sizing up Twitter users. Washington, DC: Pew Research Center (2019).
Ophir, Y., Sisso, I., Asterhan, C. S., Tikochinski, R. & Reichart, R. The turker blues: Hidden factors behind increased depression rates among Amazon’s mechanical turkers. Clin. Psychological Sci. 8, 65–83 (2020).
Article Google Scholar
Shapiro, D. N., Chandler, J. & Mueller, P. A. Using Mechanical Turk to study clinical populations. Clin. psychological Sci. 1, 213–220 (2013).
Article Google Scholar
Association, A. P. Diagnostic and statistical manual of mental disorders (DSM-5®). (American Psychiatric Pub, 2013).
Medagoda, N., Shanmuganathan, S. & Whalley, J. in 2013 International Conference on Advances in ICT for Emerging Regions (ICTer). 144-148 (IEEE).
De Choudhury, M., Sharma, S. S., Logar, T., Eekhout, W. & Nielsen, R. C. in Proceedings of the 2017 ACM conference on computer supported cooperative work and social computing. 353–369.
Aalbers, G., McNally, R. J., Heeren, A., De Wit, S. & Fried, E. I. Social media and depression symptoms: a network perspective. J. Exp. Psychol.: Gen. 148, 1454 (2019).
Article Google Scholar
Cuevas, A. C., Ots, C. V., Heeren, A. H. & Bentall, R. P. A temporal network approach to paranoia: a pilot study. Front. Psychol. 11, 2359 (2020).
Google Scholar
Epskamp, S., Waldorp, L. J., Mõttus, R. & Borsboom, D. The Gaussian graphical model in cross-sectional and time-series data. Multivar. Behav. Res. 53, 453–480 (2018).
Article Google Scholar
Chancellor, S. & De Choudhury, M. Methods in predictive techniques for mental health status on social media: a critical review. NPJ Dig. Med. 3, 1–11 (2020).
Google Scholar
Turvey, C. L., Wallace, R. B. & Herzog, R. A revised CES-D measure of depressive symptoms and a DSM-based measure of major depressive episodes in the elderly. Int. Psychogeriatr. 11, 139–148 (1999).
Article CAS PubMed Google Scholar
Zung, W. W. A self-rating depression scale. Arch. Gen. psychiatry 12, 63–70 (1965).
Article CAS PubMed Google Scholar
Pennebaker, J. W., Boyd, R. L., Jordan, K. & Blackburn, K. The development and psychometric properties of LIWC2015. (2015).
Epskamp, S., Borsboom, D. & Fried, E. I. Estimating psychological networks and their accuracy: a tutorial paper. Behav. Res Methods 50, 195–212 (2018).
Article PubMed Google Scholar
Wild, B. et al. A graphical vector autoregressive modelling approach to the analysis of electronic diary data. BMC Med. Res. Methodol. 10, 1–13 (2010).
Article Google Scholar
Mansueto, A. C., Wiers, R., van Weert, J. C., Schouten, B. C. & Epskamp, S. Investigating the Feasibility of Idiographic Network Models. (2020).
Kelley, S., Gillan, C. Using linguistic features in social media posts to study the network dynamics of depression longitudinally. Twitter_Depression, https://doi.org/10.5281/zenodo.5745764 (2021).

Download references

Acknowledgements

This study was funded by the ‘Institutional Strategic Support Fund’ grant (204814/Z/16/A) to Trinity College Dublin, funded by the SFI-HRB-Wellcome Trust partnership. SWK is funded by a Provost PhD Project Award awarded to CMG. We would like to thank Caoimhe Ni Mhaonaigh and Louise Burke for their assistance in participant recruitment.

Author information

Authors and Affiliations

School of Psychology, Trinity College Dublin, Dublin, Ireland
Sean W. Kelley & Claire M. Gillan
Trinity College Institute of Neuroscience, Trinity College Dublin, Dublin, Ireland
Sean W. Kelley & Claire M. Gillan
Global Brain Health Institute, Trinity College Dublin, Dublin, Ireland
Claire M. Gillan

Authors

Sean W. Kelley
View author publications
You can also search for this author in PubMed Google Scholar
Claire M. Gillan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.W.K.: Conceptualization, methodology, study design, statistical analysis, writing (drafting and editing). C.M.G.: Conceptualization, methodology, study design, statistical analysis, writing (drafting, editing, and supervision).

Corresponding authors

Correspondence to Sean W. Kelley or Claire M. Gillan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Johan Bollen, Michael Browning, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kelley, S.W., Gillan, C.M. Using language in social media posts to study the network dynamics of depression longitudinally. Nat Commun 13, 870 (2022). https://doi.org/10.1038/s41467-022-28513-3

Download citation

Received: 04 October 2021
Accepted: 21 January 2022
Published: 15 February 2022
DOI: https://doi.org/10.1038/s41467-022-28513-3

This article is cited by

“How” web searches change under stress
- Christopher A. Kelly
- Bastien Blain
- Tali Sharot
Scientific Reports (2024)
Machine learning of language use on Twitter reveals weak and non-specific predictions
- Sean W. Kelley
- Caoimhe Ní Mhaonaigh
- Claire M. Gillan
npj Digital Medicine (2022)
Possible Futures for Network Psychometrics
- Denny Borsboom
Psychometrika (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Association of Twitter text features with current depression symptomatology

Overall depression network composition

Within-subject changes in network connectivity during depressive episodes

Generalisability of findings to other depression networks

Discussion

Methods

Participants

Procedure

Pre-processing and text analysis

Feature specification

Network analysis

Network Connectivity of a priori Network and Current Depression

Change to a priori network within vs outside depressive episodes

Stability checks

Generalisability of findings to other depression networks

Control analyses

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links