Network-based prediction of the disclosure of ideation about self-harm and suicide in online counseling sessions

Xu, Zhongzhi; Chan, Christian S.; Zhang, Qingpeng; Xu, Yucan; He, Lihong; Cheung, Florence; Yang, Jiannan; Chan, Evangeline; Fung, Jerry; Tsang, Christy; Liu, Joyce; Yip, Paul S. F.

doi:10.1038/s43856-022-00222-4

Download PDF

Article
Open access
Published: 06 December 2022

Network-based prediction of the disclosure of ideation about self-harm and suicide in online counseling sessions

Communications Medicine volume 2, Article number: 156 (2022) Cite this article

1704 Accesses
3 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Background

In psychological services, the transition to the disclosure of ideation about self-harm and suicide (ISS) is a critical point warranting attention. This study developed and tested a succinct descriptor to predict such transitions in an online synchronous text-based counseling service.

Method

We analyzed two years’ worth of counseling sessions (N = 49,770) from Open Up, a 24/7 service in Hong Kong. Sessions from Year 1 (N = 20,618) were used to construct a word affinity network (WAN), which depicts the semantic relationships between words. Sessions from Year 2 (N = 29,152), including 1168 with explicit ISS, were used to train and test the downstream ISS prediction model. We divided and classified these sessions into ISS blocks (ISSBs), blocks prior to ISSBs (PISSBs), and non-ISS blocks (NISSBs). To detect PISSB, we adopted complex network approaches to examine the distance among different types of blocks in WAN.

Results

Our analyses find that words within a block tend to form a module in WAN and that network-based distance between modules is a reliable indicator of PISSB. The proposed model yields a c-statistic of 0.79 in identifying PISSB.

Conclusions

This simple yet robust network-based model could accurately predict the transition point of suicidal ideation prior to its explicit disclosure. It can potentially improve the preparedness and efficiency of help-providers in text-based counseling services for mitigating self-harm and suicide.

Plain language summary

In online counseling, the help-provider can often be engaging with several service users simultaneously. Therefore, new tools that could help to alert and assist the help-provider and increase their preparedness for getting further help for service users could be useful. In this study, we developed and tested a new tool that is designed to alert help-providers to the disclosure of self-harm and suicidal thoughts, based on the words that the service user has been typing. The tool is developed on the basis that word usage may have a specific pattern when suicidal thoughts are more likely to occur. We tested our tool using two years’ worth of online counseling conversations and we show that our approach can help to predict the confession of suicidal thoughts. As such, we are taking a step forward in helping to improve these counseling services.

Revealing semantic and emotional structure of suicide notes with cognitive network science

Article Open access 30 September 2021

Internet addiction and residual depressive symptoms among clinically stable adolescents with major psychiatric disorders during the COVID-19 pandemic: a network analysis perspective

Article Open access 03 June 2023

Deep graph neural network-based prediction of acute suicidal ideation in young adults

Article Open access 04 August 2021

Introduction

Online psychological services grow rapidly since their inception in the 1990s^1,2. These services provide an important avenue for preventing self-harm and suicide, which is a major public health challenge and the leading cause of death among youth worldwide.

Self-harm and suicide risk assessment and intervention is especially challenging in the context of online services because of the lack of contextual and behavioral information and the inability to physically intervene the crisis³. The anonymity, while a major strength of some online services⁴, also adds to the complexity in providing the needed crisis intervention to those at risk. Self-harm and suicide assessment and intervention in online psychological services primarily rely on the explicit prompting and disclosure of the help-seeker^5,6,7,8,9. In many ways, such explicit forms of disclosure are postdictors rather than predictors of the ideation of self-harm and suicide (ISS)¹⁰. It would be of tremendous theoretical interest and practical use if the shift from non-ISS utterance to the disclosure of ISS can be identified earlier on, especially when risk assessment and intervention are not as readily available or deployable as face-to-face services¹⁰. This is because, if the transition point from non-ISS language to ISS language can be predicted, help-providers can prepare for the potential disclosure of ISS. Such preparedness includes, for example, deploying their skills to explore ISS as many help-seekers tend to conceal their ISS during psychological treatment¹¹. Moreover, in the contexts in which the same text-based help-provider is serving multiple texters simultaneously, the prediction of transition points can serve as an alert system to prompt the provider to prioritize and manage their allocation of attention and time. This would potentially improve the efficiency and effectiveness of text-based services, as one-to-many counseling becomes feasible and safe, and without compromising the users’ experiences.

To date, research lending empirical support to the prediction of the transition point to ISS is rare. To our knowledge, there are only three recent studies that have investigated the prediction of ISS in social media^10,12,13. The authors share our concern that most studies assess text-based messages, which include explicit mentions of ISS, but do not address the prediction of ISS. While those studies have demonstrated some promise of predicting ISS, the deep learning algorithms that have been adopted in both studies are inherently black boxes; one cannot extract any human-understandable insights from the model as even its designers cannot explain why such an artificial intelligence arrives at a specific decision. In addition, the linguistic patterns between social media posts and counseling sessions are diametrically different as social media posts are self-utterance while counseling consists of messages exchanged between two people. As such, our knowledge about predicting ISS in a text-based online counseling session is still lacking, and a transparent yet robust model for such analysis is needed.

The present study aims to fill the research gap by developing and testing techniques based on complex network theory in large-scale text-based counseling data. To do so, we first construct a word affinity network (WAN) that depicts the semantic relationships between words used in text-based counseling. WAN serves as a substrate on top of which the downstream calculation of network-based distance takes place. We then use the tools of network science^14,15,16 to develop and test the accuracy of a framework predicting the transition point to ISS.

Methods

Approval of ethics

Users gave their consent to using their text data for research purposes by accepting the Privacy Policy before the commencement of service. The Privacy Policy can be seen at https://www.openup.hk/privacy-policy.htm?lang=en. Help-providers gave their consent to using their text data for research purposes by signing consent forms. The study protocol was approved by the Human Research Ethics Committee of University of Hong Kong (EA1709039).

Dataset

Open Up is a free 24/7 online text-based counseling service in Hong Kong designed to provide timely support to youth and young adults experiencing emotional distress^3,17. Since its inception in 2018 to date (March 2021), 224 counselors and volunteers have provided more than 50,000 sessions. Help-seekers have the option to access the service through the service’s website, Facebook, or instant messaging services, including SMS and WhatsApp. At any given time, each counselor can engage with more than one concurrent user. Box 1 shows a fictitious excerpt between a help-seeker and a counselor.

Box 1. An excerpt of a fictitious exchange between a help-seeker and a counselor in English translation. “H” and “C” stand for “help-seeker” and “counselor”, respectively. Refer to Supplementary Methods-A for the Chinese version

H: I am struggling every day to not jump out of the window. I am suffering and I feel so lonely.

C: Hearing this, I am worried about you, Lily

C: It seems that you feel tormented every day.

H: I feel so muddled. What is the point of life?

H: I want to die so badly. I am suffering and I feel so lonely.

C: I am here with you, Lily, you are not alone.

H: I have to wait until October before going back to the clinic for the follow-up consultation. I don’t know how to hold on.

H: I want to jump off the building right now. How can I hold on till then?

C: Is it possible to make an earlier appointment with the doctor?

C: Let them know that you are not feeling well lately?

……

Defining blocks

To introduce the counseling session inclusion criteria, we first defined a block, which is the basic unit of analysis of this study. Specifically, each session was divided into continuous blocks, each of which contains an arbitrary number of ten messages from the help-seeker. Blocks were classified into three types: ideation about self-harm and suicide blocks (ISSBs), blocks prior to ISSB blocks (PISSBs), and non-ISSB blocks (NISSBs) for the remaining blocks. Using blocks as the basic unit, we transformed the problem of predicting the transition point into the problem of detecting PISSB. Below we define the three different block types in detail.

We first define ISSBs. ISS includes thinking about, considering, or planning self-harm and suicide¹⁸. It is considered an important risk factor for self-harm and suicide attempts¹⁹. In this study, the identification of ISS blocks required two steps: Firstly, we relied on C-SSRS rules and keywords-based rules^20,21,22 to obtain a set of preliminary ISS blocks (preliminary ISSBs). Supplementary Methods-B reports words/phrases related to the explicit disclosure of ISS used in this study.

It is recognized that there currently is not a set of one-size-fits-all rules to deal with the categorization of ISS. Some rules are relatively loose; others are tighter. Several typical categories of false positives are listed in Table 1 and Table S1 (Supplementary Methods-C). Therefore, after a round of keywords-based coarse selection, we relied on human coders to refine the coding, making sure that they are indeed ISSBs.

Table 1 Typical false alarms when relying on keywords matching alone.

Full size table

The term PISSB is then self-explanatory: It stands for the block immediately prior to an ISSB. All remaining blocks that are neither ISSB nor PISSB were classified as NISSB.

Session inclusion criteria

The analysis included only sessions with at least ten message exchanges to provide sufficient information to make the classification. Sessions were excluded if they contained only ISSB and no PISSB (i.e., insufficient content before ISSB) as our aim was to identify PISSB, which was by definition prior to ISSB.

Network-based PISSB detection

Below, we first provide a brief description of the general framework before introducing the details of the model.

General framework

Words are not scattered randomly in written texts. Instead, they are organized following specific linguistic patterns^23,24,25. The syntactical features of texts can be reflected by complex networks, such as word affinity networks (WANs) which depict the semantic relationships (e.g., word adjacency, word similarity) between words. Words occurred adjacently in texts tend to form localized neighborhoods, known as word modules^23,24,25. Against this backdrop, we first constructed a WAN using a subset of counseling sessions as corpus. Then we examined if words in the divided blocks would form word modules in WAN. If this premise was met, we conducted the third step exploring if comparing the network-based distance of PISSB/NISSB word modules with ISSB word modules was a reliable measure of discriminating PISSB from NISSB. The underlying assumption was that closer distance between word modules indicates linguistic proximity between text blocks. Below we describe each step in detail.

Step 1: Constructing a word affinity network

According to the overall framework proposed above, we started by constructing a WAN using a subset of sessions, which included sessions conducted in 2019 (N = 20,618). Different from English, there is no obvious boundary for Chinese words. For example, “pine, apple, and elm” in Chinese looks like “pineappleandelm”. Therefore, word segmentation was the first step for constructing WAN. Jieba, which is a cutting-edge Chinese word segmentation algorithm, was used to carry out this procedure. Jieba is publically available at https://github.com/fxsjy/jieba.

The second step for constructing WAN was to extract important words to be included in the network. Word filtering is a routine pre-processing step in language modeling because usually a large proportion of words are trivial in any given corpus; they contribute little information and introduce noise to the downstream analyses. We extracted top 5000 important words from the 20,618 sessions by calculating each word’s term frequency-inverse document frequency (TF-IDF), a commonly used statistic to measure the importance of a word in the whole corpus. These words constructed the nodes of WAN.

After determining the words to be included in the network, we proceeded to identify meaningful relationships (i.e., links) between them. Links were identified from two aspects: (1) we considered the well-established co-occurrence relationship between words²³. The co-occurrence of words reflects syntactical features of texts; (2) we also linked up two words in terms of their semantic similarity. In particular, a Word2Vec model using CBOW was trained using these 20,618 sessions. We trained our model with 128 features using the Gensim Python library. By considering two different relationships, affinity can be interpreted as a two-fold measure: word co-occurrence and word similarity, the combination of which is more comprehensive than each one alone. A well-designed network is important because it is the fundamental base map for the whole study. A subgraph of WAN is illustrated in Fig. 1a.

**Fig. 1: The network-based transition point prediction method.**

Step 2: Modularity test

After the WAN was constructed, we proceeded to conduct the word modularity test. Current literature suggests that words in a paragraph are not scattered randomly in WAN; instead, they tend to form localized neighborhoods, known as word modules^23,25,26. Indeed, many real-world networks are modular, characterized by the existence of tightly connected clusters²⁷. To investigate if there existed the modular topology of words within a session block, we calculated the average number of steps along the shortest paths for all possible pairs of words in the block (i.e., mean shortest path length ${l}_{{block}}$) using

$${l}_{{block}}=\frac{{\sum }_{{v}_{i},{v}_{j}\in {block}}d({v}_{i},{v}_{j})}{{n}^{2}}$$

(1)

where $d\left(\cdot \right)$ represents the shortest distance between word ${v}_{i}$ and word ${v}_{j}$, and $n$ denotes the number of words in the block. Correspondingly, a reference distance is defined as the mean shortest path length for a randomly selected group of words of matching size and degree distribution in WAN. A module appeared if the observed distance ${l}_{{block}}$ and the reference distance were statistically different. A one-tailed z-test was performed (Fig. 1b) to test the hypothesis that the mean of the average shortest path between words within blocks was shorter than that among random words. Alpha of 5% was selected with a one-tailed test to testing for the possibility of the relationship. Same methods were also adopted in previous literature examining the modular topology of drug-targeted proteins in protein-protein interaction networks¹⁴.

Step 3: Measuring the distance between blocks

We then measured the distance between blocks to investigate if such network-based instrument was reliably indicative of detecting PISSBs (i.e., predicting ISSBs). We defined the distance between two blocks x and y as the topological distance between two blocks’ word modules in WAN, where the topological distance was evaluated using the recently introduced separation measure s-score:¹⁴

$${s}_{(x,y)}= < {b}_{(x,y)} > -\frac{ < {l}_{x} > + < {l}_{y} > }{2}$$

(2)

where ${l}_{{block}}$ is defined in Eq. (1), and $ < {b}_{(x,y)} > =\frac{{\sum }_{{v}_{i}\in x,{v}_{j}\in y}d({v}_{i},{v}_{j})}{m\times n}$ calculates the mean shortest path length between x and y. m and n are the number of words in x and y, respectively. As such, the s-score compared the mean shortest distance within each block, $ < {l}_{x} > $ and $ < {l}_{y} > $, to the mean shortest distance $ < {b}_{(x,y)} > $ between (x,y) block pairs. Topologically, the smaller the s-score is, the closer two blocks are. See Fig. 1c, d for two examples of the calculation of ^S(x,y).

Based on the definition of distance between two blocks, we can further calculate the mean distance between two sets of blocks. In particular, the mean distance between a set of PISSBs and a set of ISSBs ${\bar{s}}_{({PISSBs},\,{ISSBs})}$ was formulated as ${\bar{s}}_{({PISSBs},\,{ISSBs})}=\frac{{\sum }_{x\in {PISSBs},y\in {ISSBs}}{s}_{{xy}}}{u\times w}$. Similarly, the mean distance between a set of NISSBs and a set of ISSBs ${\bar{s}}_{({NISSBs},\,{ISSBs})}$ was calculated by the following equation ${\bar{s}}_{({NISSBs},\,{ISSBs})}=\frac{{\sum }_{x\in {NISSBs},y\in {ISSBs}}{s}_{{xy}}}{v\times w}$. Here, $u,v,w$ denote the number of blocks in PISSB set, NISSB set, and ISSB set, respectively. A z-test was performed to examine if the mean values of the two scenarios were statistically different.

Step 4: Optimal classification threshold

The last step was to determine the specific cutoff s-score ${\bar{s}}_{(m,\,{ISSBs})}$ that could be used to evaluate if a block m is a PISSB or not. A recommended approach for such task is to find the threshold which corresponds to the highest summation of sensitivity (i.e., recall, or true positive rate) and specificity (i.e., true negative rate) for all possible threshold values. Formally, we formulated the PISSB detection problem as a binary classification problem (PISSBs vs. NISSBs). The PISSB set and NISSB set together was randomly split into a training set (80%) and a test set (20%). Given a block m in the test set, we aimed to identify if m was a PISSB or NISSB, by comparing ${\bar{s}}_{(m,\,{ISSBs})}$ with a threshold t, where ${\bar{s}}_{(m,\,{ISSBs})}=\frac{{\sum }_{x=m,y\in {ISSBs}}{s}_{{xy}}}{1\times w}$ by definition (See Eq. (2)), and t was the optimal threshold that maximized sensitivity plus specificity in the training set. If ${\bar{s}}_{(m,\,{ISSBs})} < t$, m was likely a PISSB. Otherwise, it was classified as an NISSB.

Statistics and reproducibility

Data preprocessing and analysis were performed using Python (version 3.6). Network was visualized using Gephi (version 0.9.2). Given its sensitivity, the raw transcript data cannot be made available to the public.

When comparing two groups of interest in this study, one-tailed z-test is used because we aim to testing for the possibility of the relationship in one direction (that said, significantly greater than or significantly less than). A two-tailed test will test both if the mean of X is significantly greater than and if the mean is significantly less than the mean of Y. The mean of X is considered significantly different from the mean of Y if the test statistic is in either the top or bottom of its probability distribution, in which case we lose direction.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Results

Based on session inclusion criteria, two years’ worth of counseling sessions (N = 49,770) from Open Up were included in the analysis. Sessions from the year 2019 (N = 20,618) were used to construct the WAN (i.e., Step 1). Sessions from the year 2020 (N = 29,152), including 4583 with explicit ISS, were used to train and test the model (i.e., Steps 2, 3, and 4). Among the 4583 sessions, 3415 (74.5%) had ISSBs but no PISSBs. They were excluded from the analysis.

All valid sessions were divided into blocks. In the current analysis, we focused on the first ISSB in each session with ISS. Consequently, 1168 ISSBs and 1168 corresponding PISSBs were extracted from the 2020 dataset. The same number of NISSBs (N = 1168) were randomly sampled from the 2020 dataset as counterparts.

Modularity test

The resulting WAN from the 2019 dataset contained 5000 words selected by TF-IDF. For the 3504 blocks (i.e., the sum of 1168 ISSBs, 1168 PISSBs, and 1168 NISSBs), the average shortest path length in WAN between words within a conversation block was 2.04 (SD = 0.29), whereas the measure was 2.67 (SD = 0.36) between random words, which was 31% longer (z = −4.9, p < 0.05). This informs us that, in WAN, words within a block tend to form word modules. An example of a word module derived from a block is shown in Fig. 1e.

The s-score

The mean distance between the PISSB set (N = 1168) and the ISSB set (N = 1168) ${\bar{s}}_{({PISSBs},\,{ISSBs})}$ was 0.23 (SD = 0.03). The mean distance between the NISSB set (N = 1168) and the ISSB set ${\bar{s}}_{({NISSBs},\,{ISSBs})}$ was 0.30 (SD = 0.05). Figure 2a demonstrates the distribution of the ${\bar{s}}_{({PISSBs},\,{ISSBs})}$ and ${\bar{s}}_{({NISSBs},\,{ISSBs})}$. This suggests that network-based distance provides a potential opportunity to distinguish PISSBs from NISSBs as their distance to ISSBs is statistically different. The z-statistic (z = −8.8, p < 0.05) suggests that the observed difference between ${\bar{s}}_{({PISSBs},\,{ISSBs})}$ and ${\bar{s}}_{({NISSBs},\,{ISSBs})}$ cannot be attributed to chance.

Optimal decision threshold

Per Method-Step 4, we set 80% blocks in the PISSB set and the NISSB set (N = 1668) as the training set to seek the optimal decision threshold t, and used the remaining PISSBs and NISSBs (N = 668) to evaluate the performance at t. The ROC Curve on the training set is presented in Fig. 2b. It plots the true positive rate (i.e., sensitivity) against the false positive rate (i.e., 1–specificity) for all possible cutoff values. In this study, the threshold was 0.27 (the red circle in Fig. 2b, which corresponds to the highest sensitivity plus specificity). Under this classification threshold, the c-statistic was 0.792 for the training set and 0.787 for the test set, indicating that the performance of using s-score to discover PISSB is good. We replicated the experiment 50 times. In each trial, NISSBs were re-sampled from the counterpart pool. The mean c-statistic for test set was 0.773 (SD = 0.019). This result shows that difference between ISSBs and NISSBs is robust and generalizable. In Fig. 3, we intuitively illustrated a subset of the shortest paths among three randomly sampled blocks in the test set.

**Fig. 3: Illustration of 18 shortest paths.**

Discussion

In this study, a network-based framework was developed and tested to predict disclosure of self-harm and suicide ideation before its emergence. By doing so, the framework can, in effect, identify users of a text-based online counseling service who may soon disclose ISS in the session. We found that network-based distance between a given block and ISS blocks ${\bar{s}}_{(m,\,{ISSBs})}$ was a reliable indicator of the likelihood of expressing ISS afterward. The c-statistic was 0.792 for the training set and 0.787 for the test set. This suggests that the language patterns in our study context can accurately predict the emergence of self-harm and suicide ideation. It is noteworthy that existing studies mainly focused on discriminating text with ISS from non-ISS text. What we have done in the present study was to take a step forward by identifying text that would likely precede the disclosure of ideation about self-harm and suicide.

Another contribution of this study is incorporating network science into the solution of NLP tasks. Standard deep learning models, which is the typical way for dealing with NLP problems, are often adequately accurate. Yet they are complex, computationally taxing, and essentially black boxes that are difficult to convince end users. The (fortunate) low base rate in ISS utterances also poses a challenge of trying to find a needle in a haystack because the rare samples are usually not adequate to train the large number of parameters in deep learning models. Alternatively, by making use of complex network theory, in this study, we developed a network-based approach to process language. We demonstrated that network-based framework is simple yet powerful, adding online conversation to a growing list of fields of research that are actively capitalizing on it, such as drug combinations¹⁴, protein–protein interactions¹⁵, disease–disease relationships¹⁶, and drug repurposing for COVID-19²⁸. This study serves as an important step to advance suicide risk prediction in the growing field of online psychological services.

The model developed in this study has several important practical implications. Like other synchronous online text-based services, Open Up counselors often need to engage multiple users simultaneously in order to meet the service demand. The proposed method can therefore improve counselors’ readiness to provide timely service to those with heightened suicidal risk by alerting them when the linguistic model is reaching a PISSB. This real-time alert is critically important and necessary because patients who may soon express suicidal thoughts require the full attention from the counselors. Without this tool, a counselor involved in multiple concurrent chats might be caught off-guard. Monitoring the sessions using the present model could help to improve efficiency and enhance user’s experience.

We give an example of how this method may be implemented in the context of online text-based counseling services: A signal can be turned on in the provider’s platform if an impending ISS is predicted by the model. Seeing the signal, the counselor can choose to adjust his or her risk assessment/management strategies. After a while (e.g., ten exchanges from the help-seeker), the counselor can re-evaluate if the prediction is accurate: A table showing the five shortest paths appears to guide the counselor in understanding the underlying mechanisms of the model’s judgment, such that counselors can quickly understand the logic and then judge the accuracy and usefulness of the model. Table 2 demonstrates an example of a set of such shortest paths with respect to the sample conversation in Fig. 3. Path length between two words is calculated using one minus affinity score such that a larger value represents a longer path length. Lastly, by incorporating the feedback from the counselor, the accuracy and usefulness of the model can be improved recurrently.

Table 2 The shortest paths and their lengths.

Full size table

Second, and relatedly, the implication is that the model can remind counselors, especially new recruits and volunteers, to consider altering their strategies to better elicit disclosure of suicidal ideation as people tend to conceal their suicidal thoughts during psychological support and treatment¹¹. An alert flagging a PISSB can help the counselor and their supervisor make adequate adjustments in their services.

Third, the forecasting of forthcoming suicidal ideation by the proposed method can be more sensitive and objective than human because it is not influenced by subjective opinions, which may differ from counselor to counselor. This is especially relevant to new recruits and volunteers who might possess different levels of skills and experience.

Fourth, the current study focused on suicidal ideation, the prevention of which is of the highest priority among many online counseling services. However, we are hopeful that the same method and technology can be applied to predicting the disclosure of other types of risk. Now that the model toward the prediction of suicidal ideation was developed and tested, modifying it with respect to other outcomes of interest is the next step.

Certainly, false positives might be an issue if the proposed model were to be implemented in a wider context. In our 2020 dataset, 15.7% of the sessions were annotated as ISS admissions, which is significantly higher than the general population^29,30. In such a context, we should pay extra attention to the false positive issue as the base rate of ISS in the general population is likely smaller.

It should be noted that while satisfactory, the accuracy of the model has room for further improvement. The proposed work is a retrospective study that aims to demonstrate the effectiveness of the model. Future research should consider using up-to-date corpus to construct WAN and conduct prospective analysis. It has great promise for model improvement because the latest corpus might involve unseen words and relationships, such as “COVID-19-bankruptcy”, etc. On the other hand, and relatedly, the word affinity network should be updated recurrently over time when being deployed to the frontline systems. Also, early detection of conversations with self-harm and suicide ideation should not be equated with the prevention of suicide itself. Because the dataset we used lacks behavioral outcome data, we must acknowledge the limitations that the lack of explicitly expressed suicidal ideation should not be interpreted as the actual lack of such thoughts or plans. Similarly, the expression of suicidal ideation should not be interpreted as imminent suicide risk per se¹⁸. The model we developed should be treated as merely an additional tool to augment the service provider’s risk assessment.

Secondly, the model developed was based on a large-scale Cantonese database. While we are optimistic that the method can be readily applied to other languages or contexts, the generalizability of our findings requires further empirical support. Relatedly, the database was from an online text-based counseling service; we do not assume the same modules or model are directly applicable to other types of text-based conversations.

Third, we recognized that while what crisis workers said might be as useful as help-seekers in predicting suicidal ideation, the proposed model, however, is not intelligent enough to cope with two tracks of language at the moment. A more sophisticated model (that said, a deep learning model) might deal with the two-phase language patterns to further improve the accuracy, but we might lose explainability.

Conclusion

Our study demonstrated the ability to predict suicidal ideation prior to its emergence in the context of an online text-based counseling service. By detecting and alerting the help provider before ideation emerges, our model enables the counselors to enhance their preparedness and offer timely intervention. It can potentially also help improve service efficiency through improved triaging and timely deployment of human resources.

Data availability

Given its sensitivity, the raw transcript data cannot be made available to the public. The source data underlying Fig. 2a, b, 3 can be found at https://doi.org/10.6084/m9.figshare.20292096.

Code availability

The code used to perform the analysis can be found on GitHub³¹. Data preprocessing and analysis were performed using Python (version 3.6). Network was visualized using Gephi (version 0.9.2).

References

Alao, A. O., Yolles, J. C. & Armenta, W. Cybersuicide: the internet and suicide. Am. J. Psychiatry 156, 1836–1837 (1999).
Article CAS Google Scholar
Kiesler, S. From the couch to the keyboard: psychotherapy in cyberspace. Cult Internet 87–116 (1997).
Yip, P. et al. A 24-hour online youth emotional support: opportunities and challenges. Lancet Reg Heal - West Pacific 4, 0–2 (2020).
Google Scholar
Reeves, A., Bowl, R., Wheeler, S. & Guthrie, E. The hardest words: exploring the dialogue of suicide in the counselling process—a discourse analysis. Couns. Psychother. Res. 4, 62–71 (2004).
Article Google Scholar
Bantilan, N., Malgaroli, M., Ray, B. & Hull, T. D. Just in time crisis response: suicide alert system for telemedicine psychotherapy settings. Psychother. Res. 0, 1–11 (2020).
CAS Google Scholar
Wilks, C. R., Coyle, T. N., Krek, M., Lungu, A. & Andriani, K. Suicide ideation and acceptability toward online help-seeking. Suicide Life-Threatening Behav. 48, 379–385 (2018).
Article Google Scholar
Nobles, A. L., Glenn, J. J., Kowsari, K., Teachman, B. A., Barnes, L. E. Identification of imminent suicide risk among young adults using text messages. Conf. Hum. Factors Comput. Syst. Proc. 1–11 (2018).
Bagroy, S., Kumaraguru, P., De Choudhury, M. A social media-based index of mental well-being in college campuses. Conf. Hum. Factors Comput. Syst. Proc. 1634–1646 (2017).
Coppersmith, G., Leary, R., Crutchley, P. & Fine, A. Natural language processing of social media as screening for suicide risk. Biomed. Inform. Insights 10, 117822261879286 (2018).
Article Google Scholar
De Choudhury, M., Kiciman, E., Dredze, M., Coppersmith, G., Kumar, M. Discovering shifts to suicidal ideation from mental health content in social media. Conf. Hum. Factors Comput. Syst. Proc. 2098–2110 (2016).
Blanchard, M. & Farber, B. A. It is never okay to talk about suicide: patients’ reasons for concealing suicidal ideation in psychotherapy. Psychother Res. 30, 124–136 (2020).
Article Google Scholar
Wang, N. et al. Learning models for suicide prediction from social media posts. arXiv Prepr arXiv 210, 503315 (2021).
Google Scholar
Roy, A. et al. A machine learning approach predicts future risk to suicidal ideation from social media data. npj Digit Med. 3, 1–12 (2020).
Article Google Scholar
Cheng, F., Kovács, I. A. & Barabási, A.-L. Network-based prediction of drug combinations. Nat. Commun. 10, 1197 (2019).
Article Google Scholar
Kovács, I. A. et al. Network-based prediction of protein interactions. Nat. Commun. 10, 1–8 (2019).
Article Google Scholar
Menche, J. et al. Uncovering disease-disease relationships through the incomplete interactome. Science (80-) 347, 841 (2015).
Article CAS Google Scholar
Siu, P. et al. The Opportunities and Challenges of the First Three Years of Open Up, an Online Text-Based Counselling Service for Youth and Young Adults. 1–10 (2021).
Klonsky, E. D., May, A. M. & Saffer, B. Y. Suicide, suicide attempts, and suicidal ideation. Annu. Rev. Clin. Psychol. 12, 307–330 (2016).
Article Google Scholar
Gliatto, M. F. Evaluation and treatment of patients with suicidal ideation. Am. Fam. Phys. 59, 1500–1506 (1999).
CAS Google Scholar
Gaur, M. et al. Characterization of time-variant and time-invariant assessment of suicidality on Reddit using C-SSRS. PLoS ONE 16, 1–21 (2021).
Article Google Scholar
Gaur, M. et al. Knowledge-aware assessment of severity of suicide risk for early intervention. Web Conf 2019 - Proc World Wide Web Conf WWW 2019: 514–525.
Sawhney, R., Manchanda, P., Mathur, P., Shah, R., Singh, R. Exploring and learning suicidal ideation connotations on social media with deep learning. Proc. 9th Workshop Comput. Approach. Subject. Sent. Soc. Media Anal. 167–175 (2019).
Montemurro, M. A., Zanette, D. H. Keywords and co-occurrence patterns in the voynich manuscript: an information-theoretic analysis. PLoS ONE 8, e66344 (2013).
de Arruda H, F. et al. Paragraph-based representation of texts: a complex networks approach. Inf. Process Manag 56, 479–494 (2019).
Article Google Scholar
De Arruda, H. F. et al. Representation of texts as complex networks: a mesoscopic approach. J. Complex Netw. 6, 125–144 (2018).
Article Google Scholar
De Arruda, H. F., Costa L. da, F., Amancio, D. R. Topic segmentation via community detection in complex networks. Chaos 26 (2016).
Lynn, C. W., Bassett, D. S. How humans learn and represent networks. Proc. Natl Acad. Sci. USA 117, 29407–29415 (2020).
Gysi, D. M. et al. Network medicine framework for identifying drug repurposing opportunities for COVID-19. Proc. Natl Acad. Sci. USA 118, 1–11 (2020).
Google Scholar
Paffard, M. Suicidal ideation. In: Acute Medicine: A Symptom-Based Approach. 415–420 (2014).
Dalglish, S. L., Melchior, M., Younes, N. & Surkan, P. J. Work characteristics and suicidal ideation in young adults in France. Soc. Psychiatry Psychiatr. Epidemiol. 50, 613–620 (2015).
Article Google Scholar
Zhongzhi, X. U. zhongzhixu/WAN: code in the paper (v1.0.0). https://doi.org/10.5281/zenodo.7277707 (2022).

Download references

Acknowledgements

We are grateful for the funding support from the Hong Kong Jockey Club Charities Trust and the Collaborative Research Fund (C7151-20G) for providing the resources to set up Open Up and its operation cost. The support from the strategic theme base research from The University of Hong Kong is gratefully acknowledged.

Author information

Authors and Affiliations

School of Public Health, Sun Yat-sen University, Guangzhou, China
Zhongzhi Xu
Hong Kong Jockey Club Centre for Suicide Research and Prevention, The University of Hong Kong, Hong Kong SAR, China
Zhongzhi Xu, Yucan Xu, Lihong He, Florence Cheung, Evangeline Chan, Jerry Fung, Christy Tsang, Joyce Liu & Paul S. F. Yip
Department of Psychology, The University of Hong Kong, Hong Kong SAR, China
Christian S. Chan
Department of Psychology and Linguistics, International Christian University, Tokyo, Japan
Christian S. Chan
School of Data Science, City University of Hong Kong, Hong Kong SAR, China
Qingpeng Zhang & Jiannan Yang

Authors

Zhongzhi Xu
View author publications
You can also search for this author in PubMed Google Scholar
Christian S. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Qingpeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yucan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lihong He
View author publications
You can also search for this author in PubMed Google Scholar
Florence Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Jiannan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Evangeline Chan
View author publications
You can also search for this author in PubMed Google Scholar
Jerry Fung
View author publications
You can also search for this author in PubMed Google Scholar
Christy Tsang
View author publications
You can also search for this author in PubMed Google Scholar
Joyce Liu
View author publications
You can also search for this author in PubMed Google Scholar
Paul S. F. Yip
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.X., Q.Z., C.S.C., and P.Y. formulated the idea. Z.X., L.H., F.C., E.C., J.F., and C.T. performed the literature review and annotated the suicidal ideation. Z.X. and Y.X. developed the model and conducted the experiments. Z.X., C.S.C., Q.Z., J.Y., and P.Y. analyzed and interpreted the results and wrote the article. All authors had full access to all data (including statistical reports and tables) in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.

Corresponding authors

Correspondence to Christian S. Chan or Paul S. F. Yip.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Medicine thanks Guillermo de la Parra and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Material

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, Z., Chan, C.S., Zhang, Q. et al. Network-based prediction of the disclosure of ideation about self-harm and suicide in online counseling sessions. Commun Med 2, 156 (2022). https://doi.org/10.1038/s43856-022-00222-4

Download citation

Received: 09 November 2021
Accepted: 23 November 2022
Published: 06 December 2022
DOI: https://doi.org/10.1038/s43856-022-00222-4

Subjects

Abstract

Background

Method

Results

Conclusions

Plain language summary

Similar content being viewed by others

Revealing semantic and emotional structure of suicide notes with cognitive network science

Internet addiction and residual depressive symptoms among clinically stable adolescents with major psychiatric disorders during the COVID-19 pandemic: a network analysis perspective

Deep graph neural network-based prediction of acute suicidal ideation in young adults

Introduction

Methods

Approval of ethics

Dataset

Defining blocks

Session inclusion criteria

Network-based PISSB detection

General framework

Step 1: Constructing a word affinity network

Step 2: Modularity test

Step 3: Measuring the distance between blocks

Step 4: Optimal classification threshold

Statistics and reproducibility

Reporting summary

Results

Modularity test

The s-score

Optimal decision threshold

Discussion

Conclusion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Supplementary Material

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links