Introduction

Over the last few decades, early identification and intervention of individuals at Ultra-High Risk (UHR) for psychosis has become a major translational research goal, in the hope of developing tailored methods to minimise the risk and impact of conversion to clinically relevant illness1,2. Whilst prediction of conversion to clinically relevant psychosis has been a main point of interest in the UHR populations, it is equally important to understand protective mechanisms present in UHR individuals who do not transition3,4,5. Social6,7,8,9 and cognitive10,11,12,13,14 functioning in UHR may have value in predicting illness progression. In particular, the covariation of social and cognitive measures across time might be useful in identifying and predicting outcome in UHR. To this end, the current study aims to apply novel computational neural network methods to longitudinal social and cognitive data in UHR to predict non-transition (i.e., remittance or maintenance of a nonclinical state).

A significant proportion of UHR individuals including those who remit from UHR conditions15, experience sustained declines in social functioning (i.e., real-life interpersonal behaviour, perceptual abilities, verbal and nonverbal communication)6,7,8,16. Deficits in social skills performance have been identified in schizophrenia. This included individuals at UHR for psychosis, individuals in early stages of psychosis, and individuals with chronic psychosis17,18,19. Studies of social functioning reported a higher degree of social skill impairment in UHR individuals who develop psychosis than in those who do not develop, although the lack of study power must be emphasised, which may indicate that a lack of social skills is a sign of susceptibility to psychosis9,17.

Cognition is another domain linked to poor performance in non-transition UHR individuals in follow-up studies10,11. This has been found within the specific neurocognitive domains of working memory, speed of processing, attention and executive functions12,13,14. Association between social and cognitive functioning factors to UHR individuals transitioning have been described cross-sectionally in the areas of theory of mind8,20, emotion recognition21,22 and attributional bias8.

To the best of our knowledge, no longitudinal studies have elucidated on social and cognitive interaction using advanced predictive tools in identifying and prognosis of non-transitioning UHR individual outcomes. This highlights the importance of longitudinal research on the predictive strength of symptom variables on social and cognitive data interaction that might have high translational potential in clinical psychiatric practice.

Advanced computational and machine learning methods have the potential to provide a method for development tools that can reliably predict individual health outcomes for multimodal and heterogeneous illnesses, such as schizophrenia23,24. Machine learning methods, such as pattern recognition, include computational algorithms that can learn from a large multivariate dataset to make accurate predictions concerning clinical outcome25. The potential of machine learning for prediction of individual outcomes in psychosis was illustrated in a study by de Nijs et al.23. The authors showed that patient-reportable baseline data (e.g., Global Assessment of Functioning (GAF) scores, psychotic symptoms, quality of life, antipsychotics use, psychosocial needs and depressive symptoms predicted global (accuracy 63.5–67.6%) and symptomatic (accuracy 62.2–64.7%) outcomes after 3-year and 6-year follow-ups. Worthington et al. (2022) applied a gradient boosting machine algorithm to clinical and demographic data to predict prodromal symptom remission in a UHR cohort. The classification accuracy was obtained at 60.0–72.0% with a sensitivity of 0.68% and specificity of 0.53%. The Worthington et al.’s study is notable as the first to examine interactive clinical and demographic predictors of symptomatic remission in people who do not progress to psychosis using an advanced data-driven techniques26.

The current study extends the use of data-driven model development to explore the use of a wide range of dynamic markers of social and cognitive variables to predict 2-year symptomatic outcomes and transition status. A subgroup of UHR’s individuals was included based on the clinical status at the final assessment (month 24) and compared them with the Healthy Control (HC) group. Examining the interaction and interrelations between social and cognitive variables over time required incorporating them into one unifying model27. To this end, the computational Spiking Neural Network (SNN) model was used for interaction and classification of social and cognitive data (using 2-year data for training and testing) and prediction (using 18 months of longitudinal data to test the model to predict the outcome at month 24). The results were also compared against other machine learning techniques that are suitable for time-series data for classifying and prognosis.

Results

The results from the designed SNN-based methodology (that is fully explained in the methods section), is depicted graphically in Fig. 1, and consists of the following steps:

  1. (1)

    Visualization of the interaction between social and cognitive variables.

  2. (2)

    UHR outcome classification and prediction.

  3. (3)

    Statistical analysis of the SNN models for assessing the model significance.

Fig. 1: The diagram of the protocol of study includes population, data preprocessing, and the designed SNN-based methodology for visualisation, classification and prediction.
figure 1

a The longitudinal data that were measured from Ultrahigh risk individuals for psychosis and healthy control over a 2-year period; b interpolation of data points between the five measurements T0 (baseline), T1 (after 6 months), T2 (after 12 months), T3 (after 18 months) and T4 (after 24 months) to get the information-trend (time series); c Illustration of the designed methodology, containing encoding the changes in the time series into spike sequences and mapping them into a 3-Dimensional space of an SNN for learning, visualisation, and interaction; d computational modelling for classification and prediction of an individual transitioning outcome using dynamic two-years social and cognitive data.

Visualization and interaction of social and cognitive data

Figure 2a–c illustrate how the SNN models were trained on the longitudinal time-series of social and cognitive data across (a) HC group; (b) Remitter group and (c) Maintained group. The 23 social-cognitive variables (reported in Table 1) were mapped into a 3D SNN, where the spatial coordinates of the input neurons were defined based on the Graph Matching method28 (GM). The GM method positions highly correlated spike sequences of the 23 variables into topologically closer input neurons in the SNN. The intensities of the SNN connections vary across the variables for each group. To demonstrate the differences between the SNN models of HC and UHR subgroups, the average-value of the connection weights (Wa) (Refers to the temporal relationship between two neurons as measured by the time of spike were extracted across every single social and cognitive variable.

Fig. 2: 3D SNN connectivity models in the form of visualization on the longitudinal time series of social and cognitive data across groups (healthy control and UHR).
figure 2

Each model was mapped with a size of 1000 neurons (10 × 10 × 10 dimensions), which were then trained using the Spike Time Dependent Plasticity learning rule (STDP) to visualize 2-year social and cognitive data for a 81 HC samples; b 58 Remitter samples; and c 30 Maintained samples. For the variable mapping to SNN, the more correlated input variables (in terms of correlation in their spike trains) were mapped into nearby input neurons in SNN. The models were initialised using small-word connectivity with a radius set to 2 neuron distance, and then trained using STDP. The blue lines are positive connections (excitatory), and the red lines are negative connections (inhibitory) based on STDP learning. For each SNN model here, the average value of connection weight is reported as a metric below each model.

Table 1 The 23 data variables along with the corresponding social and cognitive domains used in this study.

The average-value of connection weights in the SNN models of HC and Remitter groups were greater (Wij: 1.2 and Wij: 0.97 respectively) than the Maintained group (Wij: 0.52). In addition to the average-value of all the connection wights in each SNN, we also reported a greater detail on the SNN connections in Fig. 3 that better demonstrates the differences between the models: (1) “HC vs Remitter” and (2) “HC vs Maintained”. Figure 3 shows the average-value of connection weights for each variable separately. This is calculated by taking average of connection weights among neurons which are directly connected to the SNN input variables (social and cognitive variables).

Fig. 3: The average of connection weights within 23 clusters of neurons inside the SNN, each of which includes neurons that are directly connected to one of the 23 social-cognitive variables, across three groups: Control group (blue bar), Remitter group (orange bar), and Maintained group (green bar).
figure 3

The components of cognitive BACS test include verbal memory, digit sequencing, token motor task, semantic fluency, symbol coding, and tower of London; the components of social HiSoC test include affect, odd behaviour, and language, and social impersonal; the components of cognitive SNK test include target and distractor accuracy and reaction time; the components of cognitive PC include the filtered, averaged and accuracy of perceptual closure scores; and the cognitive CPT test include the total score.

As shown in Fig. 3, the most prominent discriminative cognitive variables between HC and Remitters are verbal memory and semantic fluency (vegetables) from the Brief Assessment of Cognition in Schizophrenia (BACS) test; filtered score in Perceptual Closure (PC) test; and target and distractor accuracies scores in the Snakes in the grass (SNK) test. The discriminative social variables between HC and Remitters are odd behaviour, language, and social interpersonal from the High-Risk Social Challenge (HiSoC) test. These cognitive variables and the corresponding functions including verbal memory, processing speed, attention, and inhibition; and social functioning, including communication and fluency of speech can be suggested as potential predictive markers for identifying Remitters.

Figure 3 shows that the performance of HC and Maintained groups are different across verbal memory, digit sequencing, token motor task, semantic fluency, and symbol coding from the BACS test; target and distractor accuracies from (SNK) test; all the variables from the PC test and the total score for the Continuous Performance Task (CPT). The social skills performance also differs across HC and Maintained in affect, odd behaviour, language, and social interpersonal from the HiSoC test. These findings suggest that the cognitive and social functions, including verbal memory, working memory, motor speed, processing speed, attention, inhibition, nonverbal affect, communication and fluency of speech are the most discriminative and predictive markers for identifying the Maintained subgroup.

The significance of the trained SNN models across HC and UHR subgroups were examined by calculating the number of spikes that were exchanged between the cognitive and social variables. This is demonstrated by Feature Interaction Network (FIN)29 is shown in Fig. 4. Here, every node represents a cluster of spiking neurons that are centred by one data variable. The neurons are clustered using a dynamic evolving clustering technique which is based on network theory30,31,32. The lines in FIN demonstrate spikes were exchanged between neurons of adjacent clusters during the Spike Time Dependent Plasticity (STDP) learning. The thicker lines suggest greater number of transmitted spikes, representing more temporal interaction and greater exchange of information between the variables. A stronger social and cognitive interaction in the Maintained group is shown in Fig. 4c in comparison with other groups (Fig. 4a, b). This finding can be used to study the causal interaction between variables over time and to predict important variables that may affect other variables to change in UHR individuals’ status.

Fig. 4: The amount of spike interaction during the Spike Time Dependent Plasticity (STDP) learning in the SNN between each two variables from the 23 social and cognitive variables through Feature interaction network (FIN) Model.
figure 4

a HC group; b Remitter group; and c Maintained group. Each node represents a cluster of neurons around one variable (connected directly inside the SNN). Each line represents how two clusters of neurons (representing variables) exchanged spikes during STDP learning. The thicker the line between two variables, the greater the level of spike. interaction between them.

Classification and Prediction of UHR’s individuals’ outcomes

For identification of UHR individuals, we applied classification using SNN classifier of dynamic SNN (explained in the Method section). There is a total number of 169 samples across three classes (n = 81 HC; n = 58 Remitters; and n = 30 Maintained). The classification was performed based on 30 times of cross-validation through a random 50–50% split. The balance accuracy is reported in Table 2a. A predictive modelling experiment was also carried out to investigate how early the SNN can capture the discriminative patterns of changes in social and cognitive data between groups for the purpose of predicting the individual outcomes. The class label for each individual’s data is given in relation to the outcome diagnosed in 24-month. An SNN model was tested (using only the data collected during the first 18 months) to predict the output classes for the three groups of participants at the time of 24-month (label as HC, Remission and Maintained at 24-months).

Table 2 Classification and Prediction of Healthy Control and UHR’s Individuals’ Outcomes Using Spiking Neural Network.

The prediction experiment was based on running a 2-fold cross-validation (random 50–50 split) 30 times, and the average of all 30 balance accuracies was calculated (reported in Table 2b). The 6-month ahead prediction of outcomes (HC, Remission and Maintained) is reported as 80.2% balance accuracy (Table 2b).

The classification and prediction were also conducted using other traditional methods, including k-nearest neighbours (KNN), random forest, and support-vector machine (SVM) through a random 50–50% split performing 30 times. Here, for each individual’s data sample (which is interpolated to time series), all the variables’ time points were concatenated into a single vector, which unlike the SNN model that can reactive temporal data, disregards the intrinsic temporal structure of the interpolated social and cognitive data. To perform a fair comparison, we also applied long short-term memory (LSTM) and Convolutional neural network (CNN) method that can learn from streaming timeseries (Table 3a, b).

Table 3 Classification and Prediction of UHR’s individuals’ outcomes Using Machine Learning Tools.

Statistical analysis

To evaluate the significance of the trained SNN models, a Multivariate Analysis of Variance (MANOVA) tested for differences across groups in connection weights, separately for the social and cognitive tests. Between groups variables included Sex (male, female) and Group (HC, Remitter, Maintained). Sub scores from each test were entered as separate dependent variables. Greenhouse-Geisser corrections were used to correct all violations of the assumption of sphericity. There was a significant main effect of Group in the PC test [F (3) = 3.1, p < 0.03, ƞ2 = 0.05]. There was a significant effect of Sex in PC sub-score (percentage of correctly identified items), with higher connection weights in men than women. There was a significant Group × Sex interaction effect for HiSoC Affect Dimension [F (2) = 3.07, p = 0.05, ƞ2 = 0.04] and HiSoC Social Interpersonal Dimension [F (2) = 3.21, p < 0.04, ƞ2 = 0.04]. Post-hoc tests showed higher scores for Affect in male Remitters compared to Maintained (p < 0.04). In women, on-the-other-hand, the Social Interpersonal Dimension was better in Remitters than Maintained (p < 0.01). Figure 5 shows a raincloud plot of the connections weights as a function of group and social and cognitive performance.

Fig. 5: A raincloud plot of the connections weights as a function of group (healthy control and UHR) and social and cognitive performance.
figure 5

a Distribution of the connection weights comparison between healthy control and maintained groups. b Distribution of the connection weights comparison between HC and Remitter groups.

Discussion

Using a rigorous computational SNN approach, we developed models to investigate the dynamic social and cognitive interactions and to predict 2-year outcomes of UHR’s individuals for psychosis. Findings identified the most important cognitive and social variables (particularly verbal memory, processing speed, attention, affect and interpersonal social functioning) that showed discriminative patterns in the SNN models of HC vs UHR subgroups. The explainable SNN model enabled us to have a better understanding of the interaction of data variables across groups over time (with higher social and cognitive variables interaction for UHR Maintained subgroup than Remitter subgroup). Finally, the accuracy of social and cognitive data classification was higher when using SNN models compared with traditional machine learning methods (e.g., SVM, LSTM, Random Forest and CNN).

UHR state for psychosis and non-transition

Longitudinal modelling of social and cognitive performance revealed that the majority of individuals (64.5%) were remitted during the 24-month follow-up, while about 34% of the individuals maintained their UHR status and only 2% converted to clinically relevant psychosis. These results were consistent with more recent studies that reported that the rate of transition from an UHR status to clinically relevant psychosis has dropped dramatically from over 50% to 15% with follow-ups exceeding 1 year5.

Dynamic social and cognitive changes and interactions

The results showed that participants at UHR for psychosis were differentiated from HC, and Remitters were differentiated from Maintained according to 24-months social and cognitive performance. The total connection weights value showed Control > Remitter > Maintained (Fig. 2a–c). In particular, greater connectivity weights were generated for the “PC” cognitive test in the HC group than in other UHR subgroups. Greater connection weights here reflect better performance. The “PC” task tests ability to complete what is incompletely presented in the visual field to achieve “good” or as complete a figure as possible by perception33. PC scores have previously been found lower in well-established (chronic) schizophrenia than the general population, and was interpreted as reflecting atypical attention, or inability to maintain a major set33.

In between group comparison, HC and Remitter groups differed slightly in the connectivity of symbol coding from BACS test and affect dimension from HiSoC test. This suggests cognitive and social functions in the Remitter group (attention, processing speed and affect) returned to a similar level as the HC after 24 months. Moreover, the connection weight values of some of the cognitive sub-variables increased for the Remitter group to some different degrees. For example, the connection weights for the PC test (correct answer accuracy), SNK (total score accuracy) and BACS (semantic fluency in fruits and animals and token motor task) were higher than in the HC at 24-months. These variables correspond to organisation, visual, motor speed and processing speed domains, respectively. These results were consistent with Kendler et al. (2016) study that reported an improved cognition in individuals at UHR with time34.

The difference between the HC group and the Maintained subgroup are across all the social and cognitive variables except SNK test performance. The connection weight values of SNK sub-variables increased slightly for the Maintained group in relative to HC group. These variables represent cognitive abilities related to visual attention and processing speed. An increase in variability of test performance over time suggest the possibility that the underlying cognitive architecture may have devolved in Maintained group during follow-up. Thus, measures of dedifferentiation of cognitive components may be one of the most powerful factors in later conversion in individuals at risk for psychosis35.

Through statistical analysis of the extracted connection weights, significantly higher connection weights were seen in the Remitters group for HiSoC social interpersonal dimension than in other groups at 24 months that reflect greater co-variation between social interpersonal score and other cognitive measures with time. There was also a significant effect of sex in HiSoC as the Affect dimension was protective in male Remitters and the Social Interpersonal dimension was protective in female Remitters. These findings are in alignment with McGlashan36 that found females had better premorbid social functioning than males. Vila-Rodriguez et al.37 expanded these studies and discovered that females performed better in social interpersonal skills than males. These findings suggest that social interpersonal functioning in females and affective functioning in males could be the most important social variable to differentiate the groups and sex and can be used for predicting progression.

Through FIN analysis in Fig. 4, for the Maintained group, a stronger exchange of information between social and cognitive variables was seen, suggesting that dynamic variations in social and cognitive performance interact differently with time in the different groups. This interaction was more between BACS and SNK from cognitive functioning and interpersonal and odd behaviour from social functioning. According to STDP rule in SNN model, connectivity with greater weights reflects stronger spike transmission between inputs. Thus, the STDP rule develops neural connectivity among the spatially distributed inputs in the SNN model that represents the spatiotemporal relationships between the social and cognitive data variables. Finding is Fig. 4 indicates the strength of intercorrelation between cognitive and social variables and reflect how particularly attention and processing speed deficit in Maintained group can affect the dysfunctionality of other aspects of social skills including communication, and fluency of speech.

UHR’s classification and prediction

The classification of social and cognitive data over 2-years resulted in significantly higher accuracy across groups, compared to other machine learning methods (Table 2a). The prediction accuracy of dynamic social and cognitive data for Remitter and Maintained groups was 80.2% when only 18-months of data was used which suggested the existence of discriminative patterns between the groups at one year and half earlier (Table 2b). Compared to commonly applied predictive risk models for UHR which are based on modelling static variables, the current study is one of the newly advanced studies on developing a temporal-based risk prediction model using longitudinal social and cognitive data and SNN. With machine learning techniques, the spatiotemporal variables of social and cognitive data were converted into one vector of variables. As a result, temporal connections cannot be obtained. In comparison, the computational SNN models allow the integration of dynamic interactions between social and cognitive performance during various neuropsychological tasks performance.

Limitations and future work

There are some limitations to consider. First, we only selected the common social and cognitive variables that were collected across all the five times of follow-ups; thus, we ended up using 23 social and cognitive variables. Second, the SNN models were trained and tested on a small set of data and the findings are valid for the defined scope. However, we demonstrated the potential of our model in UHR’s predictive marker detection and 6-months ahead prediction with a better balance accuracy compared to other machine learning tools. Therefore, the SNN model needs to be further tuned and calibrated using more data for the generalisation purpose. Further longitudinal studies are needed to look at a time period over the 24 months done in the current study with a larger cohort population. Future research might look at the non-transitioned individual differences in time, that is affected by other variables including ethnicity, and culture. Of the ethnic groups reported, schizophrenia is a culture-bound illness, that is, a difference in culture can influence how it manifests. The results of this paper will be further extended through developing a SNN model to learn from multimodal data including genetic, neuroimaging, clinical along with social and cognitive longitudinal data. We plan to conduct personalised profile modelling for every individual using the personalised modelling approach. Our future work also includes enhancing the interpolation phase by optimising the number artificial datapoints between original data measurements.

In conclusion, the research demonstrated the feasibility of a computational SNN approach for longitudinal outcome prediction, in the most difficult to prediction, a non-transition UHR population for psychosis, based on social and cognitive data interaction. Our findings support the importance of temporal changes (over two years) and the co-variation across certain social and cognitive variables in predicting outcomes. The SNN models resulted in a good balanced accuracy for 6-month ahead prediction of UHR outcomes (based on 18-month data) for individuals who are likely to remit and for those who are likely remain stable at 24-month period. Thus, the designed SNN architecture in this study can be suggested as a possible supportive tool for early detection of UHR individuals. The SNN approach has shown to be potential in modelling UHR data. This can benefit psychiatrists and other health professionals in identifying non-converted individuals and allow them to initiate early and appropriate targeted interventions.

Method

Participants and inclusion criteria

The original study conducted in Singapore included n = 384 HC individuals (mean age = 21.7 years and SD = 3.4) and n = 173 UHR individuals (mean age = 21.3 years and SD = 3.5). All participants were between the ages of 14 and 29.

Current inclusion criteria

Only individuals who had their data collected from all variables across all five time points T0-T4 were included in the current study. Thus, a total number of 171 individuals were selected for UHR group (n = 90) and HC group (n = 81); age mean =21.29 and SD = 3.70; 106 males and 65 females. UHR individuals were then categorised into three classes (Remitters, Converters, and Maintained), based on the clinical scores at 24-month follow-up (Fig. 1a). Remitters (n = 58) refers to those UHR participants who were UHR-positive at baseline but did not meet the UHR criteria at the 24-month. Converters (n = 2) refers to those participants who were UHR-positive at baseline and developed a first episode of psychosis within 24-months. Maintained (n = 30) refers to those participants who were UHR-positive at baseline and remained UHR-positive at 24-month period of follow-up. The Converter group was dropped from the current study due to low sample size. Hence, the final study dataset contained 169 individuals (HC, Remitters, Maintained).

Assessment

Multimodal data (demographic, clinical, social, and cognitive) were collected every six months over two years: T0 = baseline assessment, T1 = 6-months, T2 = 12-months, T3 = 18-months and T4 = 24-months follow-ups. In total, twenty-three social and cognitive variables were identified (reported in Table 1) that were determined as common variables collected at baseline (T0) and at all follow-up assessments (T0 - T4).

Clinical assessments

The Comprehensive Assessment of At-Risk Mental States (CAARMS) was employed to determine the UHR’ clinical status38. The UHR subgroup was then categorised into three classes (Remitters and Maintained), based on the CAARMS scores at 24-month follow-up.

Social assessment

The performance-based social skills was measured by the High Risk Social Challenge Task (HiSoC)39. The HiSoC has been used in UHRs for psychosis studies as a standardized and performance-based social skills measure that has demonstrated high levels of validity and reliability. Social skills are measured in terms of the display of affect, odd behaviour, and language, and social-interpersonal when evaluating the task. A 5-point Likert scale is used to rate the 16 items in the task (with higher scores indicating better social skills).

Cognitive assessment

The cognitive assessments are based on four cognitive neuropsychological batteries, including (i) The Brief Assessment of Cognition in Schizophrenia (BACS)40,41 which comprises list learning, digit sequencing, token motor task, semantic fluency, Tower of London, and symbol coding tests; (ii) The Snakes in the grass (SNK)42,43 test of fear-relevant attention (accuracy and reaction time); (iii) The Continuous Performance Test (CPT)44;and iv) Perceptual Closure test (PC)45.

Data analysis

Interpolation

To capture the 2-year trend in these 23 social-cognitive variables across groups, we applied an interpolation technique which is a common practice in clinical data generation46,47. There are examples of successfully applying interpolation techniques that improved the modelling of changes in limited measured datasets, including hippocampus-amygdala in schizophrenia48, heart rate measurement in atrial fibrillation49, and MRI in traumatic brain injury50 and MRI modelling in prediction of dementia51. Durrleman et al. shows estimated timepoints (values) generated by interpolation in a different range of longitudinal datasets inferred continuous shape trajectory from observations sparsely distributed in time52. One of the interpolation methods is linear interpolation that requires knowledge of two data points and assumes constant rate of change between them. This is a process of using known data values to estimate unknown data values. In the current study, a linear interpolation is used to generate the values between the limited number of data measurements (T0-T4) in 2 years. The interpolated data points do not alter the trend of original data and only depict the linear pattern of changes between the five follow-up measurements. In the current study, a linear interpolation is used to generate the values between the limited number of data measurements (T0-T4) in 2 years. The interpolated data points do not alter the trend of original data and only depict the linear pattern of changes between the five follow-up measurements. Interpolation technique allows to generate artificial data point per month, transforming the data from static domain to time series while preserving the original trend of changes across groups. This led to generate 245 times points (time series) for each variable associated with 24 months, plus 5 time-points from the original data collection as shown in Fig. 1b. The interpolation of all the social and cognitive variables to time series, shown in Supplementary Fig. 1, represents the trends in the time series and the gradual changes. This interpolation is conducted prior to modelling the data using machine learning models and SNNs (explained in the following section). For SNN models, the interpolated time series were then encoded into sequences of binary events, called spikes that capture significant upward and downward changes in the 2-year social-cognitive data.

Computational spiking neural network modelling

The SNN model has been developed as a connectionist network framework that both spatial and temporal components of data can be learnt and trained in one model. The SNN model has a 3-dimensional structure of artificial spiking neurons28 and a learning algorithm that learns from streaming data and adapts the connection weights, representing spatio-temporal relationship between variables30. Interpolated social and cognitive variables are used as input30 to the SNN model to learn the patterns of changes in longitudinal data in relation to the outcomes (HC and UHR subgroups).

The data modelling architecture possesses the following modules:

  1. (i)

    Input module: a threshold-based encoding algorithm53 was applied to the interpolated timeseries of each variable and converted them into sequences of spikes, representing significant changes in each variable values that occurred over time. This results in a compact representation of events (spikes) and removing noise. The variables were spatially mapped into input neurons in the SNN to feed the model with the spike sequences. The 3D mapping of the SNN and the initialisation procedure are explained Supplementary material.

  2. (ii)

    SNN training module: a twofold learning process is applied. First, the input spikes are passed into the SNN model for unsupervised learning using Spike Time dependent Plasticity (STDP)54 (Fig. 1c). Then, the trained SNN model is connected to an output layer for supervised learning to capture associations between the trained SNN model and the training sample class labels (HC and UHR subgroups). The supervised learning is based on a Dynamic Evolving SNN (deSNN), shown in (Fig. 1d)55. The detains of the STDP is explained in Supplementary material.

  3. (iii)

    Classification and prediction module: the trained SNN model was tested using the social and cognitive data of those individuals who were excluded from the training process. The testing and training sets were defined, based on a 2-fold cross validation in which 50% of samples were selected randomly as the training set, while and other 50% hold-out samples were used as testing set. This process was repeated 30 times and the random selection approach assured sample permutation. For the classification task, the whole temporal length (T0-T4) of the social and cognitive variables were used for training and testing to detect the outcomes (in month 24). However, for the prediction task, only the first 18 months (T0-T3) of the data were used to test the model for performing a 6-month-ahead prediction of outcomes (at month 24). Since the dataset is imbalanced (Remitted n = 58 and Maintained n = 30), balanced accuracy is calculated as (Sensitivity + Specificity) / 2, reported in confusion Table 2., representing the number of correctly classified and miss-classified samples.

  4. (iv)

    Visualisation, interaction and knowledge discovery module: in the trained SNN, the spatiotemporal connections between the social and cognitive variables were visualised and traditional statistics used to test for differences between groups. Differences between HC and UHRs subgroups in SNN models were also studied by computing spatio-temporal interactions between the social-cognitive variables, using a Feature Interaction Network (FIN).

Figure 1a–d presents the protocol of study, as well as the designed computational SNN-based methodology for data modelling, visualisation, interaction, classification and prediction. The details of the designed SNN computational models including mathematical formulas and algorithms are presented in the Supplementary Material, Section 1.1.

Ethical approval

All aspects of the study were completed in alignment with appropriate regulations and guidelines of Nature journal publishing. Ethical approval was granted by the Singapore National Healthcare Group’s Domain Specific Review Board. Informed consent was obtained from all participants. For those under 21 years of age, the consent was obtained from a legal representative.

Software availability

The software used for the implementation of the designed method can be found at http://www.kedri.aut.ac.nz/neucube/.