Syntactic complexity and diversity of spontaneous speech production in schizophrenia spectrum and major depressive disorders

Schneider, Katharina; Leinweber, Katrin; Jamalabadi, Hamidreza; Teutenberg, Lea; Brosch, Katharina; Pfarr, Julia-Katharina; Thomas-Odenthal, Florian; Usemann, Paula; Wroblewski, Adrian; Straube, Benjamin; Alexander, Nina; Nenadić, Igor; Jansen, Andreas; Krug, Axel; Dannlowski, Udo; Kircher, Tilo; Nagels, Arne; Stein, Frederike

doi:10.1038/s41537-023-00359-8

Download PDF

Article
Open access
Published: 29 May 2023

Syntactic complexity and diversity of spontaneous speech production in schizophrenia spectrum and major depressive disorders

Katharina Schneider¹,
Katrin Leinweber ORCID: orcid.org/0000-0002-4648-1335²,
Hamidreza Jamalabadi²,
Lea Teutenberg²,
Katharina Brosch^2,3,
Julia-Katharina Pfarr^2,3,
Florian Thomas-Odenthal^2,3,
Paula Usemann^2,3,
Adrian Wroblewski^2,3,
Benjamin Straube^2,3,
Nina Alexander^2,3,
Igor Nenadić^2,3,
Andreas Jansen^2,3,
Axel Krug ORCID: orcid.org/0000-0002-0564-2497⁴,
Udo Dannlowski⁵,
Tilo Kircher^2,3,
Arne Nagels¹^na1 &
…
Frederike Stein^2,3^na1

Schizophrenia volume 9, Article number: 35 (2023) Cite this article

1996 Accesses
4 Citations
Metrics details

Subjects

Abstract

Syntax, the grammatical structure of sentences, is a fundamental aspect of language. It remains debated whether reduced syntactic complexity is unique to schizophrenia spectrum disorder (SSD) or whether it is also present in major depressive disorder (MDD). Furthermore, the association of syntax (including syntactic complexity and diversity) with language-related neuropsychology and psychopathological symptoms across disorders remains unclear. Thirty-four SSD patients and thirty-eight MDD patients diagnosed according to DSM-IV-TR as well as forty healthy controls (HC) were included and tasked with describing four pictures from the Thematic Apperception Test. We analyzed the produced speech regarding its syntax delineating measures for syntactic complexity (the total number of main clauses embedding subordinate clauses) and diversity (number of different types of complex sentences). We performed cluster analysis to identify clusters based on syntax and investigated associations of syntactic, to language-related neuropsychological (verbal fluency and verbal episodic memory), and psychopathological measures (positive and negative formal thought disorder) using network analyses. Syntax in SSD was significantly reduced in comparison to MDD and HC, whereas the comparison of HC and MDD revealed no significant differences. No associations were present between speech measures and current medication, duration and severity of illness, age or sex; the single association accounted for was education. A cluster analysis resulted in four clusters with different degrees of syntax across diagnoses. Subjects with less syntax exhibited pronounced positive and negative symptoms and displayed poorer performance in executive functioning, global functioning, and verbal episodic memory. All cluster-based networks indicated varying degrees of domain-specific and cross-domain connections. Measures of syntactic complexity were closely related while syntactic diversity appeared to be a separate node outside of the syntactic network. Cross-domain associations were more salient in more complex syntactic production.

Testing theory of mind in large language models and humans

Article Open access 20 May 2024

Delirium

Article 12 November 2020

Mendelian randomization analyses reveal causal relationships between brain functional networks and risk of psychiatric disorders

Article 09 May 2024

Introduction

Given the limitations of current psychiatric classification, a number of studies have attempted to disentangle the heterogeneity and comorbidity across affective and psychotic disorders (i.e., major depressive disorder (MDD), schizophrenia spectrum disorder (SSD), bipolar disorder (BD)) using transdiagnostic and multivariate approaches including symptomology, neuroimaging, and blood-specimen measures^{1,2,3,4,5,6,7}. However, studies have failed to identify reproducible biomarkers for the aforementioned psychiatric disorders^8,9,10. Recent studies have highlighted the importance of speech features as speech aberrations have a high prognostic value for onset, course, chronicity, and treatment response of SSD as well as MDD¹¹. Hereof, speech is considered to be an objective and specifically quantitative measure for obtaining and analyzing that is reproducible, time efficient and non-invasive in nature^12,13.

A few typical language-related symptoms of SSD include reduced speech production^14,15 and reduced performances in verbal fluency tasks^16,17 along with difficulty in word-retrieval which lead to word approximations¹⁸, production of neologisms^19,20, and less complexity of sentences^{14,15,21,22,23,24,25}. These linguistic aberrations can even be observed in early stages prior to manifestation of the disorder^25,26,27. In MDD, speech is mainly characterized by longer response latencies and reduced spontaneous speech¹³. Additionally, depressive speech often contains a higher rate of first-person singular pronouns and self-focused language, which is characterized by words related to sad emotions and the past^28,29. A greater amount of more truncated and more impersonal sentences was detected on the syntactic level in comparison to healthy controls (HC)²⁸. These alterations in multiple domains of speech can be summarized as the qualitative rating of formal thought disorder (FTD)³⁰. FTD is not an unique symptom of SSD, but occurs in other psychiatric disorders such as MDD^31,32,33. While there is much evidence about reduced syntactic complexity in SSD, it remains unknown how individuals with SSD differ from HC and those with MDD concerning the use of different types of subordinate clauses^30,33.

The number of produced simple sentences reveals no differences between SSD and HC³⁴. The simplest syntactic version on a sentence level is a main clause without embedded sentences, whereas complex sentences consist of multiple merged clauses³⁵. Complex sentences can be considered coordinated structures which contain independent parts of sentences connected by a conjunction, and subordinate structures, which consist of a main clause and at least one subordinate clause that depends on the main clause³⁵. These subordinate clauses are often underrepresented in the speech production in SSD^{14,15,21,22,23,24}, however, the scope of complex sentences amongst studies is varied.

A recent increase in use of natural language processing (NLP) measures were shown to distinguish SSD patients from HC with accuracies between 70–94%^36,37,38,39. A multitude on various aspects of language, e.g., phonetic features, coherence, structure in written or spoken language, have already been investigated by NLP¹². In the present study, we focused on syntax (i.e., complexity and diversity) in language analysis due to its evidence-based connection to cognitive variables such as executive functioning and working memory in SSD^14,18. Thus, findings on syntax can provide insight into the underlying cognitive processes involved in speech production⁴⁰. The importance of syntactic complexity for diagnosis and monitoring of SSD has been previously demonstrated in several studies^25,40,41,42. In addition, syntax can be useful for gaining deeper insight into the nature and severity of language and communication impairments in SSD^40,42. In contrast to other NLP measures, e.g., prosodic features or idea density¹², the use of subordinate clauses enables speakers to convey coherent information and especially reflect on complex ideas in discourses⁴³. Therefore, a reduced complexity of speech leads to a restricted expression of thoughts during social communication²⁵. Listeners may show greater difficulties in drawing conclusions due to a lack of syntactic organization by subordination accompanied by questions or misunderstandings^44,45. As a result, speakers are required to provide additional information and may feel frustrated about the inability to have a smooth conversation. However, there is a lack of evidence of the production of different types of subordinate clauses in German transdiagnostic samples. Our intention, therefore, was to broaden the view on syntax of spoken language by including MDD in this analysis. An overlap in psychopathology amongst psychotic and affective disorders is well known, thus we intended to expand the knowledge on a language domain^30,33.

Based on previous studies investigating syntax, the following questions remain unanswered: (1) What is the frequency of subordinate clauses which includes all types of adverbial clauses, relative clauses, complement clauses, and indirect questions in German oral language production in HC, MDD, and SSD?, (2) How do individuals with SSD differ in producing subordinate clauses from HC and those with MDD?, (3) Do participants with a lower syntactic complexity and diversity differ in terms of language-related neuropsychology and psychopathology from those with higher syntactic performance regardless of psychiatric diagnosis? (4) What kind of sub-networks consisting of syntactic measures, language-related neuropsychology, and psychopathology can be detected in relation to the degree of syntax? To address these questions, a classification algorithm was used to examine the possibility of differentiation of syntactic complexity and diversity between SSD, MDD, HC in our sample. Furthermore, we wanted to shed light on networks of syntax, language-related neuropsychology, and psychopathology in all participants. Hereof, we hypothesized that patients suffering from SSD are less likely to produce complex speech when compared to HC^{14,15,21,22,23,24} while those with MDD are comparable to HC. Language scores indicate a significant difference in relation to their distribution in SSD when compared to MDD⁴⁶, and MDD show less symptoms of poverty of speech than SSD⁴⁷. Moreover, we anticipated a negative relationship between syntax and negative⁴⁸ and positive symptoms¹⁴.

Results

SSD yielded significantly less semantic verbal fluency (VF), alternating VF, and verbal episodic memory than MDD and HC. Moreover, SSD showed significantly more negative and positive symptoms in all subscales of the Scale for Assessment of Negative Symptoms (SANS) and Positive Symptoms (SAPS) in comparison to MDD and HC. Exclusively, the total sum of SANS indicated significant differences between MDD and HC. For more details see Table 1.

Table 1 Descriptive data of participants.

Full size table

Linguistic parameters

We used ANOVA analyses to investigate differences in syntactic speech production between SSD, MDD, and HC. The group differences of the syntactic complexity, diversity and the sub-categories of complex sentences are presented in Table 2. Furthermore, we tested whether syntactic complexity and diversity were associated with possible confounders using a correlation analysis. The extracted measures of syntactic complexity and diversity did not correlate to current medication (chlorpromazine equivalents⁴⁹, Sackeim score⁵⁰, medication load index)⁵¹, duration and severity of illness (number of hospitalizations, duration of hospitalization, and duration of current episode), age or sex (all ps > 0.05), but syntactic diversity correlated with years of education (r = 0.33, p < 0.001) (see Extended Data Table 1).

Table 2 Analyzed linguistic parameters.

Full size table

Classification

We used classification analyses to investigate the diagnostic utility of syntactic complexity and diversity. Classification accuracies for HC vs SSD, HC vs MDD, and SSD vs MDD were 0.66 (p < 0.004), 0.51 (p < 0.35), and 0.63 (p < 0.005).

Cluster analysis

Cluster analysis was used to investigate transdiagnostic clusters underlying syntactic measures. Four clusters with a Bayesian Information Criterion (BIC) of 294.88 could be shown (cluster 1: n = 39, cluster 2: n = 19, cluster 3: n = 20, cluster 4: n = 34) ranging from extremely complex to very, moderate, and slightly complex speech. Out of the total number of participants in the extremely complex cluster, 45% were HC, another 45% had MDD, and only 10% had been diagnosed with SSD. An inverse distribution of diagnoses was indicated in the slightly complex cluster: 58.8% SSD, 23.5% MDD and 17.6% HC. Nevertheless, all clusters contained participants of SSD, MDD and HC, indicating a transdiagnostic distribution. Interaction analyses were used to test if the distribution of participants to one of the four clusters was driven by clinical diagnoses. No interaction effect was found for any of the five measures representing syntactic complexity and diversity (all ps > 0.05). The identified clusters did not differ in current medication intake (chlorpromazine equivalents⁴⁹, Sackeim score⁵⁰, medication load index)⁵¹, duration and severity of illness (hospitalizations, duration of hospitalization, and duration of current episode), age or sex, but the extremely complex cluster differed from the slightly complex cluster in relation to years of education (p = 0.005) (see Extended Data Table 2). The results of one-way ANOVAs or Kruskal-Wallis tests to compare language-related neuropsychological and psychopathological data of these four groups based on clustering of syntax are listed in Extended Data Table 3.

Network analyses

The network of the full sample presented in Fig. 1A was characterized by associations within each domain (syntactic complexity, language-related neuropsychology, and psychopathology) in the range of weak to strong correlations whereas cross-domain connections were very weak. The extended relative sum of subordinate clauses appeared with highest expected influence (EI) and strength (S) (EI = 2.29; S = 2.25) in all participants. Strength measures were followed by relative sum of subordinate clauses (S = 0.87) and pure syntactic complexity (S = 0.85) (see Extended Data Table 3, Extended Data Fig. 1). The four cluster-based networks in Fig. 1B–E illustrated more impactful links between the domains; thus, sub-networks were intercorrelated in all clusters to a different extent. All networks are shown in Fig. 1 and centrality measures are listed in Extended Data Table 4 and plotted in Extended Data Fig. 1.

**Fig. 1: Networks over all participants and in clusters.**

In summary, the extended relative sum of subordinate clauses indicated a very relevant node in all networks. Furthermore, both syntactic complexity and diversity were associated with positive and negative FTD except in the moderately complex cluster. Finally, it appeared that associations between syntax, language-related neuropsychology, and psychopathology were more pronounced in more complex syntactic production (i.e., the extremely complex cluster) than in participants with reduced syntactic performance (i.e., the slightly complex and moderately complex cluster). See Extended Data Fig. 2 for an insight into the networks of HC, SSD, and MDD.

Discussion

The aim of the current study was to analyze syntax in oral speech production in individuals with SSD compared to HC and those with MDD. Moreover, we investigated networks based on a subset of syntactic, language-related neuropsychological, and psychopathological measures. Results indicated significantly higher syntax in HC and MDD when compared to those with SSD. Classification analyses revealed significant results supporting this finding, albeit with poor performance. Thus, we preferred a dimensional perspective on syntactic complexity and diversity in psychiatric disorders. Cluster analysis showed four transdiagnostic clusters ranging from extremely complex to slightly complex speech that were accompanied by higher and lower FTD, respectively. Network analyses indicated differential networks across different clusters based on syntax. Notably, network associations between syntax, language-related neuropsychology, and psychopathology were more pronounced in higher syntax; cross-domain associations between syntax, language-related neuropsychology, and psychopathology were sparse in speech with lower syntax (i.e., slightly complex and moderately complex clusters).

Our results offer several new insights. First, using direct comparisons between HC, MDD, and SSD subjects, we were able to show numerous differences between the respective diagnostic categories with most pronounced distinctive features in SSD patients. These differences corroborate with previous studies^{14,18,24,25,40,52,53} on syntax. Moreover, we were able to extend these studies by using three further measures of syntactic complexity, allowing an in-depth investigation. Interestingly, all differences in these variables between groups were limited to SSD patients compared to HC and SSD compared to MDD, unlike no differences appeared between HC and MDD. However, significant differences across diagnostic categories indicated medium effect sizes (η² ≥ 0.085). A larger sample size can lead to higher accuracy and reliability of the effect size.

Second, the multivariate pattern diagnostic classification showed weak classification rates, consequently the five variables of syntax itself represented no useful measures to classify patients regarding their clinical diagnosis. Early descriptions have shown syntactic complexity to be a marker separating SSD from mania, specifically in chronic courses^21,54,55,56, whereas others showed a progressive reduction of syntactic complexity over time in SSD irrespective of the disease course²⁵ but not in mania⁵⁷. Taking a more transdiagnostic and dimensional view into account, we investigated sub-clusters of syntactic features across HC, MDD, and SSD. This approach corresponds to recent findings, showing a high overlap across different psychiatric disorders in several domains^{1,2,3,4,5,6,32,58,59} including behavioral and biological measures. This overlap is not considered when comparing or classifying clinical diagnoses^8,9,10. In contrast, alternative (multivariate) methods are necessary to better understand psychiatric heterogeneity^6,32,60. We found four transdiagnostic clusters expanding from extremely complex to slightly complex sentences. Interestingly, all identified clusters included HC, MDD, and SSD subjects with varying distributions (i.e., more HC and MDD in more complex clusters whereas the slightly complex cluster was mainly composed of SSD patients). Hence, different levels of syntax are also reflected by differences in language-related neuropsychology and psychopathology between clusters. Specifically the slightly complex cluster can be characterized by lowest language-related neuropsychological performance, pronounced negative symptoms, and higher amount of delusions in comparison to the other clusters. In contrast, participants of the extremely complex cluster exhibited less overall negative symptoms. These differences lead to the assumption that the identification of the clusters is related to the severity of psychopathological symptoms. The latter are in line with previous studies also highlighting the impact of negative symptoms on syntactic complexity^{13,14,18,25,40,61,62}. Both subscales of SANS and SAPS for FTD are related with syntactic measures, yet they encompass a different scope of linguistic aspects. FTD is a broader concept in comparison to syntactic complexity, that focusses on a grammatical phenomenon of language³⁰. Cognitive deficits such as impairment of language, memory, and executive functioning in consequence of negative symptoms are more strongly associated with difficulties in daily routines, social interaction, and resistance to therapy than positive symptoms^63,64,65.

Third, network analyses were used to investigate the network-structure across the identified clusters. All networks showed high inter-relation and intra-relation within the parameters of the three different domains: syntax, language-related neuropsychology, and psychopathology. In all four cluster-based networks, measures of syntactic complexity were closely related while syntactic diversity appeared to be a separate node outside of the syntactic network. This emphasizes that syntactic complexity and diversity are, although related, two distinct concepts. Additionally, negative FTD and verbal episodic memory represented very relevant nodes in the networks. Both nodes mediate different domains. The connection between deficits in verbal episodic memory and SSD is well-known^66,67. In reduced syntax (i.e., slightly and moderately complex clusters) cross-domain associations appeared to be very weak or missing. Furthermore, the direction of correlations varied within and across the clusters e.g., negative FTD was negatively correlated with pure syntactic complexity, positive FTD correlated positively with pure syntactic complexity in the extreme complex cluster, and the slightly complex cluster presented positive correlations for both. A negative relation between syntactic complexity and FTD is consistent with the literature; however, linguistic effects have been studied particularly in SSD and much less in MDD^11,30,68,69. Negative FTD had a more influential function for disseminating information (high closeness centrality) than positive FTD. We assume an impact of different cluster size and common effect structures⁷⁰. Future studies should investigate syntactic clusters and networks based on a larger sample size to explore stability and validity of our results. A beneficial extension would include the analysis of syntactic complexity and diversity in written language.

Limitations

Some limitations must be noted. First, our sample size was relatively small, and the two clinical diagnoses were heterogeneous due to different disease severity. Nevertheless, speech performances were not associated with duration and severity of illness (i.e., number of hospitalizations, duration of hospitalization, and duration of current episode). Second, this study used a cross-sectional design which prohibits implications of causality. Third, education was significantly different between the extremely and the slightly complex clusters which might have influenced our results. Fourth, while some studies¹⁵ reported an impact of antipsychotic medication on syntactic complexity, others did not²⁵. We did not find any medication effects. However, we cannot exclude potential effects of lifetime intake of psychiatric medication. Fifth, using a manual analysis of syntactic complexity and diversity instead of NLP algorithms entails some disadvantages such as lower comparability and lower efficiency. Nonetheless, an in-depth analysis was only achievable by using a manual approach.

Conclusion

In conclusion, reduced syntactic complexity and diversity was mirrored in reduced performances in executive functioning, verbal fluency, and verbal episodic memory as well as in elevated positive and negative FTD. SSD produced significantly less complex sentences and significantly fewer, different types of complex sentences compared to MDD and HC. Clusters based on different degrees of syntax differed in language-related neuropsychological and psychopathological measures.

Methods

Participants

For the presented study we included N = 112 German-speaking participants (aged 20–67) who were part of the FOR2107 MACS cohort (data freeze of the October 20, 2022, for more details see Kircher et al., 2019, www.for2107.de). Patients were recruited from inpatient and outpatient facilities of the university hospital in Marburg and the departments of participating local hospitals within a 50 km radius of Marburg as well as via postings in local newspapers and flyers. The following exclusion criteria were applied: verbal IQ < 80, history of head trauma or unconsciousness, severe medical illnesses (cancer, autoimmune diseases, and infections), neurological illness, and the presence of a current substance dependence.

According to a semi-structured interview, including the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV-TR)⁷¹, n = 34 participants were diagnosed with SSD, while n = 38 fulfilled the criteria for MDD. In addition, n = 40 individuals with no current or former history of any psychiatric disorder were included as HC. All procedures were approved by the local Ethics Committee according to the Declaration of Helsinki. Prior to study participation patients gave written informed consent and received a financial compensation. Table 1 shows an overview of descriptive statistics.

Language-related neuropsychological assessment

We assessed the domains of executive functioning, VF and verbal episodic memory. VF was measured by using three different categories (60 seconds each): semantic VF (category “animals”), phonemic VF (initial letter “p”), and category alternating VF (alternating categories “sports” and “fruit”)⁷², determining semantic processing and executive functions. To test the performance of verbal episodic memory, we used the German version of the California Verbal Learning Test (VLMT)⁷³.

Psychopathological assessment

A number of psychopathological scales were assessed in the course of a semi-structured interview. Ratings were performed either during or following the interview. The level of global functioning was assessed with the Global Assessment of Functioning (GAF)⁷¹. The severity of MDD were measured by Hamilton Rating Scale for Depression (HAM-D)⁷⁴ and Hamilton Anxiety Rating Scale (HAM-A)⁷⁵. In addition, SANS⁷⁶ and SAPS⁷⁷ were administered, recording negative and positive symptoms in four subscales (see Table 1). Both SANS and SAPS include subscales for FTD that were very relevant for the following network analyses. All interviewers were familiar with and trained in the evaluation of the respective psychopathological scales. Interrater reliability was assessed with the interclass coefficient, achieving good reliability of r > 0.86 in all ratings and scales.

Assessment of syntactic complexity and diversity

Eliciting speech

To elicit spontaneous speech, we used four pictures of the Thematic Apperception Test (TAT)⁷⁸ which is in line to the procedures described by Liddle et al., 2002⁷⁹. Instead of eight one-minute spontaneous speech samples, we assessed four different TAT pictures in three-minute periods; our aim was to elicit additional speech-related abbreviations (e.g., FTD) that potentially had not been present before the one-minute time frame but might develop over time. Participants were asked to tell a story about what might be happening in the picture. They were given a one-minute break between each picture; meanwhile the instruction was repeated and then the next picture was presented. If participants stopped within the three minutes of telling a story based on the picture, the instructor used non-directive prompts (e.g., “How do people feel?”; “What could happen next?”). Speech samples were audio recorded (Olympus WS-853) and transcribed literally using the f4transkript software (https://www.audiotranskription.de/f4transkript/). It is important to note that transcribers were unaware of the participants’ diagnoses.

Analysis of transcripts

Transcripts were analyzed by total number of words (tokens), total number of different words (types), total number of sentences, mean length of utterance (MLU), type-token-ratio (TTR), simple sentences (main clauses without conjunctions or subordinations), coordinated sentences (sentences with conjunctions like “and/or” or enumerations without conjunctions) and 13 different types of complex sentences (main clause in combination with subordinate clause) (see Table 2) by KS. Complex sentences included 10 types of embedded adverbial clauses (temporal, local, modal, causal, conditional, adversative, final, consecutive, concessive and comparative), relative clauses, complement clauses and indirect questions. In contrast to Tavano et al., 2008, we excluded coordinated sentences from complex sentences, because simple sentences are often joined together by a conjunction, especially in oral speech production¹⁴. Thus, there is no embedded subordinate clause that is accompanied by a change in word order. For this reason, passive constructions were also neglected in our analysis. In addition to studies, that investigated syntactic complexity in SSD^25,40, we intended to expand the knowledge with syntactic diversity inspired by Tavano et al., 2008 and extracted syntactic complexity and diversity as follows: The sum of all main clauses embedding subordinate clauses without overlaps over the total number of sentences resulted in a meaningful relative value for syntactic complexity (i.e., relative sum of subordinate clauses). Here, we did not distinguish between different depths of embedding. Thus, a complex sentence with only one embedded clause was on a par with a sentence that contained e.g., four embedded clauses. All complete utterances were assigned to either simple sentences, coordinated sentences, or complex sentences. However, overlaps between coordinated sentences and complex sentences could occur and were classified into both categories. The number of different types of complex sentences (0–13) that were produced in the picture description and divided by the maximum of possible different types (13) represents a relative value for syntactic diversity (i.e., syntactic diversity).

In addition to the metrics provided in Tavano et al., 2008, we calculated the following scores which allowed us a more detailed and comprehensive insight into syntactic complexity: 1. The sum of all subordinate clauses, as there are several of them in one main clause in relation to the number of all produced sentences (i.e., extended relative sum of subordinate clauses), but irrespective of various types of subordinate clauses in contrast to the third supplementary value (weighted sum of subordinate clauses). 2. The total number of subordinate clauses divided by the total number of complex sentences exclusively which allowed us to investigate syntactic complexity without confounding effects of non-complex sentences (i.e., pure syntactic complexity). 3. The total number of all subordinate clauses considering different types of complex sentences. The number of each subordinate clause multiplicated with a factor, which represents the number of different types in one main sentence (i.e., weighted sum of subordinate clauses), e.g., a main sentence contains 2 relative, 2 causal and 1 complement clause, implies a factor of 3 due to 3 types of complex sentences. Therefore, the value for this example is 15. For an overview of all analyzed categories, see Table 2.

Statistical procedures

Group comparisons

Group differences in syntactic complexity and diversity between HC, MDD, and SSD groups were investigated with JASP (Version 0.16; JASP Team, 2021) using one-way ANOVA analyses. In case assumptions for parametric testing were not given, non-parametric Kruskal-Wallis test was used⁸⁰. To investigate potential medication effects, we correlated the sum score of chlorpromazine equivalents⁴⁹ (antipsychotics), Sackeim score⁵⁰ (antidepressants), and the medication load index⁵¹ assessing both type and amount of different medication classes (antidepressants, antipsychotics, mood stabilizers) with the extracted syntactic complexity and diversity measures. Likewise, a correlation analysis was employed to test the relationship between duration and severity of illness (number of hospitalizations, duration of hospitalization, and duration of current episode), education, age and sex. All relevant information was obtained in the semi-structured interview and via self-reporting questionnaires.

Classification

In the context of hypothesis testing, multivariate pattern classifications algorithm provides a mathematically solid framework to test if the combined data provides meaningful information about the variable of interest⁸¹. For the present analyses, we used five metrics related to syntactic complexity (see Tables 1, 2 for details on these measures) and support vector machines (SVM) with linear kernel⁸² to classify the data in three different combinations: HC from SSD, HC from MDD, and MDD from SSD. In each case, we used two-fold cross validation with 200 repetitions to estimate the classification accuracy. To estimate the statistical significance of accuracies, we used nonparametric permutation test during which we randomized the labels 1000 times per case and repeated the classification with permuted labels⁸³. All classification analyses were performed using MATLAB R2021b.

Cluster analysis

Cluster analysis was used to identify new sub-groups based on syntactic complexity and diversity. Therefore, we used the relative sum of subordinate clauses, extended relative sum of subordinate clauses, pure syntactic complexity, weighted sum of subordinate clauses, and syntactic diversity. The random forest algorithm implemented in JASP was used to identify clusters of participants that performed similarly in terms of syntax irrespective of a diagnosis. The random forest algorithm bases on several tree predictors that contrast the similarities and dissimilarities of measurements⁸⁴. We did not fix a number of clusters beforehand; instead, we determined optimized clusters according to the BIC. MANCOVA interaction analyses were conducted to test if clinical diagnoses affected cluster contribution. An interaction effect was investigated for all five values for syntactic complexity and diversity. Moreover, we compared extracted clusters with regard to the above medication, duration and severity of illness, education, age, and sex variables.

Next, we compared language-related neuropsychological and psychopathological data between obtained clusters to better characterize them using one-way ANOVAs or Kruskal-Wallis tests⁸⁰ (see Extended Data Table 3).

Network analyses of syntax, language-related neuropsychology and psychopathology

Further, network analyses based on the Gaussian Graphical Model (GGM) were used to investigate the relationship between multiple variables of syntax, language-related neuropsychology, and psychopathology in delineated clusters⁸⁵. Based on the literature^{14,34,35,37,86}, eleven language-related variables, i.e., five variables of syntactic complexity, four language-related neuropsychological and two psychopathological variables were chosen and networks were calculated for each cluster separately (from previous cluster analysis) and additionally for the total sample (see Fig. 1), both with 1000 permutations for non-parametric bootstrapping. All variables are visualized as nodes and significant correlations between two variables as edges. The thickness and intensity of color of the edges indicate the strength of correlations; blue edges mark positive correlations and red edges mark negative correlations. The Extended Bayesian Information Criterion (EBIC)⁸⁷ Graphical Least Absolute Shrinkage and Selection Operator (Lasso)⁸⁸ (short EBICglasso) was used for estimation, with tuning parameter of 0.25. Partial correlations between the selected parameter were estimated and small edges were reduced to zero^70,89. As opposed to a non-regularized model⁷⁰, this method leads to sparser networks with missing connections between the nodes⁹⁰. It should be noted that missing edges are the least important and non-existing edges will not be presented in a network based on GGM⁷⁰. Four centrality measures indicate different relations of nodes: 1. betweenness (i.e., how many times a node is on the shortest path between two nodes), 2. closeness (i.e., how close is a node to other nodes), 3. strength (i.e., sum of connections irrespective of negative or positive) and 4. expected influence (i.e., sum of connections, accounts for negative and positive correlations)⁹⁰. The Fruchterman-Reingold algorithm was the basis of the layout of our networks⁹¹. This algorithm led to a weighted positioning of nodes in the network⁹¹. We used JASP to perform the network analyses on the basis of bootnet⁹⁰ and the qgraph packages⁹² in R to create the graphs.

Data availability

The data and code supporting the findings of this study can be accessed by contacting the corresponding author (KS).

Code availability

The data and code supporting the findings of this study can be accessed by contacting the corresponding author (KS).

References

Goodkind, M. et al. Identification of a common neurobiological substrate for mental Illness. JAMA Psychiatry 72, 305–315 (2015).
Article PubMed PubMed Central Google Scholar
Koutsouleris, N. et al. Individualized differential diagnosis of schizophrenia and mood disorders using neuroanatomical biomarkers. Brain 138, 2059–2073 (2015).
Article PubMed PubMed Central Google Scholar
Lalousis, P. A. et al. Heterogeneity and classification of recent onset psychosis and depression: A multimodal machine learning approach. Schizophr. Bull 47, 1130–1140 (2021).
Article PubMed PubMed Central Google Scholar
Lee, P. H. et al. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell 179, 1469–1482.e11 (2019).
Article Google Scholar
Patel, Y. et al. Virtual histology of cortical thickness and shared neurobiology in 6 psychiatric disorders. JAMA psychiatry https://doi.org/10.1001/jamapsychiatry.2020.2694 (2020).
Article PubMed PubMed Central Google Scholar
Brosch, K. et al. Reduced hippocampal gray matter volume is a common feature of patients with major depression, bipolar disorder, and schizophrenia spectrum disorders. Mol. Psychiatry https://doi.org/10.1038/s41380-022-01687-4 (2022).
Article PubMed PubMed Central Google Scholar
David, F. S. et al. Genetic contributions to transdiagnostic symptom dimensions in patients with major depressive disorder, bipolar disorder, and schizophrenia spectrum disorders. Schizophr. Res. 252, 161–171 (2023).
Article CAS PubMed Google Scholar
Kambeitz, J. et al. Detecting neuroimaging biomarkers for schizophrenia: A meta-analysis of multivariate pattern recognition studies. Neuropsychopharmacology 40, 1742–1751 (2015).
Article PubMed PubMed Central Google Scholar
Zarogianni, E., Moorhead, T. W. J. & Lawrie, S. M. Towards the identification of imaging biomarkers in schizophrenia, using multivariate pattern classification at a single-subject level. NeuroImage Clin 3, 279–289 (2013).
Article PubMed PubMed Central Google Scholar
Thibaut, F. Controversies in psychiatry. Dialogues Clin. Neurosci. 20, 151–152 (2018).
Article PubMed PubMed Central Google Scholar
Roche, E., Creed, L., Macmahon, D., Brennan, D. & Clarke, M. The epidemiology and associated phenomenology of formal thought disorder: A systematic review. Schizophr. Bull. 41, 951–962 (2015).
Article PubMed Google Scholar
de Boer, J. N., Brederoo, S. G., Voppel, A. E. & Sommer, I. E. C. Anomalies in language as a biomarker for schizophrenia. Curr. Opin. Psychiatry 33, 212–218 (2020).
Article PubMed Google Scholar
Koops, S. et al. Speech as a biomarker for depression. CNS Neurol. Disord. - Drug Targets 20, 1–9 (2021).
Google Scholar
Tavano, A. et al. Specific linguistic and pragmatic deficits in Italian patients with schizophrenia. Schizophr. Res. 102, 53–62 (2008).
Article PubMed Google Scholar
de Boer, J. N., Voppel, A. E., Brederoo, S. G., Wijnen, F. N. K. & Sommer, I. E. C. Language disturbances in schizophrenia: the relation with antipsychotic medication. npj Schizophr 6, 1–9 (2020).
Google Scholar
Allen, H. A., Liddle, P. F. & Frith, C. D. Negative features, retrieval processes and verbal fluency in schizophrenia. Br. J. Psychiatry 163, 769–775 (1993).
Article CAS PubMed Google Scholar
Joyce, E. M., Collinson, S. L. & Crichton, P. Verbal fluency in schizophrenia: Relationship with executive function, semantic memory and clinical alogia. Psychol. Med. 26, 39–49 (1996).
Article CAS PubMed Google Scholar
Covington, M. A. et al. Schizophrenia and the structure of language: The linguist’s view. Schizophr. Res. 77, 85–98 (2005).
Article PubMed Google Scholar
Andreasen, N. C. Thought, language, and communication disorders: I. Clinical assessment, definition of terms, and evaluation of their reliability. Arch. Gen. Psychiatry 36, 1315–1321 (1979).
Article CAS PubMed Google Scholar
McKenna, K., Gordon, C. T. & Rapoport, J. L. Childhood-onset schizophrenia: timely neurobiological research. J. Am. Acad. Child Adolesc. Psychiatry 33, 771–781 (1994).
Article CAS PubMed Google Scholar
Fraser, W. I., King, K. M., Thomas, P. & Kendell, R. E. The diagnosis of schizophrenia by language analysis. Br. J. Psychiatry 148, 275–278 (1986).
Article CAS PubMed Google Scholar
Thomas, P. et al. The reliability and characteristics of the brief syntactic analysis. Br. J. Psychiatry 168, 334–336 (1996).
Article CAS PubMed Google Scholar
Oh, T. M., McCarthy, R. A. & McKenna, P. J. Is there a schizophasia? A study applying the single case approach to formal thought disorder in schizophrenia. Neurocase 8, 233–244 (2002).
Article CAS PubMed Google Scholar
Kircher, T. T. J., Oh, T. M., Brammer, M. J. & McGuire, P. K. Neural correlates of syntax production in schizophrenia. Br. J. Psychiatry 186, 209–214 (2005).
Article PubMed Google Scholar
Silva, A. M. et al. Syntactic complexity of spoken language in the diagnosis of schizophrenia: A probabilistic Bayes network model. Schizophr. Res. https://doi.org/10.1016/j.schres.2022.06.011 (2022). [Epub ahead of print].
Hollis, C. Child and adolescent (juvenile onset) schizophrenia. A case control study of premorbid developmental impairments. Br. J. Psychiatry 166, 489–495 (1995).
Article CAS PubMed Google Scholar
Nicolson, R. et al. Premorbid speech and language impairments in childhood-onset schizophrenia: Association with risk factors. Am. J. Psychiatry 157, 794–800 (2000).
Article CAS PubMed Google Scholar
Trifu, R. N., Nemeș, B., Bodea-Hațegan, C. & Cozman, D. Linguistic indicators of language in major depressive disorder (MDD). an evidence based research. J. Evidence-Based Psychother 17, 105–128 (2017).
Article Google Scholar
Xu, S. et al. Automated Verbal and Non-verbal Speech Analysis of Interviews of Individuals with Schizophrenia and Depression. in 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 225–228 (IEEE, 2019). https://doi.org/10.1109/EMBC.2019.8857071.
Kircher, T., Bröhl, H., Meier, F. & Engelen, J. Formal thought disorders: from phenomenology to neurobiology. The Lancet Psychiatry 5, 515–526 (2018).
Article PubMed Google Scholar
Kircher, T. et al. A rating scale for the assessment of objective and subjective formal Thought and Language Disorder (TALD). Schizophr. Res. 160, 216–221 (2014).
Article PubMed Google Scholar
Stein, F. et al. Psychopathological syndromes across affective and psychotic disorders correlate with gray matter volumes. Schizophr. Bull 47, 1740–1750 (2021).
Article PubMed PubMed Central Google Scholar
Stein, F. et al. Dimensions of formal thought disorder and their relation to gray- and white matter brain structure in affective and psychotic disorders. Schizophr. Bull 48, 902–911 (2022).
Article PubMed PubMed Central Google Scholar
DeLisi, L. E. Speech disorder in schizophrenia: Review of the literature and exploration of its relation to the uniquely human capacity for language. Schizophr. Bull. 27, 481–496 (2001).
Article CAS PubMed Google Scholar
Diessel, H. The Acquisition of Complex Sentences. (Cambridge University Press, 2004).
Elvevåg, B., Foltz, P. W., Rosenstein, M. & DeLisi, L. E. An automated method to analyze language use in patients with schizophrenia and their first-degree relatives. J. Neurolinguistics 23, 270–284 (2010).
Article PubMed PubMed Central Google Scholar
Corcoran, C. M. et al. Language as a biomarker for psychosis: A natural language processing approach. Schizophr. Res. 226, 158–166 (2020).
Article PubMed PubMed Central Google Scholar
de Boer, J. N. et al. Clinical use of semantic space models in psychiatry and neurology: A systematic review and meta-analysis. Neurosci. Biobehav. Rev. 93, 85–92 (2018).
Article PubMed Google Scholar
Mota, N. B. et al. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS One 7, e34928 (2012).
Article CAS PubMed PubMed Central Google Scholar
de Boer, J. N. et al. Language in schizophrenia: relation with diagnosis, symptomatology and white matter tracts. npj Schizophr 6, 1–10 (2020).
Google Scholar
Bedi, G. et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophr. 1, 15030 (2015).
Article PubMed PubMed Central Google Scholar
Corcoran, C. M. et al. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 17, 67–75 (2018).
Article PubMed PubMed Central Google Scholar
Nippold, M. A., Cramond, P. M. & Hayward-Mayhew, C. Spoken language production in adults: Examining age-related differences in syntactic complexity. Clin. Linguist. Phonetics 28, 195–207 (2014).
Article Google Scholar
Perlini, C. et al. Linguistic production and syntactic comprehension in schizophrenia and bipolar disorder. Acta Psychiatr. Scand 126, 363–376 (2012).
Article CAS PubMed Google Scholar
Cain, K. & Nash, H. M. The influence of connectives on young readers’ processing and comprehension of text. J. Educ. Psychol. 103, 429–441 (2011).
Article Google Scholar
Steinau, S. et al. Comparison of psychopathological dimensions between major depressive disorder and schizophrenia spectrum disorders focusing on language, affectivity and motor behavior. Psychiatry Res. 250, 169–176 (2017).
Article PubMed Google Scholar
Lott, P. R., Guggenbühl, S., Schneeberger, A., Pulver, A. E. & Stassen, H. H. Linguistic analysis of the speech output of schizophrenic, bipolar, and depressive patients. Psychopathology 35, 220–227 (2002).
Article CAS PubMed Google Scholar
Thomas, P., King, K. & Fraser, W. I. Positive and negative symptoms of schizophrenia and linguistic performance. Acta Psychiatr. Scand. 76, 144–151 (1987).
Article CAS PubMed Google Scholar
Woods, S. W. Chlorpromazine equivalent doses for the newer atypical antipsychotics. J. Clin. Psychiatry 64, 663–667 (2003).
Article CAS PubMed Google Scholar
Sackeim, H. A. The definition and meaning of treatment-resistant depression. J. Clin. Psychiatry 62, 10–17 (2001).
CAS PubMed Google Scholar
Redlich, R. et al. Brain morphometric biomarkers distinguishing unipolar and bipolar depression: A voxel-based morphometry-pattern classification approach. JAMA Psychiatry 71, 1222–1230 (2014).
Article PubMed PubMed Central Google Scholar
Özcan, A. et al. The production of simple sentence structures in schizophrenia. Int. J. Arts Sci. 09, 159–164 (2017).
Google Scholar
Çokal, D. et al. The language profile of formal thought disorder. npj Schizophr 4, 1–8 (2018).
Article Google Scholar
Morice, R. & McNicol, D. Language changes in schizophrenia: a limited replication. Schizophr. Bull. 12, 239–251 (1986).
Article CAS PubMed Google Scholar
Morice, R. D. & Ingram, J. C. L. Language complexity and age of onset of schizophrenia. Psychiatry Res 9, 233–242 (1983).
Article CAS PubMed Google Scholar
Thomas, P., King, K., Fraser, W. I. & Kendell, R. E. Linguistic performance in schizophrenia: A comparison of acute and chronic patients. Br. J. Psychiatry 156, 204–210 (1990).
Article CAS PubMed Google Scholar
King, K., Fraser, W. I., Thomas, P. & Kendell, R. E. Re-examination of the language of psychotic subjects. Br. J. Psychiatry 156, 211–215 (1990).
Article CAS PubMed Google Scholar
Stein, F. et al. Factor analyses of multidimensional symptoms in a large group of patients with major depressive disorder, bipolar disorder, schizoaffective disorder and schizophrenia. Schizophr. Res. 218, 38–47 (2020).
Article PubMed Google Scholar
Stein, F. et al. State of illness-dependent associations of neuro-cognition and psychopathological syndromes in a large transdiagnostic cohort. J. Affect. Disord. 324, 589–599 (2023).
Article PubMed Google Scholar
Repple, J. et al. Shared and specific patterns of structural brain connectivity across affective and psychotic disorders. Biol. Psychiatry https://doi.org/10.1016/j.biopsych.2022.05.031 (2022).
Article PubMed Google Scholar
Bedi, G. et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophr 1, 1–7 (2015).
Article Google Scholar
Stanislawski, E. R. et al. Negative symptoms and speech pauses in youths at clinical high risk for psychosis. npj Schizophr 7, 2–4 (2021).
Article Google Scholar
Harvey, P. D., Strassnig, M. T. & Silberstein, J. Prediction of disability in schizophrenia: Symptoms, cognition, and self-assessment. J. Exp. Psychpathology 10, 1–20 (2019).
Google Scholar
Galderisi, S. et al. Persistent negative symptoms in first episode patients with schizophrenia: Results from the European First Episode Schizophrenia Trial. Eur. Neuropsychopharmacol. 23, 196–204 (2013).
Article CAS PubMed Google Scholar
Sachs, G. & Erfurth, A. Wirkungen von Cariprazin auf Negativsymptome und kognitive Störungen bei Schizophrenie. psychopraxis. neuropraxis 25, 166–171 (2022).
Article Google Scholar
Meconi, F. et al. Aberrant prefrontal beta oscillations predict episodic memory encoding deficits in schizophrenia. NeuroImage Clin 12, 499–505 (2016).
Article PubMed PubMed Central Google Scholar
Czepielewski, L. S. et al. Verbal episodic memory along the course of schizophrenia and bipolar disorder: A new perspective. Eur. Neuropsychopharmacol. 25, 169–175 (2015).
Article CAS PubMed Google Scholar
Bora, E., Yalincetin, B., Akdede, B. B. & Alptekin, K. Neurocognitive and linguistic correlates of positive and negative formal thought disorder: A meta-analysis. Schizophrenia Research 209, 2–11 (2019).
Article PubMed Google Scholar
Nagels, A. et al. Distinct neuropsychological correlates in positive and negative formal thought disorder syndromes: The thought and language disorder scale in endogenous psychoses. Neuropsychobiology 73, 139–147 (2016).
Article PubMed Google Scholar
Epskamp, S. & Fried, E. I. A tutorial on regularized partial correlation networks. Psychol. Methods 23, 617–634 (2017).
Article Google Scholar
Wittchen, H.-U., Wunderlich, U., Gruschwitz, S. & Zaudig, M. SKID I. Strukturiertes Klinisches Interview für DSM-IV. Achse I: Psychische Störungen. Interviewheft und Beurteilungsheft. Eine deutschsprachige, erweiterte Bearb. d. amerikanischen Originalversion des SKID I. (1997).
Aschenbrenner, A., Tucha, O., Lange, K. RWT Regensburger Wortflüssigkeits-Test. (Hogrefe, Göttingen, 2000).
Niemann, H., Sturm, W., Thöne-Otto, A. I. T. & Willmes, K. CVLT California Verbal Learning Test. German adaptation. Manual. (2008).
Hamilton, M. A rating scale for depression. J. Neurol. Neurosurg. Psychiatry 23, 56–62 (1960).
Article CAS PubMed PubMed Central Google Scholar
Hamilton, M. The assessment of anxiety states by rating. Br. J. Med. Psychol 32, 50–55 (1959).
Article CAS PubMed Google Scholar
Andreasen, N. C. The scale for the assessment of negative symptoms (SANS): Conceptual and theoretical foundations. Br. J. Psychiatry 155, 49–52 (1989).
Article Google Scholar
Andreasen, N. C. The Scale for the Assessment of Positive Symptoms (SAPS). (University of Iowa, 1984).
Murray, H. A. Thematic apperception test. (Harvard University Press, 1943).
Liddle, P. F. et al. Thought and language index: An instrument for assessing thought and language in schizophrenia. Br. J. Psychiatry 181, 326–330 (2002).
Article PubMed Google Scholar
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47, 583–621 (1952).
Article Google Scholar
Jamalabadi, H. et al. Classification Based Hypothesis Testing in Neuroscience: Below-Chance Level Classification Rates and Overlooked Statistical Properties of Linear Parametric Classifiers. Hum. Brain Mapp. 37, 1842–1855 (2016).
Article PubMed PubMed Central Google Scholar
Bishop, C. M. Pattern Recognition and Machine Learning. (Springer Science+Business Media, LLC, 2006).
Jamalabadi, H., Alizadeh, S., Schönauer, M., Leibold, C. & Gais, S. Multivariate classification of neuroimaging data with nested subclasses: Biased accuracy and implications for hypothesis testing. PLoS Comput. Biol. 14, e1006486 (2018).
Article PubMed PubMed Central Google Scholar
Shi, T. & Horvath, S. Unsupervised learning with random forest predictors. J. Comput. Graph. Stat. 15, 118–138 (2006).
Article Google Scholar
Epskamp, S., Waldorp, L. J., Mõttus, R. & Borsboom, D. The Gaussian Graphical Model in Cross-Sectional and Time-Series Data. Multivariate Behav. Res. 53, 453–480 (2018).
Article PubMed Google Scholar
Kuperberg, G. R. Language in Schizophrenia Part 1: An Introduction. Linguist. Lang. Compass 4, 576–589 (2010).
Article Google Scholar
Chen, J. & Chen, Z. Extended Bayesian information criteria for model selection with large model spaces. Biometrika 95, 759–771 (2008).
Article Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B 58, 267–288 (1996).
Google Scholar
McNeish, D. M. Using lasso for predictor selection and to assuage overfitting: A method long overlooked in behavioral sciences. Multivariate Behav. Res. 50, 471–484 (2015).
Article PubMed Google Scholar
Epskamp, S. et al. Estimating psychological networks and their accuracy: A tutorial paper. Behav. Res. Methods 50, 195–212 (2018).
Article PubMed Google Scholar
Fruchterman, T. M. J. & Reingold, E. M. Graph drawing by force‐directed placement. Softw. Pract. Exp. 21, 1129–1164 (1991).
Article Google Scholar
Epskamp, S., Cramer, A. O. J., Waldorp, L. J., Schmittmann, V. D. & Borsboom, D. qgraph: Network Visualizations of Relationships in Psychometric Data. J. Stat. Softw. 48, 1–18 (2012).
Article Google Scholar

Download references

Acknowledgements

We are deeply indebted to all study participants and staff. A list of acknowledgments can be found here: www.for2107.de/acknowledgements.

Funding

This work is part of the German multicentre consortium “Neurobiology of Affective Disorders. A translational perspective on brain structure and function“, funded by the German Research Foundation (Research Unit FOR2107). Principal investigators are Tilo Kircher (KI588/14-1, KI588/14-2), Udo Dannlowski (DA1151/5-1, DA1151/5-2), Axel Krug (KR3822/5-1, KR3822/7-2), Igor Nenadić (NE2254/1-2, NE2254/3-1, NE2254/4-1). KL was supported by a research grant of the University Medical Center Gießen and Marburg (UKGM; Project number: 7/2020MR). Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Arne Nagels, Frederike Stein.

Authors and Affiliations

Department of English and Linguistics, General Linguistics, University of Mainz, Mainz, Germany
Katharina Schneider & Arne Nagels
Department of Psychiatry and Psychotherapy, University of Marburg, Marburg, Germany
Katrin Leinweber, Hamidreza Jamalabadi, Lea Teutenberg, Katharina Brosch, Julia-Katharina Pfarr, Florian Thomas-Odenthal, Paula Usemann, Adrian Wroblewski, Benjamin Straube, Nina Alexander, Igor Nenadić, Andreas Jansen, Tilo Kircher & Frederike Stein
Center for Mind, Brain and Behavior, University of Marburg, Marburg, Germany
Katharina Brosch, Julia-Katharina Pfarr, Florian Thomas-Odenthal, Paula Usemann, Adrian Wroblewski, Benjamin Straube, Nina Alexander, Igor Nenadić, Andreas Jansen, Tilo Kircher & Frederike Stein
Department of Psychiatry and Psychotherapy, University of Bonn, Bonn, Germany
Axel Krug
Institute for Translational Psychiatry, University of Münster, Münster, Germany
Udo Dannlowski

Authors

Katharina Schneider
View author publications
You can also search for this author in PubMed Google Scholar
Katrin Leinweber
View author publications
You can also search for this author in PubMed Google Scholar
Hamidreza Jamalabadi
View author publications
You can also search for this author in PubMed Google Scholar
Lea Teutenberg
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Brosch
View author publications
You can also search for this author in PubMed Google Scholar
Julia-Katharina Pfarr
View author publications
You can also search for this author in PubMed Google Scholar
Florian Thomas-Odenthal
View author publications
You can also search for this author in PubMed Google Scholar
Paula Usemann
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Wroblewski
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Straube
View author publications
You can also search for this author in PubMed Google Scholar
Nina Alexander
View author publications
You can also search for this author in PubMed Google Scholar
Igor Nenadić
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Jansen
View author publications
You can also search for this author in PubMed Google Scholar
Axel Krug
View author publications
You can also search for this author in PubMed Google Scholar
Udo Dannlowski
View author publications
You can also search for this author in PubMed Google Scholar
Tilo Kircher
View author publications
You can also search for this author in PubMed Google Scholar
Arne Nagels
View author publications
You can also search for this author in PubMed Google Scholar
Frederike Stein
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.S. analyzed the data, performed statistical analyses, interpreted the data, drafted the manuscript and searched for literature. F.S. was responsible for study design, data collection, statistical analyses, and interpretation of the data, drafted the manuscript and reviewed the manuscript. A.N. reviewed the manuscript. T.K. did funding acquisition, data curation, data interpretation, study design, and reviewed the manuscript. H.J. was involved in data curation and performed classification analyses. K.L., L.T,. K.B., J.K.P., F.T.O., P.U., A.W., B.S., N.A. and A.J. were involved in data collection, data quality control, transcription, and reviewing of the manuscript. I.N., A.K., and U.D. did funding acquisition, study design, and reviewed the manuscript. All authors have contributed to the manuscript and have approved to the final version.

Corresponding author

Correspondence to Katharina Schneider.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Extended Data Table 1

Extended Data Table 2

Extended Data Table 3

Extended Data Table 4

Extended Data Figure 1

Extended Data Figure 2

Extended Data Tables and Figure legend

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schneider, K., Leinweber, K., Jamalabadi, H. et al. Syntactic complexity and diversity of spontaneous speech production in schizophrenia spectrum and major depressive disorders. Schizophr 9, 35 (2023). https://doi.org/10.1038/s41537-023-00359-8

Download citation

Received: 18 January 2023
Accepted: 25 April 2023
Published: 29 May 2023
DOI: https://doi.org/10.1038/s41537-023-00359-8

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Linguistic parameters

Classification

Cluster analysis

Network analyses

Discussion

Limitations

Conclusion

Methods

Participants

Language-related neuropsychological assessment

Psychopathological assessment

Assessment of syntactic complexity and diversity

Eliciting speech

Analysis of transcripts

Statistical procedures

Group comparisons

Classification

Cluster analysis

Network analyses of syntax, language-related neuropsychology and psychopathology

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links