Language in schizophrenia: relation with diagnosis, symptomatology and white matter tracts


Language deviations are a core symptom of schizophrenia. With the advances in computational linguistics, language can be easily assessed in exact and reproducible measures. This study investigated how language characteristics relate to schizophrenia diagnosis, symptom, severity and integrity of the white matter language tracts in patients with schizophrenia and healthy controls. Spontaneous speech was recorded and diffusion tensor imaging was performed in 26 schizophrenia patients and 22 controls. We were able to classify both groups with a sensitivity of 89% and a specificity of 82%, based on mean length of utterance and clauses per utterance. Language disturbances were associated with negative symptom severity. Computational language measures predicted language tract integrity in patients (adjusted R2 = 0.467) and controls (adjusted R2 = 0.483). Quantitative language analyses have both clinical and biological validity, offer a simple, helpful marker of both severity and underlying pathology, and provide a promising tool for schizophrenia research and clinical practice.


Language disturbances are a core symptom of schizophrenia. Since the first descriptions of schizophrenia as a mental disorder, language disturbances have been referred to as formal thought disorder (FTD)1. Kraepelin identified a subgroup of patients with severe confusion of speech, a symptom he described as “schizophasia”, characterized by “an unusually striking disorder of expression in speech, with relatively little impairment of the remaining psychic activities”1. Indeed, there is abundant evidence that language disorder is a key symptom of schizophrenia2,3,4,5,6. We aim to study the relation between language disturbances and schizophrenia, its symptomatology, and the underlying neurobiology.

Language disturbances in schizophrenia are multidimensional. Positive language symptoms include idiosyncratic semantic associations, neologisms and word approximation3,4,7. Negative language symptoms are poverty of speech (ranging from less frequent and slower to complete absence), and reduced grammatical complexity5,8,9.

Until recently, studies have used subjective observation-based instruments to investigate language disturbances in schizophrenia10. Although these rating scales have clinical utility, they do not support the assessment of subtle phenomena/deviations with respect to language form (i.e., grammar as well as sound structure). With the availability of automatic speech recognition and computational linguistic tools, which provide easy and fast quantitative analyses of phonetics, syntax and semantics11,12,13,14, language can easily be assessed in exact and reproducible measures. However, studies applying quantitative analyses of language in the field of schizophrenia have so far been scarce12,14.

To date, research on the neurobiological underpinnings of language disturbances in schizophrenia is limited. Cumulative evidence suggests that symptoms associated with schizophrenia may be the result of disordered brain connectivity15,16,17. White matter structural organization can be studied in vivo by means of diffusion tensor imaging (DTI). A recent meta-analysis from the ENIGMA Working Group on schizophrenia DTI suggests that white matter alternations in schizophrenia are widespread18; disturbances were found in almost all regions analyzed. Furthermore, FTD in schizophrenia has been associated with both structural and functional aberrations in the language network19. Moreover, language connectedness indicated by speech graph analysis was related to functional as well as structural brain markers (cortical folding patterns) in both schizophrenia and bipolar disorder20. However, to date there are no studies investigating white matter microstructure of the language pathways and their association to language disturbances specifically.

Dual-stream models associate distinct functions with left and right hemisphere language networks. The ventral stream, connecting the temporal cortex (including Wernicke’s area) with Broca’s area, supports sound-to-meaning mapping, whereas the dorsal pathway, from the posterior temporal lobe to the premotor cortex as well as the pars opercularis of Broca’s area, is taken to support auditory–motor integration21. In the current study, we used DTI to study the microstructure of the language pathways. Regions of interest (ROIs) were selected using reviews and meta-analyses on the white matter language network22,23,24. Both the dorsal stream (superior longitudinal fasciculus (SLF) and arcuate fasciculus (AF)) as well as the ventral stream (inferior longitudinal fasciculus (ILF), inferior fronto-occipital fasciculus (IFOF) and uncinate fasciculus (UF)) were included (Fig. 1). Fractional anisotropy (FA) and mean diffusivity (MD) were used as an index for white matter integrity25.

Fig. 1: Regions of interest.

Depiction of the five regions of interest (ROIs) of the The Johns Hopkins University (JHU) DTI-based white matter atlases82,83, that were analyzed in the present study in the sagittal plane (a), the coronal plane (b) and the axial plane (c).

We evaluated whether quantitative analyses of spoken language can be used to (1) classify subjects as schizophrenia patients or controls, thereby assessing its potential to aid in diagnosis. In addition, we evaluated how language characteristics relate to (2) schizophrenia symptoms and (3) structural integrity of the language pathways.



Demographic characteristics are listed in Table 1. Healthy controls and patients with schizophrenia on average did not differ in age, gender distribution or educational level. To assess the effect of antipsychotic treatment on our analyses, correlation analyses were performed. No significant relations were found between chlorpromazine equivalent dosage and Positive And Negative Syndrome Scale (PANSS)26 sub scores (all p > 0.400), language measures (all p > 0.100) or language tracts (Supplementary Tables 1, 2).

Table 1 Demographic characteristics of the study sample.

Diagnostic categories

The MANCOVA comparing both groups including age as a covariate, revealed a significant main effect of group status on language characteristics (F(11,35) = 2.565, Pillai’s trace = 0.446, p = 0.017) (Tables 2, 3). No main effect was found for age. Post hoc testing revealed that patients articulated more slowly, spoke during a smaller proportion of the interview, produced shorter utterances, had a higher type-token ratio (TTR, i.e., a measure for lexical diversity) and used fewer clauses per utterance than the healthy controls (all p’s < 0.050, Table 2).

Table 2 Description of language variables.
Table 3 Language characteristics between groups.

A binary logistic regression model was used to investigate to what extent language variables predict group status. The optimal model had high predictive power (Nagelkerke approximation: R2 = 0.733), and the Hosmer–Lemeshow test for goodness-of-fit was non-significant (p = 0.874). This model included mean length of utterance (MLU) and clauses per utterance; age and years of education were entered as covariates. Patients and healthy controls could be classified with this model with a sensitivity of 88.5% and a specificity of 81.8%.


We found a significant negative correlation between PANSS negative subscale and articulation rate (r = −0.414, p = 0.036), speaking turn duration (r = −0.420, p = 0.033), percentage of time speaking (r = −0.715, p < 0.001) and MLU (r = −0.393, p = 0.047). A significant positive association was found between PANSS negative and open-closed ratio (r = 0.397, p = 0.044). After false discovery rate (FDR) correction, only percentage of time speaking remained significant (p < 0.001). Item-based correlation analyses were performed for PANSS negative items (Supplementary Table 3). PANSS positive and general total subscales revealed no significant associations with the language variables. Exploratory post hoc analyses per PANSS item (Supplementary Table 4), showed correlations between conceptual disorganization and turn duration (r = −0.420, p = 0.033), percentage of time speaking (r = −0.715, p < 0.001), MLU (r = −0.393, p = 0.047), and open-closed ratio (i.e., a ratio of content words versus function words) (r = 0.397, p = 0.044). Excitement was associated with articulation rate (r = 0.501, p < 0.001), whereas grandiosity was positively associated with percentage of time speaking (r = 0.415, p = 0.035).

White matter integrity

Two separate MANCOVA’s were used to determine whether healthy controls and patients with schizophrenia differed on DTI measures of both the (skeletonized) language tracts and the whole brain. The results revealed no overall differences between healthy controls and patients with schizophrenia on mean FA values (F(11,35) = 0.783, Pillai’s trace = 0.197, p = 0.655) or mean MD values (F(11,33) = 1.351, Pillai’s trace = 0.310, p = 0.242). However, voxel-wise analyses with Tract-Based Spatial Statistics (TBSS) revealed significantly decreased clusters of voxels in the patients in all ROIs, as well as the corpus callosum, cingulum and the corona radiata (Fig. 2). Our primary analyses concerning FA are presented here; our secondary analyses concerning MD are presented in the supplementary material (Supplementary Tables 5, 6).

Fig. 2: Results of tract-based spatial statistics (TBSS) analysis.

The areas highlighted in red/yellow indicate significantly reduced fractional anisotropy FA values for the patient group compared to the control group after correction for multiple comparisons. These results are projected on an FMRIB58 FA standard brain and the mean FA skeleton derived from our sample (n = 48), in blue. Some of the regions of interest and other areas with significant differences are labeled in both hemispheres (L: left, R: right), among which the superior longitudinal fasciculus (SLF), inferior longitudinal fasciculus (ILS), inferior fronto-occipital fasciculus (IFOF), arcuate fasciculus (AF), the uncinate fasciculus (UF) and the corpus callosum (CC). Areas with significant differences are labeled in the sagittal plane (a), the coronal plane (b) and the axial plane (c).

Multivariate linear regression analyses revealed that language measures explained 46.7% of the variance of the mean FA of the language tracts and 51.6% of the whole brain mean FA in patients with schizophrenia (Table 4).

Table 4 Relation between language disturbances and fractional anisotropy (FA).

In healthy controls, language variables were also highly explanatory of mean FA of the language tracts (48.3%) and whole brain FA (33.1%). Regression analyses for the ROIs individually are summarized in Table 5.

Table 5 Relation between fractional anisotropy (FA) and language disturbances per ROI.


The aim of the current study was to investigate how language characteristics relate to schizophrenia pathology, symptom severity and integrity of the white matter language tracts in patients with schizophrenia and healthy controls. Patients with a schizophrenia spectrum disorder showed quantifiable language disturbances; they spoke less, their articulation rate was slower and they used less complex sentences compared to the matched healthy controls. Language analysis can be a helpful aid in diagnosis. Furthermore, there was a strong relation between these decreased language parameters and negative symptoms, suggesting that language analyses are especially helpful to detect negative symptoms. Also, quantitative properties of spoken language output are strongly related to white matter integrity of the language tracts in both patients and healthy controls.

Our results showed that patients with schizophrenia and healthy controls differed on a broad variety of language measures, including speech tempo and the amount of language produced, as well as measures of complexity. Furthermore, we found that analyzing spontaneous language production can be a powerful diagnostic tool, as it distinguishes between patients with schizophrenia and healthy controls with a sensitivity of 88.5% and a specificity of 81.8%. These sensitivity and specificity indices are in the range of blood-based molecular biomarkers (sensitivity and specificity 90%27) and neuroimaging markers using machine learning (accuracy varying between 61.1 and 95%28) for schizophrenia. The high sensitivity and specificity we found are remarkable, especially given the simplicity of the model and the small number of predictors (four variables). However, it should be noted that our sample is small and further research, including cross-validation, is necessary to assess the full potential of language variables as a diagnostic biomarker.

We showed that language disturbances are associated with PANSS negative, as well as individual items of the PANSS positive and general subscales. However, most associations were no longer significant after FDR correction. The absence of a correlation with total PANSS positive and general scores could be explained by the fact that all patients were medicated and relatively free of overt psychosis at the time of assessment.

In both patients with schizophrenia and healthy controls, several aspects of language proved highly predictive of structural integrity of the language pathways. While the patients with schizophrenia in our sample had disturbances in language production, the mean integrity of their white matter language tracts was similar to that of the healthy controls. However, our results reveal more fine-grained deviations in the integrity of the white matter tracts, revealing patterns of deviating voxel clusters in all language tracts.

More specifically, in healthy controls, the mean FA of the language tracts was predicted by the MLU, clauses per utterance and the pause to word ratio, while the whole brain FA was predicted by speaking turn duration and clauses per utterance. In patients the FA of the language tracts was predicted by pause duration, MLU and noun–verb ratio. The same pattern was found for the mean FA of the whole brain, only speaking turn duration was found as an additional predictor. These results can be explained by at least two aspects of language production. First, MLU, noun–verb ratio and clauses per utterance are measures of sentence complexity; greater utterance length and more clauses or verbs per utterance indicate more complex sentences. In child language acquisition, sentences become longer and more complex with age29,30. In general, white matter integrity increases with age in typically developing children31, which is highly correlated with language development32. Second, pause duration and pause to word ratio are thought to reflect speaking efficiency and/or processing speed. Previous research has shown that FA has been associated with information processing efficiency33. Our results confirm that white matter integrity in the language tracts is associated with increased complexity and speaking efficiency in healthy controls, and extends these findings to patients with schizophrenia.

Importantly, the relation between language variables and the integrity of the white matter tracts appears to be more specific in healthy controls than in patients with schizophrenia. In healthy controls, language is a better predictor for the language tracts than for the whole brain FA (adjusted R2 = 0.483 and 0.331, respectively). In patients, however, the language measures we used were a better (or at least similar) predictor for the whole brain FA than for the FA of the language tracts (adjusted R2 = 0.516 and 0.467, respectively). This finding can be interpreted as decreased brain specialization, since previous research has hemispheric specialization is decreased in schizophrenia34,35. Alternatively, this finding could reflect more general cognitive disturbances in schizophrenia. Schizophrenia is characterized by broad disturbances in cognition, including decreased processing speed, memory deficits and attention problems36,37. These cognitive disturbances may lead to disturbances in language that are nonspecific to language, and therefore show less clear associations with language tracts. We further showed that language measures predict white matter integrity better in some language tracts than in others. In healthy controls, up to 75.3% of the variance of the FA of the left AF is explained by aspects of spontaneous speech, while only 25% of the right UF is explained by language measures. Again, this specificity is less profound in patients with schizophrenia, where 47.8% of the variance the left AF is explained by the language measures.

Previous research revealed significant reductions in white matter integrity in patients with schizophrenia as compared to healthy controls38,39. In the current study, we did not find any group differences when looking at average FA/MD over an atlas-based ROI. This might be related to the relatively small sample size, or to the fact that most patients had recent onset psychotic illness, whereas previous research suggests that white matter deterioration in schizophrenia increases with duration of illness40. However, we did find significant group differences on clusters of voxels using voxel-wise analyses with TBSS in all language tracts, as well as the corpus callosum, cingulum and the corona radiata. These preliminary results suggest abnormalities at the microstructural level may be a part of a diffuse pattern of brain development in recent onset schizophrenia. The biologic interpretation of these microstructural anomalies remains speculative. Schizophrenia, as a neurodevelopmental disorder, has a subtly abnormal circuit underlying cortical and cerebellar functions such as, motor skills, language, cognition and emotions. Multiple mechanisms are likely to be involved41 and per individual, some mechanisms may be more important than others. A very early mechanism may stem from the innate immune system of the brain, especially microglia and complement in shaping the developing brain. Inadequate pruning may result in underdeveloped connections. With myelination to follow much used network connections, fewer well trafficked connections will become well myelinated42 In mouse models, decreased white matter integrity has been associated with acute axon and myelin damage43. However, a definite pathway for abnormal white matter connectivity in schizophrenia remains elusive.

Previous studies have proposed that FTD severity is related to integrity of white matter language tracts10,19. This hypothesis is not supported by our data, as we found language disturbances to be present in the absence of large scale white matter aberrations that were confirmed by previous research18,44,45. This difference might be related to the fact that previous studies used FTD rating scales to measure disturbances in language. A disadvantage of using symptom-based severity scores such as FTD rating scales is that these are not scored in healthy controls; therefore, this relation was not previously assessed in healthy controls. FTD rating scales may not be sensitive enough to detect subtle or preclinical deviations in language. Our results indicate that aspects of spontaneous language production are strongly related to white matter integrity in both healthy controls and patients with schizophrenia. Furthermore, we have shown that the white matter integrity of language tracts is not distinctive for patients and healthy controls, while the functional language output is. This is in agreement with research on structural and functional brain abnormalities in schizophrenia, which suggests that structural abnormalities (if observable) are modest, and that it is difficult to distinguish brains of patients from those of healthy controls46,47. Instead, schizophrenia is associated with complex alterations in regional patterns of activity in the brain, mostly in task related and resting state activity46,47,48,49.

Interestingly, in the healthy controls, language variables were more predictive of left-hemisphere language tracts than their right hemisphere counterparts. The white matter language network is generally more lateralized towards the left hemisphere in right-handed subjects50, although temporo-frontal networks in the right hemisphere support sentence-level prosody22,51. Of note, recent functional MRI studies of speech production suggest that language is not localized to the left hemisphere, instead advocating the importance of the right hemisphere during language production52,53. However, these studies involved storytelling tasks; a register of speech that involves a great deal of prosody, emotion, humor, and is not considered neutral spontaneous speech54,55. This greater emotional involvement in narrative production may in part explain the large right-hemispheric contribution during these tasks22,51. Our finding that several language tracts are more left-lateralized in healthy controls thus adheres to current views on the white matter language network22,24,50,56.

In the patients, we found no clear pattern of left-hemisphere specificity for the language variables. There is strong evidence for reduced (functional) lateralization in schizophrenia, which is evidenced by increased mixed-handedness57 and diminished language lateralization58,59,60,61. Schizophrenia patients show a reduction of left-lateralization in several white matter language tracts, including the UF62,63,64 and the IFOF64. The right-shift of FA of the UF correlates with negative symptoms64. Our results confirm these findings, as the right UF in patients was strongly associated with disfluencies and pauses, more than its left hemisphere counterpart.

This study directly assessed the relation between language disturbances and schizophrenia pathology and symptomatology, as well as the integrity of white matter language tracts in patients with schizophrenia and healthy controls. There are a number of limitations to this study. First, to date, there is no white matter atlas that includes a mask for the AF. This tract is still under investigation and the exact anatomy is still disputed65,66. Consequently, we used the temporal branch of the SLF as a mask for the AF. This mask was more of an approximation to the AF, and results concerning the AF should therefore be interpreted with caution. As there is limited research on incorporating these specific language and MRI measures, our results highlight the need for replication in a larger independent sample. Third, participants were relatively stable at time of assessment, which precluded the demonstration of correlations with positive symptoms. Further studies across ages, illness severity and disease durations are needed to understand the trajectory of language disturbances in schizophrenia. Especially replication in a group at high-risk for psychosis would be highly valuable to assess whether both white matter abnormalities and language disturbances proceed the occurrence of a psychotic disorder. Fourth, we had no control group that takes antipsychotics, without the presence of a psychotic disorder. Lastly, we used (parental) years of education as a proxy of intelligence, however, we did not control for differences in IQ between the groups. While there is currently no evidence suggesting that IQ correlates with spontaneous speech markers67, the influence of IQ on spontaneous speech remains relatively unknown. Therefore, we cannot fully rule out the influence of differences in intelligence on our results.

In conclusion, quantifiable aspects of language are a sensitive and specific tool in the classification of patients with schizophrenia and healthy controls. Furthermore, these language disturbances are associated with symptom severity, especially with negative symptoms. In both patients with schizophrenia and healthy controls, quantifiable aspects of language are highly predictive of the integrity of white matter tracts associated with language. Our current findings make an important contribution to recent to initiatives such as the Research Domain Criteria project (RDoC) and its aim towards precision psychiatry, which advocates a focus on dimensions of neurobiology and observable behavior rather than symptom-based classification systems such as the DSM and ICD68,69. Given that language analyses are non-invasive, quickly performed and low-cost, language analyses are a promising tool in schizophrenia, with both clinical and neurobiological validity.



A total of 48 participants, 26 patients with a schizophrenia spectrum disorder and 22 healthy controls, were included between 2015 and 2018 at the University Medical Center Utrecht (UMCU), the Netherlands (trial registration number: NCT01999309). Participants were included if they were (1) aged 18 years or above and (2) a native speaker of Dutch. Patients were included if they met criteria for a DSM-IV diagnosis of: 295.x (schizophrenia, schizophreniform disorder, schizoaffective disorder) or 298.9 (psychotic disorder not otherwise specified). Patients were diagnosed by their treating psychiatrist. A neuropsychologist confirmed the diagnosis using the Comprehensive Assessment of Symptoms and History (CASH) interview70. Healthy controls were screened for the absence of previous or current mental illness using the CASH by a neuropsychologist. Healthy controls were excluded if a family history of psychotic symptoms was reported. Additional exclusion criteria were the presence of uncorrected hearing disabilities or speech deficits (such as stutter), contraindications for MRI and left-handedness. The severity of psychotic symptoms was assessed in all patients with the PANSS26. This study was approved by the ethical review board of the UMCU. Written informed consent was obtained from all participants. Participants received a small monetary award for participation. Antipsychotic drug dosages were recalculated into chlorpromazine equivalents to evaluate treatment effects in the patients71.

Language data acquisition and processing

To elicit spontaneous spoken language we conducted semi-structured interviews with an average duration of fifteen minutes. Participants were informed that this was part of an analysis regarding “general experiences”; only after completion of the interview they were told that the research also focuses on the way they speak. To prevent variations in language due to the topic that was discussed, a standard set of questions was used. All questions concerned “neutral” general life experiences; topics that could be expected to have markedly different emotional valence for patients and healthy controls were not addressed. For instance, topics such as “quality of life” or “health” were avoided. If for any reason a subject did not want to answer a question, the interviewer would move on to the next question. For a list of the questions, see Supplementary Table 7.

An AKG-C544l head-worn cardioid microphone was used to record the subject’s speech. Speech was digitally recorded onto a Tascam DR40 solid state recording device at a sampling rating of 44,100 kHz with 16-bit quantization. The digitized recordings were analyzed using the Praat software72, which is standardly used for acoustic analyses of speech. Speech signals of interviewer and participant were separated by hand onto two different digital audio tracts by J.N.d.B. and A.E.V. Each stretch of speech was coded as belonging either to the participant or the interviewer. When both speakers spoke at the same time, that speech segment was coded as belonging to both speakers. The speech segments were recombined into new audio files per participant, which each thus contained only the time that an individual participant was speaking and pausing. Data files were blinded for diagnosis to prevent bias in separating the speaker. Inter-rater reliability for tier separation was 97.7%. All files were set to an average sound pressure level of 60 dB to avoid differences in the analyses based on speaking volume.

The “Praat Script Syllable Nuclei v2”73 was used to automatically obtain speech and articulation rates. The output of this script includes the total number of syllables and the total number of pauses. Pauses were defined as silences longer than 200 ms, as shorter silences in speech are often related to the articulation of particular sounds, notably plosives (e.g., the /p/, which introduces a short silence in the sound wave)74. The raw measures were recalculated as a percentage of the duration of the participants’ audio track, since they are strongly dependent on the length of the interview. The participants’ audio file was transcribed according to CHILDES-CHAT guidelines75. CLAN software applications EVAL and FLUCALC76 were used to extract a comprehensive collection of commonly used measures that reflect linguistic fluency and complexity.

This resulted in the following language measures: articulation rate, average pause duration, speaking turn duration, percentage of time speaking, MLU, TTR, clauses per utterance, noun–verb ratio, open-closed ratio, disfluencies and pause to word ratio; for additional information on these variables, see Table 2.

DTI acquisition and analysis

MRI scanning was performed by trained MR technicians using a Philips Achieva 3 tesla scanner (Philips Medical Systems, Best, the Netherlands) at the UMCU equipped with an eight-channel SENSE head coil.

Two transverse echo planar imaging diffusion-weighted single shot spin-echo scans were acquired (b-value = 1000 s/mm2, 30 non-collinear diffusion directions; 5 diffusion-unweighted (b = 0 s/mm²) volumes; field of view = 240 mm × 240 mm; acquisition matrix = 128 × 128; reconstruction matrix = 128 × 128; flip angle = 90°; slice thickness = 2 mm, 75 consecutive slices; TE = 68 ms; TR = 7011 ms; parallel imaging factor (SENSE) = 3; no cardiac gating). For the second diffusion-weighted scan the k-space readout direction was reversed (anterior–posterior) enabling a correction for susceptibility artefacts in the post processing step.

Participants were allowed to watch television and were requested not to move during the scanning procedures. DTI data was preprocessed using the Diffusion Toolbox implemented in FMRIB Software Library (FSL) release 5.0.9 (ref.77). The Brain Extraction Tool was used to remove the skull and other non-brain areas and to create a binary brain mask78. The FSL topup tool was used to compute the susceptibility correction parameters using the diffusion-unweighted volumes from both the DTI scans. These parameters where included into FSL’s eddy program enabling simultaneous correction of the DTI data for susceptibility artefacts, head motion and eddy current distortion.

A diffusion tensor model was then fitted to every voxel, and FA and MD maps were created. FA and MD are thought to reflect the coherence of the fiber orientation and free-water concentrations79,80. TBSS was used to perform voxel-wise statistical analysis on the FA and MD maps81. First, we ran a non-linear registration of all the subjects FA and MD images to the FMRIB58_FA image in MNI152 space. Next, the average FA map of all subjects was skeletonized by eliminating all voxels with FA < 0.2, and this skeleton mask was used to obtain both FA and MD data from all of our subjects. The registered FA and MD maps in standard space (computed as part of the TBSS analysis) were used for a ROI based analysis. Using the FA and MD maps, mean FA and MD values were computed for several predefined ROIs. The following ROIs were included: the SLF, AF, ILF, IFOF and UF bilaterally. The Johns Hopkins University (JHU) ICBM-DTI-81 white matter labels atlas (for bilateral SLF and UF) and the JHU white matter tractography atlas (for bilateral AF, ILF and IFOF) in MNI space were used to create masks from which average FA and MD values were extracted82,83.

In addition to exploring group differences based on the atlas masks, the skeletonized FA and MD images were used to conduct voxel-wise statistical comparison using FSL’s randomize function84. With this analysis, group differences can be seen at the voxel-level, rather than for the whole ROI. Therefore, subtle differences in clusters of voxels can also be detected.

Statistical analyses

All statistical analyses were performed using the Statistical Package for Social Sciences (SPSS, version 25). Independent samples t-tests were performed to test for group differences on demographic variables. A χ2 test was performed for categorical variables. First, group differences for the language measures were tested by performing a one-way MANCOVA, controlling for age. Next, we investigated: (1) which language variables were associated with group membership (healthy controls vs patients), by performing a backward binary logistic regression. Predictors were the language variables, as well as age, gender and education level; (2) which language variables were associated with psychotic symptoms, by performing correlation analyses were between PANSS items and language variables; (3) whether severity of language disturbances were associated with white matter aberrations in the ROIs, through backwards multivariate linear regression analyses. Mean FA values from the ROIs as well as whole brain FA and mean FA of the language tracts were entered as dependent variables and language variables as independent variables. This analysis was repeated for the MD values. To account for multiple comparisons, FDR was employed85.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to them containing information that could compromise research participant privacy or consent.


  1. 1.

    Kraepelin, E., Barclay, R. M. & Robertson, G. M. Dementia Praecox (Chicago Medical Book Co., Chicago, 1919).

  2. 2.

    Kuperberg, G. R. Language in schizophrenia part 1: an introduction. Lang. Linguist. Compass 4, 576–589 (2010).

    PubMed  PubMed Central  Article  Google Scholar 

  3. 3.

    Kuperberg, G. R. Language in schizophrenia Part 2. Lang. Linguist. Compass 4, 576–589 (2011).

    Article  Google Scholar 

  4. 4.

    Covington, M. A. et al. Schizophrenia and the structure of language: the linguist’s view. Schizophr. Res. 77, 85–98 (2005).

    PubMed  Article  Google Scholar 

  5. 5.

    DeLisi, L. E. Speech disorder in schizophrenia: review of the literature and exploration of its relation to the uniquely human capacity for language. Schizophr. Bull. 27, 481–496 (2001).

    CAS  PubMed  Article  Google Scholar 

  6. 6.

    de Boer, J. N., Brederoo, S. G., Voppel, A. E. & Sommer, I. E. C. Anomalies in language as a biomarker for schizophrenia. Curr. Opin. Psychiatry. (2020).

    Article  PubMed  Google Scholar 

  7. 7.

    Ditman, T. & Kuperberg, G. R. Building coherence: a framework for exploring the breakdown of links across clause boundaries in schizophrenia. J. Neurolinguist. 23, 254–269 (2010).

    Article  Google Scholar 

  8. 8.

    Fraser, W. I., King, K. M., Thomas, P. & Kendell, R. E. The diagnosis of schizophrenia by language analysis. Br. J. Psychiatry 148, 275–278 (1986).

    CAS  PubMed  Article  Google Scholar 

  9. 9.

    Morice, R. & McNicol, D. Language changes in schizophrenia: a limited replication. Schizophr. Bull. 12, 239–251 (1986).

    CAS  PubMed  Article  Google Scholar 

  10. 10.

    Cavelti, M., Kircher, T., Nagels, A., Strik, W. & Homan, P. Is formal thought disorder in schizophrenia related to structural and functional aberrations in the language network? A systematic review of neuroimaging findings. Schizophr. Res. 199, 2–16 (2018).

  11. 11.

    Tahir, Y. et al. Non-verbal speech cues as objective measures for negative symptoms in patients with schizophrenia. PLoS ONE 14, 1–17 (2019).

    Article  CAS  Google Scholar 

  12. 12.

    Corcoran, C. M. et al. Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 17, 67–75 (2018).

    PubMed  PubMed Central  Article  Google Scholar 

  13. 13.

    de Boer, J. N. et al. Clinical use of semantic space models in psychiatry and neurology: a systematic review and meta-analysis. Neurosci. Biobehav. Rev. 93, 85–92 (2018).

    PubMed  Article  PubMed Central  Google Scholar 

  14. 14.

    Bedi, G. et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophr. 1, 15030 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  15. 15.

    Friston, K. J. Dysfunctional connectivity in schizophrenia. World Psychiatry 1, 66 (2002).

    PubMed  PubMed Central  Google Scholar 

  16. 16.

    Konrad, A. & Winterer, G. Disturbed structural connectivity in schizophrenia—primary factor in pathology or epiphenomenon? Schizophr. Bull. 34, 72–92 (2007).

    PubMed  PubMed Central  Article  Google Scholar 

  17. 17.

    Kanaan, R. A. A. et al. Diffusion tensor imaging in schizophrenia. Biol. Psychiatry 58, 921–929 (2005).

    PubMed  Article  PubMed Central  Google Scholar 

  18. 18.

    Kelly, S. et al. Widespread white matter microstructural differences in schizophrenia across 4322 individuals: results from the ENIGMA Schizophrenia DTI Working Group. Mol. Psychiatry 23, 1261–1269 (2018).

  19. 19.

    Cavelti, M. et al. Formal thought disorder is related to aberrations in language-related white matter tracts in patients with schizophrenia. Psychiatry Res. Neuroimag 270, 40–50 (2018).

    Article  Google Scholar 

  20. 20.

    Palaniyappan, L. et al. Speech structure links the neural and socio-behavioural correlates of psychotic disorders. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 88, 112–120 (2019).

    Article  Google Scholar 

  21. 21.

    Hickok, G. & Poeppel, D. The cortical organization of speech processing. Nat. Rev. Neurosci. 8, 393 (2007).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  22. 22.

    Friederici, A. The brain basis of language processing: from structure to function. Physiol. Rev. 91, 1357–1392 (2011).

    PubMed  Article  PubMed Central  Google Scholar 

  23. 23.

    Friederici, A. D. Towards a neural basis of auditory sentence processing. Trends Cogn. Sci. 6, 78–84 (2002).

    PubMed  Article  PubMed Central  Google Scholar 

  24. 24.

    Price, C. J. The anatomy of language: a review of 100 fMRI studies published in 2009. Ann. N. Y. Acad. Sci. 1191, 62–88 (2010).

    PubMed  Article  PubMed Central  Google Scholar 

  25. 25.

    Alexander, A. L., Lee, J. E., Lazar, M. & Field, A. S. Diffusion tensor imaging of the brain. Neurotherapeutics 4, 316–329 (2007).

    PubMed  PubMed Central  Article  Google Scholar 

  26. 26.

    Kay, S. R., Fiszbein, A. & Opfer, L. A. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr. Bull. 13, 261 (1987).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  27. 27.

    Chan, M. K. et al. Development of a blood-based molecular biomarker test for identification of schizophrenia before disease onset. Transl. Psychiatry 5, e601 (2015).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  28. 28.

    Zarogianni, E., Moorhead, T. W. J. & Lawrie, S. M. Towards the identification of imaging biomarkers in schizophrenia, using multivariate pattern classification at a single-subject level. NeuroImage Clin. 3, 279–289 (2013).

    PubMed  PubMed Central  Article  Google Scholar 

  29. 29.

    Rice, M. L. et al. Mean length of utterance levels in 6-month intervals for children 3 to 9 years with and without language impairments. J. Speech Lang. Hear. Res. 53, 333–349 (2010).

  30. 30.

    Scarborough, H. S., Rescorla, L., Tager-Flusberg, H., Fowler, A. E. & Sudhalter, V. The relation of utterance length to grammatical complexity in normal and language-disordered groups. Appl. Psycholinguist. 12, 23–46 (1991).

    Article  Google Scholar 

  31. 31.

    Schmithorst, V. J., Wilke, M., Dardzinski, B. J. & Holland, S. K. Correlation of white matter diffusivity and anisotropy with age during childhood and adolescence: a cross-sectional diffusion-tensor MR imaging study. Radiology 222, 212–218 (2002).

    PubMed  PubMed Central  Article  Google Scholar 

  32. 32.

    Schmithorst, V. J. & Yuan, W. White matter development during adolescence as shown by diffusion MRI. Brain Cogn. 72, 16–25 (2010).

    PubMed  Article  Google Scholar 

  33. 33.

    Deary, I. J. et al. White matter integrity and cognition in childhood and old age. Neurology 66, 505–512 (2006).

    CAS  PubMed  Article  Google Scholar 

  34. 34.

    Ribolsi, M., Daskalakis, Z. J., Siracusano, A. & Koch, G. Abnormal asymmetry of brain connectivity in schizophrenia. Front. Hum. Neurosci. 8, 1010 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  35. 35.

    Mueller, S., Wang, D., Pan, R., Holt, D. J. & Liu, H. Abnormalities in hemispheric specialization of caudate nucleus connectivity in schizophrenia. JAMA psychiatry 72, 552–560 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  36. 36.

    Bora, E., Yucel, M. & Pantelis, C. Cognitive functioning in schizophrenia, schizoaffective disorder and affective psychoses: meta-analytic study. Br. J. Psychiatry 195, 475–482 (2009).

    PubMed  Article  Google Scholar 

  37. 37.

    Elvevag, B. & Goldberg, T. E. Cognitive impairment in schizophrenia is the core of the disorder. Crit. Rev. Neurobiol. 14, 1–21 (2000).

    CAS  PubMed  Article  Google Scholar 

  38. 38.

    Kuswanto, C. N., Teh, I., Lee, T.-S. & Sim, K. Diffusion tensor imaging findings of white matter changes in first episode schizophrenia: a systematic review. Clin. Psychopharmacol. Neurosci. 10, 13 (2012).

    PubMed  PubMed Central  Article  Google Scholar 

  39. 39.

    Fitzsimmons, J., Kubicki, M. & Shenton, M. E. Review of functional and anatomical brain connectivity findings in schizophrenia. Curr. Opin. Psychiatry 26, 172–187 (2013).

    PubMed  Article  Google Scholar 

  40. 40.

    Cropley, V., Klauser, P., Lenroot, R. K. & Bruggemann, J. Accelerated gray and white matter deterioration with age in schizophrenia. Am. J. Psychiatry 174, 286–295 (2017).

    PubMed  Article  PubMed Central  Google Scholar 

  41. 41.

    Van Kesteren, C. F. M. al. Immune involvement in the pathogenesis of schizophrenia: a meta-analysis on postmortem brain studies. Transl. Psychiatry 7, –11 e1075 (2017)..

  42. 42.

    Davis, K. L. et al. White matter changes in schizophrenia: evidence for myelin-related dysfunction. Arch. Gen. Psychiatry 60, 443 (2003).

    PubMed  Article  PubMed Central  Google Scholar 

  43. 43.

    Song, S.-K. et al. Demyelination increases radial diffusivity in corpus callosum of mouse brain. Neuroimage 26, 132–140 (2005).

    PubMed  Article  PubMed Central  Google Scholar 

  44. 44.

    Samartzis, L., Dima, D., Fusar-Poli, P. & Kyriakopoulos, M. White matter alterations in early stages of schizophrenia: a systematic review of diffusion tensor imaging studies. J. Neuroimaging 24, 101–110 (2014).

    PubMed  Article  PubMed Central  Google Scholar 

  45. 45.

    Haijma, S. V. et al. Brain volumes in schizophrenia: a meta-analysis in over 18 000 subjects. Schizophr. Bull. 39, 1129–1138 (2013).

    PubMed  Article  Google Scholar 

  46. 46.

    Chua, S. E. & McKenna, P. J. Schizophrenia–a brain disease? A critical review of structural and functional cerebral abnormality in the disorder. Br. J. Psychiatry 166, 563–582 (1995).

    CAS  PubMed  Article  Google Scholar 

  47. 47.

    Shenton, M. E., Dickey, C. C., Frumin, M. & McCarley, R. W. A review of MRI findings in schizophrenia. Schizophr. Res. 49, 1–52 (2001).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  48. 48.

    Calhoun, V. D., Eichele, T. & Pearlson, G. Functional brain networks in schizophrenia: a review. Front. Hum. Neurosci. 3, 17 (2009).

    PubMed  PubMed Central  Article  Google Scholar 

  49. 49.

    Birur, B., Kraguljac, N. V., Shelton, R. C. & Lahti, A. C. Brain structure, function, and neurochemistry in schizophrenia and bipolar disorder—a systematic review of the magnetic resonance neuroimaging literature. npj Schizophr. 3, 15 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  50. 50.

    Takaya, S. et al. Asymmetric projections of the arcuate fasciculus to the temporal cortex underlie lateralized language function in the human brain. Front. Neuroanat. 9, 1–12 (2015).

    Article  Google Scholar 

  51. 51.

    Abdul-Rahman, M. F. et al. Arcuate fasciculus abnormalities and their relationship with psychotic symptoms in schizophrenia. PLoS ONE 7, e29315 (2012).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  52. 52.

    Silbert, L. J., Honey, C. J., Simony, E., Poeppel, D. & Hasson, U. Coupled neural systems underlie the production and comprehension of naturalistic narrative speech. Proc. Natl Acad. Sci. USA 111, E4687–E4696 (2014).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  53. 53.

    Stephens, G. J., Silbert, L. J. & Hasson, U. Speaker–listener neural coupling underlies successful communication. Proc. Natl Acad. Sci. USA 107, 14425–14430 (2010).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  54. 54.

    Verma, R., Sarkar, P. & Rao, K. S. Conversion of neutral speech to storytelling style speech. In 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) 1–6 (IEEE, 2015).

  55. 55.

    Norrick, N. R. Conversational Narrative: Storytelling In Everyday Talk Vol. 203 (John Benjamins Publishing, Amsterdam, 2000).

  56. 56.

    Friederici, A. D. & Gierhan, S. M. E. The language network. Curr. Opin. Neurobiol. 23, 250–254 (2013).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  57. 57.

    Ravichandran, C., Shinn, A. K., Öngür, D., Perlis, R. H. & Cohen, B. Frequency of non-right-handedness in bipolar disorder and schizophrenia. Psychiatry Res. 253, 267–269 (2017).

    PubMed  PubMed Central  Article  Google Scholar 

  58. 58.

    Sommer, I. E. C., Ramsey, N. F. & Kahn, R. S. Language lateralization in schizophrenia, an fMRI study. Schizophr. Res. 52, 57–67 (2001).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  59. 59.

    van Veelen, N. M. J. et al. Reduced language lateralization in first-episode medication-naive schizophrenia. Schizophr. Res. 127, 195–201 (2011).

    PubMed  Article  PubMed Central  Google Scholar 

  60. 60.

    Razafimandimby, A., Tzourio-Mazoyer, N., Mazoyer, B., Maïza, O. & Dollfus, S. Language lateralization in left-handed patients with schizophrenia. Neuropsychologia 49, 313–319 (2011).

    PubMed  Article  PubMed Central  Google Scholar 

  61. 61.

    Sheng, J. et al. Altered volume and lateralization of language-related regions in first-episode schizophrenia. Schizophr. Res. 148, 168–174 (2013).

    PubMed  Article  PubMed Central  Google Scholar 

  62. 62.

    Kubicki, M. et al. Uncinate fasciculus findings in schizophrenia: a magnetic resonance diffusion tensor imaging study. Am. J. Psychiatry 159, 813–820 (2002).

    PubMed  PubMed Central  Article  Google Scholar 

  63. 63.

    Kitis, O. et al. Reduced left uncinate fasciculus fractional anisotropy in deficit schizophrenia but not in non-deficit schizophrenia. Psychiatry Clin. Neurosci. 66, 34–43 (2012).

    PubMed  Article  PubMed Central  Google Scholar 

  64. 64.

    Miyata, J. et al. Abnormal asymmetry of white matter integrity in schizophrenia revealed by voxelwise diffusion tensor imaging. Hum. Brain Mapp. 33, 1741–1749 (2012).

    PubMed  Article  PubMed Central  Google Scholar 

  65. 65.

    Yagmurlu, K., Middlebrooks, E. H., Tanriover, N. & Rhoton, A. L. Fiber tracts of the dorsal language stream in the human brain. J. Neurosurg. 124, 1396–1405 (2016).

    PubMed  Article  PubMed Central  Google Scholar 

  66. 66.

    Saur, D. et al. Ventral and dorsal pathways for language. Proc. Natl Acad. Sci. USA 105, 18035–18040 (2008).

    CAS  PubMed  Article  PubMed Central  Google Scholar 

  67. 67.

    Matarazzo, J. D., Wiens, A. N. & Manaugh, T. S. IQ correlates of speech and silence behavior under three dyadic speaking conditions. J. Consult. Clin. Psychol. 43, 198 (1975).

    Article  Google Scholar 

  68. 68.

    Cuthbert, B. N. & Insel, T. R. Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 11, 126 (2013).

    PubMed  PubMed Central  Article  Google Scholar 

  69. 69.

    Insel, T. R. The NIMH research domain criteria (RDoC) project: precision medicine for psychiatry. Am. J. Psychiatry 171, 395–397 (2014).

    PubMed  Article  PubMed Central  Google Scholar 

  70. 70.

    Andreasen, N. C., Flaum, M. & Arndt, S. The comprehensive assessment of symptoms and history (CASH): an instrument for assessing diagnosis and psychopathology. Arch. Gen. Psychiatry 49, 615–623 (1992).

    CAS  PubMed  Article  Google Scholar 

  71. 71.

    Leucht, S. et al. Dose equivalents for second-generation antipsychotics: the minimum effective dose method. Schizophr. Bull. 40, 314–326 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  72. 72.

    Boersma, P. & Weenink, D. J. M. Praat: Doing Phonetics by Computer (Version 6.0.37) (Institute of Phonetic Sciences of the University of Amsterdam, Amsterdam, 2013).

  73. 73.

    Quené, H., Persoon, I. & de Jong, N. Praat Script Syllable Nuclei v2 [Praat Script] (2011).

  74. 74.

    Rosen, S. Temporal information in speech: acoustic, auditory and linguistic aspects. Philos. Trans. R. Soc. Lond. B 336, 367–373 (1992).

    CAS  Article  Google Scholar 

  75. 75.

    MacWhinney, B. Tools for Analyzing Talk Part 1: The CHAT Transcription Format (Lawrence Erlbaum Associates, Mahwah, 2000).

  76. 76.

    Brundage, S. B. & Bernstein Ratner, N. A Clinician’s Complete Guide to CLAN and PRAAT 1–43 (2018).

  77. 77.

    Wang, R., Benner, T., Sorensen, A. G. & Wedeen, V. J. Diffusion toolkit: a software package for diffusion imaging data processing and tractography. Proc. Int. Soc. Mag. Reson. Med. 15, 3720 (2007).

    Google Scholar 

  78. 78.

    Smith, S. M. et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23, 208–219 (2004).

    Article  Google Scholar 

  79. 79.

    Kubicki, M., Westin, C. F., McCarley, R. W. & Shenton, M. E. The application of DTI to investigate white matter abnormalities in schizophrenia. Ann. N. Y. Acad. Sci. 1064, 134–148 (2005).

    PubMed  PubMed Central  Article  Google Scholar 

  80. 80.

    Alba-Ferrara, L. M. & de Erausquin, G. A. What does anisotropy measure? Insights from increased and decreased anisotropy in selective fiber tracts in schizophrenia. Front. Integr. Neurosci. (2013).

    Article  PubMed  PubMed Central  Google Scholar 

  81. 81.

    Smith, S. M. et al. Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. Neuroimage 31, 1487–1505 (2006).

    PubMed  Article  PubMed Central  Google Scholar 

  82. 82.

    Hua, K. et al. Mapping of functional areas in the human cortex based on connectivity through association fibers. Cereb. Cortex 19, 1889–1895 (2009).

    PubMed  Article  PubMed Central  Google Scholar 

  83. 83.

    Wakana, S., Jiang, H., Nagae-Poetscher, L. M., van Zijl, P. C. M. & Mori, S. Fiber tract-based atlas of human white matter anatomy. Radiology 230, 77–87 (2004).

    PubMed  Article  PubMed Central  Google Scholar 

  84. 84.

    Winkler, A. M., Ridgway, G. R., Webster, M. A., Smith, S. M. & Nichols, T. E. Permutation inference for the general linear model. Neuroimage 92, 381–397 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  85. 85.

    Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57, 289–300 (1995).

    Google Scholar 

Download references


I.E.C.S. received a TOP grant from The Netherlands Organization for Health Research and Development (ZonMW, project: 91213009). We are grateful to Zoë Dalmijn, David van de Kamp, Elleke Tissink, Arlena Schippers, Ellen Collée, Janna van Egmond, Kris Snoek, Jolien Jacobs, Joyce Berkhout, Hadassa Kwetsie and Julia Oostdam for their help with data collection and preparation. We would like to thank Olga van de Goor (O.G.) for her help with translating the semi-structured questionnaire to English.

Author information




J.N.d.B. designed and coordinated the project. M.v.H and J.B. performed MRI pre-processing. J.N.d.B. took the lead in writing the manuscript. M.J.H.B. supervised inclusion of the participants. All authors provided critical feedback and helped shape the interpretation and manuscript. All authors approved the final version of this manuscript.

Corresponding author

Correspondence to J. N. de Boer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

de Boer, J.N., van Hoogdalem, M., Mandl, R.C.W. et al. Language in schizophrenia: relation with diagnosis, symptomatology and white matter tracts. npj Schizophr 6, 10 (2020).

Download citation


Sign up for the Nature Briefing newsletter for a daily update on COVID-19 science.
Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing