Schizophrenia is associated with a range of alterations in brain structure and function1,2,3,4, some of which are related to a family history or specific genetic risk factors. Consistent relationships between delusions, hallucinations, and brain structure and function have, however, proved elusive—potentially because of power issues in relatively small clinical samples and confounds such as antipsychotic drug exposure and substance abuse. Psychotic-like experiences (PLEs) of lesser severity are present not only in patients but also in 5–8% of the general population5, and to some extent predict transition to psychiatric disorders among those with higher PLEs6; with cohort studies supporting a continuity between subclinical and clinically significant psychotic symptoms7,8. A recent study found shared genetic aetiology between PLEs and several psychiatric and neurodevelopmental disorders, indicating that PLEs may be related to a general risk for mental health disorders9. Studies of the relationship between PLEs and brain imaging metrics have, however, been scarce and characterized by small sample sizes (with N ranging from 25 for auditory hallucinations to 76 for any PLEs10,11,12,13). Functionally, PLEs have been associated with altered brain dynamics, in particular with default-mode hypoconnectivity11,14. Nonetheless, such evidence validates the study of the pathophysiology of these clinical phenotypes in non-clinical populations15, with large studies from the general population also offering increased power to detect such effects.

Aberrant functioning and organization of the salience network, default mode network (DMN), and central-executive network (CEN) are core features of several psychiatric and neurological disorders16; with patients with a diagnosis of schizophrenia showing structural and functional impairments in all three networks17. The salience network includes the insula and anterior cingulate cortex, and is involved in the identification of biological and behaviourally relevant stimuli and the subsequent coordination of neural resources to guide flexible behaviour18,19. Aberrant intrinsic functional connectivity of the salience network has been observed in schizophrenia20,21 and in individuals at clinical high risk for psychosis22, and is posited to underlie persecutory delusions in particular23. The DMN is a distributed system of fronto-temporal-parietal cortex that is activated during passive cognitive states and deactivates during several cognitive tasks24. In schizophrenia, the DMN is overactive with significant correlations between the activity of subregions of the DMN and positive symptoms25; although the evidence is somewhat inconsistent25,26,27. The CEN is a frontoparietal system coactivating the dorsolateral prefrontal cortex and the posterior parietal cortex during several cognitive tasks28. Increased functional connectivity between the DMN and CEN has been linked to hallucinations in patients with schizophrenia20.

Schizophrenia is highly polygenic, with many common alleles of small effect, and increasing numbers of genome-wide significant loci have been identified as sample sizes increase29,30,31. Summary statistics from large-scale genome-wide association study (GWAS) allow the degree of genetic liability for a heritable trait to be estimated in healthy subjects outside the population in which the original GWAS was conducted32,33. Only a small number of studies have analyzed the relationship between polygenic risk score for schizophrenia (PRSSZ) and neuroimaging biomarkers in healthy samples32,33,34,35,36,37 but some of these associations map on to regions likely to be involved in the generation of psychotic symptoms and PLEs.

Thus, in this study we investigated how PRSSZ relates to neuroanatomical properties of the salience network, DMN, and CEN, and thence to PLEs; in addition to formally testing whether the association between PRSSZ and PLEs was mediated by brain structure. We computed water diffusion magnetic resonance imaging (MRI) parameters of white matter tracts (using fractional anisotropy; FA), cortical thickness (CT), and grey matter volume (GMV) of the regions involved in these networks in a large sample of healthy participants from UKBiobank in whom any such associations would not be confounded by illness-associated factors. We employed a novel approach based on ROI–ROI analysis (derived from connectome processing) which extends our previous a priori network selection methods38,39, allowing a much finer-grained network approach than using other techniques which quantify white matter connectivity without direct subject-specific linkage to cortical or subcortical regions. In addition, previous studies have suggested that schizophrenia may be accompanied by accelerated ageing40, indicating for instance, significant declines in white matter coherence more than twice that of age-matched controls41. Therefore, to examine possible accelerated brain ageing in brain structure, we included an interaction term between age at MRI scanning and PRSSZ in all analyses.



UKBiobank ( comprises around 500,000 community-dwelling participants recruited from across the United Kingdom of Great Britain and Northern Ireland between 2006 and 2014. A subset of the participants who were part of the initial recruitment attended for head MRI scanning at an average of around 4 years after the initial visit (all data presented in this analysis were collected on the same scanner). The current study uses the 5k neuroimaging data release (Supplementary Fig. 1). UKBiobank received ethical approval from the Research Ethics Committee (reference 11/NW/0382). Those participants who had been admitted to a hospital with a diagnosis of schizophrenia or bipolar disorder with psychotic symptoms were excluded from our analysis. In order to comprehensibly study the PLEs phenotype, additional analyses comprised: (1) the exclusion of participants who had any psychiatric-related admissions (diagnoses in Supplementary Table 1) and (2) the inclusion of the whole sample (N = 3875). The present analyses were conducted as part of UKBiobank application 16,124, linked to 4844 and 10,279. All participants provided informed consent (

Polygenic risk score calculation

The details of the array design, genotyping, quality control, and imputation have been described previously42. Quality control included removal of participants based on missingness, relatedness, gender mismatch non-British ancestry, and participants based upon overlap in Psychiatric Genomics Consortium (PGC) prediction samples and schizophrenia status. Polygenic profiles were created for schizophrenia in all the genotyped participants from UKBiobank using PRSice43. PRSice calculates the sum of alleles associated with the phenotype of interest across many genetic loci, weighted by their effect sizes estimated from a genome-wide association study of that phenotype in an independent sample. These effect sizes have been estimated previously in GWAS for schizophrenia (PGC-SCZ,; 36,989 cases, 113,075 control subjects).

Before creating the scores, single nucleotide polymorphisms (SNPs) with a minor allele frequency <1% were removed and clumping was used to obtain SNPs in linkage equilibrium with an r2 < 0.25 within a 200 bp window. Five scores were created for each individual using SNPs selected according to the significance of their association with the phenotype at nominal p-value thresholds of 0.01, 0.05, 0.1, 0.5, and 1.0 (all SNPs). Our primary analyses used scores generated from a list of SNPs with a GWAS training set of p ≤ 0.1, as this threshold was shown to explain the most phenotypic variance in the discovery cohort31 (results for 0.5 and 1.0 thresholds are presented in Supplementary Results 1). Fifteen multidimensional scaling factors (estimated from SNP data) were entered into all models as additional “nuisance” covariates to control for population stratification, along with age and genotyping array.

Psychotic-like experiences information

Lifetime PLEs were taken from a Mental Health Questionnaire (MHQ; UKBiobank Category: 144) and was sent to all participants who provided an email address from July 2016 to July 2017. Responses to the following questions were dichotomized: “Did you ever believe that there was an unjust plot going on to harm you or to have people follow you, and which your family and friends did not believe existed?”, “Did you ever believe that a strange force was trying to communicate directly with you by sending special signs or signals that you could understand but that no one else could understand (for example through the radio or television)?”, “Did you ever hear things that other people said did not exist, like strange voices coming from inside your head talking to you or about you, or voices coming out of the air when there was no one around?”, and “Did you ever see something that wasn’t really there that other people could not see?”. We categorized these PLEs as persecutory delusions, delusions of reference, and auditory and visual hallucinations, respectively. This questionnaire explicitly indicated not to include those instances when the participant was dreaming, half-asleep, or under the influences of alcohol or drugs. Moreover, level of distress in relation to PLEs was defined as “Not distressing at all, it was a positive experience”, “Not distressing, a neutral experience”, “A bit distressing”, “Quite distressing”, and “Very distressing”, and was coded as a continuous variable (ranging from −1 to 3, in order of increasing distress). Participants that responded “do not know” or “prefer not to answer” were excluded from analyses in all cases.

Imaging analysis

Full details of the image acquisition and processing can be found on the UKBiobank website (, Brain Imaging Documentation (, and in ref. 44. More information regarding scan acquisition and image processing can be found in Supplementary Methods 1 and Supplementary Table 2 for a list of abbreviations of each node. Network nodes were selected based on previous studies, for the salience network we selected the following regions: thalamus, amygdala, ventral diencephalon, caudal anterior cingulate, and insula18,45; for the DMN: lateral orbitofrontal, medial orbito frontal, middle temporal, pars orbitalis, posterior cingulate, and supramarginal28,46; and for the CEN: caudal middle frontal, inferior parietal, rostral middle frontal, superior frontal, and superior parietal45,47,48. Figure 1 shows the nodes and white matter pathways selected for each of the networks studied here. For the present study, data available for participants who were unrelated, survived the quality control process and had full imaging data available, is provided in Table 1.

Fig. 1: Diagram of nodes of interest and their linking pathways derived from the connectome for the networks of interest.
figure 1

Colours were randomly assigned, and width of the links are proportional to mean FA values across all participants. Axial and coronal views of the networks with colour-coded FA gradient. L: left, R: right. A list of abbreviations is provided in Supplementary Table 2.

Table 1 Participants characteristics.

Network construction

For each subject, two networks were constructed: the number of streamline (NOS) network that was created using the number of streamlines connecting each pair of the 85 ROI (network node) pairs from the default FreeSurfer cortical49 and subcortical regions (Desikan–Killiany atlas, see Supplementary Table 2 for a list of abbreviations of each node); and the FA-weighted network that was constructed by recording the mean FA value along streamlines. The endpoint of a streamline was considered to be the first grey matter ROI encountered when tracking from the seed location. In order to reduce the number of spurious connections derived from probabilistic tractography, we applied a consistency-based threshold to the NOS matrices using the numbers of streamlines connecting all 85 ROI and preserving exclusively the top 30% white matter tracts that were most consistent across subjects50. This mask was then applied to the FA-weighted connectivity matrices. For each FA-weighted connectivity matrix for the thresholded network, the salience, DMN and CEN masks were applied based on our bilateral nodes of interest. Mean FA was computed using only the non-zero matrix elements.

Statistical analyses

We undertook an a priori network-of-interest (NOI) approach, based on the literature cited above.

Linear regressions for individual network components

Initially, we conducted linear regressions between PRSSZ and each node (CT, GMV) and edge (FA) within each of the selected NOIs. Within each model, each morphometric measure was set as the dependent variable, PRSSZ as the independent variable, controlling for age, sex, and the interaction between age and PRSSZ.

Network analyses

Next, we aimed to test whether associations with PRSSZ were best represented in the data at the network-general level (common pathway), or whether there were additional unique associations with specific network components (common + independent pathways analysis), or whether the associations were simply best described by independent pathways51. We did so in a structural equation modelling (SEM) framework. First, we fitted measurement models (i.e., models that relate the latent factor to its manifest variables) to ascertain the degree to which we could describe overall network integrity at the network level; this initial measurement model was a confirmatory factor analysis in which we tried to estimate a single latent construct of global NOI by incorporating altogether grey and white matter metrics pertaining to all nodes within a NOI. However, these models all exhibited poor fit to the data, and the latent measures of the grey and white matter did not correlate significantly.

Common model analysis

Thus, we opted to construct three measurement models for each NOI in which a latent factor was indicated by all network components separately: CT (derived from the bilateral cortical nodes), GMV (including all bilateral cortical and subcortical nodes), or white matter FA (derived from all pathways connecting nodes within each NOI). These measurement models all fitted the data well except for FA in the DMN and CEN which did not achieve acceptable model fit statistics and thus, were not included in these analyses.

For each model, we then tested associations between PRSSZ, age, the interaction PRSSZ × age, and MRI parameters. We did so by fitting multiple indicators, multiple causes (MIMIC) model52; Fig. 2a shows a simplified diagram of the SEM framework (see example of code used in Supplementary Methods 2). Within the model, each brain imaging measure was adjusted at the manifest level for sex and either whole brain average CT, whole brain total GMV, or whole brain mean FA, while PRSSZ was adjusted for sex, population stratification components, and genotyping array. Model fits were assessed using the following indices and cut-offs: Comparative Fit Index (CFI), Root Mean Square Error of Approximation (RMSEA), and Standardized Root Mean Square Residual (SRMR). We allowed for certain residual correlations between manifest variables in those cases in which their addition improved the model fit significantly. SEM analyses were conducted using the package “lavaan”53 in R with full-information maximum likelihood estimation to use all data available.

Fig. 2: Diagrams of structural equation models (SEM).
figure 2

a Multiple indicators, multiple causes (MIMIC) model52 for neurostructural properties of each network. A separate model was applied to FA, grey matter thickness (CT), and grey matter volume (GMV). From each individual bilateral node (L: left; R: right) or pathway, a latent score was calculated for FA, CT, GMV controlling for sex and whole-brain structural properties at the manifest level (i.e., whole brain FA/CT/GMV). Relation between FA/CT/GMV and PRSSZ is indicated by path β; path γ represents the association between the interaction of age and PRSSZ and FA/CT/GM factors; path δ represents the association between age and the latent factor. PRSSZ was corrected for sex and population stratification (paths not shown). The dashed line represents a possible residual correlation between nodes. b Path diagram of mediation model, where the ε coefficient represents the regression of X on M, λ coefficient the regression of M on Y, and σ coefficient the direct path of X on Y. The product of the ε and λ coefficients describes the indirect path of X on Y through M (σ′).

MIMIC common + independent pathways analysis

To assess whether associations between PRSSZ and latent constructs of network integrity were best characterized at the network-general level, or whether there were additional associations with particular brain regions beyond this, we ran a common plus independent pathways analysis51 (see details in Supplementary Methods 3).


Due to the low percentage of participants who answer “Yes” to experiencing PLEs (see Table 1), we first matched those participants to those who answered “No” by age and sex with a ratio 1:10 using the function “”, a propensity score method from the “MatchIt” package54. Linear regressions were then calculated independently for each network metric (including all network nodes or edges) with each PLEs as the response variable, covarying for either whole brain total volume (for node’s volume; to control for differences in whole-brain volume across participants), average CT (for node’s CT; to control for differences in whole-brain CT across participants) or whole brain mean FA (for edges; to control for differences in whole-brain FA across participants), age at MRI and sex. Therefore, correction for multiple comparisons was performed within each network metric and for each PLE. Raincloud plots55 were used to plot group differences between participants with PLEs and their matched controls.

Mediation analyses

We further sought to formally test whether the association between PRSSZ and PLEs was mediated by brain structure (at the manifest level). We did this in the form of a mediation model using the “lavaan” package53 with full-information maximum likelihood estimation, confidence intervals were reported (CI; constructed using 1000 bootstraps). This analysis was performed only in those cases where the path between brain structure (M) and PLEs (Y) was significant (pFDR < 0.050); mediation was observed when the change from σ to σ′ was statistically significant (confidence intervals did not include zero). Within the model, each brain region was adjusted for age, sex, and either average CT (CT analysis) or whole brain total volume (GMV analysis). Figure 2b shows a simplified diagram of the mediation framework.

Additional analyses

In order to determine whether the signal could be driven by any complication associated with any psychiatric disorder beyond a diagnosis of psychosis (i.e., distress, dysfunction, co-morbidities, medication, unhealthy lifestyles, etc.), we also performed an analysis that excluded all participants with any psychiatric diagnoses from the PLEs analysis (Supplementary Results 2). Due to the possible clinical significance of level of distress in relation to PLEs, we first computed logistic regressions with PLEs as the response variable and level of distress, age, and sex as predictor variables (using the “lm” function from the “stats v3.6.2” package). For those significant associations, we tested whether the level of distress caused by PLEs could be mediating the relationship between brain structure and PLEs (Supplementary Results 3). We also tested the hypothesis that increased number of PLEs (i.e., individual sum of these four types of PLEs) was associated with a higher level of distress56 (Supplementary Results 4).

All analyses were performed in R ( and standardized betas were reported throughout. For each section/model of statistical analysis, significance (p) values (α = 0.050) were corrected for multiple comparisons using false discovery rate (FDR)57 (using the “p.adjust” function from the “podkat” package).


Participant characteristics are presented in Table 1, and PLEs prevalence in Supplementary Table 3. Mean values for CT and GMV for each node (according to the Desikan–Killiany atlas49) are shown in Supplementary Fig. 2. All SEM models showed acceptable fit to the data (fit indices are shown in Supplementary Table 4).

PRSSZ analyses

Salience network

Linear regressions for individual network components

There were no significant associations between cortical and subcortical GMV and PRSSZ (pFDR > 0.050). For CT, nominally significant negative associations were found between the right and left insula and PRSSZ (β = −0.046, SE = 0.019, pFDR = 0.050, β = −0.039, SE = 0.018, pFDR = 0.065, respectively). Interactions between age and PRSSZ were significant for CT of the right caudal anterior cingulate (CAC) (β = −0.054, SE = 0.021, pFDR = 0.040). There were no significant associations between FA and PRSSZ (pFDR > 0.050).

MIMIC common + independent pathways analysis

The association between the latent factor for GMV and PRSSZ was not significant (r = −0.026, SE = 0.014, p = 0.242). Allowing for a direct effect of PRSSZ on left thalamus GMV significantly improved model fit (χ2[1] > 4.917, p < 0.028; independent pathway estimates: β = 0.017, p = 0.032). A direct effect of right CAC GMV on PRSSZ × age (β = −0.035, p = 0.088)—the second independent pathway added—significantly improved model fit (χ2[1] = 5.893, p = 0.015). Results are shown in Fig. 3a. In addition to the association between PRSSZ and the latent factor of CT (r = −0.069, SE = 0.015, p = 0.010), an independent pathway emerged from PRSSZ × age to right CAC CT (β = −0.043, p = 0.031), which improved the model fit (χ2[1] = 4.606, p = 0.030; Fig. 3b). There was no significant association between the latent factor for FA and either PRSSZ or PRSSZ × age interaction (p > 0.050).

Fig. 3: Diagrams of MIMIC and independent pathway models of the salience network.
figure 3

a Common + independent pathways model for latent factor of grey matter volume. b Common + independent pathways model for latent factor of grey matter cortical thickness. Dashed lines represent independent pathways while dotted lines residual correlations between nodes. Black lines represent statistically significant pathways (p < 0.050) while grey lines non-significant pathways (p > 0.050).

Default mode network

Linear regressions for individual network components

There were no significant associations between GMV, CT or FA and PRSSZ or the interaction between PRSSZ × age (pFDR > 0.050).

MIMIC common + independent pathways analysis

The associations between the latent factors for GMV and CT and PRSSZ were not significant (rGMV = −0.040, SE = 0.012, p = 0.111; rCT = −0.028, SE = 0.015, p = 0.273). An independent pathway emerged from PRSSZ × age to right supramarginal CT (χ2[1] > 5.705, p < 0.018; independent pathway estimate: β = −0.035, p = 0.008). Interactions between PRSSZ × age were not significantly associated with the latent constructs (p > 0.050).

Central executive network

Linear regressions for individual network components

No significant associations were found between the GMV, CT or FA and PRSSZ, or the interaction between PRSSZ × age (p > 0.050).

MIMIC common + independent pathways analysis

The associations between the latent factors for grey matter and PRSSZ were not significant (p > 0.050). For GMV, allowing for a direct effect of PRSSZ on right inferior parietal GMV (β = 0.039, p = 0.004) resulted in significant improvement in model fit (χ2[1] > 7.741, p < 0.006). For CT, an independent pathway emerged from PRSSZ × age to right superior frontal CT (χ2[1] > 7.512, p < 0.007; independent pathway estimate: β = 0.028, p = 0.002) and right inferior parietal CT (χ2[1] > 4.314, p < 0.039; independent pathway estimate: β = −0.024, p = 0.034). Interactions between PRSSZ × age were not significantly associated with any latent construct (pFDR > 0.050).

Psychotic-like experiences

Table 1 shows the frequencies and percentages of respondents with PLEs, Supplementary Table 3 shows sample characteristics for participants reporting PLEs versus non-PLEs. Figure 4a shows raincloud plots representing differences in CT and GMV between participants reporting PLEs and their matched controls.

Fig. 4: Raincloud plots representing differences in cortical thickness (CT) and volume (GMV) between participants reporting PLEs (blue) and matched controls (green).
figure 4

a Bilateral CT of the insula and auditory hallucinations. b GMV of right supramarginal gyrus and delusions of reference. c GMV of the left pars orbitalis and persecutory delusions. d CT of the right supramarginal gyrus and persecutory delusions. Note: CT is measured in cm and GMV in cm3. Asterisks represent significant differences between groups (*p < 0.05, **p < 0.01).

Within the salience network, significant negative associations were found between the CT of the right and left insula and auditory hallucinations (Ncases = 51, Ncontrols = 510; β = −0.114, SE = 0.035, pFDR = 0.004 and β = −0.083, SE = 0.036, pFDR = 0.045, respectively). Associations between total number of PLEs and CT/GMV of salience network nodes were all non-significant.

For the default mode network, a significant negative association was found between the GMV of the right supramarginal gyrus and delusions of reference (Ncases = 11, Ncontrols = 110; β = −0.195, SE = 0.061, pFDR = 0.022) and between persecutory delusions and GMV of the left pars orbitalis (Ncases = 12, Ncontrols = 120; β = 0.219, SE = 0.069, pFDR = 0.021). For persecutory delusions, there was a positive association with the CT of the right supramarginal gyrus (Ncases = 13, Ncontrols = 130; β = 0.177, SE = 0.048, pFDR = 0.003). We did not find any significant association between total number of PLEs and GMV/CT of nodes. No significant associations were found between CEN’s nodes and PLEs (pFDR > 0.050). The overall PLE findings are summarized in Supplementary Table 5.

Mediation analyses

The mediation models using PRSSZ and mediators met the criteria for a close-fitting model (RMSEA = 0, CFI = 1, SRMR < 0.05). We examined the hypothesis that higher PRSSZ was associated with symptom severity via the brain structures significantly associated with PLEs in the salience and DMN. First, we tested the association between PRSSZ and total number of PLEs in the whole UKBiobank sample, independently of diagnosis (NPRS = 302,575, NPLEs = 157,305, with N = 7803 answering “Yes” to experiencing at least one PLEs). However, we did not find any significant association (β = 0.020, SE = 0.018, p = 0.283). We found that the linear association between PRSSZ and auditory hallucinations was significantly mediated by CT of the right insula (from σ = 0.023 to σ′ = 0.004, CI [0.020, 0.127] with the right insular cortex mediating 82.6% of the association between PRSSZ and auditory hallucinations, Fig. 5). These results were consistent across all PRSSZ thresholds (Supplementary Results 5). No significant mediations were observed for any other ROIs.

Fig. 5: Path diagram of the mediation model, where the ε coefficient represents the coefficient of regressions of PRSSZ on cortical thickness (CT) of the right insula, and λ the coefficient of the regression of CT of the right insula on auditory hallucinations, and σ coefficient of the direct path of PRSSZ and auditory hallucinations.
figure 5

Coefficient of σ′ represents the indirect path of PRSSZ on auditory hallucinations through CT of the right insula. Note: Superscript a represents p values < 0.01 and superscript b refers to confidence intervals not including zero.


The present study aimed to investigate different structural properties of the salience, DMN, and CEN networks in relation to PRSSZ and PLEs in a large population-based cohort of healthy participants. This is, to our knowledge, the first study to investigate these polygenic-MRI associations using the structural properties of brain networks from a connectome perspective, where the properties of estimated connections that directly link distal cortical and subcortical regions in each individual are examined.

At the node level, we found nominally significant associations between thinner right and left insular cortices and higher PRSSZ. At the network level we found that higher PRSSZ was associated with reduced CT across the salience network, with some specific associations between PRSSZ and individual brain properties emerging. The findings reported here indicate both global network and domain-specific pathways implicated in schizophrenia, suggesting that those individual regions showing an independent significant association with PRSSZ may be more susceptible/resilient to the effect of PRSSZ beyond a latent factor of network integrity. We found several significant associations between PLEs—in particular regarding auditory hallucinations—and structural properties of the salience and DMN; with the right insular cortex largely mediating the association between auditory hallucinations and PRSSZ.

Impairments in white matter micro- and macrostructure are a common feature in schizophrenia with healthy relatives who are genetically at higher risk of developing schizophrenia also exhibiting impairments in FA58. Our results are in accordance to a previous study, reporting non-significant associations between a general factor of FA (including 27 major white matter tracts) and PRSSZ in a previous release of this data (n = 816)59, though the current approach offers greater brain regional fidelity and statistical power. Despite the apparent alterations in white matter structure in schizophrenia, only a small number of studies have reported significant associations between PRSSZ and white matter in healthy and clinical samples34,37,60. Significant reductions in global white matter volume in dizygotic twins discordant for schizophrenia have been reported while reductions in global grey matter were exclusively observed in schizophrenia61. Thus, the null findings of white matter observed here may partly be due to the aggregation of genetic and environmental risk factors found in affected individuals and their relatives compared with healthy individuals.

Psychotic-like experiences

The reported prevalence of PLEs in this imaging sample (4.75%) is similar to that in the whole UK Biobank sample (5%)62, and with previous population studies63,64. We did not find any significant association between PRSSZ and total number of PLEs in the entire UKBiobank sample (N = 308,693), consistent with previous studies investigating this relationship in adolescence65,66. A recent UKBiobank study found a weak association between the presence of any PLEs and PRSSZ at a threshold of p ≤ 0.05 (OR, 1.09; 95% CI, 1.06–1.12; adjusted R2 = 0.001; p = 2.96 × 10−11, N = 500,000); however, as the authors noted, this association may be biased by the possibility that their analysis included sample overlap between UKBiobank and the training sets9.

The associations between PLEs and insular cortices are consistent with its involvement in interoception; this awareness of the body’s internal state comprises emotional responses, complex cognitive states, and the sense of self67,68. Though our sample sizes were modest they are relatively large compared to the previous literature on PLEs and brain imaging in adults10,11,12,13,69 (for instance, studies on auditory hallucinations with samples of less than 30 participants compared to our sample of 51 experiencing auditory hallucinations and 134 for any PLEs). Moreover, our results are consistent with correlations between psychotic symptoms, activation, volume, and surface area of the insula in clinical70,71 and high-risk samples22; with hallucinations being associated with structural aspects of the insula72. Our mediation analyses extend these findings by showing the CT of the right insula largely mediating the association between PRSSZ and auditory hallucinations; in accordance with evidence suggesting that predominant right lateralization may discriminate auditory–verbal hallucinations from normal inner speech73. Impaired interoception has been associated with a loss of perception of the self, which may contribute to misattributions of agency in schizophrenia; indeed, higher frequency of prediction errors in sensory cues74 and attribution of inner speech to that of others have been reported in the disorder75,76. Thus, we suggest that the observed insular differences may contribute to a higher susceptibility to commit external attribution errors in subjects experiencing PLEs. Overall, our results suggest that the salience network may play a key role in PLEs compared to the DMN and CEN. This is supported by studies showing that positive symptoms in schizophrenia are related to variations in the morphology, metabolism, and neural activity of the nodes of the salience network but not of DMN or CEN77.

The positive correlation between the GMV of the right pars orbitalis—ventral subregion of the inferior frontal gyrus (IFG)—and persecutory delusions is, however, at odds with previous studies reporting reduced GMV of the pars orbitalis in high risk individuals6,78 and negative associations between its volume and positive symptoms in clinical samples79; suggesting that involvement of the pars orbitalis may be limited to clinical populations. We also found that brain structure differences in the supramarginal gyrus were associated with delusions. This is in line with the role of the supramarginal gyrus in processing auditory inputs, especially language80, and the relation between lower volume and auditory hallucinations in schizophrenia81. The supramarginal gyrus, here analyzed as a node in a salience network, is part of the somatosensory association cortex, which is involved in the interpretation of the postures and gestures of other people, including the empathic understanding of others. Disruption of the supramarginal gyrus has been linked to impaired empathy and people “projecting” emotions onto others, and such psychological processes have been implicated in delusions of persecution and reference since Freud82. Intriguingly, we have previously shown fMRI over-activation and dysconnectivity of the intraparietal sulcus bordering the supramarginal gyrus in people at high risk of schizophrenia with sub-clinical (partial) delusions and hallucinations83,84. Although the relationship between structural and functional MRI is still not clear, it may be conceivable that over-activation of the supramarginal gyrus in people with such experiences could increase CT. In non-psychotic individuals, auditory hallucinations have been linked to abnormal brain connectivity within the DMN and with auditory cortices (sample of participants experiencing auditory hallucinations of N = 25 and N = 29, respectively10,12). The involvement of right supramarginal, insula, and IFG—all of them language-related areas—in PLEs has been extensively documented, indicating that those experiencing auditory hallucinations tend to show impaired speech perception85 with an engagement of both speech production and reception circuitry86. Auditory hallucinations are commonly manifested as voices because the human auditory system is tuned to the natural priors of speech87 with its content possibly contributing to the formation of delusions by attributing meaning/agency to the experience, supported by our findings involving the supramarginal gyrus and delusions.


We limited our analysis to those networks implicated in previous functional and structural studies, translating those nodes onto a common parcellation scheme. Given issues of validity and comparability across brain atlases, and the implications of this for underlying connectivity88,89,90, further research should aim to replicate this study by computing the networks at different levels of node granularity and with different structural properties. We applied a consistency-based thresholding method in an attempt to remove spurious white matter connections, exclusively preserving the top 30% that were most consistent across subjects. Though current evidence indicates that the human brain is likely to exhibit a fairly high degree of sparsity, principles of white matter connectivity in humans are not sufficiently comprehensive (when compared to the mouse91) to allow the confident implementation of detailed anatomical priors92; some connections may have therefore been pruned erroneously. We investigated whether neurostructural properties of brain networks are associated with PLEs. However, due to the still modest frequencies of PLEs in this sample, results should be interpreted cautiously. A previous study has indicated a potential “healthy volunteer” selection bias in this sample93, in particular, people with lower socio-economic status, chronic illness, and smokers were under-represented62. This suggests that this cohort may not be representative of the sampling population.


In a large sample of largely healthy participants from UKBiobank, a higher genetic liability for schizophrenia was associated with subtle neurostructural differences; in particular, a thinner cortex across the salience network. Beyond this global relationship, some independent paths emerged in each structural network. We also found significant associations between PLEs and the insula, supramarginal gyrus, and pars orbitalis; with the insula largely mediating the association between PRSSZ and auditory hallucinations. Our results in a large healthy sample support previous studies on aberrant activation of language-related areas and the externalization of inner speech phenomena in clinical samples. Studies with greater statistical power should validate the results of this research, which indicates that the study of the non-clinical phenotype may represent a valid approach to investigate the pathophysiology of the clinical phenotype and suggests a shared genetic aetiology between the clinical and non-clinical phenotype.