Introduction

Stroke is the leading cause of long-term disability worldwide1,2. By 2047, the prevalence of stroke is estimated to increase by 27%, with a disproportionate increase of stroke events in women relative to men3,4,5. Estimations suggest a ~55,000 increase of absolute stroke events in women compared to men each year (women: 425,325 cases, men: 369,675 cases)3,4,5. The anticipated rise in stroke prevalence will put a strain on the economic burden of our health systems. The annual stroke-related social and health costs have reached >60€ billion in Europe alone6,7. Post-stroke aphasia places an overwhelming burden on many of the >25 million stroke survivors worldwide8,9. For affected individuals, the sequelae of stroke can prompt drastic life changes, with severe long-term impairments10,11. Aphasia severely impacts everyday communication and social interactions by affecting language production and comprehension. Aphasia is associated with an increased mortality rate12, longer hospitalization, poorer functional recovery, and reduced overall probability of returning to work13.

As an underappreciated fact, women typically suffer strokes with more severe consequences and language impairment even if the lesion size is comparable to men14,15. Yet, the majority of stroke lesion studies have been underpowered to detect robust sex disparities given the surfeit of small-sample investigations16,17. A recent meta-analysis argued that sex-nuanced differences in aphasia outcome might be partly explained by the typically older age of stroke onset in women compared to men18. For that reason, placing special emphasis on sex effects requires well-powered patient samples while carefully controlling for influencing factors, especially given that many sex-related neural correlates are modest in effect size19. The present investigation in a large patient cohort aims at a direct probabilistic assessment of the subtle sex effects in how the topography of white matter tissue damage leads to differences in language impairments. Such insights on sex-specific lesion–outcome associations are needed to improve personalized treatment prospects in light of the increased burden of aphasia in women.

Aphasia is a frequent consequence of embolic occlusion of the middle cerebral artery. However, brain lesion distributions are highly variable in the preserved and impaired language capacity20,21,22. In attempting to harmonize the heterogeneous clinical manifestation of aphasia, decades of research efforts have been devoted to determining the complex neural basis of language processing. Today, there is growing consensus that the human language faculty is supported by distributed large-scale networks (for meta-analyses, see refs. 23,24,25; for functional evidence, see refs. 26,27,28; for lesion studies, see ref. 29). Numerous studies have shed light on the anatomical white matter connections at play. In particular, the arcuate fasciculus (AF) has been at the center of attention for decades30,31. Early anatomical dissection outlined its anatomical trajectory, which suggested a direct fiber connection of Wernicke’s area in the left temporal lobe to Broca’s area in the left frontal lobe32,33. Applications of diffusion tensor imaging techniques promoted the idea of a further division of the AF into three segments. One direct temporo-frontal segment and two lateral segments have been proposed to compose the overall AF. The lateral tract subsegments implement an indirect fronto-parietal and temporal-parietal pathway (anterior and posterior segment, respectively)34,35. The AF may provide an anchor for the canonical perisylvian network and has been placed at the core of the neuroarchitecture of the uniquely human language abilities36.

Attempting to marry its structural and functional bases, researchers have integrated the core perisylvian pathway into different functional-anatomical modules. One popular idea is the dual-stream concept of language37,38,39,40,41,42. This view puts forward a ventral stream for mapping sound-to-meaning and a dorsal stream comprising the AF and superior longitudinal fascicle for mapping sound-to-motor representations. The ventral pathway includes the uncinate fasciculus (UF), the inferior fronto-occipital fasciculus (IFOF), and inferior longitudinal fasciculus (ILF) that together assist in processing speech signals for language comprehension31,41,43,44. The dorsal stream is thought to be left-dominant, while the ventral stream is believed to be more bilaterally organized31,38,45. However, there is strong interindividual variability in the degree of functional hemispheric dominance in general46,47,48, and accumulating evidence points towards aspects of sex-specific differentiation in particular. For example, more pronounced left lateralization in the long segment of the AF was found in men49. Sex differences have also been reported for lateralization of functional activation during carefully designed language tasks50,51,52,53.

Clinically, ischemic stroke generally affects male and female patients differently in terms of incidence, severity, survival, and potential for recovery4,54,55. In fact, women generally suffer strokes with more severe neuropsychological consequences compared to those of men14,56. Similar lesion volume effects were noted in cases of aphasia, in which smaller ischemic stroke lesions were capable of producing aphasia in women than men15. The reasons behind sex idiosyncrasies in functional outcomes after stroke remain elusive. Differences are often explained by poorer pre-stroke function, more comorbidities such as depression, and social isolation or differences in age at stroke onset (see refs. 18,57). Yet, adjustment for these factors does not adequately explain the observed discrepancies in stroke outcomes between men and women58,59.

Despite the apparent differences in stroke outcome between women and men, sex is often treated as a nuisance variable in the overwhelming majority of stroke lesion studies20,60,61. This established practice blinds researchers to detect any potential neuroanatomical effects that depend on sex. Compounding these challenges that hold barriers to understanding sex-specific brain lesion effects, lesion-symptom mapping studies often constrained the analyses to the predominantly affected hemisphere. That is, as a general trend, the left hemisphere is specifically investigated in language impairment and the right hemisphere in neglect20,60,62,63,64. To overcome several existing roadblocks, the present work takes advantage of Bayesian principles as a formal device to quantify degrees of sex divergences. In contrast to frequentist post hoc estimations, the Bayesian regime provides a direct quantification of uncertainty around all model parameters by appropriately handling all considered sources of variation, and thus enable more careful quantification of subtle sex effects, covariates, such as age, and meta-brain lesion characteristics, such as lateralization65,66,67. As a key advantage of this fully probabilistic modeling approach, the inferential capacity can thus be enhanced by accommodating dependencies between parameters at each hierarchical level. Well suited for samples with >1000 patients65,68, our Bayesian hierarchical strategy enabled a sex-aware analysis quantifying the certainty of subtle differences between female and male stroke patients, covering major white matter pathways in both hemispheres. As the accurate quantification of model uncertainty is critical for optimized decision making in medicine69, understanding how lesion location, structural language lateralization, and their clinical outcome diverge in aphasia between female and male populations can finesse personalized rehabilitation and treatment.

Results

Patient cohort: characteristics and behavioral outcome measures

Our inclusion criteria were met by 1401 patients with ischemic stroke who were recruited as a prospective multi-center cohort from 2007 to 2018 (Table 1 for details). The mean age was 67.58 (standard deviation 11.59), with 42% women. The mean age at admission was significantly higher in females than males (t = 8.9, p < 0.001, cf. Table 1). Patients had an average of 9.28 (5.92) years of education, with women amounting to fewer years on average compared to those of men (t = −17.27, p < 0.001). Based on the IQCODE score, females’ cognitive performance before the ischemic event was significantly worse compared to males (t = 3.91, p < 0.001). Stroke severity as reflected in the clinical impairment was significantly higher in females across nine cognitive tests (K-MMSE, BN, SF, PF, RCFT, RCFT delayed recall, digital symbol coding, TMT-A, and -B). Out of 15 objects to be recognized in the BN70, patients correctly named 9.92 (3.59) on average, with females scoring worse than males (t = −11.90, p < 0.001). Females also performed worse in semantic and phonemic fluency (t = −7.12 and −7.97, both p < 0.001). In sum, we observed more severe stroke-related deficits and worse premorbid cognitive performances in female patients of our cohort.

Table 1 Characteristics of the patient cohort.

Topography of ischemic stroke lesion patterns

Excellent whole-brain coverage was attested by the voxel-wise summary of lesion overlap across 1401 patients (Fig. 1a). The lesion overlap predominantly corresponded to the vascular territory of the left and right middle cerebral artery. The majority of lesions were located in the deep central white matter in both hemispheres. Most tissue lesions were concentrated in the thalamus and basal ganglia. The average lesion volume was 17.08 ml (14.81–19.36) and did not significantly differ between female and male patients within our cohort (women: 15.05 ml (11.47–18.63), men: 18.57 ml (15.63–21.51), p = 0.13; Table 1). We did not find significant differences between females and males regarding their stroke topography (Fig. 1b, c, summarized separately).

Fig. 1: Spatial topography of ischemic stroke lesions summarized from 1401 patients.
figure 1

We confirm excellent whole-brain coverage, without appreciable hemispheric differences in the voxel-level stroke topography for the whole cohort (a), or separately either for female patients (b) or for male patients (c). The overall lesion volume did not significantly differ between female and male patients within our cohort (two-sided t test, p = 0.13, cf. Table 1). Additionally, there was no significant difference in the left versus right hemispheric overall lesion distribution in females (two-sided t test, p = 0.23) or in males (two-sided t test, p = 0.63). The topographical distributions of tissue damage caused by ischemic stroke are shown in MNI reference space with z-coordinates indicated above each brain slice. Lesions predominantly overlapped in areas that correspond to the vascular territory of the left and right middle cerebral artery. The majority of lesions were localized in subcortical zones, which entails a disconnection of deep white matter tracts. Radiological view; R/L: right/left.

Anatomy of the extracted stroke lesion atoms

We first extracted hidden topographical archetypes from the high-dimensional lesion distribution in white matter at voxel resolution. Using non-negative matrix factorization (NNMF; ref. 71) for unsupervised pattern discovery, we automatically uncovered recurring spatial configurations of concurrent tissue damage that are consistently apparent across >1000 stroke cases—which we call lesion atoms (Fig. 2)56,72. Each lesion atom (component issued by NNMF application) reflects a prototypical latent pattern of stroke-inflicted white matter disconnection (for an analytical overview, see Supplementary Fig. 3). Disentanglement of this sum-of-parts representation by means of machine learning provides an important advantage for the purpose of the present study. The non-negativity constraint of the applied algorithm enabled neurobiological meaningfully and directly interpretable lesion patterns71,73,74. For each lesion atom, the underlying tract configuration is strictly positive (ranging from low to high contributions, cf. legend in Fig. 2a). The original MRI-lesion distribution is then reconstructed by the simple addition of these latent lesion atom patterns and hence is compatible with the intuitive notion of combining different parts to form a whole. In contrast, alternative dimensionality reduction tools such as principal component analysis recover a complex superposition of orthogonal components with arbitrary signs that lack intuitive meaning and direct biological interpretation74,75,76. In the present study, for example, such negative weights may contradict the physicality of stroke-induced tissue lesions as a negative amount of tract lesion damage inflicted by stroke cannot be meaningfully interpreted. Leveraging the above advantages of the NNMF approach, therefore, allows intuitive and neurobiologically meaningful interpretation of latent stroke patterns (e.g., the importance of latent patterns in the posterior cerebral artery territory in Bonkhoff et al.56,72). As a descriptive post hoc analysis, each lesion atom was assessed for hemispheric dominance of the implicated tissue lesion distribution by an index of lateralization (Fig. 2b). For most factors, there was a unique left-hemispheric lesion constellation in correspondence with a similar right-hemispheric homolog (Fig. 2c, factors 3 and 9, or 2 and 5). Overall, the spatial distribution of the derived lesion atoms emerged to be directly interpretable and biologically plausible with components underlying distinct territories of arterial supply (Fig. 2). Most of the identified lesion atoms related to infarcts in the left or right middle cerebral artery territories, and to a lesser extent to posterior cerebral artery strokes, and mixtures of both.

Fig. 2: Multivariate latent-factor discovery deconvolves unique stroke lesion atoms.
figure 2

In 1401 neurological patients, we used an unsupervised machine learning algorithm to perform a data-driven discovery of white matter lesion patterns. This sum-of-parts approach (non-negative matrix factorization, NNMF) enabled the identification of archetypical white matter lesion configurations, lesion atoms, extracted from high-resolution brain scans with a total use of lesion voxels across patients. There was no significant difference between the expression of each lesion atom in women and men (cf. Supplementary Table 1). a For each extracted lesion atom (y-axis), the underlying configuration of white matter tracts (x-axis) and their contribution from low (purple) to high (yellow) relevance are visualized from the NNMF factor matrix. Each lesion atom (represented as a row in the matrix) is visualized according to the same color scale in the 3D glass brain renderings (c). The most contributing tracts are labeled (for in-depth visualizations, see Supplementary Fig. 2). The anatomical distribution of the derived lesion atoms corresponded to directly interpretable and biologically plausible lesion patterns that are reminiscent of well-known functional systems. Each atom of coherent white matter lesion was assessed for hemispheric asymmetry (b) based on a lateralization index (LI)167. This index is computed from the weights of the latent factor loadings by LI = (L − R)/(L + R). Hemispheric asymmetry effects are indicated by left (Red) or right lateralization (Blue). Most factors showed a left-hemispheric lesion configuration in correspondence with a similar right-hemispheric homolog. Post hoc naming of the lesion atoms was based on the underlying tract contribution with the highest weights (yellow color scale). Four factors were predominantly associated with the language-related pathways (c): the dorsal perisylvian circuitry (Factor 3 and 9; R and L, respectively) and the ventral pathway (Factor 2 and 5; R and L, respectively). Notably, the core perisylvian pathway shows significant leftward asymmetry in factor 9 with the accentuated influence of the long and anterior AF. Instead, the right homolog in factor 3 predominantly relies on the anterior segment of the AF. Motor function (c) was most strongly expressed in factors 1 (L) and 7 (R), and 10 (bilateral cerebellar network). Primarily limbic factors 4 and 6 were dominated by the influence of the left and right cingulum. Factor 8 was uniquely left lateralized and captured the main effect of the fornix (c). As such, even subtle differences in lateralization align with the current neuroanatomical knowledge of the language pathways49. AF arcuate fasciculus, ILF inferior longitudinal fasciculus, UF uncinate fasciculus, IFOF inferior fronto-occipital fasciculus, CST cortico-spinal tract, IC internal capsule, CPC cortico-pontine-cerebellar tract, ICP inferior cerebellar peduncle, SCP superior cerebellar peduncle, OR optic radiation, R/L right/left.

Four lesion atoms of stroke-induced tissue damage were associated with anatomical tracts implicated in the dorsal and ventral language pathways (Fig. 2c). The canonical language tracts contributed the most to these identified lesion atoms, and hence, influenced our choice of post hoc naming. However, other tracts, e.g., motor projections within the affected arterial territory were part of the lesion atom to a lesser degree as well. Generally, the left-lateralized language atoms showed stronger lesion effects than their right-lateralized homologs. Factor 9 primarily covered white matter lesions of the core perisylvian pathway with a focus on the left hemisphere. These lesion effects predominantly rested on the AF. The level of relevance within the subsegments of the AF followed an anterior-to-posterior gradient. Here, the anterior and long segments showed higher lesion importance than the posterior segment. The core perisylvian right-hemispheric homolog was captured by factor 3. The organization of the AF’s different segments again showed an anterior-to-posterior transition. Both factors implicated the UF pathway to a lesser extent. Factor 5 resembled the left-lateralized ventral pathways, which included the ILF, the IFOF, and the UF. These lesion atom effects further implicated damage in the left optic radiation (OR), and to a lesser degree, the posterior segment of the AF. The right-hemispheric homolog to factor 5 was identified as factor 2. Right-hemispheric (Factor 2) ventral pathways were less pronounced regarding their overall lesion importance compared to the left.

Three lesion atoms resembled damage to the brain’s motor circuits (Fig. 2c bottom row). The pattern topology of lesion factor 1 included the left-lateralized motor projections of the cortico-spinal tract (CST), internal capsule (IC), and cortico-pontine-cerebellar tract (CPC). Similarly, factor 7 captured lesions in the mirrored motor projections of the right hemisphere. Factor 10 was the only factor with no hemispheric dominance and captured bilateral cerebellar connections, including the CPC, the inferior and superior cerebellar peduncle (ICP, SCP). Three further lesion atoms captured the prominent influence of the limbic system. The right cingular fibers dominated the tissue damage configuration of factor 4, next to a lesser influence of the posterior segment of the AF, IFOF, and IC. Conversely, the lesion constellation of factor 6 captured the left-hemispheric cingular influence. Lastly, factor 8 was dominantly shaped by the influence of the left fornix, less so by its right counterpart. Notably, only factor 8 did not have a clear homologous pattern on the right hemisphere. However, the right fornix appeared weakly in other lesion atoms, i.e., factors 2, 7, and 8.

Lesion atoms track interindividual differences in cognitive outcomes

To chart the clinical relevance of each lesion atom, we computed Pearson’s correlations between patient-specific expressions of a given lesion atom and a cognitive test, one for each available neuropsychological assessment (Fig. 3). The patient-specific lesion atom expressions illustrate how well the overall stroke lesion in a given patient can be recovered with the archetypal “template” captured by each lesion atom. Based on the ensuing (absolute) factor-outcome association strengths (Fig. 3a), we could rank the lesion atoms across all outcome measures of our cohort with detailed neuropsychological assessment. As a consequence of this bottom-up approach, factors 5 (Ventral pathway R), 8 (Fornix L), and 9 (Dorsal pathway L) ranked as the three most clinically relevant lesion atoms given our functional domains of cognitive outcomes. The limbic factors 4 and 6 reflected the right and left influence of the cingular fiber bundle and ranked fourth and fifth. Except for the TMT, factors 5, 8, and 9 showed pronounced negative associations across most cognitive scores (Fig. 3b). Regarding the language outcomes, damage in the left-hemispheric lesion atoms of the ventral and dorsal pathways (Factors 5 and 9, respectively) was most associated with a deterioration in Boston naming (BN), semantic fluency (SF), and phonemic fluency (PF). However, the association with their respective right-hemispheric homologs diverged: Notably, the right hemispheric homolog of the core perisylvian network (Factor 3) showed a slightly negative association with BN, but otherwise positive associations for SF and PF. In contrast, the right-hemispheric homolog of the ventral pathway (Factor 2) was negatively associated with BN and SF, with close links to deteriorated naming and speech fluency.

Fig. 3: Stroke lesion atoms are ranked from strongest to weakest associations with clinical outcomes.
figure 3

We assessed the overall clinical relevance of each lesion atom using Pearson’s correlation between subject-specific lesion atom expression and a cognitive test across outcome measures. a Based on the summed absolute contributions, factors 5 (Ventral pathway L), 8 (Fornix L), and 9 (Dorsal pathway L) ranked as the three most relevant lesion atoms. Following the top three lesion atoms, the limbic factors 4 and 6 reflect the right and left influence of the cingular fiber bundles. b All top five lesion atoms (y-axis, ranked by absolute contribution) showed pronounced negative associations with cognitive impairment across most neuropsychological assessments. Regarding language, damage in the left-hemispheric lesion atoms (Factors 5 and 9) of the ventral and dorsal pathway causes a significant degree of deterioration in the Boston naming test (BN), semantic and phonemic fluency (SF, PF). However, the association with their respective right-hemispheric homologs diverged: the right-hemispheric homolog of the dorsal core perisylvian network (Factor 3) showed a slight negative correlation in BN, but otherwise positive correlations in SF and PF. In contrast, the right-hemispheric homolog of the ventral pathway (Factor 2) is associated with a considerable deterioration in BN and SF. K-MMSE Korean-Mini Mental State Examination, BN Boston Naming, SF semantic fluency, PF phonemic fluency, SVL Seoul-Verbal Learning:, RCFT Rey Complex Figure Test, TMT A/B Korean-Trail Making Test Version A/B, R right, L left.

Sex divergences in lesion atom expressions and prediction of language outcome

Next, we turned to the direct quantification of degrees of sex differentiation in how the emerged lesion patterns predict language outcome. Based on the patient-specific constellations of lesion pattern expressions, we estimated a fully probabilistic, generative model to explain language outcome at ~3 months after stroke. In particular, our approach was able to obtain full Bayesian posteriors over the differentiation between females and males. In the present analysis, we focused on the quantitative analyses of the language outcomes of naming ability (BN) and speech fluency (SF and PF; other scores cf. Supplementary Figs. 415). Leveraging the Bayesian modeling framework, we provided an honest quantification of (a) the general effects of the derived lesion atoms on cognitive outcome after stroke and (b) the degree of sex divergence in how the lesion atoms achieve the prediction of post-stroke language outcome. All analyses were adjusted for variation that could be explained by age, age2, sex, education in years, pre-morbid cognitive performance, and total lesion volume. We also included sex as a covariate in the model to explicitly account for a priori differences between female and male stroke outcomes56. For example, worse language outcomes in females could be linked to an underlying cause independent of the lesion distribution, such as a decreased likelihood of receiving acute interventions. In this scenario, the effect would be expressed in the Bayesian posterior distribution of the sex covariate, but not in that of the female-stratified lesion atom.

Among our target language capacities, we first examined interindividual differences in naming ability as captured by the BN test. The posterior predictive checks confirmed an appropriately fitted approximation of the data variation that underlies how lesion atom expressions explain cognitive performance outcomes (R2 = 0.51, Fig. 4a). We then inspected the role of each lesion atom in forecasting future naming performance in BN at ~3 months after stroke, separately for women and men (Fig. 5a). Both sexes displayed worse naming outcomes (BN) with increasing damage to the left-lateralized ventral pathway (most prominent in factor 5), its right-hemispheric homolog (captured primarily by factor 2), and the right cingular fiber pathway (major role in factor 4). However, the inferred posterior parameter distributions also attested specific divergences between females and males regarding lesion-outcome associations: the right-hemispheric homolog of the core perisylvian network (Factor 3) and the left fornix (part of factor 8). Here, damage in the lesion pattern configurations apparent in men was predictive of worse language outcomes. In contrast, stronger impairment in females was further explained by damage in the left core perisylvian network (driving constellation of factor 9). Tissue damage in the motor projections of the right hemisphere (in factor 7) and the left cingular fiber pathway (most prominent tract of factor 6) explained more pronounced deficits in naming performance in women than men. We thus identified common and diverging effects across the subject expressions of lesion atoms to predict naming capacity between women and men.

Fig. 4: Performance of the inferred Bayesian analytical solution in predicting naming and speech fluency.
figure 4

We confirmed a well-fitted approximation of the underlying distribution as performed for model checking in previous research72,168. The posterior predictive checks are shown for the Bayesian models that were estimated to predict the inter-individual difference in (z-scored) Boston naming (a, BN), semantic, and phonemic speech fluency (b, SF, and PF). These model-based simulations of new data were then compared to the actually observed data to compute the overall explained variance (coefficient of determination, R2). This practical check of model-based outcome predictions is a well-recognized approximation to external validation given the actual data at hand164. The full model for the BN outcome explained a total variance of R2 = 0.51, and the Bayesian models for the PF and SF explained R2 = 0.37, respectively. Considering our white matter results, the predictive performance for naming and speech fluency matches previous research on gray matter lesions due to stroke72.

Fig. 5: Sex bias becomes apparent from contributions of specific lesion atoms to explaining language outcomes.
figure 5

For each lesion atom ranked by their clinical relevance (cf. Fig. 3), we show the predictive role in explaining language function separately for women and men. Figure 5 shows the mean of each marginal Bayesian posterior distribution for each lesion atom. Of note, all values reflect a certain degree of incurred dysfunction in language. However, higher mean values correspond to a lesion atom contributing to relatively more preserved language function (red) relative to the rest of our stroke cohort, while lower means explain a relatively more severe loss in function (blue) among all stroke patients in our cohort. Across sexes, effects can be similar or diverging. a For the Boston naming (BN), we observed driving loss of function common to both sexes in the left-lateralized ventral pathway (Factor 5), its right-hemispheric homolog (Factor 2), and the right cingulum (Factor 4). Diverging lesion effects between females and males arise in several atoms, including the right dorsal pathway, the left fornix, cingular tract, and the left and right motor projections (Factors 3, 8, 6, 7, and 1 respectively). b Similarly for semantic fluency (SF), the main effect common to both sexes leading to worse performance was in the left ventral pathway (Factor 5) as well. Less influential were common effects in the right ventral pathway, the left cingulum and fornix (Factors 2, 6, 8). Analogous diverging lesion-outcome predictions in the right dorsal perisylvian circuit (Factor 3) were less pronounced than in BN, leading to moderate impairment for both sexes. But lesions in the left dorsal pathway led to worse outcomes specifically for females. We further observed differences between women and men in the lesioned right cingulum (Factor 4). Here, males were predicted to have a more severe impairment. c For phonemic fluency (PF), adverse outcome is primarily explained by the contributions of the left dorsal pathway (Factor 9) and left Fornix (Factor 8) in both sexes. We observed marked differences between sexes for the right-hemispheric dorsal pathway, left and right cingulum, with males performing worse than females (Factor 3, 4, and 6). In contrast, women showed significantly worse outcomes in the lesioned left ventral pathway (Factor 5).

Out of all extracted lesion atoms, distinct factors carried explanatory weight jointly for both sexes or uniquely for the female and male patients in our stroke population. The explanatory relevance for BN was indicated by the inferred marginal posterior parameter distributions (Fig. 6a). Most relevant across both sexes, we identified a driving effect in the left-lateralized ventral pathway (Factor 5). Stroke-induced consequences were stronger in women (mean of the posterior distribution [PM] = −0.251, highest density interval [HDI] of the posterior distribution covering 80% certainty = −0.410 to −0.090) than in men (PM = −0.158, HDI −0.267 to −0.040). We found further sex-diverging lesion-outcome effects. In women, tissue damage in the left fornix (major role in factor 8) predicted a more preserved outcome (Factor 8; PM = 0.138, HDI 0.008 to 0.248). Of note, a positive PM still reflects an inflicted decrease in function. However, compared to the average of the behavioral performance distribution of the full patient cohort the behavioral injury is overall less pronounced than in other stroke patients. In males, we identified small but consistent effects: the right homolog of the ventral pathway contributed to the prediction of lost function (Factor 2; PM = −0.032, HDI −0.063 to −0.009). Further, damage to the bilateral cerebellar network (Factor 10; PM = −0.080, HDI −0.119 to −0.037) contributed to explaining naming impairments. Together, our BN findings witness a dominant lesion-outcome effect in the left ventral language pathway, consistently for both women and men. For women, the left fornix lesion was indicative of more moderate impairment of cognitive capacity ~3 months after stroke. In contrast, males only showed modest diverging lesion pattern effects located in the homologous right ventral pathway and bilateral cerebellar connections.

Fig. 6: Shared and diverging lesion effects on language performance outcomes in female and male stroke populations.
figure 6

Our model formed full posterior Bayesian estimates to infer the explanatory relevance of each lesion atom for the prediction of language outcomes in female (green) and male (blue) stroke patients. Key assessments of post-stroke language performance included the Boston Naming test (BN, a), semantic (SF, b), and phonemic fluency (PF, c). Lesion atoms were considered relevant if the highest density interval (HDI) of the posterior distribution covering 80% certainty (black line) did not overlap with zero (vertical dotted line). Deviance from zero indicates the robust directional effect (arrow): either a relative loss in function (−) or a relative preserved function (+), compared to all stroke patients in our sample. We found lesion pattern effects that were common to both sexes (left side) and that diverged between women and men (right side). All shared effects indicate a loss in function (a, b, and c: indicated by (−)) for both females and males, while the diverging lesion effects differ in direction between sexes. Here, all lesion atom effects for men indicate worse language function (indicated by − in ac). For females, instead, effects in BN and SF show a more preserved function (indicated by + in a, b), and a more severe loss in SF (−). Across both sexes, lesion topographies pointed to an overarching structural organization reflecting the dual-stream concept of language39. Here, the ventral pathway was critically involved in naming and semantic fluency deficits, and the dorsal pathway in phonemic fluency deficits, congruently for women and men. More importantly, we provide evidence for sex-nuanced lesion-outcome effects. Women primarily rely on left-dominant circuits, including structural connections of the fornix, and a female-specific vulnerability to ventral lesion damage in phonemic fluency. In contrast, men draw from bilateral ventral and cerebellar connections, and potentially integrate more domain-general cognitive and executive functions via right-lateralized cingular fiber bundle to enable efficient language processing after stroke.

Examining how SF outcome differences are underpinned by white matter tissue damage, the posterior predictive check indicated an explained variance of R2 = 0.37 (Fig. 4b). Considering the male versus female lesion-outcome associations, we identified strong effects for both sexes in the left ventral pathway (Factor 5; Fig. 5b), followed by the left cingular fibers (most prominent part in Factor 6) and the right homolog of the ventral pathway (included in factor 2). Damage captured by each of these three lesion atoms played contributing roles in lost SF function. Lesions affecting the right dorsal perisylvian circuit (most prominent in factor 3) explained a more moderate deficit for both sexes. Yet, in the SF function, too, we detected pronounced sex divergences. Men deteriorated more in SF tests as a result of tissue damage in the right cingular fibers (defining role in factor 4). In contrast, white matter damage in the left core perisylvian network (primarily in factor 9) added to difficulties with language fluency in women.

The lesion patterns predictive of SF differences resembled those explaining BN differences as evidenced by the corresponding marginal posterior distributions: our model attributed SF dysfunction in part to tissue damage in the left ventral pathway (most prominent in factor 5, Fig. 6b) in both sexes. This lesion-outcome association was more pronounced in women (PM = −0.360, HDI −0.561 to −0.186) than men (PM = −0.190, HDI −0.345 to −0.069). Patient expressions of the lesion atom in the right-hemispheric perisylvian network were informative for predicting decreased, yet in relation to the entire stroke patient cohort more preserved function in women (PM = 0.157, HDI 0.034 to 0.276) but not in men (Fig. 6b). Similar to the BN analysis, the right ventral pathway (Factor 2, PM = −0.040, HDI −0.069 to −0.010) helped predict loss in SF function in males (Fig. 6b). The left-hemispheric motor projections damage (Factor 1, PM = −0.045, HDI −0.088 to −0.012) helped predict SF dysfunction. However, we observed an even stronger lesion-outcome contribution by the right cingular fibers (defining in factor 4, PM = −0.268, HDI −0.400 to −0.121) in men. Concluding our SF findings, we found a common lesion-outcome effect in the left ventral pathway. In women, tissue damage compromising the right perisylvian pathway added to the predictability of SF performance outcomes in our patients. Damage caused to the right cingulum assisted in the prediction of future SF loss in men beyond the damage caused to the left ventral pathway.

As the last examined facet of language capacity, we quantified how patterns of tissue damage underpin interindividual differences in PF performance ~3 months after stroke. The full probabilistic model explained R2 = 0.37 of the variance in the outcome predictions based on the patient expressions of dedicated lesion atoms (Fig. 4b). Common to both sexes, our model attributed lesion damage in the left core perisylvian circuit (prominently captured in factor 9) to loss of PF function. We found marked divergences between sexes in factors 3–6 (Fig. 5). Specifically, language dysfunction in females was due, to a larger extent, to damage to the left ventral pathway (most profound in factor 5). In contrast, men showed more pronounced consequences for cognitive performance due to damage to the right and left cingulum (highest contributions in factor 5 and 6) and the right perisylvian pathway (captured mostly in factor 3).

Based on the inferred Bayesian posteriors (Fig. 6c), the left dorsal perisylvian pathway showed considerable relevance in explaining differences in PF function, which was comparable in both sexes (Factor 9, females: PM = −0.151, HDI −0.298 to −0.020; males: PM = −0.146, HDI −0.273 to −0.029). However, women’s functional loss in PF was more explained by damage in the left ventral pathway (Factor 5, PM = −0.235, HDI −0.437 to −0.043) compared to men. Additionally, lesions in the bilateral cerebellar network (Factor 10; PM = −0.058, HDI −0.114 to −0.008) tracked performance decline in PF. Further, men showed a strong negative effect in the right cingular pathway (most prominent in factor 4, PM = −0.217, HDI −0.359 to −0.077). Together, our lesion-outcome effects in PF highlighted the influence of the left-dominant perisylvian network common to both sexes. In women, the left ventral pathway added predictive value as to a loss of function in SF. In men, however, the influence of the right cingulum was further indicative of deterioration in PF.

Our sex-sensitive approach enabled us to tease apart shared and diverging lesion-outcome effects in female and male stroke populations. Across both sexes, the ventral pathway was critically involved in naming and semantic fluency deficits, while the dorsal pathway was implicated in phonemic fluency deficits. More generally, we provide evidence for sex-nuanced lesion-outcome effects. Concluding our findings across all language performance scores, women tend to rely on left-dominant circuits, including structural connections of the fornix. We further detailed a female-specific vulnerability to ventral lesion damage in phonemic fluency. In contrast, this facet of language performance in men tended to draw from bilateral ventral and cerebellar connections. Damage caused to the right cingular fiber connections assisted in the prediction of future SF loss in men beyond the damage caused to language-specific pathways.

Discussion

Language capacity is uniquely evolved in humans and a central aspect of daily life in our societies. There is now an increasing realization that brain tissue damage caused by stroke entails partly distinct consequences on language performance in women and men. We hence designed a fully probabilistic framework as a clean analytical solution to modeling the apparent degrees of sex disparity. First, we extracted neurobiologically meaningful lesion patterns from 1401 stroke patients in a data-led fashion. Second, we custom-tailored a bespoke Bayesian strategy to partition interindividual variation in caused aphasia outcomes into their lesion pattern contributions. In so doing, we offer honest uncertainty quantification of sex-biased effects in clinical language outcomes after stroke in one of the largest stroke cohorts with rarely available deep phenotyping.

In particular, (1) we identified lesion-outcome effects in men and women in the left-dominant language network39,41, highlighting ventral fiber tracts of the left ILF, the IFOF, and the UF as core lesion focus across loss of different language facets. (2) Across both sexes, our analyses of brain lesion topographies pointed to an overarching structural organization reflecting the dual-stream concept of language and revealed pronounced sex divergencies in the role of lesion patterns. (3) We provide evidence for sex-nuanced lesion-outcome effects in the domain-general supra-modal networks, which our model attributed to cortico-subcortical pathways, with strong implications of the fornix and cingular fiber bundles. In the following, we discuss our findings for the anatomical topology of the identified lesion atoms, the shared lesion-outcome effects across sexes, and diverging predictive effects between men and women in implicated domain-general supra-modal networks.

Our analytical approach offered several advantages over classical lesion-symptom mapping studies. In conventional lesion-symptom mapping, each brain voxel is usually considered in isolation62. However, analyzing the contribution of each single brain voxel individually deters intuitive interpretation by omitting interaction effects between spatially distant locations. Multivariate extensions have started to address the problem of mutual dependence between voxels63,77,78. However, these approaches remain ill-suited to detecting overarching sources of variation that are driving disparate brain effects in continuous degrees, such as brain lateralization or sex-specific lesion-behavior associations. By means of our purpose-designed generative model, we could simultaneously appreciate local tract-lesion effects (and inter-dependencies between them) and their spatially distributed configurations. Leveraging the hierarchy of our fully probabilistic approach, we can directly integrate their hemispheric and sex-specific idiosyncrasies into one unified model for clinical outcome prediction.

The estimation of our framework was supported by empirical lesion data from a large sample of 1,401 stroke patients with rarely available whole-brain coverage. In line with the existing literature, the lesions were predominantly localized in subcortical zones and led to a critical disconnection of deep white matter tracts22. Rather than providing clear-cut one-to-one mappings in all cases, the identified tract damage underlies regions vascularized by the middle and posterior cerebral artery and was previously linked to worse language deficits21,22. Our data source thus presented two thrusts forward in stroke neuroimaging modeling. First, the spatial topography of the discovered lesion constellations covered both hemispheres. In many previous stroke studies on aphasia or spatial neglect, researchers have limited the anatomical space of investigation to either the left or right hemisphere alone60,61,62,63,79,80. This unique data scenario enabled us to detect delicate predictive effects, including subtle sex differences, with regard to both brain hemispheres. Second, our findings provide comparable lesion coverage for both sexes. In contrast to the large majority of stroke lesion studies20,60,61, we explicitly model sex as a higher model effect of interest. Potential differences independent of the sex-specific brain damage-behavior relation are further accommodated by our covariates at the lower level of our Bayesian model. This strategy ensures unique lesion-symptom effects for women and men and potentially helps to explain previous seemingly contradictory findings81.

Our pattern discovery framework enabled us to identify topologically coherent and biologically meaningful spatial patterns of lesion distributions from stroke. The identified lesion patterns were defined by their arterial vasculature, e.g., Factors 3 and 9 representing mostly MCA coverage. The identified lesion atoms coincidently mirrored well-known functional brain circuits. Moreover, the respective lesion patterns generally fit well with the associated functions engaged in the applied tasks, as evidenced by neuropsychological assessments of language, motor, memory, and attention, as well as cognitive and executive control38,82,83. The fine-grained nature of the extracted lesion atoms also enabled us to bring into sharp focus fiber pathways of essential relevance for the human language capacity. The implicated lesioned anatomical tracts recapitulated the dorsal and ventral route of the dual-stream concept of language processing38,39, cerebral and cerebellar motor representations84, and visuo-spatial attention and executive control83,85. Overall, our analyses on language performance show a stronger impact of lesions in the left hemisphere, conforming to the widely held asymmetry of the human language network46.

The granularity of our lesion atoms further enabled us to dissect subcomponents of stroke effects in language-related pathways, especially the different perisylvian subsegments of the AF. Since the three-segment view of AF anatomy was proposed34, various studies ventured to examine the role that subcomponents of the AF play in realizing language function. Such studies initially suggested that the functions of the distinct fiber bundle segments differ: the direct pathway was associated with phonological processes, whereas the indirect anterior segment tended to serve semantically-based functions, and the posterior segment could serve especially language comprehension34. However, evidence from these studies have previously been hard to reconcile.

Some studies on aphasia after stroke linked the anterior segment of the left AF to impaired speech fluency, among other fiber tracts86,87. One study advocated significant language associations selectively for the posterior segment of AF88. Yet, other investigators reported no significant links between the damage to the subsegments of the AF and subsequent language function89. Based on our investigation of a large clinical cohort, we provide newly detailed evidence that dissects the major white matter pathways necessary for language production and understanding. In particular, our findings highlight the anterior and long segments as key subcomponents of the perisylvian route. Within the lesion atom of the dorsal pathway, both the anterior and long segments of the AF are further confirmed in the prediction of speech fluency in men and women. Indeed, our lesion atoms reflecting the left and right ventral language route further include the posterior segment of the AF. This finding suggests a shared essential piece of the dorsal and ventral language pathway and a contribution of the latter pathway to naming and semantic fluency deficits. Our observation ties into a previous lesion study which reported overlap between both streams in the left temporo-parietal region90. Going beyond a merely topographic interpretation, our study extends the previous findings by pointing to potentially shared underlying anatomical connections. Such an overlap seems plausible given that the associated language tasks, naming, and semantic fluency, likely invoke both fiber processing streams.

In both men and women, the strongest observed brain-behavior predictions were preferentially lateralized to the left hemisphere. We identified the fiber tracts of the left ILF, the IFOF, and the UF to be critically involved in naming and semantic fluency deficits in both sexes. The left AF was responsible for performance decline in phonemic fluency in women and men to a similar degree, which nicely mirrors earlier findings91,92. The involved anatomical tracts are in accord with incumbent theories around the dorsal and ventral language pathways38,39,41. As a limitation to these considerations, our identified lesion atoms also track, to a smaller extent, certain canonical tracts that were not typically interpretated to be related to language processing, such as motor projections. In other words, a modest lesion impact entailing motoric deficits cannot be completely ruled out, despite our umbrella-term summary as ‘dorsal/ventral language pathways’ to facilitate communication of our findings. Our lesion-outcome effects in semantic fluency and naming were shared by women and men. This observation highlights the importance of the ventral language route and some of its major white matter pathways, including the ILF, IFOF, and UF. These findings directly converge with previous analyses that collapsed across both sexes into crude average effects. In particular, our finding linking semantic fluency with the ventral stream resonates well with vast evidence from large-scale neuroimaging meta-analyses25,93, neurostimulation results in healthy participants94,95, lesion studies90,96, computational modeling studies97, and targeted intraoperative electro-stimulation of fiber tracts98,99.

Our analyses detailed brain-behavior associations of phonemic fluency to locate to the dorsal perisylvian route, congruently for both sexes. With the strong impetus of the AF fiber tract, our results are consistent with the association of the dorsal language stream with phonological processing in healthy volunteers34,41,100 and lesions in stroke patients90. Our predictive effects link the ventral and dorsal pathways to semantic and phonemic fluency, respectively. Both identified associations are aligned with findings from a variety of methodologies, including direct intraoperative electro-stimulation98,99, diffusion MRI tractography100,101, and lesion-symptom investigations in aphasia61,102,103,104,105,106. We confirm the structural relations in the dual-stream model in regard to the underlying language tasks. Our lesion findings shared by women and men are hence consistent with previous studies that were blind to sex-specific effects.

More generally, several previous efforts compounded evidence that male and female brains are distinct in the asymmetry of functional brain organization. Male brains appear to show more far-reaching segregation between the left and right hemispheric function. Instead, female brains are thought to have stronger interconnections between the left and right hemispheres50,107. According to an early hypothesis, the sharper distinction between left and right brain function, allegedly characteristic for men, was thought to promote spatial but impede verbal performance50,108. On a cognitive level, this sex divergence in brain lateralization was thought to explain why women outperformed the opposite sex in language tasks such as verbal fluency tests on average; as well as why men were supposed to typically exhibit greater spatial skills, such as in three-dimensional mental rotation tasks109,110. However, the respective effect sizes have been small and several meta-analyses called into question the organizational differences related to lateralization and language processing111,112,113.

In our sex-sensitive analyses, we found women to be more impaired by lesion damage in the left ventral pathway across different language tasks, including phonemic and semantic fluency. Moreover, we detailed a female-specific selective vulnerability to ventral lesion damage in phonemic fluency. Importantly, differences in lesion size or overall lesion topography were accounted for in our approach. As a limitation to our present study, handedness as a surrogate for the dominant hemisphere was not recorded. A relatively stronger role of the left ventral pathway in women is in line with previous work in healthy volunteers, demonstrating higher radial diffusivity in the left UF in women relative to men114.

In contrast, damage in the left-dominant ventral pathway and its right-hemispheric homolog was particularly informative about lost naming and semantic fluency capacities in men. Previous work suggested that the ventral language pathway is largely bilaterally organized, comprising parallel processing streams38,39. We expand the current literature body by adding evidence for a predominantly left-lateralized ventral dominance in women. In support of our findings, early analyses stratified by sex suggested that men have a symmetrical increase in white matter volume and concentration in the bilateral anterior temporal lobe115. Our findings receive further support in that healthy male volunteers showed increased gray matter volume and cortical thickness in several bilateral regions relative to females, although no relationship with speech tests was reported116,117. Notably, previous studies are inconclusive with respect to sex incongruencies in functional and structural substrates of language-related processes53,118. Our sex-aware analyses bear the potential to reconcile several important inconsistencies on structural lateralization effects in general and in the ventral language pathway in particular.

Indeed, our analyses brought into the open several sex-specific findings outside the canonical language circuits that shed new light on stroke-induced language deficits. Our sex-aware analysis framework offered evidence that men draw from bilateral ventral and cerebellar connections and potentially integrate more domain-general cognitive and executive functions via right-lateralized cingular fiber connections to enable efficient language processing after stroke. There is increasing evidence that many higher cognitive functions, including language processing, are supported by large-scale distributed networks82,119. Such parallel organization of networks may potentially aid recovery after aphasic stroke by the recruitment of other cognitive functions that compensate for the damaged language capacities120. In particular, additional domain-general networks may be upregulated to boost the performance of the damaged processes if the primary language system is compromised121,122,123. Damage to cingular fiber tracts may potentially undermine the increased recruitment of domain-general networks, including the “multiple-demand network” with special contributions to cognitive control83. The selective vulnerability we show for men in cingular pathways and bilateral temporal circuits may relate to previously identified sex-specific idiosyncrasies in white matter microstructure in healthy subjects117,124. The combination of core language network disruption and lesioned cingular pathways may hence reflect the complex balance of domain-specific and domain-general interplay for entertaining language functions.

In contrast, women accentuate left-dominant circuits, including structural connections of the fornix. A hippocampal–neocortical pathway through the fornix was previously suggested to be critical to the supra-modal brain network architecture of the default mode network (DMN, ref. 125). Large-scale association networks, such as the DMN, fronto-parietal control, and salience networks, are generally thought to occupy regions juxtaposed and interdigitated with the language network119,126,127. Altered supra-modal interactions with core language features via fornical fibers may hence reflect a dynamic process underlying neural remodeling in chronic aphasic stroke. Indeed, our findings emphasize a left hemispheric dominance of the fornical fibers. While the involvement of the left fornix in naming is a observation, it has already been implicated in other language facets, including semantic fluency reflecting the retrieval of episodic context128,129. Importantly, our findings capture the subacute phase of stroke; therefore, the dynamic of reorganization may still change. Possibly, the altered language-DMN unit may affect language outcomes. Previous work has described disrupted DMN function in chronic stroke patients collapsed across sexes for impairment in motor function130,131,132 as well as language function133,134,135,136. Both the language-specific as well as domain-general disruption were suggested to influence post-stroke language outcome. However, the exact associations remain to be delineated.

Our findings speak in favor of revising the view towards sex-differentiated disruption of DMN integration. General sex divergences in DMN function in healthy individuals were previously demonstrated, indicating that women have higher connectivity within the DMN117. The authors suggested that the higher intra-DMN connectivity may help explain the higher average female ability in different cognitive domains. The selective vulnerability to the fornix in women is further supported by a previous diffusion tensor investigation on healthy volunteers showing higher fractional anisotropy in women specifically in the hippocampus output pathway via the fornix124. Moreover, in chronic stroke patients, the hippocampal-neocortical connectivity to the inferior parietal lobule (IPL) was consistently found to be decreased compared to healthy controls137.

In fact, the IPL represents one potential hub for multi-sensory integration of domain-specific and supra-modal processes125. Previous work has linked the IPL with semantic aspects of language processing, social cognition, and critical switch in stimulus-driven control of attention and diversion of self-reflective thinking to processing salient external stimuli125,138,139,140. It receives axonal inputs from the posterior segment of the AF, as a short vertical tract connecting Wernicke’s with Geschwind’s areas34. Our results further emphasize the relevance of the IPL and underlying posterior AF as seen in the dominance of the ventral lesion atom in women across all applied language tasks. Our findings converge with evidence suggesting negative modulatory connections from the parietal lobule to the superior temporal gyrus in females, enabling highly efficient semantic representations53. Similarly, higher regional efficiency was reported in females in the left IPL and superior temporal regions in network connectivity analysis in neurologically intact participants141.

Building on studies on healthy individuals, we confirm the relevance of the left IPL, fornix, and DMN in ischemic stroke patients. Our lesion-outcome findings in women may potentially reconcile previous observations and offer an explanation to sex divergences in chronic aphasia outcomes. More importantly, the personalized prediction of long-term post-stroke recovery using sex-aware stroke lesion topology may bring value to future individualized treatment interventions. Intensive speech therapy was previously shown to promote more efficient DMN integration leading to consequent language improvement142,143,144,145. Targeted rehabilitation in men and women may hence advance therapy-induced neuroplasticity by enabling more efficient multimodal network integration.

Methods

Prospective stroke population resource

We analyzed a multi-center stroke registry with 1401 patients. They were retrospectively selected from the Bundang and Hallym Vascular Cognitive Impairment cohorts, which are prospectively recruited cohorts consisting of patients originally admitted to the Seoul National University Bundang Hospital or Hallym University Sacred Heart Hospital in South Korea between 2007 and 2018146. Patients were hospitalized and diagnosed with acute ischemic stroke. All patients were eligible for the present study based on the following criteria: (1) availability of brain MRI showing acute tissue infarction in the diffusion-weighted imaging (DWI) and/or fluid-attenuated inversion recovery (FLAIR), (2) successful lesion segmentation and registration, (3) no primary intracerebral hemorrhage, and (4) availability of follow-up data on key demographics and neuropsychological assessment (the 60-min Korean-Vascular Cognitive Impairment Harmonization Standards-Neuropsychology Protocol147,148). The local institutional review boards of each hospital approved the study protocol and gave waived consent requirements based on the retrospective nature of this study and minimal risks to participants.

Characteristics of stroke patient cohort

The neuropsychological performance of the patients was profiled by a rich battery of tests (cf. distributions in Supplementary Figure 1) ~3 months (Hallym: 91.4 days (SD 38.9), Bundang: 105.7 days (SD 94.1)) after the acute onset of stroke147. For the purpose of our present study, we primarily focused on two key facets of post-stroke language performance: (i) naming ability (Korean short version of the Boston Naming Test (BN)70), and (ii) speech fluency (phonemic and semantic fluency (PF, and SF, respectively; ref. 147). The BN task prompts patients to name 15 pictured nouns (short version). In the semantic fluency tasks, patients are instructed to generate as many words as possible belonging to a specific category, such as animals, within a one-minute time window. In the phonemic fluency task, in turn, patients are asked to generate words starting with a given phoneme (e.g., giut, iung, or siut, which correspond to the international phonemic alphabet phonemes of g, ŋ, and s, respectively). These fluency tests have been previously validated in the Korean language149. A diverse collection of general cognitive tests included the Korean Mini-Mental State Examination (MMSE), the Seoul Verbal Learning test (SVLT), the Rey Complex Figures test (RCFT), Digital symbol coding, and the Korean-Trail Making Tests Elderly’s version A and B (TMT). The Informant Questionnaire on Cognitive Decline in the Elderly (IQCODE) was used to capture the cognitive performance before the ischemic event150. This established retrospective assessment variant relied on health-proxy reports to evaluate the cognitive change within the period 10 years prior to the ischemic stroke. All tests and questionnaires were conducted by a trained clinical psychometrician blinded to any clinical or neuroradiological patient information151. Available sociodemographic and clinical information included age, sex, years of education, and lesion volume (Table 1). All continuous variables were adjusted to a comparable scale by mean centering to zero and variance scaling to one.

Neuroimaging data preprocessing

Whole-brain MRI scans were typically acquired within the first week after the stroke event. The scanning protocol included structural axial T1, T2-weighted spin-echo, FLAIR, and DWI sequences (3.0 T, Achieva scanner, Philips Healthcare, Netherlands; cf. Neuroimaging protocols below). Tissue damage was manually segmented on DWI or less frequently FLAIR images by experienced investigators (A.K.K. and G.A.) relying on in-house developed software based on MeVisLab (MeVis Medical Solutions AG, Bremen, Germany; ref. 152). Lesion segmentations were checked and potentially refined by two additional experienced raters. Brain scans and their corresponding lesion segmentations were linearly and nonlinearly normalized to Montreal Neurological Institute (MNI-152) reference space employing the publicly available RegLSM image processing pipeline153. Quality of normalization was rigorously controlled by an experienced rater and if necessary, manually corrected.

Neuroimaging protocols

(1) Seoul National University Bundang Hospital: The MRI protocols comprised diffusion-weighted imaging (DWI), axial T1- and T2-weighted spin echo, fluid-attenuated inversion recovery imaging (FLAIR), gradient-echo imaging, and coronal T1-weighted spin echo imaging. FLAIR imaging was acquired using a fast-spin echo sequence with the acquisition parameters: repetition time: 11,000 ms; echo time: 125 ms; inversion time: 2800 ms; slice thickness 5 mm; intersection gap 1 mm; matrix: 512 × 512; flip angle 90 degrees. DWI imaging was obtained employing an EPI-spin echo sequence with the acquisition parameters: repetition time: 5000 ms; echo time: 50 ms; diffusion b-value: 1000; slice thickness: 5 mm; intersection gap: 1 mm; matrix: 256 × 256; flip angle 90 degrees. (2) Hallym University Sacred Heart Hospital: The MRI protocols comprised DWI, axial T1- and T2-weighted spin echo, FLAIR, gradient-echo imaging, and coronal T2-weighted spin echo imaging. FLAIR imaging acquisition parameters: repetition time: 11,000 ms; echo time: 125 ms; inversion time: 2800 ms; slice thickness: 5 mm; matrix: 512 × 512; flip angle 90 degrees. DWI image acquisition parameters: repetition time: 3000 ms; echo time: 56 ms; diffusion b-value: 1000; slice thickness: 5 mm; matrix: 256 × 256; flip angle 90 degrees.

Data-led deconvolution of hidden lesion atoms

We sought to explore the coherent patterns that may combine to make up the topographical distribution of stroke lesions in white matter. First, we parsed each patient’s lesion fingerprint by summarizing the white matter damage annotations at the voxel level according to fiber tract definitions from a reference tractography atlas35,49,154. This widely-used atlas provided spatial definitions of 28 major fiber tracts in MNI standard space22,86,155,156,157. In doing so, the tract-by-tract summaries of lesion loads separately tracked the left and right hemispheres, which enabled the modeling of subtle differences in hemispheric lateralization in downstream analysis steps49,158.

Of particular relevance to the present investigation due to their roles in human-defining language capacity, we captured the neuroanatomy of the core perisylvian circuitry by the canonical arcuate fasciculus (AF), subdivided into (i) long, (ii) anterior, and (iii) posterior segments of the tract30,31,34,35. The ventral pathway consisted of the uncinate fasciculus (UF), the inferior longitudinal fasciculus (IFL), and the inferior fronto-occipital fasciculus (IFOF)25,41,44,159. Limbic pathways included the fornix and cingular white matter bundles35. Motor projections were taken into account by the internal capsule (IC), the corticospinal tract (CST), and cerebellar connections, including the cortico-pontine-cerebellar tract (CPC) and the inferior and superior cerebellar peduncle (ICP, SCP, respectively). The topography of atlas fiber tracts directly informed the labeling of the derived lesion atoms. For consistency, the resulting lesion atoms of the language system were labeled according to widely acknowledged definitions of the dual-stream model of language39. We log2-transformed and concatenated the measures of each of the 28 tract measures of lesion loads for all 1401 patients. Once each tract lesion load91,92 was recorded in each patient, we carried out non-negative matrix factorization (NNMF) as a multivariate pattern discovery strategy71. This unsupervised machine-learning algorithm can identify the topological form and patient-specific combination of lesion pattern expressions that make up the white matter disconnection inflicted by stroke. In accordance with our recent stroke studies56,72, the derived sum-of-parts representation was henceforth referred to as lesion atoms.

More formally, NNMF achieves a low-rank approximation of the data V, with V reflecting the lesion load summaries of 28 tracts, with dimensions m × n (m = number of tracts in the atlas, n = number of patients), by partitioning the interindividual variation in white matter strokes into a matrix W of k part-based factor representations. The latent pattern expression matrix H indicated how relevant each emerging lesion atom is to describe the constituent parts of an individual patient’s overall brain lesion distribution. The patient-specific expression of a given lesion atom shows how well each of the discovered lesion archetypes fits the stroke lesion found in the individual patient. W and H thus carried m × k and k × n dimensions, respectively. Given by V = WH, the latent factorization hence deconvolved the actual lesion constellation in a particular patient into a parts-based representation. Each individual patient’s brain-imaging-derived spatial lesion distribution can hence express the archetypal lesion atom configurations on a continuous scale.

In contrast to alternative dimensionality reduction tools, NNMF provided at least two important advantages for the goal of the present study. First, the non-negative value encoding of the differential white matter involvement and the inherent non-negativity constraint of the NNMF algorithm allows for intuitive neurobiologically meaningful interpretation. For each patient j, the latent pattern expression Hj offered a directly interpretable model of the lesion variation across patients in our cohort. In contrast, alternative matrix factorization algorithms, such as principal component analysis, typically recover patterns by incurring various cancellations between positive and negative pattern loadings. Such alternative treatment of the derived hidden parts-of-whole representations would have hurt intuitive neurobiological meaning. By means of strictly additive combinations of pattern contributions (allowing for no subtractions), NNMF allows for biologically more meaningful and interpretable representations of lesion constellations. Second, classical clustering approaches for dimensionality reduction would consider the effect of each differential tract involvement once only. In contrast, by embracing NNMF, damage to each white matter tract in the reference atlas has the opportunity to belong to multiple latent factors of W in continuous degrees. In this way, each tract lesion load could play a distinct role in different lesion atoms, each of which reflected extracted lesion-specific topological motifs distributed across the whole white matter.

Predicting language outcome at 3 months post-stroke from patient-specific lesion fingerprints

The thus derived lesion atoms served as a low-dimensional summary of the individual lesion fingerprints. Expression degrees of these lesion patterns, specific to each particular stroke patient, provided the input to our Bayesian model160 to explain the interindividual differences in language performance outcomes at ~3 months after the stroke event. A Bayesian modeling approach can be a richly informative procedure while automatically accounting for modeling complexity65,161. The resulting probabilistic statements quantify brain-lesion-behavior links and offer principled estimates of uncertainty around all model parameters at play67,162. Therefore, modeling dependencies between multiple sources of variability including subtle sex effects, covariates, such as age, and brain-lesion idiosyncrasies, such as lateralization, can enhance the inferential granularity of conclusions at each hierarchical level. In adopting a Bayesian generative modeling tactic, we aimed to obtain fully probabilistic parameter estimates that could inform us about effect strength and effect certainty of how each particular lesion pattern is responsible for the clinical outcome. To place a special focus on the putative effects of sex divergences on post-stroke language performance, we introduced dedicated model parameters, which allowed to estimate each pattern regression slope in a sex-specific manner. That is, we could estimate the overlapping and distinct roles of each lesion atom in tracking cognitive facets of language in women and men separately within an integrated modeling solution. The model structure was as follows:

$$\begin{array}{c}\alpha \sim Normal(\mu =0,\,\sigma =1)\\ {\beta }_{1-10}\sim Normal{(\mu =0,\sigma =1)}_{[{{{{{\rm{sex}}}}}}]}\\ {\beta }_{lesionvolume}\sim Normal(\mu =0,\,\sigma =1)\\ {\beta }_{age}\sim Normal(\mu =0,\,\sigma =10)\\ {\beta }_{age\ast age}\sim Normal(\mu =0,\,\sigma =10)\\ {\beta }_{educationyears}\sim Normal(\mu =0,\,\sigma =5)\\ {\beta }_{IQCODE}\sim Normal(\mu =0,\,\sigma =1)\\ {\beta }_{male}\sim Normal(\mu =0,\,\sigma =1)\\ {\beta }_{female}\sim Normal(\mu =0,\,\sigma =1)\\ cognitive\_outcome=\alpha +{\beta }_{1[{{{{{\rm{sex}}}}}}]}\ast Lesion\,ato{m}_{1}+\ldots +{\beta }_{10[{{{{{\rm{sex}}}}}}]}\ast Lesion\,ato{m}_{10}\\ +{\beta }_{lesionvolume}\ast Lesion\,volume+{\beta }_{age}\ast Age+{\beta }_{age\ast age}\ast Ag{e}^{2}\\ +{\beta }_{{{{{{\rm{education}}}}}}{{{{{\rm{years}}}}}}}\ast {{{{{\rm{Education}}}}}}\,{{{{{\rm{years}}}}}} +{\beta }_{{{{{{\rm{IQCODE}}}}}}}\\ \ast {{{{{\rm{IQCODE}}}}}}+{\beta }_{male}\ast Male+{\beta }_{female}\ast Female\end{array}$$
(1)

A separate probabilistic Bayesian model was estimated for each dedicated post-stroke performance score in each of the target language domains: BN, SF, and PF (cf. above; for other cognitive assessments see Supplementary Figs. 415). All analyses were appropriately adjusted for variation that could be explained by the covariates age, age2, sex, education in years, pre-morbid cognitive performance, and total lesion volume. We adjusted for age-specific effects by considering the bare age and its squared value and, therefore, captured possible u-shaped effects. Our parametric model specification also included sex as a covariate to take into account outcome differences that were independent and dependent of the lesion configurations captured in the NNMF-derived lesion atom representations (see refs. 56,72). Samples from the posterior distribution of the model parameters were drawn by the No U-Turn Sampler (NUTS, ref. 163), a kind of Monte Carlo Markov Chain algorithm (in our setting defaulted to draws = 5000).

To evaluate the practical usefulness of the thus inferred posterior parameter distributions, we computed posterior predictive checks for every analysis of a given language performance outcome. These model-based simulations of new data were then compared to the observed data to determine the predictive performance measured as coefficient of determination (R2). That is, we assessed the model-simulated outcome predictions generated by our Bayesian model to approximate an external validation based on our patient cohort. This empirical procedure is a well-recognized option for judging the adequacy of Bayesian models given the actual data at hand65,164.

Statistics and reproducibility

Data analyses were conducted in a Python 3.8.5 (IPython 7.21.0) environment and primarily relied on the packages nilearn, scikit-learn, and pymc3165,166. The probabilistic tractography atlas is available at: https://www.natbrainlab.co.uk/atlas-maps. The use of dynamic reporting guarantees full reproducibility of the results.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.