Introduction

22q11.2 deletion syndrome (22q11.2DS) is caused by a microdeletion of 1.5 to 3 Mb on chromosome 22 [1]. It is the most common pathogenic copy number variant (CNV), with an estimated prevalence of 1 in 2000–6000 live births [2]. 22q11.2DS interferes with neurodevelopment across multiple systems, resulting in cardiac malformations, facial dysmorphology, immune deficiency and seizures/epilepsy, in addition to significantly enhanced risk of neurodevelopmental and psychiatric disorders, including learning disability, ASD, ADHD, developmental coordination disorder, and schizophrenia [3,4,5]. The occurrence of this syndrome has highly variable expressivity, but may offer a unique opportunity to characterize prominent features also present in individuals with complex, idiopathic conditions, such as schizophrenia [6].

Genetic, postmortem, and neuroimaging studies in schizophrenia patients have indicated that abnormal functional integration and disruptions in white matter structural connectivity are key features of neuropathology in this disease state [7,8,9]. However, schizophrenia shares symptoms and traits with many other neurodevelopmental and psychiatric disorders; thereby challenging a disease-specific phenotype that can be measured using in vivo technologies. 22q11.2DS has long been suggested as an important genetic and developmental model for schizophrenia as it affects up to 20–40% of individuals with 22q11.2DS [3]. By taking a genotype-first approach, new insights on how 22q11.2DS relates to prominent brain phenotypes are now possible even within a small number of individuals [10].

Neuronal connectivity has been identified as a key factor in the neurobiology of 22q11.2DS [11]. Using experimental animal models, disruptions in cellular energy production have been linked to neuronal deficits, such as decreased proliferation and altered morphology of axons, dendrites, and synapses [12, 13]. In turn, this leads to lower connectivity of long-range white matter tracts and impaired synchrony of neural activity [14]. The development of white matter for optimal brain communication requires the complex coordination of neuronal and glial cell types to fine-tune the timing and synchrony of action potentials throughout the brain [15]. This includes subtle variations in axonal morphometry and packing [16], myelination [17], synaptic densities and neurochemical mediators. Thus, a better understanding of atypical connectivity in white matter requires in depth assessment of factors that might contribute to trends observed in human studies.

Diffusion MRI (dMRI) is the only imaging modality that can explore both structural connectivity and microstructure of white matter tracts in vivo. DMRI reflects diffusion of water molecules on the micrometer length scale [18, 19]; thus, water acts as an in vivo probe, sensing obstacles constructed by cellular membranes and local fiber architecture, well below the spatial resolution of MRI [19, 20]. Most commonly, diffusion tensor imaging (DT-MRI) is used to parameterize the integrity or morphology white matter [21]. DT-MRI can be acquired in a relatively short scan time and has been broadly investigated across 22q11.2DS [22], typical development [23], and schizophrenia [9, 24]. The most commonly used metrics, fractional anisotropy (FA) and mean diffusivity (\(\bar D\)), have been associated with regional variations in microstructure, previously linked to myelination, axon density, and coherence. In particular, a recent, large-scale multi-site DT-MRI consortium study reported FA was higher in people with 22q11.2DS relative to age- and sex-matched controls for most WM regions, while measures of diffusivity were consistently lower [22]. This study provides a notable benchmark for research into 22q11.2DS; however, these differences were in contrast to previous observations in schizophrenia. Patients with schizophrenia have severe and widespread deficits in FA [9]. This creates a conundrum that both positive and negative alterations in FA could be associated with such direct overlap in symptoms and traits.

Conventional DT-MRI metrics are sensitive to a vast array of biophysical processes in tissue microstructure, but specificity is distinctly lacking [25]. This limitation can now be overcome by exploiting unusually high diffusion-weightings that have only recently become available through the development of customized ultra-strong MRI gradients for human neuroimaging [26, 27]. At such strong diffusion weightings (i.e., high b-values), the dMRI signal is dominated by contributions from water within cellular compartments such as axonal or glial processes [28]. Ultra-strong gradients are advancing the development of biophysical models of diffusion in tissue that hold the promise of quantifying important cellular features such as axonal diameters, densities and orientational dispersion in vivo [29, 30].

Here, we present an in-depth characterization of white matter microstructure in both typically-developing and 22q11.2DS participants using ultra-strong gradients and an extensive multi-shell dMRI acquisition, including b-values up to 6000 s/mm2. By so doing, we conducted a multiparametric analysis to gain novel insights into the microstructural properties underlying disrupted axonal morphology, to better describe previous observations of white matter hypo-connectivity in 22q11.2DS. The findings provide insight into 1) disentangling microstructural connectivity differences in brain white matter in 22q11.2DS versus typical developing participants, and 2) establishing the sensitivity and specificity of in vivo dMRI to underlying biophysical signatures of hypo-connectivity in white matter.

Methods and materials

Subjects

This study included 92 typically-developing (TD) children (age range: 8–18 years, 49 F / 43 M) and 9 children with 22q11.2DS (age range: 11.8 to 15.2, 7 F / 2 M) (see Table 1 for full demographic information). From the TD group, a subset of children was selected based on age and sex (n = 22, age range: 12.0–15.5, 14 F / 8 M; age- and sex-matched group, ASM) for group comparison to children with 22q11.2DS. The Cardiff University School of Psychology Ethics Committee approved data collection procedures for the TD group, and each participant and/or their parent or legal guardian provided written informed consent. TD participants were recruited from the Cardiff area, and screened to exclude major neurological disorders, such as epilepsy. Participants with 22q11.2DS were recruited from the Experiences of CHildren with cOpy (ECHO) number variants study at Cardiff University (https://bit.ly/Cardiff-Echo-Study) [31]. ECHO study protocols were approved by NHS Wales Research Ethics Committee and brain imaging procedures were approved by the Cardiff University School of Medicine Ethics Committee. As part of the ECHO study, 22q11.2DS participants underwent extensive psychiatric, developmental and cognitive assessment including IQ assessment using the Weschler Abbreviated Scale Intelligence (WASI-II [32]). The ASM control group also performed an abbreviated 2-subtest WASI to establish a normative reference point in IQ scores. 22q11.2DS also provided information about social communication difficulties indicative of ASD from the Social Communication Questionnaire (SCQ [33]), and psychiatric interviews were taken using the Child and Adolescent Psychiatric Assessment (CAPA [34]). Seizure and medication history were obtained from parental questionnaires [35]. One child had a history of febrile seizures, but no other seizure history or psychiatric/epilepsy medications were reported. All participants underwent the same pre-scan procedure, which included an in-person screening for MRI safety and a practice scan in a mock MRI scanner.

Table 1 Demographics of typically developing (TD) and 22q11.2DS participants.

Image acquisition and preprocessing

All imaging data were acquired on a 3 T Connectom scanner (Siemens Healthcare, Erlangen, Germany) with 300 mT/m gradients and a 32-channel radiofrequency coil (Nova Medical, Wilmington, MA, USA) at the Cardiff University Brain Research Imaging Centre (CUBRIC).

T1-weighted anatomical images were acquired using a 3D Magnetization Prepared Rapid Gradient Echo (MP-RAGE) sequence with the following parameters: echo time of 2 ms, repetition time of 2300 ms, time to inversion of 857 ms, and flip angle of 9°. The field of view was 256 × 256, the imaging matrix 256 × 256, with 192 slices of 1 mm slice thickness. The total scan time was 5 min and 32 s.

dMRI data were acquired using a multi-shell diffusion-weighted EPI sequence. Images covered the whole brain with field of view 220 × 220 mm, imaging matrix 110 × 110, and 66 slices of 2 mm slice thickness. dMRI were acquired using the Stejskal-Tanner sequence with an anterior-to-posterior phase-encoding direction, with an echo time of 59 ms, repetition time of 3000 ms, and with diffusion-weighting gradient separation ∆ = 23.3 ms and duration δ = 7 ms. Data were acquired at five different b-values: b = 500, 1200, 2400, 4000, and 6000 s/mm2; with 30 and 60 non-collinear directions used for b ≤ 1200 s/mm2 and b ≥ 2400 s/mm2, respectively [36]. At the maximum diffusion weighting, the gradient amplitude was 285.6 mT/m. In addition, 14 non-diffusion weighted images were distributed uniformly throughout the protocol. One additional volume was acquired with reversed phase encoding for the purpose of EPI distortion correction. The total dMRI scan time was 16 min and 14 s.

The following preprocessing steps were used to reduce thermal noise and image artifacts: image denoising [37], correction for signal drift [38], motion, eddy current, and susceptibility-induced distortion correction [39], gradient non-linearities, Gibbs ringing [40], and outlier rejection [41]. The preprocessing pipeline was implemented in MATLAB (The MathWorks, Natwick, MI, USA), and depended on open source software packages from MRtrix [42] and FSL [43].

Quantitative diffusion metrics

There has been little consensus as to which dMRI metric provides the most sensitive or specific characterization of the biophysical properties of white matter. In this work, we use an inclusive approach rather than focus on a single analysis strategy, examining a large number of distinct models across the feature space of dMRI (including low, intermediate and high b-value regimes). These include (i) DT-MRI, (ii) diffusion kurtosis imaging (DK-MRI), (iii) the rotationally invariant spherical moments of the signal at various b-values (i.e., spherical harmonics), and (iv) the Biophysical Standard Model (BSM) [29]. A comprehensive overview of the individual metrics that are derived from those models and used in this study is provided in Suppl. Section: Descriptions of quantitative diffusion metrics. In brief, we will refer to the following metrics. DT-MRI includes fractional anisotropy (FA), mean diffusivity (\(\bar D\)), radial diffusivity (\({{{{{{{\mathrm{D}}}}}}}}_ \bot\)), and axial diffusivity (\({{{{{{{\mathrm{D}}}}}}}}_\parallel\)). DK-MRI includes mean kurtosis (\({{{\bar{\mathrm K}}}}\)), radial kurtosis (\({{{{{{{\mathrm{K}}}}}}}}_ \bot\)), and axial kurtosis (\({{{{{{{\mathrm{K}}}}}}}}_\parallel\)). The spherical harmonics include spherical mean at low-b (\(\dot S_\mu \left( {b = 500} \right)\), spherical mean at high-b (\(\dot S_\mu \left( {b = 6000} \right)\)), spherical variance at low-b (\(\dot S_\sigma \left( {b = 500} \right)\), spherical variance at high-b, (\(\dot S_\sigma \left( {b = 6000} \right)\)). The BSM includes the intracellular signal fraction (f), parallel intracellular diffusivity (Dc), parallel and perpendicular extracellular diffusivity (\(D_e^\parallel ,D_e^ \bot\)) and orientational alignment (κ). In addition, the precision of individual metrics is evaluated in a simulation experiment; see Suppl. Section Precision of diffusion metrics.

Unique sensitivity to axon diameter

The Connectom 3 T MRI scanner is a unique research tool with only four sites available worldwide. This scanner allowed for experimental set up that is sensitive to axon morphology, such as diameter; see Suppl. Section Sensitivity to axon morphology. The combination of strong gradients (300 mT/m versus 40–80 mT/m in clinical scanners) and short diffusion gradient duration yields an unprecedented sensitive to the inner axon diameter. Unfortunately, the direct measurement of axon diameters using dMRI remains challenging due to long scan times that might be unattainable in vulnerable cohorts. However, for the first time, the axon diameter is encoded in the dMRI data measured at high b-values due to the use of ultra-strong gradients.

Biophysical principal components

dMRI metrics are sensitive to a wide array of microscopic components, such as axonal density and packing, intra-voxel fiber orientational dispersion, membrane permeability, and myelination; however, a single metric on its own, such as FA, lacks specificity to any one particular feature of the white matter microstructure. Additional information is needed to assign more specific interpretation, but acquiring a large volume of complex dMRI data becomes difficult to analyze. To overcome this, we applied a dimensionality reduction technique to uncover hidden structures, or latent variables, that aggregate shared biophysical features across the observed data allowing for more specific interpretation without the statistical limitations imposed by a biophysical model. This approach is particularly powerful as it is not hindered by the potential correlation between metrics or redundancy that is also present in dMRI.

A principal component analysis (PCA) was used to identify latent structure in total white matter estimates [44]. The full TD cohort (n = 92) was used to train the model. The data sampling adequacy was assessed using a Kaiser-Meyer-Olkin (KMO) test and the Bartlett’s test of sphericity. Parallel analysis was used to select the optimal number of principal components. We used an oblique matrix rotation to allow underlying biophysical components to correlate, as this is likely most biologically plausible. The loadings per principal component were used to compute a composite total, or PCA component score.

Segmentation of the white matter and individual tracts

Digital brain extraction was performed using the FSL Brain Extraction Tool (BET) [43] followed by the FSL segmentation tool (FAST) [45] in which three tissue types, i.e., CSF, WM, and GM were segmented. Moreover, automated tract segmentation software (TractSeg) was used to extract eleven white matter fiber tracts using the b = 6000 s/mm2 shell for each participant [46]. Tract selection was referenced from previous developmental neuroimaging studies using dMRI [23]. For this study, these include association tracts: arcuate fasciculus (AF), cingulum (CG), inferior fronto-occipital fasciculus (IFO), inferior longitudinal fasciculus (ILF), superior longitudinal fasciculus (SLF), and uncinate fasciculus (UF); commissural tracts: genu (GCC), body (BCC), and splenium (SCC) of the corpus callosum; and projection tracts: corticospinal tract (CST), optic radiation (OR). All tract reconstructions were bilateral except for the inter-hemispheric projections of the corpus callosum; see Suppl. Fig. 1.

Tract volume quantification

The volume of each segmented fiber pathway was estimated by creating a tract density map using the associated streamlines [47], then counting the total number of voxels in which streamlines were observed. Nominal tract volumes were normalized for total brain volumes to compensate for brain size differences. Since each individual streamline is strongly susceptible to thermal noise, the estimation of volume quantification might be less accurate for fiber pathways with high surface-to-volume ratio. For our cohort, the minimally detectable effect size is 10 to 20% (cf. Suppl. Section Tract Volume Quantification, Suppl. Fig. 2). Therefore, the sensitivity of the technique is intrinsically limited to relatively large changes in typical and/or atypical development.

Along tract profiling

The estimation of the BSM parameters on a voxel level is challenging because of the poor precision and degeneracy in the model at such a low signal-to-noise ratio (SNR), i.e., multiple solutions are possible for one signal [48]. However, by adopting a novel approach of averaging the (rotationally-invariant) spherical moments within tract segments prior to model fitting, we were able to boost the SNR of our data and thus the reliability of BSM parameter estimates, thereby fostering more accurate and precise biophysical modeling. Note, the curvature of tracts does not bias the model parameters because they are derived from the rotationally invariant spherical moments of the signal.

Specifically, we projected the rotationally-invariant spherical moments of each individual tract onto twenty equidistantly-spaced nodes of the associated centerline. This in-house strategy was implemented in MATLAB, inspired by tractometry methods such as the AFQ toolbox and others [49, 50]. Each centerline was computed by fitting a three-dimensional spline through equidistantly-spaced sample points of 10,000 streamlines that constituted an individual bundle [49]. The intrinsic challenges of fiber tracking, i.e., spurious connections, might propagate in this analysis [51]. Therefore, quality control was performed using visual inspection of centerline geometry and positioning. From this, we identified the body of the corpus callosum, or BCC, to be poorly robust due to streamlines occurring in spurious directions. The BCC was removed from subsequent comparisons of 22q11.2DS participants. In addition, along-tract metric profiles for all subjects were aligned to account for varying tract lengths and head sizes using a linear translation and rescaling of the nodes.

Sensitivity to age-effects in typical development

Age-effects of dMRI are well reported in the literature and we use this analysis to establish the sensitivity of the PCA technique. The associations between age and PCA scores/individual metrics within selected tracts were assessed using the goodness-of-fit for both linear (age) and quadratic (age2) models as reported in the literature for developmental cohorts [23]. Using the corrected Akaike information criterion (AIC) to compare linear versus quadratic models, we found no statistical evidence to motivate the more complex (i.e., quadratic) model for this age range. Therefore, all age-associations for PCA scores/individual metrics in total white matter and tracts were assessed using a simple linear regression model for age. Model statistics for each comparison with age are reported as the r2 and unstandardized β coefficient. The statistical significance level (p ≤ 0.035) is determined using the Benjamini–Hochberg procedure to control the false discovery rate (FDR) in this study [52]. Predicted metric scores were computed using the regression equation for youngest (8 year old) and oldest (18 year old) participants. The difference in predicted metric scores was used to illustrate age-effects in the TD group.

Group differences in 22q11.2DS and age- and sex-matched controls

Standardized effect size was computed between age- and sex-matched controls (ASM) and 22q11.2DS participants. Here, we report Hedge’s g, which corrects for bias that might be present due to small sample sizes [53]. The statistical significance level is p ≤ 0.011 after FDR correction. The t-statistic and degrees of freedom df(N-1) are reported for independent group comparisons. Feature extraction and statistical analyses were conducted using R v3.4.1 (Single Candle) [54] and MATLAB.

Results

Multidimensional in vivo dMRI captures biophysical components of white matter

A PCA analysis was performed to characterize latent features present in dMRI data of white matter microstructure, Fig. 1. Parallel analysis in total white matter suggested that three components (dubbed here ’Biophysical Principal Components’ or BPCs) derived from eleven dMRI metrics were adequate to explain the data, together explaining 91% of the total variance in white matter. Parallel analysis was also applied to individual tracts, again confirming three components with the same groups of variables. Data sampling adequacy was verified by the KMO-statistic, KMO = 0.68, and Bartlett’s test of sphericity, x(55) = 2005, p < 0.05. Table 2 shows the standardized component loadings of each metric after oblique rotation.

Fig. 1: Age associations with the three biophysical principal components: intracellular signal (black), extracellular mobility (red), and tissue complexity (green).
figure 1

a BPC values are correlated with age in total white matter. Points are highlighted for age- and sex-matched controls (filled circles), and 22q11.2DS participants (filled squares). The black line denotes the linear regression for the TD group only (with dashed 95% confidence intervals). b Nominal values are computed for 1) estimates from the linear regression for youngest (8 year old) and oldest (18 year old) TD participants, and 2) measured BPC scores of ASM and 22q11.2DS participants. Asterisk denotes significance between group means (p < 0.05, p < 0.001).

Table 2 Standardized factor loadings for total white matter.

Inspection of the loading matrix showed that each of the three BPC’s could be described as capturing a specific biophysical dimension of microstructural features in white matter, as indicated by high component loadings for groups of variables (>0.5; Fig. 2). Here, we introduce labels to represent the most salient and unifying properties from the individual, observable metrics that contribute to each component. These are: intracellular signal, extracellular mobility, and tissue complexity.

Fig. 2: dMRI fingerprint of development. Metrics are grouped by intracellular signal (IS, blue), extracellular mobility (EM, red), tissue complexity (TC, green), and Biophysical Standard Model parameters (black).
figure 2

a The highest metric loadings are shown for each BPC. b Bar plots are encoded with linear regression statistics in total white matter. Color intensity is effect size (r2); bar length is percent difference of metric estimates from the linear regression for youngest (8 year old) and oldest (18 year old) TD participants. c Triangle plots are the same statistics for individual white matter tracts. Triangles are divided by left (L) and right (R) hemisphere for association and projection tracts. Commissural tracts span both hemispheres and are not divided. Percent differences are denoted by the scale bar, reaching a maximum of 25% difference. Triangle orientation, i.e., up or down, denotes positive or negative associations with age, respectively. Triangles are omitted that do not survive statistical significance.

The first component, intracellular signal, captured 31% of the variance in the data, and was characterized most strongly by FA, \(\dot S_\mu \left( {b = 6000} \right)\), \(\dot S_\sigma \left( {b = 500} \right)\), \(\dot S_\sigma \left( {b = 6000} \right)\). FA is generally nonspecific and has been associated with many tissue properties [25]; however, when combined with high-b (\(\dot S_\mu \left( {b = 6000} \right)\), \(\dot S_\sigma \left( {b = 6000} \right)\)) and \(\dot S_\sigma \left( {b = 500} \right)\), the resultant feature is more selective for intracellular signal. This is in part, due to diffusion weightings at high b-values (b = 6000 s/mm2) acting to suppress signals from the extracellular matrix. An in-depth technical discussion on interplay of spherical harmonic metrics and FA is provided in Suppl. Section The biophysical interpretation of the intracellular signal component.

The second component, extracellular mobility, captured 33.1% of variance in the data, with high loadings for \(\bar D\), \(D_ \bot\), \(D_\parallel\), and \(\dot S_\mu \left( {b = 500} \right)\). These metrics exist in the low-b regime (e.g., b < 2000 s/mm2), in which extra- and intra-cellular signal contribute to the signal.

And finally, the third component, tissue complexity, captured 27.7% variance in the data, with highest loadings for \(\bar K\), \(K_ \bot\), \(K_\parallel\). Kurtosis metrics have previously been used to assess tissue complexity within the white matter microenvironment. Kurtosis is quadratically more sensitive than DT-MRI and therefore a powerful tool in detecting subtle differences [55].

BPCs are sensitive to age effects

Regression scores showed sensitivity to age for all BPCs; however, the strongest association with age was found to be tissue complexity (r2 = 0.369, β = 0.002, p < 0.0001). This was closely followed by intracellular signal (r2 = 0.325, β = 0.003, p < 0.0001), and extracellular mobility (r2 = 0.222, β = −0.002, p < 0.0001) (Fig. 1a).

In total white matter, metrics associated with intracellular signal, i.e., \(\dot S_\mu\) and \(\dot S_\sigma\) at high b-values, had the largest differences versus all other metrics in predicted values between ages 8 and 18 of 20%. This large effect was reflected by strong associations with age (\(\dot S_\mu \left( {b = 6000} \right)\): r2 = 0.553, β = 0.002, p < 0.0001; and \(\dot S_\sigma \left( {b = 6000} \right)\): r2 = 0.369, β = 0.0002, p < 0.0001) (Fig. 2). Significant age-associations were also found at the tract level for both metrics, as shown in Fig. 2c. In comparison, total white matter \(\dot S_\sigma\) at low b-value and FA were less sensitive to age effects (\(\dot S_\sigma \left( {b = 500} \right)\): r2 = 0.120, β = 0.0001, p = 0.001; and FA: r2 = 0.165, β = 0.002, p = 0.0002) and had negligible effects at the individual tract level.

For extracellular mobility metrics, \(\bar D\), \(D_ \bot\), and \(\dot S_\mu \left( {b = 500} \right)\) had significant associations with age in total white matter (\(\bar D\): r2 = 0.216, β = −0.004, p < 0.0001; \(D_ \bot\): r2 = 0.294, β = −0.005, p < 0.0001; \(\dot S_\mu \left( {b = 500} \right)\): r2 = 0.248, β = 0.001, p < 0.0001). These associations were also present for most tracts. \(D_\parallel\) did not have any notable associations with age for total white matter or individual tracts.

Tissue complexity metrics had high sensitivity to age for total white matter and most individual tracts. In total white matter, \(K_\parallel\) had the strongest association (\(K_\parallel\): r2 = 0.55, β = 0.005, p < 0.0001), followed by \(\bar K\) (r2 = 0.28, β = 0.007, p < 0.0001) and \(K_ \bot\) (r2 = 0.16, β = 0.008, p = 0.0002).

Participants with 22q11.2DS have higher intracellular signal and lower extracellular mobility compared to age- and sex- matched controls

Intracellular signal fraction was increased relative to ASM controls (t(29) = 4.136, p < 0.001, g = 1.594), and lower extracellular mobility (t(29) = − 3.897, p < 0.001, g = 1.502) (Fig. 1b). There were no differences in tissue complexity (t(29) = 0.355, p = 0.725, g = 0.137).

A summary of individual quantitative diffusion metrics in total white matter and tracts is found in Fig. 3. In total white matter, intracellular signal was led by FA (t(29) = 4.611, p < 0.001, g = 1.777), followed by \(\dot S_\sigma \left( {b = 500} \right)\), (t(29) = 3.793, p < 0.001, g = 1.462); \(\dot S_\sigma \left( {b = 6000} \right)\), (t(29) = 3.789, p < 0.001, g = 1.460); and finally \(\dot S_\mu \left( {b = 6000} \right)\), (t(29) = 3.673, p = 0.001, g = 1.416).

Fig. 3: dMRI fingerprint of 22q11.2DS.
figure 3

Metrics are grouped by intracellular signal (IS, blue), extracellular mobility (EM, red), tissue complexity (TC, green), and Biophysical Standard Model parameters (black). a Bar plots are encoded by mean difference and effect size in total white matter. Color intensity is effect size (Hedge’s g); bar length is percent difference between average metric values for ASM and 22q11.2DS participants. b Triangle plots are the same statistics for individual white matter tracts. Triangles are divided by left (L) and right (R) hemisphere for association and projection tracts. Commissural tracts span both hemispheres and are not divided. Percent differences are denoted by the scale bar, reaching a maximum of 25% difference. Triangles pointed up indicate a higher metric values for 22q11.2DS relative to ASM (and vice versa for down). Data from BCC is not shown for the Biophysical Standard Model parameters due to lack of robustness in the along-tract profiling for that particular tract (marked as –). Triangles are omitted for low and moderate effect sizes, Hedge’s g < 0.4, and left blank if not significant, p < 0.011.

22q11.2DS had higher individual metric values of intracellular signal relative to ASM controls for most individual tract types. Only the low-b metric, \(\dot S_\sigma \left( {b = 500} \right)\) had consistently smaller or absent significant effects across tracts.

Extracellular mobility was decreased in 22q11.2DS compared to ASM for total white matter. The highest effect was in \(D_ \bot\) (t(29) = − 4.450, p < 0.001, g = 1.715), followed by \(\dot S_\mu \left( {b = 500} \right)\) (t(29) = 4.205, p < 0.001, g = 1.620), and \(\bar D\) (t(29) = − 3.660, p < 0.001, g = 1.411). Only \(D_\parallel\) did not reach significance threshold with the FDR correction (t(29) = 0.992, p < 0.015, g = 0.992).

At the tract level, the diffusivity metric, \(D_ \bot\) was most consistently different, being significantly less for all tracts except UF. \(\bar D\) was also consistent, with significant effects for all tracts except ILF, CST (L), BCC, and SCC. Effects remained small or absent for \(D_\parallel\) as seen in total white matter. For \(\dot S_\mu \left( {b = 500} \right)\), effects were relatively small for most tracts (here showing an increase, which corresponds to decreased diffusivity).

Tissue complexity metrics, \(\bar K\), \(K_ \bot\), \(K_\parallel\), were not sensitive to group differences for total WM or at the tract level. This is a notable omission, as kurtosis metrics were the most sensitive to age effects in the TD group.

Biophysical modelling in support of intracellular signal

For the TD cohort, intracellular signal fraction (f) showed the greatest sensitivity to age in total white matter (r2 = 0.299, β = 0.005, p < 0.0001), being 9% higher over the age range. These effects were significant for most tracts with percent differences ranging from 3 to 16%. The \(D_e^ \bot\) and κ were lower with age in total white matter (\(D_e^ \bot\): r2 = 0.094, β = −0.005, p = 0.005; κ: r2 = 0.019, β = −0.0007, p = 0.11), although κ did not reach significance here. \(D_c\) and \(D_e^\parallel\) were not sensitive to age effects.

Participants with 22q11.2DS had 5% higher f, 4.6% lower \(D_e^ \bot\) ~ 3.6% lower \(D_e^\parallel\), and 1.6% lower κ versus the ASM control group. Only f survived the statistical threshold (t(29) = 3.474, p = 0.0016, g = 1.339). The remaining metrics were not sensitive to differences, possibly reflecting poor precision inherent in fitting a large number of model parameters (Suppl. Fig. 3). At the individual tract level, f, \(D_e^ \bot\), \(D_e^\parallel\) had some sensitivity to group differences of primarily association tracts. While biophysical modelling holds the promise of more specific interpretation, it is challenged by lower precision in parameter estimates. In Suppl. Fig. 3, the precision of metrics and BSM model parameters is shown, highlighting this discrepancy, e.g., Dc and \(D_e^\parallel\). Therefore, the combination of both sensitive and specific dMRI metrics expands our inferences to allow for in-depth phenotyping of white matter microstructure.

Participants with 22q11.2DS have lower total WM and individual tract volumes compared to age- and sex-matched controls

Whole brain and individual tract volumes were computed for both TD and 22q11.2DS groups, see Fig. 4. For the TD group, there were significant age effects for lower GM volume and higher CSF volume with age (GM: r2 = 0.536, β = −0.007, p < 0.0001; CSF: r2 = 0.579, β = 0.007, p < 0.0001), aligning with previous developmental literature [56, 57]. There were no significant age effects for total brain volume or WM volume which is not consistent with other findings. In addition, the WM volume analysis per tract was not sensitive to age effects.

Fig. 4: Whole brain and tract volume fractions in TD and 22q11.2DS participants.
figure 4

a Total brain volume and gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) volume fractions are correlated with age. Points are highlighted for ASM (filled circles) and 22q11.2DS participants (filled squares). The black line denotes the linear regression line for the TD group only (with dashed 95% confidence intervals). b Nominal differences in GM, WM, and CSF are shown for volume fraction estimates from the linear regression for youngest (8 year old) and oldest (18 year old) TD participants. In addition, measured volume fractions are shown for ASM and 22q11.2DS participants (data points overlaid on bar). c Individual WM tract volume fractions are shown for (top) youngest (8 year old) and oldest (18 year old) TD, and (bottom) ASM and 22q11.2DS participants. d Percent differences in WM tract volume fractions for ASM and 22q11.2DS participants are overlaid on a single subject’s tractogram. The radius of each tract ’tube’ reflects the percent difference between groups (ranging from 0–23%). Tract colors match panel C. Asterisk denotes significance between groups (p < 0.05, p < 0.001).

Compared to ASM, 22q11.2DS children had significantly higher CSF volume (t(29) = 3.705, p < 0.001, g = 1.428) and lower WM volumes (t(29) = − 2.197, p = 0.036, g = 0.847). There were no differences in whole brain or GM volumes (Fig. 4a, b). As shown in Fig. 4c, there were significant reductions in most individual tract volumes for 22q11.2DS. Differences in tract volumes were large, ranging from 9 − 28%, and were present irrespective of tract type or location. Figure 4d highlights the global pattern of these differences across the brain.

Discussion

Replicability of previous studies

22q11.2DS has been previously studied using DT-MRI metrics for almost 20 years. The landscape of these studies is complex, resulting from differences in study design, inclusion criteria and sample sizes, clinical hardware (1.5 T to 3 T), and analysis approach. Most recently, the ENIGMA-22q11.2DS working group harmonized data across 10 study sites in the largest DT-MRI study ever completed in this population [22]. This work included 594 participants: 334 with 22q11.2DS and 260 age- and sex-matched controls (age range: 6–52 years) and serves as the most comprehensive benchmark data set for the evaluation of WM differences in 22q11.2DS.

Overall, the ENIGMA results highlight the presence of a global WM phenotype in 22q11.2DS. For this study, similar patterns of increased FA for most tracts and decreased diffusivity metrics for \(\bar D\), \(D_ \bot\), \(D_\parallel\) were observed in 22q11.2DS relative to controls. The main exceptions were three association regions - the SLF, external capsule, and fornix where the ENIGMA authors observed significantly lower FA. We did not include external capsule and fornix, however, this study did not detect lower FA in SLF, Fig. 3. This could be due to a number of differences in study design and/or methodology. For example, the age range of the ENIGMA study was much broader. White matter development is heterochronistic, meaning different regions develop at different times and rates. In this way, association tracts, such as SLF, have the most protracted development, continuing into the third decade of life and beyond [17, 23]. It is possible that competing maturational processes such as ongoing myelination may confound inferences over a broad age range, leading to the observed differences. A second possibility is that the ROIs used in the ENIGMA study do not directly correspond to the tractography approach used here. This is discussed in Suppl. Section, Comparison to ENIGMA DT-MRI results, alongside an in-depth comparison of DT-MRI metrics between these studies (Suppl. Table 1).

In-depth microstructural phenotyping of white matter in 22q11.2DS

The dMRI signal reflects a diverse array of microstructural attributes, including axonal density and packing, intra-voxel fiber orientational dispersion, membrane permeability, and myelination. Therefore, it is an inherent challenge to extract inferences on which specific process, or combination of processes, is most likely contributing to dMRI metrics. The utility of PCA techniques for grouping MRI measures in biophysically relevant ways has been demonstrated previously [44, 58, 59]. Here, this approach allowed us to probe the sensitivity of dMRI to differences in 22q11.2DS while characterizing three distinct sources, or BPCs, representative of the underlying microstructure. Accepting the risk of oversimplification, we coined these BPCs as “intracellular signal”, “extracellular mobility”, and “tissue complexity”.

By comparing the 22q11.DS to age- and sex-matched matched controls, projecting the differences onto trends of typical development in adolescence, we conclude that children with 22q11.2DS had dMRI profiles most similar to older children. This holds for intracellular signal and extracellular mobility, in addition to their composite metrics for both total white matter and individual tracts. Tissue complexity was not sensitive to group differences. This component, comprised exclusively of DKI metrics, was in contrast to strong developmental effects observed in the TD group.

Volumetric differences in WM

In addition to the diffusion metrics quantifying microstructure, we observed significant morphological differences in total WM and white matter bundles. Specifically, we identified lower white matter volume in the following tracts for 22q11.2DS: AF, CG, ILF, SLF, and OR. As seen in Fig. 4d, these tracts represent widespread decreases throughout the brain, which are also in agreement with previous studies of 22q11.2DS [60]. We did not observe significant age-related differences in WM volume across the age-span studied here in the typically developing cohort. However, the reduction in GM volume and higher CSF volume is consistent with the TD literature [57]. In what follows, we will argue that the combination of changes in dMRI metrics that were sensitized to axonal morphology at the micro-scale, and lower white matter volume in the 22q11.2DS participants at the macro-scale, provides evidence in support of the hypo-connectivity hypothesis in 22q11.2DS and a lower number of large axons in particular.

In support of hypo-connectivity

To examine the biological plausibility of our findings, we can look at converging evidence from experimental animal models of 22q11.2DS. For instance, Fernandez et al. characterized underconnectivity as limited axon and dendrite growth, and disrupted mitochondria and synaptic integrity [12]. These effects were specific to pyramidal neurons in layer 2/3 of the cortex, which primarily support long-distance cortico-cortico connections. In a separate mouse model used to report on genetic susceptibility of schizophrenia, altered axonal growth, branching, and disrupted connectivity were linked to behavioral deficits [13, 14]. In these studies, both short- and long-range connectivity was impacted. The size of terminal axonal branching correlates with the caliber of axons - therefore, large diameter axons are capable of high rates of information transfer at these junctions [61,62,63]. Axonal growth and branching are energetically demanding processes. Therefore, in 22q11.2DS, where mitochondrial dysfunction has been linked to atypical axonal morphology [12], there may be a critical susceptibility for the large neurons. To date, we are not aware of any studies measuring axon diameter distributions specifically in 22q11.2DS; however, studies of other neurodevelopmental disorders, such as ASD and Angelman Syndrome, have reported a lower number of large axons in comparison to typical development [64,65,66]. Postmortem tissue from schizophrenia subjects also shows specificity to alterations in neuron morphology, such as dendritic spine loss [67]. Therefore, we hypothesize that the lower white matter tract volumes in 22q11.2DS are representative of densely packed axons with disproportionately small diameters.

The 22q11.2 deletion is highly penetrant in affected individuals but leads to heterogeneous risk profiles and presentations. Recent work on the genetic basis of 22q11.2DS has suggested there may be a consortium of gene effects - outside the 22q11.2 locus itself - that contribute to the onset of schizophrenia or ASD [68]. In addition, environmental factors may lead to a stochastic cascade of phenotypic variation over the lifespan (reviewed in [69]). Ultimately, many factors must be considered when evaluating the expression of phenotypes related to 22q11.2DS. As this body of work continues to grow, a more in-depth characterization of established brain phenotypes can provide new insights underlying potential mechanisms of WM hypoconnectivity [10].

Sensitivity to axonal morphology

A unique feature of our study is the sensitivity to axonal morphology at b = 6000 s/mm2 achieved by exploiting the ultra-strong gradients, i.e., 300mT/m, of the Connectom scanner. Our data did not allow for the direct quantification of the axon diameter (as discussed below in Considerations for scanning). However, various metrics are sensitive to the axon diameter and are, as such, directly sensitive to differences in the axon diameter distribution, with particular sensitivity to differences in the number and diameter of large axons [70]. The key parameters are the spherical mean and variance at the highest b-value. The significant increase in both parameters in the 22q11.2DS compared to age- and sex-matched controls is consistent with a reduction in mean axon diameter. Of relevance to our observation of higher FA in 22q11.2DS and in line with the ENIGMA-22q11.2DS DT-MRI study [22], an association between higher FA (and smaller \(D_ \bot\)) and smaller axons has previously been demonstrated [71].

Moreover, there is an inverse relationship between axon diameter and axon density, i.e., larger diameter axons take up more space, due not only to larger size, but also from the space

demands of neighboring glial cells such as myelinating oligodentrocytes and astrocytes [72,73,74]. When there are fewer large diameter axons, or their morphology is less complex due to fewer branch points, the space requirements go down - in agreement with the significant loss of tract-based white matter volumes seen in our study. As a consequence, the relative proportion of smaller diameter axons increases, leading to greater axonal density and the reduced \(D_e^ \bot\). Moreover, dMRI has a reduced (or even nullified) sensitivity to smaller axons as a result of which the dMRI signal is less attenuated (see Suppl. Fig. 4). Both effects are consistent with higher values of \(\dot S_\mu\) and f as seen in 22q11.2DS relative to ASM controls. Finally, higher orientational dispersion, (i.e., drop in orientational-concentration parameter κ), in combination with lower axial diffusivity, might reflect a more tortuous path of axons that has been associated with smaller axons [75, 76]. This hypothesis was also put forward by the ENIGMA working group based on the lower axial diffusivities in their study [22]. However, a more direct measurement of tortuosity along segments of axons depends on Dc, a metric that was not estimated with sufficient precision using the Biophysical Standard Model to support this hypothesis.

Typical development

Although this work focuses on 22q11.2DS, our results also provide new insights into typical development. There is ongoing debate about the biophysical underpinnings of brain maturation: axonal growth versus myelination [77]. To this end, the extended multi-shell dMRI data, including the high b-values, complement previous observations of enhanced sensitivity to age effects (e.g., b = 3000) [78, 79]. Notably, from the high b-value data, \(\dot S_\mu\) and \(\dot S_\sigma\) are the most sensitive parameters amongst all evaluated metrics. Given that the signal from the extra-cellular space is effectively filtered out at these b-values, this result suggests that the dynamics of the neuronal and/or glial processes, rather than the extra-cellular signal, are dominating effects related to age-dependency in dMRI. As suggested by [77], white matter maturation might encompass axonal growth or swelling due to neuronal activity [80, 81] or even hormonal fluctuations [82, 83]. However, the increase in axon diameters exclusively would have resulted in a lower \(\dot S_\mu \left( {b = 6000} \right)\) (see Suppl. Fig. 4). We observed the opposite; therefore, higher values of \(\dot S_\mu \left( {b = 6000} \right)\) and f are dominated by the greater density of neuronal and/or glial processes. In addition, we observed less alignment of these processes, κ. This particular combination supports the greater density of more orientationally-dispersed cellular processes, such as those arising from glial cells. Additionally, the increase in kurtosis and reduction in mobility in the extra-cellular space are both in agreement with a greater cellular density.

Based on these observations, we hypothesize that increased complexity in astrocytic processes may be contributing to the age-dependency of the diffusion-weighted signal in typically developing children. A detectable change in astrocytes might be expected because of their abundance in the white matter [62], the size and complexity of their processes [74], and their critical role in protracted white matter development [84,85,86]. While oligodendrocytes have been well characterized in development and are known to be fundamental for myelin formation and maintenance [87,88,89], astrocytes also have many important roles, including providing trophic support of oligodendrocytes, formation of the blood brain barrier, synaptic pruning and neurotransmitter recycling [90]. Moreover, it has been demonstrated that astrocytes go through morphological and density changes during brain maturation [84, 91]. Diffusion-weighted MR spectroscopy might resolve the relative signal contributions of neuronal and glial cell types during development in the white matter [92]. It will be important in the future to conduct similar studies in larger samples, relating the microstructural findings to neurodevelopmental disorder and psychiatric risk - preferably using a longitudinal design to better understand these risks.

Strengths and limitations

A distinguishing feature of the current study is the image contrast enhancement gained from using the unique ultra-strong gradients of the Connectom 3 T scanner. The strong gradients allow a given diffusion encoding to be achieved using a shorter echo time. In this way, diffusion scans were shortened significantly, thereby reducing T2-related signal loss, and boosting SNR by 40−70% in comparison with clinical MRI scanners. This increased the statistical power of the study to detect group differences; in part compensating for the low N associated with the 22q11.2DS participant group. In addition, recruiting children with 22q11.2DS enables us to investigate the neurodevelopmental pathways based on genotype, thus mitigating some of the difficulties with genetic and environmental heterogeneity in clinically ascertained groups such as ASD or schizophrenia. Overall, efforts to maximize contrast sensitivity of dMRI in both typically developing children and children with 22q11.2DS allowed for the investigation of microstructural hypo-connectivity in brain white matter.

Children with 22q11.2DS are harder to recruit than typically developing children due to the relative rarity of the syndrome and the frequency of MRI contraindications, e.g., history of cardiac surgery. This study was also severely challenged due to changing attitudes for in-person research visits during and post-COVID. However, despite the relatively low N compared to larger studies carried out across multiple sites, we observed similar effect sizes in diffusion metrics between 22q11.2DS and ASM controls. The enhanced sensitivity of our results was conferred by the strong gradients of the Siemens Connectom 3 T scanner, where high-b contrast acts as a filter for axons and glial processes, versus DTI (using low b) where microscopic differences are often masked. In addition, children with 22q11.2DS and the ASM group were within a narrow age-range (11 to 15), limiting the influence of variance due to known age-effects. Even so, the effect sizes reported for children with 22q11.2DS are still susceptible to magnitude and sign errors, and should be interpreted cautiously [93].

The current length of the image acquisition was demanding for children, particularly those with 22q11.2DS, although this was partly mitigated by providing the opportunity to acclimatize to the scanning procedure in a mock-scanner. Despite the richness of the protocol, certain recent developments in biophysical modeling, e.g., axon diameter mapping, could not be applied to these data, as the diffusion times were insufficiently short to resolve different diameters. As such, there was no direct in vivo quantification of axon diameter which would have further strengthened the validation of disrupted axon morphology (see Suppl. Fig. 4 on sensitivity to axon diameters). Despite earlier success in using the Connectom scanner to quantify axon diameters in vivo [30, 63, 94], these measurements require extensive scan times that are currently not compatible with the time constraints of scanning children. Characterizing axon diameter distributions in this cohort would either require a dedicated study, faster imaging techniques that maximize participant comfort, or the incorporation of motion correction strategies that do not rely on image-registration based approaches [95, 96]. It would also be interesting to include myelin-sensitive contrasts, e.g. magnetization transfer imaging, as this information is orthogonal to dMRI and has been shown to be sensitive to both typical and atypical development [97, 98].

Conclusions

In summary, our findings provide support that widespread differences in white matter volumes and microstructure are related to disruptions in axonal morphology in children with 22q11.2DS. Interestingly, the dMRI fingerprint of 22q11.2DS children was most similar in terms of biophysical principal components to those of older children from the TD cohort. Similar findings were also reported in previously published large-scale dMRI studies (i.e., low-b DTI). By exploiting the unique hardware used in this study, we extend the interpretation of dMRI signal beyond microstructural sensitivity towards cellular specificity. We demonstrate evidence for hypo-connectivity in 22q11.2DS, a hypothesis put forward in a preclinical setting, but for the first time observed in vivo. This is in contrast to new insights in TD, where glial processes are implicated in observed age-effects in white matter microstructure. The sensitivity and specificity provided by the combination of high-b dMRI and biophysical modelling highlights the discriminatory power of this approach for investigating dMRI signatures of both typical and atypical developmental populations.