Detecting microstructural deviations in individuals with deep diffusion MRI tractometry

Chamberland, Maxime; Genc, Sila; Tax, Chantal M. W.; Shastin, Dmitri; Koller, Kristin; Raven, Erika P.; Cunningham, Adam; Doherty, Joanne; van den Bree, Marianne B. M.; Parker, Greg D.; Hamandi, Khalid; Gray, William P.; Jones, Derek K.

doi:10.1038/s43588-021-00126-8

Download PDF

Article
Open access
Published: 22 September 2021

Detecting microstructural deviations in individuals with deep diffusion MRI tractometry

Maxime Chamberland ORCID: orcid.org/0000-0001-7064-0984^1,2,
Sila Genc¹,
Chantal M. W. Tax^1,3,
Dmitri Shastin ORCID: orcid.org/0000-0003-3937-3675^1,4,
Kristin Koller¹,
Erika P. Raven^1,5,
Adam Cunningham⁶,
Joanne Doherty^1,6,
Marianne B. M. van den Bree⁶,
Greg D. Parker¹,
Khalid Hamandi^1,4,7,
William P. Gray ORCID: orcid.org/0000-0001-7595-8887^1,4,7 &
…
Derek K. Jones¹

Nature Computational Science volume 1, pages 598–606 (2021)Cite this article

9386 Accesses
20 Citations
39 Altmetric
Metrics details

Subjects

Abstract

Most diffusion magnetic resonance imaging studies of disease rely on statistical comparisons between large groups of patients and healthy participants to infer altered tissue states in the brain; however, clinical heterogeneity can greatly challenge their discriminative power. There is currently an unmet need to move away from the current approach of group-wise comparisons to methods with the sensitivity to detect altered tissue states at the individual level. This would ultimately enable the early detection and interpretation of microstructural abnormalities in individual patients, an important step towards personalized medicine in translational imaging. To this end, Detect was developed to advance diffusion magnetic resonance imaging tractometry towards single-patient analysis. By operating on the manifold of white-matter pathways and learning normative microstructural features, our framework captures idiosyncrasies in patterns along white-matter pathways. Our approach paves the way from traditional group-based comparisons to true personalized radiology, taking microstructural imaging from the bench to the bedside.

Radiomic tractometry reveals tract-specific imaging biomarkers in white matter

Article Open access 05 January 2024

Peter Neher, Dusan Hirjak & Klaus Maier-Hein

Comprehensive diffusion MRI dataset for in vivo human brain microstructure mapping using 300 mT/m gradients

Article Open access 18 January 2022

Qiyuan Tian, Qiuyun Fan, … Susie Y. Huang

Multicenter dataset of multi-shell diffusion MRI in healthy traveling adults with identical settings

Article Open access 27 May 2020

Qiqi Tong, Hongjian He, … Jianhui Zhong

Main

Normative modeling is an emerging statistical framework that aims to capture variability by comparing individuals with a normative population¹. The benefits of normative modeling in neuroimaging in general are well documented, with applications in atypical brain development and psychiatry (see ref. ² for a review). Applications of deep normative modeling have mostly been demonstrated at the voxel level using functional MRI data³ and volumetric data derived from standard structural magnetic resonance imaging (MRI)^4,5.

Diffusion MRI (dMRI) is another MRI modality that allows non-invasive characterization of tissue microstructure. In the brain, for example, information about the structural architecture of the white matter can be obtained by probing the random motion of water molecules⁶ and acquiring multiple magnetic resonance images with different diffusion-senitization properties. The ability to derive semiquantitative features such as fractional anisotropy⁷ or mean diffusivity⁸, or to virtually reconstruct white-matter pathways with tractography⁹ has had a huge impact on the ability to distinguish between typical and atypical brain structures in vivo in health and disease¹⁰; however, prediction modeling (or case-control differentiation), in which a group of N patients with the same disease is compared with a group of N-matched controls, is not well suited to clinically heterogeneous groups (for example, neurological disorders¹¹, psychiatric disorders such as schizophrenia or autism spectrum disorder, and rare cases). Despite decades of progress in the research domain, the primary clinical use of dMRI has been for largely limited to diagnosing acute ischemic stroke or grading and monitoring of tumor invasion. Several studies have shown success in identifying subtle but important microstructural changes at the individual patient level¹², showcasing the potential of dMRI to be applied more broadly across applications. Yet there is a scarcity of dMRI frameworks for single-patient analysis (that is, one patient versus N controls). There is an urgent yet unmet need for a paradigm shift from group-wise comparisons to individualized diagnosis, that is, detecting whether (and where) the tissue microstructure of a single participant is abnormal¹³; not only would this greatly facilitate the study of clinically heterogeneous groups^1,4, it would also facilitate the study of rare diseases and true clinical adoption (that is, making a diagnosis/prognosis in an individual patient).

As mentioned above, current efforts to apply deep normative modeling to neuroimaging have so far relied on voxel-based methods. For the assessment of white matter—which comprises continuous pathways—under conditions in which an entire pathway may be affected (for example, in developmental disorders such as schizophrenia¹⁴), fragmenting the analysis in this way is suboptimal, whereas operating on the manifold of reconstructed tracts offers a more intuitive approach. Indeed, in comparing microstructural properties between groups, many computational pipelines adopt a tractometry approach^15,16, that is, mapping measures along pathways reconstructed via tractography, either by averaging along the whole tract¹⁷ or a segment thereof^18,19,20,21. Along-tract profiling has been applied previously to investigate various brain conditions^{15,19,22,23,24}. The main advantage here is that image registration can be avoided as tractometry is performed in each patient’s native space, resulting in a set of individual tract profiles. There are, however, some limitations. First, most analyses treat tractometry measures from specific pathways as independent measures. This univariate approach has the potential to obscure key relationships between different tracts. Focusing on any particular anatomical location therefore increases the risk of losing the full picture. Although individual pathways can appear normal in isolation, by considering them as an ensemble (for example, as part of a network), any such intertract relationships could collectively help to identify outliers. Second, when analyzing multiple measures (even when derived within the same tract), statistical analysis is hampered by: (1) the multiple comparisons problem; and (2) any covariance between measurements^17,25. Here multidimensional approaches can increase statistical power by combining the sensitivity profiles of independent modalities^25,26,27.

Here we present an anomaly detection framework (Detect) that uses a data-driven, unsupervised normative modeling approach based on deep autoencoders²⁸. Autoencoders are a type of artificial neural network traditionally used for dimensionality reduction²⁸. They can capture non-linear interactions between the input features by learning a self-representation of their inputs through a low-dimensional layer (Fig. 1); the goal is to generate an output ($\hat{\textbf{x}}$) similar to the input (x) by minimizing the reconstruction error. This same representation can be exploited for anomaly detection by analyzing deviations in the reconstruction. Detect moves substantially beyond dMRI group-level analysis techniques by identifying and localizing anomalies in multiple tract profiles at once at the individual level. The proposed framework was trained to learn normative sets of features derived from healthy brain tract profiles in three independent datasets and one reproducibility dataset²⁹. Once trained, the network is then presented with unseen healthy tract profiles (for testing) and subsequently exposed to tract profiles from individuals diagnosed with neurological/psychiatric conditions. We emphasize that the model has no access to the diagnostic labels during the training phase and thus our anomaly detection approach is fully unsupervised. To assess generalization, the framework was then applied to single participants with a range of neurological and psychiatric disorders, including children and adolescents with copy number variants (CNVs) at high risk of neurodevelopmental and psychiatric disorders; patients with drug refractory epilepsy; and patients with schizophrenia (SCHZ). We compared the performance of our approach with (1) a conventional z-score-distribution approach and (2) principal component analysis (PCA) combined with the Mahalanobis distance (a widely used approach in cluster analysis and classification techniques)^21,24,27. Importantly, the analyses of the three pathological cases were performed in the browser, from the point-of-view of future Detect users.

**Fig. 1: Graphical representation of the proposed anomaly detection framework.**

Results

Framework overview

Detect is an open-source, user-facing tool built on the interactive Streamlit framework, which promotes data exploration through interactive visualizations in the browser. The framework offers three main scripts: Detect, Inspect and Relate. Detect facilitates patient comparisons using cross-validated area under the curve (AUC) computed over a set of user-defined iterations, by means of z-score, PCA or autoencoder (see Supplementary Fig. 4). The output is a single, bootstrapped anomaly score for each patient. Inspect allows the user to select a single subject and to compare it with the rest of the population. Here, feature anomalies are highlighted using a leave-one-out cross-validation approach, and only segments along the tract where two consecutive outliers occurred are reported. Results are displayed in real-time during each iteration for the user to evaluate. Furthermore, both scripts allow the visualization of tract profiles. Finally, Relate is a simple visual interface for correlating the anomaly scores obtained by the previous commands with clinical scores. In all scenarios, microstructural features included fractional anisotropy, mean diffusivity and rotationally invariant spherical harmonic features of zeroth and second orders³⁰ (RISH0 and RISH2, respectively) derived from the largest b-value data in each dataset to maximize sensitivity to the intra-axonal signal³¹.

White-matter anomaly detection in CNV participants

Discriminating power

First we investigate individual differences in white-matter microstructure in children with CNVs at high genetic risk of neurodevelopmental and psychiatric disorders³², which are relatively rare and therefore challenging to recruit for research imaging studies³³. Note that the framework was trained using data from typically developing children only. In general, the autoencoder approach showed higher AUC scores across the microstructural metrics and was better at identifying CNV patients as outliers, providing substantially higher sensitivity–specificity tradeoffs than the z-score and Mahalanobis-based approaches (Supplementary Fig. 1 and Supplementary Table 2). For example, the RISH0 feature showed higher discriminating power (AUC = 0.83 ± 0.08) compared with the mean univariate z-score (AUC = 0.80 ± 0.09) and multivariate Mahalanobis distance (AUC = 0.61 ± 0.09). In comparing the RISH0 group distributions (Fig. 2), anomaly scores derived via the autoencoder were significantly different (Kolmogorov–Smirnov test = 0.62, P < 0.003; Cohen’s d effect size = 1.39) between the CNV individuals and the typically developing patients. In particular, all CNV patients had an anomaly score larger than the typically developing patient mean and 50% of them were larger than the 95th percentile of the typically developing patient population. In comparison, the difference between the anomaly scores was less pronounced with the z-score (Kolmogorov–Smirnov = 0.34, P = 0.3, Cohen’s d = 0.38) approaches, whereas similar results were obtained by using the Mahalanobis distance (Kolmogorov–Smirnov = 0.56, P = 0.01, Cohen’s d = 1.20).

**Fig. 2: Anomaly scores for the CNV dataset.**

Tract-specific deviations

A key advantage of using deep autoencoders for anomaly detection over traditional PCA-derived approach is its unique ability to relate the anomaly back to the individual elements of the input data. More specifically, the predicted data retains the same dimensionality as the input data and thus it is possible to see which feature cannot be accurately recovered by the autoencoder. For example, if a feature has a positive reconstruction error, then one can infer that the network learned a smaller value for that feature than what was provided as input. In the context of the CNV participants, multiple regions were highlighted (by positive reconstruction errors) as deviants from the typically developing patient population. Figure 3 reveals a high anomaly rate for various association bundles such as the bilateral inferior longitudinal fasciculus (ILF), optic radiations and the left superior longitudinal fasciculus (SLF_II). This is in line with current literature where microstructural differences are expected along association pathways, in agreement with psychotic symptoms³⁴.

**Fig. 3: Along-tract anomalies in a single patient.**

White-matter anomaly detection in epilepsy

Focal cortical dysplasia (FCD)—a malformation of cortical development—is the most common etiology in drug-resistant neocortical partial epilepsies³⁵. Although complete resection is the main predictor of seizure freedom following surgery, a substantial proportion of FCDs may be missed with standard clinical imaging protocols³⁵ and the seizure-generating network may extend far beyond the visible dysplasia. Diffusion MRI contrast enhances the sensitivity of MRI to differences in the brain, but has only been reported at the group level³⁶. Here we demonstrate two key advantages of the deep autoencoder approach in a clinical context. First, it succeeded in detecting white-matter anomalies that a conventional z-score-based approach has missed, potentially due to hidden interactions between the features. Second, detection of abnormal microstructural features away from putative seizure onset zone, as demonstrated in the first example, may contribute to the mapping of epileptogenic networks in individuals; thus, although the examples shown here had radiological changes detectable with T2-weighted sequences, the method could potentially be extended to cases of MRI-negative partial epilepsy increasing the diagnostic yield.

Patient 1 is a young adult female with seizures described as a fuzzy painful sensation in the torso rising up to the head, associated with mumbling sounds, occurring 2–5 times per day. A scalp video electroencephalogram (EEG) showed left temporal interictal epileptiform discharges and left temporal EEG onset. Clinical imaging demonstrated a small area of cortical–white-matter junction blurring in the laterobasal left temporal lobe associated with a transmantle area of T2 hyperintensity, suggestive of FCD type II³⁷. Neuropsychological assessment was concordant with a left temporal deficit, also revealing preserved mesial structures manifesting in relatively preserved verbal memory performance. Subsequent stereo-EEG (SEEG) implantation confirmed ictal onset and prominent interictal discharges from neocortical contacts immediately behind the MRI lesion; furthermore, neocortical discharges were seen in SEEG contacts close to the temporal pole. Based on those clinical findings (neuropsychological assessment and SEEG), the patient proceeded to have resection with histology consistent with FCD. Five tracts of possible relevance were interrogated (Fig. 4). Microstructural anomalies were identified along the left ILF and optic radiations in the immediate proximity of the T2-weighted changes corresponding to SEEG contacts with maximal ictal EEG changes. Anomalies in the temporal portions of the left inferior fronto-occipital fasciculus and uncinate fasciculus pointed towards the temporal pole corroborating the SEEG findings that, despite normal clinical MRI, this area was part of the seizure network.

**Fig. 4: Focal cortical dysplasia anomaly detection (patient 1).**

Patient 2 is an adult female with focal onset seizures since childhood occurring daily with episodes of loss of contact, grimacing and limb stiffening hypermotor movements, including clutching at nearby object on the left side. Scalp video EEG findings were consistent with frontal onset seizure semiology. Clinical MRI showed blurring of the cortical–white-matter junction between the right posterior superior frontal gyrus and the adjacent precentral gyrus, and a transmantle sign on T2/FLAIR from the cortex reaching all the way to the lateral ventricle, consistent with FCD type II. Subsequent SEEG recordings demonstrated spatial overlap between primary motor areas and early ictal onset and hence the patient did not proceed to surgery. Five tracts of possible relevance were interrogated with our framework (Fig. 5). Anomalies were detected corresponding to radiological and electrophysiological findings along the right corticospinal tract (CST), primary motor (CC4) and superior longitudinal fasciculus (SLF-I) beyond the visible lesion. No anomalies were found along the right cingulum and primary sensorimotor (CC5) regions.

**Fig. 5: FCD anomaly detection (patient 2).**

The results are promising, with the tool identifying anomalies in concordance with clinical hypothesis in a single-patient analysis paradigm, testifying to its utility for clinical evaluation. Its extra value is highlighted by its sensitivity to outlying tract segments not detected with the conventional z-score approach. The N = 1 approach to detect deep white-matter anomalies illustrated here will facilitate the identification of individualized therapy most appropriate to that patient, suggesting additional targets for diagnostic evaluation and possible surgical treatment.

Linking brain heterogeneity with epidemiological findings in schizophrenia

The extent to which individual clinical variability in schizophrenia relates to microstructural variability remains a key challenge in neuropsychiatry³⁸, with most findings being at the group or voxel level. For the RISH0 feature, the autoencoder approach was better at identifying single SCHZ patients as outliers (AUC = 0.64 ± 0.06) when compared with PCA (AUC = 0.59 ± 0.07) or the z-score (AUC = 0.39 ± 0.06). In comparing these group distributions, anomaly scores derived from the autoencoder were found to be significantly different (t = –2.60, P = 0.01, Cohen’s d = 0.47) between the SCHZ individuals and the healthy participants (Fig. 6). In particular, 31 of the 43 SCHZ patients had an anomaly score larger than the healthy participants’s mean and nine of them were larger than the 95th percentile of the healthy participants’s population. In comparison, the difference between the anomaly scores was less pronounced for the PCA (t = –1.75, P = 0.08, Cohen’s d = 0.32) and z-score (t = 1.85, P = 0.07, Cohen’s d = 0.33) approaches (see Supplementary Fig. 1 and Supplementary Table 2). Furthermore, the above chance-level detection rates of the proposed deep autoencoder in SCHZ suggest a successful application of the tractometry-based framework in unsupervised anomaly detection. The significance of these results are even more pronounced considering the challenging task at hand; that is, where even a supervised support vector machine classifier provides similar accuracy (AUC = 0.65 ± 0.13). Finally, to provide an illustrative way in which the Detect tool can provide an important avenue for future studies, anomaly scores were correlated with clinical scores. The Hopkins anxiety index (Hopkins symptom checklist), a widely used screening instrument to study mental illness, was used as a proof of concept. For the autoencoder, PCA and z-score, the Spearman’s rank correlation coefficients were ρ = 0.38 (P = 0.012), ρ = 0.3 (P = 0.055) and ρ = –0.12 (P = 0.448), respectively (Fig. 6).

**Fig. 6: Anomaly scores for the SCHZ cohort.**

Repeatability of anomaly scores and tract profiles

Using a test-retest dataset (six patients, five time points²⁹), we assessed the repeatability of (1) the input RISH0 tract profiles and (2) the generated anomaly scores by calculating the intraclass correlation coefficient (ICC, with two-way mixed, absolute agreement for average measurements as in ref. ²⁹) and the coefficient of variation (CoV). Supplementary Fig. 2 shows the repeatability of the tract profiles, with the optic radiations being the most reproducible bundles (mean ICC = 0.95, CoV = 0.03) and the left cingulum being the least reproducible (ICC = 0.66, CoV = 0.06). In terms of anomaly scores, the proposed anomaly detection framework shows reconstruction errors that are reproducible across sessions (Supplementary Fig. 3) with an ICC of 0.96 (95% CI: 0.88, 0.99) and a CoV of 0.06. Furthermore, reliability of anomaly scores in the pediatric dataset can be estimated from the test-retest dataset. Using the two-way mixed ICC for consistency (for single measurements) of 0.85 from the test-retest study, assuming a similar measurement-related variance and accounting for the standard deviations in both cohorts, the ICC for anomaly scores in the pediatric cohort is 0.86. Even if the measurement-related variance was to increase by 30%, the ICC would still indicate good reliability (0.76). This confirms that the demonstrated group differences in anomaly scores are unlikely to have occurred due to measurement-related variance.

Discussion

Further exploration of input features and hyperparameters of the model remains to assess the generalizability of the framework and its application to other pathologies. From a generalization point of view, the key problem in biomarker research is the need for individual prediction/diagnosis. Advancing knowledge of brain pathology and related cognitive impairment at the individual level is essential for early detection and intervention. Normative models show that, if groups are too heterogeneous, it can be a challenging task to learn characteristics from a given population using supervised approaches, hence the need for unsupervised learning¹. Although the amount of data we can employ in imaging studies is relatively small in comparison with population-based studies, the framework provides a principled method to detect individual differences in tissue microstructure. With the ever-growing amount of dMRI data being acquired, the framework will make less conservative inferences as the number of data points increases. In principle, our framework can be trivially extended to tract-based assessment of non-diffusion-based microstructural metrics (for example, such as magnetization transfer and myelin water imaging^16,17), as long as the associated quantitative maps are co-registered with the diffusion space. They could be derived from any manually defined or atlas-derived regions of interest, tract-based regions of interest as done here, or even at the voxel-based level³⁹. Moreover, our framework could be applied to any set of neuroimaging features. For example, these inputs could include time series from functional MRI and cortical thickness derived from structural T1 imaging. Finally, other applications of the framework include the characterization of microstructural changes in neurological disorders without gross pathology. Furthermore, Detect could also help predict worse phenotypic outcome in those with a rare genetic disorder (for example, CNV carriers) and potentially allow for early therapeutic planning in the future.

One of the limitations of single-patient analysis is that it requires substantial amounts of healthy participant data to define a normative brain¹. Combining multisite or multiscanner dMRI datasets can greatly increase the statistical power of neuroimaging studies⁴⁰, but cross-scanner and cross-protocol variability challenges joint analysis, hence the need for data harmonization^30,41. However, dMRI harmonization is a young field, there is not yet an established method that can bring cross-scanner variability back to the level of within-patient variability. Moreover, tractography still faces considerable challenges in the field^42,43 and most commonly available tools can only track reliably within normal-appearing white matter. Although recent machine learning approaches have shown promise in reproducible tract segmentation across patients⁴⁴, therefore strengthening hope of analyzing dMRI data for group studies, there is a potential challenge: along-tract profiling approaches should only be used when a complete tract has been reconstructed. Future work will establish the utility of this approach in conditions where pathology leads to incomplete tract reconstruction. Another limitation is that autoencoders require more computation than PCA; however, PCA will also have limitations for large datasets where memory storage is an issue.

The choice of dMRI measure will also influence the capacity of the tool to detect anomalies. For example, the use of fiber-specific measurements^45,46 could help better disentangle anomalies in fiber-crossing regions. This would of course demand a model-based approach as opposed to the RISH0 features employed in this study. Furthermore, it was recently demonstrated that there is more sensitivity to individual differences at high b-values^31,47. Nevertheless, we are encouraged to see that even with more commonly used b-values (for example, b = 1,000 s mm^–²), we are still able to uncover patient/control differences in the SCHZ cohort, highlighting the potential for widespread clinical adoption of dMRI.

In the context of microstructural MRI, the demonstration of Detect through the three different scenarios goes beyond the utility of other techniques and provides compelling motivation for future application. The unsupervised multivariate framework proposed here uses state-of-the-art machine learning to approach high-dimensional data non-linearly and improve accuracy and precision over traditional anomaly detection. Our deep learning approach also provides advantages over other statistical approaches for outlier detection as it was recently shown (using functional MRI data³) that deep unsupervised approaches improve identification of psychiatric patients compared to mass-univariate normative modeling. It is also generally accepted that autoencoders tend to perform better when the middle layer (that is, bottleneck layer) is small when compared with PCA. This can potentially mean that the same accuracy can be achieved with less components and hence may be beneficial for smaller datasets.

Browser-based applications are becoming increasingly popular among the computational neuroscience community due to their ease of use and accessibility across devices²¹. Detect enables the detection of abnormalities in clinically heterogeneous groups or rare cases and ultimately improve diagnosis of neurological and neuropsychiatric disorders. Our aim was to develop and distribute an open-source framework to characterize microstructural white-matter changes at the individual level. This enables the detection of abnormalities To the best of our knowledge, other tools such as AFQ Browser²¹ compare individuals using a linear approach (z-score) that considers each tract-segment independently and ignores potential complex interactions between the features. Those tract segments are then statistically tested in an univariate manner, and as such the correction for multiple comparisons—required by the typical high dimensionality of dMRI data—will hamper the discriminating power of the analysis²⁵. Recently, PCA was employed to acknowledge the multivariate nature of dMRI data^25,26,27, but this approach still relies on linear assumptions thereby ignoring possible complex interactions between the features. We believe strongly that the proposed deep autoencoder approach goes hand-in-hand with existing browser-based dMRI analysis frameworks²¹ in encouraging reproducible research and data-driven discoveries. Finally, we encourage future users of Detect to apply the tool to their own datasets and we welcome contribution to the tool in the form of added functionalities via GitHub.

In summary, diffusion MRI offers great promise to detect subtle differences in tissue microstructure when applied at the group level; however, the goal of clinical neuroimaging is to be applicable at the individual level. The single case approach proposed here will facilitate the identification of individualized therapy most appropriate to that patient, forming a baseline biomarker for subsequent monitoring through a therapeutic process. We believe that our tractometry-based anomaly detection framework paves the way to progress from the traditional paradigm of group-based comparison of patients against healthy participants, to a personalized medicine approach, and takes us a step closer in transitioning microstructural MRI from the bench to the beside.

Methods

Detect interactive interface

Users of Detect will input demographic data that consist of comma-separated values (.csv), where each row represents a patient (ID). Example demographics columns include: group, age, gender or clinical scores. The user is given the option to correct for confounding factors (that is, by treating those attributes as covariates across the entire brain²), resulting in age-independent microstructural features, for example. The microstructural tractometry data format consists of an .xlsx spreadsheet, where each sheet represents a dMRI metric (for example, fractional anisotropy, mean diffusivity and so on). As per the demographic data, patients are stacked individually on each row. The first column denotes the ID of each patient. The remaining columns follow the following convention: bundle_hemi_section where bundle is the white-matter bundle of interest, hemi is the hemisphere (that is, left or right and void for commissural tracts), and section is the along-tract portion (for example, from 1 to 20).

Data acquisition and preprocessing

CNV dataset

Diffusion MRI data were acquired from 90 typically developing children (age 8–18 years) and eight children with CNVs at high risk of neurodevelopmental and psychiatric disorders (CNVs, 2× 15q13.3 deletion, 2× 16p11.2 deletion, 3× 22q.11.2 deletion and 1× Prader–Willi syndrome) and no apparent white-matter lesions (age 8–15 years). Data collection procedures for the typically developing and CNV groups were approved by the Cardiff University School of Psychology and School of Medicine Ethics Committees, respectively. Children under 16 and those over 16 who lacked capacity to consent given written/verbal assent and their parents or legal guardians gave written consent on their behalf. Images were acquired using a Siemens 3 T Connectom MRI scanner (32-channel radiofrequency coil, Nova Medical) with 14 b0 images, 30 directions at b = 500, 1,200 s mm^–²; 60 directions at b = 2,400, 4,000, 6,000 s mm^–²; and 2 × 2 × 2 mm³ voxels (TE (echo time)/TR (repetition time) = 59/3,000 ms; Δ/δ = 24/7 ms). The total scan time for the multishell protocol was 16 min and 14 s. Each dataset was denoised⁴⁸ and corrected for signal drift⁴⁹, motion and distortion in FSL⁵⁰, gradient non-linearities⁵¹ and Gibbs ringing⁵². Next, RISH features³⁰ were derived for each patient using the b = 6,000 s mm^–² shell to maximize sensitivity to the intra-axonal signal³¹ (zeroth and second orders only, RISH0 and RISH2, respectively). Furthermore, diffusion tensors were generated using an in-house non-linear least squares fitting routine using only the b ≤ 1,200 s mm^–² data, followed by the derivation of fractional anisotropy and mean diffusivity maps.

Epilepsy dataset

Diffusion MRI data from two epilepsy patients with FCD were acquired on a Siemens 3 T Connectom MRI scanner with 60 directions at b = 1,200, 3,000 and 5,000 s mm^–² and 1.2 × 1.2 × 1.2 mm³ voxels (TE/TR = 68/5,400 ms; Δ/δ = 31.1/8.5 ms; total scan time = 30 min and 25 s). Furthermore, data on 15 healthy participants (aged 21–41 years) from the computational diffusion MRI harmonization database were used⁴⁰. Data collection procedures for the healthy participant and FCD groups were approved by the Cardiff University School of Psychology and School of Medicine Ethics Committees, respectively. Written informed consent was obtained from all patients. Each dataset was corrected for Gibbs ringing⁵², signal drift⁴⁹, motion and distortion in FSL⁵⁰, and gradient non-linearities⁵¹. Next, RISH0 features³⁰ were derived for each patient using the b = 5,000 s mm^–² shell. A fourfold data augmentation was applied to the healthy participant tract profiles using the synthetic minority oversampling technique⁵³ resulting in 75 healthy participants.

SCHZ dataset

Diffusion MRI data from the UCLA Consortium for Neuropsychiatric Phenomics⁵⁴ was downloaded from the OpenNeuro platform (openneuro.org/datasets/ds000030/versions/00016), which also contains demographic, behavioral and clinical data. Although more focused on functional MRI, the dataset contains dMRI data from 123 healthy participants and 49 individuals with SCHZ amongst other psychiatric disorders. Data were acquired on a Siemens 3 T Tim Trio MRI scanner with one b0 image, 64 directions at b = 1,000 s mm^–² and 2 × 2 × 2 mm³ voxels. Data quality assessment was first performed, resulting in the exclusion of datasets with reduced field-of-view (preventing the reconstruction of white-matter bundles in the inferior temporal lobes) and those with substantial slice dropout (impacting estimation of diffusion metrics). A total number of 109 healthy participants (aged 21–50 years) and 43 SCHZ patients (aged 22–49 years) were used for further analysis. Diffusion data were denoised⁴⁸, corrected for patient motion in FSL⁵⁰ and distortion using the anatomical T1-weighted image as reference. Next, RISH features (RISH0, RISH2) were derived for each patient. Finally, diffusion tensors were generated using iteratively weighted least squares in MRtrix⁵⁵ followed by the derivation of fractional anisotropy and mean diffusivity maps.

Repeatability dataset

To assess repeatability, we employed the microstructural image compilation with repeated acquisitions dataset²⁹, which comprises five repeated sets of microstructural imaging in six healthy human participants (three female, aged 24–30 years). Each participant was scanned five times in the span of two weeks on a 3 T Siemens Connectom system with ultra-strong (300 mT m^–1) gradients. Multishell dMRI data were collected (TE/TR = 59/3,000 ms; voxel size = 2 × 2 × 2 mm³; b-values = 0 (14 volumes), 200 and 500 (20 directions), 1,200 (30 directions), 2,400, 4,000, and 6,000 (60 directions) s mm^–²) resulting in a total scan time of 16 min and 37 s. The same preprocessing as the CNV dataset was applied. Data collection was approved by the Cardiff University School of Psychology Ethic Committee and written informed consent was obtained from all patients.

Tractometry

For each dataset, automated white-matter tract segmentation was performed using TractSeg⁴⁴ (see Supplementary Table 1) using multishell constrained spherical deconvolution⁴⁷. For each bundle, 2,000 streamlines were generated. Tractometry¹⁶ was performed (sampling fractional anisotropy, mean diffusivity, and RISH0 and RISH2 at 20 locations along the tracts^19,23,25) using a Nextflow architecture provided by SCILPY (github.com/scilus/scilpy). Specifically, individual streamlines were reordered for all patients to ensure consistency using the following order: left-to-right for commissural tracts, anterior-to-posterior for association pathways and top-to-bottom for projection pathways. Next, a core streamline was generated and microstructural metrics at each vertex of the bundle were projected to the closest point along the core. The resulting tract profiles were concatenated to form a feature vector (x).

Artificial neural network

Our autoencoder implementation consists of a symmetric design of five fully connected layers (l_n). The input and output layers (l₁ and l₅) have exactly the same number of nodes as the number of input tracts features. The inner layers (l₂ and l₄) consecutively apply a compression ratio of two by reducing the number of nodes by half, up to the bottleneck hidden layer (l₃). For example, if using an input vector made of 100 features, l₁ and l₅ will consist of 100 nodes; l₂ and l₄ 50 nodes; and 25 nodes for the bottleneck l₃. Rectified linear units activation was used between the layers to promote sparse activation and tanh for the last layer (epochs = 25, batch size = 24, learning rate = 1 × 10^–3; optimizer, Adam; loss, mean squared error; validation split = 0.1). Using different activation functions in different layers aims at balancing the advantages and disadvantages of the two activation functions. To promote sparsity and reduce overfitting, an activation penalty was imposed to the bottleneck layer using ℓ₁-regularization (×10^–5). This is especially best suited for models that explicitly seek an efficient learned representation. The goal is to generate an output ($\hat{x}$) similar to the input (x) by minimizing the reconstruction error. Here the MAE was used as anomaly score and is defined as:

$${\mathrm{MAE}}=\frac{1}{n}\mathop{\sum }\limits_{j=1}^{n}| {x}_{i}-{\hat{x}}_{i}| ,$$

(1)

The MAE measures the average magnitude of the errors and is derived during testing by computing the absolute differences between the reconstructed microstructural features (${\hat{x}}_{i}$) and the raw input features (x_i). Due to the heavily imbalanced group ratio between healthy participants and patients (that is, CNV and epilepsy), a bootstrap was implemented to draw random samples of equal sizes from each group. Specifically, the autoencoder is trained using healthy participants data only. The entire dataset is therefore first split into a training set (80%, made of healthy controls only) and a validation set (20%, by combining the patients with a matching number of healthy participants) to create the outer fold; 10% of the training set is held out for testing during the training phase (inner fold) to evaluate the loss. Age and sex regression⁵⁶ and feature normalization (min–max) were performed on the normative training set and subsequently applied to the held-out validation set to prevent information leakage. To derive conservative estimates and assess variations within the model, we repeated this process 100 times and report the mean MAE for each patient. Finally, we compared the sensitivity versus specificity of the anomaly scores using the mean receiver operating characteristic AUC across iterations, with standard deviations used as uncertainties. Comparisons of the AUC scores between patients and controls were performed using two-tailed t-tests assuming equal variances in the case of balanced groups and Kolmogorov–Smirnov test for the unbalanced groups. The correlation between anomaly scores and clinical scores was computed using Spearman’s ρ.

Univariate approach

One of the most commonly used tools in determining outliers is the z-score. The z-score (or standard score z = x − μ)/σ) is a way of describing a data point as deviance from distribution, in terms of standard deviations from the mean of the normal distribution. Here, z-scores were computed for each tract-segment, relative to the mean of the healthy group (μ) and averaged to derive a patient-specific anomaly score at each iteration (described above). The mean over all iterations described above was retained as anomaly scores for all patients.

Multivariate linear approach

Principal component analysis was applied to the set of features by restricting the dimensionality to preserve features accounting for 85% of the variance in the data. In Detect, this number can be manually defined as the percentage of explained variance. Next, the Mahalanobis Distance (M, a multidimensional generalization of the z-score that accounts for the relationships between the white-matter bundles) was used to derive an anomaly score defined as:

$$M(x)=\sqrt{{(x-\mu )}^{^{\prime} \cdot }{C{}^{-1}}^{\cdot }(x-\mu )},$$

(2)

where x represents the feature vector of a given patient, μ is the vector of mean microstructural metrics for each tract location s, and C⁻¹ is the inverse covariance matrix of the input features. The problem of anomaly detection can be seen as a one-class classification problem and therefore, our training data only contains healthy participants to calculate C. an M score was then derived for all unseen patients in relation to the healthy participant distribution. The same dataset split as aforementioned was used to derive a bootstrapped estimate of M for each patient, which was subsequently analyzed.

Support vector machine comparison

A supervised support vector machine classifier was used for comparisons on the SCHZ dataset. Class weights were set to account for the class imbalance between healthy participants and patients. The classifier was validated using a repeated (ten times) stratified, fivefold cross-validation approach in scikit-learn (scikit-learn.org). Optimized parameters were derived for each cross-validation fold using a grid-search approach. Those included the choice of kernel ([radial basis function, linear]), regularization ([1, 10, 100, 1000]) and gamma parameters ([10⁻³, 10⁻², 10⁻¹, 1, 10¹, 10², 10³]).

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Original datasets are accessible through the original publications, including the MICRA²⁹ repeatability dataset (osf.io/z3mkn/) and the CNP⁵⁴ dataset (https://openneuro.org/datasets/ds000030/versions/1.0.00). The cDMRI dataset is publicly available and information on how to obtain the data can be found on the following webpage: https://forms.office.com/r/ZyLNjuYk3Y. Source Data are provided with this paper.

Code availability

Detect is an open-source anomaly detection framework for neuroimaging data and is available through Github⁵⁷ at github.com/chamberm/Detect under the Apache License. The framework is powered by Streamlit (www.streamlit.io), an open-source app framework for machine learning and data science. The repository is regularly updated via continuous integration to contain example data (tract-profiles), a Wiki section as well as Jupyter Notebooks with the Python code used to generate the figures in this study. A live demo is also available via Streamlit sharing. TractSeg is available at github.com/MIC-DKFZ/TractSeg. MRtrix is available at www.mrtrix.org. SCILPY is available at github.com/scilus/scilpy. FiberNavigator is available at github.com/chamberm/fibernavigator.

References

Marquand, A. F., Rezek, I., Buitelaar, J. & Beckmann, C. F. Understanding heterogeneity in clinical cohorts using normative models: beyond case-control studies. Biol. Psychiatry 80, 552–561 (2016).
Article Google Scholar
Marquand, A. F. et al. Conceptualizing mental disorders as deviations from normative functioning. Mol. Psychiatry 24, 1415–1424 (2019).
Article Google Scholar
Kia, S. & Marquand, A. Neural processes mixed-effect models for deep normative modeling of clinical neuroimaging data. In Proceedings of The 2nd International Conference on Medical Imaging with Deep Learning (eds. Cardoso, M. J. et al.) Vol. 102, 297–314 (PMLR, 2019).
Wolfers, T. et al. Mapping the heterogeneous phenotype of schizophrenia and bipolar disorder using normative models. JAMA Psychiatry 75, 1146–1155 (2018).
Article Google Scholar
Pinaya, W. H. L., Mechelli, A. & Sato, J. R. Using deep autoencoders to identify abnormal brain structural patterns in neuropsychiatric disorders: a large-scale multi-sample study. Hum. Brain Mapp. 40, 944–954 (2019).
Article Google Scholar
Stejskal, E. O. & Tanner, J. E. Spin diffusion measurements: spin echoes in the presence of a time-dependent field gradient. J. Chem. Phys. 42, 288–292 (1965).
Article Google Scholar
Pierpaoli, C. & Basser, P. J. Toward a quantitative assessment of diffusion anisotropy. Magn. Reson. Med. 36, 893–906 (1996).
Article Google Scholar
Basser, P. J., Mattiello, J. & LeBihan, D. Estimation of the effective self-diffusion tensor from the NMR spin echo. J. Magn. Reson. B 103, 247–254 (1994).
Article Google Scholar
Conturo, T. E. et al. Tracking neuronal fiber pathways in the living human brain. Proc. Natl Acad. Sci. USA 96, 10422–10427 (1999).
Article Google Scholar
Jones, D. K. & Cercignani, M. Twenty-five pitfalls in the analysis of diffusion MRI data. NMR Biomed. 23, 803–820 (2010).
Article Google Scholar
Hong, S.-J. et al. Towards neurosubtypes in autism. Biol. Psychiatry 88, 111–128 (2020).
Article Google Scholar
Deleo, F. et al. Histological and mri markers of white matter damage in focal epilepsy. Epilepsy Res. 140, 29–38 (2018).
Article Google Scholar
Scholz, J., Tomassini, V. & Johansen-Berg, H. Individual differences in white matter microstructure in the healthy brain. In Diffusion MRI (Second Edition) (eds. Johansen-Berg, H. & Behrens, T. E.) Ch. 14, 301–316 (Academic, 2014); https://doi.org/10.1016/B978-0-12-396460-1.00014-7
Lv, J. et al. Individual deviations from normative models of brain structure in a large cross-sectional schizophrenia cohort. Mol. Psychiatry https://doi.org/10.1038/s41380-020-00882-5 (2020).
Jones, D. K. et al. Age effects on diffusion tensor magnetic resonance imaging tractography measures of frontal cortex connections in schizophrenia. Hum. Brain Mapp. 27, 230–238 (2006).
Article Google Scholar
Bells, S. et al. Tractometry–comprehensive multi-modal quantitative assessment of white matter along specific tracts. In Proc. ISMRM Vol. 678, 0678 (2011).
De Santis, S., Drakesmith, M., Bells, S., Assaf, Y. & Jones, D. K. Why diffusion tensor MRI does well only some of the time: variance and covariance of white matter tissue microstructure attributes in the living human brain. NeuroImage 89, 35–44 (2014).
Article Google Scholar
Jones, D. K., Travis, A. R., Eden, G., Pierpaoli, C. & Basser, P. J. PASTA: pointwise assessment of streamline tractography attributes. Magn. Reson. Med. 53, 1462–1467 (2005).
Article Google Scholar
Yeatman, J. D., Dougherty, R. F., Myall, N. J., Wandell, B. A. & Feldman, H. M. Tract profiles of white matter properties: automating fiber-tract quantification. PLoS ONE 7, e49790 (2012).
Article Google Scholar
Colby, J. B. et al. Along-tract statistics allow for enhanced tractography analysis. NeuroImage 59, 3227–3242 (2012).
Article Google Scholar
Yeatman, J. D., Richie-Halford, A., Smith, J. K., Keshavan, A. & Rokem, A. A browser-based tool for visualization and analysis of diffusion MRI data. Nat. Commun. 9, 940 (2018).
Article Google Scholar
Dayan, M. et al. Optic radiation structure and anatomy in the normally developing brain determined using diffusion MRI and tractography. Brain Struct. Funct. 220, 291–306 (2015).
Article Google Scholar
Cousineau, M. et al. A test-retest study on Parkinson’s PPMI dataset yields statistically significant white matter fascicles. NeuroImage Clin.16, 222–233 (2017).
Article Google Scholar
Sarica, A. et al. The corticospinal tract profile in amyotrophic lateral sclerosis. Hum. Brain Mapp. 38, 727–739 (2017).
Article Google Scholar
Chamberland, M. et al. Dimensionality reduction of diffusion MRI measures for improved tractometry of the human brain. NeuroImage 200, 89–100 (2019).
Article Google Scholar
Dean III, D. et al. Multivariate characterization of white matter heterogeneity in autism spectrum disorder. NeuroImage Clin. 14, 54–66 (2017).
Article Google Scholar
Taylor, P. N., Moreira da Silva, N., Blamire, A., Wang, Y. & Forsyth, R. Early deviation from normal structural connectivity: a novel intrinsic severity score for mild TBI. Neurology 94, e1021–e1026 (2020).
Article Google Scholar
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
Article MathSciNet MATH Google Scholar
Koller, K. et al. MICRA: Microstructural image compilation with repeated acquisitions. Neuroimage 225, 117406 (2021).
Article Google Scholar
Mirzaalian, H. et al. Inter-site and inter-scanner diffusion mri data harmonization. NeuroImage 135, 311–323 (2016).
Article Google Scholar
Genc, S. et al. Impact of b-value on estimates of apparent fibre density. Hum. Brain Mapp. 41, 2583–2595 (2020).
Article Google Scholar
Chawner, S. J. et al. Genotype–phenotype associations in children with copy number variants associated with high neuropsychiatric risk in the UK (imagine-id): a case-control cohort study. Lancet Psych. 6, 493–505 (2019).
Article Google Scholar
Villalón-Reina, J. E. et al. Altered white matter microstructure in 22q11.2 deletion syndrome: a multisite diffusion tensor imaging study. Mol Psychiatry 25, 2818–2831 (2020).
Article Google Scholar
Tylee, D. S. et al. Machine-learning classification of 22q11.2 deletion syndrome: a diffusion tensor imaging study. NeuroImage Clin. 15, 832–842 (2017).
Article Google Scholar
Lerner, J. T. et al. Assessment and surgical outcomes for mild type I and severe type II cortical dysplasia: a critical review and the UCLA experience. Epilepsia 50, 1310–1335 (2009).
Article Google Scholar
Duncan, J. S., Winston, G. P., Koepp, M. J. & Ourselin, S. Brain imaging in the assessment for epilepsy surgery. Lancet Neurol. 15, 420–433 (2016).
Article Google Scholar
Blümcke, I. et al. The clinicopathologic spectrum of focal cortical dysplasias: a consensus classification proposed by an ad hoc Task Force of the ILAE Diagnostic Methods Commission. Epilepsia 52, 158–174 (2011).
Article Google Scholar
Stephan, K. E., Friston, K. J. & Frith, C. D. Dysconnection in schizophrenia: from abnormal synaptic plasticity to failures of self-monitoring. Schizophrenia Bull. 35, 509–527 (2009).
Article Google Scholar
Smith, S. M. et al. Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. NeuroImage 31, 1487–1505 (2006).
Article Google Scholar
Tax, C. M. et al. Cross-scanner and cross-protocol diffusion MRI data harmonisation: a benchmark database and evaluation of algorithms. NeuroImage 195, 285–299 (2019).
Article Google Scholar
Cetin Karayumak, S. et al. Retrospective harmonization of multi-site diffusion MRI data acquired with different acquisition parameters. NeuroImage 184, 180–200 (2019).
Article Google Scholar
Maier-Hein, K. H. et al. The challenge of mapping the human connectome based on diffusion tractography. Nat. Commun. 8, 1349 (2017).
Article Google Scholar
Jones, D. K., Knösche, T. R. & Turner, R. White matter integrity, fiber count, and other fallacies: the do’s and don’ts of diffusion MRI. NeuroImage 73, 239–254 (2013).
Article Google Scholar
Wasserthal, J., Neher, P. & Maier-Hein, K. H. TractSeg—fast and accurate white matter tract segmentation. NeuroImage 183, 239–253 (2018).
Article Google Scholar
Raffelt, D. et al. Apparent fibre density: a novel measure for the analysis of diffusion-weighted magnetic resonance images. NeuroImage 59, 3976–3994 (2012).
Article Google Scholar
Assaf, Y. & Basser, P. J. Composite hindered and restricted model of diffusion (CHARMED) MR imaging of the human brain. NeuroImage 27, 48–58 (2005).
Article Google Scholar
Jeurissen, B., Tournier, J.-D., Dhollander, T., Connelly, A. & Sijbers, J. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. NeuroImage 103, 411–426 (2014).
Article Google Scholar
Veraart, J. et al. Denoising of diffusion MRI using random matrix theory. NeuroImage 142, 394–406 (2016).
Article Google Scholar
Vos, S. B. et al. The importance of correcting for signal drift in diffusion MRI. Magn. Reson. Med. 77, 285–299 (2017).
Article Google Scholar
Jenkinson, M., Beckmann, C. F., Behrens, T. E., Woolrich, M. W. & Smith, S. M. FSL. NeuroImage 62, 782–790 (2012).
Article Google Scholar
Glasser, M. F. et al. The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage 80, 105–124 (2013).
Article Google Scholar
Kellner, E., Dhital, B., Kiselev, V. G. & Reisert, M. Gibbs-ringing artifact removal based on local subvoxel-shifts. Magn. Reson. Med. 76, 1574–1581 (2016).
Article Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: synthetic minority over-sampling technique. J. Artif. Intelli. Res.16, 321–357 (2002).
Article MATH Google Scholar
Poldrack, R. A. et al. A phenome-wide examination of neural and cognitive function. Sci. Data 3, 1–12 (2016).
Article Google Scholar
Tournier, J.-D. et al. MRtrix3: a fast, flexible and open software framework for medical image processing and visualisation. NeuroImage 202, 116137 (2019).
Article Google Scholar
Lebel, C. et al. Diffusion tensor imaging of white matter tract evolution over the lifespan. NeuroImage 60, 340–352 (2012).
Article Google Scholar
Chamberland, M. chamberm/Detect: Detect (Zenodo, 2021); https://doi.org/10.5281/zenodo.4945138

Download references

Acknowledgements

This work was supported by a Wellcome Trust Investigator Award (grant no. 096646/Z/11/Z), a Wellcome Trust Strategic Award (grant no. 104943/Z/14/Z), and an EPSRC equipment grant (no. EP/M029778/1) to D.K.J., a Sir Henry Wellcome Fellowship (grant no. 215944/Z/19/Z) and VENI grant (no. 17331) from the Dutch Research Council (NWO) to C.M.W.T., a Wellcome Trust GW4-CAT Fellowship (grant no. 220537/Z/20/Z) to D.S., a Wellcome Trust Fellowship (grant no. 102003/Z/13/Z) to J.D. and an NIH NICDH fellowship (grant no. 1F32HD103313-01) to EPR. This study is also supported by the Brain Repair and Intracranial Neurotherapeutics (BRAIN) Unit, funded by Welsh Government through Health and Care Research Wales. M.C. was in part supported by the Radboud Excellence Initiative Fellowship. The authors thank M. Descoteaux and J.-C. Houde (Sherbrooke Connectivity Imaging Laboratory) for their useful discussions and code sharing.

Author information

Authors and Affiliations

Cardiff University Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff University, Cardiff, UK
Maxime Chamberland, Sila Genc, Chantal M. W. Tax, Dmitri Shastin, Kristin Koller, Erika P. Raven, Joanne Doherty, Greg D. Parker, Khalid Hamandi, William P. Gray & Derek K. Jones
Donders Institute for Brain, Cognition and Behavior, Radboud University, Nijmegen, the Netherlands
Maxime Chamberland
Image Sciences Institute, University Medical Center Utrecht, Utrecht, the Netherlands
Chantal M. W. Tax
Department of Neuroscience, University Hospital of Wales (UHW), Cardiff, UK
Dmitri Shastin, Khalid Hamandi & William P. Gray
Bernard and Irene Schwartz Center for Biomedical Imaging, Department of Radiology, New York, NY, USA
Erika P. Raven
Medical Research Council Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, Cardiff University, Cardiff, UK
Adam Cunningham, Joanne Doherty & Marianne B. M. van den Bree
Brain Repair and Intracranial Neurotherapeutics (BRAIN) Unit, School of Medicine, Cardiff University, Cardiff, UK
Khalid Hamandi & William P. Gray

Authors

Maxime Chamberland
View author publications
You can also search for this author in PubMed Google Scholar
Sila Genc
View author publications
You can also search for this author in PubMed Google Scholar
Chantal M. W. Tax
View author publications
You can also search for this author in PubMed Google Scholar
Dmitri Shastin
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Koller
View author publications
You can also search for this author in PubMed Google Scholar
Erika P. Raven
View author publications
You can also search for this author in PubMed Google Scholar
Adam Cunningham
View author publications
You can also search for this author in PubMed Google Scholar
Joanne Doherty
View author publications
You can also search for this author in PubMed Google Scholar
Marianne B. M. van den Bree
View author publications
You can also search for this author in PubMed Google Scholar
Greg D. Parker
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Hamandi
View author publications
You can also search for this author in PubMed Google Scholar
William P. Gray
View author publications
You can also search for this author in PubMed Google Scholar
Derek K. Jones
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.C. developed the framework, conducted the experiments, analyzed the results, and wrote the manuscript, M.C., C.M.W.T. and D.K.J. conceptualized the project, A.C., J.D. and M.v.d.B. recruited the CNV patients, S.G. and E.P.R. acquired the pediatric data, M.C. and S.G. pre-processed the pediatric imaging data, G.D.P. and C.M.W.T. developed the preprocessing pipeline, W.P.G., D.S. and K.H procured the epilepsy data, M.C., C.M.W.T. and D.S. acquired and pre-processed the cdMRI and epilepsy data, K.K. acquired the MICRA repeatability data. All authors reviewed and contributed to the manuscript.

Corresponding author

Correspondence to Maxime Chamberland.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Computational Science thanks Laurent Petit, Daniel C. Alexander and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Editor recognition statement Handling editor: Ananya Rastogi, in collaboration with the Nature Computational Science team.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–4, and Tables 1 and 2.

Reporting Summary

Source data

Source Data Fig. 2

Anomaly scores for the CNV dataset.

Source Data Fig. 3

Tract profiles for the CNV dataset.

Source Data Fig. 4

Tract profiles for the FCD dataset.

Source Data Fig. 5

Tract profiles for the FCD dataset.

Source Data Fig. 6

Anomaly scores for the CNP dataset.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chamberland, M., Genc, S., Tax, C.M.W. et al. Detecting microstructural deviations in individuals with deep diffusion MRI tractometry. Nat Comput Sci 1, 598–606 (2021). https://doi.org/10.1038/s43588-021-00126-8

Download citation

Received: 18 February 2021
Accepted: 09 August 2021
Published: 22 September 2021
Issue Date: September 2021
DOI: https://doi.org/10.1038/s43588-021-00126-8

This article is cited by

Detecting abnormal cell behaviors from dry mass time series
- Romain Bailly
- Marielle Malfante
- Jérôme Mars
Scientific Reports (2024)
Radiomic tractometry reveals tract-specific imaging biomarkers in white matter
- Peter Neher
- Dusan Hirjak
- Klaus Maier-Hein
Nature Communications (2024)
Convolutional neural network-based classification of glaucoma using optic radiation tissue properties
- John Kruper
- Adam Richie-Halford
- Yalin Zheng
Communications Medicine (2024)
Scientific discovery in the age of artificial intelligence
- Hanchen Wang
- Tianfan Fu
- Marinka Zitnik
Nature (2023)
Efficiently pruning brain connectomes
- Xi-Nian Zuo
Nature Computational Science (2022)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Framework overview

White-matter anomaly detection in CNV participants

Discriminating power

Tract-specific deviations

White-matter anomaly detection in epilepsy

Linking brain heterogeneity with epidemiological findings in schizophrenia

Repeatability of anomaly scores and tract profiles

Discussion

Methods

Detect interactive interface

Data acquisition and preprocessing

CNV dataset

Epilepsy dataset

SCHZ dataset

Repeatability dataset

Tractometry

Artificial neural network

Univariate approach

Multivariate linear approach

Support vector machine comparison

Reporting Summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links