Deep learning with diffusion MRI as in vivo microscope reveals sex-related differences in human white matter microstructure

Chen, Junbo; Bayanagari, Vara Lakshmi; Chung, Sohae; Wang, Yao; Lui, Yvonne W.

doi:10.1038/s41598-024-60340-y

Download PDF

Article
Open access
Published: 14 May 2024

Deep learning with diffusion MRI as in vivo microscope reveals sex-related differences in human white matter microstructure

Junbo Chen¹^na1,
Vara Lakshmi Bayanagari¹^na1,
Sohae Chung^2,3,
Yao Wang^1,4 &
…
Yvonne W. Lui^2,3

Scientific Reports volume 14, Article number: 9835 (2024) Cite this article

170 Altmetric
Metrics details

Subjects

Abstract

Biological sex is a crucial variable in neuroscience studies where sex differences have been documented across cognitive functions and neuropsychiatric disorders. While gross statistical differences have been previously documented in macroscopic brain structure such as cortical thickness or region size, less is understood about sex-related cellular-level microstructural differences which could provide insight into brain health and disease. Studying these microstructural differences between men and women paves the way for understanding brain disorders and diseases that manifest differently in different sexes. Diffusion MRI is an important in vivo, non-invasive methodology that provides a window into brain tissue microstructure. Our study develops multiple end-to-end classification models that accurately estimates the sex of a subject using volumetric diffusion MRI data and uses these models to identify white matter regions that differ the most between men and women. 471 male and 560 female healthy subjects (age range, 22–37 years) from the Human Connectome Project are included. Fractional anisotropy, mean diffusivity and mean kurtosis are used to capture brain tissue microstructure characteristics. Diffusion parametric maps are registered to a standard template to reduce bias that can arise from macroscopic anatomical differences like brain size and contour. This study employ three major model architectures: 2D convolutional neural networks, 3D convolutional neural networks and Vision Transformer (with self-supervised pretraining). Our results show that all 3 models achieve high sex classification performance (test AUC 0.92–0.98) across all diffusion metrics indicating definitive differences in white matter tissue microstructure between males and females. We further use complementary model architectures to inform about the pattern of detected microstructural differences and the influence of short-range versus long-range interactions. Occlusion analysis together with Wilcoxon signed-rank test is used to determine which white matter regions contribute most to sex classification. The results indicate that sex-related differences manifest in both local features as well as global features / longer-distance interactions of tissue microstructure. Our highly consistent findings across models provides new insight supporting differences between male and female brain cellular-level tissue organization particularly in the central white matter.

Spatial multi-omics at subcellular resolution via high-throughput in situ pairwise sequencing

Article 14 May 2024

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain

Article Open access 13 December 2023

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Article 07 December 2020

Introduction

Biological sex (throughout this manuscript, sex, male and female refer to biological sex assigned at birth) is a crucial variable in neuroscience research. The National Institutes of Health require all preclinical and human subject studies to account for biological variables including sex in the research plan. Understanding of sex differences is particularly important as it has been shown to relate to a wide range of cognitive functions such as motor cognitive performance^1,2,3, nonverbal reasoning³, working memory^4,5,6 and episodic memory^7,8,9. In addition, prevalence of several neurological and neuropsychiatric disorders differs between males and females. For example, autism spectrum disorder and Tourette syndrome are more prevalent in males^10,11, while disorders such as multiple sclerosis and depression are more prevalent in females^12,13.

Recent advances in MRI have enabled precise measurement of the brain noninvasively. However, most MRI studies have documented structural differences between sexes in terms of gross brain volume^14,15 and cortical thickness¹⁶ though inconsistencies exist between reports throughout individuals^17,18,19. Perhaps of greater interest is whether there exist differences in cellular-level organization of the brain between males and females. Better understanding underlying sex differences in brain microstructure would inform how biological sex influences brain health and disease. Indeed, cellular-level microstructure is known to be informative in various brain studies including brain development, aging, and neurological diseases such as demyelination, tumor infiltration, and dementia^20,21,22,23. Some studies have been conducted on ex vivo samples from animal models that look at cellular features such as density of microglia^24,25; however, a true picture of the brain’s cellular structure in vivo remains elusive and ex vivo studies are limited by fixation and preparation artifacts which invariably alter the cellular matrix.

Diffusion MRI is capable of capturing microscopic features of the brain noninvasively²⁶ and is actively being used to study various neurological diseases, ranging from neurodegenerative disorders such as Alzheimer’s dementia^27,28 and Parkinson’s disease²⁹ to autoimmune disorders such as Multiple Sclerosis³⁰. Previous studies have found evidence to support the sex differences in diffusion parameters. For example, in different age ranges, analyses have indicated sex differences in various diffusion parameters such as fractional anisotropy (FA)^{16,31,32,34,35,62,63}, orientation dispersion^16,62,63, mean diffusivity (MD)^32,35,63, axial diffusivity (AD) and radial diffusivity (RD)^32,63. The sex differences have been indicated in various white matter regions such as thalamic radiation^16,32, cerebellar³¹, superior longitudinal fasciculus^31,32, corpus callosum^31,32,63, corona radiata³². However, of the studies that have been conducted, most rely on conventional statistical analysis methods in various regions-of-interest^{16,31,32,62,63}. The results have been criticized as rigorous correction for multiple comparisons seems to diminish the power of such differences, which have raised debate and the need for new approaches to study the sex differences in brain³³.

Recent advances in deep neural networks provide advanced methods to capture sex differences in brain microstructure. Two recent works using neural networks with handcrafted features from structural connectivity indicated sex-related differences^34,35; however, the use of complex hand-crafted features has again been criticized by some as cumbersome, adding potential biases, and limited in reproducibility. Besides, as different deep neural networks architectures are highly likely biased toward different types of features^46,47, it is challenging for studies based on a single model or similar models to capture thorough information. Therefore, inclusion of multiple distinct model architectures is necessary.

In this work, we have designed a comprehensive, rigorous learning-based approach aimed at contributing new evidence and insights of sex differences to the debate on whether there are indeed sex-related differences in the human brain microstructure. We hypothesize that sex differences exist in brain microstructure as various types of features, such as local features and global interactions. We accomplish this by leveraging end-to-end deep neural networks and diffusion MRI able to tap in vivo brain microstructure. The end-to-end design obviates the need for complex, a priori choices for either choosing of ROIs or hand-crafted feature engineering that could bias analysis. We register all subjects’ brains to a standard template space to remove the potential influence of overall brain size and volume. In addition, we explore 3 major, popular network architectures that capture different and complementary information and thus this work does not rely on a single model choice or model type. Finally, we attempt to identify WM areas that contribute most significantly to sex classification, and thereby have the most embedded sex-related differences. Instead of building a sex classifier, the goal of this work is to provide new evidence and insights regarding sex-related differences in brain tissue microstructure.

Materials and methods

Study population

The study includes 1031 healthy adult subjects (age range, 22–37 years) from the Human Connectome Project (HCP—Young dataset)³⁶, whereby sex labels were collected through self-reporting and no subject was found to have different self-reported sex from genetic sex. Institutional review board approval and participants’ informed consent were obtained at the participating institutions. Demographic details are summarized in Table 1.

Table 1 Study cohort.

Full size table

Diffusion MRI

Diffusion MR images were collected on a 3T scanner (Connectome Skyra, Siemens Medical Solutions, Erlangen, Germany) and preprocessed as per HCP protocol^36,37. In brief, diffusion imaging was performed with the following parameters: 3 b-values (1000, 2000, 3000 s/mm²), 90 diffusion orientations per shell, 18 b₀ (b-value = 0) images, 1.25 mm isotropic image resolution, field of view = 210 mm, number of slices = 111, TR/TE = 5520/89.5 ms, each scan was repeated along 2 phase encoding directions (RL/LR), details can be found in HCP dataset³⁶. The diffusion data was preprocessed by HCP for correction of artifacts like motion and eddy-currents artifacts³⁷. We use a in-house image processing tool to generate diffusion metrics³⁸, including fractional anisotropy (FA), mean diffusivity (MD) and mean kurtosis (MK) to assess white matter microstructure. FA and MD are included because they are the two most commonly used diffusion metrics for characterization of tissue microstructure in many studies³⁹. Of note, FA measures directionality of water movement in brain tissue, known to be sensitive to microstructures such as axons and myelin⁴⁰; and MD measures mean water diffusivity, sensitive to characteristics like cellularity⁴¹. Here, we also include MK derived from diffusion kurtosis imaging (DKI) to compactly represent non-Gaussian behavior of water molecules as a measure of tissue complexity⁴². All metrics are registered to the FA template in the MNI space⁴³ using FMRIB Software Library (FSL)⁴⁴ so as to remove effects of any macroscopic anatomical differences such as size and contour of the brain itself.

End-to-end classification models

This study employs three major model architectures: 2D convolutional neural network (CNN)⁵⁰, 3D CNN^53,54,55, and 3D vision transformer (ViT)^47,58. We choose these end-to-end deep networks that act on the entire image volume to avoid any reliance on hand-crafted features and/or complicated feature engineering. In general, CNN and ViT show state-of-the-art performances broadly across image classification tasks. The two architectures have their own strengths and may be complementary: CNN has inductive bias by design such as locality and translation equivalence/invariance (with/without pooling), making such a model generally more sample-efficient and easier in theory to capture local features of an image or volume⁴⁵. While ViT lacks the inductive bias from convolutional layers rendering them somewhat more data-hungry, ViT has strengths that CNNs lack in being able to capture long-range interactions and more global features present in an image or imaging volume^46,47, which could be important for capturing potential sex differences exist as long-range interactions. Although 3D CNN may be an intuitive choice of architecture to handle a 3D imaging volume, a 3D CNN requires more parameters and more training samples compared with a 2D CNN. Thus, we also test the performance of a 2D CNN with a lighter feature extraction backbone and greater training efficiency.

2D convolutional neural network

In this work, we use a ResNet18⁵⁰ as a 2D CNN backbone for feature extraction. Here, the 2D network essentially receives input from a small 3-slice subvolume as ResNet18 is designed to receive color images with 3 channels (RGB). We extract features from every 3 consecutive slices and combine features from all non-overlapping 3-slice subvolumes for the prediction head for classification (Fig. 1). Specifically, given input volumetric data with the shape of \(S\hspace{0.17em}\times \hspace{0.17em}H\hspace{0.17em}\times \hspace{0.17em}W\) (S: slice number, H × W: slice size, with each slice in sagittal view), we generate \(S\)/3 2D 3-channel images each with the shape of \(3\hspace{0.17em}\times \hspace{0.17em}H\hspace{0.17em}\times \hspace{0.17em}W\). The same ResNet18 is applied to extract features from each 3-slice subvolume and features from all \(S\)/3 3-slice subvolumes are concatenated as the input to a linear prediction head. The ResNet18 architecture is shown at the bottom of Fig. 1. The input is fed to a convolutional layer (conv layer) (kernel-size = 7 × 7, stride = 2, channel-number or number-of-feature-maps = 64), followed by a max-pooling layer for further downsampling (kernel-size = 3 × 3, stride = 2). After the pooling, 8 convolutional layer blocks called residual blocks (where input to the block is added to the output via residual short-cut connection) are applied where each block contains 2 convolutional layers with kernel-size = 3 × 3, the channel number gets doubled and the spatial size gets downsampled by 2 at the first conv layers of 3rd, 5th, 7th residual blocks. Each conv layer is followed by batch-normalization⁵¹ and ReLU activation⁵². In the end of ResNet18, global-average pooling is applied to each feature map to generate a single feature value, leading to 512 features for each 3-slice subvolume. Given \(SxHxW\)=180 × 224 × 224, we have \(S\)/3 = 60 3-slice subvolumes with each yielding 512 features. These 60 × 512 features are concatenated and fed to a linear layer for final prediction, which is a fully-connected layer mapping 60 × 512 features to the predicted class score.

3D convolutional neural network

We employ 3D ResNet-10^53,54,55 as our 3D CNN backbone, with architecture shown in Fig. 2. The 3D volume is firstly fed into a conv layer (kernel-size = 7 × 7 × 7, stride = 2, channel = 64) followed by a max pooling layer (kernel-size = 3 × 3 × 3, stride = 2), 8 residual blocks are then used with each block having 1 conv layer. The channel number is doubled at residual block 3, 5, 7, with stride set as 2 for block 3 and dilation set as 2 for block 5 and set as 4 for block 7. Each conv layer is followed by group-normalization⁵⁶ and ReLU activation⁵². In the end, global average pooling is applied to map 512 feature maps to 512 feature values and one linear layer is used for the final prediction, which is a fully-connected layer mapping 512 features to the predicted class score.

Vision transformer for 3D input pretrained with mask autoencoders

The original 2D ViT⁴⁷ is extended to extract features from a 3D volume. Given input 3D diffusion metric \(x \in {R }^{S \times H \times W}\), the data is reshaped into a sequence of flattened non-overlapping 3D patches \({x}_{p} \in {R }^{N \times (s \cdot h \cdot w)}\), where (\(S, H, W\)) is 3D volume size and (\(s, h, w\)) is the 3D patch size, patch number is defined as \(N = SHW/shw\). As shown in Fig. 3, for each 3D patch, a linear layer is applied to map voxel values to a latent embedding with dimension \(D\). A learnable positional embedding with same dimension \(D\) representing each token’s location, is added to the original embedding. The resulting sequence of embeddings for all N patches are fed to the encoder consisting of L alternating layers of multi-head attention and Multi-layer-perceptron (MLP) blocks. A classification token with dimension D is appended to the input embedding sequence, which is designed as a latent representing the entire input sample.The output embedding of the classification token is then fed into a linear prediction head to generate a prediction. In our study, \(S\times H\times W=\) \(182\times 224\times 224\) and \(s\times h\times w=6\times 16\times 16\), \(D=384\), \(L=12\), and the classification head is a fully-connected layer mapping 384 to the predicted class score.

We pretrain the ViT with a 2D + 3D Masked Autoencoders (MAE) modified from 2D MAE⁵⁷, where a specific ratio of patches, defined as \(r\), is randomly masked and a ViT encoder and auxiliary decoder are trained to predict the values of \(r \times N\) masked patches from \((1 - r)\times N\) unmasked patches. After pretraining, the encoder is finetuned for the target sex classification task with all \(N\) patches fed into it. Since 3D patches are more difficult to predict than 2D patches (especially given the small number of available 3D volumes), we pretrain a 2D ViT encoder with MAE on 2D slices first and use the resulting weights to initialize our 3D ViT model for 3D patches, and further pretrain the model with MAE on 3D volumes. In our study, mask ratio \(r = 0.75\) and the axillary decoder has \(D=192\), \(L=4\). Details of the pretraining can be found in the supplementary information.

Model training and evaluation

1031 unique subjects are split into training (831 subjects), validation (100 subjects) and test sets (100 subjects). Training, validation and test sets share the same sex and age distribution, where female and male have a relatively balanced ratio of 27:23. Models’ hyperparameters are tuned based on the performance on the validation set. Models are then trained with the training set and tested on the test set for final prediction results. Classifiers are implemented with pytorch. For fair comparison, all three models use the same training/validation/testing split. Details of the training process are explained in the supplementary information. For ViT, we conducted three experiments: ViT trained from scratch without MAE pretraining, linear probing where the encoder is freezed with weights from MAE pretraining, and only linear prediction head is trained for sex classification, and fine-tuned ViT where the whole model is refined on sex labels. The performance of linear probing can reflect how the feature learnt from pretraining generalizes to the sex classification task, while performance of model trained from scratch can serve as the baseline to examine if the pertaining can bring performance improvement. Details of the training are described in supplementary information.

Occlusion analysis and Wilcoxon signed rank test

We conduct occlusion analysis on the trained models and Wilcoxon signed rank test to identify white matter areas of the brain that contribute significantly to sex classification. We conduct occlusion at the region level and consider the 48 white matter regions defined by the Johns Hopkins University-ICBM-labels-1 mm atlas⁴³. Given a trained model for a diffusion metric, we compare the predicted probability for the correct label before and after occlusion of each region in succession, by setting all voxels in the region to the mean white matter value. We apply the Wilcoxon signed rank test with the one-sided alternative hypothesis to the probability changes associated with each region for all subjects in the testing dataset to test whether the decrease in the predicted probability for the correct label is statistically significant. The regions that achieve p-value < 0.05 are considered significant for distinguishing between male and female.

Ethics approval

This study was conducted in compliance with the Health Insurance Portability and Accountability Act and approved by the institutional review board.

Result

Classification results

We use the area under the curve (AUC) of each trained model on the testing dataset to evaluate the model performance. Besides, accuracy, precision and recall are also included. Table 2 shows that our 2D CNN, 3D CNN and ViT (fineturned and linear probing) models all achieved promising AUC for all 3 diffusion metrics with test AUC of > 0.9. For FA and MD, 2D CNN achieved the highest AUC at 0.98 for FA and at 0.97 for MD. 3D CNN and ViT also achieved relatively high AUC (> 0.92). For MK, all models achieved a high AUC above 0.96, and 3D CNN achieved highest performance with AUC of 0.98. The ViT trained from scratch yielded low AUC (< 0.8) for all diffusion metrics. The finetuned ViT and linear probing ViT achieved comparable AUC on all 3 diffusion metrics, indicating that the MAE-pretrained feature extraction layer is directly applicable for the classification task.

Table 2 Performance (test AUC/Accuracy/Precision/Recall) of sex classification models using three different diffusion MRI parametric maps as inputs (FA, MD, and MK).

Full size table

Occlusion analysis

2D and 3D CNNs and finetuned ViT are included in the occlusion analysis. The ViT finetuned model is selected for the occlusion analysis despite it has similar performance as the linear probing model, because the finetuned model is refined on the sex classification task. The numbers of regions passing the significance test are summarized in Table 3. Identified regions are illustrated in Figs. 4, 5, and 6.

Table 3 Number of white matter regions showing significant differences between males and females in the occlusion analysis; 48 WM regions in total.

Full size table

Discussion

The study provides new evidence of clear sex-related differences in white matter microstructure as captured by diffusion MRI, detected consistently across 3 different end-to-end, deep learning-based image classification models. The reliability of this finding is evident in the fact that high classification performance (test AUC 0.92–0.98) is observed independent of model architecture across 3 major network architecture types and without introducing the biases of complex hand-crafted features and/or manual operations as has been done previously. In addition, white matter regions most influential in model decision are identified and show the location and distribution of greatest microstructural differences.

Use of the three different model architectures intentionally allows us to leverage the different strengths of each of these network families. For example, the 3D CNN incorporates a conventional 3D CNN backbone: while it powerfully captures local features within the volume, a recent study showed that even very deep CNNs with a high number of parameters still have only small effective receptive fields⁴⁶, meaning they rely primarily on their ability to learn local features as opposed to longer distance relationships. On the other hand, ViTs capture global features more readily⁴⁷ and the incorporated MAE pretraining task used here also heavily focuses the model on inter-patch correlations than span some distance across the volume. The study finds that both the 3D CNN and ViT models performed very well, suggesting that there are both short-distance and long-distance interactions that show differences in terms of sex-related patterns of white matter microstructure.

The 2D CNN achieved overall best performance for 2 out of 3 diffusion metrics studied. This is felt to be attributable to the fact that the 2D CNN incorporates a design that may allow it to simultaneously capture both local features and global interactions (across all slices), thus making it able to leverage both types of features in the classification task. Specifically, the ResNet18 extracts features from every group of 3 consecutive slices allowing the model to learn from within-slice features and short-range inter-slice features across the 3 slices; by then concatenating features across all 3-slice subvolumes (as opposed to averaging across them as is most commonly done) the model here effectively preserves local features from every 3-slice partition while at the same time, the prediction head is able to learn more global interactions across 3-slice subvolumes. It’s also possible that the simplicity of the 2D CNN model (it had the lowest number of parameters, as shown in Table 4) helps to push its generalization capability; although differences between test performance were nominal across all three models, which suggest that they are all comparably generalizable.

Table 4 The parameter number of three model architecture.

Full size table

The occlusion results show general consistency across models and across diffusion metrics and implicate central white matter tracts and ventral/dorsal hindbrain tracts in contributing to sex-related differences, though results differ slightly across diffusion metrics and models. The number and fractional volume of WM regions significantly contributing to sex classification was highest for 2D CNN (mean number of regions: 15; mean fractional volume: 0.79) compared with 3D CNN (mean number of regions: 2; mean fractional volume: 0.24) and ViT (mean number of regions: 7; mean fractional volume: 0.16), possibly again reflecting differences in the relative facility of these models to tap short-range interactions, long-range interactions, or both. Across the three diffusion metrics, it appears that the 3D CNN classifier focused consistently on large central white matter structures such as the middle cerebellar peduncle and corpus callosum whereas the ViT and 2D CNN models tended to rely on a greater diversity of white matter regions. Another observation is that the corpus callosum was found to be important across all neural network architecture types and diffusion metrics. As sex-related regional brain structure differences have been particularly controversial^17,18,19, our work provides new evidence that sex differences do in fact exist in focal regions such as the corpus callosum.

Of note for the ViT, pre-training with MAE was important. ViT is a data-hungry architecture and difficult to train with a limited dataset since it lacks inductive bias such as the locality and translation invariance of CNNs. The MAE pretraining task (to predict masked patches from visible patches) enables the model to learn inter-patch interactions without supervision from data labels. The random masking itself also introduces data diversity to the pretraining, which helps further improve the generalizability of learned features. The benefit of MAE pretraining is clearly demonstrated in the experimental results: without pretraining, ViT trained from scratch yielded much lower performance with test AUC < 0.80. The improvement of each of the other models compared to ViT trained from scratch is statistically significant, with p < 0.05 achieved with the Wilcoxon signed-rank test that compares the predicted probability of the correct class. With MAE pretraining, the ViT encoder achieved test AUC 0.94–0.96. The end-to-end supervised finetuning brought no additional gain and achieved comparable performance with linear probing, confirming that the size of the training set is insufficient to tune a data-hungry ViT in supervised end-to-end training.

Limitations include the use of only three representative diffusion metrics, though these are chosen based on the fact that they are the most common and easily obtained using a well-established diffusion kurtosis imaging acquisition. Further exploration using modeled diffusion metrics²⁶ and neurite orientation dispersion and density imaging (NODDI)⁶⁰ may yield additional information about sex-related differences in tissue microstructure and help us continue to characterize the underlying biophysical differences between brains of males and females. Additionally, our study is based on DWI with moderate b-value, future study can include datasets with high b-value that are more sensitive to restricted diffusion⁶¹. Recognizing that the age distribution differs between the female and male cohorts (with the female group having more older people) (Table 1), we have separately evaluated the model accuracy on three subgroups broken down by age and found the performance to be comparable across all age groups. For a narrow-band of adults ages 26–30 where aging changes are likely to have little influence, our models continue to achieve high sex classification accuracy. In future work, combined with additional diffusion metrics such as the modeled diffusion metrics²⁶ and NODDI⁶⁰, the study can be extended to examine the sex differences in age ranges other than young adults, which can shed light on how sex differences progress in life span. Finally, occlusion analysis used the standard JHU-ICBM-1 mm atlas for white matter parcellation which has sizable variation in size of regions which could potentially bias regional importance; any affect region size has on region importance is limited, however, as the results for relative importance of regions does not correlate well with region size. Besides, our 2D CNN is based on sagittal slices, in future work, the new 2D CNN can be designed to leverage information from all three views. This study does not use the 3 diffusion metrics as combined input to neural networks due to GPU memory constraint; neural network architectures that can efficiently take multiple 3D volumes as combined input can be explored in the future work.

Overall, our results provide new evidence and insights to support that sex differences exist in human brain microstructure both in local features (e.g., within central white matter structures such as the middle cerebellar peduncle and corpus callosum) and in global features (like long-distance interactions). Capturing complex microstructural differences is challenging using conventional statistical methods or single-approach machine learning network inputting handcrafted features. Our work demonstrates a unique approach: that by leveraging multiple neural networks with completely different architecture design, it allows us to capture complementary information and makes our results independent of model architecture choices. When it comes to using advanced machine learning architectures that are data-hungry, self-supervised learning can be used to pretrain models and enable such neural networks to be leveraged for medical imaging studies that lack tremendous datasets such as those available for non-medical, computer vision work. Such a framework can be further adapted to study other neurological disorders.

Conclusion

This study provides new evidence of clear sex-related differences in brain white matter microstructure of healthy young adults detected using in vivo diffusion MRI without hand-crafting or manually manipulating the imaging data. We show this across 3 different end-to-end deep neural networks and 3 commonly used diffusion MRI metrics. Even after registering diffusion MR volumes to a template so as to remove macroscopic anatomical differences such as overall brain size and contour, results show that sex differences exist in diffusion anisotropy (FA), mean diffusivity (MD) and tissue complexity (MK) of brain white matter. Our experiments further suggest there are both local as well as longer-distance microstructural organizational features that differ between sexes. In particular, the central white matter appears specifically implicated. In addition, this study provides a framework to study microstructural differences in the human brain using multiple deep neural network architectures, which help capture complex microscopic features challenging for statistical methods. Further study is needed to determine whether and how these microstructural differences influence brain health and disease in both men and women.

Data availability

The dataset used in this study can be downloaded from the Human Connectome Project via Amazon S3 at https://db.humanconnectome.org/.

References

Dorfberger, S., Adi-Japha, E. & Karni, A. Sex differences in motor performance and motor learning in children and adolescents: an increasing male advantage in motor learning and consolidation phase gains. Behav. Brain Res. 198(1), 165–171 (2009).
Article PubMed Google Scholar
Moreno-Briseño, P. et al. Sex-related differences in motor learning and performance. Behav. Brain Funct. 6(1), 1–4 (2010).
Article Google Scholar
Satterthwaite, T. D. et al. Linked sex differences in cognition and functional connectivity in youth. Cereb. Cortex. 25(9), 2383–2394 (2015).
Article PubMed Google Scholar
Voyer, D., Voyer, S. D. & Saint-Aubin, J. Sex differences in visual-spatial working memory: A meta-analysis. Psychonomic Bull. Rev. 24, 307–334 (2017).
Article Google Scholar
Duff, S. J. & Hampson, E. A sex difference on a novel spatial working memory task in humans. Brain Cogn. 47(3), 470–493 (2001).
Article CAS PubMed Google Scholar
Kaufman, S. B. Sex differences in mental rotation and spatial visualization ability: Can they be accounted for by differences in working memory capacity?. Intelligence 35(3), 211–223 (2007).
Article Google Scholar
Asperholm, M., Van Leuven, L. & Herlitz, A. Sex differences in episodic memory variance. Front. Psychol. 11, 613 (2020).
Article PubMed PubMed Central Google Scholar
Asperholm, M. et al. What did you do yesterday? A meta-analysis of sex differences in episodic memory. Psychol. Bull. 145(8), 785 (2019).
Article PubMed Google Scholar
Herlitz, A., Airaksinen, E. & Nordström, E. Sex differences in episodic memory: The impact of verbal and visuospatial ability. Neuropsychology 13(4), 590 (1999).
Article CAS PubMed Google Scholar
Werling, D. M. & Geschwind, D. H. Sex differences in autism spectrum disorders. Curr. Opin. Neurol. 26(2), 146 (2013).
Article CAS PubMed PubMed Central Google Scholar
Baizabal-Carvallo, José Fidel, and Joseph Jankovic. "Sex differences in patients with Tourette syndrome." CNS spectrums (2022): 1–7.
Picco, L. et al. Gender differences in major depressive disorder: Findings from the Singapore Mental Health Study. Singapore Med. J. 58(11), 649 (2017).
Article PubMed PubMed Central Google Scholar
Pigott, T. A. Anxiety disorders in women. Psychiatric Clin. 26(3), 621–672 (2003).
Google Scholar
Ruigrok, A. N. V. et al. A meta-analysis of sex differences in human brain structure. Neurosci. Biobehav. Rev. 39, 34–50 (2014).
Article PubMed PubMed Central Google Scholar
Lotze, M. et al. Novel findings from 2838 adult brains on sex differences in gray matter brain volume. Sci. Rep. 9(1), 1671 (2019).
Article ADS PubMed PubMed Central Google Scholar
Ritchie, S. J. et al. Sex differences in the adult human brain: Evidence from 5216 UK biobank participants. Cerebral Cortex 28(8), 2959–2975 (2018).
Article PubMed PubMed Central Google Scholar
Adeli, E. et al. Deep learning identifies morphological determinants of sex differences in the pre-adolescent brain. NeuroImage 223, 117293 (2020).
Article PubMed Google Scholar
Jahanshad, N. & Thompson, P. M. Multimodal neuroimaging of male and female brain structure in health and disease across the life span. J. Neurosci. Res. 95(1–2), 371–379 (2017).
Article CAS PubMed Google Scholar
Luders, E., Toga, A. W. & Thompson, P. M. Why size matters: Differences in brain volume account for apparent sex differences in callosal anatomy: The sexual dimorphism of the corpus callosum. Neuroimage 84, 820–824 (2014).
Article PubMed Google Scholar
Jiang, X. & Nardelli, J. Cellular and molecular introduction to brain development. Neurobiol. Disease 92, 3–17 (2016).
Article CAS Google Scholar
Finch, C. E. Neurons, glia, and plasticity in normal brain aging. Neurobiol. Aging 24, S123–S127 (2003).
Article CAS PubMed Google Scholar
Von Bernhardi, R., Eugenín-von Bernhardi, L. & Eugenín, J. Microglial cell dysregulation in brain aging and neurodegeneration. Front. Aging Neurosci. 7, 124 (2015).
Google Scholar
Svolos, P. et al. The role of diffusion and perfusion weighted imaging in the differential diagnosis of cerebral tumors: a review and future perspectives. Cancer Imaging 14, 1–20 (2014).
Article Google Scholar
Han, J. et al. Uncovering sex differences of rodent microglia. J. Neuroinflam. 18(1), 1–11 (2021).
Article MathSciNet Google Scholar
Guneykaya, D. et al. Transcriptional and translational differences of microglia from male and female brains. Cell Rep. 24(10), 2773–2783 (2018).
Article CAS PubMed Google Scholar
Novikov, D. S. et al. Quantifying brain microstructure with diffusion MRI: Theory and parameter estimation. NMR Biomed. 32(4), e3998 (2019).
Article PubMed Google Scholar
Zhang, Y. et al. White matter damage in frontotemporal dementia and Alzheimer’s disease measured by diffusion MRI. Brain 132(9), 2579–2592 (2009).
Article PubMed PubMed Central Google Scholar
Harrison, J. R. et al. Imaging Alzheimer’s genetic risk using diffusion MRI: A systematic review. NeuroImage: Clin. 27, 102359 (2020).
Article PubMed Google Scholar
Bergamino, M. et al. Assessing white matter pathology in early-stage Parkinson disease using diffusion MRI: A systematic review. Front. Neurol. 11, 314 (2020).
Article PubMed PubMed Central Google Scholar
De Santis, S. et al. Evidence of early microstructural white matter abnormalities in multiple sclerosis from multi-shell diffusion MRI. NeuroImage: Clin. 22, 101699 (2019).
Article PubMed Google Scholar
Kanaan, R. A. et al. Gender differences in white matter microstructure. PloS ONE 7(6), e38272 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Seunarine, K. K. et al. Sexual dimorphism in white matter developmental trajectories using tract-based spatial statistics. Brain Connect. 6(1), 37–47 (2016).
Article PubMed PubMed Central Google Scholar
Bryant, K. L., Grossi, G., and Kaiser, A. Feminist interventions on the sex/gender question in neuroimaging research (2019).
Yeung, Hon Wah, et al. "Pipeline comparisons of convolutional neural networks for structural connectomes: predicting sex across 3152 participants." 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, 2020.
He, Hao, et al. "Model and predict age and sex in healthy subjects using brain white matter features: a deep learning approach." 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI). IEEE, 2022.
Van Essen, D. C. et al. The WU-Minn human connectome project: An overview. Neuroimage 80, 62–79 (2013).
Article PubMed Google Scholar
Glasser, M. F. et al. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80, 105–124 (2013).
Article PubMed Google Scholar
Ades-Aron, B. et al. Evaluation of the accuracy and precision of the diffusion parameter EStImation with Gibbs and NoisE removal pipeline. Neuroimage 183, 532–543 (2018).
Article PubMed Google Scholar
Vos, S. B. et al. The influence of complex white matter architecture on the mean diffusivity in diffusion tensor MRI of the human brain. Neuroimage 59(3), 2208–2216 (2012).
Article PubMed Google Scholar
Szczepankiewicz, F. et al. Quantification of microscopic diffusion anisotropy disentangles effects of orientation dispersion from microstructure: Applications in healthy volunteers and in brain tumors. Neuroimage 104, 241–252 (2015).
Article PubMed Google Scholar
Nonomura, Y. et al. Relationship between bone marrow cellularity and apparent diffusion coefficient. J. Magn. Resonance Imaging 13(5), 757–760 (2001).
Article CAS Google Scholar
Chung, S. et al. Investigating brain white matter in foot- ball players with and without concussion using a biophysical model from multishell diffusion mri. Am. J. Neuroradiol. 43(6), 823–828. https://doi.org/10.3174/ajnr.a7522 (2022).
Article CAS PubMed PubMed Central Google Scholar
Mori, S. et al. Stereotaxic white matter atlas based on diffusion tensor imaging in an ICBM template. Neuroimage 40(2), 570–582 (2008).
Article PubMed Google Scholar
Smith, S. M. et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23, S208–S219 (2004).
Article PubMed Google Scholar
Liu, Y. et al. Efficient training of visual transformers with small datasets. Adv. Neural Inform. Process. Syst. 34, 23818–23830 (2021).
Google Scholar
Ding, X., et al. Scaling up your kernels to 31x31: Revisiting large kernel design in cnns. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
Dosovitskiy, A., et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv:2010.11929 (2020).
Ingalhalikar, M. et al. Sex differences in the structural connectome of the human brain. Proc. Natl. Acad. Sci. 111(2), 823–828 (2014).
Article ADS CAS PubMed Google Scholar
Ryman, S. G. et al. Sex differences in the relationship between white matter connectivity and creativity. NeuroImage 101, 380–389 (2014).
Article PubMed Google Scholar
He, K., et al. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
Ioffe, Sergey, and Christian Szegedy. "Batch normalization: Accelerating deep network training by reducing internal covariate shift." International conference on machine learning. pmlr, 2015.
Nair, Vinod, and Geoffrey E. Hinton. "Rectified linear units improve restricted boltzmann machines." Proceedings of the 27th international conference on machine learning (ICML-10). 2010.
Hara, Kensho, Hirokatsu Kataoka, and Yutaka Satoh. "Learning spatio-temporal features with 3d residual networks for action recognition." Proceedings of the IEEE international conference on computer vision workshops. 2017.
Carreira, Joao, and Andrew Zisserman. "Quo vadis, action recognition? a new model and the kinetics dataset." proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017.
Chen, Sihong, Kai Ma, and Yefeng Zheng. "Med3d: Transfer learning for 3d medical image analysis." arXiv:1904.00625 (2019).
Wu, Yuxin, and Kaiming He. "Group normalization." Proceedings of the European conference on computer vision (ECCV). 2018.
He, K., et al. Masked autoencoders are scalable vision learners. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
Liu, Z., et al. Video swin transformer. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022.
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Zhang, H., Schneider, T., Wheeler-Kingshott, C. A. & Alexander, D. C. NODDI: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage. 61(4), 1000–1016 (2012).
Article PubMed Google Scholar
Han, C. et al. A comparison of high b-value vs standard b-value diffusion-weighted magnetic resonance imaging at 3.0 T for medulloblastomas. Br. J. Radiol. 88(1054), 20150220 (2015).
Article PubMed PubMed Central Google Scholar
Cox, S. R. et al. Ageing and brain white matter structure in 3513 UK Biobank participants. Nat. Commun. 7(1), 13629 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Lawrence, K. E. et al. Age and sex effects on advanced white matter microstructure measures in 15,628 older adults: A UK biobank study. Brain Imaging Behav. 15(6), 2813–2823 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the National Institutes of Health, National Institute of Neurological Disorders and Stroke [grant numbers: R01NS119767, R01NS131458, R01 NS119767-01A1, R01 NS039135-11, R56 NS119767]; National Institute of Biomedical Imaging and Bioengineering [grant number: P41 EB017183] and Department of Defense [grant number: W81XWH2010699].

Author information

These authors contributed equally: Junbo Chen, and Vara Lakshmi Bayanagari.

Authors and Affiliations

Department of Electrical and Computer Engineering, New York University Tandon School of Engineering, 370 Jay Street, 9th Floor, Brooklyn, NY, 11201, USA
Junbo Chen, Vara Lakshmi Bayanagari & Yao Wang
Center for Advanced Imaging Innovation and Research (CAI2R), Department of Radiology, New York University Grossman School of Medicine, New York, NY, USA
Sohae Chung & Yvonne W. Lui
Bernard and Irene Schwartz Center for Biomedical Imaging, Department of Radiology, New York University Grossman School of Medicine, New York, NY, USA
Sohae Chung & Yvonne W. Lui
Department of Biomedical Engineering, New York University Tandon School of Engineering, Brooklyn, NY, USA
Yao Wang

Authors

Junbo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Vara Lakshmi Bayanagari
View author publications
You can also search for this author in PubMed Google Scholar
Sohae Chung
View author publications
You can also search for this author in PubMed Google Scholar
Yao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne W. Lui
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.C.: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Software; Validation; Visualization; Roles/Writing—original draft; Writing—review & editing; V.L.B.: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Software; Validation; Visualization; Roles/Writing—original draft; Writing—review & editing; S.C.: Data curation; Roles/Writing—original draft; Writing—review & editing; Y.W.: Supervision; Conceptualization; Methodology; Resources; Roles/Writing—original draft; Writing—review & editing; Project administration; Funding acquisition; Y.W.L.: Supervision; Conceptualization; Resources; Roles/Writing—original draft; Writing—review & editing; Project administration; Funding acquisition.

Corresponding author

Correspondence to Junbo Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, J., Bayanagari, V.L., Chung, S. et al. Deep learning with diffusion MRI as in vivo microscope reveals sex-related differences in human white matter microstructure. Sci Rep 14, 9835 (2024). https://doi.org/10.1038/s41598-024-60340-y

Download citation

Received: 25 November 2023
Accepted: 22 April 2024
Published: 14 May 2024
DOI: https://doi.org/10.1038/s41598-024-60340-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Spatial multi-omics at subcellular resolution via high-throughput in situ pairwise sequencing

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Introduction

Materials and methods

Study population

Diffusion MRI

End-to-end classification models

2D convolutional neural network

3D convolutional neural network

Vision transformer for 3D input pretrained with mask autoencoders

Model training and evaluation

Occlusion analysis and Wilcoxon signed rank test

Ethics approval

Result

Classification results

Occlusion analysis

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links