# Multicenter dataset of multi-shell diffusion MRI in healthy traveling adults with identical settings

## Abstract

Multicenter diffusion magnetic resonance imaging (MRI) has drawn great attention recently due to the expanding need for large-scale brain imaging studies, whereas the variability in MRI scanners and data acquisition tends to confound reliable individual-based analysis of diffusion measures. In addition, a growing number of multi-shell diffusion models have been shown with the potential to generate various estimates of physio-pathological information, yet their reliability and reproducibility in multicenter studies remain to be assessed. In this article, we describe a multi-shell diffusion dataset collected from three traveling subjects with identical acquisition settings in ten imaging centers. Both the scanner type and imaging protocol for anatomical and diffusion imaging were well controlled. This dataset is expected to replenish individual reproducible studies via multicenter collaboration by providing an open resource for advanced and novel microstructural and tractography modelling and quantification.

 Measurement(s) brain measurement Technology Type(s) magnetic resonance imaging (MRI) Factor Type(s) MRI scanner Sample Characteristic - Organism Homo sapiens

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12206378

## Background & Summary

Diffusion magnetic resonance imaging (MRI), a noninvasive technique for the exploration of brain microstructures, has been widely used in scientific research and clinical diagnosis. The quantitative measures from various diffusion models reveal different types of tissue viabilities in normal populations1 and pathologies in numerous diseases2,3. Over the last two decades, advances in MRI scanners have enabled the collection of diffusion data in clinically acceptable timeframes, rendering it an essential part of standard medical exams. To explore the neurobiological mechanism of the brain, a large number of advanced diffusion models and algorithms have been proposed to interpret the complexity of the brain microstructure by introducing more tissue compartments4,5,6,7, as well as revealing a more precise white matter fiber structure and connectivity from region to region8,9,10,11.

The promising applications of diffusion imaging in recent years have also inspired many multicenter collaborations to collect data from a large sample of subjects and to share the data with investigators all over the world to solve major scientific questions regarding the brain. Several multicenter collections involving diffusion imaging have been launched, including the Human Connectome Project12, the Adolescent Brain Cognitive Development study13, the Healthy Brain Network14, and the Alzheimer’s Disease Neuroimaging Initiative15, all of which have provided public access to the information therein. With multiple imaging centers sharing responsibility for the burdens (costs and manpower) of data collection, multicenter collaborations are effectively conducted to reduce the acquisition duration and to increase the diversity of the samples.

Unfortunately, one drawback of multicenter studies arises from the inevitable bias resulting from hardware and software differences among MRI scanners. These variances across scanners may reduce the reliability of the MR measures or even conceal the significance of the effect of interest16,17,18,19,20. However, it remains to be investigated whether the individual variances among scanners could be minimized by well-controlling the scanner hardware and software. An alternative way to improve data reproducibility would be retrospective harmonization21,22,23, which is efficient but requires more individual validation regarding different diffusion measurements. In both scenarios, data acquired from the same subjects in multiple scanners are required. Until now, only a few repositories have been available for obtaining diffusion data with scanners of different types24 and magnetic fields25. There is still a lack of diversity of multicenter data from strictly consistent acquisition.

Here, we released a multicenter diffusion dataset that was collected on three traveling subjects in ten centers and with two additional repeated scans in one center. Two features can be highlighted for the dataset. First, in all centers, the same type of scanners and same scan protocol were strictly followed, and the raw data were acquired by the same operating procedure and pre-processed with the same pipeline. Second, in the diffusion protocol, we used a popular multi-shell diffusion scheme, which can be adapted for most diffusion models either for multi-compartment analysis or for precise white matter tractography to resolve crossing fibers. Part of the dataset has been used successfully in one of our previous publications26. Generally, a well-established platform is crucial for the refinement and evaluation of novel methods. With highly equipped MRI scanners utilized in the data collection, this dataset could help in the evaluation of the individual reproducibility of advanced diffusion models for multicenter studies.

In the following sections, we briefly describe the dataset acquisition and processing procedures, technical validation, and sharing and access policy.

## Methods

### Data characteristics

The data were collected at 10 centers from October 2016 to November 2017. Three healthy traveling subjects (one male, 23 years of age, and two females, 26 and 23 years of age) were scanned in nine scanners once and in one scanner (Center 10) three times. All 10 scanners were 3 T MR MAGNETOM Prisma (Siemens, Erlangen, Germany), equipped with max gradient strength of 80 mT/m and slew rate of 200 T/m/s. The software version was Syngo MR VD13D in nine scanners and Syngo MR VE11C in one scanner (Center 09). In all scans, the same type of 64-channel head coil and the same acquisition parameters were used (details below).

The scans were processed by the same operator with a fixed operating procedure. For each subject, anatomical images and diffusion images were acquired. In the anatomical imaging, the field-of-view (FOV) was set according to the head orientation using the auto-align function in the Brain Dot Engine of the scanner, and in the diffusion imaging, the FOV orientation was set parallel to the anterior commissure-posterior commissure line. The FOV center was also aligned with the isocenter of the main magnetic field by moving the scan table.

### Anatomical image

T1-weighted anatomical images were acquired using a 3D magnetization-prepared two rapid acquisition gradient echo (MP2RAGE) sequence27. The imaging parameters were as follows: repetition time (TR)/echo time (TE) = 5 s/2.9 ms, inversion time (TI) = 700, 2500 ms, FOV = 211 × 256 × 256 mm3, voxel size = 1.2 × 1 × 1 mm3, bandwidth = 240 Hz/Px, generalized auto-calibrating partial parallel acquisition (GRAPPA) factor = 3, and the acquisition time was 8 minutes and 22 seconds.

### Diffusion-weighted image (DWI)

DWIs were obtained using a simultaneous multi-slice (SMS) spin-echo echo planar imaging (EPI) prototype sequence28. The imaging parameters were as follows: TR/TE = 5.4 s/71 ms, FOV = 220 × 220 mm2, slice number = 93, voxel size = 1.5 × 1.5 × 1.5 mm3, bandwidth = 1712 Hz/Px, GRAPPA factor = 2, and SMS factor = 3, with reversed phase-encoding (PE) directions along anteroposterior (AP) and posteroanterior (PA) separately. The diffusion duration and diffusion time were 15.9 ms and 34.4 ms, respectively, for the monopolar diffusion gradients. The diffusion scheme, containing 30 vectors with uniform angular coverage on each shell (b-values = 1000, 2000, and 3000 s/mm², non-colinear between any two shells), was generated from a multi-shell vector sampling tool29. Six non-diffusion frames were equally distributed in the scheme. The total acquisition time was 19 minutes and 04 seconds.

### Computational pre-processing

Practically, the DWIs acquired by MRI scanners are sensitive to noise, field inhomogeneity and head motion, which can cause image imperfections and require additional pre-processing for correction. In the released dataset, the raw images, stored in the Digital Imaging and Communications in Medicine (DICOM) format, were converted to the Neuroimaging Informatics Technology Initiative (NIfTI) format for pre-processing. The DWIs were pre-processed using a common pipeline, which included denoising using the MRtrix330 (version 0.3.15-500-g382393bb), Gibbs-ring removal31, and distortion and motion correction using the FSL suite (version 5.0.11). Image distortion was estimated by the TOPUP tool from the non-diffusion images of PE directions to generate a field map function32. Both distortion and motion were jointly corrected with the EDDY tool33. In addition, the diffusion vectors were also rotated accordingly based on the EDDY correction. Finally, the images along the AP and PA directions were combined for subsequent analysis.

## Data Records

### Data privacy

Data collection was conducted with approval from the institutional review board of Xuanwu Hospital, Capital Medical University, Beijing, China, and all volunteers had signed the informed consent forms beforehand. Subject’s facial features had been removed from the images using the Freesurfer (version v1.379.2.73).

### Distribution for use

The dataset has been organized in the Brain Imaging Data Structure (BIDS) standard34 and is publicly available in figshare35. The multicenter scans of each subject are encoded as multiple sessions by labels identifying the center ID and repetition number. Each session folder contains the subfolders “anat” and “dwi” for anatomical and DWI images, respectively. The images are stored in a compressed NIfTI format, and the sidecar JSON files are listed together with the relevant images. In addition, a file with the suffix “eddy_parameters” produced by EDDY for each scan has also been provided under the folder “derivatives”.

## Technical Validation

### Quality control for diffusion data

The quality of the pre-processed images was evaluated. For each DW scan, we measured the signal-to-noise ratio (SNR) and ghost-to-signal ratio (GSR) from non-diffusion images. The background noise of all DWIs was also computed and compared. In addition, head motion was also estimated.

• SNR:

The SNR was measured following the National Electrical Manufacturers Association (NEMA) standard on two non-diffusion images36. The mean values of two images within a white matter region of interest (ROI) at the genu of corpus callosum (GCC) were termed the signal. The noise was calculated from the standard deviation (SD) of the difference image within the same region divided by a correction factor of $$\sqrt{2}$$. In each scan, six non-diffusion images produced five SNR measures from every two adjacent images.

• GSR:

The GSR is a measure of the Nyquist ghost artifact, which describes the signal leakage with a shift at 1/2 image size along the PE direction on image. In the non-diffusion images, a rectangular ROI at the center of the image of size 10 × 40 was selected as the signal region (ROI-s). Then, an ROI-g was defined as a pair of ghost regions by shifting the ROI-s half the image size up or down along the PE direction. Finally, an ROI-n was defined as two noise regions selected from the background with the same size as the ROI shifted left or right along the readout (RO) direction. These ROIs are outlined in blue in Fig. 1a. The mean intensities were computed within these defined ROIs. The GSR was calculated using the absolute signal difference between ROI-g and ROI-n and divided by the mean signal of ROI-s.

The EDDY tool can estimate the head motion and eddy current-induced field when pre-processing the DWIs. The translation and rotation parameters relative to the first DWI frame were obtained as listed in *.eddy_parameters, together with 10 components interpreting the quadratic eddy current-induced field. In addition, two summarized volumetric movements were also generated as the root mean square (RMS) of the voxel displacement for each volume: one was calculated relative to the first frame, and the other was calculated between adjacent frames (as listed in *.eddy_restricted_movement_rms).

• Noise on DWI:

The background noise of the DWIs was compared across centers. Four square ROIs of size 5 × 5 at the corners were drawn in the background, avoiding the Nyquist ghost region. The noise was calculated as the SD of all selected voxels divided by a correction factor of 0.66, accounting for the Rayleigh distribution on the image, as suggested in the NEMA standard37.

### Results of quality metrics

Figure 1 presents the quality metrics of the non-diffusion images. For the five SNRs and six GSRs calculated from each scan, their minimum, median, and maximum values are shown.

Figure 2 presents the RMS motion metrics between adjacent DWI frames, including the AP and PA images.

Figure 3 demonstrates the image noise evaluated from all DWI frames, which are reordered by b-values.

### Visualization and reproducibility of diffusion models

Beyond the quality metrics on the DWIs shown above, the reproducibility of diffusion outcomes generated by diffusion models or algorithms must also be considered. Here, we conducted a post-processing procedure using MRtrix3 for diffusion fiber tractography as an example. Since the framework of the fiber tracking algorithm was complicated, an intermediate measure reflecting voxel-wise fiber orientation together with the final tractography was selected for visualization. The intra-subject results were compared across all centers.

• Fiber orientation distribution (FOD):

The FOD represents the continuous distribution of underlying fibers for each voxel by the spherical deconvolution of the diffusion signal profile. To process the multi-shell diffusion data, multi-shell multi-tissue constrained spherical deconvolution10 was utilized to estimate the FOD with a maximum harmonic order of six. Figure 4 illustrates the FODs in an ROI containing crossing fibers that are mixed from the forceps minor, the anterior thalamic radiation, and the corticopontine tract.

• Track density imaging (TDI):

The anatomically constrained tractography algorithm11 was chosen for the multi-shell data to generate streamlines from the FOD with step size of 0.15 mm, length limitation from 3 to 250 mm and cut-off FOD amplitude of 0.06. Additionally, spherical deconvolution-informed filtering of tractograms was used to improve the accuracy of the fiber tractography38. After that, one million streamlines were reconstructed in the whole brain. To present the tractography, the TDI statistically measures the concentration of fiber streamlines within voxels. Figure 5 illustrates the TDI maps from all centers.

## Usage Notes

The data from eight centers [01–07 and 10] in the current dataset had been analyzed in our previous publications26, where they were denoted by A-H as follows: [01, D], [02, E], [03, A], [04, B], [05, G], [06, F], [07, C], and [10, H]. The data from centers 08 and 09 are newly added here. In addition, it should also be noted that we used an earlier version of the FSL suite (5.0.10) for pre-processing, and the Gibbs-ring removal was not used in our previous work26.

We also welcome any cooperation with us to fully explore this dataset.

## Code availability

The codes for the pre-processing steps above were assembled in the released Dataset. We also shared a package of code for running the pre-processing pipeline and the technical validation, and can be accessed in figshare39.

## References

1. 1.

Kodiweera, C., Alexander, A. L., Harezlak, J., McAllister, T. W. & Wu, Y. C. Age effects and sex differences in human brain white matter of young to middle-aged adults: a DTI, NODDI, and q-space study. Neuroimage 128, 180–192 (2016).

2. 2.

Avants, B. B., Cook, P. A., Ungar, L., Gee, J. C. & Grossman, M. Dementia induces correlated reductions in white matter integrity and cortical thickness: a multivariate neuroimaging study with sparse canonical correlation analysis. Neuroimage 50, 1004–1016 (2010).

3. 3.

Faria, A. V. et al. Atlas-based analysis of neurodevelopment from infancy to adulthood using diffusion tensor imaging and applications for automated abnormality detection. Neuroimage 52, 415–428 (2010).

4. 4.

Zhang, H., Schneider, T., Wheeler-Kingshott, C. A. & Alexander, D. C. NODDI: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage 61, 1000–1016 (2012).

5. 5.

Assaf, Y. & Basser, P. J. Composite hindered and restricted model of diffusion (CHARMED) MR imaging of the human brain. Neuroimage 27, 48–58 (2005).

6. 6.

Jespersen, S. N. et al. Neurite density from magnetic resonance diffusion measurements at ultrahigh field: comparison with light microscopy and electron microscopy. Neuroimage 49, 205–216 (2010).

7. 7.

Ferizi, U. et al. White matter compartment models for in vivo diffusion MRI at 300mT/m. Neuroimage 118, 468–483 (2015).

8. 8.

Fan, Q. et al. Investigating the capability to resolve complex white matter structures with high b-value diffusion magnetic resonance imaging on the MGH-USC Connectom scanner. Brain Connect. 4, 718–726 (2014).

9. 9.

Maier-Hein, K. H. et al. The challenge of mapping the human connectome based on diffusion tractography. Nat. Commun. 8 (2017).

10. 10.

Jeurissen, B., Tournier, J. D., Dhollander, T., Connelly, A. & Sijbers, J. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. Neuroimage 103, 411–426 (2014).

11. 11.

Smith, R. E., Tournier, J. D., Calamante, F. & Connelly, A. Anatomically-constrained tractography: Improved diffusion MRI streamlines tractography through effective use of anatomical information. Neuroimage 62, 1924–1938 (2012).

12. 12.

Van Essen, D. C. et al. The human connectome project: a data acquisition perspective. Neuroimage 62, 2222–2231 (2012).

13. 13.

Casey, B. J. et al. The adolescent brain cognitive development (ABCD) study: imaging acquisition across 21 sites. Dev. Cogn. Neurosci. 32, 43–54 (2018).

14. 14.

Alexander, L. M. et al. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Sci. Data 4, 1–26 (2017).

15. 15.

Weiner, M. W. et al. The Alzheimer’s disease neuroimaging initiative 3: continued innovation for clinical trial improvement. Alzheimer’s Dement. 13, 561–571 (2017).

16. 16.

Fox, R. J. et al. A validation study of multicenter diffusion tensor imaging: reliability of fractional anisotropy and diffusivity values. Am. J. Neuroradiol. 33, 695–700 (2012).

17. 17.

Sasaki, M. et al. Variability in absolute apparent diffusion coefficient values across different platforms may be substantial: a multivendor, multi-institutional comparison study. Radiology 249, 624–630 (2008).

18. 18.

Seo, Y., Wang, Z. J., Morriss, M. C. & Rollins, N. K. Minimum SNR and acquisition for bias-free estimation of fractional anisotropy in diffusion tensor imaging - a comparison of two analytical techniques and field strengths. Magn. Reson. Imaging 30, 1123–1133 (2012).

19. 19.

Teipel, S. J. et al. Multicenter stability of diffusion tensor imaging measures: a European clinical and physical phantom study. Psychiatry Res.: Neuroimaging 194, 363–371 (2011).

20. 20.

Zhou, X. et al. Quantitative quality assurance in a multicenter HARDI clinical trial at 3T. Magn. Reson. Imaging 35, 81–90 (2017).

21. 21.

Tax, C. M. et al. Cross-scanner and cross-protocol diffusion MRI data harmonisation: a benchmark database and evaluation of algorithms. Neuroimage 195, 285–299 (2019).

22. 22.

Karayumak, S. C. et al. Retrospective harmonization of multi-site diffusion MRI data acquired with different acquisition parameters. Neuroimage 184, 180–200 (2019).

23. 23.

Pohl, K. M. et al. Harmonizing DTI measurements across scanners to examine the development of white matter microstructure in 803 adolescents of the NCANDA study. Neuroimage 130, 194–213 (2016).

24. 24.

Ning, L. et al. Muti-shell diffusion MRI harmonisation and enhancement challenge (MUSHAC): progress and results. in Computational Diffusion MRI 217–224 (Springer, Cham, 2019).

25. 25.

Sotiropoulos, S. N. et al. Fusion in diffusion MRI for improved fibre orientation estimation: an application to the 3T and 7T data of the human connectome project. Neuroimage 134, 396–409 (2016).

26. 26.

Tong, Q. et al. Reproducibility of multi-shell diffusion tractography on traveling subjects: a multicenter study prospective. Magn. Reson. Imaging 59, 1–9 (2019).

27. 27.

O’Brien, K., Krueger, G., Lazeyras, F., Gruetter, R. & Roche, A. A simple method to denoise MP2RAGE. in Proc. 21th International Society for Magnetic Resonance in Medicine 0269 (Int. Soc. Magn. Reson. Med., 2013).

28. 28.

Setsompop, K. et al. Blipped-controlled aliasing in parallel imaging for simultaneous multislice echo planar imaging with reduced g-factor penalty. Magn. Reson. Med. 67, 1210–1224 (2012).

29. 29.

Caruyer, E., Lenglet, C., Sapiro, G. & Deriche, R. Design of multishell sampling schemes with uniform coverage in diffusion MRI. Magn. Reson. Med. 69, 1534–1540 (2013).

30. 30.

Veraart, J. et al. Denoising of diffusion MRI using random matrix theory. Neuroimage 142, 394–406 (2016).

31. 31.

Kellner, E., Dhital, B., Kiselev, V. G. & Reisert, M. Gibbs-ringing artifact removal based on local subvoxel-shifts. Magn. Reson. Med. 76, 1574–1581 (2016).

32. 32.

Andersson, J. L. R., Skare, S. & Ashburner, J. How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. Neuroimage 20, 870–888 (2003).

33. 33.

Andersson, J. L. R. R. & Sotiropoulos, S. N. An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. Neuroimage 125, 1063–1078 (2016).

34. 34.

Gorgolewski, K. J. et al. The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. Data 3, 1–9 (2016).

35. 35.

Tong, Q. et al. Multicenter dataset of multishell diffusion MRI in healthy traveling adults with identical setting. figshare https://doi.org/10.6084/m9.figshare.8851955.v6 (2020).

36. 36.

National Electrical Manufacturers Association. Determination of signal-to-noise ratio (SNR) in diagnostic magnetic resonance imaging. Report No. MS 1-2008 (R2014) (National Electrical Manufacturers Association, 2008).

37. 37.

Henkelman, R. M. Measurement of signal intensities in the presence of noise in MR images. Medical physics 12, 232–233 (1985).

38. 38.

Smith, R. E., Tournier, J. D., Calamante, F. & Connelly, A. SIFT: spherical-deconvolution informed filtering of tractograms. Neuroimage 67, 298–312 (2013).

39. 39.

Tong, Q. & He, H. Processing code for multishell dMRI dataset. figshare https://doi.org/10.6084/m9.figshare.8856887.v2 (2020).

## Acknowledgements

This work was supported by National Key R&D Program of China (2017YFC0909200), National Natural Science Foundation of China (81871428), Shanghai Key Laboratory of Psychotic Disorders (13dz2260500, K18-515007-007), Major Scientific Project of Zhejiang Lab (2018DG0ZX01), Fundamental Research Funds for the Central Universities (2019QNA5026, 2019XZZX001-01-08), Zhejiang University Education Foundation Global Partnership Fund, Zhejiang Province Laboratory Work Research Project (YB201730), Beijing Municipal Science and Technology Project of Brain Cognition and Brain Medicine (Z171100000117001), Beijing Municipal Administration of Hospitals Clinical Medicine Development of Special Funding Support (ZYLX201609), Beijing Talents Foundation (2016000021223TD07), Capacity Building for Sci-Tech Innovation - Fundamental Scientific Research Funds (19530050157, 19530050184), and Beijing Brain Initiative of Beijing Municipal Science & Technology Commission.

## Author information

Authors

### Contributions

Consortium Leadership: K.L., J.Z., P.L. & H.H. Protocol Development: Q.T., T.G., T.Q. & Y.S. Data collection: Q.T., T.G., C.L., P.L. & Q.D. Data Analysis: Q.T. Preparation of manuscript: Q.T., H.H. & J.Z.

### Corresponding authors

Correspondence to Hongjian He or Peipeng Liang.

## Ethics declarations

### Competing Interests

Dr. Tianyi Qian and Dr. Yi Sun are employed by Siemens. The remaining authors declare no competing financial interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and Permissions