Improving portable low-field MRI image quality through image-to-image translation using paired low- and high-field images

Islam, Kh Tohidul; Zhong, Shenjun; Zakavi, Parisa; Chen, Zhifeng; Kavnoudias, Helen; Farquharson, Shawna; Durbridge, Gail; Barth, Markus; McMahon, Katie L.; Parizel, Paul M.; Dwyer, Andrew; Egan, Gary F.; Law, Meng; Chen, Zhaolin

doi:10.1038/s41598-023-48438-1

Download PDF

Article
Open access
Published: 01 December 2023

Improving portable low-field MRI image quality through image-to-image translation using paired low- and high-field images

Kh Tohidul Islam¹,
Shenjun Zhong^1,5,
Parisa Zakavi¹,
Zhifeng Chen^1,2,
Helen Kavnoudias^3,4,
Shawna Farquharson⁵,
Gail Durbridge⁶,
Markus Barth⁷,
Katie L. McMahon⁸,
Paul M. Parizel^9,10,
Andrew Dwyer¹¹,
Gary F. Egan¹,
Meng Law^3,4 &
…
Zhaolin Chen^1,2

Scientific Reports volume 13, Article number: 21183 (2023) Cite this article

2751 Accesses
1 Citations
21 Altmetric
Metrics details

Subjects

Abstract

Low-field portable magnetic resonance imaging (MRI) scanners are more accessible, cost-effective, sustainable with lower carbon emissions than superconducting high-field MRI scanners. However, the images produced have relatively poor image quality, lower signal-to-noise ratio, and limited spatial resolution. This study develops and investigates an image-to-image translation deep learning model, LoHiResGAN, to enhance the quality of low-field (64mT) MRI scans and generate synthetic high-field (3T) MRI scans. We employed a paired dataset comprising T1- and T2-weighted MRI sequences from the 64mT and 3T and compared the performance of the LoHiResGAN model with other state-of-the-art models, including GANs, CycleGAN, U-Net, and cGAN. Our proposed method demonstrates superior performance in terms of image quality metrics, such as normalized root-mean-squared error, structural similarity index measure, peak signal-to-noise ratio, and perception-based image quality evaluator. Additionally, we evaluated the accuracy of brain morphometry measurements for 33 brain regions across the original 3T, 64mT, and synthetic 3T images. The results indicate that the synthetic 3T images created using our proposed LoHiResGAN model significantly improve the image quality of low-field MRI data compared to other methods (GANs, CycleGAN, U-Net, cGAN) and provide more consistent brain morphometry measurements across various brain regions in reference to 3T. Synthetic images generated by our method demonstrated high quality both quantitatively and qualitatively. However, additional research, involving diverse datasets and clinical validation, is necessary to fully understand its applicability for clinical diagnostics, especially in settings where high-field MRI scanners are less accessible.

Deep learning for fast low-field MRI acquisitions

Article Open access 06 July 2022

M4Raw: A multi-contrast, multi-repetition, multi-channel MRI k-space dataset for low-field MRI research

Article Open access 10 May 2023

Standardization of brain MR images across machines and protocols: bridging the gap for MRI-based radiomics

Article Open access 23 July 2020

Introduction

Magnetic resonance imaging (MRI) is a non-invasive medical imaging modality that can comprehensively visualize tissues and organs, exhibits superior soft-tissue contrast relative to alternative imaging modalities and can demonstrate subtle pathologies¹. MRI utilises a strong magnetic field, radiofrequency pulses, and sophisticated computational algorithms to collectively generate diagnostic images of many body regions, including the brain, spine, organs, and joints. A notable feature of MRI is its absence of ionizing radiation, which is associated with reduced radiation-related risks². Low-field MRI provides opportunities to develop a more compact, cost-effective, and portable system in comparison to current clinical MRI scanners ($\ge$1.5T field strengths)³. Although low-field MRI images suffer from reduced signal-to-noise ratio (SNR) compared to their high-field counterparts, low-field MRI is of potential interest for a number of imaging applications, including musculoskeletal and neuroimaging. Point of care (POC) MRI for use in the emergency department (ED) or intensive care unit (ICU) settings could serve as a viable alternative in remote or economically challenged areas in the world^4,5,6,7,8.

The recent advent of portable low-field MRI scanners, such as the 64mT Hyperfine Swoop, holds significant promise for reducing MRI access inequality with acceptable diagnostic quality^{9,10,11,12,13,14,15}. However, the limited spatial resolution and SNR of low-field MRI precludes using traditional image analysis tools, such as the FMRIB automated segmentation tool, FSL-FAST^16,17. As a result, there has been a growing interest to determine whether novel methods can translate images acquired with low-field POC MRI scanners to be comparable to those obtained with high-field scanners^18,19. The method, known as image-to-image translation, holds potential to improve the diagnostic value of images acquired using low-field scanners. Recently, deep learning-based (artificial intelligence) approaches, have shown significant promise for medical image synthesis^{20,21,22,23,24}. Several state-of-the-art methods previously used for natural image-to-image translation have been adapted to perform low-field to high-field MRI image-to-image translation^9,17,25,26. These include generative adversarial networks (GANs), CycleGAN, U-Net, multi-scale fusion networks, conditional generative adversarial networks (cGAN), and Pix2Pix^{27,28,29,30,31}.

Based on U-Net²⁹, Iglesias et al.⁹ proposed a state-of-the-art synthetic super-resolution method called SynthSR. The method employs a network to generate 1 mm isotropic T1-weighted structural images from clinical MRI scans with varying orientation, resolution (including low-field), and contrast. This innovative approach can potentially advance quantitative neuroimaging in both clinical care and research settings³². Subsequently, Iglesias et al.³³ provided a proof-of-concept (SynthSeg) for applying the SynthSR method to perform quantitative brain morphometry analysis on low-field MRI data. The results demonstrate that portable low-field MRI can be enhanced with SynthSR to yield brain morphometric measurements that correlate with those obtained from high-resolution images. More recently, a robust version of the Iglesias et al.³³ method called SynthSeg$^+$, with robustness for any MRI resolution and contrast, was proposed by Billot et al.³⁴ who subsequently investigated its performance on clinical scans³⁵. Similarly, Laguna et al.¹⁷ introduced an image-to-image translation architecture inspired by CycleGAN²⁸, which integrates denoising, super-resolution, and domain adaptation networks to address the challenges of portable low-field MRI in terms of resolution and signal-to-noise ratio.

The aforementioned adaptive methodologies motivate the current investigation into the performance of advanced image-to-image translation models for low-field MRI applications with a particular focus on their ability to maintain diagnostic integrity and preserve essential medical information without the introduction of image artifacts. Specifically, we investigated the GANs, CycleGAN, U-Net, cGAN, and our (LoHiResGAN) image-to-image translation model’s effectiveness in generating synthetic 3T MR images from the 64mT images. The brain morphometry was compared between the synthetic 3T images, the paired 3T, and the original 64mT images. Our findings indicate a high-level agreement between 3T and synthetic 3T brain morphometry measures, that provides more consistent results when compared with measures made using 3T and 64mT images.

Materials and methods

Data collection

Institutional ethics and institutional review board (IRB) approvals were obtained from Monash University Human Research Ethics Committee, and written informed consent was acquired from all participants involved in the study. All experiments were performed in accordance with relevant guidelines and regulations. The study, conducted between October 2022 and June 2023, involved 92 healthy individuals (mean age 44; range 18–81; SD = 17, 42 males). Each participant was scanned at Monash Biomedical Imaging using both Hyperfine Swoop (64mT) and Siemens Biograph mMR (3T) imaging systems. While the Hyperfine Swoop is designed to function in unshielded environments, leveraging its proprietary electromagnetic interference (EMI) removal technique, we encountered an EMI warning during our operations⁵. Consequently, we judiciously repositioned the scanner to a location where such warnings were no longer present. During recruitment, a 60-year-old female participant had a history of cerebral haemorrhage and subsequent development of a cerebrospinal fluid (CSF)-filled cavity, and this participant was excluded during model training but was subsequently used to test the model’s performance as an out-of-distribution sample. Table 1 provides the scanning parameters including voxel resolution, matrix size, and scan duration for both T1- and T2-weighted scans of both systems.

Table 1 Details of the scanning parameters for 64mT and 3T MRIs.

Full size table

In choosing the imaging sequences for this study, various factors were taken into consideration. For the Siemens Biograph mMR (3T) imaging system, we opted for a 2D T2-weighted TSE sequence over the more available 3D T2 SPACE. This decision was based on several grounds. First, the 2D T2-weighted TSE sequence offers faster scanning times compared to its 3D counterparts, ensuring efficiency and minimizing patient discomfort during imaging. Moreover, the 2D T2-weighted sequence is a cornerstone in many clinical protocols, and its results are well-established in the medical community³⁶. Thus, leveraging this sequence ensured that our findings had immediate clinical relevance. Furthermore, the field of view for our 2D T2-weighted acquisition was meticulously crafted to align with the 64mT image acquisition, ensuring consistency across different imaging modalities within our study.

Data pre-processing

The SynthSeg$^+$ method³⁴ was used to resample the datasets to 1 mm$^3$ isotropic resolution, and FSL-FAST was used for bias field correction without referencing an external atlas for spatial information¹⁶. To prepare the data for deep learning methods, paired training was performed by co-registering the 64mT and 3T scans using FMRIB’s linear image registration tool (FLIRT)³⁷. Finally, the dataset was randomly divided into training (n = 37), validation (n = 5), and testing (n = 50) sets, to ensure the network was trained, validated, and tested on different participant data. Also, where applicable, the relevant checklist for good machine learning practices (GMLPs) has been considered³⁸. To prepare for the SynthSR³² method, we used the FLIRT registration method to co-register T1-w and T2-w images without resampling those to 1 mm$^3$ isotropic resolution. Once we had the SynthSR T1-w image, we resampled and co-registered with 3T (T1-w) images for further comparison.

Model architecture

Our proposed method (LoHiResGAN, which signifies the low-field to high-field translation task and the use of ResNet components in a GAN architecture) was inspired by cGAN and Pix2Pix models, where the ResNet’s downsample and upsample blocks were used instead of standard U-Net³⁹. The following Table 2 provides a detailed breakdown of the architectural components and their functions within the proposed LoHiResGAN model, highlighting the structure of both the generator and the discriminator. To empirically evaluate the effectiveness of the ResNet components in our LoHiResGAN model, we undertook experiments comparing its performance with architectures devoid of these components. Our findings supported the incorporation of ResNet, showing that its components contributed significantly to the improved translation of low-field MRI images to high-field MRI images. These experiments thereby validate the advantage of integrating ResNet components into our architecture.

Table 2 Summary of the Architecture of the LoHiResGAN Model: A ResNet-based Generative Adversarial Network (GAN) for Efficient Translation of Low-Field to High-Field MRI Images.

Full size table

As this study focused on translating low-field MRI to high-field MRI containing different domain information, we hypothesized that incorporation of the structural similarity index measure (SSIM) as an additional loss could provide more information for the overall loss calculation. While mean absolute error (MAE) and binary cross-entropy (BCE) losses measure different aspects of the generated output, they do not consider the perceptual similarity between the generated and target images. In contrast, the SSIM loss is a metric that captures the similarity between two images based on the luminance, contrast, and structure, and has been shown to be more closely aligned with human perception of image quality than traditional pixel-wise error metrics like MAE. By incorporation of SSIM as an additional loss term, the overall loss function considered not only the accuracy of the generated image but also its similarity to the target image in terms of structure and texture. This may lead to improved perceptual quality in the generated images. To test the efficacy of our approach, we conducted experiments comparing the results with and without the inclusion of SSIM loss. Our findings indicated that integrating SSIM into the loss function indeed favored the production of images with enhanced perceptual quality, validating our hypothesis and underscoring its potential utility in tasks involving image translation between domains.

$$\begin{aligned} \mathscr {L}_{LoHiResGAN}(G,D)= & {} \mathbb {E}_{x,y}[\log D(x,y)] + \mathbb {E}_{x,z}[\log (1-D(x,G(x,z))]. \end{aligned}$$

(1)

$$\begin{aligned} \mathscr {L}_{L1}(G)= & {} \mathbb {E}_{x,y,z}[||y - G(x,z)||_{1}]. \end{aligned}$$

(2)

$$\begin{aligned} \mathscr {L}_{SSIM}(G)= & {} \mathbb {E}_{x,y,z}[SSIM(y, G(x,z))]. \end{aligned}$$

(3)

$$\begin{aligned} \mathscr {T}= & {} \mathop {\min }\limits _G \mathop {\max }\limits _D {\mathscr {L}_{LoHiResGAN}}(G,D) + \lambda _1 \mathscr {L}_{L1}(G) + \lambda _2\mathscr {L}_{SSIM}(G). \end{aligned}$$

(4)

where x is the input image, y is the target image, z is the noise vector, D is the discriminator, G is the generator, $\lambda _1=100$ (as per the original Pix2Pix⁴⁰) and $\lambda _2=1$ are used as the weighting factors for the $L_1$ loss and the SSIM loss respectively. $\mathscr {T}$ is the total objective function that aims to optimize during training.

In the context of the LoHiResGAN network, Eq. (1) represents the LoHiResGAN loss function that the generator, G, aims to minimize against an adversarial discriminator, D, which seeks to maximize it. This function comprises two components: one estimating the likelihood of the discriminator correctly identifying real image pairs and the other estimating the likelihood of identifying fake pairs generated by G. Equation (2) denotes the $L_1$ loss function, which measures the expected absolute difference between the target image, y, and the image produced by the generator, G(x, z). This function aims to bring the generated images closer to the target images. Equation (3) describes the Structural Similarity Index Measure (SSIM) loss for the generator, G. This metric calculates the expected structural similarity between the target and generated images, considering elements like structural information, luminance, and contrast. Finally, Eq. (4) encapsulates the overall objective function the LoHiResGAN seeks to optimize. This function is a combination of the LoHiResGAN loss, $L_1$ loss, and SSIM loss, weighted accordingly. The ultimate goal is to find the optimal G and D that minimize and maximizes these objectives, respectively.

We employed the end-to-end machine learning platform TensorFlow to perform training, validation, and testing. We applied several pre-processing steps to prepare the dataset for training different models, including input normalization within desired intensity ranges using minimum and maximum, random flipping, rotation and cropping. The Adam optimizer was selected based on its superior performance to other optimizers using the trial-and-error method. The training hyperparameters included a learning rate of $2e^{-3}$, $\beta _1=0.5$, $\beta _2=0.999$, 350 epochs, batch size 1, and shuffle after every epoch. Model training, testing and performance analysis were performed on a Ubuntu LTS (ver 20.04.5) operating system with the NVIDIA A40 GPU.

Image quality assessment and statistical evaluation

The performance of image-to-image translation models was evaluated using quantitative metrics (Eqs. 5–7), namely, normalized root-mean-squared error (NRMSE), structural similarity index measure (SSIM), peak signal-to-noise ratio (PSNR), and perception-based image quality evaluator (PIQE)⁴¹. These metrics provide different perspectives on the quality of the predicted image relative to the 3T images. Post-processing steps (segmentation and statistical analysis) were employed to conduct a comparative analysis of brain morphometry across the 3T, 64mT, and synthetic 3T MRI scans. In addition to the previously mentioned metrics, the Sørensen-Dice similarity coefficient (DICE) was used to measure the overlap between the predicted segmentation and the ground truth (Eq. 8). This provides a quantitative evaluation of how accurate the image-to-image translation models are at preserving the morphological details of the structures of interest in the synthetic 3T MRI scans. The performance of these models was analyzed across the 3T, 64mT, and synthetic 3T MRI scans to conduct a comparative analysis of brain morphometry. The Dice coefficient can range from 0 to 1, where 1 indicates perfect overlap (i.e., the predicted segmentation is identical to the ground truth), and 0 indicates no overlap.

$$\begin{aligned} NRMSE(I, K)= & {} \frac{\sqrt{\frac{1}{N}\sum _{i=1}^{N}(I_i-K_i)^2}}{max(I) - min(I)} \end{aligned}$$

(5)

$$\begin{aligned} SSIM(I, K)= & {} \frac{(2\mu _I\mu _K + c_1)(2\sigma _{IK} + c_2)}{(\mu _I^2 + \mu _K^2 + c_1)(\sigma _I^2 + \sigma _K^2 + c_2)} \end{aligned}$$

(6)

$$\begin{aligned} PSNR(I, K)= & {} 20\log _{10}\left( \frac{MAX_I}{\sqrt{MSE(I,K)}}\right) \end{aligned}$$

(7)

$$\begin{aligned} DICE(A, B)= & {} \frac{2|A \cap B|}{|A|+|B|} \end{aligned}$$

(8)

where I and K are two vectorised, non-negative matrices representing two images of same size, and N is the number of elements in I or K. $\mu _I$ and $\mu _K$ are the average of I and K, respectively; $\sigma _{I}$ and $\sigma _{K}$ are the standard deviation of I and K; $\sigma _{IK}$ is the covariance of I and K. $c_1=(k_1L)^2$ and $c_2=(k_2L)^2$ are two variables to stabilize the division with weak denominator; L is the dynamic range of the pixel-values. For PIQE, consider the original article⁴¹. A and B are, respectively, the set of pixels in the predicted segmentation and the ground truth. $|A \cap B|$ is the cardinality of the intersection of A and B. |A| and |B| are the cardinalities of A and B, respectively.

In the present investigation, skull-stripping was conducted using a brain extraction tool as reported by Jenkinson et al.⁴². Specifically, T1-weighted images from 3T were employed to generate binary brain masks for subsequent scans. Also, the one-way ANOVA tests conducted to compare the brain volume measurements of 33 regions between the original 3T, 64mT, SynthSR, and LoHiResGAN scans. The one-way ANOVA test was chosen due to its suitability for multiple group comparisons, mild assumptions (normality tested by Shapiro-Wilk and equal variance tested by Levene’s tests), simultaneous group comparison capability, and provision for effect size and post-hoc analysis. Finally, the findings were statistically analysed by using the Tukey HSD (honestly significant difference) test, which was performed to conduct post-hoc pairwise comparisons between the means (33 regions) of different MRI modalities (3T, 64mT, SynthSR, and LoHiResGAN) and control the family-wise error rate.

Results

Image quality metrics

Table 3 compares the overall performance of various image-to-image translation models on two different MRI sequences, T1- and T2-weighted. This comparison is based on four key metrics: NRMSE, PSNR, SSIM, and PIQE. LoHiResGAN displays the lowest NRMSE and PIQE values and the highest PSNR and SSIM values, outperforming other methods across all metrics.

Table 3 Comparison of normalized root-mean-squared error (NRMSE), peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and perception-based image quality evaluator (PIQE) with other state-of-the-art methods for T1- and T2-weighted (mean ± standard deviation).

Full size table

Qualitative image comparison

Figure 1 shows substantial qualitative disparities between representative 64mT and 3T images in both T1-weighted and T2-weighted modalities. These visual differences among the 64mT, GANs, CycleGAN, U-Net, and cGAN images are minimized by LoHiResGAN. An absolute difference image is also computed for each method’s output, demonstrating the variation in translation results between these methods. This shows the residual signal error compared to 3T and is clearly lowest for LoHiResGAN.

Brain parcellation (by SynthSeg$^+$) for different methods in 33 brain regions is shown in Fig. 2. The SynthSeg$^+$ model is publicly accessible and designed for user-friendly interaction. We used the pre-trained SynthSeg$^+$ model to evaluate our proposed method’s performance without any additional fine-tuning on our low-field MRI data. Qualitative analysis of segmented masks for 64mT, 3T, SynthSR, and LoHiResGAN brain regions shows large errors in CSF segmentation of the lateral ventricles, Sylvian fissures, and sulci at 64mT. Differences are also evident in the hippocampus, cerebral white matter, cortex, and deep brain structures with erroneous labeling between grey and white matter. Whilst SynthSR improves the visual appearance, there is significant smoothing and no improvement in the segmentation of CSF spaces. The segmentation of LoHiResGAN shows high visual similarity to that of 3T.

Quantitative regional brain volume

Figure 3 compares brain volume measurements for four different datasets: 3T, 64mT, SynthSR, and LoHiResGAN. Each plot corresponds to one of the 33 brain regions analyzed, with the width of each plot indicating the data distribution and the mean and inner quartiles marked by horizontal lines. In reference to 3T images, the 64mT images demonstrate large errors across multiple brain regions with differences in mean volume for cerebral cortex ($\approx$ 30% underestimate), hippocampus ($\approx$ 30% underestimate), and CSF ($\approx$ 30% overestimate), whereas changes of only $\approx$5.6-6.8% have been shown to have clinical significance in patients with hydrocephalus. SynthSR images show marked improvements in quantitative brain volume accuracy in all brain regions compared to the 64mT images. In comparison, LoHiResGAN consistently improves the accuracy across all brain regions. Specifically, LoHiResGAN reduces the underestimation observed in regions like the ventricle, cerebellum cortex, and hippocampus, while also attenuating the overestimation present in white matter (WM) and CSF.

The Tukey HSD test of the difference in mean volumes is presented in Table 4. There is a statistically significant difference in mean volumes when comparing 3T versus 64mT and also comparing 3T versus SynthSR measurements. However, there is no statistically significant difference in volumes comparing 3T versus LoHiResGAN measurements, confirming their close similarity. The test provides mean differences, adjusted p-values, and confidence intervals and indicates whether the null hypothesis is rejected for each pairwise comparison.

Table 4 Tukey HSD test results.

Full size table

Further, we employ a linear regression model to ascertain the measured volumetric accuracy from different imaging techniques, specifically focusing on gray matter (GM), WM, and CSF (Fig. 4). The comparison is made among four methodologies: 3T, 64mT, SynthSR, and LoHiResGAN. When comparing the CSF measurements, the relationship between 3T and 64mT yields an $R^2$ value of 0.6122, and for SynthSR, an $R^2$ value of 0.8226, marking a notable moderate to strong linear correspondence. The comparative analysis between 3T and LoHiResGAN results in an $R^2$ value of 0.9885, emphasizing an excellent linear association. Delving deeper into GM measurements, the interrelation between 3T and 64mT shows a significant $R^2$ value of 0.9848, and for SynthSR, the $R^2$ value is 0.9830. This association is further strengthened in the 3T and LoHiResGAN proximity, which registers an $R^2$ value of 0.9966. Turning to WM measurements, a robust linear correlation is observed between 3T and 64mT with an $R^2$ of 0.9734 and an $R^2$ of 0.9930 for SynthSR. Notably, this relationship reaches its peak when 3T is paired with LoHiResGAN, yielding an $R^2$ of 0.9989. These findings collectively underscore the precision and reliability inherent in the LoHiResGAN method.

Quantitative segmentation

For structural overlap of the segmentations, the DICE similarity coefficient between the reference (3T) segmentation and the segmentations generated using 64mT, SynthSR, and LoHiResGAN images is shown in Fig. 5 for each brain region across all subjects. Using LoHiResGAN, the DICE similarity coefficient for synthetic 3T improves compared to the original 64mT across all brain regions, achieving scores mostly $>0.9$ (where 1 indicates perfect agreement) and shows notably substantial improvement in important clinical and research regions such as the cerebral cortex, hippocampus, and CSF. SynthSR alone shows milder improvement in DICE similarity coefficient and critically performs worse than native 64mT for CSF volume. The observed variability in DICE scores across different brain regions can be attributed to several factors, including the inherent complexity of the region’s anatomy, tissue contrast with surrounding tissues, and the effectiveness of the segmentation algorithm in delineating the boundaries of these structures. The Dice coefficient is highly related to a structure’s size given its sensitivity to errors at the surface of a structure⁴³. As such, smaller structures tend to show lower Dice coefficients, limiting its use when comparing between different structures. The quantitative analysis of the mean DICE scores highlights the improved segmentation quality achieved by the synthetic 3T MRI compared to the 64mT MRI.

In Fig. 6, T1-weighted images provide a detailed visualization of three specific neural regions-the caudate nucleus (depicted in the first row), brain stem (shown in the second row), and globus pallidus (featured in the third row). Their 3D renderings, based on data from a 26-year-old male participant, further elucidate their structural nuances. A pronounced disparity is evident in the pallidum region between the 64mT and LoHiResGAN images. In contrast, the brain stem and caudate showcase minimal differences across the modalities. Importantly, when focusing on image quality, LoHiResGAN images bear a closer resemblance to the original 3T images, underlining their diagnostic potential.

In addition to the in-distribution testing, we further explore the performance of the models in out-of-distribution scenarios, specifically focusing on the ability to capture and represent abnormal pathological areas. Preliminary results reveal that the abnormal pathological areas are clearly discernible in both T1-weighted and T2-weighted sequences of the synthetic 3T images. This is showcased in Fig. 7, which demonstrates superior consistency between the synthetic images and the corresponding acquired 3T images.

Discussion

This study presents a new deep learning-based image-to-image translation model, LoHiResGAN, to enhance the quality of low-field (64mT) MRI scans and generate synthetic high-field (3T) MRI scans from low-field scans.

Comparative assessment of multiple state-of-the-art deep learning-based methods for T1-weighted and T2-weighted image quality assessments shows all achieve significantly better SNR and SSIM than the original 64T scans. GAN-based methods, including LoHiResGAN, outperform the UNet model in including NRMSE, PSNR, SSIM, and PIQE metrics, particularly for T2-weighted images. This superior performance highlights the potential of deep learning techniques, particularly GAN-based methods like LoHiResGAN, for improving image quality of low-field MRI.

Brain regions show discrepancies in volume measurements at 64mT when compared to the reference 3T scans. Such discrepancies highlight the challenges faced in relying solely on low-field MRI scans for clinical diagnosis and planning. For instance, the overestimation of certain measurements in 64mT images may have significant clinical implications, especially in conditions like hydrocephalus. The need for more accurate reconstruction models is evident, emphasizing the potential role of deep learning solutions in improving the image quality of low-field MRI scans. LoHiResGAN, as one such solution, appears promising in addressing some of these challenges. However, it’s crucial to acknowledge that no method is infallible. Despite the improvements brought about by LoHiResGAN, there remains a spectrum of inconsistencies it has yet to fully address. This suggests that there is room for further optimization and refinement. Interestingly, while some imaging methods seem to align in their representation of particular brain regions, detailed examination brings to light potential mislabeling issues, such as the misidentification of grey matter and white matter. Such nuances underscore the challenges in achieving a truly accurate representation of brain anatomy through imaging. It’s imperative for future research to focus on these challenges. By refining these methods, we can ensure that they not only improve the visual quality of MRI scans but also provide data that clinicians and researchers can rely upon for accurate interpretations and decisions.

The results highlight the efficacy of LoHiResGAN in enhancing the DICE similarity coefficient for synthetic 3T MRI, particularly when compared against the original 64mT MRI. This is especially pertinent in regions pivotal for both clinical diagnoses and research, like the cerebral cortex, hippocampus, and CSF. While SynthSR’s capabilities seem to be more moderate, it’s concerning that it sometimes underperforms compared to the native 64mT, raising questions about its utility for certain applications. The variability in DICE scores across regions underscores the multifaceted challenges in MRI segmentation. The anatomy’s inherent complexity, the varied tissue contrasts, and the proficiency of the segmentation algorithms all play roles in these discrepancies. Notably, the Dice coefficient’s particular sensitivity to structural boundaries, especially for smaller anatomical features, reminds the need for caution when employing this metric across diverse structures. Further, evaluations on varying orientations observed consistent results. Such insights emphasize the importance of refining segmentation tools and methodologies, particularly as we navigate the nuances of different brain structures. The benefits of synthetic 3T MRI show that tools like LoHiResGAN could be beneficial when traditional methods are not available, emphasizing the importance of further development in this area.

The out-of-distribution test in a patient with focal encephalomalacia highlights the potential of advanced image-to-image translation techniques to enhance diagnostic accuracy in pathologic cases, despite the model being exclusively trained on data from healthy subjects. Notably, however, the signal of the abnormality in this case remains CSF. Further research involving a more extensive and diverse dataset, including patients with various neurological pathologies, is needed to determine the clinical utility, accuracy, and ultimately contribute to the development of more robust image-to-image translation models.

Despite the encouraging findings, our proposed study has limitations. Generalisability is constrained by the relatively small sample size, which consists solely of healthy subjects. This is especially true for patterns not within the training data set. Nonetheless, for the purposes of volumetry, the sample spans patients with a broad range of ages and parenchymal volumes. A smoothing effect emerges during the analysis of generated images and may diminish the ability to resolve small structures. The underlying performance of the image segmentation method itself is not taken into account when conducting volumetric comparisons in our study. This oversight can lead to biased segmentation outcomes, although the automated results are concordant with visual inspection. Future research should address these limitations to further enhance the validity and reliability of the study findings.

Several future directions are worth exploring to enhance further the performance of image-to-image translation models for low-field to high-field MRI translation. Firstly, incorporating larger and more diverse datasets, including individuals with various neurological disease conditions, will help improve the generalizability and robustness of the model for clinical diagnostic purposes. This would ultimately contribute to better diagnostic accuracy and more targeted treatment planning in clinical settings. Secondly, exploring the combination of multiple image-to-image translation models, or even developing novel models tailored to specific brain regions or conditions, may yield further improvements in image quality and accuracy of brain morphometry measurements for different experimental settings. Lastly, integrating advanced generative deep learning techniques such as transformer models, diffusion deep learning models, and unsupervised learning approaches could potentially enhance the performance of image-to-image translation models in low-field to high-field MRI translation tasks by capturing more complex patterns and dependencies in the data.

In conclusion, the findings of this study demonstrate the substantial clinically significant limitations in native 64mT data and suggest that the application of image-to-image translation models, such as LoHiResGAN, can substantially improve the quality with synthetic high-field images approaching 3T quality. If shown to be reproducible across a range of brain pathologies, this will have significant implications for clinical and research settings, particularly in resource-limited settings where access to high-field MRI scanners may be limited. The spectrum of findings emphasizes the importance of considering similarity across different brain regions when evaluating the performance of image translation models. Further investigation into the factors contributing to regional discrepancies will enhance our understanding of the challenges associated with accurate brain structure measurements. Finally, by further exploring and refining these models, we can continue advancing the medical imaging field and contribute towards a more accurate and reliable assessment of brain structures and functions.

Data availibility

The datasets generated and/or analysed during the current study are not publicly available due to confidentiality agreements with participants but are available from the corresponding author upon reasonable request. The code used for our tests is publicly available at https://github.com/khtohidulislam/LoHiResGAN for evaluation.

References

Muñoz-Ramírez, V. et al. Subtle anomaly detection: Application to brain MRI analysis of de novo Parkinsonian patients. Artif. Intell. Med. 125, 102251. https://doi.org/10.1016/j.artmed.2022.102251 (2022).
Article PubMed Google Scholar
Chen, Z. et al. From simultaneous to synergistic MR-PET brain imaging: A review of hybrid MR-PET imaging methodologies. Hum. Brain Mapp. 39, 5126–5144. https://doi.org/10.1002/hbm.24314 (2018).
Article PubMed PubMed Central Google Scholar
Sarracanie, M. & Salameh, N. Low-field MRI: How low can we go? A fresh view on an old debate. Front. Phys. 8, 172. https://doi.org/10.3389/fphy.2020.00172 (2020).
Article Google Scholar
Cooley, C. Z. et al. A portable scanner for magnetic resonance imaging of the brain. Nat. Biomed. Eng. 5, 229–239. https://doi.org/10.1038/s41551-020-00641-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Liu, Y. et al. A low-cost and shielding-free ultra-low-field brain MRI scanner. Nat. Commun. 12, 7238. https://doi.org/10.1038/s41467-021-27317-1 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Sheth, K. N. et al. Assessment of brain injury using portable, low-field magnetic resonance imaging at the bedside of critically Ill patients. JAMA Neurol. 78, 41. https://doi.org/10.1001/jamaneurol.2020.3263 (2021).
Article Google Scholar
Guallart-Naval, T. et al. Portable magnetic resonance imaging of patients indoors, outdoors and at home. Sci. Rep. 12, 13147. https://doi.org/10.1038/s41598-022-17472-w (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Arnold, T. C., Freeman, C. W., Litt, B. & Stein, J. M. Low-field MRI: Clinical promise and challenges. J. Magn. Reson. Imaging 57, 25–44. https://doi.org/10.1002/jmri.28408 (2023).
Article PubMed Google Scholar
Iglesias, J. E. et al. Joint super-resolution and synthesis of 1 mm isotropic MP-RAGE volumes from clinical MRI exams with scans of different orientation, resolution and contrast. Neuroimage 237, 118206. https://doi.org/10.1016/j.neuroimage.2021.118206 (2021).
Article PubMed Google Scholar
Mazurek, M. H. et al. Portable, bedside, low-field magnetic resonance imaging for evaluation of intracerebral hemorrhage. Nat. Commun. 12, 5119. https://doi.org/10.1038/s41467-021-25441-6 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Arnold, T. C. et al. Sensitivity of portable low-field magnetic resonance imaging for multiple sclerosis lesions. NeuroImage Clin. 35, 103101. https://doi.org/10.1016/j.nicl.2022.103101 (2022).
Article PubMed PubMed Central Google Scholar
Sien, M. E. et al. Feasibility of and experience using a portable MRI scanner in the neonatal intensive care unit. Arch. Dis. Child. Fetal Neonatal Ed. 108, 45–50. https://doi.org/10.1136/archdischild-2022-324200 (2022).
Article PubMed Google Scholar
Padormo, F. et al. In vivo $T_1$ mapping of neonatal brain tissue at 64 mT. Magn. Reson. Med. 89, 1016–1025. https://doi.org/10.1002/mrm.29509 (2022).
Article CAS PubMed PubMed Central Google Scholar
Deoni, S. C. L. et al. Neuroimaging and verbal memory assessment in healthy aging adults using a portable low-field MRI scanner and a web-based platform: Results from a proof-of-concept population-based cross-section study. Brain Struct. Funct. 228, 493–509. https://doi.org/10.1007/s00429-022-02595-7 (2022).
Article PubMed PubMed Central Google Scholar
Yuen, M. M. et al. Portable, low-field magnetic resonance imaging enables highly accessible and dynamic bedside evaluation of ischemic stroke. Sci. Adv. https://doi.org/10.1126/sciadv.abm3952 (2022).
Article PubMed PubMed Central Google Scholar
Zhang, Y., Brady, M. & Smith, S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging 20, 45–57. https://doi.org/10.1109/42.906424 (2001).
Article CAS PubMed Google Scholar
Laguna, S. et al. Super-resolution of portable low-field MRI in real scenarios: Integration with denoising and domain adaptation, in Medical Imaging with Deep Learning (2022).
Wald, L. L., McDaniel, P. C., Witzel, T., Stockmann, J. P. & Cooley, C. Z. Low-cost and portable MRI. J. Magn. Reson. Imaging 52, 686–696. https://doi.org/10.1002/jmri.26942 (2020).
Article PubMed Google Scholar
Turpin, J. et al. Portable magnetic resonance imaging for ICU patients. Crit. Care Explor. 2, e0306. https://doi.org/10.1097/CCE.0000000000000306 (2020).
Article PubMed PubMed Central Google Scholar
Pawar, K., Chen, Z., Shah, N. J. & Egan, G. F. A deep learning framework for transforming image reconstruction into pixel classification. IEEE Access 7, 177690–177702. https://doi.org/10.1109/ACCESS.2019.2959037 (2019).
Article Google Scholar
Qu, L., Zhang, Y., Wang, S., Yap, P.-T. & Shen, D. Synthesized 7T MRI from 3T MRI via deep learning in spatial and wavelet domains. Med. Image Anal. 62, 101663. https://doi.org/10.1016/j.media.2020.101663 (2020).
Article PubMed PubMed Central Google Scholar
Sudarshan, V. P., Upadhyay, U., Egan, G. F., Chen, Z. & Awate, S. P. Towards lower-dose PET using physics-based uncertainty-aware multimodal learning with robustness to out-of-distribution data. Med. Image Anal. 73, 102187. https://doi.org/10.1016/j.media.2021.102187 (2021).
Article PubMed Google Scholar
Pain, C. D., Egan, G. F. & Chen, Z. Deep learning-based image reconstruction and post-processing methods in positron emission tomography for low-dose imaging and resolution enhancement. Eur. J. Nucl. Med. Mol. Imaging 49, 3098–3118. https://doi.org/10.1007/s00259-022-05746-4 (2022).
Article PubMed PubMed Central Google Scholar
Chen, Z. et al. Deep learning for image enhancement and correction in magnetic resonance imaging-state-of-the-art and challenges. J. Digit. Imaging 36, 204–230. https://doi.org/10.1007/s10278-022-00721-9 (2023).
Article PubMed Google Scholar
Yoo, D. et al. Signal enhancement of low magnetic field magnetic resonance image using a conventional- and cyclic-generative adversarial network models with unpaired image sets. Front. Oncol. 11, 660284. https://doi.org/10.3389/fonc.2021.660284 (2021).
Article PubMed PubMed Central Google Scholar
de Leeuw den Bouter, M. L. et al. Deep learning-based single image super-resolution for low-field MR brain images. Sci. Rep. 12, 6362. https://doi.org/10.1038/s41598-022-10298-6 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Goodfellow, I. et al. Generative adversarial nets. In Advances in Neural Information Processing Systems Vol. 27 (eds Ghahramani, Z. et al.) (Curran Associates Inc, 2014).
Google Scholar
Zhu, J. -Y., et al. unpaired image-to-image translation using cycle-consistent adversarial networks. https://doi.org/10.48550/ARXIV.1703.10593 (2017).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 Vol. 9351 (eds Navab, N. et al.) 234–241 (Springer, 2015). https://doi.org/10.1007/978-3-319-24574-4_28.
Chapter Google Scholar
Mirza, M. & Osindero, S. Conditional generative adversarial nets. https://doi.org/10.48550/ARXIV.1411.1784 (2014).
Isola, P., Zhu, J. -Y., Zhou, T. & Efros, A. A. Image-to-image translation with conditional adversarial networks. https://doi.org/10.48550/ARXIV.1611.07004 (2016).
Iglesias, J. E. et al. SynthSR: A public AI tool to turn heterogeneous clinical brain scans into high-resolution T1-weighted images for 3D morphometry. Sci. Adv. 9, eadd3607. https://doi.org/10.1126/sciadv.add3607 (2023).
Article PubMed PubMed Central Google Scholar
Iglesias, J. E. et al. Quantitative brain morphometry of portable low-field-strength MRI using super-resolution machine learning. Radiology https://doi.org/10.1148/radiol.220522 (2023).
Article PubMed Google Scholar
Billot, B., Colin, M., Arnold, S. E., Das, S. & Iglesias, J. E. Robust segmentation of brain MRI in the wild with hierarchical CNNs and no retraining. https://doi.org/10.48550/ARXIV.2203.01969 (2022).
Billot, B. et al. Robust machine learning segmentation for large-scale analysis of heterogeneous clinical brain MRI datasets. Proc. National Acad. Sci. https://doi.org/10.1073/pnas.2216399120 (2023).
Article Google Scholar
Lee, S. et al. MRI of the lumbar spine: Comparison of 3D isotropic turbo spin-echo SPACE sequence versus conventional 2D sequences at 3.0 T. Acta Radiol. 56, 174–181. https://doi.org/10.1177/0284185114524196 (2015).
Article PubMed Google Scholar
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841. https://doi.org/10.1006/nimg.2002.1132 (2002).
Article PubMed Google Scholar
Aggarwal, K., Manso Jimeno, M., Ravi, K. S., Gonzalez, G. & Geethanath, S. Developing and deploying deep learning models in brain magnetic resonance imaging: A review. NMR Biomed. https://doi.org/10.1002/nbm.5014 (2023).
Article PubMed Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. https://doi.org/10.1109/CVPR.2016.90 (IEEE, 2016).
Isola, P., Zhu, J. -Y., Zhou, T. & Efros, A. A. Image-to-image translation with conditional adversarial networks. https://doi.org/10.48550/ARXIV.1611.07004 (2016).
Venkatanath,N., Praneeth, D., Maruthi Chandrasekhar, B. H., Channappayya, S. S. & Medasani, S. S. Blind image quality evaluation using perception based features, in 2015 Twenty First National Conference on Communications (NCC), 1–6. https://doi.org/10.1109/NCC.2015.7084843 (IEEE, 2015).
Jenkinson, M., Pechaud, M., Smith, S. et al. BET2: MR-based estimation of brain, skull and scalp surfaces, in Eleventh Annual Meeting of the Organization for Human Brain Mapping, Vol. 17, 167 (Toronto, 2005).
Shamir, R. R., Duchin, Y., Kim, J., Sapiro, G. & Harel, N. Continuous dice coefficient: A method for evaluating probabilistic segmentations. https://doi.org/10.48550/arXiv.1906.11031 (2019).

Download references

Acknowledgements

This research receives funding from the National Imaging Facility (NIF) and Hyperfine Inc. The authors acknowledge the facilities and scientific and technical assistance of the NIF, a National Collaborative Research Infrastructure Strategy (NCRIS) capability, at the Monash Biomedical Imaging, Monash University.

Author information

Authors and Affiliations

Monash Biomedical Imaging, Monash University, Melbourne, VIC, Australia
Kh Tohidul Islam, Shenjun Zhong, Parisa Zakavi, Zhifeng Chen, Gary F. Egan & Zhaolin Chen
Department of Data Science and AI, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
Zhifeng Chen & Zhaolin Chen
Department of Neuroscience, Central Clinical School, Monash University, Melbourne, VIC, Australia
Helen Kavnoudias & Meng Law
Department of Radiology, Alfred Hospital, Melbourne, VIC, Australia
Helen Kavnoudias & Meng Law
Australian National Imaging Facility, Brisbane, QLD, Australia
Shenjun Zhong & Shawna Farquharson
Herston Imaging Research Facility, University of Queensland, Brisbane, QLD, Australia
Gail Durbridge
School of Information Technology and Electrical Engineering and Centre for Advanced Imaging, University of Queensland, Brisbane, QLD, Australia
Markus Barth
School of Clinical Science, Herston Imaging Research Facility, Queensland University of Technology, Brisbane, QLD, Australia
Katie L. McMahon
David Hartley Chair of Radiology, Department of Radiology, Royal Perth Hospital, Perth, WA, Australia
Paul M. Parizel
Medical School, University of Western Australia, Perth, WA, Australia
Paul M. Parizel
South Australian Health and Medical Research Institute, Adelaide, SA, Australia
Andrew Dwyer

Authors

Kh Tohidul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Shenjun Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Parisa Zakavi
View author publications
You can also search for this author in PubMed Google Scholar
Zhifeng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Helen Kavnoudias
View author publications
You can also search for this author in PubMed Google Scholar
Shawna Farquharson
View author publications
You can also search for this author in PubMed Google Scholar
Gail Durbridge
View author publications
You can also search for this author in PubMed Google Scholar
Markus Barth
View author publications
You can also search for this author in PubMed Google Scholar
Katie L. McMahon
View author publications
You can also search for this author in PubMed Google Scholar
Paul M. Parizel
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Dwyer
View author publications
You can also search for this author in PubMed Google Scholar
Gary F. Egan
View author publications
You can also search for this author in PubMed Google Scholar
Meng Law
View author publications
You can also search for this author in PubMed Google Scholar
Zhaolin Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, K.T.I., and Z.C.; Data curation, K.T.I., P.Z., H.K, and G.D.; Formal analysis, K.T.I., A.D., and Z.C.; Funding acquisition, S.F. and Z.C.; Investigation, K.T.I. and Z.C.; Methodology, K.T.I. and Z.C.; Project administration, P.Z., S.F. and Z.C.; Resources, S.Z. and Z.C.; Software, K.T.I. and S.Z.; Supervision, M.L. and Z.C.; Validation, K.T.I., M.B., K.L.M., P.M.P., A.D., G.F.E., M.L, and Z.C.; Visualization, K.T.I., M.B, K.L.M, P.M.P, A.D., G.F.E., M.L., and Z.C.; Writing-original draft, K.T.I.; Writing-review and editing, K.T.I., S.Z., Z.C., S.F., K.L.M, P.M.P., A.D., G.F.E, M.L, and Z.C. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Kh Tohidul Islam.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Islam, K.T., Zhong, S., Zakavi, P. et al. Improving portable low-field MRI image quality through image-to-image translation using paired low- and high-field images. Sci Rep 13, 21183 (2023). https://doi.org/10.1038/s41598-023-48438-1

Download citation

Received: 22 August 2023
Accepted: 27 November 2023
Published: 01 December 2023
DOI: https://doi.org/10.1038/s41598-023-48438-1

This article is cited by

Point-of-Care Imaging in Otolaryngology
- Jaclyn Preece
- Sean A. Setzen
- Gavin Setzen
Current Otorhinolaryngology Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.