Deep learning network for integrated coil inhomogeneity correction and brain extraction of mixed MRI data

Chuang, Kai-Hsiang; Wu, Pei-Huan; Li, Zengmin; Fan, Kang-Hsing; Weng, Jun-Cheng

doi:10.1038/s41598-022-12587-6

Download PDF

Article
Open access
Published: 20 May 2022

Deep learning network for integrated coil inhomogeneity correction and brain extraction of mixed MRI data

Kai-Hsiang Chuang¹,
Pei-Huan Wu²,
Zengmin Li¹,
Kang-Hsing Fan³ &
…
Jun-Cheng Weng^2,4,5

Scientific Reports volume 12, Article number: 8578 (2022) Cite this article

1452 Accesses
1 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Magnetic Resonance Imaging (MRI) has been widely used to acquire structural and functional information about the brain. In a group- or voxel-wise analysis, it is essential to correct the bias field of the radiofrequency coil and to extract the brain for accurate registration to the brain template. Although automatic methods have been developed, manual editing is still required, particularly for echo-planar imaging (EPI) due to its lower spatial resolution and larger geometric distortion. The needs of user interventions slow down data processing and lead to variable results between operators. Deep learning networks have been successfully used for automatic postprocessing. However, most networks are only designed for a specific processing and/or single image contrast (e.g., spin-echo or gradient-echo). This limitation markedly restricts the application and generalization of deep learning tools. To address these limitations, we developed a deep learning network based on the generative adversarial net (GAN) to automatically correct coil inhomogeneity and extract the brain from both spin- and gradient-echo EPI without user intervention. Using various quantitative indices, we show that this method achieved high similarity to the reference target and performed consistently across datasets acquired from rodents. These results highlight the potential of deep networks to integrate different postprocessing methods and adapt to different image contrasts. The use of the same network to process multimodality data would be a critical step toward a fully automatic postprocessing pipeline that could facilitate the analysis of large datasets with high consistency.

Generative models improve fairness of medical classifiers under distribution shifts

Article Open access 10 April 2024

Segment anything in medical images

Article Open access 22 January 2024

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Introduction

Advanced MRI techniques, such as diffusion tensor imaging (DTI), arterial spin labeling (ASL) and functional MRI (fMRI), have become essential tools to describe the structural and functional organization of the human brain to diagnose disorders non-invasively in humans and to resolve neural mechanisms and treatment targets using rodent or other animal models. With the construction of brain atlases and templates, information-rich multimodality data obtained from different subjects at different times can be co-registered to a common reference space for voxel-wise and regional analyses to describe the effects of aging, disorders and therapeutics. To achieve accurate co-registration, images must be corrected for head motion, geometric distortion, and coil B1 field inhomogeneity. Then, the brain is extracted by removing signals from the scalp and other tissues before linear/nonlinear transformation can be applied to match the individual brain images to the template. Because inferior inhomogeneity correction and brain extraction can result in dislocated or biased findings¹, methods to improve the intensity uniformity and skull stripping are essential.

To allow automatic data processing, various methods have been developed for coil inhomogeneity correction, such as N4² and brain extraction, such as BET³, MONSTR⁴, 3D PCNN⁵, RATS⁶, and SHERM⁷. However, manual adjustments and editing are still required to obtain optimal results⁸. The need for user intervention results in slow data processing and variable quality from different users, which affects results. Also, these algorithms are typically developed for structural MRI. Performance is degraded when applying to echo-planar imaging (EPI), which is widely used for DTI, ASL and fMRI, due to different contrasts (e.g., T2/T2*-weighted versus T1-weighted), spatial resolutions (e.g., 2–3 mm versus 1 mm) and geometric distortions. Because studies have suggested that direct registering EPI to the brain template would be more accurate⁹, methods that can perform well on EPI are required. In addition, most processing methods have been developed for humans and are typically difficult to apply to animal data due to the different anatomy and resolutions involved.

Deep learning neural networks have become one of the most popular techniques of image processing in recent years¹⁰. The development of deep learning was previously constrained by computing power. With the advance of general-purpose computing on graphics processing units¹¹, an algorithm with a complex deep network topology of several hundred million trainable parameters can be completed within a few days, which allows the application of deep learning networks to various challenging biomedical image processing issues (for review, see¹²), including brain extraction and segmentation¹³, Nyquist ghost removal and motion correction^14,15,16. For example, variants of 3D convolutional networks were initially successfully applied for brain extraction from single contrast T₁-weighted MPRAGE data^17,18,19 and expanded to multiple contrast modalities or even pathological brains using large multicenter multicontrast training data²⁰. Yoganada et al. combined DenseNet and U-Net to segment gray matter/white matter/CSF from brain MR images¹³. They used a 3D framework with many trainable parameters to achieve high accuracy, but the kernel size and filters of the convolution layer were constrained to allow the topology to converge. Similar attempts have been explored recently for the brain extraction of non-human primates²¹ and rodents²². Conventional intensity correction methods are still required for preprocessing before deep learning-based brain extraction. A deep learning network has also been developed to correct coil inhomogeneity²³. However, this method was developed for a rather uniform head or body coil; its performance for images acquired by highly inhomogeneous surfaces or array coils is thus unclear. Overall, current deep-learning-based methods have several limitations. First, they are primarily designed for structural MRI and are thus difficult to apply to EPI data, which suffer from inferior image quality. Second, although skull stripping networks could process multicontrast structural MRI, methods for EPI data are still optimized for a specific contrast and thus difficult to apply to spin-echo (T₂-weighted) and gradient-echo (T₂*-weighted) EPI, which are commonly used for DTI and fMRI, respectively. Third, these methods are all designed for a single postprocessing procedure. Therefore, the data processing pipeline would still be limited by user intervention being required in the remaining processing.

In this study, we address these issues of deep learning networks designed to overcome two limiting steps of the brain MRI processing pipeline: coil inhomogeneity correction and brain extraction. We solved these issues by using generative adversarial nets (GANs)^24,25 with expansion of the 2D GAN model to 3D. Because brain extraction requires coil inhomogeneity correction, we trained GANs to combine these two separate processing together. To manage different image contrasts, particularly spin-echo and gradient-echo EPI, GANs were trained using either a single contrast or multiple contrasts. Results demonstrate high accuracy and consistency compared to manually adjusted automatic methods like N4 and PCNN.

Results

Model training

In the GANs, the generator and discriminator compete with each other in a zero-sum game. Model training is similar to a seesaw battle: if one side is too strong, the loss function tends to oscillate strongly (Fig. 1A). If the vibration exceeds a certain critical point, the system has a high probability of breakdown, as shown in Fig. 1B. When that happens, the quality of the pseudoimage generated will worsen with more iterations. To prevent the model from oscillating, we use a small learning rate for the generator and discriminator ($2\times {10}^{-4}$ and $1\times {10}^{-6}$, respectively) in all the experiments. The Adam optimizer was used in both the discriminator and generator²⁶. This process resulted in a stable trend, as shown in Fig. 1C.

Three types of GAN models were trained by gradient-echo EPI (GE-EPI) only, spin-echo EPI (SE-EPI) only and the mixed dataset. Supplementary Fig. 1 shows example results during training over 5 selected iterations using the GE-EPI-only dataset. Starting from a noisy image, the predicted image became increasingly similar to the correct target image when the GAN was trained with more iterations. With > 2000 iterations, the rough details of the brain were shown. With > 10,000 iterations, the predicted image became similar to the target. The histograms also became increasingly more similar to those of the target images, indicating the method’s effectiveness at removing the bias field of the surface coil. The models trained by the SE-EPI-only (Supplementary Fig. 2) or mixed (Supplementary Figs. 3 and 4) datasets showed similar converging trends. In particular, the model could remove the external tissue and the reference phantom over the rat’s head. The similarity indices of the training data also show progressive improvements with iterations (Supplementary Fig. 5).

Choosing the best model

With more iterations, the images generated by GANs became increasingly similar to the targets, which made it difficult to identify the best model. Because there is no good standard to determine how many iterations are required to train a model, we used a strategy that chose the model that could generate the best validation results. We calculated the similarity indices of the test data on all the models trained at each iteration and selected the one producing the best comprehensive performance as the final model. Figure 2 shows the similarity indices of the testing data for models over iterations. Then, we chose the model with 11,744 iterations for mouse-only, 14,356 iterations for rat-only, 16,539 iterations for mix dataset because they had the best mean cosine angle distance (CAD), Euclidean distance (L2 norm) and mean structural similarity (MSSIM). More specifically, we chose the model that had the highest mean MSSIM score in each experiment because the MSSIM was more consistent with visual inspection.

Testing using data from the same modality

Figure 3 shows representative results after applying GE-EPI or SE-EPI test data to the models trained by GE-EPI-only, SE-EPI-only and mixed datasets. Compared to the target image, the outputs of GANs show preserved tissue contrast but improved uniformity, with histograms similar to those of the target images. The GANs could even perform better than the target image. In Fig. 3B, the target image showed an abrupt intensity drop at the bottom of the brain, while the GAN output uniformly covered the entire brain. The target image in Fig. 3D appeared to be overcorrected for coil inhomogeneity so that the contrast between the gray and white matter was low. Conversely, the GAN output a uniform image with preserved tissue contrast. Interestingly, even when the SE-EPI and GE-EPI data have different image quality and features (e.g., external tissue intensity and phantom object), the performance of the mixed modality-trained model is comparable to that of the single-modality model. This result indicates that the 3D pix2pix network could learn distinct features of the mixed dataset.

The performance of coil inhomogeneity correction was evaluated by comparing the similarity of intensity distribution, quantified using CAD, L2 norm and MSSIM (Table 1). The quality of brain extraction was quantified using the Dice index (Table 2). Overall, the performance of the GAN models was consistently high. Even though the training data exhibited a large deviation in the MSSIM for the GE-EPI only and mixed datasets due to the particular intensity distribution of certain subjects (see “Discussion” below), the performance of the testing data was consistent across subjects with a small standard deviation. The performance of the mixed-modality model was also comparable to that of the single-modality model.

Table 1 Similarity indices of test data for each corresponding model.

Full size table

Table 2 Dice index of GANs and N4 + PCNN outputs.

Full size table

Testing using multimodality data

To evaluate how well the three models could manage data that were either familiar (same type of EPI as the training data) or unfamiliar (e.g., GE-EPI for SE-EPI-only trained), we applied the testing data for the mixed group to all three models and compared their performances (Fig. 4A). As expected, the test data of the same modality had high similarity indices, while the performance degraded markedly with unfamiliar data types. The performance on the test data was marginally inferior to that of the training data, which had similarity indices that were closer to ideal. Interestingly, the model trained by mixed data outperformed the model trained by the familiar but single modality data. For the CAD, the mixed model performed (0.973 ± 0.003; mean ± SEM) much better than the SE-EPI trained model (0.969 ± 0.003, p < 0.001) for the SE-EPI test data. For the GE-EPI test data, the mixed model performed (0.968 ± 0.002) marginally better than the GE-EPI trained model (0.966 ± 0.002, p < 0.05). Similar trends were also shown in the L2 norm and MSSIM. Therefore, training with more diverse data enhanced the capability of the network.

Using the combination of two popular automatic methods (N4 and PCNN) as a benchmark, we compared the Dice indices of the GAN outputs of the test data (Fig. 4B). To achieve the best results of the conventional methods, the parameters of N4 and PCNN were optimized for each individual. For the SE-EPI data, the Dice indices of models trained by the SE-EPI-only (0.955 ± 0.008) or mixed (0.956 ± 0.008) datasets were comparable to those of the N4 + PCNN (0.964 ± 0.010) but not the model trained by the GE-EPI-only data (0.871 ± 0.044; p < 0.0001). Similarly, the GE-EPI-only (0.923 ± 0.024) and mixed (0.923 ± 0.024) dataset trained models performed as well as the N4 + PCNN (0.915 ± 0.032), except for the SE-EPI-only trained model (0.858 ± 0.024; p < 0.0001). In addition, the mixed-trained model generally exhibited smaller variations than N4 + PCNN, indicating more consistent performance.

Discussion

This study addressed two issues of deep-learning-based brain extraction: dependency on coil inhomogeneity and inflexibility for multimodality data. We used a deep learning network (3D pix2pix) to automate two critical and labor-intensive steps—coil inhomogeneity correction and brain extraction—in the brain EPI postprocessing pipeline. Results shows that different types of MRI postprocessing can be combined into one network, which can streamline the data processing pipeline and can reduce operator-dependent variations and bias in different processes. This network model can manage both spin- and gradient-echo EPI that are commonly used in DTI, ASL and fMRI. The capability of processing these major data types could facilitate the data analysis of the most commonly used advanced neuroimaging data. The model performed as well as existing automatic methods (N4 and PCNN) that were individually adjusted and optimized. Particularly, the results of the proposed method are more consistent than those from automatic methods. Although the proposed method was demonstrated using rodent data in this proof-of-concept study, similar network models could be built and trained using human data to streamline the workflow of clinical MRI data processing.

Criteria for stopping the training of a model

There is no common standard about how many iterations are required to train model sufficiently. The original GAN paper mentioned that a global optimum solution exists for an ideal model such that the model iterated countless times will converge to the global optimum. In that case, fake data from the generator infinitely approach real data, and the discriminator can no longer distinguish real and fake data. Because there is no ideal model in reality, the proposed models may reach a local optimum solution^27,28 or undergo mode collapse in training²⁹. When we inspected the similarity indices calculated using the training dataset itself, the performance increased monotonically, and thus, the optimal performance would be achieved with infinite iterations. Therefore, we chose to use test data to select the best model. However, this approach suffers from the fact that the best model may be different for different test datasets. To overcome this issue, a more diverse testing dataset from different MRI scanners may be used to evaluate the overall performance across datasets from different sites.

Performance of coil inhomogeneous correction

By visual inspection of the pseudoimages and their histograms generated by GANs, results were found to be similar to those of the target images, indicating good performance of the GANs in learning the corrected intensity distribution. To quantify the difference in the intensity distribution, we compared the CAD, L2 norm and MSSIM between the pseudo and target images (Table 1). The high similarity of the training data indicated that the model learned the importance features and relationships of the source and target images. Comparing the three datasets, the GE-EPI-only model had better performance than SE-EPI-only, which was likely due to the larger GE-EPI training dataset; thus, the model could learn more features for generating pseudoimages. Also, the image complexity of the SE-EPI data is much higher than that of the GE-EPI because the scalp and muscle signals are stronger in the SE-EPI in addition to the presence of an external phantom. Although the similarity indices of the SE-EPI-only model were inferior, the quality of the pseudoimage remained high, and only a few differences at the edge of the brain tissue could be identified. For the same reason, the similarity indices of the GE-EPI-only model were marginally worse than those of the mixed-trained model. Overall, there are many more distinct features in the mixed modality dataset that must be learned than in single-modality datasets. Under the same topology and hyperparameters, the mixed-trained model could learn complex features and achieved high similarity indices.

Performance of brain extraction

Using the Dice index, the overlap between the ideal and automatically extracted brain masks was evaluated. Overall, the GANs achieved comparable accuracy compared to those of the two existing methods (N4 and PCNN) together. The extraction results of SE-EPI were better than those of GE-EPI, likely due to better image contrast and less blurring.

The two tasks (inhomogeneity correction and brain extraction) can potentially be processed by two network models separately. For example, GAN is used for the first step, and U-Net is used for the second step. When using two models in two stages to map two data distributions, it could suffer from error propagation. The first model may contribute some error and lead to more error in the prediction of the second model. The first model will also require more time to fine-tune the hyperparameters and training, as well as additional storage space for saving weights. Here we demonstrated that GANs can successfully combine these two processing in one network.

Testing GANs using multimodality data

To evaluate how a trained model performs with data that are completely different from the training data, we tested the model trained by a single modality (i.e., GE-EPI-only and SE-EPI-only) dataset with the test data of the mixed model. The outcome was, as expected, not ideal when the data type was unfamiliar. The model trained by the SE-EPI-only data performed better than the model trained by GE-EPI-only, likely due to more features in the SE-EPI data that allow the model to process GE-EPI data but not conversely.

Comparison to other network models

Deep learning has been applied to extract or correct different types of structural information, such as brain tumors, gray/white matter tissue segmentation, skull stripping or EPI distortion correction. Although these methods all belongs to image segmentation, each has its own special focus and challenge. Brain tissue segmentation focuses on classifying voxels of different intensity distributions with less consideration of morphology. Brain extraction focuses on identifying the boundary of the brain while ignoring intensity distributions of different tissue types. Recently, a few studies have used a popular deep learning network, U-net, for automatic brain extraction. U-net has been a popular method for segmentation in medical images. Huang et al. applied 3D U-Net to T1-weighted structural MRI of the human brain¹⁸. Similarly, Pontes-Filho et al. applied a standard U-net on SE-EPI of the mouse brain, where they used a data augmentation strategy to increase the training data using elastic affine transform³⁰. Hsu et al. also applied U-net to mixed T2-weighted structural MRI and GE-EPI datasets of both rats and mice, demonstrating the feasibility of processing mixed data²². The latter two studies used 2D U-net to process slice-by-slice instead of 3D volume. Also, these studies either used data that were acquired with a homogeneity coil or applied inhomogeneity correction as a preprocessing step. Compared to the GAN in this study, we used U-Net as the generator together with PatchGAN as the discriminator. The advantage of such a competing network design is the ability of the discriminator to learn the features of the desired output instead of relying on pixel intensity differences, which improves the generator in creating better overall results without being affected by pixel-level variations.

Outliers

As shown in Table 1 and Fig. 4, most similarity indices of the training data are high except in certain cases. Although the predicted images of these outliers appeared to be nearly identical to their targets, we found that there were low intensity voxels after adjusting the display contrast (Supplementary Fig. 6). All corresponding training targets had the lowest intensity voxel inside the brain instead of the background. When the intensity was normalized to 0–1, the background of these training targets was not zero, as opposed to the “good” training targets, whose background value was zero. This result led to poor similarity indices, particularly for MSSIM because it is sensitive to structure, contrast and brightness differences between two images. This type of data constitutes approximately 2% of the training data.

Limitation

This study suffers from the following limitations. First, the training and test datasets were acquired using similar coils (i.e., single-loop surface coils but different sizes for rats and mice). Because the coil intensity profile depends on the geometry of the coil in relation to the brain, images acquired by a different coil, such as an array coil, will have different intensity inhomogeneity features. The efficacy of the intensity correction under different coils remains to be evaluated. Second, the images have similar pulse sequence parameters, particularly the echo time, and resolutions. Because the echo time and spatial resolutions could affect the contrast and distortion of the EPI, they would affect the feature contents of the data and thus the performance of the trained model. Third, the “gold standard” of coil intensity correction was generated by a popular algorithm, N4. Therefore, the intensity correction by the GAN could only be as good as N4. The performance of N4 depends on the adjusted parameter, and sometimes does not generate a completely uniform profile. Better ways to generate the gold standard, such as measuring the coil B1 profile, is preferable. As the real B1 field depends on the object and EPI distortion, the B1 field acquired using a pulse sequence (e.g., conventional gradient echo) that has different distortion from the EPI sequence will result in a different B1 field. Previous studies that develop this kind of correct methods usually used synthetic data by applying an assumed bias field as weighting factor to a perfect image without bias field³¹ or used N4, as in this study³². Using an assumed bias field may be acceptable for structural MRI but is not suitable for highly distorted EPI. Obtaining a true “gold standard” thus remains a challenge. Fourth, brain anatomy was associated with a specific type of EPI. SE-EPI was obtained from the rat brain, and GE-EPI was obtained only from the mouse brain. There is no SE-EPI data of mouse or GE-EPI data of rat. Because the brain structures of rats and mice are similar, the influence of anatomical differences on processing performance is expected to be small. Future studies that include more diverse datasets will be required to clarify this issue. Fifth, typical multiband EPI acquisition could suffer from slice leakage artifacts that impact the image quality and bias field. The proposed multiband GE-EPI avoided the brain in different slices from overlapping with each other during acquisition and therefore did not suffer from slice leakage in typical multiband EPI on a clinical MRI scanner. However, this artifact may affect the accuracy when expanding this technique to multiband EPI data from clinical scanners. Finally, we did not have sufficient data to divide the dataset into three parts. Data augmentation is a promising technique to overcome issues associated with limited training data³³ and to balance the number of GE-EPI and SE-EPI data in future work.

These limitations are primarily due to the fact that data were obtained from a single site. Although several recent studies have provided their image data in open repositories, they typically only provide raw images but not processed data, such as the brain mask, which are critical for training models and testing new algorithms. Future studies will benefit from open data that share the processed data. To promote this initiative, the imaging data used and analyzed in this study are available from the corresponding author on reasonable request.

Conclusion

In this article, a deep learning model, 3D pix2pix, that is designed to combine automatic coil inhomogeneity correction and brain extraction was developed and validated using several quantized similarity indices. With sufficient training data, the model can effectively combine these two district and operator-dependent image processing steps in an advanced neuroimaging data processing pipeline. Further development and refinement of the model could allow fully automated data processing without user intervention and thus improve the efficiency of processing big data. Although this study only demonstrated the method using rodent EPI data, a similar algorithm should be trained and applied to human data.

Methods

Generative adversarial networks

The GAN architecture primarily has two parts, a discriminator (D) and a generator (G), which play two different roles in the algorithm. The goal of the discriminator is to distinguish whether an input image comes from the generator. Conversely, the goal of the generator is to generate a pseudoimage that can deceive the discriminator. The relationship between discriminator and generator is similar to that of a vaccine and bacteria or a predator and its prey; they have opposite purposes in the algorithm. In an iterative adversarial process, the ability of both sides increases. Therefore, an image generated by the generator will be increasingly similar to the real image. The objective function from the original paper of GANs is shown in (1):

$$\underset{G}{\mathrm{min}}\underset{D}{\mathrm{max}}V\left(D, G\right)= {\mathbb{E}}_{x\sim {p}_{data}\left(x\right)}\left[\mathrm{log}D\left(x\right)\right]+{\mathbb{E}}_{z\sim {p}_{z}\left(z\right)}\left[\mathrm{log}\left(1-D\left(G\left(z\right)\right)\right)\right]$$

(1)

where $x$ is data acquired from the real world, $z$ is random noise for the generator, $D\left(x\right)$ is the output of the discriminator, and $G\left(z\right)$ is the generated fake data. ${p}_{data}$ and ${p}_{z}$ are the distributions of real-world data and random variable z, respectively. subscript $x\sim {p}_{data}$ represents that $x$ belongs to ${p}_{data}$. ${\mathbb{E}}$ is the expected value. The designed output range of the discriminator is between 0 and 1. This objective function is expected to train the discriminator to distinguish every real-world data x from the fake data $G\left(z\right)$.

3D pix2pix

To process image data, we used pix2pix, which is an extended topology of GANs that can convert a picture with a certain style into another style³⁴. pix2pix consists of a PatchGAN classifier as the discriminator and a U-net as the generator³⁵. Different from the traditional classifier, which maps an image onto a single number, the PatchGAN classifier maps an image onto a $\mathrm{M}\times \mathrm{M}$ patch, and every element in this patch has its own receptive field. There are several benefits of the PatchGAN classifier. First, images of different matrix sizes can be verified by the same PatchGAN. Second, this method does not need to process an entire image each time so that a PatchGAN model has fewer weighting parameters and thus will be more efficient. U-net has an autoencoder-like topology. The biggest difference between U-net and autoencoder³⁶ is that U-net has a shortcut between the encoder and decoder. The function of shortcut is to provide location information of a pixel of an image from the encoder block to the decoder block, making the decoder produce a higher quality image.

Particularly, pix2pix is a type of conditional GAN (cGAN)³⁷ that imposes an additional condition on the discriminator and the generator. After training, the output image of the generator will be constrained by the condition, and the discriminator must distinguish the authenticity of the input image and determine whether there is a relationship between the input image and the additional condition. The condition can be a label of class, vector, a portion of data from different modalities, or even a certain type of image. The loss function of cGANs is shown in (2):

$${\mathcal{L}}_{cGAN}\left(G,D\right)={\mathbb{E}}_{c,x}\left[\mathrm{log}D\left(c,x\right)\right]+{\mathbb{E}}_{c}\left[\mathrm{log}\left(1-D\left(c,G\left(c\right)\right)\right)\right]$$

(2)

where $c$ is the condition. In this study, we set the raw image as $c$ and the ideally inhomogeneity corrected and brain extracted image as $x$. There are two input states for the discriminator: the first takes “c” and “x” as input, denoted as “D(c, x)”, where the variable “c” represents the raw image that is affected by intensity bias and the variable “x” represents the image with bias field corrected; and the second takes the raw image “c” and a fake image G(c) as input, denoted as D(c, G(c)). The topology of the generator only requires a single input (“c”). According to the original pix2pix paper³⁴, the noise variable z is not necessary; thus, we did not use random noise as input. pix2pix then mixes the cGAN loss function with traditional loss functions, such as the L1 or L2 distance, to enhance sharpness. In this study, we used the L1 distance as in (3):

$${\mathcal{L}}_{L1}\left(G\right)={\mathbb{E}}_{c,x(L1)}\left[{\Vert x-G\left(c\right)\Vert }_{1}\right]$$

(3)

Therefore, the final loss function is:

$${G}^{*}=arg\underset{G}{\mathrm{min}}\underset{D}{\mathrm{max}}{\mathcal{L}}_{cGAN}\left(G,D\right)+\lambda {\mathcal{L}}_{L1}\left(G\right)$$

(4)

where $\lambda$ is 100, as used in the original pix2pix paper³⁴. To prevent pseudoimages between slices from forming discontinuities, we expanded the original 2D framework of the pix2pix architecture into 3D so that it could properly process the volumetric data. The original pix2pix is based on a 2D framework, which means that the shape of the input/output/hidden layers of the generator/discriminator and all the calculations (e.g., convolution, padding, pooling or stride) are suitable for a two-dimensional image. In this study, we added one more dimension to the input/output/hidden layers and called the “Conv3D” and “Conv3DTranspose” functions from the Keras API to perform the calculation in three dimensions with additional parameters for the third dimension. In addition, we also modified the layer number and hyperparameters of the U-net to have 6 layers of encoders and decoders.

To find a set of hyperparameters to balance the generator and discriminator, we used a search strategy that is similar to the “grid search”. The key hyperparameters to be tuned are learning rates for the generator and discriminator. We used a list of numbers in different orders of magnitude, such as ${2\times 10}^{-3}$, ${1\times 10}^{-3}$, ${5\times 10}^{-4}$, …, ${2\times 10}^{-6}$, and ${1\times 10}^{-6}$. Based on the trends of loss in the early stage of the training process, we can determine whether to stop the process. If the process was stopped, then the next combination of learning rates was evaluated.

Dataset

Three experiments were conducted using either spin-echo EPI (SE-EPI) of the rat brain, gradient-echo EPI (GE-EPI) of the mouse brain, or both (mix). A total of 87 rat brain scans (male Wistar rat, n = 87) and 403 mouse brain scans (male C57BL/6, 373 scans from n = 78 mice; male rTg4510 mouse, n = 30) were used in this study. Each C57BL/6 mouse was scanned with 1 or 2 sessions and in 2 to 3 repeated runs of scans acquired in each session. Only one run was acquired from each rTg4510 mouse and Wistar rat. The rat experiment was approved by the Institutional Animal Care and Use Committee of the Biomedical Sciences Institutes, A*STAR, Singapore. The mouse experiment was approved by the Animal Ethics Committee of the University of Queensland and conducted in compliance with the Queensland Animal Care and Protection Act 2001 and the current National Health and Medical Research Council Australian Code of Practice for the Care and Use of Animals for Scientific Purposes. The study was also carried out in compliance with the ARRIVE guidelines. Rat brain data were obtained from a published study³⁸ and were originally acquired using a volume coil for transmission, a 15-mm single-loop coil for receiving and SE-EPI with TR/TE = 2000/45 ms, 150 or 300 repetitions, and a spatial resolution of 0.4 × 0.4 × 1 mm³ and 0.1 mm slice gap (for details, see³⁸). Mouse brain data were acquired using a volume coil for transmission, a 10-mm loop coil for receiving and multiband GE-EPI with TR/TE = 300/15 ms, 2000 repetitions (10 min), spatial resolution of 0.3 × 0.3 × 0.5 mm³ and 0.1 mm slice gap (for details, see³⁹). All analyses were operated in real-valued data.

The EPI time-series data were motion corrected using FSL mcflirt (https://www.fmrib.ox.ac.uk/fsl) and then averaged to obtain a mean image, which was corrected for coil inhomogeneity using N4 (implemented in ANTs; http://stnava.github.io/ANTs/) with settings that were separately adjusted for rat and mouse data. Then, automatic brain extraction was conducted using PCNN (https://sites.google.com/site/chuanglab/software/3d-pcnn), followed by manual editing. To obtain optimal results from the PCNN, the expected brain size option was particularly adjusted for each data point. The brain masks generated by the optimized PCNN without editing were used for comparison with the GAN results and resulted in an image pair composed of the motion-corrected mean image (source) and the inhomogeneity-corrected and brain-extracted image (target). These pairs were divided into three datasets (SE-EPI-only, GE-EPI-only and mixed), where the latter includes both SE-EPI and GE-EPI datasets. Each dataset was randomly split into training and test data for training and testing a different GAN (Table 3). We wanted to maximize the amount of training data; thus, we only used approximately 10% as testing data (9.8% and 9.6% for the GE-EPI and mixed data, respectively). There were 36 test samples for the GE-only model, 13 test samples for the SE-only model and 43 test samples for the mixed model. We merged the GE-only and SE-only datasets into a mixed dataset, and then split the mixed dataset into a training and testing set. The test samples were chosen randomly, and this study was performed without data augmentation.

Table 3 Summary of the combinations of datasets used in three experiments.

Full size table

Because the input and output intensity of the network model ranged from -1 to 1, the intensity of each dataset was normalized to this range by the maximum and minimum intensities with the following equation:

$$\left\{ {\begin{array}{*{20}l} {\hat{A} = 2 \times \left( {\frac{A - m}{d}} \right)} \hfill \\ {m = \frac{max\left( A \right) + min\left( A \right)}{2}} \hfill \\ {d = max\left( A \right) - min\left( A \right)} \hfill \\ \end{array} } \right.$$

(5)

where $A$ is the original image and $\widehat{A}$ is the normalized image. To present the source, target and GAN outputs as positive values in the results section, their intensities were rescaled to between 0 and 1 by multiplying by 0.5 and adding 0.5.

Implementation

We use an open source API, Keras (http://keras.io/), for deep-learning model building, training, prediction and testing. The program runs on a GPU-based server ESC8000 G4 with GeForce 1080 Ti (Nvidia, USA). Figure 5 shows the topology of the U-Net and PatchGAN classifiers and the parameters of U-Net, including the number of feature maps. Batch normalization was used and is denoted as BN in Fig. 5. The “shortcut” symbol represents the “concatenate” operation. The hidden layer settings are as follows: because the best initial point of the algorithm was unknown, we set the initial status of weight parameters for all layers as in the original pix2pix paper when setting the initializer, which is a normal distribution with a standard deviation of 0.02³⁴. The sizes of both 3D convolution and 3D transpose convolution were 4 × 4 × 4, and both strides were 2 along each dimension. The training process included 200 epochs in total and a batch size of 1. The PatchGAN output size was 4 × 4 × 4.

Three models were trained by the 3 combinations of datasets. In each model, the weighting parameters of the model were initialized as random numbers of normal distribution. We set the epoch to 200 for model training in all experiments (mouse-only: 200 epochs × 367 samples = 73,400 iterations; rat-only: 200 × 74 = 14,800; mix: 200 × 447 = 89,400). This setting means that the training ran through up to 73,400, 14,800 and 89,400 iterations for the three datasets, respectively. The model at each epoch was saved to evaluate how the model improved over the epoch.

Quantitative evaluation

To test the performances of the models trained by the three combinations of datasets, we use other sets of data. In particular, the rTg4510 mouse data were used to test the performance of the model that was trained solely by the C57BL6 mouse data. Both the rat and mouse testing data were applied to all three models to evaluate how well they manage images of different MRI protocols. The GAN output was compared with the reference target image and the automatic brain mask generated by one of the most popular rodent brain extraction methods, the PCNN. We used the following quantized methods to evaluate the similarity of two images: cosine angle distance (CAD)⁴⁰, Euclidean distance (L2 norm)⁴⁰, mean square error, peak signal-to-noise ratio, and mean structural similarity (MSSIM)⁴¹. Because the L2 norm, mean square error, and peak signal-to-noise ratio are linear combinations of each other, only the L2 norm is reported. In addition, the Dice index was used to compare the brain masks⁴². Their definitions are described below:

Assuming that images $A$ and B each have $N$ voxels, the intensity of each voxel in these images can be expressed as linear arrays (6):

$$A=\left[\begin{array}{l}{a}_{1}\\ {a}_{2}\\ \vdots \\ {a}_{N}\end{array}\right], B=\left[\begin{array}{c}{b}_{1}\\ {b}_{2}\\ \vdots \\ {b}_{N}\end{array}\right]$$

(6)

The following formulae were used to calculate the similarity indices.

CAD:

$$CAD\left(A, B\right)=\mathrm{cos}\left(\theta \right)=\frac{A\cdot B}{\Vert A\Vert \times \Vert B\Vert }$$

(7)

where CAD is the cosine angle distance between two images, and the range is [− 1, 1]. The closer the value of CAD is to 1, the more similar two images are.

L2 norm:

$$L2\left(A, B\right)=\sqrt{{\sum }_{i=1}^{N}{\left({a}_{i}-{b}_{i}\right)}^{2}}$$

(8)

The smallest value of the L2 norm is 0. The closer the value of the L2 norm is to 0, the more similar two images are.

MSSIM:

$$\left\{\begin{array}{l}SSIM\left(A, B\right)={\left[l\left(A, B\right)\right]}^{\alpha }{\left[c\left(A, B\right)\right]}^{\beta }{\left[s\left(A, B\right)\right]}^{\gamma }\\ l\left(A, B\right)=\frac{2{\mu }_{A}{\mu }_{B}+{C}_{1}}{{\left({\mu }_{A}\right)}^{2}+{\left({\mu }_{B}\right)}^{2}+{C}_{1}}\\ c\left(A, B\right)=\frac{2{\sigma }_{A}{\sigma }_{B}+{C}_{2}}{{\left({\sigma }_{A}\right)}^{2}+{\left({\sigma }_{B}\right)}^{2}+{C}_{2}}\\ s\left(A, B\right)=\frac{{\sigma }_{AB}+{C}_{3}}{{\sigma }_{A}{\sigma }_{B}+{C}_{3}}\\ {\mu }_{A}={\sum }_{i=1}^{N}{w}_{i}{a}_{i},{ \mu }_{B}={\sum }_{i=1}^{N}{w}_{i}{b}_{i}\\ {\sigma }_{A}=\sqrt{{\sum }_{i=1}^{N}{w}_{i}{\left({a}_{i}-{\mu }_{A}\right)}^{2}}, {\sigma }_{B}=\sqrt{{\sum }_{i=1}^{N}{w}_{i}{\left(b-{\mu }_{B}\right)}^{2}}\\ {\sigma }_{AB}={\sum }_{i=1}^{N}{w}_{i}\left({a}_{i}-{\mu }_{A}\right)\left({b}_{i}-{\mu }_{B}\right)\\ MSSIM\left({A}_{global},{B}_{global}\right)=\frac{1}{M}{\sum }_{i=1}^{M}SSIM\left({A}_{i}, {B}_{i}\right)\end{array}\right.$$

(9)

where ${A}_{global}$ and ${B}_{global}$ are the two images that we want to compare. $A$ and $B$ are local windows of ${A}_{global}$ and ${B}_{global}$, with ${a}_{i}$ and ${b}_{i}$ located inside windows $A$ and $B$, respectively. The SSIM is composed of three factors: luminance ($l$), contrast ($c$) and structure ($s$), which were calculated from local statistics ${\mu }_{A}$, ${\sigma }_{A}$ and ${\sigma }_{AB}$ weighted by a circular-symmetric Gaussian weighting function ${\varvec{w}}=\left\{{w}_{i}|i=1, 2,\dots ,N\right\}$ with a standard deviation of 1.5 samples, normalized to unit sum $\left({\sum }_{i=1}^{N}{w}_{i}=1\right)$. Users can decide which factor is the most important by adjusting their corresponding weights $\alpha$, $\beta$ and $\gamma$. To prevent the denominator and numerator from both being equal to zero, ${C}_{1}$, ${C}_{2}$ and ${C}_{3}$ are small scalars. In this study, $\alpha$, $\beta$, and $\gamma$ were all set to 1; and ${C}_{1}$, ${C}_{2}$, and ${C}_{3}$ were set to ${10}^{-4}$, ${9\times 10}^{-4}$, and ${4.5\times 10}^{-4}$ accordingly. $M$ is the total number of windows throughout the image, and a value of $58\times 10\times 58$ was used. The range of MSSIM is [− 1, 1], with values closer to 1 representing the highest similarity.

Dice index:

$$DI\left(p,\widehat{p}\right)=\frac{2\times \left|p\cap \widehat{p}\right|}{\left|p\right|+\left|\widehat{p}\right|}$$

(10)

where $p$ is the binarized target, and $\widehat{p}$ is the binarized GAN output or PCNN brain mask. An intensity threshold of 0.05 was used to turn the target and GAN output into binary masks with voxel intensity exceeding the threshold set to 1 and 0 otherwise. The range of the Dice index is between 0 and 1, with a larger value representing more overlap between two brain masks.

Statistical analysis

To determine the performance difference between GAN models and methods, between-group comparisons were applied to the above indices using the nonparametric Friedman test (Prism, GraphPad Software, Inc., USA). P < 0.05 with correction for multiple comparisons using Dunn’s method was regarded as significant. Except particularly noted, values are reported as the mean ± standard deviation.

Data availability

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

References

Acosta-Cabronero, J., Williams, G. B., Pereira, J. M., Pengas, G. & Nestor, P. J. The impact of skull-stripping and radio-frequency bias correction on grey-matter segmentation for voxel-based morphometry. Neuroimage 39, 1654–1665 (2008).
Article Google Scholar
Tustison, N. J. et al. N4ITK: Improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010).
Article Google Scholar
Smith, S. M. Fast robust automated brain extraction. Hum. Brain Mapp. 17, 143–155 (2002).
Article Google Scholar
Roy, S., Butman, J. A., Pham, D. L. & Initiative, A. D. N. Robust skull stripping using multiple MR image contrasts insensitive to pathology. Neuroimage 146, 132–147 (2017).
Article Google Scholar
Chou, N., Wu, J., Bingren, J. B., Qiu, A. & Chuang, K.-H. Robust automatic rodent brain extraction using 3-D pulse-coupled neural networks (PCNN). IEEE Trans. Image Process. 20, 2554–2564 (2011).
Article ADS MathSciNet Google Scholar
Oguz, I., Zhang, H., Rumple, A. & Sonka, M. RATS: Rapid automatic tissue segmentation in rodent brain MRI. J. Neurosci. Methods 221, 175–182 (2014).
Article Google Scholar
Liu, Y., Unsal, H. S., Tao, Y. & Zhang, N. Automatic brain extraction for rodent MRI images. Neuroinformatics. 18, 1–12 (2020).
Feo, R. & Giove, F. Towards an efficient segmentation of small rodents brain: A short critical review. J. Neurosci. Methods 323, 82–89 (2019).
Article Google Scholar
Calhoun, V. D. et al. The impact of T1 versus EPI spatial normalization templates for fMRI data analyses. Hum. Brain Mapp. 38, 5331–5342 (2017).
Article Google Scholar
Wang, H. & Raj, B. On the origin of deep learning. arXiv preprint arXiv:.07800 (2017).
Owens, J. D. et al. A survey of general-purpose computation on graphics hardware. Comput. Graph. Forum 26, 80–113 (2007).
Article Google Scholar
Lundervold, A. S. & Lundervold, A. An overview of deep learning in medical imaging focusing on MRI. Z. Med. Phys. 29, 102–127 (2019).
Article Google Scholar
Yogananda, C. G. B., Wagner, B. C., Murugesan, G. K., A., M. & Maldjian, J. A. A Deep Learning Pipeline for Automatic Skull Stripping and Brain Segmentation. in 16th International Symposium on Biomedical Imaging (ISBI 2019), 727–731 (2019).
Lee, J., Kim, B. & Park, H. MC(2) -Net: Motion correction network for multi-contrast brain MRI. Magn. Reson. Med. 86, 1077–1092. https://doi.org/10.1002/mrm.28719 (2021).
Article PubMed Google Scholar
Lee, J., Han, Y., Ryu, J. K., Park, J. Y. & Ye, J. C. k-Space deep learning for reference-free EPI ghost correction. Magn. Reson. Med. 82, 2299–2313. https://doi.org/10.1002/mrm.27896 (2019).
Article PubMed Google Scholar
Kustner, T. et al. Retrospective correction of motion-affected MR images using deep learning frameworks. Magn. Reson. Med. 82, 1527–1540. https://doi.org/10.1002/mrm.27783 (2019).
Article PubMed Google Scholar
Kleesiek, J. et al. Deep MRI brain extraction: A 3D convolutional neural network for skull stripping. Neuroimage 129, 460–469 (2016).
Article Google Scholar
Hwang, H., Rehman, H. Z. U. & Lee, S. 3D U-Net for skull stripping in brain MRI. Appl. Sci. 9, 569 (2019).
Article Google Scholar
Salehi, S. S. M., Erdogmus, D. & Gholipour, A. Auto-context convolutional neural network (auto-net) for brain extraction in magnetic resonance imaging. IEEE Trans. Med. Imaging 36, 2319–2330 (2017).
Article Google Scholar
Isensee, F. et al. Automated brain extraction of multisequence MRI using artificial neural networks. Hum. Brain Mapp. 40, 4952–4964 (2019).
Article Google Scholar
Zhao, G. et al. Bayesian convolutional neural network based MRI brain extraction on nonhuman primates. Neuroimage 175, 32–44 (2018).
Article Google Scholar
Hsu, L.-M. et al. Automatic skull stripping of rat and mouse brain MRI data using U-Net. Front. Neurosci. (2020).
Venkatesh, V., Sharma, N. & Singh, M. Intensity inhomogeneity correction of MRI images using InhomoNet. Comput. Med. Imag. Graph. 84, 101748 (2020).
Goodfellow, I. et al. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2, 2672–2680 (2014).
Gui, J., Sun, Z., Wen, Y., Tao, D. & Ye, J. A review on generative adversarial networks: Algorithms, theory, and applications. arXiv preprint arXiv:.06937 (2020).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv, 1412.6980 (2014).
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B. & Hochreiter, S. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv. Neural Inf. Process. Syst. 30, 6626–6637 (2017).
Nagarajan, V. & Kolter, J. Z. Gradient descent GAN optimization is locally stable. Adv. Neural Inf. Process. Syst. 30, 5585–5595 (2017).
Srivastava, A., Valkov, L., Russell, C., Gutmann, M. U. & Sutton, C. Veegan: Reducing mode collapse in gans using implicit variational learning. Adv. Neural Inf. Process. Syst. 30, 3308–3318 (2017).
Pontes-Filho, S., Dahl, A. G. & Nichele, S. A deep learning based tool for automatic brain extraction from functional magnetic resonance images in rodents. arXiv preprint arXiv:.01359 (2019).
Goldfryd, T., Gordon, S. & Raviv, T. R. Deep semi-supervised bias field correction of MR images. in IEEE 18th International Symposium on Biomedical Imaging (ISBI). https://doi.org/10.1109/ISBI48211.42021.9433889 (2021).
Dai, X. et al. Intensity non-uniformity correction in MR imaging using residual cycle generative adversarial network. Phys. Med. Biol. 65, 215025. https://doi.org/10.1088/1361-6560/abb31f (2020).
Article PubMed PubMed Central Google Scholar
Shorten, C. & Khoshgoftaar, T. K. A survey on image data augmentation for deep learning. J. Big Data 6, 60 (2019).
Article Google Scholar
Isola, P., Zhu, J.-Y., Zhou, T. & Efros, A. A. Image-to-image translation with conditional adversarial networks. in Proceedings of the IEEE conference on computer vision and pattern recognition, 1125–1134 (2017).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. in International Conference on Medical image computing and computer-assisted intervention, 234–241 (2015).
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
Article ADS MathSciNet CAS Google Scholar
Mirza, M. & Osindero, S. Conditional generative adversarial nets. arXiv preprint arXiv:.1784 (2014).
Chuang, K.-H. et al. Evaluation of nuisance removal for functional MRI of rodent brain. Neuroimage 188, 694–709 (2019).
Article Google Scholar
Lee, H.-L., Li, Z., Coulson, E. J. & Chuang, K.-H. Ultrafast fMRI of the rodent brain using simultaneous multi-slice EPI. Neuroimage 195, 48–58 (2019).
Article Google Scholar
Qian, G., Sural, S., Gu, Y. & Pramanik, S. Similarity between Euclidean and cosine angle distance for nearest neighbor queries. in Proceedings of the 2004 ACM symposium on Applied computing, 1232–1237 (2004).
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
Article ADS Google Scholar
Taghanaki, S. A., Abhishek, K., Cohen, J. P., Cohen-Adad, J. & Hamarneh, G. Deep semantic segmentation of natural and medical images: A review. Artif. Intell. Rev. 54, 1–42 (2020).

Download references

Acknowledgements

This study was supported by the research programs MOST110-2221-E-182-027 and CMRPD1H0421, which were sponsored by the Ministry of Science and Technology, Taipei, Taiwan, and Chang Gung Memorial Hospital at Linkou, Taoyuan, Taiwan, respectively. This study was also supported by Queensland Brain Institute, The University of Queensland; and Australian Research Council Grant number IC170100035. The funders had no role in the conduct of the study nor the collection, analysis, and interpretation of data.

Author information

Authors and Affiliations

Queensland Brain Institute and Centre for Advanced Imaging, University of Queensland, Brisbane, Australia
Kai-Hsiang Chuang & Zengmin Li
Department of Medical Imaging and Radiological Sciences, and Graduate Institute of Artificial Intelligence, Chang Gung University, No. 259, Wenhua 1st Rd., Guishan Dist., Taoyuan, 33302, Taiwan
Pei-Huan Wu & Jun-Cheng Weng
Department of Radiation Oncology, Chang Gung Memorial Hospital at Linkou, Taoyuan, Taiwan
Kang-Hsing Fan
Medical Imaging Research Center, Institute for Radiological Research, Chang Gung University and Chang Gung Memorial Hospital at Linkou, Taoyuan, Taiwan
Jun-Cheng Weng
Department of Psychiatry, Chang Gung Memorial Hospital, Chiayi, Taiwan
Jun-Cheng Weng

Authors

Kai-Hsiang Chuang
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Huan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Zengmin Li
View author publications
You can also search for this author in PubMed Google Scholar
Kang-Hsing Fan
View author publications
You can also search for this author in PubMed Google Scholar
Jun-Cheng Weng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.-C.W., K.-H.C. and P.-H.W. conceived and implemented the experimental design. K.-H.C. and Z. L. developed and performed MRI study. P.-H.W. and J.C.W. performed registrations, image analysis and produced data for figures. K.-H.C., J.C.W., P.-H.W. and K.-H.F. contributed to writing the manuscript.

Corresponding author

Correspondence to Jun-Cheng Weng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Figure 1.

Supplementary Figure 2.

Supplementary Figure 3.

Supplementary Figure 4.

Supplementary Figure 5.

Supplementary Figure 6.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chuang, KH., Wu, PH., Li, Z. et al. Deep learning network for integrated coil inhomogeneity correction and brain extraction of mixed MRI data. Sci Rep 12, 8578 (2022). https://doi.org/10.1038/s41598-022-12587-6

Download citation

Received: 22 October 2021
Accepted: 13 May 2022
Published: 20 May 2022
DOI: https://doi.org/10.1038/s41598-022-12587-6

This article is cited by

DeepN4: Learning N4ITK Bias Field Correction for T1-weighted Images
- Praitayini Kanakaraj
- Tianyuan Yao
- Daniel Moyer
Neuroinformatics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Model training

Choosing the best model

Testing using data from the same modality

Testing using multimodality data

Discussion

Criteria for stopping the training of a model

Performance of coil inhomogeneous correction

Performance of brain extraction

Testing GANs using multimodality data

Comparison to other network models

Outliers

Limitation

Conclusion

Methods

Generative adversarial networks

3D pix2pix

Dataset

Implementation

Quantitative evaluation

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links