Attention-guided deep learning for gestational age prediction using fetal brain MRI

Shen, Liyue; Zheng, Jimmy; Lee, Edward H.; Shpanskaya, Katie; McKenna, Emily S.; Atluri, Mahesh G.; Plasto, Dinko; Mitchell, Courtney; Lai, Lillian M.; Guimaraes, Carolina V.; Dahmoush, Hisham; Chueh, Jane; Halabi, Safwan S.; Pauly, John M.; Xing, Lei; Lu, Quin; Oztekin, Ozgur; Kline-Fath, Beth M.; Yeom, Kristen W.

doi:10.1038/s41598-022-05468-5

Download PDF

Article
Open access
Published: 26 January 2022

Attention-guided deep learning for gestational age prediction using fetal brain MRI

Liyue Shen¹^na1,
Jimmy Zheng²^na1,
Edward H. Lee¹,
Katie Shpanskaya³,
Emily S. McKenna³,
Mahesh G. Atluri³,
Dinko Plasto⁴,
Courtney Mitchell⁴,
Lillian M. Lai⁵,
Carolina V. Guimaraes³,
Hisham Dahmoush³,
Jane Chueh⁶,
Safwan S. Halabi³,
John M. Pauly¹,
Lei Xing⁷,
Quin Lu⁸,
Ozgur Oztekin⁹,
Beth M. Kline-Fath¹⁰ &
…
Kristen W. Yeom³

Scientific Reports volume 12, Article number: 1408 (2022) Cite this article

4071 Accesses
12 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Magnetic resonance imaging offers unrivaled visualization of the fetal brain, forming the basis for establishing age-specific morphologic milestones. However, gauging age-appropriate neural development remains a difficult task due to the constantly changing appearance of the fetal brain, variable image quality, and frequent motion artifacts. Here we present an end-to-end, attention-guided deep learning model that predicts gestational age with R² score of 0.945, mean absolute error of 6.7 days, and concordance correlation coefficient of 0.970. The convolutional neural network was trained on a heterogeneous dataset of 741 developmentally normal fetal brain images ranging from 19 to 39 weeks in gestational age. We also demonstrate model performance and generalizability using independent datasets from four academic institutions across the U.S. and Turkey with R² scores of 0.81–0.90 after minimal fine-tuning. The proposed regression algorithm provides an automated machine-enabled tool with the potential to better characterize in utero neurodevelopment and guide real-time gestational age estimation after the first trimester.

Early pregnancy ultrasound measurements and prediction of first trimester pregnancy loss: A logistic model

Article Open access 31 January 2020

Laura Detti, Ludwig Francillon, … Robert A. Roman

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

A brain-specific angiogenic mechanism enabled by tip cell specialization

Article Open access 03 April 2024

Giel Schevenels, Pauline Cabochette, … Benoit Vanhollebeke

Introduction

The fetal brain undergoes dramatic morphological and architectural changes within a short timeframe. Accurate understanding of key milestones in fetal brain maturation is critical for assessing range of normal development and long-term cognitive outcomes¹. Previous studies have established an approximate spatiotemporal timetable of healthy fetal brain development, outlining the progressive gyrification of the cerebral cortex starting in the mid-second trimester^2,3,4,5. Depending on severity, deviations from this pattern have been associated with developmental delays, psychomotor retardation, and failure to thrive⁶. The link between gestational age and cortical folding lays the foundation for neuroimaging-derived age predictions.

A growing body of neuroscience research has managed to leverage multiple imaging modalities to accurately predict the “brain age” of individuals using machine learning^7,8,9. These algorithms learn the relationship between neuroimaging features and corresponding ages, after which they are tested on unseen data. Assuming model accuracy, discrepancies between estimated brain age and actual chronological age might suggest developmental brain pathology¹⁰. However, most studies to date have focused primarily on degenerative diseases and trauma in adults^11,12,13,14. Fetal brain-based age estimation remains a major research gap and holds profound implications for obstetric prenatal care, delivery planning, and postnatal outcomes^9,15,16.

The current method of choice for evaluating fetal brain maturity involves initial ultrasonography (US) of the cerebral cortex¹⁷. However, US can be severely limited by technical challenges and patient factors including maternal obesity, suboptimal fetal positioning, and oligohydramnios¹⁸. In addition, US-guided gestational dating in the second and third trimesters can err by up to 2 and 4 weeks, respectively¹⁹. In utero MRI has emerged as an important adjunct to US, offering detailed resolution of cortical gyration and myelination²⁰. Nevertheless, rapid and ongoing neurodevelopmental changes, low signal-to-noise ratio, tissue contrast, and geometric distortions of small fetal brain embedded within the maternal structures pose obstacles to fetal neuroimaging. Fetal motion is also random, spontaneous, and possible in all planes, rendering even fast single-shot sequences challenging^21,22. Furthermore, fetal brain MRI protocols, imaging platforms, and operator experience differ widely across institutions, leading to inconsistency in image quality and interpretation²³.

Deep learning algorithms offer a powerful means to solve complex tasks such as fetal age estimation from highly variable imaging data^12,24,25,26. Recent efforts have employed deep learning techniques on fetal brain MRI to infer gestational age, achieving moderate to high prediction accuracies^27,28. However, these studies do not demonstrate a large or diverse enough sample to claim sufficient robustness or scalability^28,29. The performance of some of these convolutional neural networks (CNN) also depends on manual brain segmentation, which can be time-intensive, poorly generalizable, and sensitive to artifacts, particularly in fetal imaging³⁰. To address these problems, we proposed a self-attention framework to improve brain localization and the use of input images in multiple planes to maximize image diversity. We developed and tested several fully automated CNN architectures on a large heterogeneous single-center fetal MRI dataset. Finally, we tested the accuracy of age prediction when applied to data from several other centers of excellence in fetal imaging.

Results

Stanford cohort

A total of 741 T2-weighted MRI scans corresponding to unique patients (median gestational age 30.6 weeks, range 19–39 weeks) were included. Coefficient of determination (R²) and mean absolute error (MAE) for each model architecture tested are presented in Table 1. For each MRI plane, diminishing performance was seen with more than 3 input slices. Between the two age prediction approaches, averaging the outputs from the global branch and attention-guided local branch generated higher R² scores and smaller MAE compared with predictions based on global images alone. The highest performing single-plane model was the attention-guided, 3-slice, coronal-view model with an R² of 0.924 and corresponding MAE of 7.9 days.

Table 1 R² score and mean absolute error performance across model architectures.

Full size table

Integrating information from the three planes achieved a notable improvement in model regression performance. A visualization of model regression performance is shown in Fig. 1. The concatenated multi-plane network produced the most accurate gestational age predictions out of all models tested, with the 3-slice architecture slightly outperforming the 1-slice model (R² = 0.945 vs. 0.935; MAE = 6.7 vs. 7.3 days). The agreement between prediction and ground truth for this model was substantial based on Lin’s concordance correlation coefficient (ρc = 0.970; 95% CI 0.961–0.978). The modified Bland–Altman plot shows slight age overestimation up to about 34 weeks, after which the model progressively underestimates gestational age across quantile curves.

External sites

The attention-guided, multi-plane ResNet-50 models trained on Stanford data were tested on external data obtained from four centers of excellence: Children’s Hospital of Los Angeles (CHLA), Cincinnati Children’s Hospital Medical Center (CCHMC), St. Joseph Hospital and Medical Center (SJH), and Tepecik Training and Research Hospital (TTRH). Without transfer learning, the 1-slice and 3-slice models achieved R² of 0.690–0.861 and 0.523–0.857 and MAE of 9.2–16.0 days and 10.3–21.0 days, respectively. As shown in Table 2, both models demonstrated notable improvement after fine-tuning (ΔMAE = − 0.7 to − 4.1 days and − 0.5 to − 4.6 days). Combining all datasets, the 1-slice model achieved higher Lin’s concordance correlation coefficient than the 3-slice model, but the difference was not significant (ρc = 0.920 [0.903–0934], vs. 0.895 [0.874–0.913]). The most generalizable models were the fine-tuned 1-slice model for CHLA, SJH, and TTRH and 3-slice model for CCHMC, with R² of 0.81–0.90, MAE of 8.4–12.9 days, and moderate ρc of 0.90–0.94.

Table 2 External validation of attention-guided, multi-plane, 1-slice and 3-slice models.

Full size table

Discussion

In this study, we present an end-to-end, automated deep learning architecture that accurately predicts gestational age from developmentally normal fetal brain MRI. Our highest-scoring model performed at R² of 0.945 on the Stanford test set, comparable or superior to published child, adolescent, and adult brain age prediction CNNs^8,10,24. Previous works in fetal brain-based age analysis using MRI have primarily been limited to the development of spatiotemporal atlases for comparative age estimation and morphological segmentation^31,32,33. Importantly, these methods help characterize fetal brain development and normal variability within the population⁹. However, most studies are restricted to a relatively small database, narrow age range, or isolated anatomical region (e.g., cortex, ventricles, hippocampus)^31,34,35,36. These limitations reduce the generalizability of age-specific templates and reveal an important gap in our understanding of normal fetal brain maturation.

Variability in imaging quality presents another significant challenge for assessing fetal development. Challenges to interpretation include the rapidly changing neurological features in utero as well as the technical complexity of imaging^17,21. Fetal MRI is notoriously complicated by the low signal from small fetal organs and relatively noisy background due to spontaneous fetal motion and maternal soft tissues (see Supplementary Fig. S1)^37,38. One study showed that a deep learning segmentation model achieves high Dice overlap scores (96.5%) on clean datasets but low performance on images with motion artifact or abnormal fetal orientation (78.8%)³⁰. This discrepancy highlights the importance of leveraging heterogeneous datasets to train and fine-tune deep learning networks. Accordingly, we reviewed all normal fetal MRIs at Stanford from 2004 to 2017 and excluded images only if severe imaging artifacts rendered them nondiagnostic. Our database of 741 images thereby enabled us to capture broad within-institution imaging variability and outnumbers datasets previously used to develop spatiotemporal atlases^{9,31,32,33,39}.

More recent deep learning methods have utilized attention guidance in conjunction with object segmentation to improve noise resiliency^40,41. Shi et al.²⁸ built an attention-based deep residual network based on 659 pre-segmented fetal brains, achieving R² of 0.92 and MAE of 0.77 weeks. Their use of attention activation maps emphasized global and regional features, such as cerebral volume and sulcal contours, within pre-processed segmentations to enhance prediction accuracy. However, this staged deep learning approach relies on the careful delineation of fetal brain masks, a time-intensive process that the authors report taking 30–40 min per sample. Since age regression depends on accurate object masking, external generalizability may be limited, as any fine-tuning would require manual segmentation by a trained researcher with domain knowledge. In contrast, we employ the attention mechanism to automatically focus on the fetal brain itself, enabling a higher signal-to-noise ratio by excluding unrelated features such as the maternal organs and other fetal body parts and reducing non-uniform MR intensity. Furthermore, both attention-guided masking and age regression are trained simultaneously and recursively, obviating the need for extensive pre-processing and fine-tuning. Our best-performing model was thereby computationally efficient and scalable, completing its regression task within 5 min at a GPU level.

The real-world utility of any deep learning model largely depends on its generalization performance. For fetal MRI in particular, standard imaging protocols, quality of imaging, sequences used, and operator experience differ widely across institutions²³. Performance losses incurred when transferring models from one institution to another has become a major concern in the machine learning field. In this study, we test multi-center generalizability of our automated deep learning network using a large external database spanning four centers of excellence, two countries, and a wide array of imaging platforms, scanner hardware, and acquisition parameters (Table 3). There were visible differences in image appearance when comparing datasets across different sites due to factors such as resolution, contrast, and signal-to-noise ratio (see Supplementary Fig. S2). Accordingly, our Stanford-trained multi-plane models yielded varying degrees of performance reduction on the external datasets. However, fine-tuning the model with just 20% of the external data enabled the network to adapt to the new cohort, highlighting its potential applicability across institutions and imaging platforms. Meaningful improvements in R² score, MAE, and age concordance were achieved across institutions after fine-tuning and may continue to be observed using larger validation datasets.

Table 3 MRI datasets and acquisition parameters by institution.

Full size table

Fetal MRI not only offers insight into prenatal development, but can also guide laboratory work-up, therapeutic interventions, counseling, and delivery planning²³. At present, the reported date of last menstrual period and first-trimester US measurements are “gold standard” methods for determining gestational age¹⁹. However, inaccurate recall of the last menstrual period, confounding factors (e.g., irregular spotting or ovulation), and US variability in the second and third trimesters have propelled the need for alternative gestational dating approaches⁴². In our study, fetal brain MRI scans interpreted as normal based on expert consensus were used to develop a convolutional neural network that was highly predictive of gestational age, offering a potential solution for age estimation in the second half of gestation. Our end-to-end approach to assessing the fetal brain also obviates the need for manual feature engineering or segmentation, enabling real-time interpretation. Moving forward, this model may serve as a backbone for evaluating gestational age as well as deviations from normal development, such as underdevelopment, malformation, and other congenital diseases^6,9. Furthermore, emerging deep learning techniques in image reconstruction⁴³ offer promise for developing population-based spatiotemporal atlases to better characterize age-based fetal neuroanatomy.

There are several limitations to this study. As a 2D CNN, age predictions are made based on single-slice inputs, potentially limiting the information available to the network. A 3D CNN incorporating multi-slice imaging features may improve model performance but would require a much larger dataset and risk greater background noise. Our approach to enhance regression accuracy involves Gaussian weighting of the attention heatmap, optimized for images centered on the fetal brain. Extreme position and size variability thereby reduces the accuracy of attention-guided mask inference but not necessarily regression performance as shown in Supplementary Fig. S3. This may be explained by the inclusion of both local and global branches, incorporating semantic features from the emphasized subregion as well as the entire image, respectively. A drawback of this approach is the inclusion of unwanted background noise when the localization procedure performs optimally.

Notably, beyond 34 weeks, our model appears to underestimate gestational age. This trend can be partially attributed to dataset imbalance with few fetal MRI performed in the late third trimester, biasing predictions toward younger gestational ages. US and MR imaging studies also indicate that peak gyrification occurs between weeks 29–35 and that most of the primary and secondary sulci along with all notable gyri have formed by weeks 34–37^4,18,44. A decreasing gyrification rate approaching full term may also skew age estimates, as fetal brains appear more homogenous as they near maturity. Future work can extend the training set to include fetal MRI at age extremes and explore emerging methods such as feature distribution smoothing for imbalanced data with continuous labels⁴⁵. In terms of generalizability, our model may also benefit from the inclusion of external data in the original training set to reduce over-fitting. Finally, a machine learning model is only as reliable as the quality of its input data. Long-term clinical and developmental outcomes for our cohort are unavailable, so scans used to train and test our model are only “normal” from a neuroanatomical perspective.

Conclusion

Deep learning has emerged as a powerful approach for interpreting complex image features. We present an attention-guided, multi-view deep learning network that analyzes MRI-based features of the normally developing fetal brain to accurately predict gestational age. We further demonstrate model performance on external sites and the utility of fine-tuning the model for enhanced generalizability. This study identifies opportunities for imaging-driven analytics of in utero human neural development with potential to enhance diagnostic precision in the second and third trimesters.

Materials and methods

Stanford data collection and cohort description

We retrospectively reviewed all 1927 fetal brain MRIs performed at Stanford Lucile Packard Children’s Hospital from 2004 to 2017, as described in Supplementary Table S1. 1.5 T and 3 T MRI data were acquired with an 8-channel head coil on Signa HDxt, Signa EXCITE, Optima MR450W, and Discovery MR750W scanners (GE Healthcare). 572 images containing cerebral malformations, ventriculomegaly, or other acquired or congenital brain lesions were excluded. 422 nondiagnostic images with severe motion artifacts or noise preventing adequate interpretation were also omitted. In total, we compiled a database of 933 fetal brain MRIs, interpreted as developmentally normal by expert pediatric neuroradiologists. MRI interpretations were based on visual features and biometry measurements such as brain biparietal diameter and skull occipitofrontal diameter. 741 studies had single-shot fast spin-echo T2-weighted sequences in all three planes (axial, coronal, and sagittal). The single-shot images, originally in DICOM File Format, were compressed to JPG files for visualization. The image slices near the middle of the sequence were pre-processed and augmented as the input. Slices were randomly cropped to 224 × 224 and normalized using sample mean and standard deviation. These data were randomly split into training (70%), validation (10%), and test (20%) sets for model input.

This study was approved by Stanford University’s Institutional Review Board (IRB). Data collection and analysis were performed in accordance with relevant guidelines and regulations. Written informed consent was obtained from all pregnant women or authorized representatives for imaging of fetuses prior to delivery (IRB protocol #42137).

Model structure

The model architecture consists of two parallel branches, the global and local branches, as shown in Fig. 2. Both the global and local branches consist of deep residual neural networks that are optimized to predict gestational age based on fetal MRI. ResNet-50, a CNN pre-trained on more than a million images from the 2012 ImageNet database, was used as the backbone deep neural network for age regression⁴⁶. For each stack of input image slices, we assumed the middle slice to contain the largest fetal brain area. We then tested the effect of the number of image slice inputs on model performance (e.g., 1, 3, 5), incorporating additional slices immediately adjacent to the middle slice. The first convolutional layer of the ResNet-50 model was parameterized to accommodate different numbers of image slices with their corresponding input channels. Pretrained model weights were then applied to subsequent layers of the network. Given input image(s) X, the global branch is first trained using the entire or ‘global’ X. Then, the region of interest is masked using an attention mechanism with Gaussian weighting and trained for age regression on the local branch. Learned features from both branches simultaneously optimize final age prediction. Independent models were trained on axial, coronal, and sagittal images to study the unique semantic features from different planes.

We compared two approaches for predicting gestational age y_pred: global branch predictions (i.e., entire image) without the attention-guided local branch, versus averaged age predictions from both the global branch and local branch (i.e., masked region of interest). The true gestational age y_true or ‘ground truth’ was determined via the standard-of-care approach of estimating the date of delivery based on an early obstetric ultrasound in the first trimester¹⁹. Gestational ages at time of US were recorded directly from the reports, and differences in MRI and US dates were added to obtain y_true for each patient. In the training phase, the model is optimized by stochastic gradient descent with backpropagation to minimize the mean squared error (MSE) loss between true and predicted ages$y_{true} - y_{pred\;2}^{2}$ ⁴⁷.

Attention-guided mask inference

Computational analysis of fetal MR imaging is extremely challenging due to the random position and rotation of fetal brains across patients. Additionally, noise unrelated to the fetus (such as the maternal placenta and organs) may negatively affect predictive performance. These considerations motivated the use of attention-guided mask inference, which provides spatially variant maps that highlight regions of interest and contribute to accurate object recognition⁴⁸.

As previously described in Guan et al.⁴⁹ and Zhou et al.⁵⁰, the attention heatmap is extracted from the last convolutional layer in the global branch. Given an initial input image X representing the whole image slice, $f_{k} \left( {x,y} \right)$ represents the activation of spatial location $\left( {x,y} \right)$ in the kth channel of the output of the last convolutional layer, where $k \in \left\{ {1, \ldots ,\left. K \right\}} \right.$ and K is the total number of feature map channels ($K = 512$ in ResNet-18, $K = 2048$ in ResNet-50). The attention heatmap values $H_{g}$ are computed by maximizing activation values across channels:

$$H_{g} \left( {x,y} \right) = \max \left( {\left| {f_{k} \left( {x,y} \right)} \right|} \right), \quad k \in \left\{ {1, \ldots ,\left. K \right\}} \right.$$

After up-sampling $H_{g}$ to match the resolution of the input images, we apply the truncated ReLU activation function to normalize the heatmap $H_{g}$ to the data range of [0, 1], where larger values represent increasing probability of detecting fetal brain tissue. High-value areas are subsequently given more attention by the prediction model. Furthermore, with the prior knowledge that the fetal brain usually localizes in the center of the image, we multiply a 2D Gaussian mask to re-weight the heatmap. Thereafter, the heatmaps highlighting the region of interest (i.e., fetal brain) are generated. Examples of heatmaps are shown in Fig. 3.

Heatmap weights are multiplied with the input image to obtain a masked region of the fetal brain, suppressing background noise in the original scan. The re-weighted image is then inputted to the local branch for age prediction based on regional features. Since we automatically extract the heatmap from the global branch and the normalization operations are differentiable, the entire model framework can be trained end to end for adaptive attention map weighting and brain age estimation.

Multi-plane learning approach

A multi-plane learning approach was employed to capitalize on complementary information contained in different MRI dimensions. Separately from the single-plane architectures, we trained a multi-plane model by minimizing the total MSE loss involving axial, coronal, and sagittal planes. Network weights are thereby optimized based on features from all MRI views simultaneously. After convergence, prediction outputs from each plane are then averaged for a final estimation of gestational age.

Training and evaluation

All network architectures were implemented with the PyTorch framework⁵¹. We trained the models using the Adam Optimizer with a learning rate of 1 × 10⁻⁴ and a batch size of 50 for 2000 iterations. The training session was conducted on a NVIDIA TITAN Xp GPU. High scoring models were defined as those with strong correlation and concordance between true gestational age and predicted gestational age. Correlative strength was evaluated for all models trained and tested on Stanford fetal imaging data by the R² and MAE. Concordance between predicted and true gestational ages was determined using Lin’s concordance correlation coefficient, with strength of agreement assessed by McBride’s criteria as follows: poor, < 0.90; moderate, 0.90–0.95; substantial, 0.95–0.99; almost perfect > 0.99^52,53. Statistical results were visually confirmed by local piecewise regression analysis using a window size of 15 points, 95% overlap between windows, and Gaussian smoothing⁵⁴.

Validation with external sites

External MRI data were obtained from four additional centers of excellence: Children’s Hospital of Los Angeles, Cincinnati Children’s Hospital Medical Center, St. Joseph Hospital and Medical Center, and Tepecik Training and Research Hospital in İzmir, Turkey. MR imaging across sites varied widely in terms of scanning platform, sequence types, and technical settings, as shown in Table 3. To test generalizability, the attention-guided multi-plane model (i.e., highest-scoring network tested on Stanford data) was used. The 1-slice and 3-slice architectures were compared across external institutions. After deploying the same data curation methods used for Stanford data, the external datasets consisted of 156, 64, 25, and 189 fetal MRI samples for CHLA, CCH, SJH, and TTRH, respectively (Supplementary Fig. S2). The Stanford-trained model was first tested directly on these unseen external samples without any transfer learning. We then fine-tuned the model with 20% of each dataset using the Adam optimizer with a learning rate of 1 × 10⁻⁵ and a batch size of 5. For SJH, we used a learning rate of 1 × 10⁻⁶ as only 5 data samples were available for fine-tuning. We employed early stopping at 5 epochs to avoid overfitting. Performance with and without fine-tuning on the remaining 80% of each dataset was compared.

Data availability

Deidentified images used in model training and testing are made available at the Stanford Digital Repository (https://purl.stanford.edu/sf714wg0636). All requests for raw data and related materials will be reviewed by the Office of the General Counsel at Stanford University to verify whether the request is subject to any intellectual property or confidentiality obligations. Restrictions generally apply to the public availability of the data due to patient agreements and privacy concerns. Any data and materials that can be shared will be transferred securely via a formal data sharing agreement.

Code availability

Source code and tutorial will be made available to reviewers upon request and deposited in a DOI-minting repository upon acceptance for publication. We used the publicly available ResNet-50 network as the backbone architecture for our deep learning models, available at https://github.com/pytorch/vision/tree/master/torchvision/models.

References

Hüppi, P. S. Growth and development of the brain and impact on cognitive outcomes. Nestle Nutr. Workshop Ser. Pediatr. Program 65, 137–149. https://doi.org/10.1159/000281156 (2010).
Article PubMed Google Scholar
Chi, J. G., Dooling, E. C. & Gilles, F. H. Gyral development of the human brain. Ann. Neurol. 1, 86–93. https://doi.org/10.1002/ana.410010109 (1977).
Article CAS PubMed Google Scholar
Garel, C. et al. Fetal cerebral cortex: Normal gestational landmarks identified using prenatal MR imaging. AJNR Am. J. Neuroradiol. 22, 184–189 (2001).
CAS PubMed PubMed Central Google Scholar
Cohen-Sacher, B., Lerman-Sagie, T., Lev, D. & Malinger, G. Sonographic developmental milestones of the fetal cerebral cortex: A longitudinal study. Ultrasound Obstet. Gynecol. 27, 494–502. https://doi.org/10.1002/uog.2757 (2006).
Article CAS PubMed Google Scholar
Habas, P. A. et al. Early folding patterns and asymmetries of the normal human brain detected from in utero MRI. Cereb. Cortex 22, 13–25. https://doi.org/10.1093/cercor/bhr053 (2012).
Article PubMed Google Scholar
Ghai, S. et al. Prenatal US and MR imaging findings of lissencephaly: Review of fetal cerebral sulcal development. Radiographics 26, 389–405. https://doi.org/10.1148/rg.262055059 (2006).
Article PubMed Google Scholar
Franke, K., Ziegler, G., Klöppel, S., Gaser, C., Alzheimer’s Disease Neuroimaging Initiative. Estimating the age of healthy subjects from T1-weighted MRI scans using kernel methods: Exploring the influence of various parameters. Neuroimage 50, 883–892. https://doi.org/10.1016/j.neuroimage.2010.01.005 (2010).
Article PubMed Google Scholar
Cole, J. H. et al. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. Preprint at http://arxiv.org/abs/1612.02572 (2016).
Namburete, A. I. L. et al. Learning-based prediction of gestational age from ultrasound images of the fetal brain. Med. Image Anal. 21, 72–86. https://doi.org/10.1016/j.media.2014.12.006 (2015).
Article PubMed PubMed Central Google Scholar
Cole, J. H. & Franke, K. Predicting age using neuroimaging: Innovative brain ageing biomarkers. Trends Neurosci. 40, 681–690. https://doi.org/10.1016/j.tins.2017.10.001 (2017).
Article CAS PubMed Google Scholar
Beheshti, I., Maikusa, N. & Matsuda, H. The association between “brain-age score” (BAS) and traditional neuropsychological screening tools in Alzheimer’s disease. Brain Behav. 8, e01020. https://doi.org/10.1002/brb3.1020 (2018).
Article PubMed PubMed Central Google Scholar
Wang, J. et al. Gray matter age prediction as a biomarker for risk of dementia. PNAS 116, 21213–21218. https://doi.org/10.1073/pnas.1902376116 (2019).
Article CAS PubMed PubMed Central Google Scholar
Franke, K., Gaser, C., Manor, B. & Novak, V. Advanced BrainAGE in older adults with type 2 diabetes mellitus. Front. Aging Neurosci. 5, 90. https://doi.org/10.3389/fnagi.2013.00090 (2013).
Article PubMed PubMed Central Google Scholar
Cole, J. H., Leech, R., Sharp, D. J., Alzheimer’s Disease Neuroimaging Initiative. Prediction of brain age suggests accelerated atrophy after traumatic brain injury. Ann. Neurol. 77, 571–581. https://doi.org/10.1002/ana.24367 (2015).
Article PubMed PubMed Central Google Scholar
Simon, E. M. et al. Fast MR imaging of fetal CNS anomalies in utero. AJNR Am. J. Neuroradiol. 21, 1688–1698 (2000).
CAS PubMed PubMed Central Google Scholar
Whitby, E., Paley, M. N., Davies, N., Sprigg, A. & Griffiths, P. D. Ultrafast magnetic resonance imaging of central nervous system abnormalities in utero in the second and third trimester of pregnancy: Comparison with ultrasound. BJOG 108, 519–526 (2001).
CAS PubMed Google Scholar
Saleem, S. N. Fetal MRI: An approach to practice: A review. J. Adv. Res. 5, 507–523. https://doi.org/10.1016/j.jare.2013.06.001 (2014).
Article CAS PubMed Google Scholar
Salomon, L. J. & Garel, C. Magnetic resonance imaging examination of the fetal brain. Ultrasound Obstet. Gynecol. 30, 1019–1032. https://doi.org/10.1002/uog.5176 (2007).
Article CAS PubMed Google Scholar
Committee Opinion No 700: Methods for estimating the due date. Obstet. Gynecol. 129, 150. https://doi.org/10.1097/AOG.0000000000002046 (2017).
Griffiths, P. D. et al. Use of MRI in the diagnosis of fetal brain abnormalities in utero (MERIDIAN): A multicentre, prospective cohort study. Lancet 389, 538–546. https://doi.org/10.1016/S0140-6736(16)31723-8 (2017).
Article PubMed Google Scholar
Limperopoulos, C. & Clouchoux, C. Advancing fetal brain MRI: Targets for the future. Semin. Perinatol. 33, 289–298. https://doi.org/10.1053/j.semperi.2009.04.002 (2009).
Article PubMed Google Scholar
Machado-Rivas, F., Jaimes, C., Kirsch, J. E. & Gee, M. S. Image-quality optimization and artifact reduction in fetal magnetic resonance imaging. Pediatr. Radiol. 50, 1830–1838. https://doi.org/10.1007/s00247-020-04672-7 (2020).
Article PubMed Google Scholar
Prayer, D. et al. ISUOG practice guidelines: Performance of fetal magnetic resonance imaging. Ultrasound Obstet. Gynecol. 49, 671–680. https://doi.org/10.1002/uog.17412 (2017).
Article CAS PubMed Google Scholar
Franke, K. & Gaser, C. Ten years of BrainAGE as a neuroimaging biomarker of brain aging: What insights have we gained?. Front. Neurol. 10, 789. https://doi.org/10.3389/fneur.2019.00789 (2019).
Article PubMed PubMed Central Google Scholar
Bermudez, C. et al. Anatomical context improves deep learning on the brain age estimation task. Magn. Reson. Imaging 62, 70–77. https://doi.org/10.1016/j.mri.2019.06.018 (2019).
Article PubMed PubMed Central Google Scholar
Liew, C. The future of radiology augmented with artificial intelligence: A strategy for success. Eur. J. Radiol. 102, 152–156 (2018).
Article PubMed Google Scholar
Kojita, Y. et al. Deep learning model for predicting gestational age after the first trimester using fetal MRI. Eur. Radiol. 31, 3775–3782. https://doi.org/10.1007/s00330-021-07915-9 (2021).
Article PubMed Google Scholar
Shi, W. et al. Fetal brain age estimation and anomaly detection using attention-based deep ensembles with uncertainty. Neuroimage 223, 117316. https://doi.org/10.1016/j.neuroimage.2020.117316 (2020).
Article PubMed Google Scholar
Wu, J. et al. Assessment of MRI-based automated fetal cerebral cortical folding measures in prediction of gestational age in the third trimester. AJNR Am. J. Neuroradiol. 36, 1369–1374. https://doi.org/10.3174/ajnr.A4357 (2015).
Article CAS PubMed PubMed Central Google Scholar
Salehi, S. S. M. et al. Real-time automatic fetal brain extraction in fetal MRI by deep learning. In 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018). 720–724. https://doi.org/10.1109/ISBI.2018.8363675 (2018).
Habas, P. A. et al. A spatiotemporal atlas of MR intensity, tissue probability and shape of the fetal brain with application to segmentation. Neuroimage 53, 460–470. https://doi.org/10.1016/j.neuroimage.2010.06.054 (2010).
Article PubMed Google Scholar
Serag, A. et al. Construction of a consistent high-definition spatio-temporal atlas of the developing brain using adaptive kernel regression. Neuroimage 59, 2255–2265. https://doi.org/10.1016/j.neuroimage.2011.09.062 (2012).
Article PubMed Google Scholar
Dittrich, E. et al. A spatio-temporal latent atlas for semi-supervised learning of fetal brain segmentations and morphological age estimation. Med. Image Anal. 18, 9–21. https://doi.org/10.1016/j.media.2013.08.004 (2014).
Article PubMed Google Scholar
Jacob, F. D. et al. Fetal hippocampal development: Analysis by magnetic resonance imaging volumetry. Pediatr. Res. 69, 425–429. https://doi.org/10.1203/PDR.0b013e318211dd7f (2011).
Article PubMed PubMed Central Google Scholar
Scott, J. A. et al. Volumetric and surface-based 3D MRI analyses of fetal isolated mild ventriculomegaly. Brain Struct. Funct. 218, 645–655. https://doi.org/10.1007/s00429-012-0418-1 (2013).
Article PubMed Google Scholar
Wright, R. et al. Automatic quantification of normal cortical folding patterns from fetal brain MRI. Neuroimage 91, 21–32. https://doi.org/10.1016/j.neuroimage.2014.01.034 (2014).
Article CAS PubMed Google Scholar
Gholipour, A., Akhondi-Asl, A., Estroff, J. A. & Warfield, S. K. Multi-atlas multi-shape segmentation of fetal brain MRI for volumetric and morphometric analysis of ventriculomegaly. Neuroimage 60, 1819–1831. https://doi.org/10.1016/j.neuroimage.2012.01.128 (2012).
Article PubMed Google Scholar
Gholipour, A. et al. Fetal MRI: A technical update with educational aspirations. Concepts Magn. Reson. Part A Bridg. Educ. Res. 43, 237–266. https://doi.org/10.1002/cmr.a.21321 (2014).
Article CAS PubMed Google Scholar
Gholipour, A. et al. A normative spatiotemporal MRI atlas of the fetal brain for automatic segmentation and analysis of early brain growth. Sci. Rep. 7, 476. https://doi.org/10.1038/s41598-017-00525-w (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Sinha, A. & Dolz, J. Multi-scale self-guided attention for medical image segmentation. Preprint at http://arxiv.org/abs/1906.02849 (2020).
Zhang, S. et al. Attention guided network for retinal image segmentation. Preprint at http://arxiv.org/abs/1907.12930 (2019).
Shi, Y., Xue, Y., Chen, C., Lin, K. & Zhou, Z. Association of gestational age with MRI-based biometrics of brain development in fetuses. BMC Med. Imaging 20, 125. https://doi.org/10.1186/s12880-020-00525-9 (2020).
Article PubMed PubMed Central Google Scholar
Lundervold, A. S. & Lundervold, A. An overview of deep learning in medical imaging focusing on MRI. Z. Med. Phys. 29, 102–127. https://doi.org/10.1016/j.zemedi.2018.11.002 (2019).
Article PubMed Google Scholar
Tallinen, T. et al. On the growth and form of cortical convolutions. Nat. Phys. 12, 588–593. https://doi.org/10.1038/nphys3632 (2016).
Article CAS Google Scholar
Yang, Y., Zha, K., Chen, Y.-C., Wang, H. & Katabi, D. Delving into deep imbalanced regression. Preprint at http://arxiv.org/abs/2102.09554 (2021).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. Preprint at https://arxiv.org/abs/1512.03385v1 (2015).
Sutskever, I., Martens, J., Dahl, G. & Hinton, G. On the importance of initialization and momentum in deep learning. In Proceedings of the 30th International Conference on Machine Learning, vol. 28, 1139–1147 (2013).
Vaswani, A. et al. Attention is all you need. Adv. Neural Inf. Process. Syst. 30, 5998–6008 (2017).
Google Scholar
Guan, Q. et al. Diagnose like a radiologist: Attention guided convolutional neural network for thorax disease classification. Preprint at http://arXiv.org/abs/1801.09927 (2018).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. Preprint at http://arxiv.org/abs/1512.04150 (2015).
Paszke, A. et al. PyTorch: An imperative style, high-performance deep learning library. Preprint at http://arxiv.org/abs/1912.01703 (2019).
Lin, L.I.-K. A concordance correlation coefficient to evaluate reproducibility. Biometrics 45, 255–268. https://doi.org/10.2307/2532051 (1989).
Article CAS PubMed MATH Google Scholar
McBride, G.B. A proposal for strength-of-agreement criteria for Lin's concordance correlation coefficient. NIWA Client Report: HAM2005062 (2005).
Sakov, A., Golani, I., Lipkind, D. & Benjamini, Y. High-throughput data analysis in behavior genetics. Ann. Appl. Stat. 4, 743–763. https://doi.org/10.1214/09-AOAS304 (2010).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This study was supported by Philips Healthcare, Cambridge, MA (K.W.Y.), Stanford Precision Health and Integrated Diagnostics Individual Seed Grant Award (L.S.), and the AOA-Chi-Li Pao Foundation Stanford Student Research Fellowship (J.Z.). We thank our numerous collaborators at Children’s Hospital of Los Angeles, Cincinnati Children’s Hospital Medical Center, St. Joseph Hospital and Medical Center, and Tepecik Training and Research Hospital.

Author information

These authors contributed equally: Liyue Shen and Jimmy Zheng.

Authors and Affiliations

Department of Electrical Engineering, Stanford University, Stanford, CA, USA
Liyue Shen, Edward H. Lee & John M. Pauly
Stanford University School of Medicine, Stanford, CA, USA
Jimmy Zheng
Department of Radiology, Lucile Packard Children’s Hospital, Stanford University School of Medicine, Stanford, CA, USA
Katie Shpanskaya, Emily S. McKenna, Mahesh G. Atluri, Carolina V. Guimaraes, Hisham Dahmoush, Safwan S. Halabi & Kristen W. Yeom
Department of Radiology, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, USA
Dinko Plasto & Courtney Mitchell
Department of Radiology, Children’s Hospital Los Angeles, Los Angeles, CA, USA
Lillian M. Lai
Department of Obstetrics and Gynecology, Lucile Packard Children’s Hospital, Stanford University School of Medicine, Stanford, CA, USA
Jane Chueh
Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA, USA
Lei Xing
Philips Healthcare North America, Gainesville, USA
Quin Lu
Department of Neuroradiology, Bakırçay University, Çiğli Education and Research Hospital, İzmir, Turkey
Ozgur Oztekin
Department of Radiology, Cincinnati Children’s Hospital Medical Center, University of Cincinnati College of Medicine, Cincinnati, OH, USA
Beth M. Kline-Fath

Authors

Liyue Shen
View author publications
You can also search for this author in PubMed Google Scholar
Jimmy Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Edward H. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Katie Shpanskaya
View author publications
You can also search for this author in PubMed Google Scholar
Emily S. McKenna
View author publications
You can also search for this author in PubMed Google Scholar
Mahesh G. Atluri
View author publications
You can also search for this author in PubMed Google Scholar
Dinko Plasto
View author publications
You can also search for this author in PubMed Google Scholar
Courtney Mitchell
View author publications
You can also search for this author in PubMed Google Scholar
Lillian M. Lai
View author publications
You can also search for this author in PubMed Google Scholar
Carolina V. Guimaraes
View author publications
You can also search for this author in PubMed Google Scholar
Hisham Dahmoush
View author publications
You can also search for this author in PubMed Google Scholar
Jane Chueh
View author publications
You can also search for this author in PubMed Google Scholar
Safwan S. Halabi
View author publications
You can also search for this author in PubMed Google Scholar
John M. Pauly
View author publications
You can also search for this author in PubMed Google Scholar
Lei Xing
View author publications
You can also search for this author in PubMed Google Scholar
Quin Lu
View author publications
You can also search for this author in PubMed Google Scholar
Ozgur Oztekin
View author publications
You can also search for this author in PubMed Google Scholar
Beth M. Kline-Fath
View author publications
You can also search for this author in PubMed Google Scholar
Kristen W. Yeom
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.S., J.Z., E.H.L., K.S., E.S.M., D.P., M.G.A., C.M., L.M.L., C.V.G., H.D., J.C., S.S.H., Q.L., O.O., B.M.K., and K.W.Y. collected and interpreted the data. K.W.Y. conceived and supervised the project. L.S. and J.Z. wrote the manuscript, performed analyses, and prepared figures and tables with assistance from K.W.Y. All authors discussed the results and reviewed the manuscript.

Corresponding authors

Correspondence to Jimmy Zheng or Kristen W. Yeom.

Ethics declarations

Competing interests

The authors declare no competing interests. Portions of this work were presented at the 2018 NeurIPs Conference in Montreal (ArXiv 2018), ISMRM Montreal 2019, and SPR 2019.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shen, L., Zheng, J., Lee, E.H. et al. Attention-guided deep learning for gestational age prediction using fetal brain MRI. Sci Rep 12, 1408 (2022). https://doi.org/10.1038/s41598-022-05468-5

Download citation

Received: 03 August 2021
Accepted: 05 January 2022
Published: 26 January 2022
DOI: https://doi.org/10.1038/s41598-022-05468-5

This article is cited by

Fetal MRI: what’s new? A short review
- Lucia Manganaro
- Silvia Capuani
- Carlo Catalano
European Radiology Experimental (2023)
MTSE U-Net: an architecture for segmentation, and prediction of fetal brain and gestational age from MRI of brain
- Tuhinangshu Gangopadhyay
- Shinjini Halder
- Sudipta Roy
Network Modeling Analysis in Health Informatics and Bioinformatics (2022)
Artificial intelligence applications of fetal brain and cardiac MRI
- Jing-Ya Ren
- Ming Zhu
- Su-Zhen Dong
Chinese Journal of Academic Radiology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Early pregnancy ultrasound measurements and prediction of first trimester pregnancy loss: A logistic model

Segment anything in medical images

A brain-specific angiogenic mechanism enabled by tip cell specialization

Introduction

Results

Stanford cohort

External sites

Discussion

Conclusion

Materials and methods

Stanford data collection and cohort description

Model structure

Attention-guided mask inference

Multi-plane learning approach

Training and evaluation

Validation with external sites

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Fetal MRI: what’s new? A short review

MTSE U-Net: an architecture for segmentation, and prediction of fetal brain and gestational age from MRI of brain

Artificial intelligence applications of fetal brain and cardiac MRI

Comments

Search

Quick links