Deep learning for transesophageal echocardiography view classification

Steffner, Kirsten R.; Christensen, Matthew; Gill, George; Bowdish, Michael; Rhee, Justin; Kumaresan, Abirami; He, Bryan; Zou, James; Ouyang, David

doi:10.1038/s41598-023-50735-8

Download PDF

Article
Open access
Published: 02 January 2024

Deep learning for transesophageal echocardiography view classification

Kirsten R. Steffner¹^na1,
Matthew Christensen²^na1,
George Gill³,
Michael Bowdish³,
Justin Rhee²,
Abirami Kumaresan^3,4,
Bryan He⁵,
James Zou⁶ &
…
David Ouyang²

Scientific Reports volume 14, Article number: 11 (2024) Cite this article

1852 Accesses
3 Citations
11 Altmetric
Metrics details

Subjects

Abstract

Transesophageal echocardiography (TEE) imaging is a vital tool used in the evaluation of complex cardiac pathology and the management of cardiac surgery patients. A key limitation to the application of deep learning strategies to intraoperative and intraprocedural TEE data is the complexity and unstructured nature of these images. In the present study, we developed a deep learning-based, multi-category TEE view classification model that can be used to add structure to intraoperative and intraprocedural TEE imaging data. More specifically, we trained a convolutional neural network (CNN) to predict standardized TEE views using labeled intraoperative and intraprocedural TEE videos from Cedars-Sinai Medical Center (CSMC). We externally validated our model on intraoperative TEE videos from Stanford University Medical Center (SUMC). Accuracy of our model was high across all labeled views. The highest performance was achieved for the Trans-Gastric Left Ventricular Short Axis View (area under the receiver operating curve [AUC] = 0.971 at CSMC, 0.957 at SUMC), the Mid-Esophageal Long Axis View (AUC = 0.954 at CSMC, 0.905 at SUMC), the Mid-Esophageal Aortic Valve Short Axis View (AUC = 0.946 at CSMC, 0.898 at SUMC), and the Mid-Esophageal 4-Chamber View (AUC = 0.939 at CSMC, 0.902 at SUMC). Ultimately, we demonstrate that our deep learning model can accurately classify standardized TEE views, which will facilitate further downstream deep learning analyses for intraoperative and intraprocedural TEE imaging.

A deep multi-stream model for robust prediction of left ventricular ejection fraction in 2D echocardiography

Article Open access 24 January 2024

A deep-learning pipeline to diagnose pediatric intussusception and assess severity during ultrasound scanning: a multicenter retrospective-prospective study

Article Open access 30 September 2023

AI supported fetal echocardiography with quality assessment

Article Open access 09 March 2024

Introduction

Cardiovascular disease is a leading cause of death and disability worldwide and has been one of the top ten most important drivers of increasing global disease burden in the last three decades¹. Echocardiography is the most commonly used imaging modality in the assessment of cardiac structure, function, and disease^2,3. The two main modalities of echocardiography imaging are transthoracic echocardiography (TTE) and transesophageal echocardiography (TEE). TTE imaging is used as a screening tool in asymptomatic patients and as the initial diagnostic tool for many cardiovascular disease states, including ischemic heart disease, valvular heart disease, rhythm disorders, and heart failure. TEE imaging is utilized in the workup and management of complex cardiac pathology such as sequelae after acute myocardial ischemia or acute aortic disease³. TEE is particularly valuable as a monitoring and diagnostic tool utilized in the management of cardiac surgery patients^2,4. As the standard of care, intraoperative TEE imaging is performed during all major cardiac surgeries, especially those requiring an open sternotomy and cardiopulmonary bypass (CPB), to help make diagnoses, guide surgical decision-making, and evaluate hemodynamic states in real-time.

Given its importance in cardiovascular disease management, echocardiography imaging has become an important target for artificial intelligence (AI). Prior echocardiography-based AI research has focused on TTE videos, with recent work showing that machine learning algorithms are able to classify standardized TTE views^5,6,7,8, recognize cardiac structures⁹, and estimate left ventricular ejection fraction¹⁰. Additional work has demonstrated the ability to accurately diagnose the etiology of left ventricular hypertrophy¹¹, extract phenotypic information such as age and sex⁹, and predict clinical outcomes such as postoperative right ventricular failure after the implantation of a left ventricular assist device¹².

Previous groups have also shown that machine learning models can be trained on TEE data to perform focused image segmentation tasks and automatically calculate measurements such as the mitral annular plane systolic excursion; however, such TEE-based approaches have been limited to small and highly-curated data sets^13,14,15. The application of AI and machine learning to TEE images acquired during the course of standard clinical care remains relatively unexplored. TEE imaging data is highly variable due to the dynamic environment in the cardiac surgery operating rooms, which results in the acquisition of varying image sequences, non-standard views, and missing views. Without an automated preprocessing and view classification pipeline for clinically-acquired TEE videos, deep learning tasks on unstructured intraoperative and intraprocedural TEE data remains challenging.

However, given the vitally important role that TEE imaging plays in the evaluation of complex cardiovascular disease states and in the perioperative management of high-risk cardiac surgery patients, there is great potential value to be extracted from TEE images with advanced deep learning methodologies. Therefore, the purpose of the present study was to train a deep learning-based TEE view classification model that could be used to create structure for intraoperative and intraprocedural TEE imaging data and thereby facilitate downstream TEE-based deep learning tasks. More specifically, the aim of the present study was to train a convolutional neural network (CNN) to accurately classify standardized TEE views using labeled intraoperative and intraprocedural TEE videos.

Methods

Cohort selection and data processing

We obtained TEE image data for randomly selected adult patients who underwent an intraoperative or intraprocedural TEE exam at Cedars-Sinai Medical Center (CSMC) between the years of 2016 and 2021. This resulted in 2967 TEE videos, including intraoperative echocardiography images from open (via sternotomy) cardiothoracic surgical operations and intraprocedural echocardiography images from transcatheter procedures for structural heart disease. We also obtained TEE image data from randomly selected adult patients who underwent an intraoperative TEE exam during open cardiothoracic surgery at Stanford University Medical Center (SUMC), resulting in an additional 465 TEE videos for an external test set.

The Institutional Review Board at Cedars-Sinai Medical Center and the Institutional Review Board at Stanford University Medical Center both granted ethical approval for this study. Given the nature of our study as a retrospective analysis of data that had already been collected as part of the clinical standard of care, a waiver of informed consent was granted by the Institutional Review Boards at both Cedars-Sinai Medical Center and Stanford University Medical Center. All study methods were performed in accordance with the guidelines and regulations outlined by both Institutional Review Boards.

TEE image data was converted from Digital Imaging and Communications in Medicine (DICOM) format data to AVI videos. Prior to labeling, model training, and analysis, an automated preprocessing workflow was undertaken to remove patient identifying information and eliminate unintended human labels. Each subsequent video was cropped and masked to remove text, ECG and respirometer information, and other information outside of the scanning sector. The resulting square images were either 600 × 600 or 768 × 768 pixels depending on the ultrasound machine and down-sampled by cubic interpolation using OpenCV into standardized 112 × 112 pixel videos.

All training, validation, and test images were labeled by a board-certified echocardiographer. Expert consensus echocardiography guidelines identify twenty-eight standardized TEE views for a complete intraoperative multi-plane TEE exam¹⁶. For our multi-category deep learning view classification model, we chose the eight most consistently acquired and most clinically useful TEE views in the intraoperative assessment of cardiac surgery patients, including: the Mid-Esophageal (ME) 2-Chamber View, ME 4-Chamber View, ME Aortic Valve (AV) Short Axis (SAX) View, ME Bicaval View, ME Left Atrial Appendage View, ME Long Axis View, Trans-Gastric (TG) LV SAX View, and Aortic View.

Four of our eight chosen views (the ME 2-Chamber, ME 4-Chamber, ME AV SAX, ME Long Axis) represent pooled categories that integrate two of the twenty-eight standardized views. More specifically, the “ME 2-Chamber View” class includes ME 2-chamber and ME mitral commissural views; the “ME 4-Chamber View” class includes ME 4-chamber and ME 5-chamber views; the “ME AV SAX View” class includes ME AV SAX and ME right ventricular (RV) inflow-outflow views; and the “ME Long Axis View” class includes ME long axis and ME AV long axis views. We also chose to generalize two categories (the TG LV SAX and the Aortic Views). TEE videos that did not fall into any of the eight chosen view classes were labeled as “Other.”

AI model design and testing

We trained a CNN to classify eight standardized TEE views. Our training and validation sets contained 2464 unique videos (split 4:1), representing 2036 patients. The model was tested on 503 randomly selected videos from CSMC and 465 randomly selected videos from SUMC, none of which were seen during model training. We trained the CNN with residual connections and spatiotemporal convolutions using the R2 + 1D architecture^17,18. We chose R2 + 1D spatiotemporal convolutions based on our prior work with TTE videos, where we tested multiple model architectures with variable integration of temporal convolutions and found decomposed R2 + 1D spatiotemporal convolutions to have the best balance of computational complexity and model performance¹⁰. A further description of model architecture and tradeoffs are well described in the original architecture papers^17,18.

Model weights were randomly initialized. Models were trained to minimize the cross entropy between the predicted view and the actual labeled view. We used an Adam optimizer¹⁹, a learning rate of 0.001, and a batch size of 44. We employed early stopping to cease model training after no further improvement on the validation set occurred. Our final model trained for nine epochs. The model was trained on 32-frame sub-clips of videos in the training set, with a temporal stride of two, yielding a final model input length of 16 frames. The choice of 16 frames is based on hyperparameter sweeps in prior work balancing model performance and computational efficiency¹⁰. The starting frame of these sub-clips within their parent clips were randomized during training as a form of data augmentation. All model training was done using the Python library PyTorch. Our code is available online at https://github.com/echonet/tee-view-classifier.

Statistical analysis

An internal hold-out test data set from CSMC which was never seen during model training was used to assess model performance. An external test set from SUMC was also used for additional testing and was never seen during model training. Model performance was assessed via AUROC. Two-sided 95% confidence intervals using 1000 bootstrapped samples were computed for each calculation. Unsupervised t-Distributed Stochastic Neighbor Embedding (t-SNE) was used for clustering analysis²⁰. All statistical analyses were performed in Python.

Results

Patient characteristics and surgery or procedure types represented in our training, validation, and test data sets are shown in Table 1. Our data sets included a broad spectrum of anatomic variation, clinical pathology, and imaging indications reflecting the cardiac open surgical and transcatheter procedural populations seen at CSMC and SUMC. The images also included a wide range of technical variation, including differences in spatial and temporal resolution, field of view depth and sector width, gain, image quality, and use of color flow Doppler (Fig. 1). The most frequently represented views included the ME-4 Chamber View, the ME Long Axis View, the TG Left Ventricular Short Axis View, and the ME Aortic Valve Short Axis View (Table 2).

Table 1 Clinical characteristics and surgery or procedure types represented in the training, validation, and internal test data sets.

Full size table

Table 2 Number of TEE videos labeled for model training and validation, per view class.

Full size table

Our view classification model achieved an overall micro-averaged area under the receiver operating curve (AUC) of 0.919 on the hold-out CSMC test set of TEE videos (Fig. 2 and Table 3). Our model showed particularly good performance for the Trans-Gastric Left Ventricular Short Axis View (AUC = 0.971), the Mid-Esophageal Long Axis View (AUC = 0.954), the Mid-Esophageal Aortic Valve Short Axis View (AUC = 0.946), and the Mid-Esophageal 4-Chamber View (AUC = 0.939). The model performance also generalized well externally, achieving a micro-averaged AUC of 0.872 when tested on the 465 never-before-seen TEE videos from SUMC. Our model had similar performance for the Trans-Gastric Left Ventricular Short Axis View (AUC = 0.957), the Mid-Esophageal Long Axis View (AUC = 0.905), the Mid-Esophageal Aortic Valve Short Axis View (AUC = 0.898), and the Mid-Esophageal 4-Chamber View (AUC = 0.902) in the SUMC data set.

Table 3 View classification model performance on the internal (CSMC) hold-out test set and the external (SUMC) test set.

Full size table

Clustering analysis suggests our AI model can identify a meaningful embedding space representing the various TEE views from heterogeneous video input that generalizes across two institutions (Fig. 3). Model performance was similar in standard black-and-white 2D B-Mode TEE videos (micro-averaged AUC = 0.902) and videos incorporating color flow Doppler information (micro-averaged AUC = 0.877) (Fig. 4), the analyses for which were performed on a combination of randomly selected internal and external test videos due to the overall low prevalence of color flow Doppler videos in our data sets.

Discussion

Our deep learning model was able to classify the eight most commonly used intraoperative and intraprocedural TEE views with high accuracy across a wide range of clinical and echocardiographic characteristics. Our videos included patients undergoing many different types of open cardiac surgery and transcatheter procedures, representing a highly diverse mix of anatomic pathology and differences in practice patterns across two major institutions for cardiology and cardiac surgery. Images also varied with respect to resolution, sizing and focus of the field of view, and the use of color flow Doppler. The model performance was consistently high across the range of findings in both held-out internal and external test data sets, demonstrating the generalizability of our view classifier in real-world clinical contexts.

Our study represents the first application of a machine learning strategy to TEE video image data acquired during the course of standard clinical care for open cardiac surgeries and transcatheter procedures. In prior work, the application of AI strategies to TEE has included focused image segmentation tasks and the automation of specific quantitative measurements. For example, groups such as Carnahan et al.¹³ and Andreassen et al.¹⁴ demonstrated the ability to identify the mitral valve apparatus from highly curated three- and four-dimensional mid-esophageal-level TEE acquisitions with the mitral valve centered in the images. Tasken et al., was able to automatically quantify the mitral annular plane systolic excursion (MAPSE), using a pipeline that included a view classification task prior to the quantification of MAPSE, highlighting the utility and necessity of TEE view classification for downstream machine learning tasks. Thalappillil et al.²¹ and Li et al.²² used quantitative measurements derived from TEE videos, rather than the TEE image data itself, as the input variables or output labels in their machine learning algorithms. Our group is the first to apply machine learning techniques to clinically-acquired intraoperative and intraprocedural TEE image data and the first to add structure to the data contained within these comprehensive clinical TEE exams.

Aside from the limited work that has been done with TEE videos, the large majority of prior AI-driven echocardiography imaged-based studies have focused on TTE videos. For example, it has been demonstrated that machine learning algorithms trained on TTE videos are able to predict standard TTE views^5,6,7,8, identify cardiac structures, estimate cardiac function, make accurate diagnoses, identify phenotypic information that is otherwise not easily recognized by a human observer, and predict clinical outcomes^{9,10,11,12,23,24}. The major advantage of working with TTE videos over TEE videos for machine learning tasks is that the TTE clinical workflow inherently creates structure for TTE data. As compared to the clinical workflow for TEE imaging, the imaging pipeline for TTE video acquisition, interpretation, and reporting is more standardized and consistent across studies and includes many image annotations and quantitative measurements. The ability to leverage these integrated annotations and quantitative measurements has reduced the need for laborious post hoc image annotation and has facilitated the swift adoption of machine learning for TTE data⁹.

In contrast to the structured nature of TTE data, intraoperative and intraprocedural TEE data are fundamentally more varied and relatively unstructured. The cardiac surgery operating rooms and structural heart disease procedural suites where TEE clinical exams are performed are highly dynamic environments. As a result, the TEE exams performed in these settings often vary in their acquisition sequences and inconsistently include image annotations or quantitative measurements. Moreover, intraoperative and intraprocedural TEE exams are subject to significant variation within and across studies, as they are acquired over the course of significant changes in clinical conditions, including changes cardiac loading, on- versus off-cardiopulmonary bypass (CPB), pre- versus post-surgical intervention, pharmacologic interventions, external cardiac pacing, and the use of other mechanical circulatory support devices. The application of AI-driven strategies to intraoperative and intraprocedural TEE imaging has primarily been limited by the relatively unstructured nature of TEE data. Our present study represents the first attempt at creating structure for clinically-acquired intraoperative and intraprocedural TEE data sets with a machine-learning based view classification algorithm. Additionally, our choice of a video-based model rather than a still image-based model helps to regularize much of the challenging natural variation that occurs with TEE.

Even though multiple AI-driven TTE view classification studies have been conducted in the past^5,6,7, the ability to directly apply these tools to TEE data is limited. While TTE and TEE are both ultrasound imaging modalities that capture cardiac structure and function, TTE views and TEE views are not entirely analogous^16,25. TTE images are acquired from the anterior (trans-thoracic) and left-lateral aspect of the patient, while TEE images are acquired from the posterior (trans-esophageal) aspect of the patient. Additionally, there are differences in probe manipulation and ultrasound beam rotation between the two modalities. The relationship between TTE versus TEE images are more nuanced than a simple one-to-one vertical flip. As a result, the advantages that allowed for the accelerated application of machine learning strategies to structured TTE data could not be automatically applied to TEE data.

The major limitation of our study is the class imbalance present in our data sets. Guidelines established by the American Society of Echocardiography and the Society of Cardiovascular Anesthesiologists identify twenty-eight different TEE views necessary to complete a comprehensive intraoperative multi-plane TEE exam¹⁶. In actual clinical practice, individual patient factors, anatomic variations and pathology, and time constraints in the cardiac surgery operating rooms and structural heart procedural suites can preclude the acquisition of all twenty-eight views. Oftentimes a comprehensive intraoperative TEE exam will include varying sequences, non-standard views, and multiple missing views. This clinical reality was reflected in our random selection of TEE videos, from both CSMC and SUMC. Many of the twenty-eight standardized TEE views were inconsistently captured and did not yield enough examples for adequate model training, validation, and testing. The most frequently captured views across all randomly selected TEE studies included the ME-4 Chamber View, the ME Long Axis View, the TG Left Ventricular Short Axis View, and the ME Aortic Valve Short Axis View, which is consistent with real-world clinical settings.

While our view classification model performed well across all labeled views, it showed particularly good performance for the views with the most training data and the views with the most visually distinct anatomic features (namely, the ME-4 Chamber View, the ME Long Axis View, the TG Left Ventricular Short Axis View, and the ME Aortic Valve Short Axis View). With respect to clinical significance and plausibility, it is not surprising that the views with most training data are also the views with the most distinct features. The goal of intraoperative echocardiography is to support real-time surgical and procedural decision-making, which requires the efficient acquisition of a complementary set of images that comprehensively captures cardiac structure and function⁴. To this end, the highest yield approach is to focus on a limited set of views that illustrate the relationships among as many significant structures as possible. Collectively, the ME-4 Chamber View, the ME Long Axis View, the TG Left Ventricular Short Axis View, and the ME Aortic Valve Short Axis View efficiently capture the large majority of the information needed by intraoperative and intraprocedural physicians. Therefore, these were the most frequently encountered views in our random selection of TEE videos, and ultimately were the highest performing classes in our view classification model. The performance of our model was the least accurate for the ME Left Atrial Appendage View due to an inadequate number of examples of this view in our random selection of TEE videos. For the future, we will continue to update our view classification model, with a particular focus on increasing the number of labeled examples for rarer views.

In order to optimize the number of examples that we had per view for model training and to build a view classifier that is reflective of real-world clinical contexts, four of our eight labels (the ME 2-Chamber, ME 4-Chamber, ME AV SAX, ME Long Axis) represent pooled categories. These pooled categories reflect a combination of two standardized views that vary only slightly with regard to omniplane angle, field of view depth, or sector width, but otherwise capture many of the same key structures and anatomic relationships (Fig. 5). The “ME 2-Chamber View” class included ME 2-chamber and ME mitral commissural videos; the “ME 4-Chamber View” class included ME 4-chamber and ME 5-chamber videos; the “ME AV SAX View” class included ME AV SAX and ME right ventricular (RV) inflow-outflow videos; and the “ME Long Axis View” class included ME long axis and ME AV long axis videos. We also chose to generalize two categories (the TG LV SAX and the Aortic Views), in order to increase the sample sizes of these classes (Fig. 5). Any image of the left ventricle in short axis, regardless of level (basal, mid-papillary, or apical), was classified as “Trans-Gastric Left Ventricular Short Axis View.” Similarly, any dedicated image of the aorta, regardless of level or axis orientation, was classified as “Aortic View.” Variation in patient anatomy and dynamic clinical needs often leads to the acquisition of TEE images that do not completely fit the criteria for a specific view class. It is not uncommon to acquire a TEE image that cannot be precisely categorized as a single view but instead falls in between two views. Therefore, combining categories with overlapping anatomic features for our view classification model mirrors real-world clinical practice.

In the present study, we demonstrate that our deep learning model can accurately classify standardized TEE views, which will facilitate further downstream deep learning analyses for intraoperative and intraprocedural TEE imaging. Previous work has already shown that intraoperative TEE imaging actively informs surgical decision-making^26,27 and is associated with improved clinical outcomes after cardiac surgery^28,29. The development of AI-driven models based on intraoperative TEE images has the potential to further enhance the value of echocardiography in the perioperative and periprocedural period by improving the ability to diagnose cardiac surgical diseases and complications, diagnose the underlying etiology of varied hemodynamic states, and predict clinical outcomes in the immediate and long-term postoperative periods.

Conclusion

In summary, we show that an intraoperative and intraprocedural TEE-based deep learning model can accurately identify standardized TEE views, the first step in the AI interpretation of TEE images. Our study represents an important first step towards the automated evaluation of intraoperative and intraprocedural echocardiography imaging and the leveraging of deep learning strategies for the advancement of patient care.

Data availability

The data that support the findings of this study are available through Kirsten R. Steffner, MD (ksteffner@stanford.edu). Restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the corresponding author upon reasonable request and with permission from the Stanford Center for Artificial Intelligence in Medicine & Imaging (aimicenter@stanford.edu).

Code availability

Our code is available online at https://github.com/echonet/tee-view-classifier.

References

Vos, T. et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet 396, 1204–1222 (2020).
Article Google Scholar
Doherty, J. U., Kort, S., Mehran, R., Schoenhagen, P. & Soman, P. ACC/AATS/AHA/ASE/ASNC/HRS/SCAI/SCCT/SCMR/STS 2017 Appropriate Use Criteria for Multimodality Imaging in Valvular Heart Disease: A Report of the American College of Cardiology Appropriate Use Criteria Task Force, American Association for Thoracic Surgery, American Heart Association, American Society of Echocardiography, American Society of Nuclear Cardiology, Heart Rhythm Society, Society for Cardiovascular Angiography and Interventions, Society of Cardiovascular Computed Tomography, Society for Cardiovascular Magnetic Resonance, and Society of Thoracic Surgeons. J. Am. Coll. Cardiol. 70, 1647–1672 (2017).
Article PubMed Google Scholar
Doherty, J. U. et al. ACC/AATS/AHA/ASE/ASNC/HRS/SCAI/SCCT/SCMR/STS 2019 appropriate use criteria for multimodality imaging in the assessment of cardiac structure and function in nonvalvular heart disease: A report of the American College of Cardiology Appropriate Use Criteria Task Force, American Association for Thoracic Surgery, American Heart Association, American Society of Echocardiography, American Society of Nuclear Cardiology, Heart Rhythm Society, Society for Cardiovascular Angiography and Interventions, Society of Cardiovascular Computed Tomography, Society for Cardiovascular Magnetic Resonance, and the Society of Thoracic Surgeons. J. Thorac. Cardiovasc. Surg. 157, e153–e182 (2019).
Article PubMed Google Scholar
Nicoara, A. et al. Guidelines for the use of transesophageal echocardiography to assist with surgical decision-making in the operating room: A surgery-based approach. J. Am. Soc. Echocardiogr. 33, 692–734 (2020).
Article PubMed Google Scholar
Madani, A., Arnaout, R., Mofrad, M. & Arnaout, R. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit. Med. 1, 1–8 (2018).
Article Google Scholar
Madani, A., Ong, J. R., Tibrewal, A. & Mofrad, M. R. K. Deep echocardiography: Data-efficient supervised and semi-supervised deep learning towards automated diagnosis of cardiac disease. NPJ Digit. Med. 1, 59 (2018).
Article PubMed PubMed Central Google Scholar
Gearhart, A., Goto, S., Deo, R. C. & Powell, A. J. An automated view classification model for pediatric echocardiography using artificial intelligence. J. Am. Soc. Echocardiogr. 35, 1238–1246 (2022).
Article PubMed PubMed Central Google Scholar
Østvik, A., Smistad, E., Aase, S. A., Haugen, B. O. & Lovstakken, L. Real-time standard view classification in transthoracic echocardiography using convolutional neural networks. Ultrasound Med. Biol. 45, 374–384 (2019).
Article PubMed Google Scholar
Ghorbani, A. et al. Deep learning interpretation of echocardiograms. NPJ Digit. Med. 3, 1–10 (2020).
Article Google Scholar
Ouyang, D. et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature 580, 252–256 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Duffy, G. et al. High-throughput precision phenotyping of left ventricular hypertrophy with cardiovascular deep learning. JAMA Cardiol. 7, 386–395 (2022).
Article PubMed PubMed Central Google Scholar
Shad, R. et al. Predicting post-operative right ventricular failure using video-based deep learning. Nat. Commun. 12, 5192 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Carnahan, P. et al. DeepMitral: Fully automatic 3D echocardiography segmentation for patient specific mitral valve modelling. In Medical Image Computing and Computer Assisted Intervention—MICCAI 2021 459–468 (Springer International Publishing, 2021).
Andreassen, B. S., Veronesi, F., Gerard, O., Solberg, A. H. S. & Samset, E. Mitral annulus segmentation using deep learning in 3-D transesophageal echocardiography. IEEE J. Biomed. Health Inf. 24, 2168–2194 (2020).
Google Scholar
Taskén, A. A. et al. Automated estimation of mitral annular plane systolic excursion by artificial intelligence from 3D ultrasound recordings. Artif. Intell. Med. 144, 102646 (2023).
Article PubMed Google Scholar
Hahn, R. T. et al. Guidelines for performing a comprehensive transesophageal echocardiographic examination: Recommendations from the American Society of Echocardiography and the society of cardiovascular anesthesiologists. J. Am. Soc. Echocardiogr. 26, 921–964 (2013).
Article PubMed Google Scholar
Tran, D., Bourdev, L., Fergus, R., Torresani, L. & Paluri, M. Learning Spatiotemporal Features with 3D Convolutional Networks. arXiv [cs.CV] (2014).
Tran, D. et al. A closer look at spatiotemporal convolutions for action recognition. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (IEEE, 2018). doi:https://doi.org/10.1109/cvpr.2018.00675.
Kingma, D. P. & Ba, J. Adam: A Method for Stochastic Optimization. arXiv [cs.LG] (2014).
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Thalappillil, R. et al. Artificial intelligence for the measurement of the aortic valve annulus. J. Cardiothorac. Vasc. Anesth. 34, 65–71 (2020).
Article PubMed Google Scholar
Li, C. et al. Machine learning model-based simple clinical information to predict decreased left atrial appendage flow velocity. J. Pers. Med. 12, 145 (2022).
Article CAS Google Scholar
Hughes, J. W. et al. Deep learning evaluation of biomarkers from echocardiogram videos. EBioMedicine 73, 103613 (2021).
Article PubMed PubMed Central CAS Google Scholar
He, B. et al. Blinded, randomized trial of sonographer versus AI cardiac function assessment. Nature https://doi.org/10.1038/s41586-023-05947-3 (2023).
Article PubMed PubMed Central Google Scholar
Mitchell, C. et al. Guidelines for performing a comprehensive transthoracic echocardiographic examination in adults: Recommendations from the american society of echocardiography. J. Am. Soc. Echocardiogr. 32, 1–64 (2019).
Article PubMed Google Scholar
Nowrangi, S. K., Connolly, H. M., Freeman, W. K. & Click, R. L. Impact of intraoperative transesophageal echocardiography among patients undergoing aortic valve replacement for aortic stenosis. J. Am. Soc. Echocardiogr. 14, 863–866 (2001).
Article PubMed CAS Google Scholar
Shapira, Y. et al. Impact of intraoperative transesophageal echocardiography in patients undergoing valve replacement. Ann. Thorac. Surg. 78, 579–583 (2004).
Article PubMed Google Scholar
MacKay, E. J., Zhang, B., Augoustides, J. G., Groeneveld, P. W. & Desai, N. D. Association of intraoperative transesophageal echocardiography and clinical outcomes after open cardiac valve or proximal aortic surgery. JAMA Netw. Open 5, e2147820 (2022).
Article PubMed PubMed Central Google Scholar
Metkus, T. S. et al. Transesophageal echocardiography in patients undergoing coronary artery bypass graft surgery. J. Am. Coll. Cardiol. 78, 112–122 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Author information

These authors contributed equally: Kirsten R. Steffner and Matthew Christensen.

Authors and Affiliations

Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University, 300 Pasteur Drive, Stanford, CA, 94305, USA
Kirsten R. Steffner
Department of Cardiology, Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, USA
Matthew Christensen, Justin Rhee & David Ouyang
Department of Cardiac Surgery, Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, USA
George Gill, Michael Bowdish & Abirami Kumaresan
Department of Anesthesiology, Cedars-Sinai Medical Center, Los Angeles, USA
Abirami Kumaresan
Department of Computer Science, Stanford University, Stanford, USA
Bryan He
Department of Biomedical Data Science, Stanford University, Stanford, USA
James Zou

Authors

Kirsten R. Steffner
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Christensen
View author publications
You can also search for this author in PubMed Google Scholar
George Gill
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bowdish
View author publications
You can also search for this author in PubMed Google Scholar
Justin Rhee
View author publications
You can also search for this author in PubMed Google Scholar
Abirami Kumaresan
View author publications
You can also search for this author in PubMed Google Scholar
Bryan He
View author publications
You can also search for this author in PubMed Google Scholar
James Zou
View author publications
You can also search for this author in PubMed Google Scholar
David Ouyang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.R.S. co-designed the study, performed all data input labeling, reviewed the results, wrote the manuscript, and designed the figures and tables. M.C. co-designed the study; was a major contributor in designing the deep learning model, testing model performance, and conducting statistical analyses; and contributed to figure design. G.G. provided clinical and demographic data, and contributed to table design. M.B. facilitated access to clinical and demographic data, and reviewed the results and manuscript. J.R. reviewed the results and manuscript. A.K. reviewed the results and manuscript. B.H. reviewed the results and manuscript. J.Z. reviewed the results and manuscript. D.O. co-designed the study; contributed to model design, testing, and analysis of results; and reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kirsten R. Steffner.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Steffner, K.R., Christensen, M., Gill, G. et al. Deep learning for transesophageal echocardiography view classification. Sci Rep 14, 11 (2024). https://doi.org/10.1038/s41598-023-50735-8

Download citation

Received: 27 June 2023
Accepted: 24 December 2023
Published: 02 January 2024
DOI: https://doi.org/10.1038/s41598-023-50735-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.