Multi-organ segmentation of abdominal structures from non-contrast and contrast enhanced CT images

Yu, Cenji; Anakwenze, Chidinma P.; Zhao, Yao; Martin, Rachael M.; Ludmir, Ethan B.; S.Niedzielski, Joshua; Qureshi, Asad; Das, Prajnan; Holliday, Emma B.; Raldow, Ann C.; Nguyen, Callistus M.; Mumme, Raymond P.; Netherton, Tucker J.; Rhee, Dong Joo; Gay, Skylar S.; Yang, Jinzhong; Court, Laurence E.; Cardenas, Carlos E.

doi:10.1038/s41598-022-21206-3

Download PDF

Article
Open access
Published: 09 November 2022

Multi-organ segmentation of abdominal structures from non-contrast and contrast enhanced CT images

Cenji Yu¹,
Chidinma P. Anakwenze³,
Yao Zhao¹,
Rachael M. Martin²,
Ethan B. Ludmir³,
Joshua S.Niedzielski²,
Asad Qureshi⁵,
Prajnan Das³,
Emma B. Holliday³,
Ann C. Raldow⁶,
Callistus M. Nguyen²,
Raymond P. Mumme²,
Tucker J. Netherton²,
Dong Joo Rhee²,
Skylar S. Gay¹,
Jinzhong Yang^1,2,
Laurence E. Court^1,2 &
…
Carlos E. Cardenas⁴

Scientific Reports volume 12, Article number: 19093 (2022) Cite this article

2544 Accesses
7 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Manually delineating upper abdominal organs at risk (OARs) is a time-consuming task. To develop a deep-learning-based tool for accurate and robust auto-segmentation of these OARs, forty pancreatic cancer patients with contrast-enhanced breath-hold computed tomographic (CT) images were selected. We trained a three-dimensional (3D) U-Net ensemble that automatically segments all organ contours concurrently with the self-configuring nnU-Net framework. Our tool’s performance was assessed on a held-out test set of 30 patients quantitatively. Five radiation oncologists from three different institutions assessed the performance of the tool using a 5-point Likert scale on an additional 75 randomly selected test patients. The mean (± std. dev.) Dice similarity coefficient values between the automatic segmentation and the ground truth on contrast-enhanced CT images were 0.80 ± 0.08, 0.89 ± 0.05, 0.90 ± 0.06, 0.92 ± 0.03, 0.96 ± 0.01, 0.97 ± 0.01, 0.96 ± 0.01, and 0.96 ± 0.01 for the duodenum, small bowel, large bowel, stomach, liver, spleen, right kidney, and left kidney, respectively. 89.3% (contrast-enhanced) and 85.3% (non-contrast-enhanced) of duodenum contours were scored as a 3 or above, which required only minor edits. More than 90% of the other organs’ contours were scored as a 3 or above. Our tool achieved a high level of clinical acceptability with a small training dataset and provides accurate contours for treatment planning.

Fully-automated multi-organ segmentation tool applicable to both non-contrast and post-contrast abdominal CT: deep learning algorithm developed using dual-energy CT images

Article Open access 22 February 2024

CT-ORG, a new dataset for multiple organ segmentation in computed tomography

Article Open access 11 November 2020

Abdominal multi-organ auto-segmentation using 3D-patch-based deep convolutional neural network

Article Open access 10 April 2020

Introduction

Pancreatic cancer is one of the most aggressive tumor types, as it accounts for 3% of all cancers in the United States, as well as 7% of all cancer-related deaths¹. Radiation therapy, along with chemotherapy, play a vital role in local tumor control for locally advanced pancreatic cancer². Radiation treatment planning for pancreatic cancer is often complex with tight dose constraints³. This is a consequence of the pancreas being surrounded by highly radiosensitive and serial organs at risk (OARs) (duodenum, stomach, and small bowel) that require maximum dose constraints. However, OAR delineation in pancreatic and liver cancer is time consuming⁴. At our cancer center, pancreas radiation treatment requires delineation of 8 OARs: stomach, duodenum, large bowel, small bowel, liver, spleen, left kidney and right kidney. The average time spent on OAR delineation has been shown to be over 20 minutes⁵. For upper abdominal OAR delineation, reproducibility is a major challenge. Experts often have conflicting OAR delineations for the same patient, especially at the gastroesophageal junction⁶. Delineation of bowel structures (duodenum, large bowel and small bowel) is also susceptible to interobserver variability^5,7. Margins reserved for motion management⁸ and poor soft tissue contrast at the small/large bowel border⁹ makes establishing the ground-truth for bowel structures difficult. It is often found in clinical practice that normal tissues extending (~ 1.0 cm) beyond the superior and inferior extent of the planning target volume (PTV) are not contoured on slices located outside of these margins. This is generally true for normal tissues that have a maximum dose objectives where the whole volume is not needed for dose optimization¹⁰, but this practice also introduces interobserver variability and clouds the establishment of the ground-truth.

Deep learning-based tools have achieved expert level performance when trained with large datasets^{11,12,13,14,15}. It has also been shown to reduce contouring inconsistency in clinical trials and to provide more accurate dose metrics¹⁶. Among deep learning-driven approaches, U-Net derived models dominate in organ segmentation tasks in the abdomen^17,18 where public datasets are abundant (liver, spleen and kidney). For serial OARs (duodenum, stomach, and small bowel) in pancreatic cancer treatment, a few U-Net based models were developed on private datasets and achieved better results than alternative approaches such as fully convolutional network-based models¹⁹. Wang et al. explored the multi-planar fusion approach with 2D U-Nets predicting on both axial, sagittal and coronal views⁹. Liu et al. utilized a 3D self-attention U-Net to segment the OARs in pancreatic radiotherapy²⁰ and achieved state-of-the art performance. These specialized U-Net models from large academic institutions required extensive research expertise to develop. In addition, these models required at least 80 sets of complete patient contours for training and validation alone. Due to aforementioned inconsistencies in the clinical contours, extensive curation by experts is required before contours qualify for deep learning training. This expensive, time-consuming process²¹ hinders the development and adoption of deep learning models outside of large academic institutions.

Recently, the self-configuring nnU-Net framework²² has shown promising results in abdominal organ segmentation. This framework systematically configured U-Nets on the basis of distribution of spacings, median shape, and intensity distribution of the training CT images. The framework is also exceedingly data efficient due to robust data augmentation methods. nnU-Net has shown promising results in abdominal organ segmentation tasks and won two of the five tasks in the CHAOS challenge¹⁸. This framework was thus chosen as our candidate for automating upper-abdominal OAR segmentation.

In summary, upper abdominal OAR contouring is time-consuming and susceptible to variabilities. Deep learning-based auto-segmentation provides a fast and consistent alternative to manual contouring. However, specialized U-Nets and large datasets are deemed essential to a robust deep learning auto-segmentation tool according to existing literature. These requirements confine the development of auto-segmentation tool to large academic centers with research expertise. In this study, we proposed using the streamlined nnU-Net framework to customize three-dimensional (3D) U-Nets that delineate eight OARs (stomach, duodenum, large bowel, small bowel, liver, spleen, left kidney and right kidney) simultaneously on contrast-enhanced and non-contrast-enhanced CT images. We hypothesized that with a small, but consistent, training set, the standard U-Net architecture could create clinically deployable models for upper-abdominal OAR segmentation. This study demonstrated clinical utility of the automatically generated segmentations through a robust evaluation via multi-observer rating of individual contours on 75 abdominal CT scans as well as quantitative evaluation on 30 CT scans. Our approach provided an easy-to-implement, data-efficient alternative for automating the clinical workflow of pancreatic radiation treatment, including adaptive radiation therapy. Our method utilized the least amount of data to achieve clinically acceptable qualitative results and competitive quantitative results compared to existing literature. In addition, we examined the organ-by-organ segmentation performance gain as we increased the number of patients in the training dataset to provide insights on the amount of data required for training robust upper abdominal segmentation models for clinics interested in developing their own tools. We will release the entire training and testing dataset on TCIA to serve as additional resources for future abdominal organs auto-segmentation development.

Materials and methods

Imaging data

Seventy patients were selected from patients with pancreatic cancer who were treated at The University of Texas MD Anderson Cancer Center from 2017 to 2020 under an IRB (institutional review board) approved protocol. CT images were acquired with the breath-hold technique on Philips Brilliance Big Bore (Philips Healthcare, Best, The Netherlands) CT simulators. CT scans had pixel sizes ranged from 0.98 to 1.04 mm and slice thickness from 1 to 2.5 mm. Patients were scanned from 5 cm above the diaphragm to the iliac crest with intravenous contrast injection. The clinical OAR contours included the duodenum, small bowel, large bowel, stomach, liver, spleen, left kidney and right kidney.

Data curation and manual segmentation

The duodenum, small bowel, and large bowel were manually delineated under physician supervision to increase consistency in normal tissue definition for these organs. To provide sufficient contextual information for the 3D U-Net models, bowel structures were extended along the z-axis and contoured throughout the entire scan. Stomach contours were trimmed to eliminate motion management margins. Liver, spleen and kidney contours were edited to ensure anatomical accuracy. All ground truth contours were reviewed and approved by a radiation oncologist. Forty sets of contours were randomly selected for training and validation. The remaining thirty sets of contours were reserved as the held-out test set.

Data preprocessing

To segment all OARs simultaneously, labels were compiled into a single segmentation map. When organ borders overlapped, the priority of the segmentation map was duodenum, small bowel, stomach, large bowel, liver, spleen and kidneys. Organs with the most stringent dose constraints were prioritized and overwrote organs with less stringent dose constraints. All images were resampled to 0.98 mm × 0.98 mm pixel size and 2.5 mm slice thickness.

Model training

The adaptive nnU-Net framework²² was employed to customize 3D U-Nets for our dataset. 3D patches of image-label pairs were used for training. The patch size was 192 × 192 × 48. The 3D U-Net network depth was dynamically optimized by nnU-Net framework to ensure sufficient depth to fully utilize the large patch size. The training batch size was 2. The resulting U-Net architecture generated by the nnU-Net framework is shown in Fig. 1.

The loss function was a combination of Dice similarity coefficient (DSC) loss and cross-entropy loss. Training and testing were done on NVIDIA Tesla V100 GPUs with 32 GB VRAM. Training was stopped after 1000 epochs. To fully extract features from a small data set, five-fold cross-validation was used among the 40-patient dataset: 32 patients were used for training, and eight patients were used for validation in each fold (80-20 split). Five 3D U-Net models were trained, and the final prediction was produced by an ensemble of all five trained models from the cross validation. Training time for the U-Net ensemble was 36 hours when individual models were trained in parallel. Inference time using the U-Net ensemble for each patient was 8 minutes on average.

To evaluate performance gains as the size of training data expanded, additional model ensembles were also trained on an escalating number of patients. Subsets of 10, 15, 20, 25, 30, and 35 patients were randomly selected. The training-validation split for each set was also 80–20, which was identical to the final model ensemble. These six additional 3D U-Net ensembles were trained under the nnU-Net framework with identical training procedures.

Quantitative evaluation

The final model ensembles from various sizes of the training data were evaluated on the held-out test set of thirty patients. The performance of the model ensembles was evaluated by the 3D DSC, 95% Hausdorff distance (HD95), and mean surface distance (MSD) between the predicted contours and the ground truth contours.

Qualitative evaluation

An additional 75 patients simulated under the breath-hold protocol were randomly selected from the clinical database as an independent qualitative test set. Our center captures two non-contrast-enhanced and three to four contrast-enhanced CT images during simulation for patients who are suitable for CT imaging with a contrast agent. For each patient, one contrast-enhanced and one non-contrast-enhanced CT image were randomly selected as part of the qualitative analysis, resulting in a total of 150 patient CT images. The automatically generated contours on both contrast-enhanced and non-contrast-enhanced images were visually evaluated and scored using a five-point Likert scale as shown in Table 1 by five radiation oncologists from three institutions and two countries. Each image was scored once by a radiation oncologist; and each organ was scored individually.

Table 1 Likert scale used by physicians to evaluate contours generated on contrast- enhanced and non-contrast-enhanced CT images.

Full size table

Ethical approval

CT images used in this study were acquired during routine treatment. This study was approved by the University of Texas MD Anderson Cancer Center Institutional Review Board (IRB 4), which included a waiver of informed consent, and all methods were performed in accordance with the relevant guidelines and regulations.

Results

Quantitative evaluation

A summary of the quantitative evaluation (n = 30) is provided in Table 2. All automatically generated contours had a mean DSC value of 0.80 or higher when compared to the ground-truth contours. Solid organs such as liver, spleen and kidneys all achieved mean DSC values ranging from 0.96 to 0.97. Radiosensitive hollow organs such as small bowel, large bowel and stomach achieved mean DSC values ranging from 0.89 to 0.92. Duodenum achieved a mean DSC of 0.80. For distance metrics, solid organs (liver, spleen and kidneys) had mean HD95 ranging from 2.21 to 2.51 mm and mean MSD ranging from 0.61 to 1.07 mm. Radiosensitive hollow organs (small bowel, large bowel and stomach) had mean HD95 ranging from 4.77 to 7.77 mm and mean MSD ranging from 1.23 to 1.99 mm. Duodenum had a mean HD95 of 12.34 mm and mean MSD of 1.68 mm.

Table 2 Mean Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95), and mean surface distance (MSD) between ground truth and prediction results from our tool on contrast-enhanced CT images.

Full size table

DSC boxplots of all organs were shown in Fig. 2. Auto-segmentation performance had more variability in hollow organs compared to solid organs. Outliers from small bowel and large bowel auto-segmentations were often caused by misidentification of small/large bowel in inferior regions of CT scans outside of treatment fields. Low DSC examples of duodenum were often caused by disagreements at the stomach/duodenum and duodenum/jejunum borders.

In order to determine if 40 patients were sufficient for optimal model performance, the mean DSCs for the individual organs were also examined for an escalating number of patients. The result was plotted in Fig. 3. The mean DSC increased as the size of the training dataset increased. The mean DSCs of all organs tended to converge as the number of patients approached 40.

Qualitative evaluation

The results from physicians’ qualitative evaluations are shown below in Tables 3. Among the non-contrast-enhanced CT images, 85.3% of the duodenum contours, 92.0% of the small bowel contours, 93.3% of the stomach contours and more than 95% of the other organ contours received a score of 3 or greater, suggesting that these contours required only minor edits from physicians. More than 50% of the duodenum, small bowel, large bowel, and stomach contours as well as more than 85% of the spleen and kidney received a score of 4 or above.

Table 3 Qualitative scores for contours generated on contrast-enhanced and non- contrast-enhanced CT images of 75 randomly selected patients.

Full size table

There was a small improvement in contour scores for auto-segmentations on contrast-enhanced CTs. 89.3% of the duodenum contours, 94.7% of the small bowel contours, and more than 95% of the other organ contours were scored as a 3 or greater. More than 60% of the duodenum, small bowel, large bowel, and stomach contours and more than 90% of the spleen and kidney scored a 4 or greater. Examples of automatically generated contours scored as 3,4 and 5 for duodenum, stomach and small bowel are shown in Fig. 4.

Discussion

We have developed a deep-learning-based tool for accurate and robust upper-abdominal OAR auto-segmentation. Our tool could simultaneously segment the duodenum, large bowel, small bowel, stomach, liver, spleen, and kidneys. Upon evaluation, the tool performed well in both quantitative and qualitative assessments. These tests were conducted on randomly selected held-out test patients (30 and 75 patients for quantitative and qualitative assessments, respectively). Our qualitative assessment was conducted by five radiation oncologists from three different institutions. The tool achieved acceptable performance for clinical deployment, even though it was trained and validated with only 40 patients. Based on the results from this study, we have clinically implemented this auto-contouring system in the clinic at MD Anderson Cancer Center. In the future, we will make this auto-contouring tool available as part of the Radiation Planning Assistant²³ (rpa.mdanderson.org) to make this tool available to radiation oncology clinics in low- and middle-income countries.

Deep learning-based auto-segmentation approaches typically require a large amount of high-quality segmented datasets to achieve optimal performance²⁴. In clinical scenario, the amount of high-quality labeled images is limited²⁵. Creating high-quality contours suitable for deep learning training requires significant time resources and expertise^21,26. A number of self-supervised deep learning approach were proposed by generating artificial data^27,28,29, but these approaches required technical expertise only available at large academic centers. Our findings offered an affordable, easy to implement approach to create auto-segmentation tools when public dataset is not available. The self-adaptive nnU-Net framework provided a standardized platform for U-Net architectures, allowing us to customize 3D U-Net ensembles that maximized the performance of the U-Net architecture. The qualitative evaluation provides evidence for the prowess of our tool. Automatically generated contours received a Likert score of 3 or above required only minor edits. Physicians deemed these contours beneficial to their segmentation workflow. Among 75 independent test patients, over 90% of the automatically generated contours received a Likert score of 3 or greater on most organs. For organs with poor soft tissue boundaries such as the duodenum, 89.7% of CT contours only required minor edits for clinical use. Our results have shown that with a dataset of 40 patients, a standard 3D U-Net architecture could deliver automatically generated contours suitable for clinical deployment.

Clinical context of segmentation errors differentiated acceptable contours (Likert ≥ 4) from contours needed necessary minor edits (Likert = 3). Small contour errors may have significant clinical relevancy. For the duodenum contour scored as a 3 in Fig. 4, the tool under-contoured a portion of the duodenum as shown by the arrow. The error shown was critical to patient safety because this segment of the duodenum was medially located and was close to the treatment target. Although most of the duodenum was properly contoured, the generated contour was scored as a 3 instead of a 4. The edit required from physicians, however, was marginal. Physicians were less concerned about absolute anatomical accuracy in other cases. For example, interobserver variability could be significant at the border of stomach and duodenum. The anatomical landmarks used to distinguish the two are subtle, often lacking a clear border. While the generated contour deviated drastically from the ground truth as shown in Fig. 5, it was scored as a 4 and deemed acceptable for treatment planning by physicians. This was because the duodenum and stomach are often optimized to have the same maximum dose constraints (D_max < 28 Gy).

Individual stylistic preferences differentiated use-as-is contours (Likert = 5) from the acceptable contours (Likert = 4). These stylistic preferences were the most prominent at the intersection of the duodenum and jejunum (contoured as part of the small bowel). The superior border of the fourth section of the duodenum had no visible border features on CT images. In Fig. 6, the automatically generated contour was scored as a 4. The ground truth duodenum contour extended more superiorly compared to the automatically generated contour at the region indicated by the arrow. The varying cranial ends of duodenum contours were deemed as stylistic differences. The physicians were uncertain about the anatomical ground truth in the region. Since duodenum and small bowel were often optimized to have the same maximum dose constraints (D_max < 28 Gy), physicians decided that these differences had limited impact on treatment planning.

Our quantitative results are comparable to those of state-of-the-art models trained with datasets of 80 patients or more for most organs. The DSC scores of the tool on small bowel, large bowel, stomach, spleen, liver, and kidney contours were within 0.01 of the current 3D state-of-the-art model (Liu et al.) as shown in Table 4. The MSDs were also comparable or smaller than the 3D state-of-the-art model shown in Table 5. Our tool, however, was trained and validated with a much smaller dataset of 40 patients. Our approach seemed to be more data efficient compared to the state-of-the-art approach. As data curation process is known to be time-consuming and expensive, our method would allow easier development and adoption in the clinic.

Table 4 Dice similarity coefficient comparison between our tool and other state-of-the- art upper-abdominal auto-segmentation models.

Full size table

Table 5 Mean surface distance comparisons between our tool and other state-of-the-art upper-abdominal auto-segmentation models.

Full size table

Studies have suggested that 3D models demand too many parameters and required a large training dataset³⁰ to converge. Previous state-of-the-art approaches, such as organ-attention 2D deep networks with reverse connections by Wang et al., have been developed to segment 2D slices along axial, sagittal, and coronal views to reduce the number of trainable parameters⁹. Our tool outperformed the 2D-based multi-planar fusion approach in DSC for duodenum, small bowel and large bowel as shown in Table 4. We also achieved lower MSD for small bowel, large bowel, stomach and liver as shown in Table 5. When challenged with structures that span along the z-axis, 3D models were better equipped to segment these structures compared to 2D-based multi-planar fusion model due to its capability to capture anatomical context along the z-axis. Since only 40 patients were used for training and validation, our tool’s 3D approach seemed to be more data efficient than the 2D multi-planar fusion approach as well.

The model performance progression with increasing patient number (Fig. 3) gave us a better perspective on why our quantitative results were comparable to state-of-the-art models. For challenging hollow structures such as the stomach and duodenum, the 3D U-Net models initially gained performance as the patient number increased. The DSC curve started converging as we approached 25 patients. Similar trends were observed in the large bowel and small bowel DSCs. While the mean DSCs converged, the standard deviations were decreasing for the stomach, large bowel and small bowel. Prediction results were less variable with a larger training/validation dataset. For solid organs such as the spleen, liver, and kidney, DSC scores were above 90 even with only 10 patients. This data provides insights for clinics or individuals that are interested in developing their individual 3D U-Net models for upper-abdominal organ segmentation. When faced with the task of creating auto-segmentation tools with a limited annotation budget, our findings might be a guideline for budget allocation.

Our tool was developed and tested on the ground truth label delineated according to our institution’s implementation of the RTOG guideline. While we introduced five radiation oncologists from three institutions to conduct qualitative evaluation, the test patients were from the same institution. With varying imaging protocols, image acquisition and reconstruction parameters, the model performance might suffer if the test patients were from various institutions from our experience³¹. In this case, small training samples might not be sufficient to guarantee great performance across varying patient cohorts. Further evaluation is needed to assess the model ensemble’s performance on different patient populations.

For future work, automatic quality assurance of the generated contour, i.e. capturing clinically unusable contours, would also be a crucial addition to our automation tool. In addition, our center utilizes CT-on-rails image guided system for pancreatic radiation treatment. While our tool exhibited robust qualitative results on non-contrast-enhanced CT images, future work would include dose accumulation studies using automatically generated contours. This can pave the way for adaptive radiation therapy in pancreatic radiation treatment.

Conclusions

We proposed a simple but effective approach for developing a deep learning-based segmentation model for upper-abdominal OAR segmentation. Using only 40 patients, we trained a nnU-Net model to generate automatic contours that was able to produce clinically acceptable results on both contrast-enhanced and non-contrast-enhanced CT images. The results of the presented analysis led to the clinical deployment of this tool.

Data availability

The data can be made available on request to Laurence Court (lecourt@mdanderson.org). The dataset will be available on The Cancer Imaging Archive.

References

Khalaf, N., El-Serag, H. B., Abrams, H. R. & Thrift, A. P. Burden of pancreatic cancer: From epidemiology to practice. Clin. Gastroenterol. Hepatol. 19, 876–884 (2021).
Article PubMed Google Scholar
Moningi, S. et al. The role of stereotactic body radiation therapy for pancreatic cancer: A single-institution experience. Ann. Surg. Oncol. 22, 2352–2358 (2015).
Article PubMed PubMed Central Google Scholar
Brunner, T. B. et al. ESTRO ACROP guidelines for target volume definition in pancreatic cancer. Radiother. Oncol. 154, 60–69 (2021).
Article PubMed Google Scholar
Ahn, S. H. et al. Comparative clinical evaluation of atlas and deep-learning-based auto-segmentation of organ structures in liver cancer. Radiat. Oncol. 14, 1–13 (2019).
Article Google Scholar
Kim, H. et al. Abdominal multi-organ auto-segmentation using 3D-patch-based deep convolutional neural network. Sci. Rep. 10, 1–9 (2020).
Google Scholar
Jabbour, S. K. et al. Upper abdominal normal organ contouring guidelines and atlas: A radiation therapy oncology group consensus. Pract. Radiat. Oncol. 4, 82–89 (2014).
Article PubMed Google Scholar
Lukovic, J. et al. MRI-based upper abdominal organs-at-risk atlas for radiation oncology. Int. J. Radiat. Oncol. Biol. Phys. 106, 743–753 (2020).
Article CAS PubMed Google Scholar
Reyngold, M., Parikh, P. & Crane, C. H. Ablative radiation therapy for locally advanced pancreatic cancer: Techniques and results. Radiat. Oncol. 14, 1–8 (2019).
Article Google Scholar
Wang, Y. et al. Abdominal multi-organ segmentation with organ-attention networks and statistical fusion. Med. Image Anal. 55, 88–102 (2019).
Article PubMed Google Scholar
Murphy, J. D. et al. A dosimetric model of duodenal toxicity after stereotactic body radiotherapy for pancreatic cancer. Int. J. Radiat. Oncol. Biol. Phys. 78, 1420–1426 (2010).
Article PubMed Google Scholar
Hernandez, S. et al. Development and dosimetric assessment of an automatic dental artifact classification tool to guide Artifact Management Techniques in a fully automated treatment planning workflow. Comput. Med. Imaging Graph. 90, 101907 (2021).
Article PubMed PubMed Central Google Scholar
Gronberg, M. P. et al. Technical note: Dose prediction for head and neck radiotherapy using a three dimensional dense dilated U-Net architecture. Med. Phys. 48, 5567–5573 (2021).
Article PubMed Google Scholar
Netherton, T. J. et al. Evaluation of a multiview architecture for automatic vertebral labeling of palliative radiotherapy simulation CT images. Med. Phys. 47, 5592–5608 (2020).
Article PubMed Google Scholar
Rhee, D. J. et al. Automatic contouring system for cervical cancer using convolutional neural networks. Med. Phys. 47, 5648–5658 (2020).
Article PubMed Google Scholar
Gay, S. S. et al. A Bi-directional, Multi-modality Framework for Segmentation of Brain Structures. In Segmentation, Classification, and Registration of Multi-modality Medical Imaging Data (eds Shusharina, N. et al.) 49–57 (Springer International Publishing, Cham, 2021).
Chapter Google Scholar
Thor, M. et al. Using auto-segmentation to reduce contouring and dose inconsistency in clinical trials: The simulated impact on RTOG 0617. Int. J. Radiat. Oncol. Biol. Phys. 109, 1619–1626 (2021).
Article PubMed Google Scholar
Heller, N. et al. The state of the art in kidney and kidney tumor segmentation in contrast-enhanced CT imaging: Results of the KiTS19 challenge. Med. Image Anal. 67, 101821 (2021).
Article PubMed Google Scholar
Kavur, A. E. et al. CHAOS Challenge - combined (CT-MR) healthy abdominal organ segmentation. Med. Image Anal. 69, 101950 (2021).
Article PubMed Google Scholar
Gibson, E. et al. Automatic multi-organ segmentation on abdominal CT with dense V-networks. IEEE Trans. Med. Imaging 37, 1822–1834 (2018).
Article PubMed PubMed Central Google Scholar
Liu, Y. et al. CT-based multi-organ segmentation using a 3D self-attention U-net network for pancreatic radiotherapy. Med. Phys. 47, 4316–4324 (2020).
Article PubMed ADS Google Scholar
Tajbakhsh, N. et al. Embracing imperfect datasets: A review of deep learning solutions for medical image segmentation. Med. Image Anal. 63, 101693 (2020).
Article PubMed Google Scholar
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211 (2021).
Article CAS PubMed Google Scholar
Court, L. E. et al. Radiation planning assistant—A streamlined, fully automated radiotherapy treatment planning system. J. Vis. Exp. 2018, 1–9 (2018).
Google Scholar
Cardenas, C. E., Yang, J., Anderson, B. M., Court, L. E. & Brock, K. B. Advances in auto-segmentation. Semin. Radiat. Oncol. 29, 185–197 (2019).
Article PubMed Google Scholar
Siddique, N., Paheding, S., Elkin, C. P. & Devabhaktuni, V. U-net and its variants for medical image segmentation: A review of theory and applications. IEEE Access https://doi.org/10.1109/ACCESS.2021.3086020 (2021).
Article PubMed Google Scholar
Lugo-Fagundo, C., Vogelstein, B., Yuille, A. & Fishman, E. K. Deep learning in radiology: Now the real work begins. J. Am. Coll. Radiol. 15, 364–367 (2018).
Article PubMed Google Scholar
Zhao, A., Balakrishnan, G., Durand, F., Guttag, J. V. & Dalca, A. V. Data augmentation using learned transformations for one-shot medical image segmentation. Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. 2019-June, 8535–8545 (2019).
Zhao, Y., Rhee, D. J., Cardenas, C., Court, L. E. & Yang, J. Training deep-learning segmentation models from severely limited data. Med. Phys. 48, 1697–1706 (2021).
Article PubMed Google Scholar
Sandfort, V., Yan, K., Pickhardt, P. J. & Summers, R. M. Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks. Sci. Rep. 9, 1–9 (2019).
Article CAS ADS Google Scholar
Zhou, Y. et al. Semi-supervised 3D abdominal multi-organ segmentation via deep multi-planar co-training. Proc. - 2019 IEEE Winter Conf. Appl. Comput. Vision, WACV 2019 121–140 (2019). https://doi.org/10.1109/WACV.2019.00020.
Huang, K. et al. Impact of slice thickness, pixel size, and CT dose on the performance of automatic contouring algorithms. J. Appl. Clin. Med. Phys. 22, 168–174 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work is part of the Radiation Planning Assistant, which receives funding from NCI, CPRIT, Wellcome Trust and Varian Medical System. This work is also supported by the Tumor Measurement Initiative through the MD Anderson Strategic Initiative Development Program (STRIDE). The author(s) acknowledge the support of the High Performance Computing for research facility at the University of Texas MD Anderson Cancer Center for providing computational resources that have contributed to the research results reported in this paper. We also would like to thank Ashli Nguyen-Villarreal, Associate Scientific Editor, and Dawn Chalaire, Associate Director, in the Research Medical Library at The University of Texas MD Anderson Cancer Center, for editing this article.

Author information

Authors and Affiliations

The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences (GSBS), Houston, TX, USA
Cenji Yu, Yao Zhao, Skylar S. Gay, Jinzhong Yang & Laurence E. Court
Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Rachael M. Martin, Joshua S.Niedzielski, Callistus M. Nguyen, Raymond P. Mumme, Tucker J. Netherton, Dong Joo Rhee, Jinzhong Yang & Laurence E. Court
Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Chidinma P. Anakwenze, Ethan B. Ludmir, Prajnan Das & Emma B. Holliday
Department of Radiation Physics, The University of Alabama at Birmingham, Birmingham, AL, USA
Carlos E. Cardenas
Guy’s and St, Thomas’ NHS Foundation Trust, London, UK
Asad Qureshi
Department of Radiation Oncology, University of California Los Angeles, Los Angeles, CA, USA
Ann C. Raldow

Authors

Cenji Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chidinma P. Anakwenze
View author publications
You can also search for this author in PubMed Google Scholar
Yao Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Rachael M. Martin
View author publications
You can also search for this author in PubMed Google Scholar
Ethan B. Ludmir
View author publications
You can also search for this author in PubMed Google Scholar
Joshua S.Niedzielski
View author publications
You can also search for this author in PubMed Google Scholar
Asad Qureshi
View author publications
You can also search for this author in PubMed Google Scholar
Prajnan Das
View author publications
You can also search for this author in PubMed Google Scholar
Emma B. Holliday
View author publications
You can also search for this author in PubMed Google Scholar
Ann C. Raldow
View author publications
You can also search for this author in PubMed Google Scholar
Callistus M. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Raymond P. Mumme
View author publications
You can also search for this author in PubMed Google Scholar
Tucker J. Netherton
View author publications
You can also search for this author in PubMed Google Scholar
Dong Joo Rhee
View author publications
You can also search for this author in PubMed Google Scholar
Skylar S. Gay
View author publications
You can also search for this author in PubMed Google Scholar
Jinzhong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Laurence E. Court
View author publications
You can also search for this author in PubMed Google Scholar
Carlos E. Cardenas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Each author had participated sufficiently in the work to take the responsibility for appropriate portions of the content. C.Y., C.A., Y.Z., R.M.M., E.L., J.N., L.C. and C.C. participated in research design and data collection. E.L., A.Q., P.D., E.H. and A.R. offered feedback to the automatically generated contours and provided qualitative evaluations. C.N. and R.P.M. implemented the system in the clinic. T.N., D.R, S.G. and J.Y. provided additional technical guidance and support for this research. C.Y. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Cenji Yu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, C., Anakwenze, C.P., Zhao, Y. et al. Multi-organ segmentation of abdominal structures from non-contrast and contrast enhanced CT images. Sci Rep 12, 19093 (2022). https://doi.org/10.1038/s41598-022-21206-3

Download citation

Received: 18 January 2022
Accepted: 23 September 2022
Published: 09 November 2022
DOI: https://doi.org/10.1038/s41598-022-21206-3

This article is cited by

Fully automated deep learning based auto-contouring of liver segments and spleen on contrast-enhanced CT images
- Aashish C. Gupta
- Guillaume Cazoulat
- Kristy K. Brock
Scientific Reports (2024)
Fully-automated, CT-only GTV contouring for palliative head and neck radiotherapy
- Skylar S. Gay
- Carlos E. Cardenas
- Laurence E. Court
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Fully-automated multi-organ segmentation tool applicable to both non-contrast and post-contrast abdominal CT: deep learning algorithm developed using dual-energy CT images

CT-ORG, a new dataset for multiple organ segmentation in computed tomography

Abdominal multi-organ auto-segmentation using 3D-patch-based deep convolutional neural network

Introduction

Materials and methods

Imaging data

Data curation and manual segmentation

Data preprocessing

Model training

Quantitative evaluation

Qualitative evaluation

Ethical approval

Results

Quantitative evaluation

Qualitative evaluation

Discussion

Conclusions

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Fully automated deep learning based auto-contouring of liver segments and spleen on contrast-enhanced CT images

Fully-automated, CT-only GTV contouring for palliative head and neck radiotherapy

Comments

Search

Quick links