Deep learning-based prediction of post-pancreaticoduodenectomy pancreatic fistula

Lee, Woohyung; Park, Hyo Jung; Lee, Hack-Jin; Song, Ki Byung; Hwang, Dae Wook; Lee, Jae Hoon; Lim, Kyongmook; Ko, Yousun; Kim, Hyoung Jung; Kim, Kyung Won; Kim, Song Cheol

doi:10.1038/s41598-024-51777-2

Download PDF

Article
Open access
Published: 01 March 2024

Deep learning-based prediction of post-pancreaticoduodenectomy pancreatic fistula

Woohyung Lee¹^na1,
Hyo Jung Park²^na1,
Hack-Jin Lee^1,3,
Ki Byung Song¹,
Dae Wook Hwang¹,
Jae Hoon Lee¹,
Kyongmook Lim^1,3,
Yousun Ko⁴,
Hyoung Jung Kim²,
Kyung Won Kim²^na2 &
…
Song Cheol Kim¹^na2

Scientific Reports volume 14, Article number: 5089 (2024) Cite this article

363 Accesses
Metrics details

Subjects

Abstract

Postoperative pancreatic fistula is a life-threatening complication with an unmet need for accurate prediction. This study was aimed to develop preoperative artificial intelligence-based prediction models. Patients who underwent pancreaticoduodenectomy were enrolled and stratified into model development and validation sets by surgery between 2016 and 2017 or in 2018, respectively. Machine learning models based on clinical and body composition data, and deep learning models based on computed tomographic data, were developed, combined by ensemble voting, and final models were selected comparison with earlier model. Among the 1333 participants (training, n = 881; test, n = 452), postoperative pancreatic fistula occurred in 421 (47.8%) and 134 (31.8%) and clinically relevant postoperative pancreatic fistula occurred in 59 (6.7%) and 27 (6.0%) participants in the training and test datasets, respectively. In the test dataset, the area under the receiver operating curve [AUC (95% confidence interval)] of the selected preoperative model for predicting all and clinically relevant postoperative pancreatic fistula was 0.75 (0.71–0.80) and 0.68 (0.58–0.78). The ensemble model showed better predictive performance than the individual ML and DL models.

Prediction of clinically relevant postoperative pancreatic fistula using radiomic features and preoperative data

Article Open access 09 May 2023

Automated machine learning (AutoML) can predict 90-day mortality after gastrectomy for cancer

Article Open access 08 July 2023

Risk prediction for malignant intraductal papillary mucinous neoplasm of the pancreas: logistic regression versus machine learning

Article Open access 18 November 2020

Introduction

Postoperative pancreatic fistula (POPF), as major complication of pancreatectomy, increase morbidity and mortality despite several preventive measures in 10–40% of the patients^1,2. Several risk prediction models have been developed to identify high-risk patients in the perioperative period, and the risk factors in these models include body mass index, pancreatic softness, and pancreatic duct size^3,4,5,6,7. These models are characterized by simplicity and convenience for quick bedside use. In contrast, the newly discovered factors related to POPF were reported as improved modalities. For example, anthropomorphic features, including proportions of subcutaneous fat and skeletal muscle, which could represent fatty pancreas, or obesity, are potentially associated with POPF⁸. Several perioperative factors, including diabetes, neoadjuvant chemotherapy, pancreatic steatosis, and remnant pancreatic volume, were associated with POPF⁹. Earlier models focused solely on the essential factors for simplicity, and there are few reports that included the newly developed risk factors. A comprehensive model incorporating both classical factors and newly discovered factors is required.

Recently, machine learning (ML) has enabled comprehensive modeling comprising a large amount of variables. Moreover, deep learning (DL) models have facilitated analytical processing of imaging-based data¹⁰ that can enable various applications. Several studies have investigated POPF prediction ML or DL models established using perioperative clinical and computed tomography (CT)-based data^11,12,13. However, they were limited because of their small sample sizes or higher event rates. Most studies applied specific ML models without conducting a comparison between ML models. Some studies did not include comparison with conventional models. This study aimed to establish a prediction model for POPF and generate CR-POPF models using either preoperative-only or perioperative data.

Results

Participant characteristics

In the study cohort of 1333 patients, the mean age was 63.4 years, mean BMI was 23.7 kg/m², and 59.8% were men. The most common surgical indication was pancreatic ductal adenocarcinoma (PDAC; n = 531, 39.8%) followed by distal bile duct cancer (n = 291, 21.8%), ampullary cancer (n = 200, 15.0%), duodenal cancer (n = 53, 4.0%), and borderline malignant disease (n = 193, 14.5%), and other benign disease (n = 65, 4.8%). The mean pancreatic duct size measured preoperatively was 3.8 mm, and 63.3% of the patients were classified into soft pancreas intraoperatively. The mean operative time was 330.9 min (Table 1, Fig. 1). The characteristics of participants in the training (n = 881) and test (n = 452) datasets are presented in Supplementary Table 1.

Table 1 Characteristics of the study population.

Full size table

Associated clinical factors for POPF

POPF and CR-POPF were diagnosed in 555 (41.6%) and 86 (6.4%) participants, respectively. All POPF occurred in 421 (47.8%) and 134 (31.8%) participants, whereas CR-POPF occurred in 59 (6.7%) and 27 (6.0%) participants, in the training and test datasets, respectively. In the multivariable analysis, all preoperative and perioperative clinical factors were included. CR-POPF participants more frequently presented non-PDAC etiology (HR 2.025, 95% CI 1.165–3.519, p = 0.012), smaller pancreatic duct size (HR 0.841, 95% CI 0.721–0.980, p = 0.027), male sex (HR 1.806, 95% CI 1.103–2.957, p = 0.019; Supplementary Table 2) than those without CR-POPF. The results of univariate analyses were shown in Supplementary Tables 3 and 4.

Body composition factors for POPF

The univariate analyses of the association between the body composition characteristics and the occurrence of POPF and CR-POPF are shown in Supplementary Tables 5 and 6, respectively. Participants with POPF showed higher visceral adipose tissue index (VATI, 43.2 vs. 37.8) and subcutaneous adipose tissue index (SATI, 52.1 vs. 48.1) and higher skeletal muscle index (SMI; 48.4 vs. 46.2) than those without POPF. Myosteatosis presented more frequently in patients without POPF (24.8% vs. 19.6%), and similar trends were observed for CR-POPF patients (higher VATI: 48.3 vs. 39.5, higher SATI: 53.5 vs. 49.5, and higher SMI: 48.1 vs. 47.1) than those without CR-POPF. Patients without CR-POPF had more frequent myosteatosis than those with CR-POPF (22.8% vs. 20.9%).

Preoperative prediction model for POPF and CR-POPF

In the five ML models established using preoperative clinical data, the top commonly selected factors such as non-PDAC etiology, small pancreatic duct size, low glucose level, high hemoglobin, and high VATI for POPF occurrence (Fig. 2). Preoperative CT-based four DL models were developed, and gradient-guided class attention maps showed the areas that the models focused on (Fig. 3). The finally selected model was the soft voting-based ensemble model composed of two ML models (ANN and logistic regression) and one DL model (Inception Net). AUCs of the ensemble model in the training, validation, and test datasets were 0.969, 0.779, and 0.750, respectively. Sensitivity, and specificity were described in Supplementary Table 7. The Roberts model was not included in the ensemble model. The predictive performance of the ensemble model was enhanced as compared to individual ML and DL models.

In the preoperative CR-POPF model, ML models frequently selected non-PDAC etiology, high VATI, absence of diabetes, and smaller pancreatic duct size as important factors predicting CR-POPF. The selected hard voting-based ensemble model comprised three ML models (ANN, TabNet, and random forest) and two DL models (ResNet and ResNeXt); the Roberts model was not included. AUCs of Ensemble model in the training, validation, and test datasets were 0.936, 0.915, and 0.682, respectively, and the ensemble model showed better predictive performance than individual ML and DL models (Table 2).

Table 2 Area under the curve values of prediction models for postoperative pancreatic fistula.

Full size table

Comparison between the conventional and the developed models

The predictive performance of the Roberts model and the preoperative ensemble model were compared to preoperatively predict POPF, and the preoperative ensemble model showed better performance (AUC, 0.750 vs. 0.637; p < 0.001); however, comparable predictive performance was observed between the preoperative ensemble and Roberts models for CR-POPF prediction (AUC, 0.682 vs. 0.635; p = 0.42). (Table 2, Fig. 4).

Changing AUC pattern according to the CR-POPF incidence

The low ratio of CR-POPF could affect model development because of the potential bias toward major cases and the negative impact of the model’s ability to learn. In this study, the CR-POPF incidence was relatively lower than that in other institutions, and we investigated changing pattern of model performance when the ratio of control and event were adjusted from 6.5% to 50%. The preoperative ensemble model for CR-POPF showed optimal performance AUCs regardless of the incidence ratio of CR-POPF, whereas the AUC of the Robert models decreased to approximately 30% of CR-POPF (Fig. 5).

Postoperative prediction models and the alternative fistula risk score

An all-inclusive prediction model was developed using preoperative, intraoperative, and postoperative variables. To predict the POPF, ML models selected non-PDAC etiology, soft pancreatic texture, high drain amylase level at postoperative day 1, and the absence of vascular resection as the top features. The AUCs of the ensemble model in the training, validation, and test dataset were 0.936, 0.832, and 0.787, respectively. The ensemble model showed higher AUCs than the alternative fistula risk score (a-FRS)³ in predicting the POPF (0.787 vs. 0.696; p < 0.001). There was no difference in CR-POPF prediction accuracy between the comprehensive ensemble and a-FRS models (0.685 vs. 0.667; p = 0.59; Supplementary Table 8).

Discussion

In this study, we developed AI-based models for predicting all POPF and CR-POPF in a large sample of 1333 patients undergoing PD. The preoperative ensemble model for POPF outperformed the prediction value compared to the conventional, ML, and DL models. The preoperative ensemble model for CR-POPF showed comparable performance, but had better predictive performance than the Roberts model after adjustment of CR-POPF incidence. The postoperative ensemble model for POPF showed better prediction value compared to the a-FRS model.

Previous studies reported that 10–40% of the patients who undergo PD experience CR-POPF². Many POPF prediction models were published that included common factors such as small pancreatic duct, soft pancreatic texture, and high BMI⁹, and are commonly used at the bedside because they are simple, comprise only two or three variables, and showed good performance.

However, recent studies have reported novel risk factors for POPF. Pathologic studies showed that fatty pancreas are associated with POPF, whereas atrophied and fibrotic pancreas have a protective role^4,8, and the recent improvement of CT technology identified novel potential factors for predicting POPF: Shi et al. showed that a higher pancreatic parenchymal-to-portal venous iodine concentration ratio measured on dual-energy CT was associated with less histologic fibrosis and greater risk of POPF¹⁴. Moreover, anthropomorphic studies showed that CT-based body composition data may help predict postoperative complications, such as POPF and poor survival^{15,16,17,18,19,20}. Prior studies^16,17 have consistently suggested the impact of high visceral obesity on POPF incidence, whereas the impact of skeletal muscle mass on POPF incidence remains controversial: several studies^17,18 have shown a protective effect whereas others^15,19,20 have failed to identify such effect. Studies reporting the impact of myosteatosis are limited, and a study with 139 participants¹⁵ showed that patients with lower SMD more frequently developed CR-POPF than those with higher SMD. In our study, high VATI was associated with both all POPF and CR-POPF incidence, which matches the results of prior studies.

The diversification of the pancreatic surgery environment has increasingly necessitated the development of a comprehensive prediction model that includes various factors. A recent meta-analysis revealed other POPF risk factors, including male sex, blood transfusion, vascular resection, and neoadjuvant chemotherapy⁹. In this study, non-PDAC etiology, small pancreatic duct size, low glucose level, high hemoglobin, and high VATI were risk factors with POPF occurrence in representative ML models. The pancreatic parenchyma in patients with non-PDAC etiology is characterized by soft and abundant tissue. Typically, this includes non-dilated pancreatic ducts, which serve as iconic risk factors. Additionally, individuals with high VATI, indicative of high visceral obesity, are also recognized as a risk factor. Most of risk factors align with findings from previous studies^9,18. There might be two confusing factors. High hemoglobin levels do not seem to be a standalone risk factor. It could be associated with male sex and high visceral fat, both of which are known risk factors^4,9. Low preoperative glucose levels may be related with soft and fatty pancreas, as shown in previous meta-analysis⁹. The important feature of ML and DL models are integration of big data, and AI is a suitable tool for this research task. Several studies have evaluated ML and DL prediction models. Kambakamba et al. reported an ML model developed using data of texture analysis from 110 patients who were matched with POPF and non-POPF groups of 55 and 55 patients, respectively, and showed that ML-based texture analysis could predict fibrotic change of pancreatic parenchyma (AUC; 0.84) and POPF (AUC; 0.95)²¹. The authors adjusted the control group sample for efficient training of the AI model; however, in real-world practice, the incidence of CR-POPF is lower than in the experimental setting. Han et al. reported an ML model using 38 clinical variables from 1769 patients, and the CR-POPF incidence was 12.5% and the AUC of the ML model was 0.74²². Shen et al. reported various ML models using clinical and radiomics data from 2421 patients, and the CR-POPF rate was 12.5% and ML model had an AUC of 0.83²³. Mu et al. developed a DL model using CT-based data from 583 patients that showed a CR-POPF rate of 13.6% (AUC 0.85, with better performance compared to FRS)¹³. Recently, ML-based models using preoperative factors have been reported. Ganjouei et al. developed an ML model (AUC 0.72) based on clinical factors which was useful for quick use. They selected the XGboost model among several ML models; however, a comparison with prior models was not performed²⁴. Other studies reported ML models based on preoperative clinical factors and radiologic data from CT scans. However, they included a small number of the patients which led to the potential of overfitting, and radiologic data were processed manually^25,26. In this study, ML, DL, and ensemble models were applied for each purpose. As in previous studies, ML models were suitable for modeling a collection of various clinical and body composition data. To develop DL models, raw CT data were used, without pancreatic segmentation or complex manual feature-selection processes, whereas previous studies extracted radiomic data for texture analysis in a labor-intensive task that requires large human resources^21,27. Preoperative and comprehensive prediction models are provided for suitable use of various clinical settings. Moreover, we provided POPF and CR-POPF models separately because the CR-POPF rate was 6.4%, which indicates lower incidence compared to the published data. Class imbalance could have affected the model’s learning capacity, and several solutions were introduced such as semi-supervised learning, data augmentation, resampling, and ensemble modeling^28,29,30. In this study, an ensemble method after individual ML modeling was used. Furthermore, we adjusted the ratio of CR-POPF and found that performance of preoperative ensemble model for CR-POPF was stable when the ratio of CR-POPF increased to less than 20%, and the model consistently outperformed the Roberts model, except in the one with 20–25% CR-POPF incidence. The Roberts model showed decreased performance with high CR-POPF incidence (> 30%), indicating that BMI and pancreatic duct size were insufficient risk factors in high CR-POPF incidence. In contrast, the ensemble model demonstrated a consistent performance across diverse CR-POPF rates owing to its incorporation of various risk factors during the modeling process. However, our comprehensive model showed similar predictive performance compared to the conventional a-FRS model. The comprehensive CR-POPF model comprised logistic regression, ResNeXt, and a-FRS model. A crucial portion of the comprehensive CR-POPF model may have already been occupied in the a-FRS model that included pancreatic texture, pancreatic duct size, and BMI, which are well-known CR-POPF risk factors. The additional logistic regression ML model included risk factors such as high amylase in drainage on POD1, high VATI, absence of diabetes; however, these factors in the ML model did not provide incremental value for the final model. Therefore, predictive performances of the ensemble and a-FRS models were comparable.

There are several limitations of this study. The decision to utilize three years of input data was driven by the availability of well-structured input data, an adequate number of patients for model establishment, and a recent decrease in the incidence of CR-POPF. The model development and validation processes were performed using data from a single center. There may be discrepancies in the postoperative management because multiple surgeons participated with this study. However, we standardized the critical pathway after surgery and unified the surgical procedures to minimize discrepancies. Stringent internal validation was performed because data for external validation were unavailable. The CR-POPF incidence is relatively lower than in other centers, which may be related with the unified procedures based on cumulative experiences and high volume of surgeries³¹. However, it may hinder determination of the statistical significance of several factors. However, we minimized this shortcoming by performing temporal validation. Despite the abundance of samples and the use of an ensemble model, the potential risk of overfitting may be a limitation during segmentation into multiple datasets and utilization of ML models.

A preoperative ensemble model for POPF provide better predictive performance than conventional model in preoperative clinical settings. Furthermore, developed ensemble model showed stable performance for predicting postoperative pancreatic fistula compared to prior model nevertheless of incidence of CR-POPF. This preoperative model could be useful for identifying risky patients in clinical studies for pancreatectomy and could help clinicians decide the immediate postoperative management in any suspicious situation.

Methods

Study population

This study was reported in line with the STROBE, and STROCSS³² criteria. The Institutional Review Board of Asan Medical Center approved the experimental protocol of this retrospective study and waived the need for informed consent (IRB No: 2021-0559). All methods were performed in accordance with good clinical practice guidelines and adhered to the principles outlined in the Declaration of Helsinki. The study was registered at cris.nih.go.kr (KCT0008156). Patients who underwent pancreaticoduodenectomy (PD) for periampullary diseases from 2016 to 2018 were enrolled. Exclusion criteria included: (a) incomplete details according to risk scores for POPF (Roberts model⁷); (b) absence of contrast-enhanced CT images during the 30 days before surgery; and (c) suboptimal CT quality due to severe artifact. Among the 1333 participants, data of those who underwent surgery from 2016 to 2017 (881 patients) and in 2018 (452 patients) were used as the training dataset and model validation, respectively (Fig. 1).

Study endpoints

The primary endpoints were the occurrence of all POPF and CR-POPF. POPF was defined according to the International Study Group in Pancreatic Surgery definition³³, and grades B and C POPF were classified as CR-POPF.

Data collection

Data on patients’ demographics, pre- and perioperative clinical data, preoperative CT images, intraoperative findings, and pathologic diagnosis were collected. Various CT scanners and image acquisition techniques were used. Details of CT acquisition were provided in the Supplementary Method and Supplementary Table 9; portal venous phase (PVP) CT images were used in the analysis. For body composition assessment, a single axial CT image at the level of lower endplate of the 3rd lumbar vertebra was used^34,35 to measure the cross-sectional areas of total abdominal wall muscle, subcutaneous adipose tissue, and visceral adipose tissue with artificial intelligence software (AID-UTM, iAID inc, Seoul, Republic of Korea)³⁶. The body composition parameters were normalized by division by the height squared (cm²/m²) and then reported as indices, including SMI, SATI, and VATI. Skeletal muscle density (SMD), which represents the degree of myosteatosis, was quantified as the mean HU of the skeletal muscle area (cutoff: 41 and 33 HU for non-overweight and overweight patients, respectively)³⁷. Details of body composition analysis are provided in Supplementary Method.

Surgical techniques and postoperative care

All surgical procedures were performed by experienced pancreatic surgeons using described operative procedures. Briefly, the pancreas was divided at the left side of the superior mesenteric vein, and pancreatic texture and pancreatic duct size were assessed intraoperatively by the attending surgeon. After a roux limb formation, end-to-side pancreaticojejunostomy (PJ) was performed. Non-absorbable monofilament was used for out-layer anastomosis with interrupted or continuous sutures. All surgeons performed duct-to-mucosa PJ anastomosis with an internal plastic stent, which was selected according to the size of the pancreatic duct. At surgery completion, two or three drains were placed adjacent to the PJ anastomosis and on the right side of the superior mesenteric arterial resection margin.

Postoperatively, serum and peripancreatic drain fluid amylase levels were routinely measured on postoperative days 1, 3, and 5; a contrast-enhanced CT scan was performed on days 5 to detect any complications. The peripancreatic drains were removed if there was no evidence of leakage on postoperative days 3–5. In cases with biochemical leakage, no additional treatment was performed. In case of leakage or suspicion of infective complications, the peripancreatic drains were left in situ, and antibiotics were administered at the discretion of the attending physician. Percutaneous or endoscopic drainage was performed according to the location of fluid collection. Reoperation was performed in patients with uncontrolled infection or unstable vital signs despite proper drainage and antibiotic use.

Model development for POPF prediction

A schematic representation of the developed models is provided in Fig. 1, and details of model development are provided in Supplementary Method. Using the training dataset, we developed models for predicting POPF and CR-POPF using preoperative data. We developed five ML models (artificial neural network [ANN], tabular network, logistic regression, random forest, and gradient boosting) utilizing the clinical information and body composition data. To train the DL models, the training dataset was divided into the training and validation subsets, and four DL models (ResNet, DenseNet, ResNeXt, and Inception net) were created utilizing preoperative CT data. Ensemble voting was used to combine the developed ML models, DL models, and the prior models (Roberts model⁷) with soft or hard voting, and the model with the highest accuracy in the validation subset was chosen. Finally, the single preoperative comprehensive model was selected, and the predictive performance was evaluated using the separate test dataset. The codes used in this work are available in the GitHub repository (https://github.com/nolife119/POPF_ensemble).

Statistical analysis

The sample size was calculated based on the area under the receiver operating characteristic curve (AUC). Based on previous studies, we hypothesized that the AUC would be 0.750. The proportion of sample with a POPF was 6–7%. The two-sided significance level (α) was set at 5%, and the statistical power (1-β) was set at 95%. The final number of subjects required for this study was 782. The chi-square test was used to compare categorical data, and the independent t-test was used to compare continuous data. Binary logistic regression analyses were used to evaluate the association between the variables and the occurrence of POPF and CR-POPF. The predictive performance of the selected models was assessed from the receiver operating characteristics (ROC) curve analysis, and the area under the ROC curve (AUC) with confidence interval (CI) was calculated. The sensitivity, specificity, and F1 score were obtained with the models’ cutoff value showing the highest accuracy in the validation subset. Analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC, USA). Two-sided p < 0.05 were considered statistically significant.

Data availability

The codes used in this work are available in the GitHub repository: https://github.com/nolife119/POPF_ensemble.

References

Smits, F. J. et al. Management of severe pancreatic fistula after pancreatoduodenectomy. JAMA Surg. 152, 540–548. https://doi.org/10.1001/jamasurg.2016.5708 (2017).
Article PubMed PubMed Central Google Scholar
van Dongen, J. C. et al. Fistula risk score for auditing pancreatoduodenectomy: The auditing FRS. Ann. Surg. https://doi.org/10.1097/SLA.0000000000005532 (2022).
Article PubMed Google Scholar
Mungroop, T. H. et al. Alternative fistula risk score for pancreatoduodenectomy (a-FRS): Design and international external validation. Ann. Surg. 269, 937–943. https://doi.org/10.1097/sla.0000000000002620 (2019).
Article PubMed Google Scholar
Gaujoux, S. et al. Fatty pancreas and increased body mass index are risk factors of pancreatic fistula after pancreaticoduodenectomy. Surgery 148, 15–23. https://doi.org/10.1016/j.surg.2009.12.005 (2010).
Article PubMed Google Scholar
Yamamoto, Y. et al. A preoperative predictive scoring system for postoperative pancreatic fistula after pancreaticoduodenectomy. World J. Surg. 35, 2747–2755. https://doi.org/10.1007/s00268-011-1253-x (2011).
Article PubMed Google Scholar
Callery, M. P., Pratt, W. B., Kent, T. S., Chaikof, E. L. & Vollmer, C. M. Jr. A prospectively validated clinical risk score accurately predicts pancreatic fistula after pancreatoduodenectomy. J. Am. Coll. Surg. 216, 1–14. https://doi.org/10.1016/j.jamcollsurg.2012.09.002 (2013).
Article PubMed Google Scholar
Roberts, K. J. et al. A preoperative predictive score of pancreatic fistula following pancreatoduodenectomy. HPB (Oxford) 16, 620–628. https://doi.org/10.1111/hpb.12186 (2014).
Article PubMed Google Scholar
Box, E. W. et al. Preoperative anthropomorphic radiographic measurements can predict postoperative pancreatic fistula formation following pancreatoduodenectomy. Am. J. Surg. 222, 133–138. https://doi.org/10.1016/j.amjsurg.2020.10.023 (2021).
Article CAS PubMed Google Scholar
Zhang, B. et al. Risk factors of clinically relevant postoperative pancreatic fistula after pancreaticoduodenectomy: A systematic review and meta-analysis. Medicine (Baltimore) 101, e29757. https://doi.org/10.1097/MD.0000000000029757 (2022).
Article PubMed Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410. https://doi.org/10.1001/jama.2016.17216 (2016).
Article PubMed Google Scholar
Chen, J. Y. et al. Risk scoring system and predictor for clinically relevant pancreatic fistula after pancreaticoduodenectomy. World J. Gastroenterol. 21, 5926–5933. https://doi.org/10.3748/wjg.v21.i19.5926 (2015).
Article PubMed PubMed Central Google Scholar
Skawran, S. M. et al. Can magnetic resonance imaging radiomics of the pancreas predict postoperative pancreatic fistula?. Eur. J. Radiol. 140, 109733. https://doi.org/10.1016/j.ejrad.2021.109733 (2021).
Article PubMed Google Scholar
Mu, W. et al. Prediction of clinically relevant pancreatico-enteric anastomotic fistulas after pancreatoduodenectomy using deep learning of preoperative computed tomography. Theranostics 10, 9779–9788. https://doi.org/10.7150/thno.49671 (2020).
Article PubMed PubMed Central Google Scholar
Shi, H. Y. et al. Dual-energy CT iodine concentration to evaluate postoperative pancreatic fistula after pancreatoduodenectomy. Radiology 304, 65–72. https://doi.org/10.1148/radiol.212173 (2022).
Article PubMed Google Scholar
Linder, N. et al. Power of computed-tomography-defined sarcopenia for prediction of morbidity after pancreaticoduodenectomy. BMC Med. Imaging 19, 32. https://doi.org/10.1186/s12880-019-0332-6 (2019).
Article PubMed PubMed Central Google Scholar
Jang, M. et al. Predictive value of sarcopenia and visceral obesity for postoperative pancreatic fistula after pancreaticoduodenectomy analyzed on clinically acquired CT and MRI. Eur. Radiol. 29, 2417–2425. https://doi.org/10.1007/s00330-018-5790-7 (2019).
Article PubMed Google Scholar
Pecorelli, N. et al. Effect of sarcopenia and visceral obesity on mortality and pancreatic fistula following pancreatic cancer surgery. Br. J. Surg. 103, 434–442. https://doi.org/10.1002/bjs.10063 (2016).
Article CAS PubMed Google Scholar
Nishida, Y. et al. Preoperative sarcopenia strongly influences the risk of postoperative pancreatic fistula formation after pancreaticoduodenectomy. J. Gastrointest. Surg. 20, 1586–1594. https://doi.org/10.1007/s11605-016-3146-7 (2016).
Article PubMed Google Scholar
Van Rijssen, L. B. et al. Skeletal muscle quality is associated with worse survival after pancreatoduodenectomy for periampullary, nonpancreatic cancer. Ann. Surg. Oncol. 24, 272–280. https://doi.org/10.1245/s10434-016-5495-6 (2017).
Article PubMed Google Scholar
Pierobon, E. S. et al. The prognostic value of low muscle mass in pancreatic cancer patients: A systematic review and meta-analysis. J. Clin. Med. https://doi.org/10.3390/jcm10143033 (2021).
Article PubMed PubMed Central Google Scholar
Kambakamba, P. et al. The potential of machine learning to predict postoperative pancreatic fistula based on preoperative, non-contrast-enhanced CT: A proof-of-principle study. Surgery 167, 448–454. https://doi.org/10.1016/j.surg.2019.09.019 (2020).
Article PubMed Google Scholar
Han, I. W. et al. Risk prediction platform for pancreatic fistula after pancreatoduodenectomy using artificial intelligence. World J. Gastroenterol. 26, 4453–4464. https://doi.org/10.3748/wjg.v26.i30.4453 (2020).
Article PubMed PubMed Central Google Scholar
Shen, Z. et al. Machine learning algorithms as early diagnostic tools for pancreatic fistula following pancreaticoduodenectomy and guide drain removal: A retrospective cohort study. Int. J. Surg. 102, 106638. https://doi.org/10.1016/j.ijsu.2022.106638 (2022).
Article PubMed Google Scholar
Ashraf Ganjouei, A. et al. A machine learning approach to predict postoperative pancreatic fistula after pancreaticoduodenectomy using only preoperatively known data. Ann. Surg. Oncol. 30, 7738–7747. https://doi.org/10.1245/s10434-023-14041-x (2023).
Article PubMed Google Scholar
Matsui, H. et al. A novel prediction model of pancreatic fistula after pancreaticoduodenectomy using only preoperative markers. BMC Surg. 23, 310. https://doi.org/10.1186/s12893-023-02213-1 (2023).
Article PubMed PubMed Central Google Scholar
Capretti, G. et al. A machine learning risk model based on preoperative computed tomography scan to predict postoperative outcomes after pancreatoduodenectomy. Updates Surg. 74, 235–243. https://doi.org/10.1007/s13304-021-01174-5 (2022).
Article PubMed Google Scholar
Bhasker, N. et al. Prediction of clinically relevant postoperative pancreatic fistula using radiomic features and preoperative data. Sci. Rep. 13, 7506. https://doi.org/10.1038/s41598-023-34168-x (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Boo, Y. & Choi, Y. Comparison of mortality prediction models for road traffic accidents: An ensemble technique for imbalanced data. BMC Public Health 22, 1476. https://doi.org/10.1186/s12889-022-13719-3 (2022).
Article PubMed PubMed Central Google Scholar
Bugnon, L. A., Yones, C., Milone, D. H. & Stegmayer, G. Deep neural architectures for highly imbalanced data in bioinformatics. IEEE Trans. Neural Netw. Learn Syst. 31, 2857–2867. https://doi.org/10.1109/TNNLS.2019.2914471 (2020).
Article PubMed Google Scholar
Chen, Z., Duan, J., Kang, L. & Qiu, G. Class-imbalanced deep learning via a class-balanced ensemble. IEEE Trans. Neural Netw. Learn Syst. 33, 5626–5640. https://doi.org/10.1109/TNNLS.2021.3071122 (2022).
Article PubMed Google Scholar
Shin, S. H. et al. Chronologic changes in clinical and survival features of pancreatic ductal adenocarcinoma since 2000: A single-center experience with 2,029 patients. Surgery 164, 432–442. https://doi.org/10.1016/j.surg.2018.04.017 (2018).
Article PubMed Google Scholar
Mathew, G., Agha, R., STROCSS Group. STROCSS 2021: Strengthening the reporting of cohort, cross-sectional and case-control studies in surgery. Int. J. Surg. 96, 106165. https://doi.org/10.1016/j.ijsu.2021.106165 (2021).
Article PubMed Google Scholar
Bassi, C. et al. The 2016 update of the International Study Group (ISGPS) definition and grading of postoperative pancreatic fistula: 11 years after. Surgery 161, 584–591. https://doi.org/10.1016/j.surg.2016.11.014 (2017).
Article PubMed Google Scholar
Kazemi-Bajestani, S. M., Mazurak, V. C. & Baracos, V. Computed tomography-defined muscle and fat wasting are associated with cancer clinical outcomes. Semin. Cell Dev. Biol. 54, 2–10. https://doi.org/10.1016/j.semcdb.2015.09.001 (2016).
Article PubMed Google Scholar
Tewari, N., Awad, S., Macdonald, I. A. & Lobo, D. N. A comparison of three methods to assess body composition. Nutrition 47, 1–5. https://doi.org/10.1016/j.nut.2017.09.005 (2018).
Article PubMed Google Scholar
Prado, C. M. et al. Prevalence and clinical implications of sarcopenic obesity in patients with solid tumours of the respiratory and gastrointestinal tracts: A population-based study. Lancet Oncol. 9, 629–635. https://doi.org/10.1016/s1470-2045(08)70153-0 (2008).
Article PubMed Google Scholar
Martin, L. et al. Cancer cachexia in the age of obesity: Skeletal muscle depletion is a powerful prognostic factor, independent of body mass index. J. Clin. Oncol. 31, 1539–1547. https://doi.org/10.1200/jco.2012.45.2722 (2013).
Article PubMed Google Scholar

Download references

Funding

This study was supported by grants from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (HI18C2383) and the National Research Foundation of Korea (NRF), funded by the Korean government (MSIT No. 2021R1C1C1010138).

Author information

These authors contributed equally: Woohyung Lee and Hyo Jung Park.
These authors jointly supervised this work: Song Cheol Kim and Kyung Won Kim.

Authors and Affiliations

Division of Hepatobiliary and Pancreatic Surgery, Department of Surgery, Asan Medical Center, Brain Korea21 Project, University of Ulsan College of Medicine, 88, Olympic-ro 43-gil, Songpa-gu, Seoul, 05505, Republic of Korea
Woohyung Lee, Hack-Jin Lee, Ki Byung Song, Dae Wook Hwang, Jae Hoon Lee, Kyongmook Lim & Song Cheol Kim
Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, 88, Olympic-ro 43-gil, Songpa-gu, Seoul, 05505, Republic of Korea
Hyo Jung Park, Hyoung Jung Kim & Kyung Won Kim
R&D Team, DoAI Inc., Seongnam-si, Gyeonggi-do, Republic of Korea
Hack-Jin Lee & Kyongmook Lim
Department of Convergence Medicine and Radiology, Research Institute of Radiology and Institute of Biomedical Engineering, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Yousun Ko

Authors

Woohyung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hyo Jung Park
View author publications
You can also search for this author in PubMed Google Scholar
Hack-Jin Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ki Byung Song
View author publications
You can also search for this author in PubMed Google Scholar
Dae Wook Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Jae Hoon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kyongmook Lim
View author publications
You can also search for this author in PubMed Google Scholar
Yousun Ko
View author publications
You can also search for this author in PubMed Google Scholar
Hyoung Jung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kyung Won Kim
View author publications
You can also search for this author in PubMed Google Scholar
Song Cheol Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Concept and design, S.C.K., H.J.K., and W.H.L.; Acquisition of data, K.B.S., D.W.H., and J.H.L.; Analysis, or interpretation of data, H.J.P., Y.K., H.J.L., and K.M.L.; Drafting of the manuscript, W.H.L., H.J.P. and H.J.L.; Critical revision of the manuscript for important intellectual content. All authors.

Corresponding authors

Correspondence to Kyung Won Kim or Song Cheol Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, W., Park, H.J., Lee, HJ. et al. Deep learning-based prediction of post-pancreaticoduodenectomy pancreatic fistula. Sci Rep 14, 5089 (2024). https://doi.org/10.1038/s41598-024-51777-2

Download citation

Received: 04 July 2023
Accepted: 09 January 2024
Published: 01 March 2024
DOI: https://doi.org/10.1038/s41598-024-51777-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.