A machine-learning approach to human ex vivo lung perfusion predicts transplantation outcomes and promotes organ utilization

Sage, Andrew T.; Donahoe, Laura L.; Shamandy, Alaa A.; Mousavi, S. Hossein; Chao, Bonnie T.; Zhou, Xuanzi; Valero, Jerome; Balachandran, Sharaniyaa; Ali, Aadil; Martinu, Tereza; Tomlinson, George; Del Sorbo, Lorenzo; Yeung, Jonathan C.; Liu, Mingyao; Cypel, Marcelo; Wang, Bo; Keshavjee, Shaf

doi:10.1038/s41467-023-40468-7

Download PDF

Article
Open access
Published: 09 August 2023

A machine-learning approach to human ex vivo lung perfusion predicts transplantation outcomes and promotes organ utilization

Andrew T. Sage ORCID: orcid.org/0000-0003-2517-9062^1,2,3,4,
Laura L. Donahoe^2,3,
Alaa A. Shamandy^5,6,
S. Hossein Mousavi ORCID: orcid.org/0000-0002-4891-3138^1,2,
Bonnie T. Chao^1,2,
Xuanzi Zhou^1,2,
Jerome Valero^1,2,
Sharaniyaa Balachandran²,
Aadil Ali^1,2,
Tereza Martinu^1,2,4,
George Tomlinson⁷,
Lorenzo Del Sorbo^1,8,
Jonathan C. Yeung ORCID: orcid.org/0000-0001-5759-3028^1,2,3,
Mingyao Liu^1,2,3,4,
Marcelo Cypel ORCID: orcid.org/0000-0001-5652-1938^1,2,3,4,
Bo Wang^5,6,9,10^na1 &
…
Shaf Keshavjee ORCID: orcid.org/0000-0003-4547-8094^1,2,3,4^na1

Nature Communications volume 14, Article number: 4810 (2023) Cite this article

4079 Accesses
4 Citations
55 Altmetric
Metrics details

Subjects

Abstract

Ex vivo lung perfusion (EVLP) is a data-intensive platform used for the assessment of isolated lungs outside the body for transplantation; however, the integration of artificial intelligence to rapidly interpret the large constellation of clinical data generated during ex vivo assessment remains an unmet need. We developed a machine-learning model, termed InsighTx, to predict post-transplant outcomes using n = 725 EVLP cases. InsighTx model AUROC (area under the receiver operating characteristic curve) was 79 ± 3%, 75 ± 4%, and 85 ± 3% in training and independent test datasets, respectively. Excellent performance was observed in predicting unsuitable lungs for transplantation (AUROC: 90 ± 4%) and transplants with good outcomes (AUROC: 80 ± 4%). In a retrospective and blinded implementation study by EVLP specialists at our institution, InsighTx increased the likelihood of transplanting suitable donor lungs [odds ratio=13; 95% CI:4-45] and decreased the likelihood of transplanting unsuitable donor lungs [odds ratio=0.4; 95%CI:0.16–0.98]. Herein, we provide strong rationale for the adoption of machine-learning algorithms to optimize EVLP assessments and show that InsighTx could potentially lead to a safe increase in transplantation rates.

Human ex vivo lung perfusion: a novel model to study human lung diseases

Article Open access 12 January 2021

Pulmonary function as a continuum of risk: critical care utilization and survival after allogeneic hematopoietic stem cell transplantation - a multicenter cohort study

Article 16 March 2024

Implementation of an experimental isolated lung perfusion model on surgically resected human lobes

Article Open access 21 August 2019

Introduction

Precision medicine for isolated organs has been enabled by the development of ex vivo perfusion systems for the lung^1,2,3,4,5, liver^6,7, heart^8,9, kidney^10,11,12, and pancreas¹³. For surgeons, these platforms represent a pragmatic approach to assess the suitability of marginal (non standard) donor organs for transplantation^14,15. Ex vivo lung perfusion (EVLP) is an established ex vivo assessment technology that aids in the recovery of donor lungs that otherwise would have been discarded^1,2,3,4,5,16, providing a critical source of viable lungs for patients in need of a transplant. While global lung transplant volumes have increased with EVLP integration, they are still significantly outpaced by the number of people added to the waitlist each year—a problem compounded by the recent pandemic¹⁷. Although use of EVLP is a possible solution to the organ shortage problem¹⁸, it is limited by the lack of standardized acceptance criteria regarding when to use an organ for transplant^19,20. Moreover, EVLP decision-making is largely subjective and involves many measurements performed during ex vivo perfusion which can be daunting for inexperienced EVLP programs^19,20.

During EVLP, lungs are maintained in a normothermic (37 °C) environment, perfused with an acellular perfusate solution, and ventilated using an ICU-grade lung protective ventilator^1,2,3,4,5. At present, lung monitoring includes physiological (i.e., gas exchange, compliance, airway pressure), biochemical (i.e., glucose and lactate levels, pH, acid-base chemistry), imaging (i.e., radiographic images, bronchoscopy), and biological measurements (i.e., cytokines and chemokines)^1,2,3,4,5. In a previous study, we developed the ‘Toronto Lung Score’ based on interleukin-6 (IL-6) and IL-8 protein levels in EVLP perfusate, and used it to profile lung inflammation²¹. Additional studies have demonstrated the association between the severity of specific evaluation parameters during EVLP and patient outcomes;^22,23,24,25 however, these studies failed to holistically evaluate the breadth of potential data derived from EVLP.

While artificial intelligence (AI) and machine learning (ML) have had a significant impact on clinical decision-making in other areas of medicine, they have not yet been thoroughly investigated for use during ex vivo organ perfusion. EVLP is particularly well-suited for ML approaches because the ex vivo data are: (i) restricted to an isolated organ and free of confounding signals from other organ systems; (ii) collected longitudinally for several hours, providing a potential trajectory of improvement or deterioration in organ quality, and (iii) derived from numerous different monitoring systems generating a high volume of data. However, evidence that an AI-guided approach to EVLP decision-making could meaningfully impact organ utilization and post-transplant outcomes has not been demonstrated to date.

To develop a comprehensive approach to surgical decision-making by leveraging organ assessment data generated during EVLP, we evaluated eXtreme Gradient Boosting (XGBoost)²⁶, a decision-tree based ML technique, using clinical EVLP data collected in our center over the past decade. Our ML model, termed InsighTx, uses donor features and all possible assessments made during EVLP to predict suitable lungs for transplantation and patient outcome—the duration of post-transplant mechanical ventilation for the recipient. Important recipient features were then added to donor lung predictions using the InsighTx model to demonstrate an approach that personalizes transplant predictions. We further investigated whether the InsighTx model would impact clinical decision-making during EVLP in a retrospective, real-world evaluation study. This paper summarizes the development of the InsighTx algorithm using the largest collection of clinical EVLP data to date (Fig. 1), and provides evidence that an AI-guided approach could potentially lead to a safe increase in the number of transplants performed following ex vivo assessment.

Results

EVLP cohort characteristics

From 2008 to 2022, there were a total of n = 725 eligible clinical EVLP cases that were included in InsighTx model development and validation. There were n = 504 EVLP cases performed from 2008 to November 2019 that were used as a development dataset. Consecutive EVLP cases conducted between December 2019 to December 2020 (n = 97) and December 2020 to August 2022 (n = 124) were used as validation cohorts 1 and 2 respectively (Table 1). There were no significant differences in donor age, sex, BMI or type (Table 1); however, the proportion of donation after circulatory death (DCD) compared to donation after brain death (DBD) donors increased in the validation cohorts; median warm ischemic time was 65 min [IQR: 50–80 min]. Transplant rates and post-transplant outcomes significantly varied (Table 1). The rate of transplantation following EVLP was the highest in Test Dataset 1 (66%) and lowest in Test Dataset 2 (49%). While the incidence of Primary Graph Dysfunction (PGD) Grade 3 at 72 h was consistent in this study, we observed that the proportion of patients extubated in less than 72 h was highest in Test Dataset 1 (49%) and lowest in Test Dataset 2 (30%) (Table 1). Although extubation times varied, the median time spent in the ICU was similar across the datasets (Table 1). Of all donor lungs evaluated on EVLP, 38% resulted in transplantation and extubation in less than 72 h post-transplant, 22% were transplanted but associated with prolonged ventilation, and 40% were deemed unsuitable for transplant. These prevalence rates were used as the reference baseline for the area under the precision-recall curve (AUPRC) of EVLP and transplant outcomes.

Table 1 Clinical EVLP case characteristics for InsighTx model development

Full size table

InsighTx model development and performance

The AUROC for the overall InsighTx model was 79 ± 3%, 75 ± 4%, 85 ± 3% in the training and test sets, respectively (Table 2 and Supplementary Fig. 1). Importantly, discrimination was high for identifying donor lungs on EVLP that resulted in a time to extubation less than 72 h (AUROC: 80 ± 4% (training), 76 ± 6% (test dataset 1), 83 ± 4% (test dataset 2)) and for identifying lungs that were unsuitable for transplantation (AUROC: 90 ± 4% (training), 88 ± 4% (test dataset 1), 95 ± 2% (test dataset 2)). Although the prediction of prolonged time to extubation in transplant recipients was modest in test dataset 1 compared to the training dataset (AUROC: 67 ± 6% (training) vs. 62 ± 9% (test dataset 1)), the model performed well in test dataset 2 (AUROC: 76 ± 6%) (Table 2). Importantly, the precision (positive predictive value) of the model to identify any unsuitable donor lung (i.e., declined for transplant or extubated ≥72 h) was 81% and model precision for suitable donor lungs (i.e., extubated <72 h) was similar at 72%. Furthermore, the AUPRC showed a marked improvement of the InsighTx model to predict EVLP outcomes compared to baseline AUPRC values (prevalence of the respective endpoints) (Supplementary Fig. 1). For patients extubated <72 h (baseline AUPRC 38%), the InsighTx model had an AUPRC of 67 ± 6% in the training dataset, 74 ± 8% in test dataset 1, and 64 ± 10% in test dataset 2. Similar AUPRC results were observed in patients that required prolonged ventilation post-transplant: 40 ± 7% (training), 31 ± 11% (test dataset 1), and 42 ± 11% (test dataset 2) for the InsighTx model vs. 22% for the baseline AUPRC. Notably, the improvement in AUPRC was the strongest for lungs deemed unsuitable for transplant (InsighTx: 86 ± 5% (training), 81 ± 7% (test dataset 1), and 96 ± 2% (test dataset 2) vs. 40% baseline AUPRC).

Table 2 AUROC performance of the InsighTx model to predict EVLP and Tx outcomes

Full size table

We further investigated the relationship between the InsighTx model and PGD Grade 3 at 72 h. For donor lungs that were predicted to have a time to extubation <72 h using the InsighTx model, the negative predictive value (NPV) for PGD Grade 3 at 72 h post-transplant was 88% [95% CI: 84–91%, p < 0.001, n = 430].

A central characteristic of the XGBoost algorithm is the ability to determine the relative importance of the input variables. Only donor type and PEEP (positive end-expiratory pressure) had SHAP (shapley additive explanations) importance values of 0 and were therefore not used by the model for outcome prediction; all other input features were required by the model (SHAP > 0). We observed unique combinations of the donor and EVLP parameters that underlie the prediction of each clinical endpoint (Table 3).

Table 3 Ranked EVLP features for endpoint prediction

Full size table

InsighTx model and recipient features

We investigated whether the inclusion of key recipient features increased the performance of the InsighTx model and the prediction of post-transplant time to extubation. To do this, we employed a sequential modeling approach where the InsighTx results were combined with recipient age, sex, BMI, status, and indication for transplant to generate a secondary, updated probability of post-transplant outcome. As might be expected, the addition of recipient features increased the AUROC for the InsighTx model to discriminate which EVLP cases would result in short or prolonged time to extubation in transplant patients (Supplementary Table 1). A significant increase of 10% in the AUROC was observed compared to a recipient-only model and a similar trend of +6% in AUROC was observed versus the InsighTx model alone (Supplementary Table 1).

InsighTx implementation analysis

Our analysis showed that the InsighTx model demonstrated good net benefit for transplant suitability and post-transplant extubation <72 h decisions over a wide range of threshold probabilities (Supplementary Fig. 2). As expected, we noted that transplant ‘all’ or ‘none’ approaches were beneficial at the lowest and highest threshold probabilities, respectively (Supplementary Fig. 2), which likely reflects historical transplant decisions based on recipient urgency.

Lastly, we sought to investigate whether the results of the InsighTx model would have a meaningful impact on surgical decision-making during EVLP. A summary of the donor and recipient characteristics for this subset of EVLP cases are provided in Supplementary Table 2.

Overall, we observed that InsighTx model use encouraged a theoretical increase of 7% in the decision to proceed to transplant for lungs more likely to produce good outcomes and a 4% decrease in the decision to proceed to transplant for lungs that were unsuitable (Supplementary Table 3). Interestingly, we observed a net decrease of 13% for the utilization of lungs that resulted in the need for prolonged ventilation, with no change in the lung assessment score (Supplementary Table 3). Most notably, for lungs that were historically declined but predicted to be suitable by InsighTx, there was a 13% increase in decision to proceed to transplant when the ML based decision-aid was available (Supplementary Table 3).

Using a mixed effects logistic regression model, we observed a clinically meaningful impact of InsighTx on surgical decision-making. For lungs that were actually transplanted and had extubation <72 h or which were not transplanted but had a high probability of extubation <72 h on InsighTx, having the InsighTx model available for decision-making resulted in a 13-fold increase [95% CI: 4–45] in the odds of a favorable transplant decision and an improvement of +0.95 [95% CI: 0.4–1.51] in lung suitability assessments (i.e., the impression of lung suitability may have increased from 8 to 9 (out of 10) for a given assessor) (Table 4). Moreover, the opposite was true for unsuitable donor lungs (i.e., decreased odds of transplant and less favorable impression of the organ) (Table 4). When respondents were grouped by EVLP experience level (i.e., number of clinical EVLP cases performed; experience threshold of 100 cases), we observed a consistent effect of the model on decision making (Supplementary Table 4). Notably, those with less EVLP experience tended to have a lower baseline rate of transplantation (Supplementary Table 4).

Table 4 Summary of the impact of InsighTx on clinical decision-making

Full size table

Discussion

In the present study, we observed that a ML approach to organ assessment predicts EVLP and post-transplant outcomes. The InsighTx model was developed using the largest collection of clinical EVLP cases to date and has learned from decisions made by an experienced EVLP program. The model performed extremely well in the prediction of three possible outcomes following EVLP, with an AUROC of 79%, 75%, and 85% in the training and two test datasets, respectively. Furthermore, we demonstrated that the addition of recipient features to InsighTx predictions can be used to further fine-tune model performance. Most importantly, we show that the model represents a surgical decision-aid that could potentially lead to a safe increase in transplant volume at our institution.

An important observation from this study was that InsighTx performance was maintained in all three datasets, spanning over a decade of clinical EVLP practice, even though the prevalence of key clinical outcomes (such as post-transplant extubation <72 h) varied in the cohorts. These results reflect the robust nature of InsighTx to accurately assess the donor lung and predict clinical outcomes, irrespective of different donor populations and time periods. This finding is especially important given that Test Dataset 1 and 2 occurred during the COVID-19 pandemic which impacted lung transplant programs and organ donation rates. Thus, the results herein suggest that the InsighTx model is generalizatable to the evolving landscape of lung transplantation. As with all predictive assays for lung transplantation, future studies that involve periodic validation of clinical accuracy are warranted to continually evalutate the impact of clinical practice evolution.

Studies by our group and others have shown the predictive value of various biomarkers during EVLP^{21,22,23,24,25, 27,28,29,30,31,32}. A study by DiNardo et al. demonstrated that physiological and biochemical features may help to make a decision to transplant²². In addition, numerous other studies have highlighted the predictive role of inflammatory cytokines, including IL-6, IL-8, IL-10, and IL-1β, for the assessment of lung injury^21,23,24,25. As such, the approach taken in the present study attempts to advance all of the available data and research conducted to date towards the development of a comprehensive and unified ML-based EVLP assessment model. It is important to note that traditional cytokine testing approaches operate on timelines that are not practical for clinical EVLP; however, rapid (i.e., <40 min, TORdx LUNG) cytokine testing platforms (Supplementary Table 5) enable the integration of these features with the InsighTx model. In doing so, previous reports on the importance of biological data can be included in the InsighTx model for real-time decision making. At present, these platforms are restricted to inflammatory cytokines but, as technical capabilities expand, other previously reported protein biomarkers can be added to future iterations of the InsighTx model.

Historically, most studies on EVLP biomarker studies have focused on dichotomous endpoints and, therefore, fail to adequately represent the spectrum of outcomes following EVLP. A unique feature of the InsighTx model is the reporting of the likelihood of three possible clinical outcomes following EVLP. This provides surgeons with a comprehensive view of the most probable recipient outcome post-transplant. Notably, the model showed excellent performance in predicting donor lungs that were: (i) likely to result in a short time to extubation post-transplant, or (ii) unsuitable for transplantation. Moreover, the prediction of post-transplant extubation <72 h by InsighTx was strongly predictive of non-PGD Grade 3 at 72 h. The ability of the InsighTx model to discriminate donor lungs that were associated with prolonged ventilation post-transplant was modest, but showed marked improvement over standard practice. An important finding in our study was that InsighTx precision was 81% when prolonged ventilation and declined for transplantation EVLP cases were considered together. These results strongly support the notion of an injured donor lung phenotype identified by InsighTx which can be used to guide clinical decision-making during EVLP.

The objective of this study was to derive a model for an isolated donor lung to help predict outcome for any recipient, irrespective of their pre-transplant condition or status. It is important to note that the final decision to transplant resides with the surgeon, who takes relevant recipient features into account. Using a donor-centric approach, we observed that a ML model based on donor and EVLP features alone actually demonstrated excellent performance. Although the addition of recipient features to InsighTx improved model AUROC, it did not reach statistical significance, which underscores the importance and good performance of the donor-centric approach of the InsighTx model. Nevertheless, we found that the addition of recipient characteristics, as might be expected, can strengthen model discrimination for post-transplant time-to-extubation.

The sequential donor-recipient modeling approach underscores the power of the InsighTx model: one can use it for donor lung assessment as a generalized model for any recipient or InsighTx results can be combined with specific recipient details that will personalize the prediction to a particular patient. Our results offer further support of the role that the recipient contributes to their post-transplant outcome. For example, recipient age, BMI, and pre-transplant status (urgency) were found to be important modifiers of the InsighTx model predicted outcome. Future studies should investigate additional, more complex recipient features using the approach described herein.

As the field of ex vivo organ perfusion continues to expand, targeted therapies and regenerative strategies will be applied during ex vivo preservation to improve organ function¹⁸. Thus, the InsighTx model is well-suited to meet this future state by focusing on the outcome of the organ alone, and will be able to better gauge the impact of any future intervention on a donor lung, thereby ensuring that all donor lungs are well conditioned prior to transplant into any recipient.

Detailed analysis of the InsighTx model revealed a different mix of assessment parameters underlying the various endpoint classifications. While this finding was not unexpected, it is interesting to note the relative importance of various features in relation to lung suitability and patient outcomes. Our findings support previous observations that donor type and PEEP (set to a constant for nearly all cases) were unlikely to be associated with outcome and, thus, provide little predictive value. Biological and biochemical biomarkers were highly ranked for the prediction of post-transplant outcome. In particular, we observed that acid-base chemistry was extremely important in determining patient outcomes. Features such as pH and base excess are well known biomarkers of metabolic and respiratory acidosis in lung physiology;³³ however, the identification and weighting of these markers in EVLP by the InsighTx model is novel and further underscores the value of an AI-based approach to evaluate and understand the significance of ex vivo assessments.

One of the key findings in the present study was the real-world evaluation of the use of the InsighTx model on surgical decision-making. While there have been reports of predictive ML algorithms in thoracic surgery³⁴, this is the first such study to show that the use of an AI-based decision-aid during EVLP could theoretically change and improve lung transplant decisions. The results of this study suggest that the impact of ML on transplantation rates could be dramatic and that an overall increase in transplant activity at the program level is plausible. Of note, the effects of the ML model were different based on the predicted post-transplant outcome. For lungs that were associated with poor outcomes, there was a large decrease in the tendency to transplant. This decrease was offset by an even larger increase in the decision to transplant lungs that were historically declined, but predicted by the InsighTx model to have good post-transplant outcomes. These results also demonstrated that experienced EVLP personnel would be more likely to transplant additional donor lungs on EVLP; however, the net gain or decrease in transplantation rates were similar regardless of experience level. Thus, these findings suggest that overall donor lung utilization rates could appropriately and safely increase with InsighTx model implementation for all centers and would likely be of greater benefit to those with less EVLP experience. It is important to note the limitation that this analysis was derived from retrospective adjudication and reflects the views of the participants at our center. Thus, external validation followed by a prospective, multicentre trial is warranted to fully study and understand the broader impact of InsighTx on surgical decision-making and validate our findings that using the InsighTx AI model during EVLP can safely increase transplantation rates.

Although machine learning models can be used to accurately predict medical outcomes, careful consideration regarding the scope and ease in which the data are available will directly impact clinical translation³⁵. To that end, the data features used by the InsighTx model are routinely collected and accessible during standard EVLP practice (Summarized in Supplementary Table 5). In the future, the extraction of these data can be automated and directly linked to the InsighTx algorithm, thereby enabling streamlined integration of the model during clinical EVLP in real-time. This approach offers the exciting possibility of leveraging the performance associated with machine learning algorithms while not causing undue burden on clinical EVLP teams.

While the results of this study are promising, there are several limitations to our findings. The model was developed and validated in a cohort of lungs from a single, experienced institution. While this data represents the largest collection of clinical EVLP cases to date, future studies involving large external datasets are needed to confirm our findings. In addition, improvements to expand the breadth of InsighTx biomarkers and data, such as including additional features and/or real-time monitoring and analysis of parameters instead of hourly, is likely to enrich the data quality and strengthen the results of the model. Current efforts are underway to realize this potential.

In conclusion, ex vivo organ perfusion techniques are poised to revolutionize the approach to organ repair, regeneration, and transplantation. While these techniques are being established, a comprehensive and standardized approach to organ assessment is necessary. Using a clinically established ex vivo lung perfusion technique, EVLP, we show that an AI-based ML model is accurate and can safely lead to more transplants. As the number of patients waiting for a transplant continues to grow and outpace the number of available donor organs, the development of novel strategies that maximize the usage of these scarce resources becomes critical. The development of InsighTx to safely identify more viable donor lungs represents a significant step forward for the field of organ perfusion and transplantation by promoting a precision-medicine approach to surgical decision-making.

Methods

Study population

Informed consent was obtained from all participants. Institutional approval for this study was obtained (UHN REB#12-5488-13). All consecutive clinical EVLP cases performed at Toronto General Hospital (University Health Network, Toronto, ON, Canada) from 2008 to 2022 were considered for model development and validation. Model training was performed using consecutive clinical EVLP cases occurring between 2008 and November 2019, whereas Test Datasets 1 and 2 represented consecutive cases conducted between December 2019 and December 2020 and December 2020 and August 2022, respectively. Transplant recipient inclusion criteria included adults with end-stage lung disease referred for first lung transplantation. Exclusion criteria were double lung EVLP assessments that resulted in single lung transplantation.

Data collection and storage

All data were recorded and stored with institutional approval (UHN REB#11-0170-AE). Our EVLP technique has been previously described^1,2,3,4,5. Briefly, lung assessments are made hourly and data are derived from an ICU-grade ventilator, pressure monitors and perfusate samples collected from the EVLP circuit. Additional features were extracted from the donor chart at the time of EVLP. Biochemical and oxygenation data were generated using a blood gas analyzer (RAPIDPoint, Siemens Healthcare, Germany). ∆pO₂ and ∆pCO₂ measurements were calculated as the venous-arterial difference in oxygenation and carbon dioxide partial pressure in perfusate solution, respectively. Protein measurements (i.e., IL-6, IL-8, IL-10, IL-1β) were completed by ELISA (Ella by Protein Simple Inc., San Jose, CA, USA and TORdx LUNG by SQI Diagnostics Inc., Toronto, ON, Canada). A summary of EVLP parameters is provided in Supplementary Table 6 and Supplementary Table 7. Primary Graft Dysfunction (PGD) grades were assigned in accordance with the International Society for Heart and Lung Transplantation working group 2016 definition³⁶.

Data preprocessing

EVLP data were extracted from our Toronto Lung Transplant Program Database and assessed for completeness. Missing data was obtained using the original source documents and records, or accounted for by the XGBoost algorithm during model training and testing. Supplementary Table 5 summarizes parameter source and acquisiton time. For each parameter that was assessed longitudinally during EVLP, the following features were extracted from the data up to four hours: minimum and maximum values, trend during EVLP, and the last recorded value. For EVLP cases that lasted between four and six hours, data was capped after four hours to standardize model predictions. Compliance and protein measurements were normalized to donor lung size using estimated total lung capacity.

InsighTx model development

A comprehensive list of all assessment features used in model development can be found in Supplementary Table 6. The InsighTx model was developed using a class-weighted XGBoost algorithm (v1.4.2) trained to predict the following clinical endpoints: (i) donor lungs on EVLP deemed unsuitable for transplantation and EVLP cases that resulted in transplantation with recipients who were extubated in (ii) <72 h or (iii) ≥72 h post-transplant. EVLP cases from 2008 to 2019 were used to train the model using donor and EVLP features. The development cohort was used to establish the model hyperparameters and randomly partitioned 80:20 for training and testing―five-fold cross-validation was performed where one-fold was used as the internal test set at each of the five iterations (Note: data reported from the development cohort are derived from the results of the internal test sets). Data arising from EVLP cases conducted from 2019 to 2020 and 2020 to 2022 were used as two independent validation cohorts to test the InsighTx model. Each EVLP case was assigned a predicted outcome based on the endpoint with the highest probability (most likely outcome) derived from the InsighTx model. Predicted outcomes were used for model performance analyses and in the implementation study analysis.

InsighTx and recipient model development

A random forest model was used to evaluate the addition of recipient features (age, sex, body mass index (BMI), patient status³⁷, and indication for transplant) to the outcome probabilities of the InsighTx model. Recipient status was recorded at assessment, listing, and transplant admission according to standard procedures at our institution³⁷. All EVLP cases that resulted in bilateral transplantation (n = 368) were included, and five-fold cross validation was performed. Supplementary Table 8 lists the summary statistics for recipient features used in this analysis.

Implementation analysis

To evaluate the effect of InsighTx on clinical decision-making, we conducted a blinded retrospective case review for a subset of n = 20 EVLP cases in this study, with a panel of n = 15 participants comprising surgeons (n = 7), surgical fellows (n = 3), organ perfusion specialists (n = 3), and EVLP assistants (n = 2) at our institution (Fig. 2). Each case was de-identified and presented alongside donor and recipient information. For declined EVLP cases, the details of the intended recipient were used. The study cases included: six cases where the historical outcome matched the InsighTx model prediction (i.e., extubated <72 h or declined for transplant), nine cases that were historically declined for transplant but the InsighTx model predicted that the lungs were likely to produce a good transplant outcome, and five lungs where the InsighTx model correctly predicted the need for prolonged ventilation. For statistical analysis, cases were grouped as either suitable (predicted time to extubation <72 h) or unsuitable (predicted time to extubation ≥72 h or declined for transplant) for transplantation. Assessors were asked to determine the suitability of the lung for transplant (yes or no) based on standard EVLP evaluation parameters alone and their assessment (impression) of the organ on a scale from 0 (poor) to 10 (excellent). The predicted transplant outcome from the InsighTx model was then revealed and respondents were asked to re-answer the transplant suitability and lung assessment questions. This study analysis was reviewed and approved by our institution (UHN REB#19-6251).

Statistical methods

Demographic and clinical data were summarized using descriptive statistics for the development and testing cohorts and compared using Chi-squared, ANOVA, and Kruskal–Wallis tests. The area under the receiver operating characteristic (AUROC) and precision-recall (AUPRC) curves were used to assess the predictive performance of the overall InsighTx model as well as each clinical outcome of interest. Training and Test Dataset p-values were determined using bootstrapping. Briefly, using the Test Dataset sample size, a subset of datapoints were randomly selected from the Training Dataset and the AUROC of the subset was obtained from the model predictions. This was repeated 10,000 times to generate an underlying distribution of the AUROC values. The respective AUROC value from the Test Dataset was then used to determine a cutoff for the distribution, and the proportion of data in the distribution less or greater than the Test Dataset AUROC was used to determine the p-value. Net benefit analysis was conducted using the decision to transplant or time-to-extubation as a binary outcome on all study cases. InsighTx model net benefit was compared to transplant ‘all’ or ‘none’ approaches. To further estimate the effect of the InsighTx model results on decision-making in the retrospective review, a logistic regression model was fitted, with the suitability for transplant as the outcome, fixed effects for use of InsighTx, and EVLP group, and random effects for study case and assessor. All analyses were conducted using Stata (StataCorp, College Station, TX, USA), GraphPad (GraphPad Software, San Diego, CA, USA), SPSS Statistics (IBM Corp, Armonk, NY, USA), Python Programming Language (Python Software (v3.9), Wilmington, DE, USA), or R statistics software (R Foundation for Statistical Computing, Vienna, Austria).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All data supporting the findings described in this manuscript are available in the article, Supplementary Information File, and from the corresponding authors upon request. A Source Data file has also been provided. Our study design did not include provisions to share the de-identified individual participant data, given historical concerns from our institution’s Research Ethics Board on the inherent risk of potentially identifying a participant using a combination of de-identified data fields. Thus, individual patient data from this study will not be made available in publicly accessible databases. However, researchers affiliated with accredited research institutions may request access by contacting the corresponding authors (S.K. and B.W.) who will respond within one month of the request. Data transfer and usage restrictions will be in accordance with the data sharing agreement policies and procedures at University Health Network. Source data are provided with this paper.

Code availability

The study design approved by our institution did not include provisions to share source InsighTx code from this study and it is not available in publicly accessible databases. However, researchers affiliated with accredited research institutions may request access by contacting the corresponding authors (S.K. and B.W.) who will respond within one month of the request. Code transfer and usage restrictions will be in accordance with the data and material sharing agreement policies and procedures at University Health Network. A detailed description of the InsighTx model using XGBoost can be found via GitHub (https://github.com/bowang-lab).

References

Cypel, M. et al. Normothermic ex vivo lung perfusion in clinical lung transplantation. N. Engl. J. Med. 364, 1431–1440 (2011).
Article CAS PubMed Google Scholar
Cypel, M. et al. Technique for prolonged normothermic ex vivo lung perfusion. J. Heart Lung Transplant. 27, 1319–1325 (2008).
Article PubMed Google Scholar
Cypel, M. et al. Normothermic ex vivo perfusion prevents lung injury compared to extended cold preservation for transplantation. Am. J. Transplant. 9, 2262–2269 (2009).
Article CAS PubMed Google Scholar
Cypel, M. et al. Experience with the first 50 ex vivo lung perfusions in clinical transplantation. J. Thorac. Cardiovasc. Surg. 144, 1200–1206 (2012).
Article PubMed Google Scholar
Yeung, J. C. et al. Physiologic assessment of the ex vivo donor lung for transplantation. J. Heart Lung Transplant. 31, 1120–1126 (2012).
Article PubMed Google Scholar
Ceresa, C. D. L., Nasralla, D., Pollok, J. M. & Friend, P. J. Machine perfusion of the liver: applications in transplantation and beyond. Nat. Rev. Gastroenterol. Hepatol. 19, 199–209 (2022).
Article PubMed Google Scholar
Boehnert, M. U. et al. Normothermic acellular ex vivo liver perfusion reduces liver and bile duct injury of pig livers retrieved after cardiac death. Am. J. Transplant. 13, 1441–1449 (2013).
Article CAS PubMed Google Scholar
Ardehali, A. et al. Ex-vivo perfusion of donor hearts for human heart transplantation (PROCEED II): a prospective, open-label, multicentre, randomised non-inferiority trial. Lancet 385, 2577–2584 (2015).
Article PubMed Google Scholar
Xin, L. et al. A new multi-mode perfusion system for ex vivo heart perfusion study. J. Med. Syst. 42, 25 (2017).
Article PubMed Google Scholar
Kaths, J. M. et al. Normothermic ex vivo kidney perfusion for the preservation of kidney grafts prior to transplantation. J. Vis. Exp. 101, 52909 (2015).
Google Scholar
Kaths, J. M. et al. Eight-hour continuous normothermic ex vivo kidney perfusion is a safe preservation technique for kidney transplantation: a new opportunity for the storage, assessment, and repair of kidney grafts. Transplantation 100, 1862–1870 (2016).
Article CAS PubMed Google Scholar
Urbanellis, P. et al. Normothermic ex vivo kidney perfusion improves early DCD graft function compared with hypothermic machine perfusion and static cold storage. Transplantation 104, 947–955 (2020).
Article CAS PubMed Google Scholar
Prudhomme, T. et al. Ischemia-reperfusion injuries assessment during pancreas preservation. Int. J. Mol. Sci. 22, 5172 (2021).
Article CAS PubMed PubMed Central Google Scholar
Whitson, B. A. & Black, S. M. Organ assessment and repair centers: the future of transplantation is near. World J. Transplant. 4, 40–42 (2014).
Article PubMed PubMed Central Google Scholar
Keshavjee, S. Human organ repair centers: fact or fiction? JTCVS 3, 164–168 (2020).
Article Google Scholar
Divithotawela, C. et al. Long-term outcomes of lung transplant with ex vivo lung perfusion. JAMA Surg. 154, 1143–1150 (2019).
Article PubMed PubMed Central Google Scholar
Suarez-Pierre, A. et al. Measuring the effect of the COVID-19 pandemic on solid organ transplantation. Am. J. Surg. 224, 437–442 (2022).
Article PubMed Google Scholar
Watanabe, T., Cypel, M. & Keshavjee, S. Ex vivo lung perfusion. J. Thorac. Dis. 13, 6602–6617 (2021).
Article PubMed PubMed Central Google Scholar
Okahara, S. et al. Common criteria for ex vivo lung perfusion have no significant impact on posttransplant outcomes. Ann. Thorac. Surg. 111, 1156–1163 (2021).
Article PubMed Google Scholar
Possoz, J., Neyrinck, A. & Van Raemdonck, D. Ex vivo lung perfusion prior to transplantation: an overview of current clinical practice worldwide. J. Thorac. Dis. 11, 1635–1650 (2019).
Article PubMed PubMed Central Google Scholar
Sage, A. T. et al. Prediction of donor related lung injury in clinical lung transplantation using a validated ex vivo lung perfusion inflammation score. J. Heart Lung Transplant. 40, 687–695 (2021).
Article PubMed Google Scholar
Di Nardo, M. et al. Predicting donor lung acceptance for transplant during ex vivo lung perfusion: the EX vivo lung PerfusIon pREdiction (EXPIRE). Am. J. Transplant. 21, 3704–3713 (2021).
Article PubMed Google Scholar
Ferdinand, J. R. et al. Transcriptional analysis identifies potential novel biomarkers associated with successful ex-vivo perfusion of human donor lungs. Clin. Transplant. 36, e14570 (2021).
Article Google Scholar
Andreasson, A. S. I. et al. The role of interleukin-1β as a predictive biomarker and potential therapeutic target during clinical ex vivo lung perfusion. J. Heart Lung Transplant. 36, 985–995 (2017).
Article PubMed PubMed Central Google Scholar
Andreasson, A. S. et al. Profiling inflammation and tissue injury markers in perfusate and bronchoalveolar lavage fluid during human ex vivo lung perfusion. Eur. J. Cardiothorac. Surg. 51, 577–586 (2017).
PubMed Google Scholar
Chen, T. & Guestrin, C. XGBoost: A Scalable Tree Boosting System. Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ‘16). 785–794 (2016).
Machuca, T. N. et al. Protein expression profiling predicts graft performance in clinical ex vivo lung perfusion. Ann. Surg. 261, 591–597 (2015).
Article PubMed Google Scholar
Machuca, T. N. et al. The role of the endothelin-1 pathway as a biomarker for donor lung assessment in clinical ex vivo lung perfusion. J. Heart Lung Transplant. 34, 849–857 (2015).
Article PubMed Google Scholar
Hashimoto, K. et al. Soluble adhesion molecules during ex vivo lung perfusion are associated with posttransplant primary graft dysfunction. Am. J. Transplant. 17, 1396–1404 (2017).
Article CAS PubMed Google Scholar
Hashimoto, K. et al. Higher M30 and high mobility group box 1 protein levels in ex vivo lung perfusate are associated with primary graft dysfunction after human lung transplantation. J. Heart Lung Transplant. 37, 240–249 (2018).
Article Google Scholar
Caldarone, L. et al. Neutrophil extracellular traps in ex vivo lung perfusion perfusate predict the clinical outcome of lung transplant recipients. Eur. Respir. J. 53, 1801736 (2019).
Article CAS PubMed Google Scholar
Kanou, T. et al. Cell-free DNA in human ex vivo lung perfusate as a potential biomarker to predict the risk of primary graft dysfunction in lung transplantation. J. Thorac. Cardiovasc. Surg. 162, 490–499 (2021).
Article PubMed Google Scholar
Abrams, D. et al. Risks and benefits of ultra-lung-protective invasive mechanical ventilation strategies with a focus on extracorporeal support. Am. J. Respir. Crit. Care Med. 205, 873–882 (2022).
Article PubMed Google Scholar
Bellini, V., Valente, M., Del Rio, P. & Bignami, E. Artificial intelligence in thoracic surgery: a narrative review. J. Thorac. Dis. 13, 6963–6975 (2021).
Article PubMed PubMed Central Google Scholar
Schlegel, A. The long road to identify a reliable viability test in liver transplantation. Transplantation 106, 702–704 (2022).
Article PubMed Google Scholar
Snell, G. I. et al. Report of the ISHLT Working Group on Primary Lung Graft Dysfunction, part I: Definition and grading-A 2016 Consensus Group statement of the International Society for Heart and Lung Transplantation. J. Heart Lung Transplant. 36, 1097–1103 (2017).
Article PubMed Google Scholar
Hirji, A. et al. Clinical judgment versus lung allocation score in predicting lung transplant waitlist mortality. Clin. Transplant. 34, e13870 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors thank the Toronto Lung Transplant Program Biobank and Rasheed Ghany with the Toronto Lung Transplant Program Database team for their efforts in this study. In addition, we would like to acknowledge the participants of the retrospective case review analysis for their time and insight. This study would not be possible without the efforts of the clinical teams past and present, including: organ perfusion specialists, transplant fellows, nurses and students. Most importantly, we would like to thank the generosity of all organ donors.

Author information

These authors jointly supervised this work: Bo Wang, Shaf Keshavjee.

Authors and Affiliations

Latner Thoracic Research Laboratories, Toronto General Hospital Research Institute, University Health Network, Toronto, ON, Canada
Andrew T. Sage, S. Hossein Mousavi, Bonnie T. Chao, Xuanzi Zhou, Jerome Valero, Aadil Ali, Tereza Martinu, Lorenzo Del Sorbo, Jonathan C. Yeung, Mingyao Liu, Marcelo Cypel & Shaf Keshavjee
Toronto Lung Transplant Program, Ajmera Transplant Centre, University Health Network, Toronto, ON, Canada
Andrew T. Sage, Laura L. Donahoe, S. Hossein Mousavi, Bonnie T. Chao, Xuanzi Zhou, Jerome Valero, Sharaniyaa Balachandran, Aadil Ali, Tereza Martinu, Jonathan C. Yeung, Mingyao Liu, Marcelo Cypel & Shaf Keshavjee
Department of Surgery, University of Toronto, Toronto, ON, Canada
Andrew T. Sage, Laura L. Donahoe, Jonathan C. Yeung, Mingyao Liu, Marcelo Cypel & Shaf Keshavjee
Institute of Medical Science, University of Toronto, Toronto, ON, Canada
Andrew T. Sage, Tereza Martinu, Mingyao Liu, Marcelo Cypel & Shaf Keshavjee
Department of Computer Science, University of Toronto, Toronto, ON, Canada
Alaa A. Shamandy & Bo Wang
Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
Alaa A. Shamandy & Bo Wang
Department of Medicine, University Health Network, Toronto, ON, Canada
George Tomlinson
Interdepartmental Division of Critical Care Medicine, Medical and Surgical Intensive Care Unit, University Health Network, Toronto, ON, Canada
Lorenzo Del Sorbo
Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
Bo Wang
Vector Institute, Toronto, ON, Canada
Bo Wang

Authors

Andrew T. Sage
View author publications
You can also search for this author in PubMed Google Scholar
Laura L. Donahoe
View author publications
You can also search for this author in PubMed Google Scholar
Alaa A. Shamandy
View author publications
You can also search for this author in PubMed Google Scholar
S. Hossein Mousavi
View author publications
You can also search for this author in PubMed Google Scholar
Bonnie T. Chao
View author publications
You can also search for this author in PubMed Google Scholar
Xuanzi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jerome Valero
View author publications
You can also search for this author in PubMed Google Scholar
Sharaniyaa Balachandran
View author publications
You can also search for this author in PubMed Google Scholar
Aadil Ali
View author publications
You can also search for this author in PubMed Google Scholar
Tereza Martinu
View author publications
You can also search for this author in PubMed Google Scholar
George Tomlinson
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Del Sorbo
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan C. Yeung
View author publications
You can also search for this author in PubMed Google Scholar
Mingyao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Cypel
View author publications
You can also search for this author in PubMed Google Scholar
Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shaf Keshavjee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.T.S. and S.K. contributed to and verified all aspects of the work. Additionally: conceptualization: A.T.S., L.L.D., L.D.S., A.A., M.C., T.M., J.C.Y., B.W., M.L., and S.K.; data curation and investigation: A.T.S., L.L.D., A.A.S., H.M., B.T.C., X.Z., S.B., T.M., G.T., L.D.S., J.C.Y., M.L., M.C., B.W., and S.K.; formal analysis: A.T.S., A.A.S., H.M., B.T.C., X.Z., G.T., and B.W.; funding acquisition: J.V., B.W.,and S.K.; methodology: A.T.S., L.L.D., T.M., L.D.S., A.A., M.L., M.C., B.W., and S.K.; project administration: J.V. and S.B.; software: A.A.S., H.M., B.T.C., X.Z., and B.W.; supervision: T.M., L.D.S., M.L., M.C., B.W., and S.K.; validation: A.T.S., L.L.D., T.M., L.D.S., M.L., M.C., B.W., and S.K.; visualization: A.T.S., L.L.D., B.T.C., and X.Z.; manuscript writing: all authors. All authors had full access to all the data.

Corresponding authors

Correspondence to Bo Wang or Shaf Keshavjee.

Ethics declarations

Competing interests

S.K. serves as Chief Medical Officer of Traferox Technologies and receives personal fees from Lung Bioengineering, outside the submitted work. A.T.S., J.V., M.L., M.C., B.W., and S.K. are inventors of patents related to the submitted work. The inventors fully adhere to policies at University Health Network that ensure academic integrity and management of potential conflicts of interest. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Clemens Aigner, John Dark, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sage, A.T., Donahoe, L.L., Shamandy, A.A. et al. A machine-learning approach to human ex vivo lung perfusion predicts transplantation outcomes and promotes organ utilization. Nat Commun 14, 4810 (2023). https://doi.org/10.1038/s41467-023-40468-7

Download citation

Received: 28 September 2022
Accepted: 26 July 2023
Published: 09 August 2023
DOI: https://doi.org/10.1038/s41467-023-40468-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Human ex vivo lung perfusion: a novel model to study human lung diseases

Pulmonary function as a continuum of risk: critical care utilization and survival after allogeneic hematopoietic stem cell transplantation - a multicenter cohort study

Implementation of an experimental isolated lung perfusion model on surgically resected human lobes

Introduction

Results

EVLP cohort characteristics

InsighTx model development and performance

InsighTx model and recipient features

InsighTx implementation analysis

Discussion

Methods

Study population

Data collection and storage

Data preprocessing

InsighTx model development

InsighTx and recipient model development

Implementation analysis

Statistical methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information File

Reporting Summary

Source data

Source Data

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links