Introduction

Radiation therapy plays an essential role in cancer treatment; however, cardiac toxicities are late adverse events associated with thoracic irradiation1,2,3,4,5,6,7,8,9,10,11. Partial volume radiation-induced heart complications mostly occur in long-surviving breast cancer patients who have undergone surgery1,2; meanwhile, evidence on radiation-related injuries after whole heart radiation has come mostly from patients with Hodgkin lymphoma3. Recently, cardiotoxicity has been recognised as an adverse event that occurs earlier than previously thought, as indicated by studies on patients with lung and oesophageal cancer4,5,6,7,8. Furthermore, some reports have suggested that heart irradiation might even affect short-term survival6,7,8. Among the various cardiac complications such as pericarditis, pericardial effusion (PCE), myocardial infarction, angina, and arrhythmia, PCE is regarded as the most frequent and important toxicity2,4,8,9,10,11. Although it has been suggested that reduced irradiation dose and volume delivered to the heart may help manage the risk of complications, the quantitative evaluation of the risk for dose-volume irradiation to the heart remains unclear. Therefore, it is reasonable to introduce mathematical modelling to analyse the radiation-to-heart dose-volume relationship and predict the normal tissue complication probability (NTCP).

We have previously reported the incidence of symptomatic PCE (S-PCE) in oesophageal cancer patients, showing that the average irradiation dose to the heart (including the pericardium) was the most significant predictor of S-PCE11. As the pericardium appears to be an anatomically uniform structure, we have assumed it to be a parallel organ. This study aimed to investigate the NTCP of PCE using mathematical modelling and examine whether the mean heart dose (MHD) can be used as an indicator of NTCP in S-PCE and PCE in any grade (A-PCE).

Methods

The patient dataset analysed in this study was acquired from radiation treatment records of The Department of Radiation Oncology, Keio University Hospital. This was a retrospective observational study; hence, we used the “opt-out” method to obtain informed consent from patients. The study was approved by the Keio University School of Medicine Ethics Committee (no. 20150137). Patient clinical information was anonymised to protect personal information. All investigations were conducted in accordance with the relevant guidelines and regulations.

Patients were eligible for inclusion in the present study if they met the following criteria: newly diagnosed primary oesophageal cancer patients who received definitive concurrent chemoradiotherapy (CCR) between 2001 and 2014, involvement of the thoracic oesophagus, treated with conventional 2.0 or 1.8 Gy fractionation, total irradiation dose of ≥ 50 Gy, computed tomography (CT) data were available for analysis of the dose-volume of the pericardium, ≥ 6 months follow-up, and pathologically confirmed malignant PCE. Two-dimensional treatment plans were reconstructed as three-dimensional plans using a treatment planning system without modification, the details of which have been previously reported11.

Follow-up chest-abdominal CT scans and gastrointestinal fibrescopy were performed every 3–6 months for 3 years, and half-yearly to annual examinations were performed after 3 years. Adverse events were graded according to the National Cancer Institute-Common Terminology Criteria for Adverse Events (NCI-CTCAE) version 4.012. S-PCE was defined as an effusion of grade 3 or above. The survival and PCE- free periods were determined after the completion of CCR. Heart volume is defined as the volume enclosed by the pericardium including the heart11. In this study, however, the pericardial volume was defined separately as a ring structure consisting of 2 mm of both the outer and inner walls of the heart. The cardiac substructures, including the right and left atria and the right and left ventricles, were also manually contoured on each CT image, referring to the Atlases for Organs at Risk (OARs) in Thoracic Radiation Therapy13 and the heart atlas by Feng et al.14. All contours were reviewed and corrected by a single radiation oncologist (J.F.). Three-dimensional plans were generated in increments of 10 cGy, using a commercially available radiation treatment planning system (XiO®, Elekta AB, Stockholm, Sweden), and calculated with the superposition algorithm appropriate for use with heterogeneous tissues. Calculated dose distribution was acquired by exporting the Digital Imaging and Communications in Medicine in radiation therapy (DICOM RT) file.

Statistical calculation

Univariate logistic regression analysis was calculated with a cut-off p-value < 0.10 to analyse the relationship between PCE and patient and treatment characteristics. Pearson correlations were used to test multicollinearity with an R-squared threshold > 0.70. A multivariable forward stepwise logistic regression model was fit to include all selected significant factors associated with PCE. Variable parameters were included in the final model once the model was significantly better using the likelihood ratio test (p < 0.05, two-sided).

To validate our model internally, all patient and treatment factors and selected dosimetric parameters for the cause of the toxicities were repeated in 1000 bootstrap samples. The optimism of the generated model was assessed by estimating the performance difference between each bootstrap and the original sample in line with the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) statement15. The area and the adjusted area under the receiver operating characteristic (ROC) curve were compared to quantitatively evaluate the predictive power of the analyses. To perform an external validation, a previously reported model16 was compared to our models.

To compare and validate the MHD-based NTCP model, we used probit regression and compared it to the Lyman–Kutcher–Burman (LKB) NTCP model17. Generalised equivalent uniform dose (gEUD)-based LKB NTCP parameters (n, m, and TD50) were estimated using the maximum likelihood method18. In the probit regression-using MHD-based NTCP model, the m-value and TD50 were estimated at n = 1, with the pericardium considered an organ with a significant volume effect. In this case, TD50 represented MHD, instead of gEUD.

NTCP parameter optimisation from the DICOM RT file was performed using an in-house application. The application was coded with Python, Pydicom, dicompyler-core, and scikit-learn19,20,21,22. Optimum parameters were determined using the gradient descent method.

The probability of pericarditis incidence was given by

$$\begin{array}{c}NTCP=\frac{1}{\sqrt{2\pi }}{\int }_{-\infty }^{t}\mathrm{exp}\left(-\frac{{x}^{2}}{2}\right)dx\end{array}$$
(1)

with

$$\begin{array}{c}t=\frac{\mathrm{gEUD}-{\mathrm{TD}}_{50}}{m\cdot {\mathrm{TD}}_{50}}\end{array}$$
(2)
$$\begin{array}{c}gEUD={\left(\sum_{i}{v}_{i}{D}_{i}^\frac{1}{n}\right)}^{n}\end{array}$$
(3)

where n, m, and TD50 are the parameters of the LKB model, including the dose-volume effect on the organ; m-value represents the steepness of the dose–response curve, n-value represents the volume effect (large volume effect for n close to one; small volume effect for n close to zero), and TD50 represents the dose that corresponds to the 50% probability of complications, given whole-organ irradiation. In the third equation, vi and Di denote fractional volume and dose per ith voxel, respectively.

The following equation was used to calculate the coefficient of determination:

$${R}^{2}=1-\sum_{i}{\left({y}_{i}-f\right)}^{2}/\sum_{i}{\left({y}_{i}-\overline{y }\right)}^{2}$$
(4)

Herein, f denotes the linear function calculated by regression and \(\overline{y }\) is the arithmetic mean of a vertical axis in the figure.

Statistical analyses were performed in SPSS, version 26.0 (SPSS Inc. Chicago Illinois, USA).

Results

Patient background and incidence of pericardial effusion

A total of 297 consecutive oesophageal cancer patients were treated with CCR during the study period. Cases were excluded when lost to follow-up before 6 months (n = 30), death occurring within 6 months (n = 24), no CT data available for dose-volume analysis (n = 13), malignant PCE (n = 1). Ultimately, 229 cases were included in this study. The overall median follow-up period was 37 months (range, 6–178), and that for surviving patients (n = 131) was 48 months (range, 6–178). A-PCE was observed in 100 (43.7%) patients. The timing of PCE onset ranged from 2–75 months, with a median of 7 months. S-PCE developed in 18 (7.9%) patients; the onset was 4–108 months, with a median of 21 months. Other observed new cardiac events were coronary artery disease (n = 8), arrhythmia (n = 6), and heart failure (n = 7). Within that group, six, three, and five patients also developed PCE. Patient demographic and clinical characteristics are summarised in Table 1.

Table 1 Demographic and clinical characteristics of patients included in the study cohort.

Significant predictor identification for PCE by logistic regression analysis

Alcohol use was associated with A-PCE, while hypertension was associated with S-PCE by univariable logistic regression analysis. Meanwhile, most cardiopulmonary dose parameters were significantly associated with both A-PCE and S-PCE. Pearson’s correlation analysis revealed that these cardiac dose-volume parameters were highly correlated in predicting A-PCE and S-PCE. Then we performed a ROC analysis of these parameters to specify the most relevant parameter for PCE. MHD was selected as the significant predictor with the largest area under the curve (AUC) value for S-PCE (AUC = 0.821). While for A-PCE, the AUC of MHD (AUC = 0.713) was smaller than that of heart V40 (AUC = 0.734), we thought MHD could be regarded as a representative parameter considering its clinical usefulness and preferable AUC values. The detailed results of univariable logistic regression analyses and ROC analyses were provided in supplementary data (suppl. Data Tables 1 to 4).

Multivariable logistic regression analysis demonstrated that only the MHD was the best predictor for A-PCE, with an odds ratio of 1.08 per Gy MHD (adjusted AUC after bootstrapping = 0.715). On the other hand, S-PCE was best predicted by MHD and hypertension, with an odds ratio of 1.17 (adjusted AUC after bootstrapping = 0.814). The estimated model and NTCP curve by logistic regression are presented in Table 2a and Fig. 1, respectively. The robustness of choosing MHD in the NTCP model, both for A-PCE and S-PCE, was verified by bootstrap analysis and calibration (suppl. Data Figs. 1 to 4). The previously reported NTCP model16 results are also shown in Table 2a as an external validation, indicating comparable results of discrimination (AUC) and calibration.

Table 2 NTCP models for A-PCE and S-PCE by logistic regression analyses (2a) and by probit regression analyses (2b).
Figure 1
figure 1

NTCP curves for A-PCE (a) and S-PCE (b). The NTCP curves were calculated by the logistic regression analysis derived from MHD. A-PCE, pericardial effusion in any grade; NTCP, normal tissue complicated probability; S-PCE, symptomatic pericardial effusion; MHD, mean heart dose.

NTCP model by probit regression analysis

The NTCP curve for A-PCE derived from the MHD-based model calculated by probit regression and derived from the LKB model are shown in Fig. 2. The NTCP curve derived from the MHD-based model (Fig. 2a) showed a gentler curve (m = 0.75) than that derived from the LKB model (Fig. 2b) (m = 0.47); moreover, the NTCP curve derived from the MHD-based model showed preferable fitting by linear regression (Fig. 2c). The NTCP increased by 1.3% for each 1.0 Gy mean dose increase; the intercept was 5.1% at 0 Gy. When validating the goodness of fit, both the discrimination and the calibration of A-PCE in the MHD-based model were equivalent to that in the LKB model. On the other hand, the NTCP model for S-PCE derived from the MHD-based model calculated by probit regression and derived from the LKB model are shown in Fig. 3. The MHD-based model (Fig. 3a) was of similar shape and yielded comparable estimates as the LKB model (Fig. 3b). The validation of the goodness of fit, discrimination, and the calibration of S-PCE in the MHD-based model were equivalent to that in the LKB model. As a direct comparison of the MHD and LKB models, linear regression analysis and calibration results indicated excellent correspondence in both A-PCE and S-PCE. The summary of probit regression analysis was shown in Table 2b. The distribution of calibration plots was provided in supplementary data (suppl. data Figs. 5 to 8).

Figure 2
figure 2

NTCP curves for A-PCE calculated by the MHD-based model (a and c) and LKB model (b). The x-axis indicates the MHD, which corresponds to the mean dose in the MHD-based model. Filled circles indicate average dose points for gEUD at 10 Gy intervals. NTCP curve by the MHD-based model also showed preferable fitting by linear regression (c). A-PCE, pericardial effusion in any grade; LKB model, Lyman–Kutcher–Burman model; MHD, mean heart dose; NTCP, normal tissue complicated probability; No. of Pts, number of patients; gEUD, generalised equivalent uniform dose.

Figure 3
figure 3

NTCP curves for S-PCE calculated by the (a) MHD-based model and (b) LKB model. The x-axis indicates gEUD, which corresponds to the mean dose in the MHD-based model. Filled circles indicate average dose points for gEUD at 10 Gy intervals. gEUD, generalised equivalent uniform dose; LKB model, Lyman–Kutcher–Burman model; No. of Pts, number of patients; NTCP, normal tissue complicated probability; S-PCE, symptomatic pericardial effusion; MHD, mean heart dose.

Assessment of NTCP estimates

NTCP estimates derived from the logistic and probit regression are summarised in Table 3. The NTCP model for S-PCE showed a similar shape at the threshold near 30 Gy (Figs. 1b, 3a,b); however, the NTCP model for A-PCE, while also showing a similar shape, showed a curve that was gentler than that of S-PCE without threshold (Figs. 1a, 2a,b).

Table 3 Summary of the estimated TD and normal tissue complication probability stratified to the applied model and the grade of pericardial effusion.

Discussion

In this study, we investigated the NTCP of PCE using mathematical modelling. MHD was identified as the most significant predictor for S-PCE and a relevant parameter for A-PCE in multivariate analysis. The bootstrap analysis confirmed the robustness of obtaining a preferable AUC by selecting MHD in the NTCP model. The NTCP model derived from MHD was comparable to the widely used LKB model for both A-PCE and S-PCE.

PCE in cancer patients may arise through one of several mechanisms23,24. The lung disease study10 included receipt of adjuvant chemotherapy, history of cardiac disease, and a left-sided tumour, which were identified as independent predictors in the prospective subgroup. Several previous studies could not identify clinical background as a predictor for PCE4,11,16. In the current study, hypertension was selected as the only clinical factor relevant for S-PCE. A history of hypertension may induce cardiac symptoms in patients with A-PCE by exacerbating radiation-induced microvascular damage, raising jugular venous pressure, and edema. Chemotherapy may worsen the radiation-induced cardiotoxicity23, but it is challenging to elucidate the contribution of each therapy when CCR was performed as radical therapy. In our earlier study, chemotherapy was identified as a risk factor for PCE by univariate analysis but was not identified in the multivariate analysis25. Further investigation is needed to elucidate the mechanisms since PCE arises from various processes and factors.

In this study, various dose-volume parameters such as heart, pericardium, atria, and ventricles were widely selected as significant factors. These heart and cardiac substructures are highly correlated; therefore, we selected MHD as a representative of cardiac dose-volume parameters considering the results of ROC and the widely used guidelines26. Several previous studies have reported that the dose-volume of the irradiated pericardium may be a better factor than the dose volume of the whole heart4,16, and the bootstrap analysis of our study suggests the same. Since the dose volume of the whole heart was equivalent to that of the pericardium, we applied MHD to the NTCP model after considering its simplicity in clinical application. Some previous studies have investigated NTCP of cardiac toxicities in oesophageal cancer patients. According to studies by Emami and Burman et al., which proposed the LKB model, TD50 for the development of pericarditis was 48 Gy, with n- and m-value of 0.35 and 0.10, respectively17,27. In the present study, the estimated TD50 and n-values corresponding to S-PCE onset were nearly equivalent to those reported previously17,27. Therefore, the dose-volume associated with the development of pericarditis may correspond to that related to the development of S-PCE.

Meanwhile, several studies had investigated NTCP for A-PCE. Martel et al. reported an n-value of 0.63 for PCE, indicating that the pericardium is a parallel organ. These authors evaluated TD50 as a ‘biodose’, based on the data from a prospective clinical trial using a hypofractionation schedule, showing that TD50 was 50.6 Gy and m-value was 0.13, both of which are estimates comparable to those reported in the present study28. Recently, Beukema et al. reported NTCP for A-PCE using pericardial mean dose by logistic analysis16. Results of our external validation were per the TRIPOD statement, as the discrimination ability and calibration were comparable to their results. In this study, the NTCP curve for A-PCE showed a gentler shape than S-PCE without threshold. In addition, A-PCE showed preferable fitting by a straight line with intercept, indicating that the risk of A-PCE increases linearly with MHD. A previous report on the development of coronary artery disease in breast cancer patients revealed that disease risk increased linearly with the administration of MHD29. Similarly, another breast cancer study reported a predication model for acute cardiac events that concluded that LV-V5 (left ventricular volume receiving 5 Gy) was the predictor30. These results may suggest that even low doses cause cardiotoxicity.

Some previous studies have assumed n to equal one when considering other organs at risk to be parallel organs, for example, lungs, salivary glands, and liver31,32,33. For example, a previous study of radiation pneumonitis by Semenenko and Li compared NTCP estimates of cases with steroid administration with those of all instances showing that the NTCP sigmoid curve for all events followed a gentler progression with a lower threshold of mean total lung dose than that for cases of severe pneumonitis that required treatment with steroids32. These findings correspond to the present findings on NTCP curves for A-PCE and S-PCE.

This study has some limitations, which should be considered when interpreting its findings. First, as this was a retrospective single-centre study, the sample size and the number of symptomatic cardiac events except for PCE was small. Hence, the estimated NTCP was limited to PCE, not to whole cardiac events. Second, the estimated dose volume of cases treated with a two-dimensional plan might be inaccurate due to reconstruction. Since S-PCE was observed relatively frequently in the early period, we included these initial cases in the NTCP analysis. External validation using studies performed on modern irradiation techniques, such as intensity moderated radiation therapy34, may strengthen our model. Third, checking the inter-operator consistency with two oncologists would be beneficial since definition of border of pericardium is not easy. Noticeably, mathematical models have inherent limitations, as they do not consider host factors.

These limitations notwithstanding, the present study has several strengths. A substantial number of cases were treated using conventional fractionated irradiation and cardiac and lung structures were delineated by methods that have been previously reported as atlases and are now included among the standard. We clarify the difference in the NTCP curve for A-PCE and S-PCE. The risk of PCE could be predicted from the MHD, and it can be easily referred in daily radiotherapy planning. We believe that it can contribute to safe radiotherapy. Although the reported n-value was consistent with that shown in a previous study, we could not demonstrate that the pericardium was a parallel organ. Further studies may elucidate the impact of irradiation on a specific cardiac substructure encompassed by the pericardium.

Conclusions

The MHD was identified as a significant predictor for PCE risk assessment with preferable ROC and robustness. The MHD-based NTCP model is simple but comparable to the LKB model for both A-PCE and S-PCE. Estimated NTCP provides clinically useful parameters for predicting PCE. Hypertension may accelerate symptoms in the presence of PCE.