Introduction

Accurate risk stratification in cardiac surgery is necessary to improve decision making prior to surgical and interventional treatment, patient information as to their prognosis and general1 care improvement.

The original European System for Cardiac Operative Risk Evaluation (EuroScore) was derived from a large international registry and enabled the estimation of postoperative mortality from clinical and biological preoperative variables2 EuroScore 2 was later developed to improve risk stratification in specific types of surgery such as aortic valve replacement (AVR) and improve overall calibration3.Ever since, EuroScore 2 has been largely accepted and is widely used. However, several concerns were raised, the score showing poor calibration in higher risk patients in whom it underestimated the risk4.

Apart from left ventricle ejection fraction (LVEF) and New York heart association (NYHA) functional class, EuroScore 2 does not capture heart failure severity, a known prognostic factor in cardiac surgery. Indeed, heart failure-related postoperative mortality is related to other factors than pump failure only, such as vasoplegia and systemic inflammatory response syndrome3,4. In a previous study, heart failure with preserved ejection fraction (HFpEF ) was shown to be an independent risk factor of mortality and post-operative shock5 in cardiac surgery. HFpEF was defined according to the 2016 European society of cardiology6 guidelines as an left ventricle ejection fraction ≥ 50%, symptomatic heart failure with New York heart association (NYHA) class 2 or greater, elevated brain natriuretic peptide (BNP) and relevant echocardiographic findings (left ventricle hypertrophy, left atrium enlargement, or diastolic filling anomaly).

Increased brain natriuretic peptide (BNP) plasma concentration has already been associated with worse outcomes in cardiac surgery7,8,9,10,11,12. However, although BNP is associated with mortality, interpretation must account for patients’ characteristics such as age, gender, morphology and renal function. While the predictive capabilities of BNP would not suffice to characterize patient-specific risk, this study tests the hypothesis that it could improve the accuracy of EuroScore 2.

The aim of this study was to evaluate the additional prognosis value of a model combining preoperative BNP and EuroScore 2 as compared to EuroScore 2 only, regarding in-hospital mortality.

Results

Demographics, types of procedures and outcomes

The scope of the study included 4,980 patients. Patients’ characteristics are presented in Table 1 and types of procedures in Table 2. Mean age was 68.9 years. Mean BMI was 26.7 kg/m2. The population was a standard population of heart surgery patients. 22% of the patients had a NYHA score above 3. 6% had diabetes, 13.2% had extra cardiac arteriopathy. 2.8% had active endocarditis. 81.2% had elective surgery.

Table 1 Baseline characteristics.
Table 2 Types of procedure.

The surgery was an isolated CABG in 45.6%, it was a valvular surgery in 39% of the surgeries.

EuroScore 2 had a median of 1.8 [IQR 1.0–3.42]. Preoperative BNP had a median of 481 ng/L [IQR 250–751].

Median length of stay in-hospital was 11 days [IQR 6–15]. In-hospital mortality rate was 3.7% (163 patients over 4,980 included). Comparing those who died to others, those who died were older (74.4 ± 9.7 vs. 68.7 ± 11, p-value = 0.003) and more often women (38.2 vs. 25.8%, p = 0.0001). Their EuroScore 2 was higher (6.0 [IQR 2.9–14.0] vs. 1.7 [IQR 1.0–3.2], p = 0.0001) with significant differences in NYHA scale, more chronic obstructive pulmonary disease, poor mobility, active endocarditis, critical preoperative state, and higher pulmonary pressure (see Table 1 for intergroup comparisons).

Performance of EuroScore II

ROC analysis showed that EuroScore 2 had good discrimination with an AUROC of 0.82 (95%CI 0.79–0.85; p-value < 0.0001). Calibration measure with goodness-of-fit analysis showed significant differences between observed and predicted mortality (χ2 = 51; p-value < 0.00001). Differences were mostly observed in patients with EuroScore 2 below 2% where risk was overestimated, and above 5% where it was underestimated (see Fig. 1).

Figure 1
figure 1

Calibration plot, comparison between observed mortality and mortality predicted by EuroScore 2. Differences between observed and predicted values are significant (Hosmer–Lemeshow test: χ2 = 49.94, p < 0.0001). i.e. for patients having a theoretical risk of 5%, observed mortality was 18% and for those predicted at 11%, had 30%; conversely, for patients predicted at 2%, observed mortality was null.

Performance of preoperative BNP

ROC analysis showed that preoperative BNP adequately discriminated postoperative mortality with an AUROC of 0.66 (95%CI 0.62–0.70; p < 0.0001). Differences between observed and predicted values (calibration) are significant (Hosmer–Lemeshow test: χ2 = 49.94, p-value < 0.0001.

Performance of a combined model

ROC analysis showed that a combined model of EuroScore 2 and preoperative BNP adequately discriminated postoperative mortality with an AUROC of 0.67 (95%CI 0.63–0.71, p = 0.0001). The combined model did not show a better AUROC than EuroScore 2 alone, nor BNP (DeLong comparison test p < 0.0001 in disfavor of the combined model when compared to EuroScore 2 alone, and p = 0.79 when compared to BNP alone) (see Fig. 2).

Figure 2
figure 2

In-hospital mortality according to BNP, EuroScore 2 and a combined model. EuroScore 2 and preoperative BNP accurately discriminated in-hospital mortality with respective AUROC: 0.82 (95% CI 0.79–0.85), p < 0.0001 and 0.66 (95% CI 0.62–0.70), p-value = 0.0001. Combining EuroScore 2 and preoperative BNP yielded an AUROC of 0.67 (95% CI 0.63–0.71), p = 0.0001, however, it was not superior to neither EuroScore 2 alone nor BNP alone.”

Association with in-hospital mortality

In univariate analysis, EuroScore 2 was associated with mortality with an unadjusted OR of 1.12 (1.10–1.14), p-value < 0.0001. Similarly, BNP was associated with mortality with an unadjusted OR of 1.06 (1.03–1.09), p-value < 0.001 (per 1,000 unit-increase).

In a multivariable analysis, EuroScore 2 remained independently associated with mortality with an adjusted OR of 1.12 (1.10–1.14), p-value < 0.0001. However, BNP was not associated with mortality anymore.

Subgroup analysis with AUROC comparisons of EuroScore 2

We conducted a subgroup analysis of the prediction of in hospital mortality according to BNP and EuroScore 2 stratified on several factors (see Table 3).

Table 3 AUROC comparison, by subgroup.

In patients with a eGFR > 60 mL/min/m2, the AUROC of EuroScore 2 on mortality was higher than for those with eGRF less than 60 mL/min/m2 (respectively, 0.85 (0.81–0.88) vs. 0.65 (0.57–0.73), DeLong p-value < 0.0001). Similarly, in patients with an elective surgery the AUROC of EuroScore 2 was higher compared to the patients with emergency surgery (respectively, 0.80 (0.77–0.84) vs. 0.67 (0.58–0.76), DeLong p-value = 0.016).

Further sensitivity analyses were performed on subgroups, to assess association between BNP and in-hospital mortality, independently from EuroScore 2, using multivariable regression analysis. Overall, none were statistically significant, except in the subgroup of patients with eGFR > 60 mL/min/m2 (see Supplementary Material for details).

Discussion

The study brought forward two main findings: (1) EuroScore 2 accuracy was validated regarding in-hospital mortality with an AUROC of 0.82 but undererstimated risk when above 10% and; (2) preoperative BNP was not independently associated with in-hospital mortality, regardless of EuroScore 2 in multivariable regression analysis.

The results concerning EuroScore 2 alone were consistent with previous publications: EuroScore 2 showed good discrimination with an AUROC of 0.811,13 but limited calibration, trending towards underestimation of overall in-hospital mortality2. This limits the usage of this score as risk below 10% represents more than 90% of patients, notably for the choice between interventional procedures such as percutaneous valve replacement14,15,16,17 and conventional surgery. In patients with a low calculated risk score, additional risk stratification is necessary.

The purpose of this study was to see if an elevated pre-operative BNP could increase the stratification of in-hospital mortality for cardiac surgery, since it is a good predictor of mortality in conditions such as acute exacerbation of chronic obstructive pulmonary disease18, and patients with heart failure19.

BNP being secreted by cardiac ventricular myocytes in response to increased ventricular wall tension, it accounts for variations of heart filling conditions and has been reported more accurate than LVEF estimation regarding adverse outcomes, including mortality2,20. Indeed, BNP accounts for other types of heart failure than systolic dysfunction including right ventricle and diastolic dysfunction21,22. Furthermore, it provides a better understanding of the risk of endothelial and vascular dysfunction as well as the inflammatory respons23,24. As a matter of fact, BNP has been shown associated to postoperative extracorporeal-circulation-related systemic inflammation4.

The study showed that preoperative BNP discriminative performance was moderate with an AUROC of 0.66, albeit statistically significant, as previously documented25. However, in multivariable analysis, BNP was not independently associated with in-hospital mortality, and did not add to EuroScore 2, regarding risk evaluation. More importantly, when combined to EuroScore 2, BNP decreased discrimination assessed by AUROC, with a significantly lower AUROC. This may warrant caution when using this biomarker, on top of EuroScore 2, when assessing patients’ prognosis.

We previously conducted a subgroup analysis of patients with a normal LVEF8. According to the ESC definition, heart failure with preserved ejection fraction is defined by a normal LVEF and elevated BNP. As observed in the present study, an elevated BNP did not improve the prediction of mortality. Other markers such as diastolic dysfunction in echocardiography for example should be used26. Indeed, four elements may qualify for diastolic dysfunction: annular e′ velocity: septal e′ , lateral e′, average E/e′ ratio, left atrium volume index, and peak tricuspid regurgitation velocity.

Our assumption was that BNP may have added to the prognostication of cardiac surgery patients, by potentially identifying heart failure with preserved ejection fraction, however, this hypothesis was ultimately not proven in the present paper.

Limitations

Although single-centred this study brought forward similar mortality rates and preoperative characteristics in our cohort to those previously reported in other cohorts, making external validation plausible but necessary.

BNP was routinely measured, instead of the more recent N-terminal pro-BNP (NT-proBNP), as the included patients were operated between 2012 and 2016. However, it has been documented that results found with BNP could be translated to NT-proBNP, regarding mortality and cardiac events27,28.

We acknowledge the collinearity between age28, eGFR29, LVEF30 and BNP. The sensitivity analysis that we conducted showed that even stratified on age, BMI, LVEF and type of surgery, BNP was not more associated with in-hospital mortality. Interestingly, in the subgroup of patients with eGFR above 60 mL/min/m2, BNP was an independent factor of mortality, albeit this result may require to be taken cautiously due to the type of analysis involved.

Finally, main outcome was in-hospital mortality, hence results may be harder to extrapolate to longer-term mortality, although, both EuroScore 2 and BNP were previously associated with longer-term mortality after cardiac surgery14,31.

Methods

Study population and study design

From the 1st of January 2012 to the 5th of July 2019, the study included all adult patients undergoing cardiac operations with cardiopulmonary bypass (CPB). Exclusion criteria were: age under 18 years and re-interventions during the same hospitalization. Informed consent was obtained from all subjects.

Data were collected prospectively: BNP and variables required for the computation of EuroScore 2 and in-hospital mortality. Data were anonymized as per national regulation and used with the approval of an institutional review board committee. All data are part of the Registry for the Improvement of Postoperative OutcomeS in Cardiac and Thoracic surgEry (RIPOSTE) database (NCT03209674). Patients’ opposition to the use of anonymized data by investigators was systematically sought (i.e. informed consent was obtained from all patients).

EuroScore 2 was computed as described in its original publication32. It included age, gender, New York Heart Association (NYHA) functional class, angina symptoms, insulin-dependent diabetes mellitus, extracardiac arteriopathy, chronic pulmonary dysfunction, neurological or musculoskeletal dysfunction severely affecting mobility, previous cardiac surgery, renal function with creatinine clearance, active endocarditis, critical preoperative state, left ventricle ejection fraction, recent myocardial infarction, pulmonary artery systolic pressure, procedure urgency and weight of the procedure.

Similarly to the EuroScore 2 study, main outcome was in-hospital mortality and was defined as death occurring in the same hospital where the operation took place, before discharge from the hospital2.

BNP was evaluated preoperatively in all patients, sampled in the 48 h preceding surgery. If several BNP levels were available, the most recent prior to surgery was used. BNP was quantified using immunoassay on Architect iSystem automatons (Abbott, Illinois, USA). Since BNP cut-off depends of studies and definitions32,33 we decided to use it as a continuous variable. All methods were carried out in accordance with relevant guidelines and regulations (Declaration of Helsinki).

Data analysis

Descriptive analyses were conducted and expressed, for continuous variables as mean ± standard deviation or median [interquartile range, IQR] when appropriate; and for categorical variables as number of occurrences (percentage). Normality was assessed using a Shapiro–Wilk test.

Association between in-hospital mortality, EuroScore 2 and BNP was assessed using multivariable logistic regression.

Discrimination performance of the two models (EuroScore 2 alone and then the combined model) was assessed by building receiver operating characteristic (ROC) curves and by computing the area under curve (AUROC) with a 95% confidence interval [95%CI]. The AUROC were compared using Delong test33.

Calibration of EuroScore 2 was performed using the Hosmer–Lemeshow goodness-of-fit test, i.e. the same test used in the original validation paper of EuroScore 22. Graphical representation was made by dividing EuroScore 2 into ranges of risk as described previously (< 1%, 1–5%, 5–10% and > 10%).

For ROC analysis, sample size required a minimum of 100 deaths to reach statistical significance34,35. With an expected mortality rate of 5%, based on our past activity and from other published registries, we required a minimum number of inclusions of 2,000 patients.

A multivariable analysis of mortality was performed with EuroScore 2 and BNP, using only these two variables as covariates. For sensitivity, subgroup analyses analysis were then conducted, based on specific subpopulations. They included age (under or above 65 years), BMI (under of above 25), LVEF (under and above 50%), and eGFR (under or above 60 mL/min/m2). Similarly, a subgroup analysis was conducted in patients with a low risk profile (EuroScore 2 under 2%) and a high risk profile (above 5%).

IBM v23.0 (IBM, Armonk, USA) was used for all analyses.

Conclusion

In this cohort, while BNP adequately discriminated in-hospital mortality, it did not add prognostic value to EuroScore 2 regarding postoperative in-hospital mortality after cardiac surgery. EuroScore 2 underestimated mortality in patients with risk above 5%.