Introduction

Colorectal cancer (CRC) is one of the most common cancers worldwide. Surgery is the cornerstone of treatment for CRC. More than half of newly diagnosed CRC patients receive surgical resection of the primary tumor, according to the Taiwan Cancer Registry1. Radical CRC surgery requires an en-bloc removal of the primary tumor with the associated mesentery to ensure adequate surgical margins, which is the most important prognostic factor for local control and survival2,3. Both colectomy and total mesorectal excision are common surgical interventions that general surgeons or colorectal specialists can perform in different levels of hospitals. Much of the literature has demonstrated that higher provider volume (i.e., hospital or surgeon volume) is associated with better surgical outcomes and long-term survival4,5,6,7. Some may argue that provider volume is hardly a proxy for other critical characteristics of providers, such as surgical technique or decision-making of the individual surgeon. Fundamental infrastructures of the hospital, including the quality of clinical care, equipment of the intensive care unit, and multidisciplinary teamwork, are essential considerations. However, it serves as a simple and intuitive indicator for measurement5,8.

Since the association between surgical volume and mortality was first described in the 1970s9, many publications have confirmed a positive volume-outcome relationship. Both surgical and medical therapeutic volumes have been reported to correlate with patient outcomes. For example, Ross et al. wrote that admission to higher-volume hospitals would reduce the 30-day mortality for acute myocardial infarction, heart failure, and pneumonia10. Schrag et al. noted that hospital surgical volume was effective in predicting clinical outcomes in patients who underwent surgery for colon cancer11. Several studies in Taiwan have also discussed volume and outcome. Lin et al. reported physician volume predicting inpatient mortality among ICU patients with pneumonia12. However, not all empirical evidence supports the volume-outcome relationship. Leonard et al. showed a volume effect on circumferential resection margin, R0 resection rate, sphincter preservation rate, and the number of nodes examined after chemoradiotherapy in patients with stage II–III rectal cancer; however, no volume effect was observed for recurrence and overall survival rates6. Yasunaga et al. reported that neither hospital nor surgeon volume was associated with postoperative complications in patients after colectomy13.

Recent studies with more delicate methods by applying advanced statistical and analytical strategies have provided different findings. Some studies argue that volume is not adjusted for confounding variables. For instance, Chen et al. reported high surgeon and hospital surgical volume as a significant contributor to outcomes of patients with breast cancer14. In contrast, Kuo et al. reported no association between surgeon volume and breast cancer recurrence or survival after controlling for patient and provider characteristics using multilevel mixed-effect models15. Panageas et al. demonstrated that when implementing random-effect models or generalized estimating equations in volume-outcome studies to adjust for the clustering effect of surgeons, the strength of statistical significance attenuates. In contrast, conventional statistical methods often result in overly narrow confidence intervals and could lead to biased interpretations16.

In addition, the definitions of surgical volume also affected surgical outcomes. Several studies used different definitions of surgical volume and combinations17,18,19. However, no studies have been published to date comparing the effects of different definitions of surgical volume on patient outcomes. To resolve this issue, we used Taiwan’s National Health Insurance Research Database (NHIRD) to analyze the outcomes of patients who received definitive surgeries for CRC by using different definitions of surgical volume.

Materials and methods

Data sources

Taiwan’s National Health Insurance (NHI) program was established on March 1, 1995. It covers more than 99.9% of Taiwan's population, including outpatient, inpatient, emergency, dental, and traditional Chinese medical services, as well as surgical procedures and prescription medicine. The NHIRD provides nationwide population-based data, making it a valuable resource for population-based health research. To ensure patient confidentiality, data were retrieved and analyzed by on-site analysis at the Health and Wellness Data Science Center via remote connection to the Ministry of Health and Welfare server. All patients with severe diseases, of which cancer is included, are enrolled in the Registry for Catastrophic Illness Patients (RCIP) and receive copayment exemption under the NHI program. The NHIRD contains information on the demographic characteristics of hospitals and physicians, ambulatory care, admissions, procedures, diagnoses, and prescribed medications. The diagnosis coding system is used in line with the International Classification of Diseases (ICD) 9th revision system to classify diagnostic, health services utilization, and death data20.

All personally identifiable information in the NHIRD is encrypted to ensure privacy and confidentiality. The Bureau of National Health Insurance and the National Health Research Institutes have established regulations to safeguard the confidentiality of the data. This study has received approval from the Institutional Review Board at Taipei Veterans General Hospital, affirming its compliance with ethical research standards (2019-07-054BC). All study methods were performed per relevant guidelines and regulations of Taipei Veterans General Hospital in Taiwan. The Taipei Veterans General Hospital ethical committee waived the informed consent form.

Study population

The patients enrolled in this study were newly diagnosed with CRC based on the ICD-9-CM major codes 153–154 and were to be registered in the RCIP from January 1, 2005 to December 31, 2016. Diagnosis of CRC requires pathological proof for enrollment in the RCIP. Those diagnosed with CRC before January 1, 2005 were not enrolled. CRC patients diagnosed under age 20 and those who did not identify sex and the essential information for providers were excluded.

Surgical volume

The volume of CRC definitive surgeries by each surgeon and hospital was calculated. We used three common definitions of surgical volume. Total volume means the total surgical number conducted by the same provider during the study period. Cumulative volume was calculated as the number of operations the surgeon performed before the index procedure. In contrast, annual volume was calculated as the number of times the surgeon had been responsible for surgery during the index year. Figure 1 shows the number of CRC patients a surgeon surgically operated on at different times.

Figure 1
figure 1

The definitions of surgical volume. This surgeon performed on 10 patients during the study period. The darker patient example shows the total, cumulative, and annual volume calculations.

Notably, we divided all patients into quartiles based on the hospital and surgeon volumes, ranging from lowest to highest volume. All patients were stratified into four quartiles: lowest, middle-low, middle-high, and highest.

Variables

The primary endpoint was death within 5 years from the first date of definitive surgery, which is a common indicator for the long-term outcome of cancer surgery15,21,22,23,24. Information on the date and cause of death is contained in the National Cause of Death Data. All patients enrolled in this study were followed until dropout from the NHI program, death from any cause, or the end of the year 2017.

Patient demographics include age, sex, comorbidities, urbanization, and insurance amount. Comorbidities, including hypertension, diabetes mellitus (DM), heart failure, chronic obstructive pulmonary disease (COPD), end-stage renal disease (ESRD), liver cirrhosis, and cerebrovascular accidents (CVA), were identified using ICD-9-CM codes from NHIRD. Patients’ socioeconomic status was categorized by degree of urbanization and monthly income. In addition, we used the physician and facility registries to construct hospital- and surgeon-level variables, including the hospital's level of accreditation, ownership, geographic region, as well as the surgeon's age, sex, and experience.

Statistical analysis

CRC patients’ and providers’ demographic data were compared using Fisher’s exact test or chi-square test for categorical variables and the Mann–Whitney U test for continuous variables. The 5-year survival probability was measured using the Kaplan–Meier method from the time of diagnosis to death or last follow-up. The difference between groups was further estimated by a log-rank test. Univariate and multivariate Cox proportional hazards models were used to identify predictors of 5-year mortality among CRC patients. The surgeon-level random effects were adjusted using a frailty model for Cox regression in the multivariate analysis12,25. Furthermore, model discrimination was estimated using the Akaike information criterion (AIC) and the Bayesian information criterion (BIC), which showed how the different definitions of surgical volume affect the outcomes—the lower the AIC and BIC, the more explanatory the variables and the model are. Using the Harrell's C statistics calculation, we also conducted sensitivity analysis to determine the discriminatory ability of different definitions of hospital and surgeon volumes. Data management and all statistical analysis were performed using SAS 9.4 software (SAS Institute Inc., Cary, NC) and STATA statistical software, version 15.1 (StataCorp, College Station, TX). Statistical significance was defined as a p value of less than 0.05.

Results

Characteristics of the study population

We identified 100,960 newly diagnosed CRC patients who received definitive surgeries between January 1, 2005 and December 31, 2016. Of these, 257 patients did not reveal sex, 29 patients were under the age of 20, and 665 patients did not reveal the provider's basic information (Fig. 2). Therefore, the final cohort consisted of 100,009 patients, including 55,849 (55.8%) males and 44,160 (44.2%) females, of median age of 66 at diagnosis (range 20–105 years). The characteristics of CRC patients, hospitals, and surgeons are listed in Table 1. More than half of the surgeons were aged ≥ 45 years (57.4%), and the median experience of curative surgery was 8.6 (IQR 5.7–11.9) years. Patients were separated into four groups according to the surgeon’s total volume. The median total volume of CRC definitive surgeries was 575 (IQR 218–1,089) for surgeons. Patients aged ≥ 65, those who were male, those with comorbidities, patients having lower income, and those living in rural areas were more likely to undergo resection by lower-volume surgeons.

Figure 2
figure 2

Patient selection flow chart.

Table 1 Baseline characteristics of patients with colorectal cancer receiving definitive surgery.

Comparing different surgical volumes for 5-year death

We divided CRC patients who received definitive surgeries into quartiles according to different definitions of provider volume. The Kaplan–Meier curves show that patients' 5-year survival was significantly better in other models in higher-volume groups (Fig. 3). The total volume-mortality relationship in CRC surgery is shown in Table 2. Model 1 was adjusted for patient-level risk factors, including age, sex, comorbidities, degree of urbanization, and income group. Model 2 was adjusted for variables listed in Model 1 and hospital-level risk factors, including ownership, geographic region, and medical center status. Model 3 was adjusted for patient-level, hospital-level, and surgeon-level characteristics. The univariate Cox proportional hazards analysis showed that hospital and surgeon total volumes were significantly associated with a lower mortality risk within the 5-year observation period, and after adjustment for individual and provider characteristics, surgeon volume, but not hospital volume, remained a significantly predictive factor of death, with a dose–response relationship. The adjusted HRs were 0.76 (95% CI 0.73–0.78; p < 0.001), 0.73 (95% CI 0.70–0.75; p < 0.001) and 0.67 (95% CI 0.64–0.70; p < 0.001) for the middle-low, middle-high, and highest surgeon total volume group, respectively, compared to the lowest quartile surgeon total volume group.

Figure 3
figure 3

Five-year survival between patients receiving definitive surgery for colorectal cancer divided by different definitions of hospital and surgeon volumes.

Table 2 Risk factors for 5-year mortality among patients receiving definitive surgery for colorectal cancer.

We further used two definitions of surgical volume that affected the outcomes: cumulative and annual volume. Compared with other definitions of surgical volume, cumulative volume provided the lowest AIC and BIC for predicting 5-year mortality compared with the total and annual volumes. The details are shown in Table 3. For the cumulative volume, the results show that higher surgeon cumulative volume was associated with a lower risk of 5-year mortality than low-volume surgeon volumes. The adjusted HRs for the middle-low, middle-high, and highest quartile compared with the lowest surgeon cumulative volume were 0.75 (95% CI 0.73–0.78; p < 0.001), 0.68 (95% CI 0.66–0.71; p < 0.001) and 0.59 (95% CI 0.56–0.62; p < 0.001), respectively.

Table 3 Risk factors for 5-year mortality among patients receiving definitive surgery for colorectal cancer.

Sensitivity analysis

The surgeon cumulative volume seems the most suitable for demonstrating the relationship between provider volume and 5-year mortality. The cumulative volume model provided the highest Harrell’s C statistic (0.621), with total and annual volume following. Harrell’s C statistics were 0.618 and 0.620 for total and annual volume, respectively.

Volumes of laparoscopic minimally invasive surgeries

A total of 17,002 (17.0%) CRC patients who underwent laparoscopic colon resection were enrolled across 118 hospitals and 582 surgeries. Laparoscopy provided a lower 5-year mortality rate among all patients than open surgery (HR 0.60, 95% CI 0.58–0.62; p < 0.001). Furthermore, we calculated the number of laparoscopic minimally invasive surgeries for hospitals and surgeries, classifying them into four groups based on their quartiles. Supplementary Fig. 1 illustrates the survival curves for the total volume of minimally invasive surgeries in CRC patients. We observed some differences between the hospital volume and surgeon volume groups, which were statistically significant (all p < 0.001). There was an association between 5-year mortality and surgeon volume in the total volume, but not hospital volume. The patients treated by higher-volume surgeons had a lower 5-year mortality rate. The adjusted HRs of 5-year mortality for low-intermediate, high-intermediate, and high-volume compared with low-volume surgeons were 0.87 (95% CI 0.79–0.97; p = 0.013), 0.84 (95% CI 0.75–0.95; p = 0.007) and 0.90 (95% CI 0.79–1.04; p = 0.159), respectively. However, regardless of hospital and surgeon volume, using different definitions of volume to include cumulative and annual volumes was not associated with 5-year mortality (Supplemental Table 1).

Discussion

The results of our study reveal that higher surgeon volume is independently associated with the lower 5-year mortality rate in CRC patients receiving definitive surgery after adjusting for hospital volume and other potential confounders. When introducing different definitions of provider volume, the inverse association between surgeon volume and 5-year mortality rate remain consistently significant, while hospital volume does not. Surgeon volume is more important than hospital volume in CRC surgeries. Thus, those CRC patients treated by high-volume surgeons at low-volume hospitals have a lower mortality risk than those treated by low-volume surgeons at high-volume hospitals. In addition, our findings demonstrate that “cumulative volume” can predict the 5-year mortality of the study cohort better than total and annual volume.

There is a large body of literature over the past 30 years investigating the volume-outcome relationship in cancer treatment26. However, the definition of provider volume used in these studies varies and therefore a different definition of provider volume could affect the analytical results of patient outcomes. The most common volume definitions are annual volume and cumulative volume17,27,28,29,30. In a volume-outcome study on cervical cancer patients after radiation therapy conducted by Wright et al., mean annualized hospital volume and previous year hospital volume were not associated with survival benefit, while in the sensitivity analysis, current year hospital volume, which was defined as the number of patients treated at a given hospital within the same calendar year, significantly predicted survival outcome19. Derogar et al. reported annual volume as the number of operations within the index year and cumulative volume as the chronological number of operations, and they found the combination of annual and cumulative surgeon volume to be a predictor of long-term survival for esophageal cancer patients, whereas individually these factors were not17. Jeldres et al. reported that both annual and cumulative provider volumes were independent predictors of failure-free survival in patients with localized prostate cancer after definitive radiotherapy18. In the present study, we explored the effect of varying definitions of provider volume on the mortality rate of CRC patients, and the results were consistent among different models. Moreover, we also include AIC, BIC, and Harrell’s C statistics to examine the fitness of varying volume definitions, which was not included in the prior studies. When examining AIC, BIC, and Harrell’s C statistics with different volume definitions, cumulative volume, defined as the number of surgeries the surgeon performs before the index surgery, is the best-fitting model. This number could present as an indicator of the cumulative experiences of surgery. In addition, this finding can also be used as a guide in subspecialty education programs, showing that the cumulative procedures conducted by the surgeon should meet the minimal requirement to be certified as a specialist.

The surgeon's cumulative volume could be seen as a measure of accumulated experience and proficiency. Surgeons who have performed more surgeries are expected to have better skills and outcomes due to their experience. Notably, some medical procedures have a learning curve, with surgeries likely improving over time with practice. In this context, the cumulative volume could assess how the performance of surgery changes as surgeons gain more experience, aligning with the “practice makes perfect” hypothesis26,31. Moreover, cumulative volume might be considered a proxy for the quality of care. Surgeries performed by higher-cumulative-volume surgeons are thought to be more efficient and effective as potentially accruing more experience leads to improved skills, reduced complications, and enhanced patient care.

The positive volume-outcome relationship in CRC patients after surgery has been demonstrated in much of the literature, with or without mutual adjustment for hospital and physician volume. Schrag et al. investigated hospital and surgeon volume on the outcome of rectal cancer patients. The results of their work revealed that surgeon volume was a better predictor of long-term survival than hospital volume7. In the results from a Cochrane systematic review and meta-analysis, the effect of workload on patients’ prognosis after CRC surgery was stronger at the surgeon level than at the hospital level, particularly in the outcome of 5-year overall survival and operative mortality32. Previous studies have explored the positive association of high cumulative and high annual volume surgeons on cancer survival rates for rectal and colorectal cancer patients24,30. When researching different volume definitions, the results of our study confirmed the robustness of the importance of physician volume over hospital volume on the long-term mortality rate in CRC patients after cancer surgery. These findings suggested that enhancing surgeon-specific experiences can improve patients’ outcomes. Thus, from a healthcare policy standpoint, minimal requirements for surgical procedures and specialization may be essential for surgeon training in colorectal cancer surgery. In Taiwan, patients can seek second opinions with great accessibility under the coverage of National Health Insurance. As a result, patients tend to visit medical centers for cancer treatment. From the patients’ aspect, the information on surgeon volume can minimize the information gap and help patients choose an experienced surgeon in the local hospital near where they live, thus improving patient logistics of high-volume centers.

The laparoscopic surgical approach has become the gold standard treatment for most colon cancers, as recommended by the American Society of Colon and Rectal Surgeons33. Our study found no association between the volume of laparoscopic minimally invasive surgeries and the risk of 5-year mortality due to the smaller number of patients included in the analyses during the later years of follow-up. The adoption of the laparoscopic approach for colorectal surgery within the surgical community has been slower and less widespread in Taiwan compared to laparoscopic procedures for other indications. In our study population, we observed a gradual increase in the number of patients with CRC selected for laparoscopy in recent years. The implementation of laparoscopic surgery, like any new surgical technique, is associated with a learning curve and may result in longer operation times, shorter hospital stays, reduced incidence of surgical site infection, increased complications, or a higher frequency of conversion to open surgery34,35,36,37.

The current study had certain limitations. First, several potential confounders, including smoking, alcohol consumption, obesity, and family history, are not available in the data. Secondly, we lacked data on tumor markers and genetic features, which might also impact patients’ long-term survival. Third, potentially curative surgeries might be mixed with palliative operations. Fourth, surgical volume beyond the study period was not collected. Finally, a more extended surgical experience may be an effect modifier for surgeon volume, which needs further studies.

In this nationwide population-based study, we observed that CRC patients treated by high-caseload surgeons had better survival. Additionally, surgeon volume is shown to be a more important predictor of 5-year mortality than hospital volume in CRC patients after cancer surgery. Among the varying definitions of surgeon volume we applied, surgeon cumulative volume could predict patients’ outcomes better than the volumes of the other definitions. Hence, the cumulative volume of the surgeon can serve as an index of surgeon experience and be used as a reference in subspecialty qualification.