Introduction

Colorectal cancer (CRC) is the third most common cancer worldwide and accounts for 10% of all newly diagnosed cancers1. Although the surgical techniques, chemotherapy and molecular-target therapy have dramatically developed, the long-term survival rate of patients with CRC remains low, particularly in patients with advanced stage cancers2. Accurate prediction of prognosis will assist in adopting appropriate therapies and contribute to better management of CRC patients. Currently, the tumour-node-matastasis (TNM) surgical staging system, based on postoperative histopathology of the tumour, is considered to be the golden standard for predicting clinical outcomes of cancer patients3. However, its accuracy has been debated as tumour progression may not be solely determined by the characteristics of the tumour, but the host inflammatory responses as well4,5,6. Therefore, in recent years, great efforts have been made to identify inflammation-related factors for precise prediction of disease prognosis.

Glasgow Prognostic Score (GPS) is such an inflammation-based factor, defined by the combination of the level of serum C-reactive protein (CRP) and albumin, which are indicators of systematic inflammatory response and nutritional status respectively7. The first study on GPS reported by Forrest et al.8 in 2003 showed that it could be an predictor of prognosis for non-small-cell lung cancer patients. Subsequently, growing evidence suggested that GPS was served as an independent prognostic index in a variety of malignant cancers including hepatocellular carcinoma9, esophageal cancer10, gastric cancer11, renal cancer12, and pancreatic cancer13. For patients with CRC, the GPS system was also widely studied, but the results were controversial. Leitch and his colleagues14 reported that GPS was an independent prognostic factor for CRC, while Son et al.15 did not detect significant association between GPS and patients’ survival. This makes it rational to perform systematic evaluation of the correlation between GPS and prognosis of patients with CRC to further clarify its clinical significance.

Results

Characteristics of the enrolled studies

As shown in Fig. 1, a total of 710 records were identified from different databases by initial search. After screening the titles and abstracts, 73 full-text articles were assessed for further eligibility analysis. 48 of them were subsequently excluded as follows: 18 studies without sufficient data; 22 of overlapping populations; 8 of only documented with abstracts, comments or reviews. Finally, 25 original reports published between the year 2006 and 2016 with sample size ranging from 42 to 1000 were enrolled into the meta-analysis14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38. The characteristics of all the studies were summarized in Table 1.

Figure 1
figure 1

Flow diagram of the study selection process.

Table 1 Characteristics of included studies.

Among the included studies, some patients were enrolled multiple times for different purposes in publications. In Leitch and his colleagues’ research14, the patients with TNM stage I,IIand III were overlapped with those in Park’s study38. So we only extracted the data of patients with stage IV from the prior study. Among all the studies, 15 were conducted in Japan, 2 in the United Kingdom, 2 in China, 3 in Korea, 1 in Australia, 1 in Turkey and 1 in Austria. Ten studies only included patients with stage IV, one only included stage II and the remaining included a mixed population. The cut-off values of GPS were diverse. In 12 studies, GPS of 1 was defined as elevation, while in the remaining 13 literatures, GPS of 2 was considered as an increase. Surgery was the main treatment approach in 20 of the 25 included studies. A ‘Korean treatment’ was discussed in one study conducted in Korea, and the remaining four studies focused on chemotherapy.

GPS and survival outcome

Obvious heterogeneity was detected in this meta-analysis (OS: I 2 = 74.3%, P < 0.001; CSS: I 2 = 63.3%, P = 0.002), so the random-effect model was adopted to calculate the pooled HR and 95% CI values. There were 16 studies reporting the relationship between pretreatment GPS and OS in CRC patients. The pooled HR of 2.83 (95% CI: 2.0–4.0, P < 0.001) implied that CRC patients with elevated GPS were expected to have poor OS (Fig. 2A). There were 11 studies presenting the relationship between GPS and CSS. The pooled result showed significant association between elevated GPS and short CSS (HR = 1.94, 95% CI: 1.51–2.49, P < 0.001) (Fig. 2B). Furthermore, subgroup analysis was conducted, stratified by sample size, study of region and cut-off value of GPS. As shown in Table 2, increased GPS was also predicted to have a significant poor prognostic effect on survival in CRC patients in those subgroups.

Figure 2
figure 2

(A) Forest plot of hazard ratio for the association between GPS and OS in patients with CRC; (B) Forest plot of hazard ratio for the association betweenGPS and CSS; (C) Forest plot of hazard ratio for the association between elevated GPS and survival in patients with primary operable disease; (D) Forest plot of hazard ratio for the association between elevated GPS and survival in patients with advanced inoperable diseases.

Table 2 Results of subgroup analysis.

In addition, we explored the influence of GPS on survival in different disease stages. By grouping CRC patients into two subsets, 6 studies included patients with primary operable diseases (stages I,II, III), and 10 studies included patients with advanced inoperable diseases (stage IV). The rest included a mixed population and were excluded as the data was not able to be extracted according to the above two groups. The pooled estimate also showed that elevated GPS was correlated with worse survival, both in primary operable patients (HR = 2.43, 95% CI: 1.42–4.18, P = 0.001) (Fig. 2C) and advanced inoperable patients (HR = 2.08, 95% CI: 1.65–2.61, P < 0.001) (Fig. 2D).

GPS and clinicopathological features

There were 8 studies reporting the relationship between the level of GPS and TNM stage16,17,18, 20, 21, 23, 27, 29. The combined odds ratio (OR) of 1.44 (95% CI: 1.010–2.065, P = 0.044) displayed that patients with elevated GPS was prone to be in advanced TNM stages, with no significant heterogeneity (I 2 = 38.5%, P = 0.123; Fig. 3A). The relationship between GPS and serum carcino-embryonic antigen (CEA) level in CRC patients was presented in 10 studies16,17,18,19, 21, 24, 27, 29, 31, 33. The pooled OR was 2.252 (95% CI: 1.508–3.362, P = 0.000), indicating that elevated GPS was associated with elevated serum CEA, with no obvious heterogeneity (I 2 = 41.9%, P = 0.078; Fig. 3B). There was no significant correlation between increased GPS and lymphatic invasion (positive vs negative) and vascular invasion (positive vs negative) (Table 3).

Figure 3
figure 3

The forest plot between pretreatment GPS and clinicopathological parameters in CRC. (A) TNM stage (III, IV vs 0, I, II); (B) serum CEA.

Table 3 Relationship between pretreatment GPS and lymphatic invasion and venous invasion.

Meta-regression analysis

In order to explore the source of heterogeneity, a meta-regression analysis was performed based on the following variables: sample size, study region, cut-off value of GPS and TNM stage. The result showed that for the 16 studies on OS, sample size was significantly related to heterogeneity (P = 0.001) (Fig. 4). Study region, cut-off value and TNM stage were negatively related to hazards of endpoints (Table 4). For 11 studies on CSS, study of region was significantly related to heterogeneity, while sample size, cut-off value and TNM stage were negatively related to hazards of endpoints (Table 4).

Figure 4
figure 4

Meta-regression plot of HRs of OS against sample size.

Table 4 Results of meta-regression on OS and CSS

Sensitivity analysis

In order to assess whether the results were credible and stable with obvious heterogeneity, sensitivity analysis was carried out by means of omitting each study by turns (Fig. 5A and B). The result indicated that there was no obvious influence of one individual study on the pooled HR.

Figure 5
figure 5

(A) Sensitivity analysis of 16 studies inclued in this meta-analysis for OS. (B) Sensitivity analysis of 11 studies inclued in this meta-analysis for CSS.

Publication bias

A funnel plot and Egger’s test were employed to investigate publication bias. As a result, for the studies of the GPS and OS, Begg’s test showed no publication bias (P = 0.260, Fig. 6A), whereas, Egger’s test revealed statistical significance (P < 0.001). For the studies of the GPS and CSS, substantial publication bias was detected both in the Begg’s test (P = 0.013, Fig. 6B) and Egger’s test (P < 0.001). Consequently, the “trim and fill” analysis was further performed and the recalculated result did not change significantly (OS: HR = 1.381, 95% CI: 1.247–1.529, P < 0.001; CSS: HR = 1.430, 95% CI: 1.279–1.599, P < 0.001), indicating the stability of our pooled results.

Figure 6
figure 6

(A) Funnel plot of 16 included studies in this meta-analysis for OS; (B) Funnel plot of 11 included studies in this meta-analysis for CSS.

Discussion

Inflammation plays an important role in the development and progression of various tumours. Cancer can induce local or systematic inflammation, mediated by the activation of transcription factors and release of cytokines, which can inversely influence tumour activities, including cell proliferation, angiogenesis, cell migration, and invasion39, 40. There are several inflammation-based prognostic systems having been reported in cancers. GPS is one of them, combining serum CRP and albumin which are closely connected with the prognosis of cancer patients respectively41, 42. Initially, GPS was applied to determine the prognosis of patients with inoperative lung cancer8. Thereafter, it was increasingly used to assess the outcome of patients with multiple malignant tumours43,44,45,46,47. Recently, a number of investigations attempted to evaluate the significance of GPS in predicting patients’ survival in CRC14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38, but had the inconsistent results.

The current study, to our best knowledge, is the most comprehensive meta-analysis assessing the correlation between GPS and the prognosis of patients with CRC. A total of 25 studies containing 5660 CRC patients were included. As shown in this meta-analysis, the pooled HR of OS and CSS were 2.83 (95% CI: 2.00–4.00, P < 0.001) and 1.94 (95% CI: 1.51–2.49, P < 0.001) respectively, which suggested that patients with elevated GPS were predisposed to exhibit poor survival outcome. When subgroup analysis was performed stratifying by sample size, study of region and cut-off value of GPS, worse survival was also presented in CRC patients with elevated GPS in all the subgroups, which suggested that our results were reliable.

As staging is currently the most important prognostic indicator for CRC, we grouped patients into two subsets–the primary operable diseases and the advanced inoperable diseases. Then we conducted meta-analysis in these two groups respectively, regarding the information with reference to GPS and survival outcome. Inspiringly, the pooled results were consistent in the two groups, despite their different presentations, diagnosis and treatments, both suggesting that elevated GPS was associated with worse survival in CRC patients, which further validated our conclusion.

Furthermore, an obvious relationship between GPS and clinical parameters, including TNM stage and serum CEA level was observed, which further confirmed that GPS could be a promising predicting index for CRC patients. However, we found no significant correlation between the increased GPS and lymphatic and venous invasion. One possible explanation could be that among the included studies, only 8 studies had reported the connection between GPS and lymphatic invasion, 9 between GPS and venous invasion, resulting in a small sample size for analyzing.

The forest plot revealed heterogeneity in this meta-analysis (I 2 = 74.3%, P < 0.001; I 2 = 63.3%, P = 0.002). Therefore, we performed meta-regression analysis to explore the source. The results indicated that the sample size might contribute to the heterogeneity across 16 studies on OS, while the region of study contributed to heterogeneity across 11 studies on CSS. Among the included 25 studies, only 3 contained with populations more than five hundreds. A small sample size was usually considered as the source of heterogeneity. Of note, the majority of the included studies (19/25) were conducted in Asian medical institutions, and ethnic background and life styles may contribute to the variations in cancer patients’ prognosis, which was consistent with other meta-analysis48. However, the subgroup analysis in terms of the sample size and study region did not alter the overall results.

There are several limitations that should be considered in this meta-analysis. First of all, though the amount of included studies was large, significant interstudy heterogeneity was still observed and could not be eliminated completely. The variables included in our meta-regression analysis partly explained the heterogeneity, other factors might affect the prognosis. Secondly, most of the included studies were retrospective. Thus, further larger scale, well-designed prospective investigations are required in the future.

In conclusion, current evidence from the meta-analysis identifies elevated GPS as a promising prognostic biomarker in CRC. GPS, calculated from two conventional laboratory data, does not demand additional examinations such as imaging techniques or histopathology, and could be easily obtained from peripheral blood test. Additionally, compared to the existing evaluation system such as TNM stage, GPS could be obtained before operation. Pre-operative elevated GPS suggests patients at high risk of recurrence and poor survival, which could help doctors to adopt more careful surgery and more rigorous follow-up.

Materials and Methods

Literature search

We performed electronic literature searches using Pubmed, Embase, Cochrane library, Web of Science, ChinaInfo and Chinese National Knowledge Infrastructure before June 01, 2016. Following key search terms were included in our study: “Glasgow Prognostic Score”, “GPS”, “prognosis”, “colorectal carcinoma”, “colon cancer”. The references of all included studies were screened to identify additional related full articles.

Study selection

A study was considered eligible according to the following criteria: (1) the study was written in English or Chinese; (2) the diagnosis of CRC was confirmed by pathology; (3) correlation between GPS and overall survival (OS)/cancer-specific (CSS) was presented in the article; (4) HRs with its 95% CI were reported in the study. Exclusion criteria was defined as following: non-human CRC studies; studies published in abstracts, letters, reviews, case reports, expert comments, duplicated articles, and studies without HR and 95% CI.

Data extraction

Two investigators (LYY and HXK) independently extracted data from the eligible studies and disagreements were resolved by discussion. The following items were obtained from each study: (1) name of the first author, study region, publication year, number of cases, age, dominating treatment, follow-up period; (2) clinicopathological parameters including TNM stage, lymphatic invasion, venous invasion, lymph node metastasis, and carcinoembryonic antigen (CEA) level; (3) survival data of OS and CSS; (4) the cut-off value of GPS.

Quality assessment

We evaluated the articles identified by the above criteria and implemented quality assessment according to the Newcastle-Ottawa-Scale (NOS)49. Any discrepancy was resolved by consensus.

Statistical analysis

Data analysis was conducted using Stata 12.0 (Stata Corporation, Texas, US). The HRs with its 95% CI was directly obtained from each literature and the pooled HR was used to assess the significance of GPS on OS and CSS of the patients with CRC. If several estimates were reported for the same value, HRs and 95% CI were extracted preferentially from multivariate analysis where available. Otherwise, HRs were extracted from univariate analysis. For analyzing the correlation between GPS and clinicopathological parameters, OR with its 95% CI was combined by Mantel-Haenszel method as the effective value. Heterogeneity among pooled results was assessed using Cochrane Q test and Higgin I 2 statistic. P < 0.05 or/and I 2 > 50% were considered statistically heterogeneous and the random-effect model was applied; otherwise, the fixed-effect model was performed. Publication bias was evaluated using the funnel plot and Egger’s test (values of P > 0.05 indicated lack of publication bias)50. Meta-regression analysis was conducted to explore potential sources of heterogeneity. We also performed a sensitivity analysis in which one study at a time was removed and the rest were analyzed to evaluate whether the results could have been affected markedly by a single study.