Comparison of faecal protein biomarkers' diagnostic accuracy for colorectal advanced neoplasms: a systematic review and meta-analysis

Nasir Kansestani, Atefeh; Zare, Mohammad Erfan; Tong, Qingchao; Zhang, Jun

doi:10.1038/s41598-022-06689-4

Download PDF

Article
Open access
Published: 16 February 2022

Comparison of faecal protein biomarkers' diagnostic accuracy for colorectal advanced neoplasms: a systematic review and meta-analysis

Atefeh Nasir Kansestani¹^na1,
Mohammad Erfan Zare¹^na1,
Qingchao Tong¹ &
…
Jun Zhang¹

Scientific Reports volume 12, Article number: 2623 (2022) Cite this article

1848 Accesses
3 Citations
Metrics details

Subjects

Abstract

Early diagnosis of colorectal advanced neoplasms (ANs), including colorectal cancer (CRC) and advanced adenoma (AA), has a positive effect on the survival rate. As a first attempt, the aim of this meta-analysis was to compare the diagnostic accuracy of faecal protein biomarkers for the detection of colorectal neoplasms with consideration of a wide range of covariates. A systematic literature search was performed up to Jun 10, 2021 on Web of Sciences, Scopus and PubMed. The diagnostic accuracies were calculated using the bivariate/hierarchical random effect model. Biomarkers were determined to be clinically applicable (CA) if they had areas under the curve > 0.70 and positive and negative likelihood ratios > 2 and < 0.5, respectively. A total of 47,059 test results were extracted from 16 immunochemical faecal occult blood test (iFOBT), 26 pyruvate kinase-M2 (PK-M2) and 23 faecal calprotectin (FC) studies. Only iFOBT, PK-M2 and FC for CRC plus iFOBT and PK-M2 for AN were CA. iFOBT had significantly superior accuracy (P = 0.02 versus PK-M2 and P < 0.01 versus FC for CRC; P < 0.01 versus PK-M2 for AN). Regarding covariates, the lateral flow method of PK-M2 measurement increased its accuracy for CRC detection compared to the enzyme-linked immunosorbent assay (P < 0.01). iFOBT is recommended as the most accurate faecal biomarker for CRC and AN diagnosis.

Faecal microRNAs as a non-invasive tool in the diagnosis of colonic adenomas and colorectal cancer: A meta-analysis

Article Open access 01 July 2019

Diagnostic performance of a faecal immunochemical test for patients with low-risk symptoms of colorectal cancer in primary care: an evaluation in the South West of England

Article Open access 19 January 2021

Biomarker measurement in non-invasively sampled colorectal mucus as a novel approach to colorectal cancer detection: screening and triage implications

Article Open access 13 May 2020

Introduction

Colorectal cancer (CRC) is currently the third-most prevalent malignancy and the second leading cause of death among cancerous patients¹. Despite the fulfilment of major efforts such as screening programs, the increasing trend of new cases in recent years indicates that better strategies are required not only for the early diagnosis of CRC but also for other types of colorectal advanced neoplasms (ANs) as important precursors of CRC. AN includes CRC and advanced adenoma (AA). AA is defined as multiple colorectal polyps or individual lumps ≥ 1 cm in size, tubulovillous or villous histology features or high-grade dysplasia. AA patients have a higher risk of developing CRC².

Early diagnosis of AN has a positive correlation with a high survival rate owing to implementation of proper treatments, especially in high-risk groups, including first-degree relatives (FDRs) of individuals with CRC and AA. Guidelines from different authoritative societies recommend AN screening for average-risk individuals by age 50 years or older and 40 or 10 years for high-risk groups. Today, colonoscopy is considered the gold standard for AN diagnosis and screening^2,3. However, colonoscopy is an expensive, invasive and operator skill-dependent technique. In addition, it requires unpleasant bowel preparation and occasionally causes serious complications. Therefore, implementing noninvasive biomarkers for the diagnosis of AN seems to be necessary⁴.

Today, a wide range of faecal biomarkers consisting of stool DNA testing, stool miRNAs, the faecal microbiome and different proteins have been introduced for the diagnosis and screening of AN. However, faecal protein biomarkers have special importance due to their low cost, noninvasiveness and simple sampling procedure attributes⁵. The first introduced faecal biomarker for AN was the guaiac-based faecal occult blood test (gFOBT), and since its introduction, it has saved many human lives, despite its low sensitivity. This method has been replaced by the immunochemical faecal occult blood test (iFOBT), which has much higher sensitivity^2,5. In recent decades, some novel and promising faecal protein biomarkers have been introduced for the diagnosis and screening of CRC and other ANs. For example, pyruvate kinase-M2 (PK-M2) with an overall accuracy of 0.85⁶ and faecal calprotectin (FC) with an overall accuracy of 0.81⁷ have been reported for the detection of CRC in the latest published systematic review and meta-analyses. Nonetheless, there is no comparative systematic review or meta-analysis to find the most accurate faecal protein biomarker. Given the above information, as a first attempt, the aim of this evidence-based meta-analysis was to compare the diagnostic accuracy of clinically available faecal protein biomarkers for the detection of CRC, AA and AN with consideration of a wide range of covariates to find and recommend the most accurate one.

Methods

Search strategy

The search strategy of the present systematic review was carried out based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement⁸. We performed systematic searches on electronic databases containing Web of Science, Scopus and MEDLINE/PubMed until June 10, 2021, without any language restrictions. Additionally, the Chinese National Knowledge Infrastructure (CNKI) database for Chinese full text articles and the Scientific Information Database (SID) database for Persian full text articles were searched. The following MeSH terms (“Colorectal neoplasms”) AND (“Diagnosis” OR “Early detection of cancer”) were used to search MEDLINE/PubMed, and text words containing (“Colorectal cancer” OR “CRC” OR “Colorectal malignancy” OR “Colorectal tumour” OR “Adenoma” OR “Colorectal neoplasms”) AND (“Faecal biomarker” OR “Laboratory tests” “Diagnostic biomarker” OR “Screening Biomarker”) were used to search other databases, besides MEDLINE/PubMed (Supplemental Table S1). Finally, similar papers which were purposed by Pubmed, as well as google scholar, the reference lists of each selected paper and related systematic and narrative reviews on this topic were assessed to identify missed studies. To exclude duplicate papers, records were imported into EndNote software (Version X9, Thomson Reuters).

Study selection and data extraction

Two reviewers (A.N. K and M.E. Z) independently screened the title and abstract of all obtained records for eligibility and inclusion. The inclusion criteria were as follows: (1) patients for whom a faecal protein biomarker was used to detect CRC, AA or AN; (2) CRC and AA should be confirmed by colonoscopy and pathology reference standards; (3) specific diagnostic information was sufficient to construct a 2 × 2 contingency table; and (4) for each faecal biomarker, at least 4 studies should be found. Exclusion criteria were set as follows: (1) duplicated studies, review articles, editorials, case reports, and clinical guidelines; (2) insufficient data reporting to construct the 2 × 2 contingency table; CRC and colorectal AA were not verified by the aforementioned reference standards.

A custom-made form was utilized for data extraction, including the first author’s name, publication year, country of the study, subjects’ average age, gender, study design, total sample size, true positives, true negatives, false-positives, and false negatives. The results of iFOBT were extracted in those studies, which was accomplished along with other assessed biomarkers. To achieve more reliable results in case–control designed studies, 2 × 2 contingency tables were constructed by comparing the specific characteristics versus not only healthy controls but also other patients, which did not have those specific characteristics. To homogenize different units, mg/L (= μg/mL) was transformed to μg/g by multiplying each value by a factor of 5.

In CRC patients, the percentages of distal and late-stage tumours were extracted. Proximal tumours were defined as those located from the caecum to the transverse colon, and distal tumours were located from the splenic flexure to the rectum. In addition, late-stage tumours were defined as CRC stages III + IV or Dukes’ stages C + D versus 0 + I + II or Dukes’ stages A + B, which were categorized as early-stage tumours⁹. Colorectal adenomas were defined as AA when the following features were present: (1) high-grade dysplasia; (2) tubulovillous or villous components; and (3) multiple adenomas or individual lumps ≥ 1 cm in size. AN included CRC and/or AA.

Quality assessment and publication bias

The methodological quality of each included study was assessed utilizing the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool. QUADAS-2 evaluates four key domains made up of “patient selection”, “index test”, “reference standard”, and “flow and timing” in two categories, “risk of bias” for all four domains and “applicability” for the first three domains in the diagnostic accuracy studies. Each category was scored as low, high or unclear according to the assessment criteria. All disagreements were resolved by consensus after discussion. Furthermore, to evaluate potential publication bias, the linear regression method was utilized to assess the asymmetry of Deeks’ funnel plot. P < 0.1 for the slope coefficient reveals the presence of publication bias.

Statistical analysis

To construct a 2 × 2 contingency table, true positives, false positives, true negatives and false negatives were calculated for each included study. A standard bivariate method was employed to calculate the summary points, including pooled sensitivity, pooled specificity, pooled positive likelihood ratio (PLR +), pooled negative likelihood ratio (PLR−) and pooled diagnostic odds ratio (PDOR). Using a hierarchical model, summary receiver operating characteristic (HSROC) curves were plotted to determine the area under the curve (AUC) as a global measure of test performance. The overall diagnostic accuracy of each biomarker was interpreted according to AUC, PLR+ and PLR−. The relationship between the AUC value and diagnostic accuracy is described as follows: 0.5–0.70 is interpreted as not acceptable, 0.71 to 0.79 acceptable, 0.80–0.89 good and 0.90–1 excellent. Additionally, based on PLR+ and PLR−, the diagnostic accuracy of each biomarker is divided into four categories. PLR− values < 0.1, 0.1–0.2, 0.2–0.5 and > 0.5 represent substantial, moderate, small and nonmeaningful evidence, respectively, to rule out disease existence. PLR+ values > 10, 5–10, 2–5 and < 2 are considered substantial, moderate, small and not meaningful evidence to rule in disease existence, respectively. The results of LRs were summarized by a scattergram.

In this study, we considered clinically applicable biomarkers if they had AUC > 0.70, PLR+ > 2, and PLR− < 0.5. To compare the diagnostic accuracy of different clinically applicable biomarkers, relative DORs (RDORs) and their P values were computed.

Between-study heterogeneity was evaluated using Higgins’ inconsistency index (I²). I² > 50% implied substantial heterogeneity. To find potential sources of heterogeneity and explore the robustness of the results, when sufficient studies were available, subgroup analysis was performed based on the method of measurement, cut off, study type and QUADAS-2 domains. Additionally, meta-regression analysis was carried out on age and sex covariates for all neoplasms as well as tumour location and stage covariates for CRC. In addition, to illustrate another potential source of heterogeneity, the Spearman correlation coefficient was calculated to determine the threshold effect.

In the present study, calculations were conducted and summarized for reporting considering a 95% confidence interval (95% CI), and reports were defined as statistically significant when P < 0.05 (except publication bias). All statistical analyses were performed by “midas” commands in Stata software (Stata Corporation, College Station, TX, USA, version 12.0), and RevMan 5.3 was employed to draw comparative HSROC plots.

Results

Study selection

Among 2581 initial records, 840 studies were excluded owing to duplication, and 1670 were excluded after screening the title and abstract. In this stage, the most common reasons for exclusion were (1) review articles, editorials, case reports, and clinical guidelines; (2) laboratory biomarkers evaluated on nonfaecal samples (serum and tissue); and (3) nonprotein biomarkers such as molecular biomarkers and microbiome mass. Finally, 71 studies underwent full text assessment. Among these, 22 studies were excluded due to the following reasons: (1) lack of verification by reference standard (colonoscopy and pathology) (n = 13) and (2) insufficient data to construct the 2 × 2 contingency Table (n = 9). Eventually, 49 studies with 47,059 test results were included in the present study (Fig. 1A).

Pursuant to the inclusion criteria, PK-M2 and FC were found to be eligible biomarkers for further assessment.

Sixteen of 49 included studies reported iFOBT data in addition to the other assessed biomarkers, with 13,769 test results^{10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25}. All 16 studies had iFOBT results for the diagnosis of CRC (5610 test results), 10 studies had results for AA (4008 test results) and 11 studies had results for AN (4151 test results). One study evaluated iFOBT with two different commercial kits, so we constructed two separated 2 × 2 contingency tables from this article¹⁶.

From 26 PK-M2 included studies with 12,213 test results^{13,14,15,16,17,18,19,20,21,22,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41}, 25 studies reported the results of CRC detection (5706 test results), 10 studies for AA (3781 test results) and 10 studies for AN (2726 test results). One study assessed PK-M2 by two different methods and therefore built two 2 × 2 contingency tables from this article¹³.

We found 23 studies with 21,077 test results for FC^{10,11,12,19,20,23,24,25,30,42,43,44,45,46,47,48,49,50,51,52,53,54,55}, all of which had information on CRC diagnosis with 9747 test results. The FC results for the detection of AA and AN were extracted from 9 articles with 5665 test results for each condition. There were two studies that evaluated FC by two different methods, so two separate 2 × 2 contingency tables were constructed for each article^25,44.

Two studies evaluated all three biomarkers^19,20, and one study evaluated PK-M2 and FC at the same time for CRC diagnosis³⁰. Table 1 summarizes the main characteristics of the included studies in this review.

Table 1 Characteristics of included studies.

Full size table

Quality assessment and publication bias

The quality of the included studies was assessed using the QUADAS-2 tool, and the results were illustratively summarized for each biomarker (Fig. 1B–D). The quality assessment results of the included studies in the iFOBT group revealed the major risk of bias in the “flow and timing” and “patient selection” categories mainly because all patients were not included in the analysis and case–control study design, respectively (Fig. 1B). Regarding PK-M2 included studies, the major risk of bias occurred in the “patient selection” category because of the case–control study design. Additionally, there were 3 studies with high risk and 9 studies with unclear risk of bias in the “index test” category as a result of a lack of prespecified thresholds and unclear index test interpretation without knowledge of the reference standard result (Fig. 1C). Concerning FC included studies, the greatest risk of bias referred to “flow and timing” and “index test” owing to the aforementioned reasons (Fig. 1D). The included studies for all biomarkers raised no concerns regarding applicability.

Table 2 includes the publication bias analyses of each group. Regarding CRC diagnosis, Deeks’ funnel plot asymmetry test indicated that there was no significant publication bias in the iFOBT, PK-M2 and FC biomarker datasets (Supplemental Fig. S1A–C). In relation to AA detection, significant publication bias in the iFOBT dataset and the absence of publication bias in the PK-M2 and FC datasets were found (Supplemental Fig. S2A–C). Concerning AN diagnosis, analyses indicated no significant publication bias in iFOBT and PK-M2 but indicated significant publication bias in FC datasets (Supplemental Fig. S3A–C).

Table 2 Diagnostic accuracy of faecal biomarkers and their comparisons.

Full size table

Diagnostic accuracy of faecal biomarkers

Table 2 presents the diagnostic accuracy of different faecal biomarkers for the detection of CRC, AA and AN. For CRC diagnosis, all 3 assessed biomarkers were applicable according to their PLR+, PLR− and AUC (> 2, < 0.5, and > 0.70, respectively) (Table 2 and Fig. 2A–D). Figure 2E shows the LR scattergram of CRC clinically applicable biomarkers.

Our results showed that there was no applicable biomarker for the diagnosis of AA individually. Moreover, the analyses showed that iFOBT and PK-M2 were clinically applicable for the detection of AN, whereas FC was not applicable (Table 2 and Fig. 3A–C). Figure 3D presents the LR scattergram of CRC clinically applicable biomarkers.

Comparison of faecal biomarkers diagnostic accuracies

The most useful parameter for comparison of test accuracies between different biomarker groups or subgroups is DOR. Thus, we used individual DORs and their relatives to compare the diagnostic accuracies of clinically applicable biomarkers.

Among CRC clinically applicable biomarkers, the accuracy of iFOBT was significantly higher than that of PK-M2 and FC. The accuracies of PK-M2 and FC were not significantly different (Table 2). In addition, the AUC of iFOBT was the highest among other biomarkers, and based on the LR scattergram, only iFOBT had an upper moderate power of accuracy to both rule in and rule out CRC existence (Fig. 2E).

Among AN clinically applicable biomarkers, the accuracy of iFOBT was significantly higher than that of PK-M2. Furthermore, in comparison to PK-M2, the AUC of iFOBT was higher (Table 2). In line with the LR scattergram, iFOBT had an upper moderate power of accuracy to confirm but not exclude AN existence, whereas PK-M2 had a lower moderate power of accuracy to confirm and exclude AN existence (Fig. 3D).

To determine the effect of biomarker combinations on diagnostic accuracy, the results of double combinations, including iFOBT + PK-M2, iFOBT + FC, PK-M2 + FC, and triple combinations, namely, iFOBT + PK-M2 + FC, were extracted from primary studies if they had these data. The final result was considered positive if at least one of the biomarkers was positive, and negative results were determined if all double or triple combined biomarkers were negative. The iFOBT + PK-M2 data could be extracted from three studies^18,20,21, following iFOBT + FC from three^20,23,25, PK-M2 + FC from one²⁰ and iFOBT + PK-M2 + FC from two^19,20 studies. Our analysis could not find any combined biomarker that significantly increased the diagnostic accuracy compared to individual biomarkers (Supplemental Table S2). Moreover, PK-M2 + FC and iFOBT + PK-M2 + FC had significantly lower accuracy for the diagnosis of AN than individual iFOBTs.

Subgroup analysis

Our results demonstrated substantial heterogeneity among studies in different groups when calculating the pooled sensitivity and specificity (Table 2). Thus, to identify the potential sources of heterogeneity, subgroup analyses were performed.

Each group of studies was separated into 7 subgroups on the basis of the method of measurements (latex agglutination immunoturbidimetry (LAIT) for iFOBT as well as enzyme-linked immunosorbent assay (ELISA) for PK-M2 and FC versus lateral flow), cut-off values (≥ 20 µg/g versus < 20 µg/g for iFOBT, > 4 U/mL versus 4 U/mL for PK-M2 and > 50 µg/g versus 50 µg/g for FC), study type (cohort versus case–control) and 4 domains of the QUADAS-2 “risk of bias” category (low risk versus high or unclear “risk of bias”) (Table 3). A significant difference in a subgroup indicates that it could be considered a source of heterogeneity.

Table 3 Subgroup analysis.

Full size table

For the diagnosis of CRC, the lateral flow method of PK-M2 measurement led to a significant increase in the overall accuracy (Fig. 4). Moreover, in the FC group, the case–control study design and high or unclear “risk of bias” in the “patient selection” domain led to a significant increase in overall accuracy.

Regarding the detection of AA and AN, there was no subgroup to change the overall accuracy. Due to the similar subset of each covariate, subgroup analyses of study type and “patient selection” domain in iFOBT and cut-off value in PK-M2 and FC groups were not feasible for AA diagnosis. Additionally, the study type and “patient selection” domain in iFOBT and the cut-off value in the PK-M2 and FC groups were not executable in the AN group.

Threshold effect and meta-regression analysis

In addition to subgroup analysis, threshold effect and univariate meta-regression analysis were performed to further evaluate causes of heterogeneity.

In diagnostic accuracy studies, one of the most important sources of heterogeneity is the threshold effect. Our analysis showed that the diagnostic threshold effect was not significant as a source of heterogeneity for iFOBT and FC to CRC, AA, and AN diagnosis. Regarding PK-M2, although there was no significant threshold effect in the CRC and AA groups, there was significant heterogeneity in AN detection (P < 0.01) (Table 2).

For univariate meta-regression analysis, we considered some covariates, including the mean age of patients, % male as sex frequency, % distal tumours as CRC tumour site, and % late as CRC tumour stage. Our results demonstrated that none of the aforementioned covariates had sensitivity and specificity heterogeneity. It should be noted that due to the lack of FC biomarker data, analysis of the aforementioned covariates in the AA group as well as the impact of CRC tumour stage on heterogeneity were not feasible (Table 4).

Table 4 Univariate meta-regression.

Full size table

Discussion

For the first time, our present systematic review and meta-analysis summarized and compared the diagnostic performances of all available faecal protein biomarkers, namely, iFOBT, PK-M2 and FC, for screening CRC, AA, and AN. Additionally, and uniquely, we assessed the impact of tumour site, tumour stage, method of measurement and different cut-off values on the performance of these biomarkers.

The overall quality of the included studies for each biomarker was relatively high according to the QUADAS-2 tool. In summary, the range of low-risk studies in the “risk of bias” category for all four domains was 50–95.6%, reflecting moderate to very low risk of bias, and all included studies had no concern regarding “applicability” in all three domains. To evaluate the impact of QUADAS-2 domains on the overall accuracy, subgroup analysis was conducted based on low versus high or unclear risk from the “risk of bias” category. The results showed that despite the impact of different domains on sensitivities and specificities, only the “patient selection” domain in the FC group for CRC detection could significantly affect the overall accuracy, which we have discussed in the fourth following paragraph (Table 3).

The first important aim of our study was to determine the most accurate faecal protein biomarker. Our analyses showed that iFOBT, PK-M2 and FC biomarkers were clinically applicable for CRC, as well as iFOBT and PK-M2 for AN, and there were no biomarkers for AA according to their AUCs or positive and negative LRs. Additionally, the combination of biomarkers could not increase the accuracy for the detection of each condition. The overall accuracy of iFOBT was significantly higher than that of PK-M2 and FC for CRC detection (P = 0.02 and < 0.01, respectively) and significantly higher than that of PK-M2 for AN diagnosis (P < 0.01). Pursuant to our search results, before ours, there was no meta-analysis to compare the accuracy of various faecal biomarkers for the diagnosis of different intestinal neoplasms. Nonetheless, Li et al.⁵⁶, using 4 research papers, conducted a direct comparison between iFOBT and PK-M2 for CRC screening. Despite the small number of studies to achieve convincing results, to confirm our findings, they indicated that iFOBT had significantly higher accuracy than PK-M2. Furthermore, all of our included articles that contained comparison data had higher iFOBT accuracy than PK-M2 and/or FC for the diagnosis of both CRC and AN, except the results of Kim et al.¹³. Kim et al. assessed the accuracy of two different methods of PK-M2 measurement and compared them with iFOBT in CRC and adenoma patients. Their results showed that regardless of the measurement method, PK-M2 accuracy was superior to iFOBT for the diagnosis of CRC and adenoma. The most likely reason for this contradiction is a technical mistake related to measurement equipment, in which a systematic error gave rise to a decrease in the accuracy of iFOBT in Kim’s study. To clarify this issue, the iFOBT accuracy of Kim’s study was tested versus the other studies. The results indicated that the performance of iFOBT in Kim’s study was significantly lower than that in other studies (RDOR = 0.19 (95% CI, 0.04–0.10); P = 0.04), which indicates that systematic error is possible.

Today, the most widely used biomarker for the detection of colorectal neoplasms is FOBT. Two commonly used FOBTs are gFOBT and iFOBT, and it has been proven that iFOBT has superior diagnostic performance^2,57. Our results showed that iFOBT is clinically applicable for CRC diagnosis with upper moderate overall accuracy, in line with its positive and negative LR results. Additionally, it is clinically applicable for AN with upper moderate accuracy only for confirmation, not for exclusion. The overall accuracy of iFOBT in our present study is similar to that of previously published meta-analyses^58,59,60. However, we evaluated more covariates in our research to shed light on the different strengths and limitations of iFOBT implementation. The first unique covariate was the measurement method. Currently, there are two common methods for the measurement of iFOBT, qualitative rapid lateral flow and quantitative latex agglutination immunoturbidimetry, whereas before the present study, there were no data about their overall accuracy differences. According to our findings, there was no difference between these two methods of measurement with different commercial brands for the diagnosis of all three conditions (Table 3). Another covariate was the cut-off to find the optimal iFOBT value. In a previously published meta-analysis, Lee et al.⁵⁸ proposed that a lower 20 μg/g cut-off may increase the sensitivity of iFOBT for the detection of CRC compared to the upper 20 μg/g values. Therefore, we analysed the difference in accuracies between the lower 20 μg/g versus upper 20 μg/g values not only for CRC detection but also for AA and AN. Our results indicated that there were no significant differences among different cut-off values for the detection of CRC, AA and AN (Table 3). Meanwhile, the results of univariate meta-regression analysis showed that age, sex, CRC tumour site and stage could not affect the sensitivity and specificity of iFOBT for the diagnosis of all three conditions (Table 4). The results of the most recently published meta-analysis confirmed our findings in terms of the impact of tumour site on iFOBT performance⁶⁰. However, concerning CRC tumour site, the results of Hirai et al.’s meta-analysis⁵⁹ are not completely consistent with ours. They concluded that the overall accuracy of iFOBT for the proximal colon was significantly lower than that for the distal colon, but it is not convincing given the largely overlapping confidence intervals in the site-specific sensitivities.

PK-M2 is a promising non-organ-specific tumour biomarker, and its concentration is elevated in various types of tumours⁵⁶. For the first time in 2004, Hardt et al.⁴¹ demonstrated that the PK-M2 concentration was elevated in the faeces of CRC patients and could be used as a biomarker. To date, several studies have been conducted on faecal PK-M2 in CRC patients, and the results have shown contradictory accuracies. To determine whether faecal PKM2 could be used as a biomarker for the diagnosis of colorectal neoplasms, a diagnostic accuracy meta-analysis must be carried out. Following two earlier versions^56,61, the latest diagnostic accuracy meta-analysis of PK-M2 for CRC detection was published in 2015, which included 8 studies⁶. Nonetheless, all aforementioned studies included only CRC patients, without evaluating the impact of different covariates on PK-M2 performance. In this study, plus updating the body of evidence using 26 included research articles, we uniquely assessed the diagnostic accuracy of PK-M2 for the detection of AA and AN in addition to CRC. Furthermore, the impact of different covariates on the performance of PK-M2 was evaluated. Our findings indicated that PK-M2 was clinically applicable for the diagnosis of CRC and AN and not for AA, with lower moderate accuracy for both disease confirmation and exclusion given its LR results. These results are compatible with previous meta-analyses regarding the accuracy of PK-M2 for the diagnosis of CRC^6,56,61. To provide new insights into PK-M2 performance, we assessed different covariates in terms of accuracy. One of our important findings was the impact of the PK-M2 measurement method on its performance. Subgroup analysis in the CRC group demonstrated that rapid lateral flow could significantly increase the accuracy of PK-M2 compared to the ELISA method (RDOR = 0.14 (95% CI 0.04–0.48); P < 0.01) (Fig. 4). These findings were similar to the study results of Kim et al.¹³. Moreover, we reanalysed the difference in iFOBT and lateral flow PK-M2 measurement accuracies. The results revealed that when lateral flow PK-M2 measurement was implemented, it eliminated the initial significant difference in iFOBT accuracy for CRC detection (RDOR = 1.79 (95% CI 0.38–8.46); P = 0.43), whereas the accuracy of iFOBT for AN was still significantly superior to that of lateral flow PK-M2 (RDOR = 0.28 (95% CI 0.10–0.81); P = 0.02). The lower accuracy of the ELISA method could be derived from the biostability of tumour PK-M2 in stool samples. There is some evidence that tumour PK-M2 in stool samples could be dramatically affected by sample storage time⁶². By nature, ELISA is a time-consuming method, whereas lateral flow is a rapid technique that is commonly utilized in point-of-care tests (POCTs). Additionally, our results implied that age, sex, cut-off value, CRC tumour site and stage did not affect PK-M2 accuracy (Tables 3, 4).

FC is released in faeces following mucosal neutrophil degradation as a result of intestinal inflammation. The level of FC increases in a wide range of intestinal diseases that are associated with inflammation, including inflammatory bowel disease, CRC and AA⁵⁵. The results of numerous studies indicated a broad range of FC sensitivities for the detection of CRC, from 33 to 100% (Fig. 2C). The latest meta-analysis with 20 included articles regarding the performance of FC for CRC and adenoma diagnosis was performed in 2018⁷. However, this prior paper evaluated all adenomas, not advanced type adenomas, which are clinically important precursors of CRC. Meanwhile, there were no data concerning the impact of measurement technique, type of included studies, and CRC site specificity on FC accuracy. In the present research, in addition to updating the data using 23 included research articles regarding CRC, we assessed the diagnostic accuracy of FC in AA and AN detection for the first time as well as the impact of various covariates on FC performance. Our results are consistent with a previous meta-analysis⁷ indicating that FC has lower moderate accuracy for the diagnosis of CRC based on its LR values. Additionally, we determined that it is not applicable to the detection of AA and AN. Evaluated covariates, including age, sex, method of measurement and CRC tumour site, had no significant effect on FC accuracy (Table 4). Nonetheless, the case–control study design and “patient selection’ domain from the QUADAS-2 “risk of bias” category had a significant impact on FC performance for the diagnosis of CRC (Table 3). These two covariates are relatively similar because a high-risk point is given to case–control studies in the “patient selection’ domain. As mentioned above, FC has low specificity for intestinal disorders; therefore, its overall accuracy declines in cohort study designs that include patients with different intestinal disorders.

One of the most important strengths of this study was the adoption of rigorous inclusion and exclusion criteria in three widely used medical databases without language restriction. Diagnostic accuracy comparison of multiple biomarkers and subgroup analysis by different methods of measurement and cut-off values are another unique strength. In addition, we analysed the impact of the site and the stage of tumours on the biomarker performances in the CRC group, which has not been conducted in previous meta-analyses. Despite the strengths, there are some limitations that should be taken into consideration when interpreting our findings. First, the accuracy of AN detection may be under- or overestimated because it is strongly influenced by the proportion of CRC and AA cases in the study population. Second, AA data were not available to determine site-specific accuracy. Third, the protocol of this study has not been registered on the PROSPERO database.

Conclusion

In summary, our results determined that iFOBT is the most accurate faecal biomarker and is recommended for the diagnosis of CRC and AN, among other clinically applicable types. In addition, the lateral flow method of PK-M2 measurement should be implemented instead of ELISA due to its higher efficacy on PK-M2 performance. There is no clinically applicable faecal biomarker for AA diagnosis as an important precursor of CRC. Large prospective cohort studies are recommended to confirm our findings. Additionally, further research is suggested to find new comprehensive biomarkers.

References

Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries (Cancer J. Clin, CA, 2021). https://doi.org/10.3322/caac.21660.
Book Google Scholar
Loktionov, A. et al. Biomarker measurement in non-invasively sampled colorectal mucus as a novel approach to colorectal cancer detection: screening and triage implications. Br. J. Cancer 123, 252–260 (2020).
CAS PubMed PubMed Central Google Scholar
Ogunwobi, O. O., Mahmood, F. & Akingboye, A. Biomarkers in colorectal cancer: Current research and future prospects. Int. J. Mol. Sci. 21, 1–20 (2020).
Google Scholar
Mármol, I., Sánchez-de-Diego, C., Dieste, A. P., Cerrada, E. & Yoldi, M. J. R. Colorectal carcinoma: A general overview and future perspectives in colorectal cancer. Int. J. Mol. Sci. 18 (2017).
Toma, S. C., Ungureanu, B. S., Patrascu, S., Surlin, V. & Georgescu, I. Colorectal cancer biomarkers: A new trend in early diagnosis. Curr. Heal. Sci. J. 44, 140–146 (2018).
CAS Google Scholar
Uppara, M. et al. A systematic review and meta-analysis of the diagnostic accuracy of pyruvate kinase M2 isoenzymatic assay in diagnosing colorectal cancer. World J. Surg. Oncol. 13, 48 (2015).
PubMed PubMed Central Google Scholar
Ye, X., Huai, J. & Ding, J. Diagnostic accuracy of fecal calprotectin for screening patients with colorectal cancer: A meta-analysis. Turkish J. Gastroenterol. 29, 397–405 (2018).
Google Scholar
McInnes, M. D. F. et al. preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: The PRISMA-DTA statement. JAMA 319, 388–396 (2018).
PubMed Google Scholar
Yau, T. O., Tang, C. M., Harriss, E. K., Dickins, B. & Polytarchou, C. Faecal microRNAs as a non-invasive tool in the diagnosis of colonic adenomas and colorectal cancer: A meta-analysis. Sci. Rep. 9, 1–13 (2019).
Google Scholar
Turvill, J. et al. Diagnostic accuracy of one or two faecal haemoglobin and calprotectin measurements in patients with suspected colorectal cancer. Scand. J. Gastroenterol. 53, 1526–1534 (2018).
CAS PubMed Google Scholar
Högberg, C., Karling, P., Rutegård, J. & Lilja, M. Diagnosing colorectal cancer and inflammatory bowel disease in primary care: The usefulness of tests for faecal haemoglobin, faecal calprotectin, anaemia and iron deficiency: A prospective study. Scand. J. Gastroenterol. 52, 69–75 (2017).
PubMed Google Scholar
Karl, J. et al. Improved diagnosis of colorectal cancer using a combination of fecal occult blood and novel fecal protein markers. Clin. Gastroenterol. Hepatol. 6, 1122–1128 (2008).
PubMed Google Scholar
Kim, Y. C. et al. The usefulness of a novel screening kit for colorectal cancer using the immunochromatographic fecal tumor M2 pyruvate kinase test. Gut Liver 9, 641–648 (2015).
CAS PubMed Google Scholar
Guan-Fu, W. J., Yi-Hua, L. P. & Xin-Min, S. Evaluation of fecal tumor m2 pyruvate kinase combined with fecal occult blood test as a detecting tool for colorectal cancer. J. Sun Yat Sen Univ. (Med Sci) 27, 350–353 (2006).
Google Scholar
Shastri, Y. M. et al. Comparison of an established simple office-based immunological FOBT with fecal tumor pyruvate kinase type M2 (M2-PK) for colorectal cancer screening: Prospective multicenter study. Am. J. Gastroenterol. 103, 1496–1504 (2008).
ADS CAS PubMed Google Scholar
Mulder, S. A. et al. Tumor pyruvate kinase isoenzyme type M2 and immunochemical fecal occult blood test: Performance in screening for colorectal cancer. Eur. J. Gastroenterol. Hepatol. 19, 878–882 (2007).
CAS PubMed Google Scholar
Vogel, T. et al. Vergleich verschiedener stuhltests zur detektion von neoplasien des kolon. Dtsch. Medizinische Wochenschrift 130, 872–877 (2005).
CAS Google Scholar
Zaccaro, C. et al. Power of screening tests for colorectal cancer enhanced by high levels of M2-PK in addition to FOBT. Intern. Emerg. Med. 12, 333–339 (2017).
PubMed Google Scholar
Rutka, M. et al. Diagnostic accuracy of five different fecal markers for the detection of precancerous and cancerous lesions of the colorectum. Mediat. Inflamm. 2016, 2492081 (2016).
Google Scholar
Parente, F. et al. A combination of faecal tests for the detection of colon cancer: A new strategy for an appropriate selection of referrals to colonoscopy? A prospective multicentre Italian study. Eur. J. Gastroenterol. Hepatol. 24, 1145–1152 (2012).
PubMed Google Scholar
Caviglia, G. P., Cabianca, L., Fagoonee, S. & Gili, F. M. Colorectal cancer detection in an asymptomatic population: Fecal immunochemical test for hemoglobin vs. Fecal M2-type pyruvate kinase. Biochem. Medica 26, 114–120 (2016).
Cho Chi Hyun et al. Evaluation of the Performance of a Fecal Tumor M2-PK Rapid Kit Using Stool Specimens for Detection of Colorectal Tumors. Ann. Clin. Lab. Sci. 154–160 (2016).
Widlak, M. M. et al. Diagnostic accuracy of faecal biomarkers in detecting colorectal cancer and adenoma in symptomatic patients. Aliment. Pharmacol. Ther. 45, 354–363 (2017).
CAS PubMed Google Scholar
Mowat, C. et al. Faecal haemoglobin and faecal calprotectin as indicators of bowel disease in patients presenting to primary care with bowel symptoms. Gut 65, 1463–1469 (2016).
CAS PubMed Google Scholar
Kok, L. et al. Diagnostic accuracy of point-of-care fecal calprotectin and immunochemical occult blood tests for diagnosis of organic bowel disease in primary care: The cost-effectiveness of a decision rule for abdominal complaints in primary care (CEDAR) study. Clin. Chem. 58, 989–998 (2012).
CAS PubMed Google Scholar
Alhadi, S. C. et al. The use of M2-pyruvate kinase as a stool biomarker for detection of colorectal cancer in tertiary teaching hospital: A comparative study. Ann. Coloproctol. 36, 409–414 (2021).
Google Scholar
Rigi, F. et al. Expression of tumor pyruvate kinase M2 isoform in plasma and stool of patients with colorectal cancer or adenomatous polyps. BMC Gastroenterol. 20, (2020).
Dabbous, H. K. et al. Evaluation of fecal M2PK as a diagnostic marker in colorectal cancer. J. Gastrointest. Cancer 50, 442–450 (2019).
CAS PubMed Google Scholar
Sithambaram, S., Hilmi, I. & Goh, K. L. The diagnostic accuracy of the M2 pyruvate kinase Quick stool test-A rapid office based assay test for the detection of colorectal cancer. PLoS One 10, (2015).
Wang, H. P., Wang, Y. Y., Pan, J., Cen, R. & Cai, Y. K. Evaluation of specific fecal protein biochips for the diagnosis of colorectal cancer. World J. Gastroenterol. 20, 1332–1339 (2014).
PubMed PubMed Central Google Scholar
Wei, C. Value of combined detection of tumor type M2 pyruvate kinase, tissue polypeptide specific antigen and carcino-embryonic antigen in diagnosis of colorectal cancer. Int. J. Lab. Med. 35, 2775–2778 (2014).
Google Scholar
Abdullah, M., Rani, A., Simadibrata, M., Fauzi, A. & Syam, A. The value of fecal tumor M2 pyruvate kinase as a diagnostic tool for colorectal cancer screening. Acta Med. Indones 44, 94–99 (2012).
PubMed Google Scholar
Li, Y. & Wang, J. Clinical significance of blood and fecal tumor M2-pyruvate kinase expression in patientswith colorectal cancer. J. South Med. Univ. 31, 2087–2089 (2011).
Google Scholar
Haug, U., Hundt, S. & Brenner, H. Sensitivity and specificity of faecal tumour M2 pyruvate kinase for detection of colorectal adenomas in a large screening study. Br. J. Cancer 99, 133–135 (2008).
CAS PubMed PubMed Central Google Scholar
Haug, U. et al. Tumour M2-PK as a stool marker for colorectal cancer: Comparative analysis in a large sample of unselected older adults vs colorectal cancer patients. Br. J. Cancer 96, 1329–1334 (2007).
CAS PubMed PubMed Central Google Scholar
Koss, K., Maxton, D. & Jankowski, J. A. Z. Faecal dimeric M2 pyruvate kinase in colorectal cancer and polyps correlates with tumour staging and surgical intervention. Color. Dis. 10, 244–248 (2008).
CAS Google Scholar
Zhang, Y., Gu, W., Lin, Z. & Zhiuha, R. Significance of fecal tumor M2 pyruvate kinase for detection of gastrointestinal cancers. Chin. J. Gastroenterol. 12, 465–468 (2007).
Google Scholar
Shastri, Y. M. et al. Prospective multicenter evaluation of fecal tumor pyruvate kinase type M2 (M2-PK) as a screening biomarker for colorectal ecoplasia. Int. J. Cancer 119, 2651–2656 (2006).
CAS PubMed Google Scholar
Tonus, C., Neupert, G. & Sellinger, M. Colorectal cancer screening by non-invasive metabolic biomarker fecal tumor M2-PK. World J. Gastroenterol. 12, 7007–7011 (2006).
CAS PubMed PubMed Central Google Scholar
Naumann, M. et al. Faecal pyruvate Kinase type M2 valid screening parameter for colorectal cancer? Preliminary results from a multicenter comparative study. Dtsch Med Wochenschr 129, 1806–1807 (2004).
CAS PubMed Google Scholar
Hardt, P. D. et al. Faecal tumour M2 pyruvate kinase: A new, sensitive screening tool for colorectal cancer. Br. J. Cancer 91, 980–984 (2004).
CAS PubMed PubMed Central Google Scholar
Khoshbaten, M. et al. Diagnostic value of fecal calprotectin as a screening biomarker for gastrointestinal malignancies. Asian Pacific J. Cancer Prev. 15, 1667–1670 (2014).
Google Scholar
Meucci, G. et al. Diagnostic value of faecal calprotectin in unselected outpatients referred for colonoscopy: A multicenter prospective study. Dig. Liver Dis. 42, 191–195 (2010).
PubMed Google Scholar
Damms, A. & Bischoff, S. C. Validation and clinical significance of a new calprotectin rapid test for the diagnosis of gastrointestinal diseases. Int. J. Colorectal Dis. 23, 985–992 (2008).
CAS PubMed Google Scholar
Hoff, G. et al. Testing for faecal calprotectin (PhiCal) in the Norwegian Colorectal Cancer Prevention trial on flexible sigmoidoscopy screening: Comparison with an immunochemical test for occult blood (FlexSure OBT). Gut 53, 1329–1333 (2004).
CAS PubMed PubMed Central Google Scholar
Carroccio, A. et al. Diagnostic accuracy of fecal calprotectin assay in distinguishing organic causes of chronic diarrhea from irritable bowel syndrome: A prospective study in adults and children. Clin. Chem. 49, 861–867 (2003).
CAS PubMed Google Scholar
Costa, F. et al. Role of faecal calprotectin as non-invasive marker of intestinal inflammation. Dig. Liver Dis. 35, 642–647 (2003).
CAS PubMed Google Scholar
Summerton, C. B., Longlands, M. G., Wiener, K. & Shreeve, D. R. Faecal calprotectin: A marker of inflammation throughout the intestinal tract. Eur. J. Gastroenterol. Hepatol. 14, 841–845 (2002).
CAS PubMed Google Scholar
Tibble, J. A., Sigthorsson, G., Foster, R., Forgacs, I. & Bjarnason, I. Use of surrogate markers of inflammation and Rome criteria to distinguish organic from nonorganic intestinal disease. Gastroenterology 123, 450–460 (2002).
PubMed Google Scholar
Johne, B., Kronborg, O. & Tøn, H. I. A new fecal calprotectin test for colorectal neoplasia: Clinical results and comparison with previous method. Scand. J. Gastroenterol. 36, 291–296 (2001).
CAS PubMed Google Scholar
Kristinsson, J. et al. Screening of first degree relatives of patients operated for colorectal cancer: Evaluation of fecal calprotectin vs Hemoccult II. Digestion 64, 104–110 (2001).
CAS PubMed Google Scholar
Tibble, J. et al. Faecal calprotectin and faecal occult blood tests in the diagnosis of colorectal carcinoma and adenoma. Gut 49, 402–408 (2001).
CAS PubMed PubMed Central Google Scholar
Tibble, J. et al. A simple method for assessing intestinal inflammation in Crohn’s disease. Gut 47, 5006–5513 (2000).
Google Scholar
Røseth, A. G. et al. Faecal calprotectin: A novel test for the diagnosis of colorectal cancer?. Scand. J. Gastroenterol. 28, 1073–1076 (1993).
PubMed Google Scholar
Turvill, J. et al. Faecal calprotectin in patients with suspected colorectal cancer: A diagnostic accuracy study. Br. J. Gen. Pract. 66, e499–e506 (2016).
PubMed PubMed Central Google Scholar
Li, R., Liu, J., Xue, H. & Huang, G. Diagnostic value of fecal tumor M2-pyruvate kinase for CRC screening: A systematic review and meta-analysis. Int. J. Cancer 131, 1837–1845 (2012).
CAS PubMed Google Scholar
Tinmouth, J., Lansdorp-Vogelaar, I. & Allison, J. E. Faecal immunochemical tests versus guaiac faecal occult blood tests: What clinicians and colorectal cancer screening programme organisers need to know. Gut 64, 1327–1337 (2015).
CAS PubMed Google Scholar
Lee, J. K., Liles, E. G., Bent, S., Levin, T. R. & Corley, D. A. Accuracy of fecal immunochemical tests for colorectal cancer: Systematic review and meta-analysis. Ann. Intern. Med. 160, 171–181 (2014).
PubMed PubMed Central Google Scholar
Hirai, H. W. et al. Systematic review with meta-analysis: Faecal occult blood tests show lower colorectal cancer detection rates in the proximal colon in colonoscopy-verified diagnostic studies. Aliment. Pharmacol. Ther. 43, 755–764 (2016).
CAS PubMed Google Scholar
Lu, M., Luo, X., Li, N., Chen, H. & Dai, M. Diagnostic accuracy of fecal occult blood tests for detecting proximal versus distal colorectal neoplasia: A systematic review and meta-analysis. Clin. Epidemiol. 11, 943–954 (2019).
CAS PubMed PubMed Central Google Scholar
Huang, J. X. et al. Tumor M2-pyruvate kinase in stool as a biomarker for diagnosis of colorectal cancer: A meta-analysis. J. Cancer Res. Ther. 10, C225–C228 (2014).
PubMed Google Scholar
Haug, U. et al. Tumor M2 pyruvate kinase as a stool marker for colorectal cancer: Stability at room temperature and implications for application in the screening setting. Clin. Chem. 52, 782–784 (2006).
CAS PubMed Google Scholar

Download references

Acknowledgements

This research received no grant from any financial organizations or funding agency in the public, commercial, or not for profit sectors.

Author information

These authors contributed equally: Atefeh Nasir Kansestani and Mohammad Erfan Zare.

Authors and Affiliations

Department of Clinical Laboratory, School of Medicine, Sir Run Run Shaw Hospital, Zhejiang University, Hangzhou, China
Atefeh Nasir Kansestani, Mohammad Erfan Zare, Qingchao Tong & Jun Zhang

Authors

Atefeh Nasir Kansestani
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Erfan Zare
View author publications
You can also search for this author in PubMed Google Scholar
Qingchao Tong
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.N.K., M.E.Z. and J.Z. conceived and designed the project. A.N. K and M.E. Z reviewed literatures, A.N.K., M.E.Z. and Q.T. extracted the data. A.N.K. and M.E.Z. performed statistical analysis and wrote the manuscript. J.Z. reviewed the manuscript and provided suggestions for further development. All performances were conducted under supervision of J.Z.

Corresponding author

Correspondence to Jun Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nasir Kansestani, A., Zare, M.E., Tong, Q. et al. Comparison of faecal protein biomarkers' diagnostic accuracy for colorectal advanced neoplasms: a systematic review and meta-analysis. Sci Rep 12, 2623 (2022). https://doi.org/10.1038/s41598-022-06689-4

Download citation

Received: 03 July 2021
Accepted: 03 February 2022
Published: 16 February 2022
DOI: https://doi.org/10.1038/s41598-022-06689-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.