Background

Colorectal cancer (CRC) causes substantial morbidity and mortality throughout the world and the highest incidence is seen in developed countries with a Western culture.1 The prognosis of CRC is highly dependent on the stage of disease at diagnosis. The 5-year survival rate ranges from 90% for localised stage CRC to 10% for patients with distant metastases.2 To promote early diagnosis and improve survival rates, many countries have implemented cancer referral guidelines and fast track endoscopy for patients with alarm symptoms indicative of CRC.3,4 For CRC, alarm symptoms are specific symptoms such as rectal bleeding or changes in bowel patterns for individuals ≥40 years. However, non-specific symptoms such as weight loss and tiredness could be important markers of malignant disease and, in general, almost half of cancer patients present non-specific symptoms prior to diagnosis.5 In Denmark, screening for CRC is offered to people in the age of 50–74 years. However, only approximately 25% of all CRC are detected by screening.6 For the rest of the cases and in the group aged 75 years and older, symptoms will still have a significant role in the diagnostic process of CRC.

It has been demonstrated that both specific and non-specific symptoms of CRC are common in the general population and few consult their general practitioner (GP) when experiencing alarm symptoms of CRC.7 Positive predictive values (PPVs) for alarm symptoms of CRC have mostly been investigated for persons consulting their GP with alarm symptoms and little is known about the negative predictive values (NPVs).8,9,10 As campaigns to increase public awareness of alarm symptoms and promote earlier presentation by the patients are progressively carried out worldwide,11 it is important to gather knowledge about the PVs of alarm symptoms in the general population.

The aims of this study were (1) to determine the 1-year PV of specific and non-specific alarm symptoms of CRC in the general population 40 years or above and (2) to describe the proportion of specific and non-specific alarm symptoms reported by patients prior to diagnosis of CRC.

Methods

Study design and population

The study was designed as a nationwide cohort study based on questionnaires and national registries, imbedded in the Danish Symptom Cohort (DaSC).12 This particular study aimed to gather knowledge about the predictive values of specific and non-specific alarm symptoms of CRC in the general population. Other studies have already reported predictive values of gynaecological cancer alarm symptoms13 and upper gastrointestinal cancer.14 Parts of the methods described below have therefore been previously described.7,12,15

From the Danish Civil Registration System (CRS), 100,000 adults aged 20 years or above were randomly selected and invited to participate in a survey. All Danish citizens are registered in the CRS with a unique personal identification number. Prior to the sampling procedure, individuals were excluded if they had declared to the CRS that they did not want research-related inquiries. The individuals received a postal letter explaining the purpose of the study. The questionnaire was designed using the internet-based platform SurveyXact.16 A unique login for a secure webpage was included in the letter. This provided access to a comprehensive web-based questionnaire. To prevent exclusion of people with no access to the internet, participants were offered to complete the survey by telephone interview. When an invited subject was unable to respond due to severe illness or having moved abroad, family or relatives could decline the invitation on behalf of the invited person. The reason for not responding was then registered as illness or moved abroad.

The questionnaire

The methodological framework for developing, pilot testing and field testing the questionnaire is described in detail elsewhere.12 This paper addresses the specific and non-specific symptoms indicative of CRC. These symptoms were selected based on a review of literature, national and international cancer referral guidelines and descriptions of cancer pathways.3,4,17 In total, ten predefined symptoms reported by individuals aged 40 years or above form the base of this paper (Table 1). The four specific symptoms and the age limit were chosen as it is included in the Danish cancer referral guidelines (Supplementary Figure 1). The respondents were asked whether or not they had experienced one or more of the symptoms in the preceding 4 weeks. The respondents were additionally asked whether or not they had contacted their GP regarding the symptom. The wording of the question regarding symptom experience was: “Have you experienced any of the following sensations, symptoms or discomfort within the past 4 weeks?” with the option to select one or more of the predefined symptoms. With regard to GP contact, the question was worded: “Have you contacted your general practitioner concerning the symptom(s) you have experienced within the preceding four weeks, by appointment, telephone or e-mail?” An item concerning when the symptom(s) occurred for the first time was also included. The response categories were: “less than one month ago”, “1–3 months ago”, “3–6 months ago” or “more than six months ago”.

Table 1 Specific and non-specific alarm symptoms of colorectal cancer

Register data

Information on CRC diagnoses in the respondent cohort aged 40 years or above was retrieved from the Danish Cancer Registry (DCR). The DCR contains personal and tumour characteristics for all incident cancer cases in Denmark including date of diagnosis and ICD-10 codes for lesions.18 Only cases diagnosed in a 12-month period after the completion of the questionnaire were included. Furthermore, the cases were excluded if the individual had been diagnosed with the same ICD-10 code in a time period covering 5 years prior to the completion of the questionnaire. The ICD-10 codes used in this study are listed in Table 2.

Table 2 ICD-10 codes used for incident cancer cases

Statistical analyses

PPVs were calculated by dividing the number of symptomatic individuals diagnosed with CRC by the total number of symptomatic individuals in each category. NPVs were calculated by dividing the number of asymptomatic respondents not subsequently diagnosed with CRC by the total number of asymptomatic respondents. The PVs are presented as percentages. PPVs and NPVs for CRC were calculated for each category. Moreover, we calculated the positive likelihood ratios (LR+) and negative likelihood ratios (LR−) as a relative measure of the association between symptom experience and CRC. The PVs and LRs for CRC were calculated for each of the four specific alarm symptoms of CRC and for each of the six non-specific alarm symptoms for the total population (aged ≥40 years) and separately for two age groups 40–74 and≥75 years. For change in bowel patterns (stool texture and frequency), only experience of symptoms for the first time more than 1 month earlier was included. Moreover, the PVs and LRs were calculated for: (1) at least one of the ten alarm symptoms, (2) at least one of the specific alarm symptoms, (3) at least one of the non-specific alarm symptoms, (4) GP contact with at least one of the ten alarm symptoms and (5) GP contact with at least one specific alarm symptom. The PVs and LRs in these five groups were calculated for both the entire study population of respondents aged 40 years or above as well as for subgroups of individuals under and above 75 years of age. The proportions of specific and non-specific alarm symptoms of CRC reported by individuals aged 40 years or above diagnosed with CRC are presented.

All statistical tests used a significance level of P < 0.05. Data analyses were conducted using STATA statistical software 13.1 (StataCorp, College Station, TX, USA). In concordance with the Danish data protection act only summary tables with four or more individuals in each count are reported.

Results

Of the 100,000 randomly selected subjects, 4474 (4.7%) were not eligible because they had either died, could not be reached due to unknown address, were suffering from severe illnesses, had language problems or had moved abroad. A total of 95,253 subjects were eligible for the study, of these 69,060 were aged 40 years or above and 37,455 completed the questionnaire, yielding an overall response rate of 54.2% (Fig. 1). The mean age of the respondents in this age group was 58.4 years (95% confidence interval (CI): 58.3–58.6) compared to 60.8 years (95% CI: 60.6–61.0) for non-respondents. Slightly more respondents were women (52.7%) compared to non-respondents (50.2%). The respondents were more often women married/co-habiting and had a higher income and educational level and were more often affiliated to the labour market (Table 5).

Fig. 1
figure 1

Study cohort

In total, 64 individuals (0.17%) were diagnosed with CRC 1 year after completing the questionnaire. Of these, 41 (64.1%) were males, and 23 (35.9%) were women. The mean age of the individuals diagnosed with CRC was 69.7 years (95% CI: 67.3–72.1).

Due to too few observations, only PVs for nine of the ten predefined symptoms are reported (i.e. PVs for unintended weight loss is not reported). The PPVs of the four specific alarm symptoms for CRC (abdominal pain, change in stool texture, change in stool frequency, blood in stool/rectal bleeding) ranged between 0.2% (95% CI: 0.1–0.4) (change in stool texture) and 0.6% (95% CI: 0.3–1.1) (rectal bleeding) (Table 3). The PPVs of the five non-specific alarm symptoms for CRC (diarrhoea, constipation, abdominal bloating, feeling unwell, tiredness) ranged between 0.1% (95% CI: 0.1–0.3) (constipation and feeling unwell) and 0.3% (95% CI: 0.1–0.5) (diarrhoea) (Table 3). The NPVs were 99.8 for all the specific and non-specific alarm symptoms (Table 3).

Table 3 Number of respondents (n, %), cases of colorectal cancers, numbers (n, %) (total numbers n = 64), PPVs (%) with 95% CI and LR+ and LR− with regard to symptom type (n = 37,455)

The PVs and LRs for CRC among individuals under and above 75 years are given in Table 4 and demonstrate the difference between age groups. Generally, the PPVs were lower and the NPVs higher in the group younger than 75 years compared to those aged 75 years and above. For the individuals aged 75 years or above who had reported at least one specific alarm symptom, a PPV of 1.0% (95% CI: 0.4–2.0) and an NPV of 99.5% (95% CI: 99.1–99.7) for CRC was estimated. Individuals aged 75 years or above reporting at least one of the specific alarm symptoms that lead to a GP contact had a PPV of 1.9% and LR+ of 3.0 for being diagnosed with CRC and the NPVs and LR− for this group was 99.5% (95% CI: 99.2–99.7) and 0.8% (95% CI: 0.6–1.0) respectively (Table 4).

Table 4 Number of respondents (n,%), cases of colorectal cancers, numbers (n, %), PPVs (%) and NPVs (%) with 95% CI and LR+  and LR− with 95% CI with regard to specific and non-specific cancer alarm symptoms and stratified on age group (under and above 75 years of age)

Of the individuals diagnosed with CRC, 43.8% had experienced at least one of the four specific alarm symptoms. Abdominal pain was the most common specific alarm symptom reported by 26.6% of the 64 individuals diagnosed with CRC. In total, 40.6% had reported tiredness, this being the most common non-specific symptom. Of the individuals diagnosed with CRC, 65.6 % had experienced at least one of the specific and non-specific alarm symptoms.

Discussion

Main findings

In this study, we investigated the PPVs, NPVs, LR+ and LR− of specific and non-specific alarm symptoms for CRC in the general population. The overall findings are that although the PPVs are low, the LR+s show a relative association with symptoms (e.g. LR+ 3.4 for rectal bleeding), and although the NPVs were high, the LR− estimates were too uncertain to conclude that individuals without symptoms are in no risk of CRC.

In general, the PPVs were higher among individuals aged 75 years and above compared to those aged 40–74 years. The NPVs were highest in the youngest age groups. The highest PPV was found among individuals aged 75 years and above reporting at least one of the specific alarm symptoms that led to a GP contact. The LR+ was substantially higher for individuals experiencing the specific alarm symptoms (LR+ 3.4% for rectal bleeding) when compared to the non-specific alarm symptoms (LR+ 1.2% for abdominal bloating). The lowest LR− was found for individuals aged 40–74 years reporting at least one specific alarm symptom.

Strengths and limitations

The major strength of this study is the prospective cohort design following a large cohort of the general population, which gives the opportunity to retrieve information about symptom experiences prior to diagnosis. With this study design we minimised the risk of recall bias that is often seen in studies regarding cancer patients’ symptoms prior to diagnosis. The use of register-based diagnoses rather than asking the respondents further reduced the risk of recall bias. The DCR was used to identify cases of cancer. This registry is based on mandatory data from several sources and is a valid source of information on diagnoses.18

A general weakness of questionnaire-based studies is that respondents may not interpret the questions and categories of answers as intended. Prior to the survey, we conducted several rounds of pilot testing and field testing to reduce this possibility.12 Based on the results of the pilot testing, it seems reasonable to assume that the respondents understood the questions as intended.

This study reflects self-reported experience of symptoms within the preceding 4 weeks and subsequent contacts with a GP. Although we asked for symptom experiences and GP contacts within a short time period, some memory decay cannot be ruled out. Another limitation to keep in mind is the fact that willingness to respond to a questionnaire regarding symptom experiences may be associated with the presence of symptoms. It has been demonstrated that individuals with many or severe symptoms and vulnerable personality may be more prone to participate in a survey about symptoms.19,20

The age distribution of the cohort is skewed in the older age groups with regard to respondents and non-respondents (Table 5). However, as we do not know the symptom experience of the non-respondents we cannot know how this difference affects our results.

Table 5 Socio-demographic characteristics for the cohort of 69,060 individuals ≥40 years

We investigated whether non-respondents had a higher number of CRC and found slightly more cases of CRC among non-respondents in the age group ≥40 years (64 cases among the 37,455 (0.17%) respondents compared to 72 cases among the 31,605 non-respondents (0.23%)). The differences in the number of CRCs in the two groups might be affected by a number of factors. Firstly, the non-responders were slightly older than the respondents. Since the incidence of CRC increases with age, this might be an associated factor. Older age might be associated with a higher degree of comorbidity and therefore lower capacity to participate in the survey. Secondly, it could be due to socioeconomic factors. A Danish study from 2008 found that higher incidences of CRC were associated with social disadvantages predominantly amongst men, specifically related to co-habiting status, housing tenure, dwelling size and affiliation to the labour market.21 The non-respondents in our study had lower socioeconomic status compared to respondents. A third possible associated factor could be that people who take their symptoms less seriously are more likely to have CRC.22 As the non-respondents did not wish to participate in the survey on symptoms and healthcare-seeking, one could hypothesise that they were less likely to take symptoms seriously.

The limited number of cases of CRC might affect the reliability of the results. Due to the limited number of cases of CRC in the follow-up period, we did not have sufficient power to evaluate the PVs of combinations of alarm symptoms. It is plausible that experiencing more than one alarm symptom increases the risk of malignancy.23 From previous studies we know that experiencing multiple symptoms increases the probability of seeking medical attention.15 Therefore, the non-specific symptoms of CRC might play an important role in the diagnostic pathway.

Comparison with previous literature

To our knowledge, this is the only population-based study estimating PPVs and NPVs of alarm symptoms of CRC with corresponding LR+ and LR−. A literature study by Fijten et al.,24 which includes nine population-based studies, estimated the PPV of rectal bleeding to be 1 in 1000. A relatively new systematic review investigating the diagnostic value of rectal bleeding found PPVs ranging from 0.01 to 0.21%.25 However, this review was comprised of studies from various settings and all the population-based studies targeted screening of CRC.

Several studies have investigated the diagnostic values of alarm symptoms of CRC in primary care8,23,26 and found PVs highest for rectal bleeding and anaemia. This is in line with the present results where the single symptom with highest PPV was rectal bleeding. The PPVs in the studies from general practice were substantially higher. A review by Astin et al.23 found that PPVs for rectal bleeding as single symptom varied between 2.2 and 15.8%. The findings in the present study support this suggestion, as alarm symptoms presented to the GP had a higher PPV compared to alarm symptoms not reported to the GP. This indicates that deciding to consult the GP with a symptom increases the likelihood of the symptom being caused by a disease.

In a previous study, we have demonstrated that no more than 33.8% of the individuals in the general population who experienced rectal bleeding consulted their GP.7 This in line with the findings from Crosland and Jones27 who found that 41% of people from the general population had consulted their GP when experiencing rectal bleeding. These results combined with results from the present study have clear implications for modelling the impact and cost-effectiveness for public awareness campaigns. Unfortunately, awareness campaigns have not yet shown to improve survival.28 Although screening has been implemented in some countries, most CRC patients are still to be found based on symptoms. Instead of general awareness campaigns, targeted education of older persons, who have the highest incidence, might have an effect on healthcare-seeking and ultimately survival of CRC. It is important that we communicate the importance of healthcare-seeking with the defined alarm symptoms and the fact that low risk is not no risk. Informing patients referred for fast track investigation of CRC about the modest risk of actually having CRC could possibly reduce the anxiety related to investigation for suspected cancer.29

Conclusions

This study supports our hypothesis that PPVs of alarm symptoms of CRC experienced in an unselected general population were lower than PPVs of symptoms presented in hospitals or general practices. The single symptom with highest PPV was rectal bleeding. The fact that rectal bleeding as an initial symptom is associated with a less advanced stage of CRC and increased survival rate9 supports the idea that future campaigns aiming to promote early diagnosis should focus on increasing healthcare-seeking when experiencing rectal bleeding. Individuals aged 75 years and above who reported having contacted the GP with the alarm symptoms had the highest PPV. This finding gives reason to further explore the process of people’s decision to seek healthcare and to target education of older persons, who have the highest incidence of CRC.

The PPVs for the alarm symptoms were low and the NPVs were high. Experiences of specific and non-specific alarm symptoms are frequent. This means that despite requiring further investigation, most patients with an alarm symptom of CRC do not have CRC.