Abstract
Background
Comorbid conditions may delay lung cancer diagnosis by placing demand on general practioners’ time reducing the possibility of prompt cancer investigation (“competing demand conditions”), or by offering a plausible non-cancer explanation for signs/symptoms (“alternative explanation conditions”).
Method
Patients in England born before 1955 and diagnosed with incident lung cancer between 1990 and 2019 were identified in the Clinical Practice Research Datalink and linked hospital admission and cancer registry data. Diagnostic interval was defined as time from first presentation in primary care with a relevant sign/symptom to the diagnosis date. 14 comorbidities were classified as ten “competing demand“ and four “alternative explanation” conditions. Associations with diagnostic interval were investigated using multivariable linear regression models.
Results
Complete data were available for 11870 lung cancer patients. In adjusted analyses diagnostic interval was longer for patients with “alternative explanation” conditions, by 31 and 74 days in patients with one and ≥2 conditions respectively versus those with none. Number of “competing demand” conditions did not remain in the final adjusted regression model for diagnostic interval.
Conclusions
Conditions offering alternative explanations for lung cancer symptoms are associated with increased diagnostic intervals. Clinical guidelines should incorporate the impact of alternative and competing causes upon delayed diagnosis.
Similar content being viewed by others
Introduction
Lung cancer is the leading cause of cancer mortality in the UK [1]. Lung cancer survival rates are relatively poor in the UK compared to other high income countries [2], with an age-standardised 5 year survival estimate of 13% in 2010–14, compared to 17% in France and 18% in Germany [3]. This is partly due to later stage at diagnosis [4]. More than 70% of patients in the UK are diagnosed at stage 3 or 4 [1], and in a comparative study of non-small-cell lung cancer patients aged 66 y and over only 15% of patients in England were diagnosed at Stage 1, compared to 25% in the United States [5] Delay between a patient first noticing a symptom and diagnosis may occur before presentation to primary care, in primary care or in secondary care [6]. Shortening the overall diagnostic interval (DI) i.e., the time between first presentation in primary care with a potential cancer symptom and cancer diagnosis, might limit the proportion of patients diagnosed at a later stage [7, 8] and improve survival [9].
Primary care consultation rates generally increase before cancer diagnosis [10]—this may represent a “missed opportunity” to decrease DI. The number of pre-referral consultations varies by tumour site, and is relatively high for lung cancer, with around 30% of lung cancer patients having multiple consultations as compared to <10% of melanoma or breast cancer patients [11]. Current NHS guidelines for suspected lung cancer recommend cancer pathway referral for a specialist secondary care appointment within two weeks for patients with unexplained haemoptysis or chest x-ray results suggestive of cancer, or chest x-ray within 2 weeks for patients aged over 40 y with either one or two other potential cancer symptoms, depending on the patient’s smoking history [12].
There is evidence that the DI for cancer may be increased by the presence of comorbidities [13, 14]. A recent review found that the presence of pre-existing diseases is associated with increased DIs across a range of cancers including leukaemia, myeloma, oesophageal, colorectal and laryngeal cancers [13]. It has been suggested that comorbidities can be grouped into two categories affecting DI via distinct mechanisms [14]: (1) “competing demands” i.e., conditions that are unrelated to the cancer but place a burden on the general practitioner’s (GP) time and (2) “alternative explanations” i.e., conditions that provide plausible alternative explanations for cancer signs/symptoms in the diagnostic process and delay referral for cancer investigation in secondary care as a result. For lung cancer potential “alternative explanation” comorbidities include common chronic conditions such as asthma and chronic obstructive pulmonary disease (COPD). One qualitative analysis of significant event audits of lung cancer diagnoses from 92 general practices in England found that the presence of asthma or COPD was recognised by GPs to have the potential to mask lung cancer symptoms and result in diagnostic delay [15]. Similarly, in a case series of lung cancer patients in Denmark it was noted that attribution by the GP of lung cancer symptoms to existing disease was associated with prolonged diagnostic delay for some patients [16]. In an analysis of the Surveillance, Epidemiology and End Results programme (SEER) data on lung cancer patients in the US, Nadpara et al found a marked association between presence of comorbidities as assessed by Charlson score and DI [17], and COPD and asthma were both found to be associated with delayed lung cancer referral in a study in Morocco [18]. Pearson et al. found a positive association between comorbidities and secondary care diagnostic interval among English lung cancer patients [19]. An experimental study of different case vignettes of possible lung and colorectal cancer presented to physicians in the UK found that the presence of an alternative explanation decreased the likelihood of a prompt referral for cancer investigation [20]. However, there is limited published evidence on the association between the presence of comorbidities and DI in representative samples of lung cancer patients in the UK.
In the paper, we investigated the associations between comorbidities and DI in a large cohort of patients with lung cancer in the UK. We hypothesised that the presence of one or more comorbidities would increase the DI, and that this effect would be more marked for comorbidities, such as chronic respiratory conditions, which might offer an alternative explanation for lung cancer symptoms. We also aimed to assess whether any associations between DI and comorbidities were moderated by the presenting cancer sign/symptom or the patient’s smoking status, and to investigate the effect of a number of other measures of patient characteristics i.e., body mass index (BMI) and alcohol drinking, and area-based measures of social deprivation.
Methods
This study used routinely collected data from general practice (i.e., primary care) consultations in England from the UK Clinical Practice Research Datalink (CPRD) [21] dataset, and linked data [22] at the individual patient level with the cancer registrations, secondary care admissions and Office for National Statistics data on deaths and social deprivation.
Clinical Practice Research Datalink (CPRD)
The CPRD contains pseudonymised healthcare records from over 20 million people. The data are derived from the longitudinal electronic health records (EHRs) kept at the primary care practice. CPRD is divided into two databases, Gold and Aurum according to the clinical record-keeping software used at the practice—CPRD Gold data was used in this study. Information on diagnoses, symptoms, consultations, referrals, prescriptions, test results and other patient characteristics such as BMI, smoking and alcohol drinking habits is available. Most clinical data within CPRD Gold are recorded using Read codes [23], a structured hierarchical coding system with each code representing a health-related concept (eg. diagnoses or symptoms).
Hospital episodes statistics (HES)
The linked HES data [24] used included medical records from admission and day case procedures but not data from outpatient appointments (where diagnostic information is recorded in <5% of attendances [25]) or emergency department visits without admission. Diagnoses are recorded using ICD-10 codes.
Cancer registry
The cancer registry data is held by the National Cancer Registration and Analysis Service [26]. New cases of cancer are recorded based on information from sources including health care providers and death certificates. Cancer diagnoses are recorded using ICD-10 codes.
Office for National Statistics Death data
Dates of death of the lung cancer patients were obtained from the ONS [27] and were used to identify patients where date of diagnosis was on or after date of death.
Index of multiple deprivation
Data on index of multiple deprivation (IMD) was also available based on the GP practice postcode [28].
Clinical code lists
Clinical code lists used to define cases of lung cancer (including cancers of the trachea and bronchus), symptoms, comorbidities, ethnic group, and other patient characteristics are available in the supplementary information. Code lists were drawn up by review of published code lists [29] and by searching the medical and product dictionaries supplied by CPRD.
Study population and representativeness
Data were available on 800,018 patients, i.e., all patients in CPRD Gold born in 1954 or earlier who were still registered with a GP on their 65th birthday and who were eligible for linkage with the cancer registry, hospital episodes statistics, Office for National Statistics and IMD data (eligibility for linkage was determined by consent to the linkage process by the GP practice and the availability of a valid NHS number [30]). From this sample primary lung cancer cases were identified from 3 data sources, the CPRD primary care data, and the linked hospital episodes statistics and cancer registry data, using lists of Read codes or ICD-10 codes as appropriate. The CPRD data has been shown to be broadly representative of the UK population in terms of age, sex, and ethnicity when compared to the UK census and to the Health Survey for England in terms of BMI [31]. However, given the changing ethnic make-up of the UK over time, and the fact our sample was drawn from those patients born in 1954 or earlier, the proportion of white patients was relatively higher than if a younger age range of patients had been included. In addition, among women the proportion of lung cancer cases diagnosed at age 70 y or over increased substantially over the study period [32], so the study cohort will be less representative of the total population of female lung cancer patients in the earlier time periods.
Variable definitions
Diagnostic interval: DI was defined as the time between date of first presentation in primary care with a clinical problem suggestive of possible lung cancer, to the date of diagnosis (signs/symptoms and other clinical indicators up to 12 months before diagnosis were considered in line with a number of other studies of cancer DI conducted using the CPRD data [14, 33, 34]).
Presenting clinical problem: Signs/symptoms and other clinical problems considered to indicate possible lung cancer were cough, haemoptysis, loss of appetite, fatigue, dyspnoea, weight loss, thrombocytosis, chest infection, finger clubbing and lymphadenopathy—these were based on the National Institute for Health and Care Excellence (NICE) guidelines for suspected cancer recognition and referral [12] and on discussion with a GP (author MC). Due to small numbers of patients first presenting with finger clubbing, lymphadenopathy and appetite loss these were grouped together as “Other”, patients presenting with more than one clinical indicator initially were categorised as “Multiple”.
Date of diagnosis: this was taken to be the first date of an assigned diagnostic code for lung cancer present across the 3 data sources i.e., the CPRD primary care data, the HES data and the Cancer Registry data. Cancer Registry data were only available until the end of 2016, so for patients diagnosed between 2017 and 2019 date of diagnosis was taken as the first date of a diagnostic code for lung cancer in either the HES or the CPRD.
Multimorbidity: Following the classification from Mounce et al. [14] we considered the number of two classes of comorbidity diagnoses for each patient 1) “competing demand” (CD) conditions unrelated to lung cancer that place demand on the physician’s time and might reduce the focus on possible lung cancer symptoms 2) “alternative explanation” (AE) conditions i.e., conditions to which some lung cancer symptoms could reasonably be attributed. For the CD conditions we included coronary heart disease (CHD), depression/anxiety, heart failure, hypertension, chronic kidney disease (CKD), osteoporosis, dementia, serious mental illness (SMI), epilepsy and diabetes. For the AE conditions we included COPD, asthma, chronic fatigue syndrome (CFS), and receipt of a prescription for an Angiotensin-converting enzyme (ACE) inhibitor – the latter was included as cough is a common side effect of ACE inhibitors. As these were mostly chronic, non-resolving conditions, the presence of a diagnostic code at any point in the medical record, excluding the 12 months before diagnosis, was taken to indicate that the patient had the condition, an absence of a code was taken to indicate the absence of the condition (the 12 months prior to cancer diagnosis were excluded as we were interested in the effect of pre-existing comorbidities on DI). As depression/anxiety may resolve, only diagnoses in the 3 years prior to cancer diagnosis were taken to indicate the presence of the condition. For ACE inhibitor prescription, the relevant timeframe was taken to be any point in the 2 years prior to diagnosis.
Other variables: other factors considered included sex, age at diagnosis and year of diagnosis (categorised as shown in Table 1), usual consultation frequency measured as total number of consultations involving contact with a GP from 24 to 12 months before diagnosis, increase in consultation frequency in the year before diagnosis (yes/no) (12 month periods were chosen for comparison of consultation frequency as there is a substantial seasonal variation [35]), presenting clinical problem, IMD quintiles based on the GP practice address, and additional patient characteristics including smoking status (ever smoker versus non-smoker), alcohol drinking status (former, current or non-drinker), and BMI category (underweight or BMI < 18.5 kgm−2, normal weight or BMI of 18.5 to <25 kgm−2, overweight or BMI of 25 to <30 kgm−2, and obese or BMI ≥ 30 kgm−2). For BMI, smoking and alcohol drinking the most recent available measure was used, excluding measures in the 6 months before diagnosis as it was felt these might have been affected by the course of the disease. The patient’s ethnic group was also considered—there was substantial missing data in this variable and very small numbers in any ethnic group except for white (<1% in total) so the variable was grouped as white and other.
Exclusions
The following groups of patients were excluded from analyses: (1) patients with a current registration date <2 y before lung cancer diagnosis (2) patients who did not present with a relevant complaint in the year prior to diagnosis meaning DI could not be calculated (3) patients with a clinical code for chemo- or radio-therapy in the CPRD data prior to the diagnosis date (in these cases it was unclear when the cancer diagnosis was made) (4) patients with a diagnosis date after their death date (5) patients from practices with an up-to-standard date (the date from which data from a practice is considered to be of adequate quality for research) <1 y before their diagnosis date.
Statistical analyses
Our outcome variable was DI in days—we aimed to investigate how this was associated with number of CD and AE conditions allowing for presenting clinical problem, patient characteristics and other variables. The DI distribution was skewed, therefore results of univariate analyses report the median DI in each category. Age, usual consultation frequency and IMD were treated as continuous variables, other predictor variables were categorised as shown in Table 1. Initially, univariate associations between potential predictors and DI were investigated using Mann–Whitney U Test and Kruskal–Wallis test as appropriate for categorical variables and univariate linear regression for continuous variables. Multiple linear regression models were chosen for the multivariable analyses for ease of interpretability, as the data were skewed a number of diagnostic tests and additional analyses were run to assess the effect of non-normality on the results obtained. A series of regression models were produced using the following procedure:
Model 1: All variables with minimal (<5%) missing data (sex, age and calendar year at diagnosis, number of CD and AE comorbidities, IMD, usual consultation frequency, increase in consultation frequency, and presenting symptom) were entered into an initial model, and a final model selected using both backwards and bidirectional stepwise selection based on Akaike Information Criterion (AIC) [36].
Model 2: The effect of adding variables with significant amounts of missing data (ethnicity, BMI category, smoking status, alcohol drinking) to the final selected Model 1 was then investigated.
Model 3: The effect of adding interaction terms to the model between AE comorbidities and smoking, between presenting symptom and smoking and between AE comorbidities and presenting symptom was also investigated—all three interactions were added to Model 2 initially and backwards stepwise selection was used to select those that were significant at p < 0.05.
Model 4: As a final step we repeated the backwards selection procedure used in Model 1 replacing the grouped variables for CD and AE conditions with all the individual comorbidities as separate dummy variables in the initial model. This was in order to identify any individual comorbidities with a particularly marked effect on DI.
Results
Figure 1 shows a flow chart of the process of patient selection. 25875 lung cancer patients were identified in total across the 3 data sources (the CPRD primary care data and the linked HES and cancer registration data). Following exclusions, 11870 of these presented with a relevant sign/symptom allowing calculation of DI, these patients formed our final sample for analysis. Descriptive data on these patients are given in Table 1. Median (IQR) DI across all included patients was 140 (46, 270) days.
Univariate analyses
In univariate analyses, all the potential predictor variables were significantly associated with DI with the exception of ethnic group and increase in consultation rate before diagnosis (Table 1). Presenting sign/symptom was associated with large differences in DI which varied from a median of 37 days in patients presenting with haemoptysis to 173 days in those presenting with chest infection. DI was also substantially higher in patients with more CD conditions (median of 196 days in those with three or more conditions versus 101 days in those with none) and those with more AE conditions (median of 246 days in those with two or more conditions versus 90 days in those with none). DI was also somewhat higher in females, in ever- versus never smokers, in underweight patients, and among former drinkers.
Median DI stratified by number of AE and CD conditions
Figure 2 shows the median DI across number of AE conditions stratified by number of CD conditions. Median DI increased by number of AE conditions irrespective of number of CD conditions. For patients with no AE conditions median DI increased with number of CD conditions, however, for those with one or more AE conditions, DI was similar across strata of CD conditions.
Multivariable analyses
Results of Models 1 and 2 are shown in Table 2. Regression coefficients shown in the table represent the mean change in DI in days for a unit change in the predictor, or for categorical variables the difference in days from the reference category.
Model 1: The backwards stepwise regression procedure resulted in the number of CD conditions being dropped from the initial linear regression model, all other variables were retained. DI was slightly higher for females than males (B of 7.2 (95%CI 3.3, 11.2) days) and increased slightly with age and decreased with more recent calendar year of diagnosis. Presenting clinical problem was strongly associated with DI. Patients presenting with haemoptysis or weight loss were diagnosed substantially more quickly (regression coefficient (B) of −82.6 (95%CI −93.4, −71.8) and −50.5 (−61.9, −39.2) days respectively) compared to patients presenting with cough. DI was greater in those with a higher usual consultation rate (by around 2 days per consultation), and an increase in consultation rate in the year before diagnosis was also associated with a longer DI in the multivariable analysis. DI was strongly associated with the presence of AE conditions, being 30.6 (26.1, 35.0) and 73.8 (67.8, 79.9) days higher among those with one and two or more AE conditions respectively compared to patients with none.
Model 2: When ethnicity was added to Model 1 there was no evidence of an association (p = 0.143). Given the relatively large amount of missing data and the very small numbers in any ethnic group other than white this variable was not included in subsequent analyses. The results of adding smoking status, BMI category and alcohol drinking to the model are shown in Table 2. There was evidence that diagnosis was delayed in underweight patients compared to patients in the normal BMI category (B of 23.4 (12.7, 34.0) days)—this was in contrast to the association of weight loss as a symptom with reduced DI. Diagnosis was slightly faster in non-smokers compared to ever−smokers (B −11.4 (−18.6, −4.2) days). There was also some evidence that DIs were longer for former drinkers (B 6.7(0.1, 13.3) days) and non-drinkers (B 8.2 (1.1, 15.4) days) compared to current drinkers. Adding these variables to the model slightly reduced the association of DI with sex, estimates for other predictors remained similar.
Model 3: Only the interaction between number of AE conditions and first symptom was retained in the backwards selection procedure (p < 0.001). Full results are shown in Table S1 in the supplementary information. Notably, there was an indication that DIs for patients with dyspnoea were longer in those with an AE condition (by 25 days).
Model 4: The comorbidities remaining in the final model are shown in Table 3. Three AE comorbidities remained in the model, COPD, asthma and ACE inhibitor prescription – these were associated with a 59.1, 30.6 and 7.7 day increase in DI respectively. CHD was also associated with a relatively large increase in DI (18.5 days), the other conditions remaining in the model were depression/anxiety, osteoporosis and epilepsy which were all associated with increased DI (by 12.5, 10.3 and 13.6 days, respectively), and diabetes which was associated with a 5.6 day decrease in DI.
Additional analyses
As the distribution of DI was skewed Models 1 and 4 were rerun using the log and square root of DI—this did not improve the fit and so the original models using the untransformed DIs were retained. There was evidence of significant heteroskedasticity (p < 0.001 by Breusch–Pagan test) and so Models 1 and 4 were rerun using a weighted linear regression - the effect estimates and standard errors obtained were essentially unchanged (Table S2 in the supplementary information). Bias-corrected accelerated bootstrap confidence intervals were also constructed for the coefficients in Model 1 as this method is robust to non-normality [37]—the results were almost identical to those from the regression model (Table S3 in the supplementary information). Model 1 was repeated excluding patients with death certificate only diagnoses and those where date of diagnosis and date of death were the same, and the results obtained were very similar (Table S4 in the supplementary information). Model 1 was also repeated with the increase in consultation frequency in the year before diagnosis entered as a continuous variable rather than dichotomised to yes/no (Table S5 in the supplementary information). The increase in DI associated with a unit increase in yearly frequency of consultations in the year before diagnosis was 1.9 (95%CI 1.7, 2.1) days, other coefficients in the model remained similar.
Model 2 was repeated using data with missing values for BMI, smoking status and alcohol drinking imputed by multivariate imputation by chained equations using the MICE package in R [38]. Adding imputed data for missing values had a minimal effect on the regression coefficients. (Table S6 in the supplementary information).
Finally, for the 5884 patients for whom information on date of chest x-ray was available in the primary care data, we repeated Model 1 using the interval from first presentation in primary care with a relevant sign/symptom to chest x-ray (x-ray interval). This interval was included as it was felt it might be an indication of the primary care (as opposed to secondary care) element of the DI. The observed patterns of association between the predictor variables and x-ray interval were similar to those for DI, with the presence of one or two or more AE conditions being associated with a 21.8 and 60.7 day increase in x-ray interval respectively (Table S7 in the supplementary material).
Discussion
Summary
Patients with one or more comorbid conditions that could provide an alternative explanation for their lung cancer symptoms had substantially increased intervals from presentation in primary care to diagnosis, by 31 and 74 days for patients with one and two or more conditions respectively. The number of CD conditions did not remain in the final model selected by the stepwise procedure. More frequent GP consultations were also associated with a longer time from presentation in primary care to diagnosis. As expected, the “red flag” symptom of haemoptysis was associated with a substantially lower DI. DI was longer in smokers and in underweight patients, suggesting an association between other markers of poor health and delayed diagnosis. For individual conditions the greatest increase in DI was associated with COPD (59 days).
Interpretation of results and comparison with literature
While a number of studies have considered overall multimorbidity or the effect of respiratory disorders in relation to lung cancer DI, we believe this is the first study of lung cancer to group comorbidities as AE and CD conditions. In the model where the comorbidities were considered individually, we found significant increases in DI for three of our suggested AE conditions, ie, COPD, asthma and ACE inhibitor prescription. The diagnostic delay for cancer we have observed for patients with COPD and asthma is in line with previous studies [16,17,18]. To our knowledge, this is the first study to show an association between ACE inhibitor prescription and increased DI. The mean increase (around 8 days) was relatively small but was independent of the presence of several common chronic conditions associated with ACE prescription. In the model where comorbidities were considered individually, we also found strong associations between CHD and depression/anxiety and increased DI. Although we considered CHD as a CD condition, both dyspnoea and chest pain are common symptoms of CHD, so it is possible that it may be acting as an AE condition. However, both CHD and depression/anxiety have also been found elsewhere to be positively associated with DI for colorectal cancer [14, 39].
It has been suggested that the difference in the average number of pre-referral consultations for different cancers relates to their typical symptom patterns, with patients with more specific symptoms (e.g., a breast lump) being referred sooner than those with less specific symptoms (e.g., fatigue or dyspnoea) [11, 40, 41]. However, there is relatively little information on the associations between presenting cancer symptom and DI [40]. We found that DI was lowest for patients presenting with haemoptysis, a key “red flag” symptom for cancer, and was also relatively low for weight loss. Similarly, Walter et al. found that haemoptysis was the symptom associated with the shortest DI in 963 patients with suspected lung cancer [42]. A number of other studies of both lung [33, 34, 43] and other cancers [33] have found shorter DI among patients with “alert/alarm” versus “vague/non-alert” symptoms [34, 44]. It seems likely that some cancer symptoms would be more likely than others to be attributed to comorbidities, but few studies have considered the influence of symptoms and AE conditions simultaneously. We found evidence of an interaction between AE conditions and presenting clinical problem on DI, notably an additional increase of 25 days in patients with both dyspnoea and an AE condition. This suggests that symptoms of dyspnoea may be particularly likely to be attributed to an existing comorbidity among patients with an AE condition.
An increase in consultation rate is a risk marker for cancer [10]. Consultation rates rise for patients with cancer around four to six months before diagnosis, with this increase occurring independently of usual consultation frequency [45]. However, high habitual consultation rates have been found to reduce the GP’s suspicion of cancer, particularly in older patients [46]. Multimorbidity is strongly associated with GP consultation rates [47], and has been found to contribute to the rate of GP burnout [48]. In our study, both higher usual consultation frequency and an increase during the year before diagnosis were strongly positively associated with longer DI in the multivariable analyses. The positive association between the increase in consultation rates in the year before diagnosis may reflect an increase in diagnostic investigations undertaken in primary care. Frequently attending patients may have had a referral made for investigation of another serious illness regarded as a more likely explanation for their symptoms by the GP. However, the association with higher usual consultation rate in the pre-diagnostic period is unlikely to be explained in this way. It is also possible that the positive associations between consultation rates and DI may result from a focus on psychological or anxiety-related, rather than physical, explanations for symptoms [49] (which would accord with the delay in diagnosis associated with a depression/anxiety diagnosis in this study).
Smoking is the key risk factor for lung cancer, and the NICE guidelines on lung cancer referral differ depending on the patient’s smoking status [12]. Somewhat counterintuitively we found that DI was longer in smokers. It is possible this may reflect a tendency by GPs to view the effects of smoking (rather than possible cancer) as an “alternative explanation” for symptoms such as breathlessness.. This apparent increase in DI among patients with a history of smoking is concerning, as there is evidence that they may also wait longer before presenting to primary care with lung cancer symptoms because of a fear of blame or stigma [50].
We also found an association between BMI and delayed cancer diagnosis, with DI being around 23 days longer in underweight compared to normal weight patients, although weight loss as a symptom was associated with more rapid diagnosis. While weight loss as a symptom is frequently included in analyses, there seems to be little other information in the literature on the association between pre-diagnostic BMI and DI. The number of underweight patients was relatively small so these results should be treated with caution, but it may be that weight loss as a possible cancer symptom is less easily recognised in patients who are already underweight.
Although the unadjusted median DIs increased over the study period, the adjusted DI, taking account of presenting sign/symptom and other factors, decreased in recent years. Referral guidelines changed over the study period, with a 2 week wait guideline for cancer referral being introduced in 2000 [51]. There have also been a number of public information campaigns over this timeframe resulting in increased awareness of early cancer symptoms [52], including the “Be Clear on Cancer” campaign [53]. Evidence suggests the proportion of patients presenting with different lung cancer symptoms has changed over time, with more patients presenting with cough, and fewer with haemoptysis [54]. Additionally, while the common and red flag symptoms for lung cancer (haemoptysis, cough, dyspnoea etc.) have been well-established for decades, there has been a gradual acceptance of other indicators, such as thrombocytosis over the study period. Access to a GP may also have changed over the study period – the number of GPs per capita in England increased between 1997 and 2009 but then fell between 2009 and 2018 [55].
Complex patterns of association between DI and cancer mortality have been observed, including apparently paradoxical associations between shorter DI and increased mortality, which may be explained by more severe symptoms from aggressive tumours leading to quicker diagnosis (“waiting time paradox”) [34, 56, 57]. Despite the inconsistent nature of observed associations between DI and mortality, there is no plausible mechanism by which increased DI can reduce mortality, and increased tumour size is associated with higher mortality within TNM stage group [58]. Faster diagnosis allows quicker referral to other services including palliative care, and there are other adverse consequences to longer DI, including increased patient anxiety [59].
Strengths and limitations
A key strength of this study is the use of the CPRD and linked secondary care and cancer registration data, which covers a large, broadly representative sample of the English population. As the data are routinely collected the study covers data from groups that may be hard to recruit to traditional research studies such as elderly/frail patients [60], patients living in deprived areas and those with literacy problems, although the preponderance of white patients meant we were not able to conduct a meaningful analysis of the effect of ethnic group. Cancer cases were identified from three sources (NCRAS, HES and CPRD), increasing the completeness of case identification and the accuracy of date of diagnosis determination [61], this is particularly important for studies investigating the effect of comorbidity as missing cancer diagnoses in the CPRD were more likely for older patients with comorbidities [61]. We were also able to identify those cases with a death certificate only diagnosis (where it could be argued that the calculated DI is artificially truncated) and to confirm that excluding these patients from the analysis did not materially affect the results.
However, there are a number of potential issues with the use of routinely collected data for research. Symptom identification in this study was dependent upon the accuracy of recording by the GP. Some clinical problems will have been noted in the free text area of the clinical records which we were unable to access. In addition, it has been suggested that some GPs may only be prompted to record a code for a symptom when the decision to refer has been made [34]. Both these factors could result in an underestimate of DI. It seems probable that this would occur more often for symptoms potentially explicable by existing conditions, so this bias would be likely to result in a reduction in the observed association between AE conditions and DI with the data available in our study, rather than inflating it.
A further issue is the relatively high levels of missing data for smoking status, alcohol drinking and BMI. These data are unlikely to be missing completely at random, with sicker patients and those consulting more frequently tending to have more complete data [62], and a complete case analysis may introduce bias [63]. The quality of data recording in the CPRD will have been affected by the introduction of the Quality and Outcomes Framework in 2004 [64], a scheme that offered financial incentives to GPs to encourage reporting of key data items. As a result, the completeness of recording of variables such as smoking status, BMI and alcohol drinking has increased substantially since that time [31]. We repeated our analysis with imputed data for missing values for smoking, BMI and alcohol drinking using an imputation process that included key recommended variables (measures of healthcare utilisation, health status/comorbidities and socioeconomic status [65]). Our results remained essentially unchanged, suggesting it is unlikely that the missing data would have substantially altered the observed associations.
We based our definition of DI on symptoms occurring up to 12 months before diagnosis. Taking into account the large number of patients in the study, and the lack of access to any text notes in the clinical record, our only option was to determine date of first symptom algorithmically from the date of the first relevant Read code in the EHR in the 12 months preceding diagnosis. However, some patients who go on to develop lung cancer may have had symptoms such as cough or dyspnoea for many years, and this method did not allow us to capture other potential indicators of true first cancer symptoms, such as a change in the nature or severity of cough.
The 12 month cut-off will result in the exclusion of a small number of patients with true DIs longer than a year, conversely, increasing the DI beyond this point would have resulted in more patients with symptoms unrelated to their subsequent cancer being included. We believe 12 months offers a reasonable compromise based on the low predictive value of symptoms recorded before this point for lung cancer [66], and is in line with previous work on cancer DI [14, 33, 34].
We have focussed our analyses on the factors associated with the overall DI from presentation in primary care to diagnosis, on the basis that this is the health system interval most likely to be associated with patient outcomes. However, delays can occur in both primary and secondary care, and may have different predictive factors. Using data on date of chest x-ray in the primary care record as a proxy for primary care interval, we found a similar pattern of associations to using the overall DI, although as we did not have access to the HES diagnostic imaging data there will have been substantial missing information in this variable (we had information on x-ray interval for 49% of the lung cancer patients, while a previous analysis of CPRD using the HES DID data suggests that 85% of patients had a pre-diagnostic x-ray [67]).
Conclusions and Clinical Implications
This is one of only a few studies on how multimorbidity (i.e., AE and CD conditions) might impact on DI for lung cancer patients – substantial increases were found for patients with conditions offering alternative explanations for lung cancer symptoms. These findings highlight the difficulty of recognising cancer symptoms in the face of multimorbidity, and support the recent proposal to roll out a screening programme in the UK to detect lung cancer sooner [68]. Further studies to determine factors predictive of lung cancer development among patients with AE conditions might enhance this programme, by identifying those who would benefit most from regular screening. DI was also longer for patients with higher consultation frequency suggesting competing demand on the GP’s time may impact on timely diagnosis. Our results also suggest that diagnosis may be delayed in lung cancer patients who smoke or are underweight. Clinical guidelines should highlight the potential of these factors to impact on timely cancer diagnosis.
Data availability
The data used in this study are provided under licence from the UK Medicines and Healthcare Products Regulatory Agency.
Code availability
Data preparation code is available on zenodo (https://doi.org/10.5281/zenodo.11549893).
References
Cancer Research UK. Lung Cancer Statistics, https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/lung-cancer#heading-One Accessed 13th February.
Arnold M, Rutherford MJ, Bardot A, Ferlay J, Andersson TM, Myklebust T, et al. Progress in cancer survival, mortality, and incidence in seven high-income countries 1995-2014 (ICBP SURVMARK-2): a population-based study. Lancet Oncol. 2019;20:1493–505.
European Federation of Pharmaceutical Industries and Associations. SURVIVAL, https://efpia.eu/publications/cancer-comparator-report/survival/ Accessed 16th August.
Walters S, Maringe C, Coleman MP, Peake MD, Butler J, Young N, et al. Lung cancer survival and stage at diagnosis in Australia, Canada, Denmark, Norway, Sweden and the UK: a population-based study, 2004-2007. Thorax. 2013;68:551–64.
Andreano A, Peake MD, Janes SM, Valsecchi MG, Pritchard-Jones K, Hoag JR, et al. The care and outcomes of older persons with lung cancer in England and the United States, 2008-2012. J Thorac Oncol. 2018;13:904–14.
Neal R. Do diagnostic delays in cancer matter? Br J Cancer. 2009;101:S9–S12.
Tørring ML, Murchie P, Hamilton W, Vedsted P, Esteva M, Lautrup M, et al. Evidence of advanced stage colorectal cancer with longer diagnostic intervals: a pooled analysis of seven primary care cohorts comprising 11,720 patients in five countries. Br J Cancer. 2017;117:888–97.
Jain S, Sharma SS, Gupta DK. Effect of patient and diagnostic intervals on the risk of advanced stage in Indian patients with seven types of gastrointestinal cancers: a retrospective cohort study. Cancer Epidemiol. 2024;88:102514.
Tørring ML, Frydenberg M, Hansen RP, Olesen F, Vedsted P. Evidence of increasing mortality with longer diagnostic intervals for five common cancers: a cohort study in primary care. Eur J Cancer. 2013;49:2187–98.
Ewing M, Naredi P, Nemes S, Zhang C, Månsson J. Increased consultation frequency in primary care, a risk marker for cancer: a case-control study. Scand J Prim Health Care. 2016;34:205–12.
Lyratzopoulos G, Wardle J, Rubin G. Rethinking diagnostic delay in cancer: how difficult is the diagnosis? Bmj. 2014;349:g7400.
National Institute for Health and Care Excellence. Suspected cancer: recognition and referral, https://www.nice.org.uk/guidance/ng12/chapter/Recommendations-organised-by-site-of-cancer Accessed 11th February, (2015).
Renzi C, Kaushal A, Emery J, Hamilton W, Neal RD, Rachet B, et al. Comorbid chronic diseases and cancer diagnosis: disease-specific effects and underlying mechanisms. Nat Rev Clin Oncol. 2019;16:746–61.
Mounce LTA, Price S, Valderas JM, Hamilton W. Comorbid conditions delay diagnosis of colorectal cancer: a cohort study using electronic primary care records. Br J Cancer. 2017;116:1536–43.
Mitchell ED, Rubin G, Macleod U. Understanding diagnosis of lung cancer in primary care: qualitative synthesis of significant event audit reports. Br J Gen Pract. 2013;63:e37–46.
Bjerager M, Palshof T, Dahl R, Vedsted P, Olesen F. Delay in diagnosis of lung cancer in general practice. Br J Gen Pract. 2006;56:863–8.
Nadpara P, Madhavan SS, Tworek C. Guideline-concordant timely lung cancer care and prognosis among elderly patients in the United States: a population-based study. Cancer Epidemiol. 2015;39:1136–44.
Erefai O, Soulaymani A, Mokhtari A, Obtel M, Hami H. Diagnostic delay in lung cancer in Morocco: a 4-year retrospective study. Clin Epidemiol Global Health. 2022;16. https://doi.org/10.1016/j.cegh.2022.101105.
Pearson C, Fraser J, Peake M, Valori R, Poirier V, Coupland VH, et al. Establishing population-based surveillance of diagnostic timeliness using linked cancer registry and administrative data for patients with colorectal and lung cancer. Cancer Epidemiol. 2019;61:111–8.
Sirota M, Kostopoulou O, Round T, Samaranayaka S. Prevalence and alternative explanations influence cancer diagnosis: An experimental study with physicians. Health Psychol. 2017;36:477–85.
Medicines and Healthcare Products Regulatory Agency. Clinical Practice Research Datalink. https://cprd.com/ Accessed 7th March 2024.
Medicines and Healthcare Products Regulatory Agency. CPRD linked data, https://cprd.com/cprd-linked-data. Accessed 30th May 2024.
Chisholm J. The Read clinical classification. Br Med J. 1990;300:1092–1092.
NHS Digital. Hospital Episodes Statistics (HES), https://digital.nhs.uk/data-and-information/data-tools-and-services/data-services/hospital-episode-statistics.
Medicines and Healthcare Products Regulatory Agency. Hospital Episode Statistics (HES) Outpatient Care and CPRD primary care data Documentation (set 21), https://cprd.com/sites/default/files/2022-02/Documentation_HES_OP_set21.pdf Accessed 4th October 2023.
Medicines and Healthcare Products Regulatory Agency. Cancer registration data, https://cprd.com/cprd-linked-data#Cancer%20registration%20data. Accessed 30th May 2024.
Medicines and Healthcare Products Regulatory Agency. ONS death registration data and CPRD primary care data Documentation, https://cprd.com/sites/default/files/2022-02/Documentation_Death_set22_v2.6.pdf Accessed 18th June, (2021).
Medicines and Healthcare Products Regulatory Agency. Small area level data based on practice postcode, https://cprd.com/sites/default/files/2022-05/Documentation_SmallAreaData_Practice_set22_v3.4_1.pdf Accessed 18th June, (2022).
Zghebi SS, Reeves D, Grigoroglou C, McMillan B, Ashcroft DM, Parisi R, et al. Clinical code usage in UK general practice: a cohort study exploring 18 conditions over 14 years. BMJ Open. 2022;12:e051456.
Medicines and Healthcare Products Regulatory Agency. Using linked data, https://www.cprd.com/using-linked-data Accessed 8th May, (2024).
Herrett E, Gallagher AM, Bhaskaran K, Forbes H, Mathur R, van Staa T, et al. Data resource profile: clinical practice research datalink (CPRD). Int J Epidemiol. 2015;44:827–36.
Cancer Research UK. Lung cancer incidence statistics, https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/lung-cancer/incidence#heading-Two Accessed 30th May.
Din NU, Ukoumunne OC, Rubin G, Hamilton W, Carter B, Stapley S, et al. Age and gender variations in cancer diagnostic intervals in 15 cancers: analysis of data from the UK clinical practice research datalink. PLoS One. 2015;10:e0127717.
Redaniel MT, Martin RM, Ridd MJ, Wade J, Jeffreys M. Diagnostic intervals and its association with breast, prostate, lung and colorectal cancer survival in England: historical cohort study using the Clinical Practice Research Datalink. PLoS One. 2015;10:e0126608.
Millwood S, Tomlinson P, Hopwood J. Evaluation of winter pressures on general practice in Manchester: a cross-sectional analysis of nine GP practices. BJGP Open. 2021;5. https://doi.org/10.3399/bjgpopen20X101138.
Cavanaugh JE, Neath AA. The Akaike information criterion: background, derivation, properties, application, interpretation, and refinements. WIREs. Comput Stat. 2019;11:e1460.
Davison A, Hinkley D. Bootstrap Methods and their Application. New York, USA, Cambridge University Press; 1997.
van Buuren S, Groothuis-Oudshoorn K. mice: multivariate imputation by chained equations in R. J Stat Softw. 2011;45:1–67.
Robertson R, Campbell NC, Smith S, Donnan PT, Sullivan F, Duffy R, et al. Factors influencing time from presentation to treatment of colorectal and breast cancer in urban and rural areas. Br J Cancer. 2004;90:1479–85.
Koo MM, Hamilton W, Walter FM, Rubin GP, Lyratzopoulos G. Symptom signatures and diagnostic timeliness in cancer patients: a review of current evidence. Neoplasia. 2018;20:165–74.
Swann R, McPhail S, Witt J, Shand B, Abel GA, Hiom S, et al. Diagnosing cancer in primary care: results from the National Cancer Diagnosis Audit. Br J Gen Pract. 2018;68:e63–e72.
Walter F, Rubin G, Bankhead C, Morris H, Hall N, Mills K, et al. Symptoms and other factors associated with time to diagnosis and stage of lung cancer: a prospective cohort study. Br J Cancer. 2015;112:S6–S13.
Guldbrandt LM, Fenger-Grøn M, Rasmussen TR, Jensen H, Vedsted P. The role of general practice in routes to diagnosis of lung cancer in Denmark: a population-based study of general practice involvement, diagnostic activity and diagnostic intervals. BMC Health Serv Res. 2015;15:21.
Buccheri G, Ferrigno D. Lung cancer: clinical presentation and specialist referral time. Eur Respir J. 2004;24:898–904.
Jensen H, Vedsted P, Møller H. Consultation frequency in general practice before cancer diagnosis in relation to the patient’s usual consultation pattern: a population-based study. Cancer Epidemiol. 2018;55:142–8.
Jensen H, Merrild CH, Møller H, Vedsted P. Association between GPs’ suspicion of cancer and patients’ usual consultation pattern in primary care: a cross-sectional study. Br J Gen Pract. 2019;69:e80–e87.
O’Regan A, O’Doherty J, O’Connor R, Cullen W, Niranjan V, Glynn L, et al. How do multi-morbidity and polypharmacy affect general practice attendance and referral rates? A retrospective analysis of consultations. PLoS One. 2022;17:e0263258.
Pedersen AF, Nørøxe KB, Vedsted P. Influence of patient multimorbidity on GP burnout: a survey and register-based study in Danish general practice. Br J Gen Pract. 2020;70:e95–e101.
Sharabani R, Kagan I, Cojocaru S. Frequent attenders in primary health care: a mixed-methods study of patient and staff perspectives. J Clin Nurs. 2023;32:7135–46.
Tod AM, Craven J, Allmark P. Diagnostic delay in lung cancer: a qualitative study. J Adv Nurs. 2008;61:336–43.
NHS. The NHS Cancer Plan. https://webarchive.nationalarchives.gov.uk/ukgwa/20130107105354/http://www.dh.gov.uk/prod_consum_dh/groups/dh_digitalassets/@dh/@en/documents/digitalasset/dh_4014513.pdf. 2000.
Richards MA. The National Awareness and Early Diagnosis Initiative in England: assembling the evidence. Br J Cancer. 2009;101:S1–4.
Cancer Research UK. Be Clear on Cancer. https://www.cancerresearchuk.org/health-professional/awareness-and-prevention/be-clear-on-cancer Accessed 17th May.
Chowienczyk S, Price S, Hamilton W. Changes in the presenting symptoms of lung cancer from 2000–2017: a serial cross-sectional study of observational records in UK primary care. Br J Gen Pract. 2020;70:e193–9.
Statista. Annual spending on public healthcare as a share of gross domestic product (GDP) in the United Kingdom (UK) from 2000 to 2022. https://www.statista.com/statistics/472984/public-health-spending-share-of-gdp-united-kingdom-uk/ Accessed 21st May.
González-Barcala FJ, García-Prim JM, Alvarez-Dobaño JM, Moldes-Rodríguez M, García-Sanz MT, Pose-Reino A, et al. Effect of delays on survival in patients with lung cancer. Clin Transl Oncol. 2010;12:836–42.
Radzikowska E, Roszkowski-Sliz K, Chabowski M, Glaz P. Influence of delays in diagnosis and treatment on survival in small cell lung cancer patients. Adv Exp Med Biol. 2013;788:355–62.
Rami-Porta R, Bolejack V, Crowley J, Ball D, Kim J, Lyons G, et al. The IASLC lung cancer staging project: proposals for the revisions of the T descriptors in the forthcoming eighth edition of the TNM classification for lung cancer. J Thorac Oncol. 2015;10:990–1003.
Forster AS, Herbert A, Koo MM, Taylor RM, Gibson F, Whelan JS, et al. Associations between diagnostic time intervals and health-related quality of life, clinical anxiety and depression in adolescents and young adults with cancer: cross-sectional analysis of the BRIGHTLIGHT cohort. Br J Cancer. 2022;126:1725–34.
Todd OM, Burton JK, Dodds RM, Hollinghurst J, Lyons RA, Quinn TJ, et al. New Horizons in the use of routine data for ageing research. Age Ageing. 2020;49:716–22.
Arhi CS, Bottle A, Burns EM, Clarke JM, Aylin P, Ziprin P, et al. Comparison of cancer diagnosis recording between the Clinical Practice Research Datalink, Cancer Registry and Hospital Episodes Statistics. Cancer Epidemiol. 2018;57:148–57.
Weiskopf NG, Rusanov A, Weng C. Sick patients have more data: the non-random completeness of electronic health records. AMIA Annu Symp Proc. 2013;2013:1472–7.
Rubin D. Inference and missing data. Biometrika. 1976;63:581–92.
NHS England. Quality and Outcomes Framework (QOF). https://digital.nhs.uk/data-and-information/data-tools-and-services/data-services/general-practice-data-hub/quality-outcomes-framework-qof Accessed 21st May.
Wells BJ, Chagin KM, Nowacki AS, Kattan MW. Strategies for handling missing data in electronic health record derived data. EGEMS (Wash DC). 2013;1:1035.
Prado M, Kessler L, Au M, Burkhardt H, Suchsland M, Kowalski L, et al. Symptoms and signs of lung cancer prior to diagnosis: Comparative study using electronic health records. BMJ Open. 2022. https://doi.org/10.1101/2022.06.01.22275657.
Arendse KD, Walter FM, Pilling M, Zhou Y, Hamilton W, Funston G. Time from presentation to pre-diagnostic chest X-ray in patients with symptomatic lung cancer: a cohort study using electronic patient records from English primary care. Br J Gen Pract. 2021;71:e273–9.
GOV.UK. New lung cancer screening roll out to detect cancer sooner, https://www.gov.uk/government/news/new-lung-cancer-screening-roll-out-to-detect-cancer-sooner. 2023. Accessed 30th May 2024.
Acknowledgements
We would like to acknowledge the advice on using the CPRD data from Dr Louis Gagatsis, and advice on the statistical analyses from Dr Anestis Touloumis.
Funding
This research was funded by the National Institute for Health and Care Research (NIHR) Applied Research Collaboration Kent, Surrey, Sussex (ARC KSS), funding number NIHR200179.
Author information
Authors and Affiliations
Contributions
IR conducted the analyses and wrote the first draft of the manuscript. EF, AM and HvM acquired the data for the study. IR, MC, AM, LF, HvM and EF all contributed to the design and interpretation of the analyses and the drafting of the final manuscript. All authors have approved the final version and agree to be accountable for the work.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethics approval
Ethics approval for the study was granted by the MHRA (UK) Independent Scientific Advisory Committee (ISAC-Protocol 19_050). The study used data from the Clinical Practice Research Datalink and linked Hospital Episodes Statistics and Cancer Registry data which are collected from patients as part of their NHS care. The study was performed in accordance with the Declaration of Helsinki.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rogers, I., Cooper, M., Memon, A. et al. The effect of comorbidities on diagnostic interval for lung cancer in England: a cohort study using electronic health record data. Br J Cancer 131, 1147–1157 (2024). https://doi.org/10.1038/s41416-024-02824-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41416-024-02824-2