Introduction

The World Health Organisation has argued that the management of mouth cancer should be an integral part of national cancer control programmes.1 The prevention of mouth cancer can be classified into primary, secondary and tertiary programmes. While primary prevention describes a process that prevents a disease from developing, secondary prevention programmes generally seek to detect disease while it is localised or 'early'. Tertiary prevention seeks to mitigate the morbidity from established disease and to improve quality of life.

Screening is one method of early disease detection and is defined as 'the application of a test to people who are free of the disease in question, in order to identify those who may have the disease and to distinguish them from those who may not.'2 A screening programme is an on-going process of examination and referral, applied to a defined population at set intervals, and it must be differentiated from case-finding or early detection, which is usually a one-off process of clinical examination aimed at diagnosing specific lesions. Screening is also distinct from epidemiological studies that 'screen' populations to determine the prevalence of a disease or lesions.

Many countries set strict criteria for new screening programmes to be initiated, for example, the UK National Screening Committee now assesses the evidence for a new programme against 20 criteria.3 There are more than 20 screening programmes which have already been implemented in the UK, but only three are for cancer: these are cervical, breast and bowel cancer. As can be inferred from Table 1, the criteria demand a high level of evidence, and many of the implemented programmes may not now satisfy all of the 20 criteria. As indicated in the right column, many of the criteria have not been met or have never been evaluated or addressed for mouth cancer screening.

Table 1 Criteria for the implementation of a screening programme3

Population screening programmes are of three main types. Mass screening describes a process whereby the whole population are screened, but this type of programme is rarely used. Most programmes are selective and target a subset of the population who are felt to be at highest risk, for example the screening of particular age groups of women for cervical cancer, or the over-60s for bowel cancer. The third type is opportunistic screening, where individuals are examined when they attend a healthcare professional for some other, often unrelated, purpose. This may be used, for example, to check blood pressure when attending a general medical practitioner, or could involve a mucosal examination when attending the dentist for a routine check.

Making a case for screening mouth cancer

The management of established lesions of mouth cancer is complex and multidisciplinary, often involving surgery and radiotherapy, and is therefore expensive. By contrast, small, early-stage cancers can often be treated by simple surgery, with less cost, lower morbidity and a better chance of cure. For example, the five-year survival for stage I mouth cancer has been shown to be 96% compared with 57% for late disease at stage IV.4 In addition, many cases of mouth cancer are preceded by a potentially malignant disorder (often a white patch or leukoplakia) providing a further opportunity to prevent disease progression. The mouth is also easy to examine and mouth lesions are relatively easy to detect. This all suggests that mouth cancer screening should be easy to implement and be an effective public health measure to prevent mouth cancer. It seems perplexing therefore that no country in the world has implemented a national mouth cancer screening programme, and at our present state of knowledge it is unlikely to happen.5 The following sections will try to summarise why this is the case.

Methods for the evaluation of a screening programme

Before a test can be used in clinical practice its efficacy must be determined in experimental programmes that enable a calculation of sensitivity and specificity. A screening examination or test is not intended to be diagnostic, so when an individual is screened there are four potential outcomes (Table 2). A perfect screening test would correctly classify everyone in the population who has the disease as positive (TP) and all people without the disease as negative (TN). However, this never occurs, because the tests can never be absolutely accurate and so some individuals are falsely classified as positive (FP) or negative (FN). There is therefore a balance between the sensitivity and specificity of the screening test. The sensitivity is calculated as TP/TP+FN and is the proportion of the population who are correctly classified as having the disease. The specificity is calculated as TN/TN+FP and shows the proportion of the population correctly classified as disease-free. TP + FN is the number of people in the population who have the disease and TN + FP is the number in the population who do not have the disease. There is no ideal value for sensitivity and specificity, while it is desirable to have a low number of false positives, to avoid unnecessary investigations and anxiety, it is also important not to miss disease, so false negatives must also be as low as possible. In general, tests aim to have a sensitivity and specificity in the order of 80% or more.

Table 2 Metrics for a screening programme

Once the sensitivity and specificity have been determined they can be used to form a summary receiver operating characteristic (ROC) curve, which graphically illustrates the ability of the test to distinguish positives from negatives (Fig. 1) The graph plots the true positive rate (sensitivity) against the false positive rate (1 – specificity). The efficacy of the test can be represented as the area under the curve (AUC). The dotted line represents chance that is, the test is no better than a toss of a coin at detecting true positives and true negatives (AUC of 0.5). The higher curve gives an AUC with a higher value. Values of 0.8 or greater are considered to be the minimum for an acceptable test.

Figure 1
figure 1

Summary receiver operating characteristic curve

Screening for mouth cancer

At the present time no test has been shown to be effective for the accurate detection of oral lesions in the context of a screening programme.6 The only properly tested health technology used in experimental and observational studies for the detection of oral potentially malignant lesions or mouth cancer is a conventional oral examination (COE). The results of ten studies are summarised in Table 3.7,8,9,10,11,12,13,14,15,16 In a Cochrane diagnostic test accuracy review, Walsh et al.17 examined the potential of COE, vital rinsing, light-based detection, biomarkers and mouth self-examination (MSE) as methods for the detection of potentially malignant lesions and early mouth cancer in apparently healthy adults. The review only included studies that had calculated sensitivity and specificity, and excluded studies that did not follow-up on subjects assigned as negative by the screening test (making the calculation of false negatives impossible). The review found that the sensitivity of COE was variable, but was always greater than 0.70, and that specificity was consistently greater than 0.90. None of the other technologies tested were sufficiently well evaluated to determine their test accuracy. Overall, Walsh et al. concluded that COE had a satisfactory test performance with sensitivities and specificities similar to those reported for breast and cervical cancer screening programmes. They also highlighted that a number of studies utilised non-medical or dental, healthcare workers as screeners, without any significant change to the test performance results.12,13,14 This shows that trained healthcare workers or dental care professionals are equally able to examine the mouth and detect relevant lesions, and could fully participate in programmes for screening or early detection. This has been confirmed by a number of more recent studies that have evaluated lesion identification by trained health workers.18,19

Table 3 Reports of evaluations of conventional oral examinations, in which sensitivity and specificity of the test have been calculated

These studies have shown that it is possible to detect relevant oral lesions with an acceptable degree of accuracy, and suggest that COE may be a valuable screening test. However, a more important consideration is to determine whether a mouth cancer screening programme using COE, actually works. The primary outcome measure for the clinical efficacy of a screening programme is whether or not it reduces mortality within the population – that is, the programme must be shown to save lives. A Cochrane systematic review of screening programmes found that there has only ever been one properly conducted study that has evaluated the clinical- or cost-efficacy of mouth cancer screening.20 This was a large cluster-randomised controlled trial carried out over a period of 15 years involving four rounds of screening, completed in 1998, 2002, 2004 and 2009.21,22,23,24 The screening programme was carried out in 13 municipalities in Kerala, South India, divided into seven test (screened) clusters (n = 96,517) and six control (not screened) clusters (n = 95,356). Non-medical, but trained, university graduates performed a COE to identify potentially malignant lesions (including leukoplakia, oral submucous fibrosis and lichen planus), and ulcers suspicious of malignancy. The gold standard outcome measure for the study was to determine mortality from mouth cancer in the test and control cohorts.

The Kerala study did not show a significant difference in mouth cancer mortality for the screened group (15.4/100,000 person–years) compared to the control group (17.1/100,000 person–years), with a RR of 0.88 (95% CI 0.69–1.12). However, at the end of the 15-year period, they were able to demonstrate a statistically significant 24% reduction in mortality for high-risk individuals who used tobacco and/or alcohol between the screened group (30/100,000 person–years) and the control group (39.0/100,000).25 Equally, it is important to note that only 19,288 persons completed four rounds of screening – 20% of the eligible population.

Further analysis of the Kerala programme26 found that the overall benefit of the screening programme was 269 life years saved per 100,000 for all individuals and 1,438 lives saved among the high-risk groups. The cost per screening examination was only $6 per person and the incremental cost per life-year saved was US$835 for all individuals and US$156 for those at high-risk. In their systematic review, Brocklehurst et al.20 acknowledged the significant findings of the Kerala study, but also concluded that the trial had a high risk of bias, and concluded that further experimental evidence was needed to recommend population screening programmes. Similar conclusions were reached by Government bodies across the world and none have found the evidence sufficiently robust to recommend screening as a population health approach.5

Should we screen for mouth cancer?

The cost of screening demonstrated by the Kerala studies was within the target set by the WHO Commission on Macroeconomics and Health, who define an intervention as being cost effective when the cost–efficacy ratio is less than a country's gross domestic product per capita.25 However, these values may not apply in a Western setting and to date there have been no randomised controlled trials in any developed country. This is mainly because of the low prevalence of relevant lesions making it very difficult and prohibitively expensive to conduct definitive trials of large scale population screening programmes for mouth cancer in countries such as the UK or USA.

An alternative approach was undertaken by Speight et al.26 who used computer modelling to simulate the costs of mouth cancer screening in a number of healthcare environments. Data to inform the model were taken from published NHS costs and from studies of disease prevalence and test performance. They found that mouth cancer screening in the UK would only be cost-effective if it were targeted at high-risk individuals and carried out opportunistically by GMPs and/or GDPs. In this scenario, the cost per life year saved was £22,850, which is within the costs deemed acceptable by NICE.26,27

These studies have shown that screening programmes have not resulted in a demonstrable reduction in mortality, although there is some evidence that screening of high-risk groups may be effective and cost-effective. At present the evidence is insufficient to persuade a government to sanction a national programme. In addition, there are other valid criticisms of screening for mouth cancer. Most of the studies that have evaluated a COE have used a lesion consistent with an mouth potentially malignant disorder or early mouth cancer as the criteria for a positive test. Most often this has been a white patch (leukoplakia), red patch (erythroplakia) or persistent ulcer, suggestive of cancer. Leukoplakia is the most common potentially malignant lesion, and although they are quite common, with a prevalence of between 1.4 and 22%,28,29 the clinical appearance does not correlate well to histology and only about 5% overall will ever progress to cancer.30,31 This means that about 95% of screen detected lesions may not progress and are therefore not relevant to a test designed to detect lesions with a high likelihood of progressing to mouth cancer. The 'holy grail' is to identify adjunctive clinical tests or molecular methods that may identify the relevant 5%. However, at the present time, no molecular biomarkers or adjunctive tests have been shown to have utility in screening trials.32,33

The management of oral potentially malignant disorders also remains a controversial area, raising doubt about the correct management for screen-detected lesions. Holmstrup et al. showed that even after surgical removal, the risk of malignant change may remain since the visible lesion may only represent a small area in a field of damaged mucosa.34,35 As a result, even though early detection and intervention may remove the detected lesion, a whole field of damaged mucosa remains, that could progress to malignancy. There is also a lack of understanding about the malignant transformation of potentially malignant lesions and its temporal relationship with mouth cancer.30 Some potentially malignant lesions may become increasingly dysplastic or progress to cancer, but others remain static or may even regress. This lack of understanding of the detailed natural history of potentially malignant lesions, and the absence of molecular markers means that finding a test that can identify which lesions will become malignant remains a real challenge.

Finally, if opportunistic screening were an option, there remain questions about whether it would be effective in a dental practice setting. Clearly, dental healthcare workers are already trained to properly examine the mouth, but it has been suggested that patients at the highest risk of mouth cancer are the least likely to be regular attenders at dental practices.36 Further research is needed to undertake a retrospective analysis of cases to determine exactly the patient pathway to secondary care and the frequency and nature of contacts with healthcare professionals. This would help identify the potential nature and timing of appropriate interventions. Screening by non-medical or dental healthcare workers has been shown to be effective and utilising this group in alternative settings may be one of the most effective methods of improving early detection.

Summary and conclusions

As indicated above and illustrated in Table 1, studies into mouth cancer screening have only provided evidence to satisfy five of the 20 criteria required by the UK Screening Committee. The majority of issues have not been evaluated or have been shown not to meet the criteria. In some cases the criteria are partially met, but more research is needed. For example, there is little doubt that mouth cancer is an important health problem, but to satisfy criteria 1, there needs to be more research into the natural history of the disease. In particular, we need to fully understand the biology of potentially malignant lesions and we need biomarkers that can be used to develop a test which is more specific than COE and will identify only those lesions which are most likely to progress to cancer (Criteria 5). For these reasons, mouth cancer screening has not been implemented. In the meantime, oral healthcare workers should remain vigilant when examining the mouth and be alert to the presence of mucosal changes which may be associated with potentially malignant disorders or early mouth cancer. In addition, patients should be instructed about high-risk habits including alcohol and tobacco use.