Main

Testing for high-risk human papillomaviruses (HPV) has been proposed for primary screening for prevention of cervical cancer via the detection of its precursors (Sasieni and Cuzick, 2002; Cuzick et al, 2008; Franceschi et al, 2011; Murphy et al, 2011; Arbyn et al, 2012; Ronco et al, 2012; Wright et al, 2012). HPV testing is recognised as being more sensitive than cytology for the detection of high-grade cervical intraepithelial neoplasia (CIN) grade 2 or more severe (CIN2+) (Cuzick et al, 2006; Mayrand et al, 2007), although with lower specificity. HPV testing also has a higher negative predictive value at 60 months for cervical intraepithelial neoplasia grade 3 or more severe (CIN3+) than cytology at 24 months (van den Akker-van Marie et al, 2003; Dillner et al, 2008). Comparison of HPV vs cytology screening in randomised trials has resulted in considerable evidence supporting the use of HPV-based testing for increased detection of CIN2+ at initial screening and reduced subsequent risks of cervical cancer and its precursors (Naucler et al, 2007; Kitchener et al, 2009; Arbyn et al, 2012; Leinonen et al, 2012; Rijkaart et al, 2012a; Ronco et al, 2014). These findings have resulted in the scheduled adoption of primary HPV-based screening strategies in some countries (Health Council of the Netherlands, 2011; Australian Government, 2014).

The HPV testing for Cervical Cancer Screening Trial (HPV FOCAL) (Ogilvie et al, 2010, 2012) is a Canadian randomised clinical trial (RCT) comparing primary HPV testing with Liquid Based Cytology (LBC) for cervical cancer screening. Women were randomized into one of the three arms: Intervention, Control or Safety (Figure 1) (Ogilvie et al, 2010). Women in the Intervention and Safety arms received primary HPV testing with cytology triage at entry. Women in the Control arm received primary LBC testing at entry and at 24 months. Women in the Safety arm received LBC screening at 24 months and exited the study. Women in the Intervention and Control arms receive co-testing with HPV and LBC at 48 months and exited the study.

Figure 1
figure 1

HPV FOCAL study schematic. Subjects with unsatisfactory cytology were retested and if persistent (3 consecutive) were referred to colposcopy.

The Safety arm was included in HPV FOCAL to provide an early indication of the safety of extending to a 48-month screening interval as used in the Intervention arm because a 24-month interval was the community standard. The trial protocol specified that the Safety and Control arms were to be compared, on cervical intraepithelial neoplasia grade 2 or worse (CIN2+), at specified thresholds of subjects completing the 24-month screening round with a view to potential early stopping of the trial. This manuscript reports rates of disease detection, and resource utilisation of the final protocol specified comparison of the Safety and Control arms when accrual to the Safety arm was complete. Future reports will present results comparing the Intervention and Control arms over 48 months, when that the data is complete as specified in the protocol.

Materials and methods

Over 25 000 British Columbia women between the ages of 25 and 65 were randomised within HPV FOCAL (ISRCTN79347302). All women in HPV FOCAL were participants in the British Columbia Cervical Cancer Screening Program (CCSP) and patients of collaborating family physicians (Ogilvie et al, 2010). Study participants were required to not be pregnant, not have been screened within the previous 12 months, have no history of CIN2+ in the preceding 5 years and provide informed consent (Ogilvie et al, 2012).

Randomisation to the three study arms, Intervention, Control and Safety (Figure 1), was made with equal probability until 1 January 2011, when the planned sample size for the Safety arm (6200) was achieved. Thereafter, randomisation occurred equally to the Intervention and Control arms until the planned sample size for these arms was attained. Simple randomisation with equal probabilities of one-third were used to create a single assignment list to the three trial arms and subjects were allocated sequentially based upon the accession of their specimen at the central laboratory. Subjects and their physicians were blinded to their allocation until 24 months, unless follow-up of an abnormal test was required and they were then informed of the test used. Trial testing in the Control and Safety arms at baseline and at 24 months was as follows:

Control arm

Women received LBC testing at Baseline and those negative for intraepithelial lesion or malignancy (NILM) were recalled for LBC screening again at 24 months. Those with low-grade squamous intraepithelial lesion or worse (LSIL) were recommended for colposcopy. Those with atypical squamous cells undetermined significance (ASCUS) were triaged with an HPV test, and if positive were recommended for colposcopy and if negative were recalled at 12 months and retested with LBC. Those ASCUS or greater (ASCUS) were recommended for colposcopy and those NILM were returned to screening with LBC at 24 months. Screening at 24 months used the same management scheme as at baseline (Figure 1).

Safety arm

Women received HPV testing at Baseline and those testing negative were recalled for exit screening with LBC at 24 months. Those Baseline HPV positive were triaged with LBC and those ASCUS were recommended for colposcopy. Those with cytology triage of NILM were recalled 12 months later for HPV and LBC co-testing and recommended for colposcopy if HPV positive and/or ASCUS; those both LBC and HPV negative were returned to screening and recalled for LBC at 24 months for their final screen. LBC screening at 24 months used the same management scheme as in the Control arm (Figure 1) except that women ASCUS and HPV negative were retested 12 months later through the CCSP using conventional cytology.

If a woman, in any arm, had CIN2+ diagnosed, she exited the trial and future screening was directed by her family physician. If a woman had CIN1 or negative results at colposcopy, she was returned to HPV FOCAL screening within her randomisation group.

Trial participants had samples collected using the ThinPrep (Hologic Ltd, Toronto, ON, Canada) sample-processing protocol and screen and triage tests were performed on the same collected specimen. The HPV testing was performed using Digene Hybrid Capture 2 (HC2) high-risk HPV DNA Test (Qiagen Inc., Toronto, ON, Canada) and women high-risk HPV positive (HPV+) received triage by LBC (Cook et al, 2015).

The data were extracted for the two screening rounds, at entry and 24-months, for all women randomised to the Control and Safety arms prior to 1 January 2011. The rates of referral for colposcopy and CIN detected at Round 1 (Baseline and 12-month follow-up, if required), 24-month screen round (including 36-month follow-up, if required) and cumulatively for the two arms were calculated. Overall and age-specific rates are presented. Cumulative rates of CIN2+ and CIN3+ are reported based upon randomisation arm and on an intention-to-treat basis. Test results are reported as proportions based upon subjects undergoing testing. Significance tests were performed using the χ2 test for count data. Confidence intervals were based on the binomial distribution. Test of significance for odds ratios and confidence intervals was calculated using the asymptotic distribution of the log odds ratio (Breslow and Day, 1980).

The Safety arm was included in HPV FOCAL to monitor the early effect of the extended 4-year interval in the Intervention arm. By design, it was specified that comparison of the Safety and Control arms should be undertaken before accrual was complete to the trial. The trial protocol indicated that CIN2+ detection from LBC testing at 24 months in the Safety arm should be <80% of that observed in the Control arm. A formal group sequential analysis plan using Pocock boundaries (DeMets and Lan, 1995) was used in which the rate of CIN2+ detection at 2 years was compared between the Safety and Control arms at pre-specified numbers of Safety arm subjects completing the 24-month screening round (500, 1000, 1500, 2500 and 6200) using a standardised test statistic specified in the study protocol. The distribution of disease and resource use at the final analysis are presented.

This research was approved by Research Ethics Board of the University of British Columbia, BC Cancer Agency branch, approval number H06-04032.

Results

There were 6213 and 6104 women randomised to the Safety and Control arms prior to 1 January 2011, of which 6203 and 6075, respectively, had a satisfactory baseline screen. Figure 2 provides a consort diagram for subject accrual to the study. Randomisation to the two arms achieved balance by age when coded by decade (P=0.13), self-reported ethnicity in 6 categories (P=0.95), education level in 4 levels (P=0.90) and lifetime sexual partners in 4 categories (P=0.41). The age at randomisation distribution was <30: 8.1%, 30–39: 23.9%, 40–49: 32.6% and 50+: 35.4%.

Figure 2
figure 2

Consort diagram for the HPV FOCAL Safety and Control arms for testing at Baseline and at 24 months.

The results of Round 1 screening are given in Table 1. For baseline testing, the rate of ASCUS in the Control arm was 3.8 per 100 (95% CI=3.3–4.3) and the rate of HPV+/ASCUS in the Safety arm was 2.8 per 100 (95% CI=2.4–3.3). The amount of triage testing was higher in the Safety arm: 8.4% of women required reflex LBC testing in the Safety arm for a positive HPV result compared with 1.0% who required reflex HPV testing in the Control arm for an ASCUS result. Follow-up testing of subjects at 12 months was also more common in the Safety arm (5.2%) than in the Control arm (0.7%). Overall, more colposcopies were performed in Round 1 in the Safety arm (5.6%) than in the Control arm (3.2%). Twelve month follow-up testing resulted in the identification of more disease overall in Round 1 in the Safety arm than in the Control arm, with detection rates of 15.3 (95/6203) per 1000 vs 10.4 (63/6075) per 1000, RR=1.48 (95% CI=1.08–2.03) for CIN2 and 6.9 (43/6203) per 1000 vs 4.6 (28/6075) per 1000 (RR=1.50, 95% CI=0.94–2.42) for CIN3+. These are similar to Round 1 results already reported for the trial where the data from the Intervention arm are included (Ogilvie et al, 2012). The number of colposcopies per CIN2+ identified was higher in the Safety arm, 3.7, (95% CI=3.1–4.4) than in the Control arm 3.1, (95% CI=2.6–3.9), although this difference was not significant (P=0.23).

Table 1 Round 1 test results, referral to colposcopy and pathology findings for the safety and control arms

The results of the 24-month round of screening (including 36-month follow-up results) are given in Table 2. Women who did not have CIN2+ identified in either Safety or Control arms at the first round and who had not withdrawn or become ineligible (e.g., had hysterectomy or emigrated from British Columbia) were eligible for rescreening at 24 months. There were 5874 and 5839 not known to be ineligible for rescreening in the Safety and Control arms at 24 months and 5426 (92.3%) and 5130 (87.9%), respectively, who returned for screening; P<0.001 for difference in testing rates. No CIN2+ was detected at the 36-month follow-up test in the 24-month round. Among those tested, colposcopy was slightly more frequent in the Control arm, 19 per 1000 (95% CI=16–23), than in the Safety arm, 15 per 1000 (95% CI=12–18), with P=0.08 for difference. Less disease was identified in the Safety arm than in the Control arm at the 24-month screening round and the detection rates were 2.0 per 1000 vs 4.7 per 1000, RR=0.43 (95% CI=0.21–0.88), and 0.6 per 1000 vs 2.5 per 1000, RR=0.22 (95% CI=0.06–0.77) for CIN2+ and CIN3+, respectively. The ratio of CIN2+ detection at the 24-month round in the Safety arm compared with the Control arm was 43%, which met the threshold of <80% specified in the HPV FOCAL trial protocol. In both arms, rates were higher in Round 1 than in the 24-month round for CIN2+ and CIN3+: Safety arm RR=7.6 (95% CI=4.1–14.1) and Control arm RR=2.2 (95% CI=1.4–3.6) for CIN2+.

Table 2 Results of LBC testing at the 24-month screen round in the safety and control arms

Table 3 provides summary results on the disease detection by age at entry, screening round and study arm. Generally, the Safety arm had higher rates of detection at the initial round than the Control arm, although confidence intervals, especially for CIN3+, were generally quite wide, and for the 30–34-year-age-group, the observed difference went in the opposite direction. The rates of detected disease declined with increasing age in both arms in Round 1 and in the 24-month screen rounds (Table 3).

Table 3 Number of women randomised, cases of disease and rates of disease detection, with confidence intervals, for Round 1 and Round 1 plus 24-month round by age group at randomisation

To anticipate the relative performance of HPV screening, every 4 years vs LBC screening every 2 years, results from Round 1 testing in the Safety arm can be compared with the sum of Round 1 plus 24-month results from the Control arm. The cumulative rates of disease detection for the combined Round 1 and 24-month screening round results for the Control arm were similar to the Round 1 rates in the Safety arm (Table 3). For all ages combined, the odds ratio of disease detection for first round Safety arm vs both rounds of the Control arm were RR=1.07 (95% CI=0.80–1.43) and RR=1.03 (95% CI=0.67–1.57) for CIN2+ and CIN3+, respectively. Excluding the youngest age-group (25–29 years) did not substantially alter the relationship with RR=1.04 (95% CI=0.73–1.47) and RR=0.92 (95% CI=0.54–1.56) for CIN2+ and CIN3+, respectively.

Table 4 provides age-specific rates of specimen collection, test performance and colposcopy for Round 1 in the Safety arm, and combined Round 1 and 24-month screening round in the Control arm. In Table 4, a screening test and a resulting triage test is counted as one specimen and two tests. Overall, the likelihood of referral to colposcopy was higher in the Safety arm than in the Control arm, RR=1.17 (95% CI=1.00–1.38). In subjects over 30 years of age, the increase in referral to colposcopy was not significant RR=1.05 (95% CI=0.87–1.26), whereas it was significant in those under 30 years of age RR=1.60 (95% CI=1.14–2.24). Numbers of screening specimens and tests performed were higher in the Control arm than in the Safety arm overall, and for each of the age groups. The magnitude of the difference was smallest for the youngest group (age 25–29 years) and increased with age.

Table 4 Mean rates, with 95% Confidence Intervals, of specimen collection, tests and colposcopies performed as a result of Round 1 in the Safety arm and for combined Round 1 and 24-month round testing in the Control arm by age at study entry

Discussion

These results illustrate that higher rates of both CIN2+ and CIN3+ were detected at Round 1 screening in the Safety arm, where screening was HPV based, compared with the Control arm, where screening was LBC based. Cumulative disease detection from the combined Round 1 and 24-month screening round in the Control arm was similar to that of Round 1 in the Safety arm. Disease detection in age subgroups was consistent with the preceding findings, although confidence intervals were wide. Cumulative referral to colposcopy at Round 1 and 24-month screening round for the Control arm was approximately equal to that of Round 1 for the Safety arm, indicating that overall referral to colposcopy would be similar for 2-year LBC screening and every 4-year screening using HPV. The exception to this was the youngest group enrolled (25–29 years) where referral to colposcopy was significantly elevated for the Safety arm. Results presented show the HPV FOCAL trial met the protocol-specified target for CIN2+ detection in the Safety arm vs Control arm (<80%) at 2 years with a relative rate of 43% observed. As anticipated, the need for testing and specimen collection was lower in the Safety arm than in the Control arm, indicating the potential for resource savings in all age groups.

The purpose of the Safety arm was to monitor incident disease and provide an early indication of the effect of the extended screening interval in the Intervention arm. Previous cohort studies have indicated that the negative predictive value of the HPV test provides extended protection from disease development compared with cytology testing (Dillner et al, 2008; Arbyn et al, 2012). Thus screening protocols for HPV-based testing in Canada and elsewhere (Murphy et al, 2011; Australian Government, 2014) propose extended intervals for subsequent testing for those screening HPV negative compared with pre-existing protocols for cytology testing in the same jurisdictions. The inclusion of HPV as a co-test with cytology also led to increased interval recommendations in the United States (Saslow et al, 2012; Moyer and on behalf of the U.S. Preventive Services Task Force, 2012). The HPV FOCAL trial was established to examine the effect of extending the interval in British Columbia where screening with cytology was recommended on a biennial schedule at the time the trial commenced. The results presented align with findings from other trials in that increased rates of prevalent disease (CIN2+ and CIN3+) are observed at the initial screen in the HPV tested arms (Naucler et al, 2007; Kitchener et al, 2009; Ronco et al, 2010; Arbyn et al, 2012; Rijkaart et al, 2012a; Ronco et al, 2014).

One of the major advantages of HPV-based screening is the consistency of automated measurement, whereas cytology requires subjective interpretation. Variation in the sensitivity of cytology between jurisdictions has been shown to be considerable (Nanda et al, 2000) so that the results of any individual study comparing HPV and cytology screening will depend upon the quality of cytology interpretation. British Columbia has been conducting cervical screening for over 50 years (Boyes et al, 1982; Anderson et al, 1988; Morrison et al, 1996) and its screening program has utilised a single laboratory with a well-established quality control system. British Columbia has the lowest rate of invasive cervical cancer in Canada (Canadian Cancer Society’s Advisory Committee on Cancer Statistics, 2013) and Canada is a low incidence jurisdiction globally (Arbyn et al, 2011). In the data presented, the detection rate ratio for cytology compared with HPV testing was 67% for the detection of CIN2+ in the first round. Planned screening with both LBC and HPV at 48 months in the Intervention and Control arms will provide further data on the longer-term effects in this population.

In this trial, cytology triage and follow-up testing at 12 months was used to manage HPV-positive women. Other algorithms are being considered (Rijkaart et al, 2012b; Australian Government, 2014; Wright et al, 2015; Wentzensen et al, 2016) and their utilisation will alter the overall efficacy of HPV-based screening and the demand it places on diagnostic services (Coldman et al, 2015). In the future it is likely that new management strategies of HPV-positives cases will result in improved disease detection and/or decreased resource use. Preventing cervical cancer is dependent not only on identification and treatment of pre-invasive disease but also on its earliest identification. Although two rounds of cytology resulted in similar levels of disease detection, some cases were detected later in the Control arm (24 months) so that the risk of invasive cancer will be lower in those screened with HPV as has been observed in other trials (Arbyn et al, 2012; Ronco et al, 2014). More definitive results of the relative effect of HPV screening at a lower frequency than LBC will be available from future analyses.

This study was conducted in women enrolled in the Cervical Screening Program in British Columbia residing in Vancouver or Victoria, who were patients of participating physicians and who provided informed consent. Participants were well educated (Ogilvie et al, 2010) and may be anticipated to have lower disease rates than the general population. Despite this, in Round 1, women randomised to the Control arm had rates of disease detection at least as high as that reported for the population (BC Cancer Agency, 2014). This may be related to the high rate of annual screening in the BC population (BC Cancer Agency, 2014) with over 30% returning within 18 months of a previous negative screen. Twenty-four month return rates were lower for the Control than Safety arm despite double-blind screening allocation. This arose because of increased returns associated with higher baseline follow-up screening in the Safety arm and may also have been because this was the final study screen for that arm.

Conclusion

A single round of HPV-based screening was found to be as effective at detecting high-grade CIN as two rounds of LBC screening, and it appears that doubling the screening interval posed no risk to enrolled women. LBC testing at 24 months found less disease in those initially screened with HPV than those initially screened with LBC. Colposcopy utilisation for a single round of HPV screening was similar to two rounds of LBC screening.