Persistent infection with oncogenic Human Papillomavirus (HPV) is necessary for cervical carcinogenesis. Although evidence suggests that the vaginal microbiome plays a functional role in the persistence or regression of HPV infections, this has yet to be described in women with cervical intra-epithelial neoplasia (CIN). We hypothesised that increasing microbiome diversity is associated with increasing CIN severity. llumina MiSeq sequencing of 16S rRNA gene amplicons was used to characterise the vaginal microbiota of women with low-grade squamous intra-epithelial lesions (LSIL; n = 52), high-grade (HSIL; n = 92), invasive cervical cancer (ICC; n = 5) and healthy controls (n = 20). Hierarchical clustering analysis revealed an increased prevalence of microbiomes characterised by high-diversity and low levels of Lactobacillus spp. (community state type-CST IV) with increasing disease severity, irrespective of HPV status (Normal = 2/20,10%; LSIL = 11/52,21%; HSIL = 25/92,27%; ICC = 2/5,40%). Increasing disease severity was associated with decreasing relative abundance of Lactobacillus spp. The vaginal microbiome in HSIL was characterised by higher levels of Sneathia sanguinegens (P < 0.01), Anaerococcus tetradius (P < 0.05) and Peptostreptococcus anaerobius (P < 0.05) and lower levels of Lactobacillus jensenii (P < 0.01) compared to LSIL. Our results suggest advancing CIN disease severity is associated with increasing vaginal microbiota diversity and may be involved in regulating viral persistence and disease progression.
Persistent infection with a high-risk oncogenic Human Papillomavirus (HPV) subtypes, most commonly 16 and 18, is a necessary, although not sufficient, condition for development of invasive cervical cancer (ICC) and its precancerous precursor; cervical intra-epithelial neoplasia (CIN)1. Although HPV infection is very common in sexually-active women2, the majority of infections are transient3. Only a small proportion of women infected with the virus goes on to develop clinically significant pre-invasive lesions and, if not treated, invasive malignant disease. Mechanisms of persistence of HPV infection are not well understood.
The vaginal microenvironment plays an important role in reproductive health. Commensal vaginal Lactobacillus spp. are thought to defend against pathogens and sexually transmitted infections4 through maintenance of a hostile pH5, production of species-specific metabolites, bacteriocins and through adherence to mucous and disruption of biofilms6,7,8,9. Next generation sequencing (NGS) based studies have facilitated detailed characterisation of the “healthy” vaginal microbiome and shown that 5 major community-state types (CSTs) exist; CST I, II, III and V are dominated by Lactobacillus crispatus, L. gasseri, L. iners and L. jensenii respectively, whereas CST IV has characteristically low numbers of Lactobacillus spp. and increased diversity of anaerobic bacteria10. Longitudinal studies of the vaginal microbiome using NGS indicates that bacterial community structure is dynamic and hormonally influenced with a propensity to become less stable during menstruation11 and conversely more stable and less diverse during normal pregnancy12,13. The stability and composition of the vaginal microbiome may play an important role in determining host innate immune response and susceptibility to infection. Bacterial vaginosis (BV), a condition characterised by Lactobacillus spp. depletion, overgrowth of anaerobic species, and higher vaginal pH has been associated with increased transmission rates of sexually-transmitted infections14 and human immunodeficiency virus (HIV)15. Conversely, it has recently been reported that viral infection of the cervix during murine pregnancy increases susceptibility of ascending vaginal bacterial infection through sensitisation and priming of the host innate immune system16.
Relatively little is known about the mechanisms associated with clearance or persistence of HPV infection. Along with higher rates of HPV infection, BV has been associated with delayed clearance of the virus and with CIN, suggesting that a diverse, Lactobacillus-depleted microbiome may play a mechanistic role17,18,19,20. A recent study of 68 HPV-discordant monozygotic female Korean twins using NGS showed that HPV-positive twins had lower levels of Lactobacillus spp. and increased counts of Fusobacteria and Sneathia spp. compared to their HPV-negative twins21. Consistent with these findings, analysis of vaginal swabs collected longitudinally for 16 weeks from 32 sexually active women found that a Lactobacillus spp.-depleted, Atopobium spp. enriched (CST IV) community structure is associated with slowest regression of HPV whereas a Lactobacillus gasseri-dominated microbiome (CST II) is associated with the most rapid regression rates for HPV22.
While there is evidence of an association between vaginal microbiome structure and HPV infection, the potential relationship between the vaginal microbiota and CIN disease progression has yet to be investigated. In this study, we characterised the vaginal microbiome structure and diversity in women with pre-invasive cervical intraepithelial neoplasia, invasive cervical cancer, and in healthy controls to assess how CSTs may correlate with disease presence and severity. We hypothesised that increasing microbiome diversity is associated with increasing CIN severity.
We enrolled 169 women into the study who were classified into 4 groups; normal (n = 20), low-grade squamous intraepithelial lesion (LSIL) (n = 52), high-grade squamous intraepithelial lesion (HSIL) (n = 92) and ICC (n = 5). Table 1 shows the characteristics of each group. No difference in the mean age of the population was determined (mean 31, SD 5.08, range 23–45, P = 0.071). No category showed systematic bias with respect to the four disease-groups after adjustment for multiple hypothesis testing. There was equal distribution in the samples collected at the follicular or luteal phase of the cycle and in the rate of women that had intercourse within 48 hours from sample collection. Histology was available in all HSIL and cancer cases (100%), and for 69% (36/52) and 5% (1/20) of LSIL and normal cases, respectively.
The structure of the vaginal microbiome correlated to the disease severity
In total 8 409 192 reads were obtained from 169 samples with an average number of reads per sample of 49 759 and the mean and median read lengths of 513 and 520 bp respectively. To avoid sequencing bias, operational taxonomic units (OTUs) were randomly sub-sampled to the lowest read count of 183, which retained 97% of OTU counts (data not shown) and still provided coverage of >96% for all samples. Following removal of singletons and rare OTUs, a total of 49 taxa were identified in the vaginal microbiome of the study cohort.
Initial assessment of vaginal community structure was performed using principal component analysis (PCA) of species sequence data in the context of disease grade (normal, LSIL, HSIL and ICC) (Fig. 1). Three major clusters were identified which represented samples dominated by either L. crispatus, L. iners or samples depleted of Lactobacillus spp. with higher diversity. The results of the analysis at class level are presented in Supplementary Figure 1.
Hierarchical clustering analysis (HCA) of the sequence data using nearest neighbour linkage at species level (Fig. 2) identified 5 major clusters that exhibited bacterial community structure consistent with previously described vaginal microbiome community state types (CSTs); CST I: L. crispatus-dominated, CST II: L. gasseri-dominated, CST III: L. iners-dominated, CST IV: Lactobacillus-depleted and CST V: L. jensenii-dominated10 (Fig. 2). The results of the analysis at class level are presented in Supplementary Figure 2 & Supplementary Table 1.
The rates and frequency of the different CSTs (I, II, III, IV & V) were compared between CIN disease severities and healthy controls (Table 2; Supplementary Table 3; Fig. 2). CST I was the most frequent CST in our study cohort (70/169, 41% of all patients), followed by CST III (47/169, 28%), CST IV (40/169, 24%), CST V (7/169, 4%) and CST II (5/169, 3%). Higher rates of CST IV (Lactobacillus-depleted, high diversity) were associated with increasing disease severity with CST IV observed twice as frequently in women with LSIL, three times as frequently in women with HSIL and four times as frequently in women with ICC when compared to disease free controls (normal = 2/20, 10%; LSIL = 11/52, 21%; HSIL = 25/92, 27%; cancer = 2/5, 40%). Conversely, frequency of CST I (Lactobacillus crispatus-dominant) was lower with increasing disease severity (normal = 10/20, 50%; LSIL = 22/52, 42%; HSIL = 37/92, 40%; cancer = 1/5, 20%). The number of ICC cases was small for any valid conclusion. Although the results of the analysis did not attain significance given the modest sample size, there appears to be a correlation of CST IV and increasing disease severity.
A similar distribution of CST IV was observed when women with HPV/Atypical squamous cells of undetermined significance (ASCUS) and LSIL changes were analysed as two separate groups (normal = 2/20, 10%; ASCUS = 5/26, 19%; LSIL = 6/26, 23%; HSIL = 25/92, 27%; cancer = 2/5, 40%) (Table 2; Fig. 2; Supplementary Table 3). These analyses are suggestive of association between CST IV and increasing disease severity, and are consistent with the expected direction of effect, but do not attain significance due to modest sample-size.
Both species richness (Fig. 3A) and alpha-diversity (Fig. 3B,C) indices were higher in CST IV, compared to the other CSTs particularly CST I (P < 0.001) and CST III (P < 0.01) (Fig. 3). Consistent with increased rates of CST IV in high grade disease, vaginal microbiota richness and diversity were also found to be higher in women with high-grade disease, compared to low-grade disease, and lowest in normal women but this was not statistically significant (Fig. 4) (Supplementary Table 4).
The structure of the vaginal microbiome correlated to the HPV status and genotype
HPV status was available for 117 women in our cohort. CST IV was most frequently observed in high-risk HPV (HR-HPV) positive as compared to HR-HPV negative women (26/93, 28% versus 5/24, 21%). HPV negative women were most likely to have CST I (13/24, 54%) as opposed to HPV positive women (38/93, 41%). The rates for the other CSTs were comparable between HR-HPV positive and negative subjects (Table 2; Fig. 2).
Of 93 women who were HPV positive, genotyping was available for 62 subjects. The rate of CST IV was higher for women infected with HPV16 (9/31, 29%) when compared to HPV18 (1/5, 20%) or women with other high-risk oncogenic types (5/26, 19%) although did not reach significance, likely due to sample size (Table 2; Fig. 2).
The rate of CST IV was no different for Normal/LSIL HPV negative versus HPV positive individuals (3/20, 15% versus 7/34, 21%), but substantially higher for HSIL or worse (HSIL = 19/58, 33%; ICC = 2/5, 40%), suggesting that the presence of a high diversity Lactobacillus-depleted microbiome may be more strongly correlated to the presence of clinically significant pre- or invasive disease rather than the presence of the virus itself (Table 2; Supplementary Table 3; Fig. 2). Again, the results did not reach statistical significance.
Identification of vaginal microbiota composition markers of CIN disease severity
Linear discriminant analysis (LDA) effect size (LEfSe) modeling was used to identify differences in microbiota composition that may be related to increasing disease severity (Fig. 5). Due to sample size restrictions, we limited our comparison to LSIL versus HSIL patients. In the LSIL group, significant over-representation of Lactobacillus jensenii (P < 0.01) (Fig. 5A,B,F) and Lactobacillus coleohominis (P < 0.05) (Fig. 5B) were observed. In contrast, HSIL samples were found to have significantly higher levels of Peptostreptococcus anaerobius (P < 0.05), and Anaerococcus tetradius (P < 0.05) (Fig. 5A–D). HSIL samples were also found to have significant overrepresentation of Fusobacteria- primarily Sneathia sanguinegens (P < 0.01) (Fig. 5A,B,E; Supplementary Figure 3).
We detected a two-fold increase in the rate of a CST IV vaginal microbiome in those women with LSIL, a three-fold increase in women with HSIL and a four-fold increase in women with invasive cancer compared to controls. Increasing disease severity was also associated with decreasing relative abundance of Lactobacillus spp. A recent longitudinal study by Brotman et al. reported that women with a high diversity, Lactobacillus spp. depleted (CST IV) vaginal microbiome were most likely to become HPV-positive, and to have persistent HPV infection22. Our findings suggest that vaginal microbial diversity is associated not only with HPV infection, but also with advancing CIN severity, but does not attain significance due to modest sample-size.
It is currently unclear if a CST IV microbiome is a causal factor in progression of CIN or a consequence of it. BV, a condition diagnosed using traditional culture techniques, in part by Lactobacillus spp. depletion and increased diversity of potentially pathogenic gram negative bacteria, is associated with significantly higher rates of HPV infection and CIN19,20. We used NGS techniques to further examine this association and identified Sneathia sanguinegens as a biomarker of HSIL, which has previously been shown to associate with HPV infection21. Two other BV-associated bacteria; Peptostreptococcus anaerobius and Anaerococcus tetradius were also found to be markers of HSIL in our cohort. This further suggests that specific anaerobic species may play a role in disease progression, rather than simply signifying presence of HPV infection.
Bacteria are increasingly appreciated as a key player in the initiation and progression of other malignancies including colorectal cancer23,24,25 where Fusobacteria has been identified as a potential pro-carcinogenic bacterial class25,26. Our findings show that this class, and specifically Sneathia sanguinegens, is discriminatory of HSIL suggesting similar mechanisms, likely involving activation of inflammatory pathways, may be involved in the cervix. Approximately one third of premalignant lesions go on to develop invasive cervical disease, if untreated. It is possible that the women in our cohort with CST IV microbiomes are those at highest risk of progression to clinically significant invasive lesions, yet our findings only demonstrate association, not causality, between cervical pre-cancer, persistent HPV infection and the structure of the vaginal microbiota. Whilst women with BV display higher rates of HPV infection14,19 the virus has been shown to induce a pro-inflammatory environment to facilitate integration of viral DNA27,28,29,30,31. Thus HPV infection itself may adversely impact on the host’s immune defences and mucosal metabolism leading to aberration of vaginal microbiota, thus promoting viral persistence and disease progression.
Lactobacillus spp. are classically regarded as ‘protective’, yet our study supports other previous reports which suggest the clinical picture may be dictated by the specific species present, and that the genus as a whole may not be regarded as protective in its entirety. Brotman and colleagues reported that an L. iners dominated vaginal microbiome (CST III) was associated with HPV-infection whereas vaginal microbiomes dominated by L. gasseri (CST II) exhibited the most rapid clearance of HPV infection22. While we did not observe over-representation of CST II or III in our CIN cohorts, the reduced prevalence of L. iners (CST III) concurrent with increased rates of CST IV in CIN, compared to controls may represents a shift from L. iners (CST III) towards a CST IV type microbiome with the acquisition of CIN. Unlike the majority of Lactobacillus spp., L. iners does not produce H2O2, which has been shown to have antibacterial and antiviral properties32,33,34. Consistent with the possibility that H2O2-producing Lactobacillus spp. are protective against CIN progression, we detected higher prevalence of L. jensenii and L. coleohominis, both H2O2-producing lactobacilli, in women with LSIL compared to HSIL, suggests this species may be particularly protective in preventing progression of the dysplastic and ultimately carcinogenic process. Furthermore Lactobacilli spp., have been shown to be cytotoxic when co-cultured with cervical cancer cells in vitro, but not normal cells, independent of lactic acid concentrations, highlighting interactions amongst cervical cells, the microbiota and the mucosal metabolic milieu35.
Environmental and hormonal factors are also known to modulate the vaginal microbiome. Smoking has been previously correlated to HPV persistence and CIN as well as Lactobacillus spp. depletion and dysbiosis36. Although women in our study with high-grade disease were more likely to be smokers, the differences did not reach statistical significance and no correlation between vaginal microbial composition and smoking status was identified. A sub-analysis of smokers showed that the prevalence of CST IV increased with disease severity indicating that a high-diversity microbiome is correlated to disease status rather than smoking as a potential confounder.
Future therapeutic strategies permitting the modulation of the vaginal microbiome with oral or topical regimes to a Lactobacillus spp.-dominant microbiome may be able to promote HPV clearance or even reverse the process of tumourigenesis, reducing the morbidity resulting from these conditions and their treatments37,38. Probiotics have been used in a similar manner to reduced recurrence of BV, through accurate, targeted modification of the bacterial community39.
Further research is required to understand the molecular mechanisms involved in the complex role that bacterial communities can play in the development of cancer. An understanding of the functional properties of the community state types is required in order to complement what we already know about their structure. Further longitudinal studies are needed to investigate the changes and stability of the microbiome during transition from acute HPV infection, to persistent infection through to development of CIN and cancer.
In summary, this is the first study to correlate the structure of the vaginal microbiome with presence of CIN in women of reproductive age. Our findings suggest that the presence and prevalence of specific vaginal microbiome CSTs may be involved in the pathogenesis of CIN and cervical cancer. We have also identified 5 bacterial species that could help to differentiate low- and high-grade disease, and with further research these may improve our understanding of the role of the bacterial microenvironment in HPV persistence, development of CIN and progression to cancer. Although the development of HPV vaccines will be the main prevention strategy for this disease, its implementation in many settings can be challenging due to financial, cultural barriers and lack of infrastructure. Microbiome modulation with pre- and probiotics towards stable Lactobacillus-dominant vaginal community structure that promotes HPV clearance, could represent low-cost future therapeutic strategies.
Our findings may be of future clinical and therapeutic relevance and raise the clinical question as to whether women with a high diversity vaginal microbiome and cervical pathology should be subject to more intense colposcopic surveillance and/or treatment and whether the examination of the vaginal microbiome could be used as a triage tool for this population. Future longitudinal studies should aim to elucidate the causality between HPV infection, CIN, the immune microenvironment and the vaginal microbiome and increase our understanding of the role that vaginal bacteria in the tumour microenvironment.
Study population – Inclusion and Exclusion criteria
Ethical approval was obtained from the National Research Ethics Service Committee London – Fulham (Approval number 13/LO/0126). All experiments were performed in accordance with the approved guidelines. All patients gave informed consent. We included pre-menopausal non-pregnant women, 18–45 years of age who attended the colposcopy and gynaecology clinics at Imperial College NHS Healthcare Trust. Women were included irrespective of their ethnicity, parity, smoking habits, phase in their cycle and use of contraception. The type of contraception and the time of their cycle (follicular or luteal) were documented. Women who were HIV or hepatitis B/C positive, with autoimmune disorders, who received antibiotics or pessaries within 14 days of sampling, or had a previous history of cervical treatment were excluded. Detailed medical and gynaecological history was collected including time since last sexual intercourse and douching practices. Ethnicity was self-reported as Caucasian, Asian or Black. Histology was used to classify patients into groups. If histology was available from both punch biopsies and treatment cones, the most severe lesion was documented. If histology was not available as not clinically indicated (i.e. healthy controls, low-grade lesions under cytological surveillance), cytology was used for classification.
Sample collection and processing
A sterile, disposable speculum was inserted, without lubricant, and a swab was taken from the posterior vaginal fornix using a BBLTM CultureSwabTM containing liquid Amies (Becton Dickinson, Oxford, UK) and stored immediately at −80 °C. Whole-Genomic bacterial DNA was extracted from the swabs using a QiAmp Mini DNA kit (Qiagen, Venlo, Netherlands) as previously described .
HPV DNA test and 16/18 genotyping was performed according to manufacturer’s guidelines using the Abbott RealTime High Risk (HR) HPV assay on Abbott M2000 platform; a clinically validated in vitro polymerase chain reaction (PCR) assay with identification of HPV-16, -18 and 12 other HR HPV subtypes (31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 68)40.
Illumina MiSeq sequencing of 16S rRNA gene amplicons
The V1-V2 hypervariable regions of 16S rRNA genes were amplified by PCR using a forward and reverse fusion primer as previously described12. Sequencing was conducted at Research and Testing Laboratory (Lubbock, TX, USA).
16S rRNA gene sequence analysis
Sequence data were analysed in Mothur using the MiSeq SOP Pipeline41 Sequence reads were quality checked and normalised to the lowest number of reads. Singleton OTUs and OTUs < 10 reads in any sample were collated into OTU_singletons and OTU_rare phylotypes respectively, to maintain normalisation and to minimise artefacts. OTUs were defined using a cut off value of 97% and result data analysed using Vegan package within the R statistical package for assessment of microbial composition and diversity (R Development Core Team 2008). OTU taxonomies (from Phylum to Genus) were determined using the RDP MultiClassifier script to generate the RDP taxonomy42 while species level taxonomies of the OTUs were determined using the USEARCH algorithm combined with the cultured representatives from the RDP database43. Alpha and beta indices were calculated from these datasets with Mothur and R using the Vegan package.
Subjects were analysed in 4 different phenotype subgroups; normal, LSIL, HSIL and ICC. We included women with HPV changes or ASCUS in the LSIL category and further analysed them separately. Furthermore, we compared HR-HPV positive versus negative women, irrespective of the disease status and also assessed separately women positive for HPV16 versus HPV18 versus other high-risk (negative for HPV16 and HPV18) oncogenic subtypes. Finally, we used data from both the disease and HPV status and compared normal/LSIL HR-HPV negative women, normal/LSIL HR-HPV positive women, HSIL HR-HPV positive and ICC patients.
Analysis of statistical differences between the vaginal microbiome of patient groups was performed using the Statistical Analysis of Metagenomic Profiles (STAMP) package44. Data were subjected to multivariate analysis using PCA and HCA by nearest neighbour linkage with a clustering density threshold of 0.75.
To assess potential ascertainment bias of selected clinical characteristics with respect to four phenotype categories of interest, we performed fisher’s exact test for each of the following characteristics (age, ethnicity, parity, smoking, menstrual cycle, contraception and time since last intercourse). To analyse the importance of CSTs with respect to specific phenotype categories, we tested whether CSTs are significantly over or under-represented in any category. For this purpose we created a CST indicator variable, whereby CST = 1 for samples that could be assigned to the given CST and CST = 0 for all other samples. We performed a fisher’s exact test on the corresponding contingency table. Analyses were performed using R and false discovery rate adjustment (Benjamin & Hochberg) was applied to correct p-values. P-values and q-values < 0.05 were considered significant.
Accession code: Public access to sequence data and accompanying metadata can be obtained at the European Nucleotide Archive’s (ENA) Sequence Read Archive (SRA) (accession number PRJEB7756).
How to cite this article: Mitra, A. et al. Cervical intraepithelial neoplasia disease progression is associated with increased vaginal microbiome diversity. Sci. Rep. 5, 16865; doi: 10.1038/srep16865 (2015).
We thank all the participants of the study. Our work was supported by the British Society of Colposcopy Cervical Pathology (Jordan/Singer Award), the Imperial College Healthcare Charity, Genesis Research Trust and the Imperial Healthcare NHS Trust NIHR Biomedical Research Centre (P45272). DAM is supported by a Career Development Award from the Medical Research Council (MR/L009226/1).
About this article
Nature Reviews Microbiology (2017)