Abstract
Human papillomavirus (HPV) types differ by geographic location and the ethnicity of the human host, which may have implications for carcinogenicity. HPV35 is one of the least frequently identified high-risk types in North America and Europe but was the most common high-risk HPV (hrHPV) infection in a cohort in rural Zimbabwe. Whole genome analysis is limited for HPV35; no such studies have been performed in Zimbabwe. Of 648 women in the initial cohort in Zimbabwe, 19 (19/648, 2.9%) tested positive for HPV35, and eight samples were successfully sequenced for HPV35. The maximum number of sequence variants for the whole genome was 58 nucleotides (0.7%) compared to the prototype (58/7879). The maximum number of sequence variants in E6 and E7 was 3 (3/450, 0.7%) 2 (2/300, 0.7%), respectively. These are the first HPV35 whole genome sequences from Zimbabwe, and these data further lend support to the carcinogenicity of HPV35 despite limited sequence heterogeneity. Further studies to determine carcinogenic effects and impact of HPV vaccinations are warranted, especially in sub-Saharan Africa.
Similar content being viewed by others
Introduction
Human Papillomaviruses (HPV) are circular double-stranded DNA viruses known to infect humans. The genome is approximately 7,900 base pairs and is characterized by early genes (E1, 2, 4–7), late genes (L1-2), long control region (LCR), and a variable non-coding region (between E5 and L2)1. More than 200 HPV types have been identified, categorized by genetic structure and mucosal tropism2. The Alphapapillomavirus genus primarily infects human mucosal surfaces, and the types therein are classified into clades and low-risk, intermediate-risk (or possible, probable carcinogenic) and high-risk types, depending on their carcinogenic potential3.
Most cervical cancers are caused by HPV16 and HPV18, together comprising ~70% of cervical cancers4. Human papillomavirus 16 (HPV16) is classified in clade alpha-9, along with other carcinogenic types including HPV31, 33, 35, 52, and 585. Among these types, HPV35 is one of the least frequently identified in North America and Europe but is commonly identified in studies from sub-Saharan Africa6,7. For example, studies in Mozambique found HPV35 to be the most common hrHPV type among hrHPV-positive women and women with cervical neoplasia8, as well as the fourth most frequently detected type in cervical cancers9. Similarly, work from Malawi found HPV35 to be the third most frequently detected hrHPV type in cervical cancers10. Furthermore, HPV35 is the most common hrHPV among women in rural Zimbabwe (19/648, 2.9%)11, and is the fifth most frequently identified hrHPV type in cervical cancers in Zimbabwe12,13. While specific genetic variants have been associated with carcinogenesis in other hrHPV types, HPV35 sequence data is relatively limited1,2,14,15,16. The present study describes 8 complete HPV35 whole genome sequences from self-collected cervical swab specimens obtained from women in rural Zimbabwe.
Results
HPV35 whole genome sequences were obtained from previously collected cervicovaginal exfoliated cells from a study of hrHPV distribution in rural Zimbabwe11. HPV35 represented the largest proportion of HPV infections in that cohort; 2.9% of women (19/648) provided self-collected cervical swabs testing positive for HPV35. Of these 19 samples, mixed infections were excluded in order to evaluate for HPV35 variant associations with cytologic testing, and only available samples were extracted and sequenced (N = 9). Whole HPV35 genomes were successfully sequenced from 8 samples.
Whole genome sequence analysis
Figure 1 shows the complete phylogenetic tree for the eight Zimbabwean samples when compared to the HPV35H reference, as well as representative A1 and A2 sublineage sequences from different geographic locations. Six samples are best classified as sublineage A1, while two are consistent with sublineage A2. The maximum number of nucleotide sequence variants compared to the prototype was 58 (58/7879; 0.7%), and the remainder of the sequences varied by 0.5% or less.
Of these successfully sequenced, HPV35–positive women, 3 (3/8, 37.5%) were HIV seropositive. There were no significant differences in HIV seropositivity or duration of antiretroviral therapy and HPV35 variants or carcinogenicity. Cytologic testing with liquid-based Pap smears was conducted concurrently with VIAC. Cytology results were obtained for 37.5% (3/8) of the HPV35 samples sequenced, including 1 negative for intraepithelial lesions or malignancy (NILM) and 2 high-grade intraepithelial lesions (HSIL; ZM435 and ZM621). VIAC was considered “precancer” for the two HSIL cases, as well as “precancer” for the sample considered negative by Pap cytology analysis. Both A2 samples were associated with HSIL.
E6 gene
E6 gene variants are described in Table 1. The variants 127T > C, 136T > C, and 341T > C, have previously been identified14,16. A newly identified synonymous variant, 193G > A, was found in one sample tested, ZM392. The maximum number of nucleotide sequence variants compared to the E6 prototype was 3 (3/450, 0.7%).
E7 gene
E7 gene variants are provided in Table 2. These variants were 675T > C and 682T > C. The maximum number of nucleotide sequence variants compared to the E7 prototype was 2 (2/300, 0.7%). Two samples, ZM435 and ZM621, had the same previously described synonymous variant (675T > C) associated with sublineage A22,14. A newly described 682T > C variant was also found in sample ZM435, as well as ZM464. This non-synonymous variant results in a Gly41Cys amino acid change.
LCR polymorphism
LCR variants are shown in Table 3. These variants were 7156C > A, 7215T > G, 7261A > C, 7394A > C, 7535T > C, 7894C > A, and 7902G > A. A previously described 16-bp insertion was identified in 8 of 8 sequences15,16. The maximum number of nucleotide sequence variants compared to the prototype was 4 (4/770, 0.5%). Variants 7261A > C, 7394A > C, and 7535T > C have been previously described14,15,16 and occurred frequently in these sequences. Newly described variants include 7156C > A, 7215T > G, 7894C > A, and 7902G > A.
Discussion
This is one of the few studies examining genetic polymorphisms throughout the whole genome of HPV35, the most common hrHPV type identified in a community-based study conducted in rural Zimbabwe from 2016–201711. Importantly, HPV35 is frequently identified in women with cervical dysplasia and cervical cancers in numerous studies in Southern sub-Saharan Africa8,9,10,11,12,13,15,16,17,18,19. Consistent with the literature, the data presented here demonstrates that HPV35 genetic variation is low relative to the variation observed in other hrHPV types2,14,15,16,20,21,22,23. In this study, the maximum nucleotide sequence variation compared to the prototype whole genome was 0.7%, and the remainder of the whole genome sequences varied by 0.5% or less. The limited variation observed in HPV35 may represent recent divergence from the most recent common ancestor of the HPV16, HPV31, and HPV35 clade2.
Consistent with the low level of genetic variation, HPV35 is divided into two sublineages, A1 and A2, rather than separate variant lineages, which are defined as requiring greater than 1.0% difference between complete genomes2,24. Other work suggests that HPV35 sequences may also fall into two evolutionary categories based on the presence or absence of the 16-bp insertion in the LCR15,16. This insertion is absent in the HPV35 reference sequence isolated in the United States but is present in all of the rural Zimbabwean whole genome sequences in this study.
While the association of the LCR insertion with the risk of viral persistence or dysplasia has not yet been investigated, several studies have evaluated the association of HPV35 sublineage with the risk of cervical intraepithelial neoplasia (CIN). For example, work from Costa Rica suggests that the A1 sublineage may be associated with higher risk of persistence and the development of CIN3+ than the A2 sublineage25. In contrast, a study from the United States identified no difference between HPV35 sublineages in the risk of CIN2/326. Of the whole genomes sequenced herein, the two samples associated with HSIL by cytology were in the A2 sublineage. Future large scale studies including HPV whole genome sequencing will be required to determine whether the risk of dysplasia and cervical cancer are associated with HPV35 sublineage, and/or individual viral variants, including single nucleotide changes and insertion/deletions.
While HPV35 single nucleotide sequence variants have been previously reported, particularly in the E6, E7, and LCR genomic regions, the attribution of increased dysplasia or cervical cancer risk to particular variants has not been described2,14,15,16. In this study both previously described and newly described single nucleotide variants were identified. The newly described variants in these Zimbabwean viruses included E6 (193G > A), LCR (7156C > A, 7215T > G, 7894C > A, and 7902G > A), and a non-synonymous variant E7 (682T > C; Gly41Cys). Gagnon et al. suggest that HPV35 persistence may associated with the presence of non-synonymous changes in E7, though given the small number of sequences and participants there was insufficient power to investigate individual variants14. The study presented here was not designed to evaluate persistence, and future studies will be necessary to determine the role of this and other variants in persistent infection. Amino acid changes in HPV35 E7 could impact interactions with the retinoblastoma protein (Rb)14. Similarly, E6 amino acid changes may alter regulation of p53, whereas nucleotide changes in the LCR may change transcription factor binding. Once risk-associated individual variants are identified, functional experiments investigating their mechanism of action will be required.
Conclusions
This study presents analysis of whole genome HPV35 sequences from self-collected cervical swab specimens obtained from women in rural Zimbabwe. There was an overall low level of nucleotide variation from the prototype HPV35 whole genome sequence. Newly described synonymous and non-synonymous single nucleotide variants were identified in E6 (193G > A), E7 (682T > C; Gly41Cys), and the LCR (7156C > A, 7215T > G, 7894C > A, and 7902G > A). Further studies are needed to establish whether these variants are associated with the development of cervical dysplasia and cervical cancers given the prevalence of HPV35 in Southern Africa and uncertain coverage by available HPV vaccines.
Methods
Sample collection and participant information
This study was nested under a previously reported cohort in rural Zimbabwe. Samples from the Chidamoyo study were selected if they tested positive for HPV35 by the Anyplex II HPV28 (Seegene Inc., Seoul, Korea) genotyping assay. Detailed study design and sample collection data were previously described.
HPV typing results and cytologic diagnoses of cervical lesion and characteristics of study subjects were obtained from the prior study. The protocol for the present study was approved by the institutional review boards at the University of Zimbabwe, Stanford University, Medical Research Council of Zimbabwe, and the Research Council of Zimbabwe.
Sequencing methods
Viral nucleic acids were extracted from 200 µL of thin-prep samples using The EZ1 Virus Mini Kit v2.0 (Qiagen, Germantown, MD) and eluted in 60 µL. The whole region of HPV35 from each patient was amplified using primer pairs AF/AR and WF/DR (Table 1). Each reaction included LongAmp Hot Start Taq 2X Master Mix (New England Biolabs, Ipswich, MA), 400 nM of each primer, 3% DMSO and 5 µL nucleic acid. The reactions were run in a DNA Engine PTC-200 (Bio-Rad, Hercules, CA), with cycling conditions: 94 °C for 2 min; 40 cycles of 94 °C for 10 sec, 55 °C for 30 sec, and 65 °C for 5 min; final extension for 10 min at 65 °C. PCR products were sequenced by bi-directional dideoxynucleotide termination sequencing using AF, AR, WF, DR, EF, GF, KF, CR, MF, PF, and QR primers (Elim Biopharmaceuticals, Inc.; Hayward, CA). Primer sequences can be found in Table 4. The HPV35H (GenBank HQ537708) was used as the reference sequence. The individual sequences were aligned to the reference sequence using NCBI blast and the consensus sequences were generated from the assembly. The consensus sequences were manually curated by referring to the raw chromatograms where the base calls from two or more Sanger sequences showed conflict. Frame shifting insertion and deletions at nucleotide positions covered by a single sequence were discarded.
Construction of phylogenetic tree
The consensus sequences and a subset of HPV35 A1 and A2 sublineage sequences2 were subjected to multiple sequence alignment with default parameters using Clustal X (version 2.0)27. The phylogenetic tree was calculated from the multiple sequence alignment using the Neighbor-Joining algorithm in Clustal X, and the phylogenetic tree was then generated with Mega X28.
Ethical approval and informed consent
Ethical approval was granted by Stanford University (#37975), University of Zimbabwe (JREC 221/16), the Medical Research Council of Zimbabwe (MRCZ/A/2128), and the Research Council of Zimbabwe (No. 02921). In addition, the Provincial and District Medical Officers were notified, as well as headmen and villages during community meetings, after sensitization via training of community health workers prior to data collection. Women were informed that their participation was voluntary, they could withdraw at any time, that we would offer to test for HIV but they could refuse this testing or refuse to be notified of their result, and that all information regarding their HIV and HPV status would be kept confidential. Most women wanted to know their HIV and HPV results. Eligible women were interviewed verbally by trained research data collectors on the research team using an electronic questionnaire to collect information on sociodemographic and reproductive information. Inclusion took place after individual informed consent signed electronically or with a thumbprint on a paper copy if illiterate, and an additional paper copy was given to the participation. Informed consent (signature or witnessed thumbprint) was obtained from all participants prior to enrolment. The methods were carried out in accordance with the relevant guidelines and regulations.
Data availability
The whole genome sequences described in this study are available in GenBank (Study Number, GenBank Accession Number): ZM241, MN829875; ZM301, MN829876; ZM464, MN829877; ZM392, MN829878; ZM358, MN829879; ZM435, MN829880; ZM621, MN829881; ZM262, MN829882.
References
de Sanjose, S., Alemany, L., Castellsague, X. & Bosch, F. X. Human papillomavirus vaccines and vaccine implementation. Womens Health 4, 595–604, https://doi.org/10.2217/17455057.4.6.595 (2008).
Chen, Z. et al. Evolution and taxonomic classification of human papillomavirus 16 (HPV16)-related variant genomes: HPV31, HPV33, HPV35, HPV52, HPV58 and HPV67. PLoS One 6, e20183, https://doi.org/10.1371/journal.pone.0020183 (2011).
Schiffman, M., Clifford, G. & Buonaguro, F. M. Classification of weakly carcinogenic human papillomavirus types: addressing the limits of epidemiology at the borderline. Infect. Agent. Cancer 4, 8, https://doi.org/10.1186/1750-9378-4-8 (2009).
Ferlay, J. et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int. J. Cancer 136, E359–386, https://doi.org/10.1002/ijc.29210 (2015).
Viarisio, D., Gissmann, L. & Tommasino, M. Human papillomaviruses and carcinogenesis: well-established and novel models. Curr. Opin. virology 26, 56–62, https://doi.org/10.1016/j.coviro.2017.07.014 (2017).
De Vuyst, H. et al. The burden of human papillomavirus infections and related diseases in sub-saharan Africa. Vaccine 31(Suppl 5), F32–46, https://doi.org/10.1016/j.vaccine.2012.07.092 (2013).
Clifford, G. & Franceschi, S. HPV in sub-Saharan. Africa. Papillomavirus Rep. 16, 322–326, https://doi.org/10.1179/095741905x49089 (2013).
Castellsague, X. et al. Human papillomavirus genotypes in rural Mozambique. Lancet 358, 1429–1430, https://doi.org/10.1016/S0140-6736(01)06523-0 (2001).
Naucler, P., Da Costa, F. M., Ljungberg, O., Bugalho, A. & Dillner, J. Human papillomavirus genotypes in cervical cancers in Mozambique. J. Gen. Virol. 85, 2189–2190, https://doi.org/10.1099/vir.0.80001-0 (2004).
Howitt, B. E. et al. Comprehensive Human Papillomavirus Genotyping in Cervical Squamous Cell Carcinomas and Its Relevance to Cervical Cancer Prevention in Malawian Women. J. Glob. Oncol. 3, 227–234, https://doi.org/10.1200/JGO.2015.001909 (2017).
Fitzpatrick, M. B. et al. hrHPV prevalence and type distribution in rural Zimbabwe: A community-based self-collection study using near-point-of-care GeneXpert HPV testing. Int. J. Infect. diseases: IJID: Off. Publ. Int. Soc. Infect. Dis. 82, 21–29, https://doi.org/10.1016/j.ijid.2019.02.022 (2019).
Kuguyo, O. et al. Cervical cancer in Zimbabwe: a situation analysis. Pan Afr. Med. J. 27, 215, https://doi.org/10.11604/pamj.2017.27.215.12994 (2017).
Bruni, L. et al. Human Papillomavirus and Related Diseases in Zimbabwe (2019).
Gagnon, S. et al. Viral polymorphism in human papillomavirus types 33 and 35 and persistent and transient infection in the genital tract of women. J. Infect. Dis. 190, 1575–1585, https://doi.org/10.1086/424854 (2004).
Calleja-Macias, I. E. et al. Genomic diversity of human papillomavirus-16, 18, 31, and 35 isolates in a Mexican population and relationship to European, African, and Native American variants. Virology 319, 315–323, https://doi.org/10.1016/j.virol.2003.11.009 (2004).
Calleja-Macias, I. E. et al. Worldwide genomic diversity of the high-risk human papillomavirus types 31, 35, 52, and 58, four close relatives of human papillomavirus type 16. J. Virol. 79, 13630–13640, https://doi.org/10.1128/JVI.79.21.13630-13640.2005 (2005).
Adler, D. H. et al. Cervical dysplasia and high-risk human papillomavirus infections among HIV-infected and HIV-uninfected adolescent females in South Africa. Infect. Dis. Obstet. Gynecol. 2014, 498048, https://doi.org/10.1155/2014/498048 (2014).
Bigoni, J. et al. Cervical cancer screening in sub-Saharan Africa: a randomized trial of VIA versus cytology for triage of HPV-positive women. Int. J. Cancer 137, 127–134, https://doi.org/10.1002/ijc.29353 (2015).
Castellsague, X. et al. Vaccine-related HPV genotypes in women with and without cervical cancer in Mozambique: burden and potential for prevention. Int. J. Cancer 122, 1901–1904, https://doi.org/10.1002/ijc.23292 (2008).
Chen, Z. et al. Diversifying selection in human papillomavirus type 16 lineages based on complete genome analyses. J. Virol. 79, 7014–7023, https://doi.org/10.1128/jvi.79.11.7014-7023.2005 (2005).
Chan, P. K. et al. Geographical distribution and oncogenic risk association of human papillomavirus type 58 E6 and E7 sequence variations. Int. J. Cancer 132, 2528–2536, https://doi.org/10.1002/ijc.27932 (2013).
Liu, Y., Pan, Y., Gao, W., Ke, Y. & Lu, Z. Whole-Genome Analysis of Human Papillomavirus Types 16, 18, and 58 Isolated from Cervical Precancer and Cancer Samples in Chinese Women. Sci. Rep. 7, 263, https://doi.org/10.1038/s41598-017-00364-9 (2017).
Basto, D. L. et al. Genetic diversity of human papillomavirus types 35, 45 and 58 in cervical cancer in Brazil. Arch. Virol. 162, 2855–2860, https://doi.org/10.1007/s00705-017-3439-5 (2017).
Burk, R. D., Harari, A. & Chen, Z. Human papillomavirus genome variants. Virology 445, 232–243, https://doi.org/10.1016/j.virol.2013.07.018 (2013).
Schiffman, M. et al. A population-based prospective study of carcinogenic human papillomavirus variant lineages, viral persistence, and cervical neoplasia. Cancer Res. 70, 3159–3169, https://doi.org/10.1158/0008-5472.Can-09-4179 (2010).
Xi, L. F. et al. Lineages of oncogenic human papillomavirus types other than type 16 and 18 and risk for cervical intraepithelial neoplasia. J Natl Cancer Inst 106, https://doi.org/10.1093/jnci/dju270 (2014).
Larkin, M. A. et al. Clustal W and Clustal X version 2.0. Bioinformatics 23, 2947–2948, https://doi.org/10.1093/bioinformatics/btm404 (2007).
Kumar, S. et al. Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 35, 1547–1549, https://doi.org/10.1093/molbev/msy096 (2018).
Acknowledgements
We thank Chidamoyo Christian Hospital for their cooperation and assistance with all aspects of this study, the study coordinator, Edwell Mereki, data collectors: Semya Mereki, Christine Momemebere, Nancy Momemebere, laboratory technician: Oliver Sakawaya, hospital administrator: Major Mereki and nursing and physician staff support. We also thank Fiona Mutisi, as well as Bhavini Suraiya Varyani and Vinie Kouamou for their support with logistics and laboratory assistance. We thank cytotechnologists Rama Arumilli (Stanford) and Raymond Chibvongodze (CIMAS) for assistance with interpretation of ThinPrep cervical cytology slides. We thank Laurel Stell for statistics and data management support. This study was funded under a National Institutes of Health Fogarty Global Health Equity Fellowship training grant for MBF under TW0009338 R25 and the Stanford Pathology Department Mentored Trainee Grant awarded to M.B.F. and B.A.P.
Author information
Authors and Affiliations
Contributions
M.B.F., Z.H., R.S.D.M., M.K.S., J.W., and B.A.P. conceived and planned the study. M.B.F. and B.A.P. obtained I.R.B. approvals. M.B.F., Z.H., J.D., J.W., C.H.H., M.K.S., and B.A.P. collected the study data, and M.B.F., Z.H., J.W., M.K.S., D.A.K., and B.A.P. contributed to the interpretation of the results. M.B.F. and B.A.P. took the lead in writing the manuscript. All authors provided critical feedback and helped shape the research, analysis, and manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fitzpatrick, M.B., Hahn, Z., Mandishora, R.S.D. et al. Whole-Genome Analysis of Cervical Human Papillomavirus Type 35 from rural Zimbabwean Women. Sci Rep 10, 7001 (2020). https://doi.org/10.1038/s41598-020-63882-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-63882-z
This article is cited by
-
High-risk human papillomavirus genotype distribution among women with gynecology complaints in northwest Ethiopia
Infectious Agents and Cancer (2023)
-
Genetic diversity in L1 ORF of human papillomavirus in women with cervical cancer with and without human immunodeficiency virus in Botswana and Kenya
BMC Infectious Diseases (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.