Introduction

Human papillomavirus (HPV) is globally the most common sexually transmitted infection1, and it is strongly associated with cervical, anogenital and oropharyngeal cancers2,3. HPV is one of the main causes of mortality among women in underdeveloped countries1,4,5. Although the prevalence of HPV has already been evaluated in some specific groups and regions, there are no data on HPV prevalence in young general populations of the different regions of Brazil6.

The prevalence and types of circulating HPV vary widely both among different populations and among age groups within populations. All regions of the world have shown an overall decline in prevalence according to age, except Latin America and the Caribbean, where the prevalence increases later in life, presenting a bi-modal distribution1. In addition, the prevalence and type-specific HPV frequency can change according to race, with a higher incidence in indigenous7 and black populations8.

The introduction of HPV vaccination is an opportunity to prevent infection and associated lesions, thus changing the patterns of mortality by cervical cancer. Brazil introduced an HPV immunization program using a quadrivalent vaccine in 2014 for children 9 to 14 years old and is currently adopting a 2-dose vaccination schedule (0–6 months)9. Although vaccine efficacy in decrease cervical intraepithelial neoplasia has been demonstrated in clinical trials10,11,12, measuring the impact of vaccination on HPV prevalence in young populations is important to evaluate the effectiveness of immunization against HPV in different scenarios under pressure of the vaccines. The impact of national HPV vaccination programs is being evaluated with different strategies, including data obtained from residual cervical screening13,14,15,16,17,18, including women 25 years or older, that are dependent on screening rates and the population profile of each specific country. Few studies have had the opportunity to use data from national surveys19 and evaluate young populations.

This study reports prevalence data on type-specific genital HPV in a nationwide sample of adolescents and young adults who use the Public Health System in Brazil. These data allow comparison between different geographical regions and provide important baseline estimates to evaluate the impact of the HPV vaccination program.

Methods

Study design and population

In this cross-sectional study, we evaluated the prevalence of HPV types in a convenience sample of sexually active women and men, aged 16 to 25 years from public primary health care units of the 26 state capitals and the Federal District of Brazil, from September 2016 to November 2017. Participants were recruited through different approaches as individuals who came to the unit for any reason, community health agents extended personal invitations, and nurses in school-based health promotion programs. The study sample of 7935 was calculated to detect a prevalence of at least 30% with an 80% power and to detect a difference of at least 5.2% between the regions of the country. Males were included (M:F ratio of 1:6) to allow subsequent assessment of herd immunity as a result of the HPV vaccination program in future years. This research was approved by the Moinhos de Vento Hospital human subjects research board, and written informed consent was obtained from all participants (Approval No. 1607032). We excluded pregnant women, those who had undergone hysterectomy or trachelectomy, and participants who had ever had cervical intraepithelial neoplasia grade 2 or higher. The study protocol was published previously20.

Data collection

All participants were interviewed by primary care professionals trained specifically for the study using structured questionnaires on risk factors for HPV infection and demographic and behavioral data. Socioeconomic status distribution is a composite score calculated based on the number of household assets, degree of education of the household head and presence of monthly paid housekeeper21. Race/skin color was self-reported. We also asked about current and past cigarette smoking, educational level, marital status, number of sex partners in the last five years and last 12 months, age at first sexual intercourse, and condom use.

For women, the cervical sample was obtained using the Qiagen HC2 DNA collection device according to the manufacturer’s instructions. Men were instructed to self-collect samples from the entire surface of the scrotum, glans penis/coronal sulcus and penile shaft using a saline-wetted Dacron swab, under supervision of a health professional. The cervical and genital swabs were placed in a tube containing 1 mL of Digene Specimen Transport Medium (STM), stored at controlled room temperature (15–25 °C) and shipped to a central laboratory weekly where the samples were aliquoted and stored at −80 °C until processing.

HPV genotyping test

DNA was extracted from 0.5 mL of specimen transport medium (STM) using magnetic beads for isolation and purification on a robotic system (MagNA Pure LC 2.0; Roche) according to the manufacturer’s extraction instructions and the DNA concentration in the extract was determined using the NanoDrop 2000 (Thermo Scientific™).

HPV genotyping was performed using the Roche Linear Array® (LA) genotyping test which amplifies a 450 bp fragment in the L1 gene. Per reaction, 25 μl working master mix was combined with DNA extract (between 100–150 ng) diluted in 25 μl of ultrapure DNase/RNase-Free water. Polymerase chain reaction cycling conditions and hybridization were performed as recommended by the manufacturer. Probes could detect 37 types of HPV simultaneously22. The assay incorporated β-globin as an internal control for sample amplification. To ensure reproducibility of LA, an automated AutoBlot instrument (Fujirebio) was used for the hybridization and wash steps23. As the HPV 52 probe cross-reacts with types 33, 35 and 58, additional analyses were performed, if needed, using a type-specific real-time PCR assay24,25. Samples negative for β-globin and HPV (n = 312, 14.68% in men; n = 4, 0.07% in women) were considered inadequate and excluded from analysis. Six hundred eighty-seven (8.93%) men and women were considered nonresponders because they used an incorrect sampling technique or sample DNA was less than 5 ng/µl. We chose not to genotype these samples to avoid potential bias in the results. From the analyzed samples, eight (0.13%) were considered inadequate after genotyping.

HPV results were grouped as follows: positive for any HPV type, positive for 13 high-risk HPV (16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 68), positive for other HPV (6, 11, 26, 40, 42, 53, 54, 55, 61, 62, 64, 66, 67, 69, 70, 71, 72, 73, 81, 82 [IS39], 83, 84, 89 [CP6108]), positive for vaccine types (quadrivalent: 6, 11, 16, 18; nonavalent: 6, 11, 16, 18, 31, 33, 45, 52, 58), and multiple types (more than one HPV type).

Statistical analysis

The variable number of partners in the past year was categorized as ≤1 or >1 and the variable number of sexual partners in the last five years was categorized as ≤1, 2–3 and ≥4, according to the variable distribution. Social class was categorized according with the Associação Brasileira de Empresas de Pesquisa (ABEP – Brazilian Association of Survey Companies b) who uses a system based on the purchasing power and the level of education of the families to identify social and economic status, ranging from the highest (Class A) to the lowest (Class E)26.

We used the response rate definition from the American Association for Public Opinion Research, which defines the response rate as the number of participants divided by the number of persons that were ever contacted and screened as eligible27.

Sampling weights were applied to the data from our convenience sample in each capital to estimate HPV prevalence among the male and female population aged 16–25 years according to the Brazilian Census28. SAS software (Statistical Analysis System, SAS Institute Inc., Cary, N.C.), version 9·4, was used to conduct the analyses, and a 2-sided confidence level of 5% was determined to be statistically significant. Data with a normal distribution (Anderson-Darling test) were analyzed with Student’s t test or ANOVA, and the Chi square test was used for categorical variables.

The association between demographic and behavioral characteristics and the prevalence of HPV (overall and HR-HPV) was evaluated through Poisson regression with robust variance for binary outcomes. Variables with P < 0.20 in univariate analysis were included hierarchically (sociodemographic, socioeconomic, and behavioral characteristics, consecutively) in a multivariate model. The multivariate model was controlled by other variables included in the model, using the Hosmer-Lemeshow procedure29.

For each region, a univariate model was used to evaluate the prevalence ratio of characteristics in relation to HPV. Additionally, a multivariate model (adjusted for all included variables) was used to evaluate the prevalence ratio among each of the five regions of Brazil, using the South region as reference (region with lowest HPV prevalence). Interaction terms between variables were examined to evaluate effect modification. The vaccine status was investigated, and 810 participants who reported having been vaccinated were excluded from the analysis.

Patient and public involvement

No patients were involved in setting the research question or the outcome measures, nor were they involved in developing plans for recruitment, design, or implementation of the study. No patients were asked to advise on interpretation or writing up of results. The results from the present study will be disseminated through institutional websites, written communication, events and conferences, networks, and social media.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the Moinhos de Vento Hospital research board (Approval No. 1607032) and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Results

Characteristics of participants

Of the 7694 recruited participants, 6388 (83.0%) provided valid samples for HPV analysis (5268 women and 1120 men). There were no significant differences in race/skin color, educational level or family income in the population that was able to provide a valid sample (Supplement 1). Higher social class, younger age and the Southeast region had higher levels of inadequate sample (Supplement 1). The overall response rate27, defined as the number of eligible people who agreed to participate in the study, was 99.1%.

The majority declared themselves as brown/pardo (n = 3671/ 57.0%), 3817 were not engaged in a formal relationship (63.5%), 3402 were socioeconomic status C (55.5%), and 3571 were attending or had finished secondary school (55.9%) (Table 1). Although half used condoms at some point during their lives, less than 40% reported condom use in the last intercourse encounter. The mean age at first sexual intercourse was 15.3 years (95% CI 15.2 to 15.4), and the majority (79.0%) of women had only one partner in the past year. The age of first intercourse was lower in men (14.9 (95% CI 14.8 to 15.1)) than in women (15.5 (95% CI 15.4 to 15.5)), and men had more partners in the last year (44.6%) than women (20.9%).

Table 1 Sociodemographic and Sexual Behavior Characteristics of the Study Population. POP-Brazil Study, 2017.

HPV prevalence and genotype

From examined samples, 3447 were positive for any HPV type (53.6%; 95%CI 51.4 to 55.8), with no differences between women (2851, 54.6%; 95%CI 52.5 to 56.6) and men (596, 51.8%; 95%CI 47.0 to 56.7). When stratified by age group (16–21 and 22–25), there was a significant difference only among women (P < 0.001), with a higher prevalence of HPV infection in the 16–21 year old group (1652, 57.9%; 95%CI 58.8 to 64.2) than in the 22–25 year old group (1199, 42.1%; 95%CI 44.2 to 50.2). Detection of both overall HPV and HR-HPV prevalence varied by age, being highest among 18–19-year-olds and decreasing in older ages. The prevalence of HR-HPV types was 35.2% (2358; 95%CI 33.1 to 37.2), with significant differences between sexes, with a higher frequency in women (2030, 38.6%) than men (328, 29.2%) (Fig. 1).

Figure 1
figure 1

Prevalence of Any and High-Risk Human Papillomavirus (HPV) According to Sex. POP-Brazil Study, 2017. High-risk HPV types are defined as HPV types 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, and 68.

Quadrivalent HPV types were detected in 1002 (14.8%) specimens [27.9% (95%CI 25.5 to 30.4) of those with any HPV detected], with no significant differences between men and women (data not shown). Nonavalent HPV types were detected in 1842 (27.7%; 95%CI 25.8 to 29.6) specimens being 51.6% (95%CI 48.7 to 54.6) of all HPV positive samples.

Although most individuals who tested positive for HPV had only one HPV type (1411, 42.2%; 95%CI 39.3 to 45.1), multiple types were detected in 2036 (31.0%; 95%CI 29.0 to 32.9) participants: 883 (26.7%; 95%CI 24.1 to 29.3) had 2 types, 484 (12.2%; 95%CI 10.5 to 14.0) had 3 types, and 669 (18.8%; 95%CI 16.6 to 21.1) had 4 types or more. There was a statistically significant difference in the proportion of multiple types between sexes (33.0% in women and 27.5% in men, P < 0.02).

The most prevalent HPV types were HPV 52 (531, 7.8%; 95%CI 6.7 to 8.9), HPV 16 (547, 7.5%; 95%CI 6.5 to 8.4), HPV 62 (399, 6.8%; 95%CI 5.6 to 7.9), HPV 89 (331, 6.3%; 95%CI 5.0 to 7.5), and HPV 61 (355, 6.0%; 95%CI 4.9 to 7.2). Overall, 247 (3.8%; 95%CI 2.9 to 4.6) participants were positive for HPV 18, and 757 (10.73%; 95%CI 9.5 to 11.9) were positive for both HPV 16 and 18. The prevalence of HPV 6 was 5.6% (353; 95%CI 4.7 to 6.4), and 74 participants (1.4%; 95%CI 0.9 to 1.9) were positive for HPV 11. The pattern of HPV infection differed widely between sexes. While HPV 16 (479, 8.9%; 95%CI 7.7 to 10.1) and 52 (452, 8.8%; 95%CI 7.6 to 10.0) were the most prevalent HR-HPV types in women, HPV 59 (71, 6.5%; 95%CI 4.2 to 8.9) and 52 (79, 6.0%; 95%CI 3.7 to 8.2) were the most frequent in men (Fig. 2).

Figure 2
figure 2

Most Frequent High- and Low-Risk HPV Types in Men and Women in the POP-Brazil Study. The lines indicate 95% confidence intervals.

Determinants of HPV detection

Detection of any HPV detection was associated with female gender, brown/pardo color, socioeconomic status D-E, being single or dating, current smoking and having two or more partners in the past year. Associations for detection of HPV-HR types were similar to those for detection of any HPV types, except brown/pardo color and former smoker were not significant for HPV-HR types (Table 2).

Table 2 Characteristics Associated with any type and High-Risk Genital HPV Prevalence: a Multivariate Model. POP-Brazil Study, 2017.

Regional differences in HPV detection

The prevalence of any HPV differed significantly between regions with higher detection in the Northeast (919, 58.1%; 95%CI 54.8 to 61.3), followed by the Midwest (654, 56.7%; 95%CI 51.9 to 61.5), North (749, 53.6%; 95%CI 49.5 to 57.6), Southeast (617, 49.9%; 95%CI 45.3 to 54.5), and South (508, 49.7%; 95%CI 45.0 to 54.3).

The geographical regions also varied in the socio-behavioral characteristics that associated with HPV detection (Table 3). Dating increased the rate of HPV detection in the Southeast, Midwest, and Northeast regions. Current smoking and being a former smoker were associated with HPV prevalence in the Midwest, Northeast and North, and the number of sexual partners in the past year was associated with HPV prevalence in all regions except the South. Only in this region was socioeconomic status A associated with HPV infection. When we adjusted a model for sex, race/skin-color, marital status, social class, smoke, condom use, age at first intercourse and number of partners in the last year, Northeast (1.17; 95% CI 1.03–1.34) and Midwest (1.16; 95% CI 1.01–1.34) haver significantly higher prevalences when comparing to South region.

Table 3 Characteristics Associated with HPV Prevalence in Each Geographic Region of Brazil. POP-Brazil Study, 2017.

Discussion

This study with over 6000 participants is the first to provide nationwide data on genital HPV prevalence in Brazil. Furthermore, to our knowledge, this is one of only a few large studies enrolling women and men simultaneously, allowing a direct comparison of HPV rates between genders. We found a high prevalence of any HPV (53.5%) and HR-HPV (35.2%) in a nationwide unvaccinated sample of 16–25-year-olds. While the overall HPV prevalence is similar between sexes, women have higher rates of HR-HPV and multiple infection. The type-specific detection of HPV infection varied widely between sexes. The geographical regions of Brazil differed in socio-behavioral characteristics that were associated with HPV detection.

Approximately half of the studied women had HPV detected. Previous small studies in the Brazilian population have shown HPV rates from as low as 6.7%30 up to 62.3%31, depending on the population evaluated and method of detection. Compared to global rates32, the prevalence in this study is higher than the ones reported for the same age range in South America (18.1%)6 and similar to the US HPV detection rate (53.8%)18 for 20–24 year old.

The majority of studies on HPV prevalence are focused on sex-specific populations, preventing a direct comparison between genders. In the present study, the prevalence of overall genital HPV in men was similar to women and higher than previous Brazilian studies33 except for the HIM Study (men aged 18–70 years), which found higher rates of HPV (72.3%)34. In the United States (NHANES) and England (Natsal-3) surveys have also compared the prevalence of HPV between sexes. Although the overall prevalence of HPV was higher among men in the US, this varied by age, with younger women (age 14–24 years) having higher prevalence of any type and HR-HPV35, similar to our results. Although we used similar genotyping methods, there were differences compared to our study. In the United States, the number of sexual partners was associated with an increase in HR-HPV prevalence in men35. In contrast, our study found that the number of partners increased the HR-HPV prevalence in women (from 1.6, 95%CI 1.1 to 2.1 to 1.7; 95%CI 1.2 to 2.3) but only in those who initiated sexual activity after age 15 (data not shown).

Although men and women share the main HPV types, the prevalence varies between sexes. In men, the most prevalent HR type was HPV 59, an oncogenic type related to HPV 1836. In women, the most prevalent HR type was HPV 16. These data emphasize the importance of screening considering that approximately 68.2% of invasive cervical cancers are attributed to HPV 16 or 1832.

Brazil covers a large portion of the South American continent and has high socioeconomic and cultural diversity. Therefore, it was reasonable to assume that the Brazilian population might have different risk factors associated with HPV infection from region to region. The majority of studies available were performed in women in the Southeast region and no reports were found that included all geographic regions of Brazil33. Although there are differences in prevalence according to region, race/color, social class and educational level were generally not significantly associated with HPV detection. Similar results have been reported in previous research studies in American and British populations37,38. Studies showing an effect of race and social class on HPV prevalence are often not comparable due to differences in the included population39,40,41, which is generally limited to specific groups. Behavioral characteristics, such as smoking and number of partners, may explain differences in prevalence rates according to region. Current and prior cigarette smoking was associated with HPV prevalence in all but the South and Southeast regions, possibly due to the higher prevalence of smokers in these regions (20.2% and 16.6%, respectively). Cigarette smoking contributes to cervical cancer32, and it has been associated with HPV prevalence42,43, incidence44, and persistence45.

This study has some limitations that must be noted. This multicentric community-based study was not population-based; instead, it was performed in convenience primary health care units spread throughout Brazil. We were able to weight our estimates for the age structure, but all people included in the survey were sexually active and therefore likely to overestimate the true population prevalence. Furthermore, we do not have data to weight for differences in sexual behavior between people living in capitals and other regions of the state, adding uncertainty around estimates extrapolated to the wider population. Although we have a large sample size, when we presented the prevalence stratified by sex or region, some estimates may be unstable due to the small sample size in specific categories. Additionally, males had a higher rate of invalid samples because they had lower levels of extracted DNA, similar to other studies46.

In summary, we report a high prevalence of any HPV type and HR-HPV types in a nationwide sample of men and women aged 16–25 years who were unvaccinated for HPV, with a higher prevalence of HR-HPV types observed in women. There were some differences in HPV prevalence across regions in Brazil that might be partially explained by behavioral variables. Our data provide information to be used in HPV prevention policies and constitute a baseline against which the impact of the HPV immunization program in Brazil can be measured in future years.