Introduction

Most women have a vaginal microbiota (VMB) that consists predominantly of lactobacilli1. The most common type of vaginal dysbiosis is bacterial vaginosis (BV), characterized by a decrease in lactobacilli and increase in fastidious anaerobes2. Other types of bacterial dysbiosis, vulvovaginal candidiasis, and Trichomonas vaginalis (TV) are also common. These conditions are associated with vaginal inflammation, thereby increasing the risk of HIV acquisition3. BV is also associated with pelvic inflammatory disease, infertility, and adverse pregnancy outcomes2.

The majority of women seeking care for vaginal symptoms receive antibiotic or antifungal treatment empirically or syndromically without any diagnostic testing4. In some specialized clinics, women might be offered limited diagnostic testing, such as vaginal pH determination and/or wet mount microscopy. In research settings, BV is typically diagnosed by the Amsel criteria or Nugent scoring5,6, with the latter currently being considered the gold standard: Gram-stained vaginal smears are scored based on microscopic visualization of three bacterial morphotypes with a score of 0–3 considered normal, 4–6 intermediate, and 7–10 BV regardless of symptoms. In the last 15 years, molecular methods have become more widely available, and have been applied to the VMB, although mostly in descriptive studies to date1.

Evidence is mounting that ‘microbiological-BV’ (defined here as the absence of lactobacilli-domination by Nugent scoring or by molecular methods regardless of symptoms) can cause long-term adverse outcomes in the presence or absence of vaginal symptoms2. BV is notoriously difficult to treat7,8,9. About 60–80% of patients are cured after a course of oral or vaginal metronidazole or clindamycin, but recurrence rates are high7. Therapies to restore and maintain an optimal lactobacilli-dominated VMB after antibiotic treatment are not standard practice, but some clinicians in Europe and North America recommend twice weekly 0.75% metronidazole vaginal gel for 4–6 months to lower the risk of BV recurrence7. This recommendation was tested in a randomized controlled trial in the USA, which showed a statistically significant reduction in BV recurrence (34% by Nugent scoring)10. In addition, two African trials evaluating oral (2 g monthly) and vaginal metronidazole (five nights every three months) for BV prevention also showed significant incidence reductions (45% and 10%, respectively)11,12.

Probiotic lactobacilli may be able to restore and maintain a lactobacilli-dominated VMB, may be better able to prevent or disrupt BV-associated biofilms than antibiotics, and can likely be used safely for long periods without the risk of causing antimicrobial resistance13. Lactobacilli-containing vaginal probiotic clinical trials to date have shown mixed results, but eight of the 12 trials showed sufficiently promising efficacy for BV prevention to warrant further investigation13,14,15,16,17,18,19,20,21,22,23,24. Some trials used commercially available probiotic strains (mostly derived from the gut or fermented foods) and others vaginal strains isolated from healthy women25, but efficacy signals were similar for the two probiotic strain categories13,14,15,16,17,18,19,20,21,22,23,24. None of the trials reported major safety concerns or colonization beyond the dosing period. Because ‘natural’ strains do not seem to outperform commercially available strains, and for pragmatic reasons, we chose to evaluate two vaginal probiotics that are currently on the market. Our aim was to assess their safety and impact on the VMB in much more detail than previous trials had done, and to develop data analytic methods to enable use of high-dimensional 16S rRNA gene sequencing data for this purpose. The two probiotics that we evaluated were Ecologic Femi+ vaginal capsule (EF+; Winclove Probiotics, Amsterdam, The Netherlands) and Gynophilus LP vaginal tablet (GynLP; Biose, Arpajon-sur-Cère, France). EF+ contains multiple lyophilized bacteria (Bifidobacterium bifidum W28, Lactobacillus acidophilus W70, L. helveticus W74, L. brevis W63, L. plantarum W21, and L. salivarius W24) in a total dose of 1.5 × 109 colony forming units (CFU). GynLP contains 1.6 × 109 CFU of Lcr Regenerans, a culture of the L. rhamnosus 35 (Lcr35) strain. The tablet disintegrates in the vagina after forming a gel to release Lcr35. Gynophilus (the same active ingredient as GynLP but a different formulation) had shown promise in preventing BV recurrence in a previous trial, but EF+ had not previously been studied for this indication15.

Methods

From June 2015 until February 2016, we conducted a randomized clinical trial in Kigali, Rwanda, to evaluate intermittent use of the above-mentioned vaginal probiotics as well as oral metronidazole (Tricozole, Laboratory & Allied ltd, Nairobi, Kenya) to prevent the recurrence of microbiological-BV after metronidazole treatment. The trial was a pilot trial with a modest sample size (N = 68) at the request of the funder.

Eligibility

Women were eligible for screening if they were aged 18–45, in good overall physical and mental health, and at high urogenital infection risk defined as having had more than one sex partner in the last 12 months or having been treated for a sexually transmitted infection and/or BV in the last 12 months. They were eligible for enrollment if they were confirmed HIV-negative and non-pregnant, were diagnosed with BV (Nugent score 7–10 and/or by modified Amsel criteria, defined as two of the following three criteria positive: vaginal pH > 4.5, positive whiff test, or ≥20% clue cells) and/or TV (by wet mount and/or culture), and were cured after seven days of oral metronidazole treatment (500 mg twice per day)5. Cure was defined as no BV by modified Amsel criteria and no TV by wet mount6. We did not use Nugent scores and TV culture as tests-of-cure to allow for same day enrollment but results became available after enrollment. At enrollment, 51/68 women were BV-negative by modified Amsel and Nugent criteria (score 0–6), 17/68 women were BV-negative by modified Amsel but BV-positive by Nugent criteria (score 7–10), and all women were TV-negative by both wet mount and culture and free of symptomatic vulvo-vaginal candidiasis, urinary tract infection, syphilis, and clinician-observed genital abnormalities or vaginal discharge. We did not exclude women with gonorrhea and/or chlamydia because the local testing turn-around time was slow. Positive herpes simplex type 2 serology was not a reason for exclusion. Additional exclusion criteria applied but these were rare (Fig. 1, Supplement 1).

Figure 1
figure 1

Flow diagram of numbers of women seen, study procedures, and samples collected. Abbreviations: BV, bacterial vaginosis; D7, day 7 visit; DNA, desoxyribonucleic acid; M1/2/6, month 1/2/6 visit; qPCR, quantitative polymerase chain reaction; RU, Rinda Ubuzima; STI, sexually transmitted infection; TV, Trichomonas vaginalis; UTI, urinary tract infection. *Totals to 110 reasons among 102 women because there could be more than one reason per woman. No speculum exam performed; molecular testing of self-sampled vaginal swabs. Reasons: outside of metronidazole treatment window (n = 5), enrollment target already met (n = 4), has a mental disorder (n = 1), did not complete screening procedures and was subsequently lost to follow-up (n = 1), withdrew consent during the screening visit because she thought the reimbursement was too low (n = 1). §Three women in each randomization group were selected for self-sampling every other day during the first month of follow-up. Reasons: moved away from Kigali (n = 2), lost interest because symptoms resolved (n = 1), was verbally harassed by partner and sister about study participation (n = 1). ||These were all physician-collected. See Supplement 1 for information about samples that should have been but were not collected or were lost during processing.

Randomization groups and visit schedule

Women were randomized to four groups (17 women per group) within three days of completing oral metronidazole treatment: (1) behavioral counseling only (control group); (2) counseling plus 500 mg oral metronidazole twice weekly for two months; (3) counseling plus EF+ vaginal capsule once per day for the first five days followed by thrice weekly for a total of two months; and 4) counseling plus GynLP once every four days for two months. Participants applied the first dose of their intervention under direct observation at the enrollment visit, and returned to the clinic after seven days (D7), one month (M1), two months (M2; cessation of product use), and six months (M6). They were allowed to cease vaginal product use temporarily during menstruation. Product adherence was assessed by review of diary cards, used and unused products, and by a self-rating adherence scale. Symptomatic BV, vulvovaginal candidiasis, and urinary tract infections, and laboratory-confirmed sexually transmitted infections, diagnosed during follow-up were treated using standard oral therapies, and women were urged to continue their interventions during treatment. Visit procedures are summarized in Fig. 1 and described in Supplement 1.

Diagnostic testing

Women were tested for BV, TV, and vulvovaginal candidiasis at each visit. BV was diagnosed by Gram stain Nugent scoring and modified Amsel criteria; the vaginal pH was measured by pressing a pH paper strip (pH range 3.6–6.1 with 0.3 increments; Machery-Nagel, Düren, Germany) against the vaginal wall. TV was diagnosed when motile trichomonads were observed on wet mount and by InPouch culture (Biomed Diagnostics, White City, OR, USA). Vulvovaginal candidiasis was diagnosed when budding yeasts and/or (pseudo)hyphae were observed on wet mount. All other diagnostic tests (Supplement 1) were only done at screening, M6, and when judged clinically necessary by the physician, with the exception of pregnancy and urinalysis tests, which were repeated at enrollment prior to randomization.

16S rRNA gene sequencing and qPCR

We collected vaginal swabs in duplicate but DNA was extracted from one swab per woman only at each time point (639 samples in total): screening (176 screened women and 170 samples), enrollment and four follow-up visits (68 randomized women and 338 samples), and self-sampled swabs from 12 women (three per randomization group) who had been asked to self-sample at home every Monday, Wednesday, and Friday during the first month (12 women and 131 samples) (Fig. 1). Briefly, DNA was extracted using a combination of lysozyme lysis, Qiagen DNeasy Blood and Tissue kit (Qiagen, Manchester, UK), and bead-beating procedures (Supplement 1). Purified DNA underwent two PCR rounds: amplification of the 16S rRNA gene V3-V4 region and dual-index barcoding allowing multiplexing of up to 384 samples. Samples were sequenced on an Illumina HiSeq2500 instrument (Illumina, San Diego, CA, USA), run in rapid mode, 2 × 300 bp using a 250PE and 50PE kit. All samples collected from the same participant were included in the same run. Negative and positive controls (ZymoBiomics Microbial Community DNA standard; Zymo Research Corp, Irvine, USA) were incorporated throughout. The panbacterial 16S rRNA gene copy concentrations of physician-collected samples of enrolled women with valid sequencing results (N = 393) were determined by BactQuant qPCR assay as previously described26.

Molecular data processing

Processing of sequencing reads, taxonomic assignments, and rarefaction are described in Supplement 1. This yielded 401 unique amplicon sequence variants (ASVs) in 629 samples, mapping to species (n = 255; 63.6%), genus (n = 116; 28.9%), or higher taxonomic level (n = 30; 7.5%). Concentrations in cells/μl per ASV per sample were estimated by multiplying the ASV-specific copy-normalized relative abundance by the sample-specific 16S rRNA gene copies concentration (explained in more detail in Supplement 1)27,28. Heatmaps of the 20 most common ASVs by sample are shown in Fig. S1A for relative abundances and Fig. S1B for estimated concentrations. VMB data reduction was required for molecular efficacy analyses, and was done in three different ways. First, the richness (number of ASVs) and Simpson diversity index (1-D) were calculated for each sample. Second, each ASV was assigned to one of four ‘bacterial groups’ based on the published literature (Supplement 2): (1) lactobacilli (all species combined, but with subcategorization into EF+ strains, the GynLP strain, and ‘natural lactobacilli’ in some analyses); (2) BV-anaerobes (fastidious anaerobes that have consistently been associated with BV); (3) pathobionts (bacteria that are considered more pathogenic than BV-anaerobes and that often co-occur with lactobacilli instead of BV-anaerobes, such as streptococci, staphylococci, enterococci, and Enterobacteriaceae)29; and (4) ‘other bacteria’ (a rest group, consisting mostly of skin bacteria and Bifidobacterium species). Within each sample, read counts of ASVs belonging to the same bacterial group were summed. This resulted in four relative abundances (one for each bacterial group) per sample, which sum to 1.0 for each sample. Third, we classified samples into eight mutually exclusive VMB types (each sample was assigned to only one VMB type): (1) L. iners-dominated (>75% lactobacilli of which L. iners was the most common; n = 247 samples); (2) L. crispatus-dominated (>75% lactobacilli of which L. crispatus was the most common; n = 17); (3) other lactobacilli-dominated (>75% lactobacilli of which L. jensenii and L. gasseri were the most common; n = 28); (4) lactobacilli plus BV-anaerobes (25–75% lactobacilli; n = 86); (5) polybacterial with ≥10% Gardnerella vaginalis (<25% lactobacilli and 10–50% G. vaginalis; n = 138), (6) polybacterial with <10% G. vaginalis (<25% lactobacilli and <10% G. vaginalis; n = 23), (7) G. vaginalis-dominated (<25% lactobacilli and >50% G. vaginalis; n = 41); and (8) ≥20% pathobionts (n = 49). The samples assigned to VMB types 1–7 had a maximum of 0–18.2% pathobionts per VMB type. To improve statistical power and to facilitate visualizations, the eight VMB types were condensed further into four ‘pooled VMB types’ as follows: (1) lactobacilli-dominated (original VMB types 1–3 combined; n = 292), (2) lactobacilli plus BV-anaerobes (original VMB type 4; n = 86); (3) BV-like (original VMB types 5–7 combined; n = 202); and ≥20% pathobionts (original VMB type 8; n = 49).

Downstream statistical analysis

Data were analyzed using STATA v13.0 (StataCorp, College Station, TX, USA). The trial safety endpoints were self-reported solicited and unsolicited (serious) adverse events and social harms, and clinician-observed speculum exam findings. Adverse events were coded using the Medical Dictionary for Regulatory Activities (medDRA v19.1, McLean, VA, USA). The primary preliminary efficacy endpoints were incident BV by Nugent score 7–10 or by modified Amsel criteria (binary variables). The secondary preliminary efficacy endpoints were the VMB composition variables derived from the sequencing data: the four bacterial group relative abundances and estimated concentrations (continuous variables) and VMB type or pooled VMB type (categorical variables). For cross-sectional analyses, we used Fisher’s exact test for binary and categorical variables, and Kruskal-Wallis and Mann Whitney U tests for continuous variables. For longitudinal analyses, we used incidence rates and incidence rate ratios (which only included samples collected at study visits; Fig. 1), and mixed effects models (which included all samples, including those that were self-collected in between study visits, but Nugent scores and estimated concentrations were not available for self-collected samples; Fig. 1). Most analyses were conducted on the intent-to-treat (ITT) population (N = 68), but the BV by Nugent score and modified Amsel criteria incidence analyses were conducted on the modified ITT population (which was limited to women who were BV-negative by both modified Amsel and Nugent criteria at the time of randomization; n = 51). We compared each intervention group to the control group for one VMB endpoint at a time at screening, enrollment, and longitudinally over time. All mixed effects models included one VMB endpoint as the outcome, participant identification number as the random effect, and randomization group (an indicator variable with the control group as the reference group) as the main fixed effect, and adjusted for covariates that were associated with at least one VMB endpoint in mixed effects models at p < 0.05 (Supplement 1).

Ethical statement

All participants provided written informed consent. The study was conducted in accordance with the Declaration of Helsinki, sponsored by the University of Liverpool, approved by the Rwanda National Ethics Committee and the University of Liverpool Research Ethics Subcommittee for Physical Interventions, and first posted on ClinicalTrials.gov (NCT02459665) on 2 June 2015.

Results

Participant disposition

Of the 68 randomized women, only four did not complete the trial (Fig. 1), resulting in 29.93 person-years of data. The median age was 31 (interquartile range (IQR) 27–35) (Tables 1 and S1). The majority of women (92.6%) had exchanged sex for money or goods, and had had a median of five (IQR 2–18) sex partners, in the past month. All but three women used condoms, but mostly inconsistently. Two-thirds of the women (61.8%) were using hormonal contraception, and four women became pregnant during the trial. Short course metronidazole/tinidazole use for other indications during the intervention period was evenly distributed among randomization groups (Table S2: n = 1–3 per group; p = 0.688), as was short course use of other oral antibiotics (Table S2: n = 2–4 per group; p = 0.781). Furthermore, these other antibiotics did not impact lactobacilli and BV-anaerobes estimated concentrations significantly (Table S2, Fig. S2). No antifungals were used. Most women adhered with their study product >90% of the time, but this percentage was non-significantly lower in the GynLP group (68.8%) than in the metronidazole (82.4%; Fisher’s exact p = 0.438) and EF+ groups (88.2%, Fisher’s exact p = 0.225; Table S3).

Table 1 Baseline characteristics.

VMB compositions at baseline

By design, all 68 randomized women at screening had BV (by Nugent and/or modified Amsel criteria) and/or TV (by wet mount and/or culture): 56 women (82.4%) had BV alone, 10 women (14.7%) had BV and TV, and two women (2.9%) had TV alone. Therefore, as expected, most women had dysbiotic VMB types by 16S rRNA gene sequencing at screening: polybacterial with ≥10% G. vaginalis (28/67; 41.8%), lactobacilli plus BV-anaerobes (12/67; 17.9%), polybacterial with <10% G. vaginalis (8/67; 11.9%), G. vaginalis-dominated (8/67; 11.9%), and ≥20% pathobionts (1/67; 1.5%). However, 10/67 women (14.9%) had a L. iners-dominated VMB type, of whom six were TV-negative, and these women might not have needed metronidazole treatment. Also by design, all randomized women were BV-negative (by modified Amsel criteria) and TV-negative (by wet mount and culture) at the time of randomization. However, almost half of the women (30/67; 44.8%) were not lactobacilli-dominated by sequencing at the time of randomization, and none of them were L. crispatus-dominated (which is considered the most optimal VMB state): their VMB types were L. iners-dominated (35/67; 52.2%), dominated by other lactobacilli (2/67; 3.0%), lactobacilli plus BV-anaerobes (18/67; 26.9%), ≥20% pathobionts (6/67; 9.0%), G. vaginalis-dominated (6/67; 6.0%), and polybacterial with ≥10% G. vaginalis (2/67; 3.0%) (Table 1).

At randomization, the estimated mean total bacterial concentration ranged from 5.54–6.34 log10 cells/μl in the four randomization groups, and these ranges were 5.22–5.97 for lactobacilli, 3.36–5.62 for BV-anaerobes, 1.34–2.36 for pathobionts, and 0.57–2.20 for ‘other bacteria’ (Table S4; relative abundance data in Table S5). The mean richness ranged from 7.5–17.8, and the mean Simpson diversity from 0.15–0.38. Randomization did not completely balance the baseline VMB compositions of the four groups: the estimated concentrations of total bacteria and BV-anaerobes were higher in the GynLP group than the control group (Mann Whitney U test p < 0.05), and the estimated BV-anaerobes concentration (p < 0.01) and Simpson diversity (p < 0.05) were lower in the EF+ group than the control group (Table 1).

Variables that were associated with at least one VMB endpoint in unadjusted mixed effects models (Table S6) included currently using hormonal contraception or being pregnant (associated with a higher estimated pathobionts concentration), above-average sexual risk taking based on reported numbers of partners and condom use (higher estimated pathobionts concentration), aged 30 years or older (lower estimated BV-anaerobes and pathobionts concentrations), and managing menses with a sanitary pad compared to other methods (higher Nugent score). Reporting current urogenital symptoms was also associated with VMB composition but this likely represents reverse causality. We could not exclude women with ongoing chlamydia and/or gonorrhea infections at randomization because the turn-around time of diagnostic testing was slow, but the VMB compositions after metronidazole treatment of women with and without ongoing chlamydia and/or gonorrhea infection were similar (Fig. S2).

Safety

Two serious adverse events occurred but these were judged not to be related to study participation: one woman in the oral metronidazole group was hospitalized for typhoid fever and one woman in the EF+ group for malaria during pregnancy. Both events occurred after the intervention period and both women recovered completely. Two women reported non-serious social harms that were judged related to study participation. One woman in the control group was beaten by her partner because she engaged in self-sampling; she withdrew from self-sampling but continued participation in the study. One woman in the GynLP group suffered verbal harassment from her partner and her sister for taking part in the study and elected to withdraw.

During the intervention period, urogenital symptoms were reported by 13.4% of participants (genital itching, burning, and pain during sex but no vaginal discharge) with no differences between groups, and only two speculum exam and no bimanual exam findings were reported by the physician (Table 2). After product cessation, urogenital symptom reporting was similar compared to the intervention period (10.8%), but the number of speculum exam findings increased (32.8%), probably reflecting the high urogenital infection incidence in this cohort. A total of 41 adverse events were spontaneously reported between enrollment and M6. All of them were judged definitely not or unlikely to be related to trial participation, and they were evenly distributed among groups.

Table 2 Safety endpoints.

Preliminary efficacy: microscopy endpoints

In modified ITT analyses, the BV incidence rate by Nugent score during the intervention period was 10.18 per person-year at risk (PY) in the control group, and lower in the metronidazole (1.41/PY; p = 0.004), EF+ (3.58/PY; p = 0.043), and GynLP groups (5.36/PY; p = 0.220) (Table 3). Mean Nugent scores during the intervention period were highest in the control group, lowest in the metronidazole group, and in-between in the two vaginal probiotics groups (Fig. 2A). By the end of the intervention period, many women had developed microbiological-BV without symptoms. In line with standard practice, they were not treated, but they were also no longer included in the ‘person-years at risk’ denominator because they had already developed the endpoint. BV incidence rates were therefore much lower (1.26/PY overall) after cessation of the intervention, and similar between groups. The results for BV incidence by modified Amsel criteria were similar to those for Nugent scores (Table 3), and no vulvovaginal candidiasis was diagnosed after randomization.

Table 3 Preliminary efficacy – microscopy endpoints.
Figure 2
figure 2

Preliminary efficacy by Nugent score, alpha diversity, and bacterial group estimated concentration. Abbreviations: D7, Day 7 visit; EF+, Ecologic Femi+; Enr, enrollment visit; est conc, estimated concentration; GynLP, Gynophilus LP; ITT, intention to treat; M1/2/6, month 1/2/6 visit; Scr, screening visit; VMB, vaginal microbiota. Changes in VMB outcomes over time per randomization group. See Supplement Table S4 for 95% confidence intervals. (A) Mean Nugent scores over time, only including women (n = 51) who were BV-negative by modified Amsel and Nugent criteria (modified ITT analysis). (B) Mean alpha diversity over time. (C) Mean estimated bacterial cell concentration over time. (D) Difference in mean estimated bacterial cell concentration with enrollment, over time. (E) Mean estimated lactobacilli concentration over time. (F) Difference in mean estimated lactobacilli concentration with enrollment, over time. (G) Mean estimated BV-anaerobes concentration over time. (H) Difference in mean estimated BV-anaerobes concentration with enrollment, over time. (I) Mean estimated pathobionts concentration over time. (J) Difference in mean estimated pathobionts concentration with enrollment, over time. (K) Mean estimated other bacteria concentration over time. (L) Difference in mean estimated other bacteria concentration with enrollment, over time.

Preliminary efficacy: sequencing endpoints

We took the baseline VMB composition imbalances into account by not only showing bacterial group means at each visit but also mean differences with enrollment (Fig. 2 and Table S4 for estimated concentrations, and Fig. S3 and Table S5 for relative abundances), and by using mixed effects models including participant identification number as the random effect to determine if any changes were statistically significant (Table 4). Immediately after BV treatment completion, the VMBs of most women gradually worsened (lactobacilli declined and BV-anaerobes expanded), probably due to the high-risk nature of the cohort. The mean estimated lactobacilli concentration declined to a low of 3.86 log10 cells/μl at M2 in the control group, but less so in the intervention groups (ranging from 4.60–5.58 log10 cells/μl at follow-up visits). In unadjusted mixed effects models using data from the intervention period only, metronidazole users had a higher estimated lactobacilli concentration (p = 0.043) and relative abundance (p = 0.006) than controls, EF+ users had a higher relative abundance (p = 0.014) but not estimated concentration than controls, and GynLP users had trends in the same directions that were not statistically significant. The expansion of BV-anaerobes was significantly lower in oral metronidazole users (relative abundance; p = 0.023), and in EF+ users (estimated concentration; p = 0.041), compared to controls. Mean estimated pathobionts concentrations were low in all groups throughout, ranging from 1.61–3.35 log10 cells/μl at follow-up visits. Mixed effects models did not identify any significant associations between randomization groups and estimated pathobionts concentrations or relative abundances, but showed trends (0.05 < p < 0.1) towards lower pathobionts relative abundances in the two vaginal probiotics groups compared to controls (Table 4). Mean estimated concentrations of ‘other bacteria’ were also low throughout, but highest in the EF+ group at the D7 and M1 visits (EF+ contains a Bifidobacterium strain). This was significant in unadjusted mixed effects models for relative abundances (p = 0.023) but not estimated concentrations.

Table 4 Preliminary efficacy – mixed effects models.

The proportions of women in each randomization group at each visit having a particular VMB type corresponded with the estimated concentration and relative abundance data, and additionally showed that – in lactobacilli-dominated women – the L. iners-dominated VMB type was far more common than the L. crispatus-dominated and other lactobacilli-dominated VMB types throughout (Fig. S4). Among women with dysbiosis, the lactobacilli plus BV-anaerobes and polybacterial with ≥10% G. vaginalis VMB types continued to be the most common dysbiosis types during follow-up. In unadjusted mixed effects models using intervention period data only, metronidazole users and EF+ users, each compared to controls, were significantly less likely to have dysbiotic VMB types (p = 0.012 and p = 0.029, respectively) (Table 4).

The associations in unadjusted mixed effects models persisted after adjustment for hormonal contraception use/pregnancy, sexual risk taking, and age, except for the association with estimated Lactobacillus concentration among metronidazole users. Metronidazole users compared to controls had a significantly higher Lactobacillus relative abundance (p = 0.014), a significantly lower BV-anaerobes relative abundance (p = 0.049), and were significantly less likely to have BV by Nugent scoring (p = 0.031) or by VMB types (the two polybacterial and G. vaginalis-dominated VMB-types combined; p = 0.017) (Table 4). EF+ users compared to controls had a significantly higher Lactobacillus relative abundance (p = 0.025), a significantly lower estimated BV-anaerobes concentration (p = 0.046), a significantly higher relative abundance of ‘other bacteria’ (p = 0.009), and were significantly less likely to have BV-like VMB types (p = 0.027).

Detection of probiotic strains

During the intervention period, relevant probiotic strains were detected in 39% of samples from EF+ users and 20% of samples from GynLP users (all swabs combined, including self-sampled swabs). The detection percentages were 58% and 31%, respectively, in sensitivity analyses using non-rarefied data. Some of the EF+ strains cannot be differentiated from naturally occurring strains, and EF+-like strains were therefore detected (at low levels) in all groups at most time points (Fig. 3, Table S4). However, the mean estimated concentrations were highest in the EF+ group during the intervention period (mean estimated concentrations 0.48–1.92 log10 cells/μl per visit for all women combined, and 3.62–4.28 log10 cells/μl per visit for women who did have EF+ strains detected using rarefied data). The GynLP strain was only detected in the GynLP group during the intervention period (mean estimated concentrations 0.25–1.05 log10 cells/μl per visit for all women combined, and 3.72–4.55 log10 cells/μl per visit for women who did have GynLP detected using rarefied data). Inter- and intra-individual differences between participants were high: the highest estimated vaginal probiotic concentration detected in an individual EF+ user was 5.51 log10 cells/μl, and in an individual GynLP user was 6.17 log10 cells/μl. During the intervention period, the mean relative abundances of the probiotic strains were 0.03 in both EF+ and GynLP users, and 0.08 and 0.15, respectively, if only samples in which any strains were detected were included.

Figure 3
figure 3

Detection of probiotic strains during the trial. Abbreviations: D7, Day 7 visit; EF+, Ecologic Femi+; Enr, enrollment visit; GynLP, Gynophilus LP; M1/2/6, month 1/2/6 visit; Scr, screening visit. Mean relative abundance (A) and mean estimated concentration (B) of Lactobacillus species over time in the EF+ group; mean relative abundance (C) and mean estimated concentration (D) of Lactobacillus species over time in the GynLP group. The length of bars in (A,C) depicts total relative abundance of all Lactobacillus species combined.

VMB transitions

The stacked graph and alluvial diagrams (Fig. S4) show that transitions from one VMB type to another were common. As expected, most transitions during the intervention period were from Lactobacillus-dominated states to dysbiotic states, but the reverse also occurred. Transitions between VMB types were common in all randomization groups. The percentage of actual transitions divided by potential transitions between D7 and M6 were 29/66 (43.9%) in the control group, 32/64 (50.0%) in the metronidazole group, 28/67 (41.8%) in the EF+ group and 24/56 (42.9%) in the GynLP group (Fisher’s exact p = 0.792).

Incidence of sexually transmitted and urinary tract infections

As expected, the incidences of HIV (n = 2), herpes simplex type 2 (n = 1), syphilis (n = 1), gonorrhea (n = 5), chlamydia (n = 6), TV (n = 5) and urinary tract infection (n = 7) were too low to determine differences between randomization groups (Table S7).

Discussion

This trial showed that all three interventions were safe. Our preliminary efficacy results confirm that intermittent use of metronidazole reduces BV recurrence10,11,12, and suggest that intermittent use of lactobacilli-containing vaginal probiotics may also reduce BV recurrence. These findings are important because some women and clinicians may prefer safe and efficacious vaginal probiotics to metronidazole since they are not expected to negatively affect other body niche microbiomes or cause antimicrobial resistance.

Our trial was funded as a pilot study and therefore had a modest sample size. Despite this, many of the preliminary efficacy associations for intermittent metronidazole and EF+ use reached statistical significance. While this was not the case for intermittent GynLP use, all of the trends in this group were in the same directions. We believe that the GynLP group suffered a few disadvantages compared to the other randomization groups, which may explain the lack of statistical significance. Randomization imbalances commonly occur in small trials30, and in our trial, this led to women in the GynLP group being more dysbiotic at baseline than controls and EF+ users. We ameliorated this disadvantage in our analysis strategy, but we may not have been able to eradicate it. In addition, GynLP users were less adherent on average than metronidazole and EF+ users. Differences in adherence using triangulated data did not reach statistical significance, but probiotic strain detection rates (58% for EF+ samples and 31% for GynLP samples using non-rarefied data) support this claim. Biose has since simplified the dosing regimen of next generation Gynophilus products to two fixed days a week to boost adherence. Finally, GynLP dosing (once every four days) was similar to metronidazole dosing (twice weekly) but less frequent than EF+ dosing (once per day for the first five days followed by thrice weekly for the remainder of the intervention period).

Probiotic detection rates were 58% for EF+ samples and 31% for GynLP samples, and inter- and intra-individual variabilities were high (with estimated probiotic concentrations ranging from zero to 6.17 log10 cells/μl). This detection variability is consistent with most other vaginal probiotic studies that used sampling at non-daily intervals and molecular assessment methods21,31,32,33. A major drawback of all of those studies, including ours, is that product use was not directly observed but self-reported, and precise information about the time period between last product insertion and sample collection was lacking. The average total bacterial load of a healthy vagina is currently not known34, but we estimate it to be in the order of 2 × 1010 bacteria (Supplement 1). The vaginal probiotic strain(s) in our trial were both applied at about 1.5 × 109 CFU per dose, which would be about 7.5% of the total vaginal bacterial load after application if all probiotic bacteria were to remain in the vagina. We detected mean relative abundances of 7.7% for EF+ strains and 15.1% for GynLP when only samples with any relevant probiotic strains detected during product use were included (thereby eliminating any potential non-adherence). A recently published study of Gynophilus Slow Release tablet (which is almost identical to GynLP) in which women self-sampled every day showed that mean estimated vaginal concentrations of Lcr35 by qPCR were between 104 and 106 CFU/μl in women who used the tablet once every four or five days35. Using our estimated concentration data, we detected a similar concentration range in samples with any GynLP detected. This consistency is reassuring, but the question remains whether probiotic concentrations of this order of magnitude optimally prevent BV recurrence in the long-term. Furthermore, all studies referenced in this paragraph, including ours, have shown that probiotic strains do not persist in the vagina after dosing has ceased. The second question then is whether the colonization capacity of probiotic bacteria should be improved. Our data suggest that probiotic lactobacilli may boost ‘natural’ lactobacilli indirectly, which may be sufficient. Indirect effects may include increased localized lactic acid production, modulation of cervicovaginal mucosal immune responses, and/or inhibition of biofilm formation, by probiotic bacteria3,36,37.

Additional limitations of our study include lack of proof that the probiotic contents (strains and doses) were in agreement with the product labels, the high urogenital infection risk of this cohort (which makes prevention more challenging), and our inability to fully control for potential confounders. However, the mixed effects models were controlled for some of the best known VMB determinants (hormonal contraception, pregnancy, sexual risk taking, and age)1,38,39,40. We were not able to exclude women with gonorrhea and/or chlamydia infection at the time of randomization due to the slow laboratory turn-around time, but the VMB compositions of women with and without infection were similar, and we therefore think that this did not negatively affect our results.

With the development of better genomic and culturing methods, we are now on the cusp of a new era in vaginal probiotic research. Past vaginal probiotic studies have shown mixed results13,14,15,16,17,18,19,20,21,22,23,24, but almost all of these studies used imprecise VMB assessments based on clinical symptoms and microscopy. The addition of sequencing methods showed that many more women than previously thought are not lactobacilli-dominated after standard antibiotic BV treatment, that host responses to antibiotic and probiotic treatment are highly variable, and that it is possible to differentiate between probiotic strains and ‘natural’ lactobacilli. Furthermore, others have shown that quantifying relative abundance data in the same manner as we have done in this study correlates well with species-specific quantitative PCRs of non-minority species27,28. This then allows for microbiota data reduction into quantitative variables that can be analyzed in mixed effects models that adjust for repeated measures and confounding.

We conclude that lactobacilli-based vaginal probiotics warrant further development to improve long-term beneficial effects. We recommend that future trials incorporate quantitative molecular methods, optimize dosing and timing of product insertion versus sample collection, and enroll women with various urogenital risk profiles. Ideally, these trials would also evaluate the effects of interventions on vaginal biofilm formation, and – eventually – the impact on pregnancy complications, HIV epidemics, and other adverse outcomes.