Introduction

There is accumulating evidence that dysfunction of the bladder sensory nerves has a primary role in pathological conditions such as overactive bladder (OAB)1,2,3 that is characterized by urgency, urinary frequency, nocturia and urgency incontinence4. Due to the lack of an established, objective and reliable clinical assessment tool, it is currently difficult to detect alterations of afferent human bladder pathways. Consequently, the pathologic mechanisms behind OAB frequently remain unknown. Methods used so far for the investigation of sensory innervation, such as current perception threshold (CPT) assessment and urodynamic investigation, largely rely on subjective or semi-objective criteria5,6. A more objective and qualitative assessment tool of afferent nerve function are sensory evoked potentials (SEP)7. While feasibility of bladder SEP recordings was already demonstrated in healthy subjects8,9,10,11,12,13 and patients14,15, previous studies of our group reported that bladder stimulation at a relatively slow frequency of 0.5 Hz/pulse width = 1 ms, led to more reproducible cortical SEPs in female and male healthy subjects in contrast to a faster frequency of 3 Hz/pulse width = 0.2 ms8,11. This could be explained by the fact that bladder afferents comprise only myelinated Aδ-fibers and unmyelinated C-fibers, while faster conducting fibers are missing16,17. Nevertheless, it is difficult to draw a conclusion about the frequency effect in view of the different pulse widths requiring further studies. To ensure SEP reproducibility, averages of at least 500 stimuli and a repetition of at least two runs have been recommended for measurements of the upper and lower limbs7. As peripheral nerve stimulation can be applied with rates as high as 3 Hz due to their fast conducting nature, the time required to obtain an average of 500 stimulations amounts to a few minutes and is thus not critical in terms of keeping stimulation conditions constant.

Considering the indirect control of electrode placement via intravesical catheters, potential changes of bladder volume with time and the longer stimulation durations (up to six-fold increase) due to lower stimulation frequencies, bladder SEP measurements are more challenging. This study was performed in order to refine the methodology and to optimize the settings such as frequency and repetition number of stimulation in order to achieve a faster and reliable evaluation of viscero-sensory afferent bladder pathways. This is important for an implementation into clinical diagnostics and to minimize measurement bias through changes that occur over time. We aimed to evaluate the impact of different stimulation frequencies below 3 Hz at a constant pulse width of 1 ms, number of stimuli/runs and low-pass filters (200 Hz versus 70 Hz as previously used in our publications8,11) on SEPs from the bladder. We hypothesized that SEP waveform, responder rate (RR), reliability and signal-to-noise ratio (SNR) can be improved with optimized measurement parameters. Post-processing using a 70 Hz versus 200 Hz low-pass filter was expected to lead to smoother but corresponding SEP waveforms despite inessentially decreased amplitudes and increased latencies. The SEP findings from the bladder dome (BD) and trigone (TG) were compared to pudendal SEPs as a clinical reference standard.

Material and Methods

This study was approved by the local ethics committee (Kantonale Ethikkommission Zürich; KEK-ZH-Nr. 2013-0518), registered at clinicaltrials.gov (Identifier: NCT02272309, date of registration: 22/10/2014), and performed in accordance with the Declaration of Helsinki. Study data were collected and managed using REDCap (Research Electronic Data Capture) electronic data capture tools18. All subjects provided written informed consent prior to inclusion.

Subjects

Healthy volunteers were recruited through postings and advertisements on internet platforms and were invited for the screening visit to the Neuro-Urology, Spinal Cord Injury Centre at Balgrist University Hospital between October 2015 and June 2017. Study subjects had to be between 18 and 40 years old as well as in good mental and physical condition. This was defined as the absence of any lower urinary tract (LUT) symptom according to International Continence Society (ICS) terminology4, any urological or neurological diseases, urinary tract infection (UTI), previous surgery for urological or neurological reasons, and any regular medication intake (except contraceptive). The screening of the subjects included a complete medical history, vital signs, physical examinations (including examination of urogenital sensation, bulbocavernosus reflex, anal reflex, anal sphincter tone, and anal squeeze response), free uroflowmetry, post-void residual, Montreal-Cognitive-Assessment (MoCA), Hospital Anxiety and Depression Scale (HADS), International Prostate Symptom Score (IPSS) and a 3-day bladder diary (BLD)19. All included subjects fulfilled predefined cut-offs (Table 1).

Table 1 Baseline characteristics (n = 40, 20 females).

In addition, standardized urological questionnaires including the International Consultation on Incontinence Modular Questionnaire modules (ICIQ-FLUTS, ICIQ-MLUTS) and the Overactive Bladder Questionnaire short-form (Swiss German OAB) were completed19.

Study design

As shown in the protocol paper of the SENSORYII project comprising 5 LUT stimulation sites19, a power analysis yielded a total inclusion number of 10 subjects per stimulation site and gender group. Following study inclusion, the volunteers were invited for two identical visits (interval of 29.4 ± 8.5 days) whereby the daytime was held constant for the measurement visits (0 – 3 h). Subjects were randomly assigned to one of the five groups receiving stimulation either at one of the bladder (BD, TG) or urethral (proximal, membranous - males only, distal) stimulation sites (allocation ratio: 1:1:1:1:1, Supplementary Figure S1). In view of the clinical need for afferent nerve function assessments of the bladder, we here focus on the bladder SEPs. At the beginning of each visit, UTI and pregnancy were excluded. All subjects were told to refrain from the consumption of caffeine and cigarettes three hours prior to the measurement and alcohol one day prior to the measurement. A follow-up interview was conducted after each measurement visit to assess wellbeing and potential adverse events of the volunteers. The last follow-up was performed in August 2017.

Electrical stimulation

A custom-made 14 French stimulation catheter (Unisensor AG, Attikon, Switzerland) was inserted transurethrally into the bladder using non-anaesthetic lubricant gel. The catheter included radiopaque platinum electrodes to precisely position the catheter under fluoroscopic guidance19. By use of the catheter, the bladder was filled with 60 mL of contrast medium (Ultravist® 150TM, Bayer AG, Switzerland) at the beginning of each bladder SEP measurement to ensure constant starting volumes. During the assessments, subjects were lying quietly in the supine position with eyes closed. Constant current stimulation was generated using a neurophysiological stimulator (Dantec Keypoint Focus, Neurolite AG, Belp, Switzerland) and applied via the stimulation catheter. Repetitive square wave stimuli of different stimulation frequencies (0.5 Hz, 1.1 Hz, 1.6 Hz; pulse width = 1 ms) were applied at the predefined stimulation site. Following a repeated-measures, randomized controlled factorial design, the order of the stimulation frequencies was pseudorandomly allocated using a computer-generated randomization list stratified on gender. The research team performed sequence generation and randomisation and was not formally blinded to group allocation. Five consecutive runs of 100 stimuli each (run1–run5) were applied per frequency. This resulted in stimulation cycle durations of 16.7 min, 7.6 min, and 5.2 min for 0.5 Hz, 1.1 Hz, and 1.6 Hz, respectively. To ensure a stable catheter position during measurements, the catheter was taped to the upper leg and/or penis. At the beginning of each assessment, CPTs were determined at least two times as described previously20. After a one-off pain threshold (PT) assessment, stimulation intensities were individually adjusted aiming at a strong but non-painful stimulation. At the end of each stimulation, the bladder was emptied. Following bladder stimulations, three measurements of somatosensory evoked potentials (SSEPs) were performed at a frequency of 3.1 Hz in a random order: tibial nerve (pulse width = 0.2 ms); pudendal nerve (pulse width = 0.2 ms); pudendal nerve (pulse width = 1.0 ms). Similar to bladder SEP measurements, five runs of 100 stimuli were applied (duration: 2.7 min) per stimulation cycle. For our analysis, pudendal SEPs with 1 ms pulse width were selected as comparator to bladder SEPs due to the same pulse width.

Recording

A surface electrode system comprising of a cap-based extended international 10–20 montage21 (Easy Cap, Easy Cap GmbH, Herrsching, Germany) was used. Electrooculogram and Electrocardiogram were also recorded. Continuous recordings of SEPs were performed from surface electrodes (Ag/AgCl) at Cz referenced to Fz using BrainVision Recorder (BrainProducts, Gilching, Germany). The ground electrode was placed at AFz position. Electrode impedances were kept below 20 kΩ. Amplification of the scalp electrode signals was performed using BrainAmp amplifier (Brain Products, Gilching, Germany). Sampling frequency was 5000 Hz and the applied analogue filter between 0.016 and 1000 Hz.

Data processing and analyses

BrainVision Analyzer2 (BrainProducts, Gilching, Germany) was used for SEP processing. Filtering involved the application of 0.5 Hz-70 Hz band-pass (infinite impulse response filters; Butterworth zero-phase shift filter; 24 dB/Oct) plus 50 Hz Notch filter. This was followed by ocular correction22 and semiautomatic artefact rejection ( ± 100 µV) including visual inspection. To assure that no relevant bladder SEP component was lost, explorative analyses were performed using a 200 Hz instead of 70 Hz low-pass filter.

Segmentation and Averaging

Bladder SEP recordings were divided into 700 ms segments (100 ms pre-stimulus, 600 ms post-stimulus), while SSEP recordings were broken down into 300 ms segments (100 ms pre-stimulus, 200 ms post-stimulus). The segments of each individual run and of concatenated runs (run1 to run2 (run1_2), run1 to run3 (run1_3), run1 to run4 (run1_4), (run1 to run5 (run1_5)) were averaged and analyzed. We aimed for SEP averages consisting of at least 70% artefact free segments for each stimulation frequency, individual run and visit. For the direct filter comparison (i.e. 200 Hz vs. 70 Hz low-pass), only bladder SEP and pudendal SEP data sets retaining 100% of the segments after raw data inspection in both filter variants were included. Additionally, the averages of odd and even segments were evaluated for each run.

In order to exclude effects of different experimental durations, separate analyses were conducted on the averaged segments of the first 300 seconds of each stimulation cycle in order to directly compare bladder stimulation frequencies, independent from different stimulation durations. Exploratory analyses were performed using baseline correction (pre-stimulus time interval -53 ms to -3 ms) prior to averaging.

Peak detection and outcome measures

An SEP was regarded ‘stable’ if the averaged SEP signal of the odd and even segments overlapped in shape and timing with respect to the SEP components. Peak detection was performed manually. The markers were set using the following criteria: a) overlapping waveform between odd and even SEP averages and b) identifiable P1, N1, P2 bladder SEP components, respectively P40, N50, P65, N85 SSEP components. On a subject level, we were careful to choose individually corresponding SEP components across runs, frequencies and visits. Markers were individually set for the two filter versions on every single run and the combined runs. Subsequently, P1, N1, and P2, respectively P40, N50, P65 and N85 peak latencies and peak-to-peak amplitudes (P1N1 and P2N1 respectively P40N50 and P65N85) were extracted. Responder rate reflects the percentage of recordings that resulted in a stable SEP with identifiable components and clear marker placement. Signal-to-noise ratio of the Cz-Fz channel was calculated from the quotient of the average signal power and the average noise power for all locations, visits, frequencies, and runs, respectively. Relative stimulation intensity was calculated by dividing absolute stimulation intensity by CPT. Produced volume represents the emptied volume after a stimulation cycle minus the starting volume of 60 mL.

Statistics

Preprocessing and statistical analyses were performed using BrainVision Analyzer 2 (BrainProducts, Gilching, Germany), RStudio (Version 1.1.453, Boston, MA, U.S.A.) and the Stata statistics software package 14.2 (StataCorp. 2015. Stata Statistical Software: Release 14. College Station, TX: StataCorp LP.). Continuous variables are presented with means and standard deviations (SD) (or median and range where appropriate). Normal distribution was tested using Shapiro-Wilk test and by visual inspection of histogram and qq-plots. For all statistical analyses, a significance level of p < 0.05 was used. Unpaired Welch’s t-test or Mann-Whitney-U tests were performed to check for gender and location differences. In order to compare baseline corrected SEP curves among the different stimulation frequencies, runs and visits including all subjects (even if marker setting was not possible), paired t-tests were calculated for each data point of the whole segment (t-curves). In graphical representations, significant t-values (two-tailed) are highlighted.

Wilcoxon signed rank tests were used to compare the two filter variants regarding SEP latency and amplitude measures. For SNR analysis, Friedman's test respectively Wilcoxon signed rank test (p-values Bonferroni corrected) were used to compare different frequencies, runs and visits.

Linear mixed modelling (LMM)

For the amplitude analysis, we considered the absolute peak-to-peak amplitudes for P1N1 and P2N1. For the latency analysis, those for P1, N1 and P2 were considered. We a priori defined the default settings of 0.5 Hz stimulation and the aggregation of 100 stimuli as comparator to the other frequencies and individual runs. We defined indicator variables for 1.1 Hz and 1.6 Hz, and eight indicator variates for other data aggregations (separate runs: second to fifth, and combined runs (run1_2), (run1_3), (run1_4) and (run1_5)). The following subjects’ characteristics were entered as additional independent variables: (age, female gender, body height, body weight, urine production volume and absolute stimulation intensity). Using this modelling set-up, we examined the effect of various measurement settings, data aggregations (runs) and subjects’ characteristics on amplitude and latency. We performed stratified analyses for visits (first vs. second), and for location (TG vs. BD). Analyses were run on complete cases only, excluding measurements with missing values. To account for repeated measurements between subjects, we introduced an indicator variate for study subject as a random factor to the model. Sensitivity analyses were performed by excluding one subject with extreme latency values and by introducing the relative instead of the absolute stimulation intensity to the model. LMMs of pudendal SEP data were performed similarly (without independent variable urine production volume) on complete cases. Due to the datamatrix, p-values should only be applied indicatively.

Reliability

Reliability across visits was analyzed using intraclass correlation coefficient (ICC, single measures, two-way random effect, and absolute agreement). The ICC values were characterized according to Cicchetti (1994)23 and consequently considered as “poor” (less than 0.40) “fair” (0.40 – 0.59), “good” (0.60 – 0.74) or “excellent” (0.75 – 1.00).

Results

Group descriptives

For the analysis of bladder SEPs, forty healthy controls were included (Supplementary Figure S1). Descriptive statistics are displayed in Table 1. All subjects tolerated the procedures well and no UTI was reported. However, 27 out of 40 subjects reported temporary self-limiting dysuria after 48 out of 80 measurements. Two subjects reported temporary self-limiting hematuria after 2 out of 80 measurements. All described symptoms were mild and expected due to irritations caused by the catheter, did not require medical consultation or treatment, and were continously declining over a short period of time, i.e. no more than 3 days.

Stimulation parameters and bladder volume are listed in Supplementary Table S1 stratified for gender, location and frequency.

The impact of filter parameters

Typical and stable SEP components (P1, N1, P2 for bladder SEPs; P40, N50, P65, N85 for pudendal SEPs respectively) were detectable for both low-pass filter variants (Supplementary Figure S2). The 200 Hz and 70 Hz low-pass filter variant revealed similar SEP curve shapes along the entire SEP segments. Based on the marker positions, Wilcoxon-signed rank tests showed no filter-specific differences for the latencies of the bladder SEP and pudendal SEP components. Analysis of peak-to-peak amplitudes revealed no significant differences for bladder SEPs but smaller amplitudes for pudendal SEPs (P40N50: V = 0, p = 0.004; P65N85: V = 0, p = 0.004) preprocessed with 70 Hz compared to 200 Hz low-pass filter. Subjectively, manual marker setting was easier using 70 Hz low-pass filter due to the smoother SEP curve compared to the 200 Hz. Consequently, and considering the integrity of all relevant SEP components, the following results are presented for the 70 Hz low-pass filter variant.

Number of segments

For the analyses of the effect of stimulation frequencies, number of runs, and visits, on the SEPs, the averages contained at least 70 % artefact free segments, except for five bladder SEP datasets which showed a lower number of valid segments in the first run (minimum of 46 %). Regarding pudendal SEPs, two datasets containing single runs with a minimum of 40 % of valid segments were included. Considering that the respective datasets showed stable SEPs, they were included in the main analysis.

The impact of stimulation frequency on SEP outcome

Typical P1, N1, P2 components were found with a 100 % RR for all three stimulation frequencies and both locations with larger peak-to-peak amplitudes when using slower stimulation frequencies (Fig. 1a,b).

Figure 1
figure 1

SEP group averages across two visits and a constant number of 500 stimuli. SEPs were recorded from Cz-Fz during stimulation at the BD (a, n = 20 subjects) and TG (b, n = 20 subjects) with the three different stimulation frequencies: 0.5 Hz (black), 1.1 Hz (dark grey) and 1.6 Hz (light grey, all without baseline correction). T-curves of differences between the stimulation frequencies (baseline corrected; 0.5 Hz-1.1 Hz: dark grey dashed line; 0.5 Hz-1.6 Hz: light grey dashed line; 1.1Hz-1.6 Hz: light grey dotted dashed line) are shown for BD (c) and TG (d). T-values above 2.021 and below -2.021 were considered as significant.

T-curves of the SEP differences between the frequencies are illustrated in Fig. 1 for BD (c) and TG (d). Significant differences between the frequencies were prominent around the N1 and P2 peak. For the N1 peak we observed differences between all frequency comparisons. For the P2 peak, significant differences were observed only between 0.5 Hz and 1.1 Hz as well as 0.5 Hz and 1.6 Hz. Slow stimulation frequency consistently presented with a negative shift during the baseline phase (e.g. 0.5 Hz SEP curves in Fig. 1a,b).

Linear mixed model revealed smaller P2N1 peak-to-peak amplitudes during stimulation with 1.1 Hz (estimates between −3.5 µV and −4.5 µV, Tables 2a) and 1.6 Hz (estimates between -5.1 µV and -5.9 µV, Table 2a) compared to 0.5 Hz. Latencies of the N1 peak were similar between 0.5 Hz and 1.1 Hz as well as 0.5 Hz and 1.6 Hz (Table 2b).

Table 2 Results of the linear mixed effect models (LMM) showing the influence of different variables on P2N1 amplitude (a) and N1 latency (b).

Results of the LMM for P1N1 amplitude as well as P1 and P2 latencies are shown in supplementary Table S2–S4. Median peak latencies for the location BD across visits and the three frequencies (500 stimuli) were 65.2 ms (47.0 to 84.2 ms) for P1, 117.5 ms (97.6 to 150.4 ms) for N1 and 250.2 ms (209.4 to 298.6 ms) for P2. Median latencies for the location TG were 57.5 ms (48.2 to 97.6 ms) for P1, 118.2 ms (94.6 to 178.4 ms) for N1 and 252.5 ms (216.2 to 304.2 ms) for P2. Similar LMM results were shown when excluding one subject with extreme values or when introducing the relative rather than the absolute stimulation intensity to the model.

Across both locations, SNR of the whole SEP curve (Cz-Fz, baseline corrected) of the first visit was significantly higher for lower frequencies (0.5 Hz: 0.14 (0.01–1.47); 1.1 Hz: 0.07 (0.01–0.59); 1.6 Hz: 0.04 (0.00–0.48); Friedman-test chi-squared: 31.65, df = 2, p < 0.001; 0.5Hz-1.1 Hz: V = 779, p < 0.001; 0.5Hz-1.6 Hz: V = 764, p < 0.001, 1.1Hz-1.6 Hz: V = 540, p = 0.25; Bonferroni corrected) compared to higher stimulation frequencies. Separate analyses over a fixed stimulation duration of 300 s showed similar frequency effects compared to a fixed number of stimuli as can be seen in Fig. 2.

Figure 2
figure 2

Baseline corrected SEP group averages across two visits after a constant stimulation duration of 300 seconds. SEPs were recorded from Cz-Fz during stimulation at the BD (a, n = 20 subjects) and TG (b, n = 20 subjects) with the three different stimulation frequencies: 0.5 Hz (black), 1.1 Hz (dark grey) and 1.6 Hz (light grey). T-curves of differences between the stimulation frequencies (baseline corrected, 0.5Hz-1.1 Hz: dark grey dashed line; 0.5Hz-1.6 Hz: light grey dashed line; 1.1Hz-1.6 Hz: light grey dotted dashed line) are shown for BD (c) and TG (d). T-values above 2.021 and below -2.021 were considered as significant.

Impact of number of stimuli on SEP outcome

Considering each of the five runs (each consisting of 100 stimuli) separately, decreasing amplitudes were observed from run 1 to 5 accompanied by decreasing RR for all locations and the three frequencies (Fig. 3). Sections with significant differences between runs are shown in Fig. 3d–f. While there were significant differences around the N1 and P2 peak between run1 and all other individual runs for the location BD, only the differences between run1, and run4 and run5, respectively, reached significance for the location TG. Pudendal SEPs showed significant reductions between run1 and run4, as well as run1 and run5, around 55 ms.

Figure 3
figure 3

SEP group averages of the individual runs across two visits recorded from Cz-Fz during stimulation with 0.5 Hz at the BD (a, n = 20 subjects) respectively TG (b, n = 20 subjects) and 3.1 Hz at the pudendal nerve (c, n = 40 subjects). Run1, run2, run3, run4, and run5 are indicated in black, red, blue, green and orange, respectively (without baseline correction). T-curves of differences between baseline corrected individual runs 2–5 to run 1 (run2-run1: red dashed, run3-run1: blue dashed, run4-run1: green dashed, run5-run1: orange dashed) are shown for BD (d), TG (e) and pudendal nerve (f). T-values above 2.021 (for pudendus: 1.990) and below -2.021 (for pudendus: -1.990) were considered as significant. Significant differences between runs are highlighted in grey. Responder rate is displayed in % across the individual runs (open symbols) as well as the combined runs (filled symbols, consider that some symbols of the different frequencies overlay) of the runs for BD (g) and TG (h) and pudendal nerve (i) with 0.5 Hz indicated by black circles, 1.1 Hz, dark grey squares, 1.6 Hz, light grey up-pointing triangles, and 3.1 Hz, black down-pointing triangles, respectively.

The analysis of the individual marker positions supported the findings of decreasing amplitudes from run1 to run5 (see estimates in Table 2a). For N1 latencies, estimates of differences were between -2.4 ms and 1.1 ms (Table 2b). For the pudendal nerve stimulation, LMM results are reported in Supplementary Table S5–S7.

Responder rate of bladder SEPs across runs is shown in Fig. 3g,h. By combination of the first two runs, RR can be maximized for all frequencies of the two locations except for the frequency 1.6 Hz during TG stimulation where the combination of three runs is necessary to reach a RR of 100%. These results were shown for both measurement visits. Responder rate of pudendal nerve stimulation could also be increased by combination of at least two runs; however, the highest RR reached was 97.5 % (Fig. 3i).

The SNR of the entire SEP curve of the first visit was significantly different between runs across both bladder locations (Friedman chi-squared = 92.23, df = 4, p = <0.001) and for pudendal SEP measurements (Friedman chi-squared = 16.24, df = 4, p = 0.003) with earlier runs showing greater SNR. See Fig. 4 for SNR changes across runs for the location BD during stimulation with 0.5 Hz.

Figure 4
figure 4

Signal-to-noise ratio (SNR) of baseline corrected data across two visits plotted for the individual (a) and combined runs (b; i.e. 1_2: averaged segments across run1 and run2) when stimulating at the BD with 0.5 Hz (n = 20 subjects).

Changes of SEP outcome measures across visits

Between the two visits, group averages showed good agreement of SEP waveform for the two bladder locations and the pudendal nerve (Fig. 5). While significant differences were reported for pudendal SEPs before P40 and between P40/N50, bladder SEPs showed significantly greater amplitudes during the second visit around the P2 peak, except for TG during 1.1 Hz stimulation.

Figure 5
figure 5

Baseline corrected SEP group averages of 200 stimuli recorded at Cz-Fz during 0.5 Hz, 1.1 Hz, and 1.6 Hz stimulation of the BD (ac), TG (d,e,f) and pudendal nerve (g, exception: 500 stimuli). Visit one and two are indicated in light blue, respectively violet. The dashed grey lines constitute the corresponding t-values (visit2-visit1; baseline corrected). T-values above 2.086 (for pudendal SEP: 2.021) and below -2.086 (for pudendal SEP: -2.021) were considered as significant. Significant differences between visits are highlighted in grey.

When taking visit as an independent variable in the LMM, latencies of the second visit were slightly shorter for both locations (N1 - estimate BD: -2.0 ms, N1 - estimate TG: -1.1 ms, P40 – estimate pudendus: -0.3 ms) while amplitudes were greater (P2N1 - estimate BD: 1.3µV, P2N1 - estimate TG: 0.8 µV, P40N50 – estimate pudendus: 0.1 µV). Across all five runs, there was no significant difference in SNR between visits for all frequencies of the two bladder locations (p > 0.114) and the pudendal nerve (p = 0.685). The ICC values for latency and amplitude measures are given in Table 3 for bladder SEP and pudendal SEP measurements.

Table 3 Reliability of SEP components across visits.

Discussion

In this cohort of young healthy subjects we could successfully record cortical SEPs during bladder stimulation with three different frequencies (0.5 Hz, 1.1 Hz, 1.6 Hz – 1 ms pulse width) and evaluate the impact of several measurement settings, i.e. frequency, number of stimuli and data acquisition, on SEP outcome. Bladder SEPs with three clearly identifiable peaks, including a prominent negative deflection (N1) between two less positive deflections (P1 and P2), were observed with a 100% RR during stimulation with all applied stimulation frequencies across 500 stimuli. For both locations, significantly greater amplitudes were measured during stimulation with 0.5 Hz compared to 1.1 Hz and 1.6 Hz. This indicates that the choice of the stimulation frequency is crucial. This significant difference in amplitude values was shown based on the marker positions and on t-curves, respectively, as well as by SNR that include all of the measurements and information about the whole length of the segment. Lower stimulation frequencies might lead to larger amplitudes because of a better susceptibility of the slow fibers in the bladder to these frequencies due to longer refractory periods. N1 latency across 500 stimuli was found to be longer (15–25 ms) than in a few previous studies with healthy subjects12,24,25, but comparable to a study measuring SEPs in healthy young men11. Most of the previous studies performed the stimulation with 0.5–3 Hz. Nevertheless, the heterogeneous study populations and inconsistencies of measurement settings are likely the main reasons that hampered a meaningful frequency comparison. For the present study we decided to keep the pulse width constant at 1 ms for a proper comparison of the frequencies. Based on the knowledge of bladder fiber constitution and results from previous studies8,11, stimulation frequency 0.5 Hz and two frequencies below 3 Hz were selected.

An electrical phenomenon clearly visible in the SEP curves measured during stimulation with slow frequencies such as 0.5 Hz (Fig. 1) is a negative shift in the electroencephalography baseline. This observation is called contingent negative variation and can be attributed to attention and arousal function26.

Depending on the stimulation frequency, the application of 500 stimuli led to quite different stimulation durations (16.7 min for 0.5 Hz, 7.6 min for 1.1 Hz and 5.2 min for 1.6 Hz). Nevertheless, the comparison of the SEP curves averaged over a constant stimulation duration of 300 seconds showed similar results for the frequency comparison (Fig. 2). Consequently, we can exclude that the SEP differences observed between the stimulation frequencies (Fig. 1) resulted from varying duration.

Our data showed a gradual decrease in amplitudes, RR and SNR across runs (Figs. 3 and 4). Nevertheless, RR can be maximized by combination of at least two runs compared to the individual runs for all frequencies and the two bladder locations. The results suggest that the total number of stimuli or runs, respectively, can be reduced from 500 stimuli (5 runs) to 200 stimuli (2 runs) in order to achieve reliable bladder SEPs and at the same time minimize acquisition time. Until now it was unclear, how many stimuli are needed to get reliable bladder SEPs. Various numbers of applied stimuli (up to 1000 stimuli) and runs were reported previously for bladder SEP measurements8,10,11.

We assume that habituation or rapidly changing bladder volumes may lead to dislocation of the stimulation catheter or afferent inhibition, causing a decrease in amplitudes. The size of the SNR is important since bladder SEPs with bigger SNR are better detectable and thereby marker setting of P1, N1, P2 is easier. Although we are not bothered by increasing bladder volume over time during pudendal nerve stimulation, a significant, albeit smaller decrease in amplitudes was found across runs. This might be due to habituation even if the stimulation duration was much shorter.

Investigating SEP latencies and amplitudes, minimal differences between visits were found. Considering inter-individual variance and pudendal SSEPs these differences are negligible.

ICC analysis revealed good to excellent reliability for latencies of both bladder locations and for all three stimulation frequencies except for BD when stimulating with 1.6 Hz (Table 3). For the location BD, stimulation with 1.6 Hz showed the lowest ICC values, while the values were quite comparable for 0.5 Hz and 1.1 Hz. For the location TG, stimulation with 0.5 Hz showed better reliability compared to 1.1 Hz and 1.6 Hz for latency and amplitude values. Reliability is slightly better for the location TG compared to BD. Although the positioning of the electrodes is more difficult in the bladder compared to the skin, the ICC values of pudendal SEP latencies were comparable to N1 and P2 reliability of the lower stimulation frequencies for both bladder locations. In line with a previous study, reliability was higher for the latencies compared to the peak-to-peak amplitudes with better ICC for N1 latencies compared to P1 and P28. The amplitudes are known to be more prone to changes compared to latencies since changing electrode impedances or varying relaxation degree of the participants can additionally influence it27. Compared to previous studies, reliability across visits was comparable or higher for all bladder SEP latencies and peak-to-peak amplitudes8,28. In addition to having a constant starting bladder volume of 60 mL, we compared the radiographs to be sure to place the catheter exactly at the same position at the two visits. Another important modification was to focus on setting the markers on the same SEP components across visits (see Fig. 3 in Knüpfer et al.11).

Previously, different low-pass filters were used for the preprocessing of bladder SEPs8,11. Our bladder SEP data do not confirm results from other fields reporting smaller amplitudes and longer latencies by decreasing low-pass filter29,30,31. However, the lack of difference in our study could be explained by the rather small difference between the two filters or to a certain part by the use of zero-phase shift filters in Vision Analyzer. Our choice of the 70 Hz low-pass filter rather affects high frequency components of the signal, which can explain the slightly smaller peak-to-peak amplitudes with the 70 Hz filter for the pudendal SEPs.

With respect to bladder SEP latencies and amplitudes, RR, reliability and SNR, our data suggest a stimulation frequency of 0.5 Hz to be the preferred technique for stimulation of the afferents of the BD and TG. These results were supported by analyses of the manually set markers and of the whole curve shape (t-curves). For the decision of the minimally required number of stimuli we have to consider the strong habituation effect and that this measurement should be applicable in patients where it will likely be more difficult to record reproducible SEPs. Although we observed the largest amplitudes and highest SNR during the first run, we may have a better RR when more stimuli are averaged so that according to the present results we would suggest a minimum of 2 runs of 100 stimuli. This is supported by the fact that a 100% RR was reached after 2 runs of stimulating with 0.5 Hz in both bladder locations, while these data showed good to excellent reliability across visits for the SEP latencies. N1 latency seems to be the robust marker with excellent reliability.

Our study provides a systematic bladder SEP evaluation in both gender groups while comparing different stimulation frequencies at two locations of the bladder. This data from healthy subjects was analyzed based on manual marker positions and on the whole SEP curve by means of standardized statistical testing (t-curves). It provides additional information on variability and potential confounding factors. The measurement of cortical SEPs elicited by bladder electrical stimulation may have the potential to serve as a neurophysiological biomarker for afferent nerve fiber function in patients with LUT symptoms such as OAB. By analysing latencies and peak-to-peak amplitudes of the SEP components one can obtain information on nerve fiber integrity and conduction velocity7. The application of bladder SEPs might be a useful amendment to findings from complementary investigations (i.e. history, neurological examination, urodynamic examination)32 as well as a surrogate marker and outcome measure for established and approved therapies targeting afferent bladder pathways. However, one should be aware of the rather high between-subject variability of bladder SEPs compared to the low variability within-subjects. Further studies testing this optimized setup in different neurological patient groups and age groups are needed and will help to define the clinical application field of this assessment tool. Additionally, consecutive investigations have to show if our bladder SEP findings hold true also for urethral stimulation sites, considering their distinct variations in afferent innervation and gender specific anatomy.

Limitations

Currently, marker setting does not work automatically and consequently has to be performed manually, which is a subjective and very time-consuming task. Furthermore, the fact that different subjects were included per location complicates the direct comparison of BD and TG. Nevertheless, doing too many subsequent measurements in the bladder could influence the results due to irritations of the mucosa as well as attentional changes and decrease of compliance of the volunteers. Correspondingly, the observed decrease in amplitudes and RR with increasing number of stimuli (runs) may indicate habituation, which may be attributed to changes in attention. This may be avoided by introduction of a short random delay between single stimuli. This would have to be tested in further studies.

Conclusion

The results of the present randomized study indicate that cortical potentials can be recorded in young healthy volunteers with high reliability and low repetition rates when stimulating the bladder with low frequencies. This potentially makes this investigation a clinically useful test for the investigation of bladder afferents. Our data clearly show that the choice of the stimulation parameters is very relevant for implementation of bladder SEPs into daily clinical practice. Based on the current results, we would recommend a stimulation frequency 0.5 Hz, because of best reproducibility, largest amplitudes and best SNR. N1 latency seems to be the most robust and reliable bladder SEP marker. The number of electrical stimuli can be reduced to 200 (2 runs of 100 stimuli) to achieve robust responses in reasonable acquisition time (400 seconds). This may constitute a good compromise between the duration of a stimulation cycle and peak-to-peak amplitudes of the SEP.