Validation of an algorithm for sound-based voided volume estimation

Jung, Gyoohwan; Ryu, Hoyoung; Lee, Jeong Woo; Jeong, Seong Jin; Margolis, Eric; Grover, Neel; Lee, Sangchul

doi:10.1038/s41598-023-50499-1

Download PDF

Article
Open access
Published: 02 January 2024

Validation of an algorithm for sound-based voided volume estimation

Gyoohwan Jung¹,
Hoyoung Ryu²,
Jeong Woo Lee³,
Seong Jin Jeong^4,5,
Eric Margolis⁶,
Neel Grover⁷ &
…
Sangchul Lee^4,5

Scientific Reports volume 14, Article number: 138 (2024) Cite this article

587 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

A voiding diary is commonly used in clinical practice to monitor urinary tract health. However, manual recording and use of a measuring cup can cause significant inaccuracy and inconvenience. Recently sound-based voided volume estimation algorithms such as proudP have shown potential to accurately measure the voided volumes of patients urination while overcoming these inconveniences. In order to validate the sound-based voided volume estimation algorithm, we chose bodyweight change after urination as a reference value. Total 508 subjects from the United States and Korea were enrolled. 584 data points that have matching bodyweights change data and urination sound data were collected, and fivefold cross validation was performed in order to evaluate the model on all data in the dataset. The mean voided volume estimated by the algorithm was 202.6 mL (SD: ± 114.8) while the mean bodyweight change after urination was 208.0 g (SD: ± 121.5), and there was a strong linear correlation with high statistical significance (Pearson’s correlation coefficient = 0.92, p-value < 0.001). Two paired t-test showed the equivalence with bodyweight change data with 10 mL margin. Additionally, a Bland–Altman plot shows a mean difference of − 5.5 mL with LoA (− 98.0, 87.1). The results support high performance of the algorithm across the large population data from multi-site clinical trials.

Combining urine color and void number to assess hydration in adults and children

Article 18 January 2021

Test–retest reliability of the 20-min pad test with infusion of strong-desired volume in the bladder for female urodynamic stress incontinence

Article Open access 28 October 2020

Efficacy of salt reduction for managing overactive bladder symptoms: a prospective study in patients with excessive daily salt intake

Article Open access 18 February 2021

Introduction

Daily tracking of voiding parameters provides important information regarding patients’ urinary health¹. In clinical practice, a voiding diary kept by patients is a useful tool recommended and utilized by physicians to assess a patient's urinary health². These are typically measured at home by the patient who has to manually record the voided volume (VV) by reading the marking on a measuring cup³. The inconvenience of conducting multiple manual steps can contribute to poor compliance⁴. Additionally, there are high risks for inaccuracy caused by the lack of standard measuring cups and human mistakes occurred during manual reading and recording⁵.

In the past, several investigators attempted to demonstrate the performance of sound-based estimations that might solve those challenges^{6,7,8,9,10,11,12,13}. Among those sound-based estimation algorithms, the proudP by Soundable Health, Inc (San Jose, CA, USA) is the only commercialized and the most active in clinical research.

However, it was found challenging to choose an appropriate standard measure because a commercial uroflowmeter usually requires subjects to urinate in a designated device while acoustic estimation analyzes the sound that the urine hits the water surface in a toilet bowl. Conventional ultrasound bladder volume scanner results¹⁴ or bodyweight changes before and after voiding¹⁵ were exploited as a standard reference. In fact, in clinical practice, urine weight has been commonly used to estimate VV. For example, gravimetric uroflowmeters, one of the commercially available types¹⁶, converts urine weight into volume. As a patient urinates into a specific beaker, a weight transducer in the uroflowmeter detects the change in receptacle weight and converts it into the VV¹⁷.

Therefore, in this study, we validate the algorithm for the VV estimation by comparing with the VV converted from bodyweight changes due to urination. The conversion was based on the assumption that urine specific gravity is 1. The error from the conversion would have little effect on the clinical practice¹⁵.

Materials and methods

Ethics statement

This study was approved by the local Institutional Review Board of Seoul National University Bundang Hospital (IRB No. B-2012-654-305) and Western Institutional Review Board Copernicus Group (IRB No. 20215311). All data used for analysis were anonymized. We obtained informed consent from all patients enrolled in the study. Personal identifiers were completely removed and the data were analyzed anonymously. All methods were performed in accordance with relevant guidelines and regulations.

Study population and definitions

Subjects who were healthy volunteers or patients, aged over 18 years old, and able to provide informed consent to participate were eligible. Data collection occurred from September 27, 2021 to October 27, 2022 (Study 1), and from October 27, 2021 to July 14, 2022 (Study 2). Data were collected in the bathrooms at the hospital or clinic. Participants were allowed to provide multiple voiding sounds, and one void was registered as an independent event regardless of who provided it.

Exclusion criteria were retracted consents, the lack of matching data for either voiding sound or bodyweight change data, bodyweight change over the capacity of the sound-based algorithm (either under 10 g or over 1 kg). Additionally, recordings that failed to follow study instructions were excluded such as poor, incomplete, interrupted recording of voiding sound, voiding into another object that is not water in a toilet bowl, changes in conditions that can affect bodyweight such as consumption or excretion of food, or addition or removal of items carried by the subject.

Procedure

Written informed consent was obtained from all enrolled subjects prior to data collection. Subjects were asked to complete a questionnaire with basic demographic questions including medical history. Right after measuring pre-void bodyweight, subjects recorded voiding sound using the iOS mobile application solely developed for data collection, which was immediately followed by post-void bodyweight measurement.

Data collection

Urination recording was conducted using an iOS mobile application solely developed for data collection (Fig. 1). The application was installed in iPhone XR and iPhone 12 from Apple Inc., Cupertino, CA, USA.

The subject was weighed by CAS HB-150, a high resolution weight scale with a readability of 10 g and a minimum and maximum capacity of 500 g and 150 kg, respectively.

Since we did not control the amount of water intake or the time or interval of urination, which can affect the voided volume, it varied greatly even within each voiding individual, so each urination was regarded as independent.

Voided volume prediction model and evaluation

Fivefold cross validation was performed in order to evaluate the model on all data in the dataset. An urine sound waveform is transformed into a mel-spectrogram, which is then fed as input to the 2D-CNN model for training. Also, a frequency masking method allowing masking of mel-spectrogram in the frequency domain up to 25% is applied in pre-processing to overcome overfitting due to small training set size. The output of the model is the voided volume. The mimetic diagram of voided volume estimation is demonstrated in supplementary Fig. S1.

Statistical analysis

Paired samples t-test for equivalence was used to evaluate the statistical significance of any differences between the VV calculated based on bodyweight change after urination and the VV estimated using the iOS collection application. The equivalence of two different measurements is statistically proven if the 95% confidence interval of the mean difference is within the pre-defined equivalence margin.

To show equivalence, H0 and H1 are set as below.

H0: |VV_pred − VV_bodyweight change|≥ δ
H1: |VV_pred − VV_bodyweight change|< δ

As the null hypothesis (H0) has two one-sided tests (difference < + δ or difference > − δ), 'two one-sided-tests (TOST) method' is used in equivalence testing. The p-value for this hypothesis testing as a whole is defined as the maximum p-value of two one-sided tests. If 95% CI of the difference is within the equivalence margin range (− δ, + δ), the two measurements are considered equivalent¹⁸. The statistical analysis and calculations were performed using the Python™ v3.6.9 programming language and its scientific computing package SciPy v1.5.4 (Python Software Foundation, Beaverton, OR, USA) and R version 4.3.1.

Results

Total 527 subjects volunteered for this study including 300 subjects from Study 1 and 227 subjects from Study 2. After excluding 19 participants who voluntarily decided to discontinue their participation, a total of 508 subjects were enrolled in the study.

A total of 663 data points were collected from 508 enrolled subjects. After excluding 79 data points that did not meet the inclusion criteria, a total of 584 data points were included in the final analysis. Detailed description of excluded data points is summarized in Table 1.

Table 1 Summary of data collection.

Full size table

The mean age of the obtained data points was 60.61 (SD: ± 15.24). The mean age across the model of phone is demonstrated in Table 2. The mean VV obtained using the iOS collection application 202.6 mL (SD: ± 114.8) while the mean bodyweight change after urination was 208.0 g (SD: ± 121.5) (Table 3; Fig. 2). The statistical analysis shows strong linear correlation between the two measurements. (Pearson’s correlation coefficient = 0.92, p-value < 0.001) (Fig. 3).

Table 2 Summary of distribution of ages of data set.

Full size table

Table 3 Summary of set.

Full size table

Because the scale used in this study to measure bodyweight change has a resolution of 10 g, 10 mL was set as the equivalence margin in following analyses. As shown in Fig. 4 and Table 4, the 95% CI of mean difference (− 8.8 mL, − 2.2 mL) is within the equivalence margin (− 10 mL, + 10 mL) and the maximum p-value for the TOST results (0.0103002) is smaller than 0.05. Therefore, the results demonstrate statistical equivalence between the two measurements. Additionally, we analyzed the data with a Bland–Altman plot which shows the distribution of differences between the two measurements within the Limit of Agreement (LoA). The mean difference was − 5.5 mL with LoA (− 98.0, 87.1) (Fig. 5).

Table 4 Summary of two one-sided-tests (TOST) results.

Full size table

Discussion

The results of this study supports the use of sound-based voided volume estimation algorithm for accurately and conveniently collecting VV as a mobile voiding diary.

In this study, a Bland–Altman plot shows a mean difference of 5.5 mL with limit of agreement (− 98.0, 87.1), highly acceptable when compared to reference data. Analyses of previous studies support that this level of differences between the two measurements is highly acceptable in clinical practices. For example, C. Palnaes and P. Klarskov assessed the distribution of differences between data in a voiding diary manually recorded by patients and the actual volume of urine collected for 24 h and recorded by the nurse¹⁹. The Bland-Atlman plot for average urine volume during 24 h shows a limit of agreement of about 70 mL, which is similar with the results for a voided volume of each void in this study. In another study, D. R. Small et al. evaluated the measurement error of a portable bladder scanner of which use in the clinic has been well established to estimate the post-void residual. The average difference was 16.7 mL (SD: 50.2 mL), much greater than the average difference − 5.5 mL (SD: 47.2 mL) in this study²⁰. The proudP’s VV estimation algorithm is based on the same AI architecture as used in this paper, but is trained on larger scale with more diverse data. Therefore, it is expected that the proudP application provides an accurate VV estimation, while significantly enhancing convenience of users by enabling mobile app-based, at-home measurements.

This study has a few limitations. First, because the sound of urination into a commercial uroflowmeter is different from the sound that the urine hits the water surface in a toilet bowl, we chose the bodyweight change before and after urination as a reference value using a high resolution weight scale instead of measuring the volume of urine directly. Accordingly, it was important to limit not to do any other actions that could affect bodyweight between before and after urination, and it needed more effort to control it and check the compliance. Second, if the assumptions made when converting urine weight to volume are different from the actual values, additional errors may occur in individual results. However, load cell and spinning disk uroflowmeters calculate VV by assuming the density of urine is approximately 1 g/ml and are already widely used in clinical practice²¹. Third, the training and performance evaluation of the AI model is based on data collected from this limited number of clinic toilets in this clinical trial. Therefore, we cannot guarantee the same performance when the measuring environments change. In terms of measurement device, although it is difficult to generalize as there are only two combinations of environment and model, there was no significant difference in mean difference and a slight difference in the LoA as shown in Supplementary Table S1 and Fig. S1. But it is difficult to distinguish the exact cause with the current data, and we want to verify it later. Finally, when the urine falls on the toilet walls or in a urinal without water, totally different sounds will be produced, and this model cannot guarantee the high accuracy for these sounds. Therefore, we limit this DL model to be used only on urination on water, and this can be easily checked by male users as they are standing in front of a toilet.

Conclusions

The validation from this study demonstrates that the sound-based voided volume estimation algorithm provides highly accurate estimates of voided volumes when compared to the body weight change before and after urination across large population data from multi-site clinical trials. Additionally, it will enhance patient’s convenience as it eliminates the need for manual recording of voiding activities, associated potential errors, and inconvenience of carrying and using a voiding beaker. The ability to track daily voiding activities simply using a sound-based mobile app will likely improve patient compliance as well.

Data availability

The datasets analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

TR:: Training set
TE:: Test set
VV:: Voided volume
SD:: Standard deviation
TOST:: Two one-sided-tests
LoA:: Limit of agreement
df:: Degree of freedom

References

Gacci, M. et al. European association of urology guidelines on male urinary incontinence. Eur. Urol. 82, 387–398 (2022).
Article PubMed Google Scholar
Homma, Y. et al. Assessment of overactive bladder symptoms: Comparison of 3-day bladder diary and the overactive bladder symptoms score. Urology 77, 60–64 (2011).
Article PubMed Google Scholar
van Brummen, H. J., Heintz, A. P. M. & van der Vaart, C. H. The association between overactive bladder symptoms and objective parameters from bladder diary and filling cystometry. Neurourol. Urodyn. 23, 38–42 (2004).
Article PubMed Google Scholar
Golomb, J., Lindner, A., Siegel, Y. & Korczak, D. Variability and circadian changes in home uroflowmetry in patients with benign prostatic hyperplasia compared to normal controls. J. Urol. 147, 1044–1047 (1992).
Article CAS PubMed Google Scholar
Summers, S. J. et al. Male voiding behavior: Insight from 19,824 at-home uroflow profiles. J. Urol. 205, 1126–1132 (2021).
Article PubMed Google Scholar
Zvarova, K. et al. Recording urinary flow and lower urinary tract symptoms using sonouroflowmetry. Can. J. Urol. 18, 5689–5694 (2011).
PubMed Google Scholar
Krhut, J. et al. Comparison between uroflowmetry and sonouroflowmetry in recording of urinary flow in healthy men. Int. J. Urol. 22, 761–765 (2015).
Article PubMed Google Scholar
Arjona, L., Enrique Diez, L., Bahillo Martinez, A. & Arruza Echevarria, A. UroSound: A smartwatch-based platform to perform non-intrusive sound-based uroflowmetry. IEEE J. Biomed. Health Inform. 1, 1 (2022).
Google Scholar
Lee, Y. J., Kim, M. M., Song, S. H. & Lee, S. A novel mobile acoustic uroflowmetry: Comparison with contemporary uroflowmetry. Int. Neurourol. J. 25, 150–156 (2021).
Article PubMed PubMed Central Google Scholar
Choo, M. S., Ryu, H. Y. & Lee, S. Development of an automatic interpretation algorithm for uroflowmetry results: Application of artificial intelligence. Int. Neurourol. J. 26, 69–77 (2022).
Article PubMed PubMed Central Google Scholar
El Helou, E. et al. Mobile sonouroflowmetry using voiding sound and volume. Sci. Rep. 11, 11250 (2021).
Article ADS PubMed PubMed Central Google Scholar
Dawidek, M. T., Singla, R., Spooner, L., Ho, L. & Nguan, C. Clinical validation of an audio-based uroflowmetry application in adult males. Can. Urol. Assoc. J. 16, E120–E125 (2022).
PubMed Google Scholar
Lee, H. J., et al. Development and validation of a deep learning system for sound-based prediction of urinary flow. Eur Urol Focus (2022).
Kim, H. et al. Validation of mobile application measuring voided volume: a pilot prospective study. J. Urol. 207, E482–E483 (2022).
Article Google Scholar
Takai, S., Matsukawa, Y., Hashizume, N. & Gotoh, M. A small pilot study to evaluate the accuracy and feasibility of a novel automated voiding diary device for recording urine output measurements. Neurourol. Urodyn. 40, 272–277 (2021).
Article PubMed Google Scholar
Chun, K., Kim, S. J. & Cho, S. T. Noninvasive medical tools for evaluating voiding pattern in real life. Int. Neurourol. J. 21, S10-16 (2017).
Article PubMed PubMed Central Google Scholar
Laborie UROCAPTM IV Standard model owner’s manual (2021).
Mara, C. A. & Cribbie, R. A. Paired-samples tests of equivalence. Commun. Stat. Simul. C 41, 1928–1943 (2012).
Article MathSciNet Google Scholar
Palnaes Hansen, C. & Klarskov, P. The accuracy of the frequency-volume chart: comparison of self-reported and measured volumes. Br. J. Urol. 81, 709–711 (1998).
Article CAS PubMed Google Scholar
Small, D. R., Watson, A. & McConnachie, A. A quantitative comparison of four current portable ultrasound bladder scanners. Br. J. Med. Surg. Urol. 1, 35–40 (2008).
Article Google Scholar
Gammie, A. et al. International continence society guidelines on urodynamic equipment performance. Neurourol. Urodyn. 33, 370–379 (2014).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors declare that they did not receive institutional, private and/or corporate financial support.

Funding

This work was supported by the Korea Medical Device Development Fund grant funded by the Korea government (the Ministry of Science and ICT, the Ministry of Trade, Industry and Energy, the Ministry of Health and Welfare, the Ministry of Food and Drug Safety) (Project Number: 1711138269, RS-2020-KD000141) (NTIS, RS-2020-KD000141). This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2020R1F1A1072702).

Author information

Authors and Affiliations

Department of Urology, Hanyang University College of Medicine, 222, Wangsimni-ro, Seongdong-gu, Seoul, Korea
Gyoohwan Jung
Department of Urology, Ewha Womans University College of Medicine, 52, Ewhayeodae-gil, Seodaemun-gu, Seoul, Korea
Hoyoung Ryu
Department of Urology, Kyung Hee University Medical Center, Kyung Hee University College of Medicine, 7-13, Kyungheedae-ro 6-gil, Dongdaemun-gu, Seoul, Korea
Jeong Woo Lee
Department of Urology, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam-si, Gyeonggi-do, Korea, 13620
Seong Jin Jeong & Sangchul Lee
Department of Urology, Seoul National University College of Medicine, 103 Daehakro, Seoul, 03080, Korea
Seong Jin Jeong & Sangchul Lee
Hackensack Meridian School of Medicine, 340, Kingland St., Nutley, NJ, 07110, USA
Eric Margolis
New Jersey Urology, Englewood, NJ, USA
Neel Grover

Authors

Gyoohwan Jung
View author publications
You can also search for this author in PubMed Google Scholar
Hoyoung Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Jeong Woo Lee
View author publications
You can also search for this author in PubMed Google Scholar
Seong Jin Jeong
View author publications
You can also search for this author in PubMed Google Scholar
Eric Margolis
View author publications
You can also search for this author in PubMed Google Scholar
Neel Grover
View author publications
You can also search for this author in PubMed Google Scholar
Sangchul Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.J.: data collection, data analysis, manuscript writing. H.R.: data collection, project development, manuscript editing. J.W.L.: data collection, data management, data analysis. S.J.J.: data management, data analysis. E.M.: data collection, data management, data analysis. N.G.: data collection, data management, data analysis. S.L.: project development, data collection, manuscript editing. All the authors approved and contributed to the final manuscript. All the authors showed consent for publication of this study.

Corresponding author

Correspondence to Sangchul Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jung, G., Ryu, H., Lee, J.W. et al. Validation of an algorithm for sound-based voided volume estimation. Sci Rep 14, 138 (2024). https://doi.org/10.1038/s41598-023-50499-1

Download citation

Received: 27 February 2023
Accepted: 20 December 2023
Published: 02 January 2024
DOI: https://doi.org/10.1038/s41598-023-50499-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.