Cough sound-based estimation of vital capacity via cough peak flow using artificial neural network analysis

Umayahara, Yasutaka; Soh, Zu; Furui, Akira; Sekikawa, Kiyokazu; Imura, Takeshi; Otsuka, Akira; Tsuji, Toshio

doi:10.1038/s41598-023-35544-3

Download PDF

Article
Open access
Published: 25 May 2023

Cough sound-based estimation of vital capacity via cough peak flow using artificial neural network analysis

Yasutaka Umayahara¹,
Zu Soh²,
Akira Furui²,
Kiyokazu Sekikawa³,
Takeshi Imura¹,
Akira Otsuka¹ &
…
Toshio Tsuji²

Scientific Reports volume 13, Article number: 8461 (2023) Cite this article

596 Accesses
Metrics details

Subjects

Abstract

This study presents a novel approach for estimating vital capacity using cough sounds and proposes a neural network-based model that utilizes the reference vital capacity computed using the lambda-mu-sigma method, a conventional approach, and the cough peak flow computed based on the cough sound pressure level as inputs. Additionally, a simplified cough sound input model is developed, with the cough sound pressure level used directly as the input instead of the computed cough peak flow. A total of 56 samples of cough sounds and vital capacities were collected from 31 young and 25 elderly participants. Model performance was evaluated using squared errors, and statistical tests including the Friedman and Holm tests were conducted to compare the squared errors of the different models. The proposed model achieved a significantly smaller squared error (0.052 L², p < 0.001) than the other models. Subsequently, the proposed model and the cough sound-based estimation model were used to detect whether a participant’s vital capacity was lower than the typical lower limit. The proposed model demonstrated a significantly higher area under the receiver operating characteristic curve (0.831, p < 0.001) than the other models. These results highlight the effectiveness of the proposed model for screening decreased vital capacity.

Area under the expiratory flow-volume curve: predicted values by artificial neural networks

Article Open access 06 October 2020

A novel automatic cough frequency monitoring system combining a triaxial accelerometer and a stretchable strain sensor

Article Open access 11 May 2021

A noval pulmonary function evaluation method based on ResNet50 + SVR model and cough

Article Open access 12 December 2023

Introduction

Vital capacity is a fundamental parameter used to properly interpret lung function in clinical practice. The loss of functioning lung parenchyma contributes to decreased vital capacity in many nonobstructive lung disorders¹. Moreover, vital capacity provides prognostic information and is associated with increased mortality in the elderly population². Conventionally, a spirometer is generally used (Fig. 1a) to measure vital capacity; however, this approach is expensive and inconvenient because these devices must be used in hospital settings. Thus, home-based respiratory function monitoring has attracted considerable attention^3,4. Moreover, this vital capacity measurement method has been improved and uses a smartphone connected to a device such as a flow sensor held in the mouth by the subject^5,6. However, a flow sensor is required for measurement and requires a mouthpiece and a filter to prevent infection, which is costly. Thus, home respiratory function monitoring would be easier and cheaper if vital capacity could be estimated without requiring a device that touches the subject’s mouth.

Vital capacity estimation has been studied for a long time. In a study published in 1948, Baldwin⁷ developed a multiple regression equation for predicting vital capacity, which depended on the individual’s characteristics, such as gender, age, and height. In 2006, the World Health Organization (WHO)⁸ deprecated the use of regression curves to predict references for biological measurements and recommended using the lambda-mu-sigma (LMS) method. This method allows simultaneous modelling of the skewness (lambda), which models the departure of the variables from normality using a Box‒Cox transformation, the mean (mu), and the coefficient of variation (sigma), for the analysis of its recently published growth standard. In 2012, the Global Lung Function Initiative announced the global lung function 2012 equations derived using the LMS method⁹. In this way, methods for calculating the reference value of vital capacity have been established and used all over the world^10,11,12. The common point of each method is that aspects of the subject’s physical attributes, such as gender, age, and height, are used as explanatory variables. However, the vital capacity estimated using the LMS method (VC_LMS) does not represent actual vital capacity for each individual and is merely a reference value based on the age and height of the individual¹³.

Previous studies have shown that vital capacity is related to cough strength, such as cough peak flow, which can be measured by a spirometer or peak flow metre^14,15. The cough peak flow is an index of airway clearance ability and is related to the cough sound pressure level^16,17, which can be easily measured by various microphones, such as condenser microphones, microphones in headsets¹⁷ and those built into smartphones ¹⁵. Moreover, the cough peak flow can be estimated via the cough sound pressure level^17,18, which is referred to as the cough peak flow computed via cough sounds in this study. We hypothesize that the actual vital capacity of each individual could be estimated by using physical functions such as the cough peak flow computed via cough sound, which are related to both vital capacity and the subject’s physical attributes. If vital capacity can be estimated using cough sounds, flow sensors, mouthpieces, and filters would be unnecessary. More importantly, abnormal decreases in vital capacity could be detected by comparing the estimated vital capacity and the lower limit of the normal vital capacity⁹, which can be calculated by using the LMS method because pulmonary function varies with age, height, sex and ethnicity⁹.

Therefore, the purpose of this study was to estimate vital capacity using cough sounds (Fig. 1b). We employed an artificial neural network to estimate the vital capacity using the cough peak flow computed via cough sounds and VC_LMS as inputs. Because it is well known that vital capacity changes nonlinearly with age and height, we hypothesized that the artificial neural network, which uses nonlinear transformations¹⁹ to estimate vital capacity, could be advantageous. The estimated vital capacity was then used to detect the decrease in vital capacity below the lower limit of the normal vital capacity.

Materials and methods

Participants and inclusion criteria

Table 1 shows the participants’ characteristics. A total of fifty-six participants were included. Twenty-five elderly (10 male and 15 female) and 31 young (19 male and 12 female) adults participated in the experiment. The elderly participants, aged 70 to 91, lived at home where they had been receiving routine healthcare services through private arrangements. The following were exclusion criteria: a history of lung disease, institutionalization, terminal illness, unstable acute or chronic disease, a score of less than 23 on the Mini-Mental State Examination²⁰, inability to give informed consent, inability to walk independently or use of a cane, and neuromusculoskeletal impairment. The young group consisted of self-reported healthy participants with no previous cardiovascular or pulmonary diseases. Participants who failed to manoeuvre the respiratory function test were excluded.

Table 1 Characteristics of the participants.

Full size table

Ethical approval

This study was conducted in accordance with the amended Declaration of Helsinki. The Hiroshima Cosmopolitan University Institutional Review Board (No. 20200305) approved the protocol, and written informed consent was obtained from all participants.

Pulmonary function testing

Pulmonary function tests, as shown in Fig. 1a, were performed using a spirometer (Autospiro AS-507; Minato Medical Science Co., Ltd., Osaka, Japan) with the participants in a sitting position according to ATS/ERS guidelines²¹. Vital capacity was determined as the largest value from at least three acceptable manoeuvres. This measured vital capacity was utilized as the estimation target for the proposed model, as explained in subsequent sections. In addition, before the respiratory function test and cough sound measurement, an interview was conducted to check for respiratory symptoms such as acute upper respiratory tract infection or changes in physical condition. Moreover, the order of the respiratory function test and cough test was randomly assigned to minimize the effects of bias due to the measurement order, and the interval between the two tests was one week.

Cough sound measurement system

Figures 1b,c show the cough sound measurement system. The experiment was performed as previously described¹⁷ using an in-ear microphone to measure cough sounds. A previous study on cough peak flow estimation via cough sound measurements reported that an in-ear microphone is suitable for measuring cough sounds due to the constant distance between the mouth of the sound source and the microphone¹⁷. The electret condenser microphone (in-ear microphone, ECM-TL3; Sony Corporation, Japan) was attached to the right ear canal. The measured sound signals were digitized using a 16-bit analogue-to-digital converter (PowerLab16/35, AD Instruments, Inc., Dunedin, New Zealand) at a 100 kHz sampling rate set by analysis software (LabChart version 8, AD Instruments, Inc.), and stored on a personal computer. The digitized cough sound signal was band-pass filtered between 140 and 2000 Hz to minimize artefacts caused by heart sounds and muscle interference (see Fig. 1c).

Cough sound measurement protocols

Following thorough instructions on the coughing method provided to participants, three trials of maximal voluntary coughing were performed during each 20-s measurement period. Adequate rest periods were provided between each trial to minimize the potential impact of fatigue.

Feature extraction

A respiratory physiotherapist with expertise in respiratory diseases extracted a 5-s segment of cough sound from each 20-s measurement period. The selected 5-s segment included the maximum cough sound in all cases. The sound pressure level, measured in dB, was subsequently determined using the following equation:

$$L_{P} \left( t \right) = 20\log_{10} \left\{ {\frac{{V_{r} \left( t \right)}}{{P_{0} V_{s} }}} \right\},$$

(1)

where ${V}_{r}\left(t\right)$ represents the measured voltage value, t is the discrete measurement time in the cough sound period, ${P}_{0}=20$ µ Pa is the reference sound pressure, and ${V}_{s}={10}^{\left(S/20\right)}$ is the voltage output per Pa. Here, $S=-35.0\mathrm{ dB }(0\mathrm{ dB }= 1\mathrm{ V}/1\mathrm{ Pa})$ is the sensitivity of the in-ear microphone. The maximum sound pressure level was calculated for each acceptable trial as follows:

$$L_{p,}^{\left( i \right)} = \max \left( {L_{P} \left( t \right)} \right) ,$$

(2)

where the superscript (i) indicates the trial. Finally, the cough sound pressure level $SPL$ was determined based on at least three acceptable trials as follows:

$$SPL = \max \left( {L_{{p,{\text{max}}}}^{\left( i \right)} } \right),$$

(3)

Cough peak flow is a cough strength parameter that can be estimated via cough sounds, namely, CPS^16,17. Specifically, it is calculated based on the cough sound pressure level and participant age by using the following Equation¹⁸:

$$CPS = \left( {a_{1} + a_{2}\,age} \right) \left( {e^{\beta SPL} - 1} \right),$$

(4)

where ${a}_{1}, {a}_{2}$ and $\beta$ are constant parameters determined based on a nonlinear optimization scheme¹⁸.

Proposed model

A previous study reported that vital capacity is related to cough strength¹⁵. We hypothesized that the measured vital capacity can be estimated by correcting the VC_LMS value, which reflects the height and age of a subject, using the cough peak flow computed via cough sounds. Here, VC_LMS in Liter can be estimated based on the previous literature¹², as shown in Eqs. (5) and (6):

$${\text{Male}}:\,VC_{LMS} = {\text{exp}}\left( { - 8.8317 + 2.1043\,{\text{ln}}\left( h \right) - 0.1382\,{\text{ln}}\left( a \right) + m - s} \right),$$

(5)

$${\text{Female}}:\,VC_{LMS} = {\text{exp}}\left( { - 8.0707 + 1.9399\,{\text{ln}}\left( h \right) - 0.1678\,{\text{ln}}\left( a \right) + m - s} \right),$$

(6)

where h represents the participant’s height, a represents their age and m-s is the age-specific contribution from the spline function⁹.

To examine the linear relationships between the measured vital capacity and cough peak flow computed via cough sounds, partial correlation analysis was carried out. We then proposed the use of a neural network-based model to estimate the measured vital capacity from VC_LMS and the cough peak flow computed via cough sounds (see Fig. 1c and Eq. 4). To estimate the measured vital capacity, a three-layer feedforward perceptron was employed, which was composed of an input layer using an identity activation function, a hidden layer using a hyperbolic tangent function, and an output layer using an identity activation function. The number of units in the input layer was equivalent to the dimension of the input vector ${\varvec{I}}={\left[CPS, {VC}_{\mathrm{LMS}}\right]}^{T}\in {\mathbb{R}}^{2}$. The output layer, which was used to estimate the vital capacity, was configured with a single unit. The number of units in the hidden layer was set as a hyperparameter, denoted by H. Accordingly, the models included a total of 4H + 1 weight and bias parameters, which were trained using an error backpropagation algorithm. The objective of the training process was to minimize the root mean squared error, which was calculated as follows:

$$\sqrt {\left( {\frac{1}{N}\sum\nolimits_{i}^{N} {\left( {\widehat{{VC_{i} }} - VC_{i} } \right)^{2} } } \right)} ,$$

(7)

where ${\widehat{VC}}_{i}$ and $V{C}_{i}$ represent the estimated and measured vital capacity from observation i, respectively, and N is the total number of observations.

To determine the optimal number of units in the hidden layer, a nested cross-validation method was utilized²². The outer loop of this method involved training the model based on N − 1 observations and evaluating its accuracy based on the remaining observation. Moreover, the inner loop divided the N − 1 observations into two datasets, with one half of the data used to train the model using different unit numbers (H = 1, 2, 3) and the other half utilized to assess the estimation accuracy. This process was repeated for all possible combinations of training and test sets, and the optimal value of H was selected based on the highest accuracy achieved. The analyses were conducted using IBM Statistical Package for Neural Networks (SPSS) version 26.

Verification of the estimation accuracy

To validate the efficacy of the proposed model, we compared its estimation accuracy with two different methods. First, the VC_LMS and the vital capacity estimated by the proposed model, NNVC_CPS, were compared to verify the combined effectiveness of employing the neural network-based model and the cough peak flow computed via cough sounds. To evaluate the effectiveness of converting the cough sound pressure level to the cough peak flow computed via cough sounds with Eq. (4), the input vector was modified from ${\varvec{I}}={\left[CPS, {VC}_{\mathrm{LMS}}\right]}^{T}$ to ${\varvec{I}}={\left[SPL, {VC}_{\mathrm{LMS}}\right]}^{T}$. Hereafter, the vital capacity estimated based on the SPL and VC_LMS inputs is referred to as NNVC_SPL (see Fig. 1c). The estimation accuracy was evaluated using the mean square error $\frac{1}{N}\sum_{i}^{N}{\left(\widehat{V{C}_{i}}-V{C}_{i}\right)}^{2}$, where ${\widehat{VC}}_{i}$ and $V{C}_{i}$ represent the estimated and measured vital capacity from observation i, respectively, and N is the total number of observations, and the Spearman’s rank correlation coefficient between the measured vital capacity and the estimated vital capacities (VC_LMS, NNVC_SPL, and NNVC_CPS). In addition, the absolute reliability of the model was investigated using regression analysis and the Bland‒Altman analysis method to detect systematic errors, such as fixed and proportional bias^23,24. The Wilcoxon signed-rank test and, the Friedman and Holm tests^25,26 were used for the comparison, and p < 0.05 was considered significant.

Detecting abnormal decreases in vital capacity

The LLN, which can be calculated using the LMS method, represents the lower limit of the normal vital capacity, and a vital capacity less than this limit is diagnosed as respiratory dysfunction. Therefore, we detected abnormal vital capacity when the estimated vital capacities (NNVC_CPS and NNVC_SPL) were less than the LNN and verified the discrimination accuracy using the area under the receiver operating characteristic²⁷ curve (AUC). The AUCs resulting from NNVC_CPS and NNVC_SPL were compared using the DeLong test, where p < 0.05 was considered significant.

Statistical analyses other than the neural network analysis were performed with IBM SPSS version 26 and EZR (Saitama Medical Center, Jichi Medical University, Saitama, Japan)²⁸, which is a graphical user interface for R (the R Foundation for Statistical Computing, Vienna, Austria).

Results

Relationships between vital capacity and the body structure parameters and cough peak flow

Table 2 shows that the partial correlations between the vital capacity and cough peak flow, age, and height were 0.286 (p = 0.038), − 0.718 (p < 0.001), and 0.683 (p < 0.001), respectively. Because there was no significant partial correlation between vital capacity and weight, we excluded this parameter from the input for the neural network-based model for estimating vital capacity.

Table 2 Partial correlation analysis results, n = 56.

Full size table

Vital capacity estimation accuracy

Leave-one-out cross-validation analysis showed that the root mean squared errors of the NNVC_SPL and NNVC_CPS against the measured vital capacity were 0.165 L and 0.112 L, respectively. Figure 2 shows the relationships between the measured vital capacity and the estimated vital capacities, indicating correlation coefficients of 0.924 (p < 0.001) for VC_LMS, 0.909 (p < 0.001) for NNVC_SPL, and 0.944 (p < 0.001) for NNVC_CPS. Figure 3 shows the corresponding Bland‒Altman plots. Neither NNVC_SPL nor NNVC_CPS showed systematic errors, but VC_LMS showed a fixed bias (one sample t test; p < 0.001) and a proportional bias (r = − 0.414; p = 0.002). Furthermore, the Friedman and Holm tests showed significant differences in the squared error between VC_LMS and NNVC_SPL, VC_LMS and NNVC_CPS, NNVC_SPL and NNVC_CPS (median 0.308 L² vs. 0.100 L; p = 0.001, 0.308 L² vs. 0.052 L; p < 0.001, 0.100 L² vs. 0.052 L²; p = 0.037, respectively) (see Fig. 4a). In young participants, the Friedman test showed no significant differences in the squared error (p = 0.198) (see Fig. 4b); however, among elderly participants, the Freidman and Holm tests showed significant differences in the squared error between the VC_LMS and NNVC_SPL, VC_LMS and NNVC_CPS (0.548 L² vs. 0.110 L²; p < 0.001, 0.548 L² vs. 0.034 L²; p < 0.001, respectively) (see Fig. 4c). Figure 5 demonstrates the results of comparing the squared error between generations. The Wilcoxon signed-rank test showed significant differences in the squared error of VC_LMS between young and elderly participants (0.130 L² vs. 0.548 L²; p < 0.001) (see Fig. 5a); however, there were no significant differences in the squared error for NNVC_SPL between generations (see Fig. 5b). Although there was no significant difference in the NNVC_CPS between generations, the squared error among the elderly participants was approximately 40% lower than that of the young participants (see Fig. 5c).

Detection accuracy of abnormal decreases in vital capacity

The DeLong test showed a significant difference in the AUC between the NNVC_SPL and NNVC_CPS (0.578 vs. 0.831; p = 0.002, respectively) (see Fig. 6). The true positive and false negative rates of the NNVC_CPS were 0.731 and 0.269, respectively.

Discussion

This study aimed to develop a simple vital capacity evaluation method. To the best of our knowledge, this was the first study that estimated vital capacity based on cough sound. The proposed method demonstrated that an accurate vital capacity can be estimated for different individuals by using VC_LMS and the cough peak flow computed via cough sounds. In addition, we found that an abnormal decrease in vital capacity, which is associated with respiratory dysfunction, can be detected using the proposed vital capacity estimation method, with an AUC of 0.831.

First, to determine the input to the neural network-based model, the relationships between the vital capacity and different physical attributes and the cough peak flow were analysed using partial correlations. The results showed that cough peak flow, age, and height were significantly correlated with vital capacity. These relationships were consistent with those found in previous studies^14,15. Height and age were used as independent variables for calculating the reference value of the vital capacity (VC_LMS) via the LMS method. The LMS method has the advantage of reflecting age-dependent changes in respiratory function because a nonlinearly smooth fit of the vital capacity over the entire age range can be predicted⁹. Thus, VC_LMS includes information about both age and height. In addition, a previous study reported that vital capacity is related to cough peak flow¹⁵, which can be estimated via the cough sound pressure level, and its estimated value is CPS¹⁷. For these reasons, we hypothesized that vital capacity can be estimated by correcting the VC_LMS value using the cough peak flow computed via cough sounds or the cough sound pressure level. Thus, two neural network-based models were constructed: the first model uses VC_LMS and the cough peak flow as inputs, and the other model uses VC_LMS and the cough sound pressure level as inputs.

The experimental results showed that NNVC_CPS led to the best estimation accuracy among the three methods, and no systematic error was observed (see Fig. 3c). Furthermore, Eq. (4) incorporates an age factor, indicating that an equation that uses the cough peak flow computed via cough sounds as an input is less susceptible to the effects of aging. While the cough peak flow computed via cough sounds and cough sound pressure level are related, as shown in Eq. (4), they differ in that the age factor is included in the formula for computing the cough peak flow computed via cough sounds (Eq. 4) but not in the cough sound pressure level formula (Eq. 3). Previous studies reported that vocal fold function, a crucial factor in coughing, is negatively impacted by aging^29,30. Thus, it is plausible that NNVC_CPS could effectively suppress the effects of aging on vocal cord function and enable the detection of decreased vital capacity. In spirometer measurements, if the difference in the vital capacity between the largest and second largest manoeuvre exceeds 0.150 L, the measurement is considered a failure, and additional trials should be performed²¹. In this study, the mean relative difference between the measured vital capacity and NNVC_CPS was 0.008 L ml (95% CI − 0.082 to 0.098), which is lower than the standard value for additional trials. A recent investigation employing dynamic chest radiography estimated the forced vital capacity, yielding a correlation coefficient of 0.86 (95% CI 0.79 to 0.90) between the measured and estimated values³¹. Similarly, a recent study examining forced vital capacity estimation via vocal analysis in patients with amyotrophic lateral sclerosis reported a correlation coefficient of 0.8, with a mean absolute error of 0.54 L³². Our study focused on measuring slow vital capacity, which involves slow expiration, while previous studies measured forced vital capacity, which involves fast expiration with effort. However, despite the differences in the measurement methods and participant attributes, the estimation accuracy of our proposed model is expected to be better than or at least equivalent to that of methods proposed in prior investigations. Therefore, the proposed method could have sufficient accuracy and be useful in screening tests.

We also attempted to detect abnormal decreases in vital capacity using the estimated NNVC_CPS. The efficacy of this approach was confirmed, with a high AUC of 0.831 (see Fig. 6). In a previous study that discriminates restrictive impairment of lung function, spirometry values were used to calculate the difference between lung age and actual age. This method showed an AUC of 0.891³³, which is slightly higher than the proposed model. Nonetheless, the proposed model significantly outperforms previous studies in terms of ease of measurement.

It should be noted that the false-negative rate was high, and there was a possibility of missing respiratory function decline. This suggests the effect of cases with loud cough sounds but low lung capacity. Respiratory muscle strength and cough strength have been shown to be positively correlated^15,34. Thus, respiratory muscle strength could affect the cough sound pressure level, which was used to calculate the estimated cough peak flow computed via cough sounds. The estimation accuracy could be improved further to reduce the false-negative rate, such as by adding variables related to respiratory muscle strength to the input layer in the neural network.

The findings of this study suggest that the vital capacity of individual participants can be estimated with a neural network analysis approach using VC_LMS and the cough peak flow computed via cough sounds as inputs. Unlike linear models, the neural network-based model can handle nonlinear changes and was suggested to be promising for respiratory monitoring in a previous study³⁵. However, the participants of this study were limited to young and elderly people without underlying diseases based on self-report. The cough peak flow computed via cough sounds used in this study, which reflects cough force, is calculated based on cough sounds. Because the cough sound may be affected by the accumulation of secretions such as sputum, narrowing of the airway due to some diseases, or inadequate closure of the glottis, it is unclear to what extent the accuracy of vital capacity estimation may be affected. Therefore, it is necessary to clarify the effects of secretions and diseases on the accuracy of vital capacity estimation in future studies. In addition, although this study estimated only vital capacity, it is necessary to estimate measures that reflect obstructive ventilation disorders, such as forced vital capacity, one-second volume, and peak flow, to construct a more comprehensive respiratory function estimation system. Moreover, to apply the proposed method to patients in home environments, it would be more convenient to measure cough sounds with a smartphone. However, it has been found that the measurement accuracy of smartphones is lower than that of in-ear microphones¹⁷. Thus, additional studies are needed to implement the proposed method for estimating vital capacity on smartphones. In addition, it is essential to improve the proposed method in the future so that the error between the measured vital capacity and the estimated vital capacity is minimized; then, the same cut-off reference value could be applied. Nonetheless, it should be noted that the proposed method is presented as a screening method, and a conclusive diagnosis must be based on a thorough examination at a medical institution.

Data availability

The data that support the findings of this study are available in the main text and from the corresponding authors upon reasonable request.

References

Birnkrant, D. J. et al. Diagnosis and management of Duchenne muscular dystrophy, part 2: Respiratory, cardiac, bone health, and orthopaedic management. Lancet Neurol. 17, 347–361. https://doi.org/10.1016/S1474-4422(18)30025-5 (2018).
Article PubMed PubMed Central Google Scholar
Pedone, C. et al. Prognostic significance of surrogate measures for forced vital capacity in an elderly population. J. Am. Med. Dir. Assoc. 11, 598–604. https://doi.org/10.1016/j.jamda.2009.12.003 (2010).
Article PubMed Google Scholar
Gerke, S., Shachar, C., Chai, P. R. & Cohen, I. G. Regulatory, safety, and privacy concerns of home monitoring technologies during COVID-19. Nat. Med. 26, 1176–1182. https://doi.org/10.1038/s41591-020-0994-1 (2020).
Article CAS PubMed PubMed Central Google Scholar
Nakshbandi, G., Moor, C. C. & Wijsenbeek, M. S. Home monitoring for patients with ILD and the COVID-19 pandemic. Lancet Respir. Med. 8, 1172–1174. https://doi.org/10.1016/S2213-2600(20)30452-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Jankowski, P. et al. The use of a mobile spirometry with a feedback quality assessment in primary care setting—A nationwide cross-sectional feasibility study. Respir Med 184, 106472. https://doi.org/10.1016/j.rmed.2021.106472 (2021).
Article PubMed Google Scholar
Trivedy, S., Goyal, M., Mohapatra, P. R. & Mukherjee, A. Design and development of smartphone-enabled spirometer with a disease classification system using convolutional neural network. IEEE Trans. Instrum. Meas. 69, 7125–7135. https://doi.org/10.1109/TIM.2020.2977793 (2020).
Article ADS Google Scholar
Baldwin, E. D., Cournand, A. & Richards, D. W. Jr. Pulmonary insufficiency; physiological classification, clinical methods of analysis, standard values in normal subjects. Medicine 27, 243–278 (1948).
Article CAS PubMed Google Scholar
World Health Organization. WHO Child Growth Standards: Length Height-for-Age, Weight-for-Age, Weight-for-Length, Weight-for-Height and Body Mass Index-for-Age: Methods and Development (World Health Organization, 2006).
Google Scholar
Quanjer, P. H. et al. Multi-ethnic reference values for spirometry for the 3–95-yr age range: The global lung function 2012 equations. Eur. Respir. J. 40, 1324–1343. https://doi.org/10.1183/09031936.00080312 (2012).
Article PubMed PubMed Central Google Scholar
European Respiratory, S. The Global Lung Function Initiative https://www.ersnet.org/science-and-research/ongoing-clinical-research-collaborations/the-global-lung-function-initiative/ (2012).
Jo, B. S. et al. Reference values for spirometry derived using lambda, mu, sigma (LMS) method in Korean adults: In comparison with previous references. J. Korean Med. Sci. 33, e16. https://doi.org/10.3346/jkms.2018.33.e16 (2018).
Article PubMed Google Scholar
Kubota, M. et al. Reference values for spirometry, including vital capacity, in Japanese adults calculated with the LMS method and compared with previous values. Respir. Investig. 52, 242–250. https://doi.org/10.1016/j.resinv.2014.03.003 (2014).
Article PubMed Google Scholar
Stanojevic, S. et al. Reference ranges for spirometry across all ages: a new approach. Am. J. Respir. Crit. Care Med. 177, 253–260. https://doi.org/10.1164/rccm.200708-1248OC (2008).
Article PubMed Google Scholar
Kang, S. W. & Bach, J. R. Maximum insufflation capacity: Vital capacity and cough flows in neuromuscular disease. Am. J. Phys. Med. Rehabil. 79, 222–227. https://doi.org/10.1097/00002060-200005000-00002 (2000).
Article CAS PubMed Google Scholar
Kang, S. W. et al. Relationship between inspiratory muscle strength and cough capacity in cervical spinal cord injured patients. Spinal Cord 44, 242–248. https://doi.org/10.1038/sj.sc.3101835 (2006).
Article ADS CAS PubMed Google Scholar
Umayahara, Y. et al. in 2017 IEEE/SICE International Symposium on System Integration (SII). 936–941.
Umayahara, Y. et al. Estimation of cough peak flow using cough sounds. Sensors (Basel) 18, 2381. https://doi.org/10.3390/s18072381 (2018).
Article ADS PubMed Google Scholar
Umayahara, Y. et al. A mobile cough strength evaluation device using cough sounds. Sensors (Basel) 18, 3810. https://doi.org/10.3390/s18113810 (2018).
Article ADS PubMed Google Scholar
Hornik, K., Stinchcombe, M. & White, H. Multilayer feedforward networks are universal approximators. Neural Netw. 2, 359–366. https://doi.org/10.1016/0893-6080(89)90020-8 (1989).
Article MATH Google Scholar
Folstein, M. F., Folstein, S. E. & McHugh, P. R. Mini-mental state. A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198. https://doi.org/10.1016/0022-3956(75)90026-6 (1975).
Article CAS PubMed Google Scholar
Miller, M. R. et al. Standardisation of spirometry. Eur. Respir. J. 26, 319–338. https://doi.org/10.1183/09031936.05.00034805 (2005).
Article CAS PubMed Google Scholar
Cawley, G. C. & Talbot, N. L. On over-fitting in model selection and subsequent selection bias in performance evaluation. J. Mach. Learn. Res. 11, 2079–2107 (2010).
MathSciNet MATH Google Scholar
Bland, J. M. & Altman, D. G. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet (London, England) 1, 307–310 (1986).
Article CAS PubMed Google Scholar
Bland, J. M. & Altman, D. G. Comparing methods of measurement: Why plotting difference against standard method is misleading. Lancet (London, England) 346, 1085–1087. https://doi.org/10.1016/s0140-6736(95)91748-9 (1995).
Article CAS PubMed Google Scholar
Holm, S. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 6, 65–70 (1979).
MathSciNet MATH Google Scholar
Chan, A. O. et al. Prevalence of colorectal neoplasm among patients with newly diagnosed coronary artery disease. JAMA 298, 1412–1419. https://doi.org/10.1001/jama.298.12.1412 (2007).
Article CAS PubMed Google Scholar
van Erkel, A. R. & Pattynama, P. M. Receiver operating characteristic (ROC) analysis: Basic principles and applications in radiology. Eur. J. Radiol. 27, 88–94. https://doi.org/10.1016/s0720-048x(97)00157-5 (1998).
Article PubMed Google Scholar
Kanda, Y. Investigation of the freely available easy-to-use software ‘EZR’ for medical statcistics. Bone Marrow Transpl. 48, 452–458. https://doi.org/10.1038/bmt.2012.244 (2013).
Article CAS Google Scholar
Goy, H., Fernandes, D. N., Pichora-Fuller, M. K. & van Lieshout, P. Normative voice data for younger and older adults. J. Voice Off. J. Voice Found. 27, 545–555. https://doi.org/10.1016/j.jvoice.2013.03.002 (2013).
Article Google Scholar
Nishio, M. & Niimi, S. Changes in speaking fundamental frequency characteristics with aging. Jpn. J. Logop. Phoniatr. 46, 136–144. https://doi.org/10.5112/jjlp.46.136 (2005).
Article Google Scholar
Ueyama, M. et al. Prediction of forced vital capacity with dynamic chest radiography in interstitial lung disease. Eur. J. Radiol. 142, 109866. https://doi.org/10.1016/j.ejrad.2021.109866 (2021).
Article PubMed Google Scholar
Stegmann, G. M. et al. Estimation of forced vital capacity using speech acoustics in patients with ALS. Amyotroph. Lateral Scler. Front. Degener. 22, 14–21. https://doi.org/10.1080/21678421.2020.1866013 (2021).
Article Google Scholar
Toda, R. et al. Validation of “lung age” measured by spirometry and handy electronic FEV1/FEV6 meter in pulmonary diseases. Intern. Med. 48, 513–521. https://doi.org/10.2169/internalmedicine.48.1781 (2009).
Article PubMed Google Scholar
Bahat, G. et al. Relation between hand grip strength, respiratory muscle strength and spirometric measures in male nursing home residents. Aging Male Off. J. Int. Soc. Study Aging Male 17, 136–140. https://doi.org/10.3109/13685538.2014.936001 (2014).
Article Google Scholar
Blanco-Almazan, D., Groenendaal, W., Catthoor, F. & Jane, R. Chest movement and respiratory volume both contribute to thoracic bioimpedance during loaded breathing. Sci. Rep. 9, 20232. https://doi.org/10.1038/s41598-019-56588-4 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank the volunteers who participated in our study. This work was supported by JSPS KAKENHI, Grant Numbers JP16K16475 and JP19K06878.

Author information

Authors and Affiliations

Graduate School of Health Sciences, Hiroshima Cosmopolitan University, 3-2-1 Otsukahigashi, Asaminami-ku, Hiroshima, Japan
Yasutaka Umayahara, Takeshi Imura & Akira Otsuka
Graduate School of Advanced Science and Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-Hiroshima, Hiroshima, Japan
Zu Soh, Akira Furui & Toshio Tsuji
Graduate School of Biomedical and Health Sciences, Hiroshima University, 1-2-3 Kasumi, Minami-ku, Hiroshima, Japan
Kiyokazu Sekikawa

Authors

Yasutaka Umayahara
View author publications
You can also search for this author in PubMed Google Scholar
Zu Soh
View author publications
You can also search for this author in PubMed Google Scholar
Akira Furui
View author publications
You can also search for this author in PubMed Google Scholar
Kiyokazu Sekikawa
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Imura
View author publications
You can also search for this author in PubMed Google Scholar
Akira Otsuka
View author publications
You can also search for this author in PubMed Google Scholar
Toshio Tsuji
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the concept and design of the study; Y.U. and T.T. conceived the idea of the study; Y.U., A.O. and K.S. performed the experiments and data acquisition; Y.U. and T.I. analysed the data; Y.U. prepared the figures and the first manuscript draft; Y.U., T.T., Z.S. and A.F. developed the statistical analysis plan; Y.U. and T.I. conducted statistical analyses. T.T., Z.S., A.F. and K.S. contributed to the interpretation of the results. Y.U. drafted the original manuscript. T.T. supervised the study. All authors reviewed the manuscript draft and revised it critically for intellectual content. All authors approved the final version of the manuscript to be published.

Corresponding authors

Correspondence to Yasutaka Umayahara or Toshio Tsuji.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Umayahara, Y., Soh, Z., Furui, A. et al. Cough sound-based estimation of vital capacity via cough peak flow using artificial neural network analysis. Sci Rep 13, 8461 (2023). https://doi.org/10.1038/s41598-023-35544-3

Download citation

Received: 28 December 2022
Accepted: 19 May 2023
Published: 25 May 2023
DOI: https://doi.org/10.1038/s41598-023-35544-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.