Cancer classification using machine learning and HRV analysis: preliminary evidence from a pilot study

Vigier, Marta; Vigier, Benjamin; Andritsch, Elisabeth; Schwerdtfeger, Andreas R.

doi:10.1038/s41598-021-01779-1

Download PDF

Article
Open access
Published: 16 November 2021

Cancer classification using machine learning and HRV analysis: preliminary evidence from a pilot study

Marta Vigier ORCID: orcid.org/0000-0002-2307-4411^1,2,
Benjamin Vigier³,
Elisabeth Andritsch¹ &
…
Andreas R. Schwerdtfeger^2,4

Scientific Reports volume 11, Article number: 22292 (2021) Cite this article

3528 Accesses
6 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Most cancer patients exhibit autonomic dysfunction with attenuated heart rate variability (HRV) levels compared to healthy controls. This research aimed to create and evaluate a machine learning (ML) model enabling discrimination between cancer patients and healthy controls based on 5-min-ECG recordings. We selected 12 HRV features based on previous research and compared the results between cancer patients and healthy individuals using Wilcoxon sum-rank test. Recursive Feature Elimination (RFE) identified the top five features, averaged over 5 min and employed them as input to three different ML. Next, we created an ensemble model based on a stacking method that aggregated the predictions from all three base classifiers. All HRV features were significantly different between the two groups. SDNN, RMSSD, pNN50%, HRV triangular index, and SD1 were selected by RFE and used as an input to three different ML. All three base-classifiers performed above chance level, RF being the most efficient with a testing accuracy of 83%. The ensemble model showed a classification accuracy of 86% and an AUC of 0.95. The results obtained by ML algorithms suggest HRV parameters could be a reliable input for differentiating between cancer patients and healthy controls. Results should be interpreted in light of some limitations that call for replication studies with larger sample sizes.

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Article Open access 12 April 2024

Hang Yuan, Shing Chan, … Aiden Doherty

An overview of clinical decision support systems: benefits, risks, and strategies for success

Article Open access 06 February 2020

Reed T. Sutton, David Pincock, … Karen I. Kroeker

A systems identification approach using Bayes factors to deconstruct the brain bases of emotion regulation

Article 22 March 2024

Ke Bo, Thomas E. Kraynak, … Tor D. Wager

Introduction

Worldwide, cancer still is the second most prevalent cause of mortality¹. Several studies have shown that three basic biological mechanisms are involved in tumourigenesis: oxidative stress, inflammation and excessive sympathetic activity^2,3,4. Oxidative stress leads to both DNA damage, the primary cause of tumourigenesis, and uncontrolled cell proliferation⁵. Inflammation enhances cancer cell resistance to stress and apoptosis. Furthermore, inflammation contributes to angiogenesis and metastasis and promotes tumourigenesis in the early stages of oncogenesis^6,7⁠ and disease progression in its later stages⁸. Additionally, the inflammatory microenvironment, which plays a role in fighting and eliminating tumours, may also facilitate tumour growth and the production of free radicals to further induce oxidative stress. Finally, metastasis development is under the control of the sympathetic nervous system by stimulating cancer cell migratory capacity⁸.

One common factor influencing all three mechanisms is the vagus nerve as a major constituent of the parasympathetic nervous system, indexed by heart rate variability (HRV). Several studies indicate a bidirectional link between the vagus nerve and cancer. For example, it has been reported that the vagus nerve may exercise a neuromodulatory influence on cancer by slowing tumour development and progression^5,9. Specifically, these authors inferred that vagal influences might reduce oxidative stress, modulate inflammation, and inhibit sympathetic activity. Notably, the information about tumourigenic activity related to tumour-associated proinflammatory cytokines is transferred to the brain by the vagus nerve^10,11. Studies involving patients with vagotomy confirmed the role of the vagus nerve in cancer onset, showing an increased risk of developing lung or colorectal cancer after the surgery^12,13. On the other hand, Strous et al.¹⁴⁠ suggested that the same cancer-related mechanisms accompanying the development and progression of a malignant tumour may cause vagal dysfunction and decreased HRV. Although the origin of the relationship between the vagus nerve and cancer is unclear, the lower HRV was unanimously reported in early and advanced cancer patients compared to healthy individuals^15,16.

Vagus nerve activity can be quantified by recording an electrocardiogram (ECG) and analysing beat-to-beat fluctuations in heart rate. HRV parameters can be described by linear (time- and frequency-domain) and non-linear measures¹⁷.

Time-domain measures of HRV quantify the amount of variance in the RR-intervals, which represents the period between successive heartbeats. Time-domain statistics are analysed through parameters such as the standard deviation of RR-intervals in a defined time period, the root mean square of successive differences (RMSSD) indicating short-term fluctuations in successive RR-intervals, the number or proportion of different pairs of successive RR-intervals that differ by more than a fixed time interval¹⁸.

Frequency-domain measures quantify the distribution of absolute or relative power into four frequency bands established by the Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology (1996): ultra-low-frequency (ULF ≤ 0.003 Hz), very-low-frequency (VLF: 0.0033–0.04 Hz), low-frequency (LF: 0.04–0.15 Hz), and high-frequency (HF: 0.15–0.4)¹⁹. Frequency-domain measures are analysed by power spectral density computation using several parametric or nonparametric methods in different frequency bands of interest²⁰.

Finally, non-linear measures of HRV quantify the unpredictability of fluctuations in a time series, as the HRV signals are non-linear and non-stationary by nature¹⁸. Moreover, HRV parameters, when examined using chaos theory and non-linear system theory, suggest the non-linear mode of interaction between the mechanisms involved in cardiovascular regulation¹⁸.

The differences between healthy individuals and cancer patients in time and frequency domain measures of HRV have often been reported, ultimately showing a vagal impairment in the latter group^16,21. A lower HRV in cancer patients indicates autonomic dysfunction, which most cases exhibit^16,21,22. This cancer-related alteration is characterised by a sympathovagal imbalance with highly active SNS and impaired PNS functioning²³. Although not often examined in cancer, the changes in non-linear HRV measures were reported as early signs of several diseases²⁴. Therefore, we explored the combination of several linear and non-linear HRV features to classify cancer vs non-cancer.

Machine Learning (ML) established an essential role in healthcare and medical research^25,26. In cardiovascular research, ML has been successfully used in automated ECG analysis for arrhythmia detection and classification, ischemia detection, left and right ventricular hypertrophy, bilateral ventricular hypertrophy, and diabetes^27,28,29,30. Several studies classified disorder-affected individuals vs healthy controls based on HRV and machine learning algorithms. For example, Aggarwal et al.³¹ classified healthy vs diabetic rats based on diabetes-related changes in HRV, using an artificial neural network (ANN) and support vector machine (SVM). Other researchers created a neural network that automatically classified diabetic and healthy individuals based on disease-related HRV alterations³².

To the best of our knowledge, only two studies applied machine learning to predict or classify cancer based on HRV analysis. Shukla and Aggrawal³³ extracted HRV indices from ECG recordings of 104 lung cancer patients and 30 healthy individuals. The authors found reduced HRV in cancer patients compared to healthy people. Further, the authors reported that the decrease in HRV levels was related to the severity of the disease. They predicted and classified lung cancer stages using ANN and SVM with 93.09% and 100% accuracy. In a recent study, Shukla, and Aggrawal³⁴ analysed the 5-min electrocardiogram of 114 breast cancer patients and 13 age-matched healthy individuals. The authors used a Lavenberg–Marquardt algorithm-based artificial neural network and a support vector machine that classified two groups based on spectral HRV features with a maximum accuracy of 54.2% and 100%, respectively.

This research proposes a new methodology in cancer classification based on HRV and ML, which is different from the methods discussed above. To classify cancer vs healthy individuals, we used an ensemble model with the stacking method. Specifically, we aimed to examine the capacity of three different ML algorithms to recognise patterns in HRV to classify ECGs. Next, we aggregated the predictions from these algorithms to build a meta-classifier to improve the robustness and classification accuracy. In addition, we applied the Recursive Feature Elimination method to select the most relevant HRV features. We also addressed the issue of class imbalance.

We recruited patients with tumours most consistently related to vagal dysfunction (i.e., breast cancer, prostate cancer, colorectal cancer, lung cancer, and pancreatic cancer) and within different stages of cancer development. Our sample contains five of the six most common types of cancer in Austria³⁵. Thus, patients suffering from these types of cancer are often present in general oncology units, similar to where we collected our data. Importantly, research reports reduced HRV and a decreased survival rate in different types and stages of disease^16,36,37. Also, no clear agreement exists among researchers regarding the direction of influence between HRV and cancer and the timeline of observed cancer-related changes in HRV. Therefore, we included patients from various cancer stages to account for the possible differences in patient’s autonomic activity.

HRV measures were recorded from cancer patients and healthy individuals. A subset of time, frequency domain and non-linear features averaged over 5 min was used as input to the machine learning models created to distinguish cancer patients from healthy controls based on HRV analysis.

In particular, we investigated the performance of three ML classifiers: Random Forest (RF), Linear Discriminant Analysis (LDA), and Naive Bayes (NB). Next, we created one ensemble model based on the combination of those models using the stacking method and eXtreme Gradient Boosting (XGB) from the Caret package³⁸ (Version 6.0.88) in R³⁹ (Version 4.1.0).

Our study makes a new contribution to the existing research. Its foremost advantage is to employ an ensemble stacking algorithm that allows to reduce the uncertainties of predictions and to improve the robustness of classification.

Methods

Participants

The data set comprised two cohorts, 77 cancer patients (see Table 1) and 57 healthy controls. ECG, age, gender, and medical variables were recorded. The cancer group was age and sex-matched with the control group. Exclusion criteria included diabetes, cardiovascular pathologies, pregnancy, and psychiatric disorders. Both cancer patients and healthy individuals were excluded from the study if they were suffering from or taking any medication related to psychiatric or cardiac conditions, diabetes, and pregnancy. The average age of participants was 50 years old, and around 60% of the participants were women. Most cancer patients (42.86%) were diagnosed with breast cancer, 2.6% with prostate cancer, 3.9% with lung cancer, 37.67% with colorectal cancer, and 12.97% with pancreatic cancer.

Table 1 Cancer patients characteristics.

Full size table

Thirty-two patients were diagnosed with the first cancer stage, seven with the second, twelve with the third, and twenty-six with the fourth stage.

When we recorded the ECGs, none of the patients were receiving any cancer treatment. Moreover, we asked participants to avoid caffeine, nicotine and alcohol consumption and refrain from physical exercise in the two hours preceding the ECG recording.

The ethics committee of the Medical University of Graz approved all the study protocols. The participants gave written informed consent before taking part in the study. The procedures used in this study adhere to the tenets of the Declaration of Helsinki.

HRV analysis and feature selection

An ECG was recorded in a sitting position for 5 min, applying a chest lead.

In cancer patients, an ECG Holter monitoring was applied (Schiller Holter MedilogAR). Three Ag/AgCl electrodes were placed on the distal end of the right clavicle, lower left rib cage chest, and lower abdomen.

Control group recordings were obtained using eMotion Faros 180°, Mega Electronics Ltd. Both devices are medically certified, highly accurate, and widely used in research.

For both groups, the signal was sampled at a rate of 1,000 Hz. HRV was analysed offline via Kubios HRV Premium software (version 3.3.1)⁴⁰ following International Guidelines¹⁹. The software is device-independent, ensuring similar treatment of ECG signal and artifacts⁴¹. “Before R-wave time instant extraction, the R-wave is interpolated at 2000 Hz to improve the time resolution of the detection”⁴¹.

Visual artefact correction was performed on the raw IBI series, and if needed, an automatic correction algorithm was applied. 6.7% of the recordings were affected by artefacts. According to Kubios guidelines⁴⁰, we included no recording containing 5% or more artefacts in the analysis. Time-domain HRV features were calculated directly from the time series of RRIs. Frequency-domain analysis was fast Fourier transform (FFT) based, and the non-linear parameters were computed from the detrended RR interval data.

We decided to select twelve HRV features for two reasons. First, several studies reported impaired autonomic activity and significantly decreased vagal functioning based on some time and frequency HRV features in cancer patients^36,37,42. As a consequence, it seemed important to focus on the HRV features most consistently associated with cancer-related alternations. Second, disease-related changes in non-linear HRV indices were used to detect some disorders at their early stages^{43,44,45,46,47}. Hence, they might improve cancer classification in the early stages. Based on prior knowledge and clinical plausibility, we selected twelve HRV features as described in Table 2. We compared the healthy sample with cancer patients by applying the Wilcoxon sum-rank test.

Table 2 HRV features.

Full size table

Next, we used the Recursive Feature Elimination (RFE) method to obtain optimal performance for the classifiers and chose the best combination of features from our prior knowledge-based choice. RFE is a wrapper-type feature selection algorithm, that applies a backward selection process to find the most advantageous sequence of features. The first step for RFE is to construct a model based on all features and estimate the importance of each feature in the model. Then, it rank-orders the features and eliminates those with the lowest importance iteratively based on model evaluation metrics⁴⁸. In this research, we used the random forest algorithm wrapped by RFE and applied it to select HRV features.

Model development

Model development was carried out with five HRV features selected by RFE and the outcome was a dichotomous variable with two levels: patient and control. We standardised the data using the following options from Caret’s pre-process function: BoxCox, center and scale. The BoxCox method allows for correcting the skewness of the data. Centering involves subtracting mean from values, while scaling enables to divide values by standard deviation.

Our model development consisted of two steps. First, we chose three base models that incorporated various underlying algorithms. Second, we created an ensemble based on the stacking method to improve the accuracy and robustness of classification. Stacking is a well-established and powerful ensemble machine learning technique. In the stacked ensemble model, a meta-classifier is trained using the predictions of the base classifiers to make an optimal combination of the predictions. The stacking method integrates base learners' strengths to attain more robust performance, reduce estimation uncertainties and improve prediction accuracy⁴⁹.

In the present research, HRV features were supplied into the base models, producing the predictions used by the meta-classifier, which classified the input data into “cancer “or “patient” categories.

We applied the three following machine learning algorithms to classify cancer and healthy individuals: Linear Discriminant Analysis (LDA), Naïve Bayes (Nb), and Random Forest (RF). All the ML methods are included in the Caret R package.

LDA is a classification method that searches for a linear combination of variables that best divides two classes. Nb classifier is a probabilistic machine learning model based on the Bayes theorem used for the classification task. RF is a machine learning classification algorithm that builds a decision tree model⁴⁸.

We implemented tenfold cross-validation with five repeats to assess three diverse classification models available in the Caret package using 60% of the dataset for training. This method randomly splits the dataset into ten segments where one segment represents the validation set, and the remaining nine parts are employed to build the training set. This operation is repeated ten times. Each time one part is removed, and thus we obtain a different part of the data for validation. The average result of the ten parts consists of the final prediction result⁴⁸. We used 40% of the entire dataset as an independent testing set for the final model evaluation.

The training set contained 35 control and 47 cancer individuals. To deal with the class imbalance, we employed an up-sampling technique offered by the Caret package. This method randomly replicates the instances in the minority class by sampling with replacement to have the same size.

We used the default tuning hyper-parameter optimisation approach in the R package Caret, a grid search assessing three different sets of values for the hyper-parameters and selecting the best performing values for the final model.

To evaluate if any improvement in the performance of base-classifiers was possible, we created an ensemble model based on the stacking method. The stacked meta-model comprised the predictions of all three base classifiers. We applied Extreme Gradient Boosting as an ensemble algorithm to create the stacked model.

We used the following R packages to build the stacked ensemble model: Caret (Version 6.0.8)³⁸ and caretEnsemble⁵⁰ (Version 2.0.1).

We evaluated the models’ performance using metrics including accuracy, sensitivity (true-positive rate), specificity (true negative rate), Cohen’s Kappa coefficient and ROC. A confusion matrix was created for the stacked ensemble model.

Ethics approval

Approval was obtained from the ethics committee of Medical University of Graz. The procedures used in this study adhere to the tenets of the Declaration of Helsinki.

Results

HRV analysis

HRV variables were compared between cancer and healthy individuals using Wilcoxon sum-rank test. We observed significantly lower values (and for most of the measures, considerably lower SDs) in cancer patients as compared to healthy controls. The results for both groups are summarised in Table 3.

Table 3 HRV features comparison between cancer and control individuals.

Full size table

Feature selection

We ran the recursive feature elimination algorithm using 12 prior knowledge-based selected features and individuals' health status (healthy/cancer) as an outcome variable. The five most important features were selected and all other measures were eliminated.

Table 4 shows the results of RFE with the five most important features. Figure 1 represents density plots for the five top HRV features selected by RFE. None of these features shows a significant overlap suggesting that all might constitute relevant input for classification.

Table 4 Results of recursive features elimination algorithm applied to 12 prior knowledge-based selected HRV features.

Full size table

Model performance

Performance of base-classifiers

The results shown in Table 5, Figs. 2 and 3 indicate the performance metrics of three base classifiers on the training set with ten-fold cross-validation. They suggest that the algorithm of Naive Bayes performed the worst on the classification task, showing an accuracy of 79%. The algorithm of Linear Discriminant Analysis performed slightly better, with an accuracy of 80%. The algorithm of random forest performed best, with 85% of accuracy.

Table 5 Prediction performance of base classifiers and stacked ensemble.

Full size table

The correlation between predictions of models was moderate, ranging from 0.35 to 0.51 (see Fig. 4). Thus suggesting that every classifier might perform better at learning different data patterns. In addition, all three base classifiers displayed promising performances while classifying cancer vs healthy individuals based on HRV analysis. Thus, we integrated all models to build a meta-classifier and analyse if the classification accuracy improved.

Performance of meta-classifier

In a final step, after training and evaluating models created using the individual ML algorithms, we devised a stacked model aggregating the predictions of the base classifiers. This meta-model was trained using Extreme Gradient Boosting. The stacked model produced an accuracy of 0.929 and Kappa of 0.859 with ROC = 0.956, sensitivity of 0.846, and specificity of 0.924, thus outperforming all base learners (see Table 5). On the test set (unseen data), the model had an accuracy of 0.865 and a Kappa of 0.719 (see Fig. 5), sensitivity of 0.773 and specificity of 0.933.

The confusion matrix for the meta-classifier presents the performance of this model on the testing set (Fig. 5). The outcome variable has two values (patient and control). The column with control values shows that 17 control data points were correctly classified by the model, whereas two were incorrectly classified as belonging to the opposite class (patients). In the case of patients, the model correctly recognised 28 cases and wrongly assigned five patients to the control class.

The ensemble model achieved an area under the receiver operating characteristics (ROC) curve (AUC) of 0.945 (95% CI 0.8916–0.9993), see Fig. 6.

Discussion

This pilot study evaluated the possibility of machine learning-based discrimination between cancer patients and healthy controls based on five-minute-ECG recordings. In the first step, comparing HRV parameters (linear and non-linear features) between cancer patients and healthy controls suggested significant differences in all HRV parameters constituting an input for the machine learning in this research. These findings align with several studies^16,21 documenting decreased levels of HRV in cancer patients compared to healthy populations. Specifically, a systematic review²² including 19 studies conducted in 2018 concludes that decreased HRV illustrates a disease-related autonomic dysfunction in cancer patients.

Importantly, the performance of three well-established machine learning algorithms achieved satisfactory results ranging from 79 to 85% accuracy. The RF performed best compared to the other ML algorithms in this study. The differences in accuracy between the base classifiers were, however, modest. There was a slight variation in accuracy between the best (RF) and worst (NB) performing algorithms (0.059).

To improve the robustness and accuracy of the classification model, we created a stacked learner that included the predictions of all three individual ML models. The accuracy of the stacked model was 8% higher compared to the RF model. Therefore, we conclude that in this study, a slight improvement in accuracy was achieved by stacking multiple classification models.

On the testing set, the meta-learner's performance was still satisfactory. However, the model’s sensitivity was pretty moderate (77%). Thus, one should exert caution while excluding cancer based on this classification model. At the same time, the meta-classifier performed very well in terms of specificity (93%). It could be, therefore, speculated that our model might be accurate while used to confirm a suspected cancer diagnosis. The model’s high specificity might imply low chances of getting positive results in non-cancer individuals. Considering this model’s moderate sensitivity, it might be used as a complementary tool in oncology and applied when, based on other tests, there is a suspicion of cancer.

Both sensitivity and specificity of the meta-classifier are greater than 70%, which seems to be a reasonable trade-off⁵¹. Nevertheless, at this stage of the research, it is uncertain if the results might be generalised for different algorithms beyond the ML used in this analysis. Our meta-classifier did not notably outperform the results from the best base classifier. Notwithstanding, using an ensemble model based on different ML algorithms may prevent reduced performance and prediction uncertainty.

Despite the moderate improvement in classification accuracy of the ensemble model, it should be noted that the performance of our model proved satisfactory as compared with previous research, showing 86% accuracy, 93% specificity and 77% sensitivity while classifying cancer vs healthy individuals on unseen data.

Only two studies employed machine learning and HRV analysis to classify healthy individuals vs cancer patients. Shukla and Aggrawal³³ predicted and classified lung cancer stages using ANN and SVM with 93.09% and 100% accuracy. The same authors³⁴ applied a Lavenberg–Marquardt algorithm-based artificial neural network (ANN) and support vector machine (SVM) to classify cancer vs healthy individuals based on spectral features of HRV, with maximum accuracy of 54.2% and 100%, respectively.

Some differences between these studies and the present work are worth mentioning. First, the studies are different in terms of statistical analysis. We created a stacked ensemble model while the authors of the research mentioned above applied single ML algorithms.

Moreover, our study did not focus on classifying between different cancer stages. Further, we decided to recruit patients with different cancer types most consistently related to vagal dysfunction (i.e., breast cancer, prostate cancer, colorectal cancer, lung cancer, and pancreatic cancer).

In contrast, previous research selected one type of cancer. Although similar in sample size with our research, both studies applied ML on highly imbalanced data without accounting for it. Finally, we applied the Recursive Feature Elimination method of feature selection.

Therefore, the findings of this pilot study could constitute a preliminary framework for developing cancer classifications techniques based on HRV analysis and ML. It should be noted, though, that further research is necessary to evaluate the algorithm’s sensitivity for different cancer stages. In this study, most of the patients were ascribed to stage 1 and stage 4, respectively. Thus, more research, including patients with different cancer stages, is needed.

While the findings of this early pilot study illustrate a satisfactory performance of machine learning algorithms to classify cancer vs healthy individuals from HRV measures, several limitations should be noted.

First, the sample size was relatively small due to clinical routines and the time needed for acquiring HRV data, and the recording time was not standardised. The ECG was recorded at the same location for each patient cohort; however, the measurements took place at a different time of the day for all the participants. Hence, circadian rhythms could have increased variance in the measures. Future research should aim to standardise the time of recordings to control for circadian rhythms. Also, sleep alterations can cause a change in HRV features⁵². Thus, future research should evaluate the cancer patients' sleep patterns via standard questionnaires.

Second, we used two different devices to record cancer patients’ and healthy individuals’ ECG, which might have introduced some differences. However, in both devices, the signal was sampled at the same rate, avoiding differences in the accuracy of HRV estimation. Both devices are medically certified, highly accurate, and widely used in research. Moreover, we based our analysis on HRV values calculated in and exported from Kubios software, assuring similar treatment of ECG signal and artifacts⁴¹.

Third, in this study, we focused on certain cancers most consistently associated with HRV dysfunction^2,42,53,54. Future studies should aim for larger and more diverse samples of cancer patients and healthy controls, which might help to probe the robustness of cancer detection based on HRV analysis.

Fourth, it should be highlighted that although cancer patients and healthy controls were age and gender-matched, other potential confounds, like physical activity, smoking, body weight, etc.⁵⁵ could have contributed to the differences between groups. As far as such differences are representative for the group of cancer patients and reflect common risk factors for cancer (e.g., a higher number of smokers, lower levels of physical activity, obesity)⁵⁶, the classification based on ML remains valid.

Fifth, despite a diverse selection of ML algorithms, there is a possibility that better results could have been achieved with other ML algorithms.

Sixth, although we used the default hyper-parameters provided by the Caret package, tuning might improve classification performance.

Additionally, the choice of the algorithm applied in the stacking ensemble might impact the results. Future work should focus on exploring different stacking algorithms (i.e., generalised linear model). Finally, although a comparison of different diseases (e.g., diabetes, cardiovascular pathologies, mental disorders) associated with aberrations in HRV was beyond the scope of this research (see, e.g.⁵⁷ for a review of HRV in severe clinical conditions reflecting brain–heart interaction), future research should aim to identify distinct patterns of HRV for each of these conditions and attempt a multi-class classification based on HRV analysis and machine learning techniques.

Conclusion

In this pilot study, we demonstrated preliminary results illustrating machine learning-based cancer classification vs healthy individuals, using linear and non-linear HRV measures. In line with the previous research on HRV in cancer patients^2,4,16,58 the findings confirm significant differences in autonomic function compared to healthy controls. Furthermore, we demonstrated that ML algorithms could classify healthy vs cancer individuals based on ECG with acceptable accuracy, sensitivity, and specificity compared to previous studies. We also found that we could create a meta-classifier that performed slightly better than underlying base-classifiers by using a stacking method. Random Forest led to the highest predictive accuracy, and Naïve Bayes performed worst compared to the other algorithms. The stacked model accuracy was about 8% higher than the best individual model. Ensemble model performance on unseen data was satisfactory (accuracy of 86%), showing a moderate sensitivity (77%) and high specificity (93%). Notably, the moderate sensitivity of this model suggests that ruling out cancer based on this classification method should be done with caution. At the same time, the meta-classifier performed very well in terms of specificity, suggesting high performance when aiming to confirm a suspected cancer diagnosis. Future studies may shed light on how ML and HRV analysis could be of practical value as a supplementary tool in oncology.

Abbreviations

AUC:: Area under the curve
ECG:: Electrocardiogram
FFT:: Fast Fourier transform
RFE:: Recursive features elimination
RF:: Random forest
HRV:: Heart rate variability
IBI:: Interbeat interval
ML:: Machine learning
XGB:: EXtreme gradient boosting
NB:: Naïve Bayes
LDA:: Linear discrimination analysis

References

Wang, H. et al. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: A systematic analysis for the Global Burden of Disease Study 2015. Lancet 388, 1459–1544 (2016).
Google Scholar
Couck, M. D., Brummelen, D. V., Schallier, D., Grève, J. D. & Gidron, Y. The relationship between vagal nerve activity and clinical outcomes in prostate and non-small cell lung cancer patients. Oncol. Rep. 30, 2435–2441 (2013).
PubMed Google Scholar
Reuter, S., Gupta, S. C., Chaturvedi, M. M. & Aggarwal, B. B. Oxidative stress, inflammation, and cancer: How are they linked?. Free Radic. Biol. Med. 49, 1603–1616 (2010).
CAS PubMed PubMed Central Google Scholar
De Couck, M., Caers, R., Spiegel, D. & Gidron, Y. The role of the vagus nerve in cancer prognosis: A systematic and a comprehensive review. J. Oncol. 2018, 1–11 (2018).
Google Scholar
De Couck, M. et al. You may need the vagus nerve to understand pathophysiology and to treat diseases. Clin. Sci. 122, 323–328 (2012).
Google Scholar
Grivennikov, S. I. & Karin, M. Inflammation and oncogenesis: A vicious connection. Curr. Opin. Genet. Dev. 20, 65–71 (2010).
CAS PubMed Google Scholar
Mouton, C. et al. The relationship between heart rate variability and time-course of carcinoembryonic antigen in colorectal cancer. Auton. Neurosci. Basic Clin. 166, 96–99 (2012).
CAS Google Scholar
Yuan, Y., Jiang, Y. C., Sun, C. K. & Chen, Q. M. Role of the tumor microenvironment in tumor progression and the clinical applications (review). Oncol. Rep. 35, 2499–2515 (2016).
CAS PubMed Google Scholar
Gidron, Y., Deschepper, R., De Couck, M., Thayer, J. & Velkeniers, B. The Vagus Nerve can predict and possibly modulate non-communicable chronic diseases: introducing a neuroimmunological paradigm to public health. J. Clin. Med. 7, 371 (2018).
CAS PubMed Central Google Scholar
Gidron, Y., Perry, H. & Glennie, M. Does the vagus nerve inform the brain about preclinical tumours and modulate them?. Lancet Oncol. 6, 245–248 (2005).
PubMed Google Scholar
Ondicova, K. & Mravec, B. Role of nervous system in cancer aetiopathogenesis. Lancet Oncol. 11, 596–601 (2010).
PubMed Google Scholar
Ekbom, A., Lundegårdh, G., McLaughlin, J. K. & Nyrén, O. Relation of vagotomy to subsequent risk of lung cancer: Population based cohort study. Br. Med. J. 316, 518–519 (1998).
CAS Google Scholar
Hall, R. Increased risk of cancer mortality after vagotomy for peptic ulcer: A preliminary analysis. Eur. J. Cancer Prev. 1, 35–37 (1991).
ADS PubMed Google Scholar
Strous, M. T. A. et al. Is pre-operative heart rate variability a prognostic indicator for overall survival and cancer recurrence in patients with primary colorectal cancer?. PLoS ONE 15, e0237244 (2020).
CAS PubMed PubMed Central Google Scholar
Bijoor, S. N., Banerjee, S. & Subbalakshmi, N. K. Influence of cancer severity and functional status of cancer on cardiac parasympathetic indicators. Asian J. Pharm. Clin. Res. 10, 312–316 (2017).
Google Scholar
De Couck, M. & Gidron, Y. Norms of vagal nerve activity, indexed by heart rate variability, in cancer patients. Cancer Epidemiol. 37, 737–741 (2013).
PubMed Google Scholar
Francesco, B. et al. Linear and nonlinear heart rate variability indexes in clinical practice. Comput. Math. Methods Med. 2012, 1–5 (2012).
MathSciNet MATH Google Scholar
Shaffer, F. & Ginsberg, J. P. An overview of heart rate variability metrics and norms. Front. Public Health. 5, 258 (2017).
PubMed PubMed Central Google Scholar
Malik, M. et al. Heart rate variability: Standards of measurement, physiological interpretation, and clinical use. Circulation 93, 1043–1065 (1996).
Google Scholar
Laborde, S., Mosley, E. & Thayer, J. F. Heart rate variability and cardiac vagal tone in psychophysiological research: Recommendations for experiment planning, data analysis, and data reporting. Front. Psychol. 8, 213 (2017).
PubMed PubMed Central Google Scholar
Coumbe, B. G. T. & Groarke, J. D. Cardiovascular autonomic dysfunction in patients with cancer. Curr. Cardiol. Rep. 20, 1–8 (2018).
Google Scholar
Kloter, E., Barrueto, K., Klein, S. D., Scholkmann, F. & Wolf, U. Heart rate variability as a prognostic factor for cancer survival: A systematic review. Front. Physiol. 9, 623 (2018).
PubMed PubMed Central Google Scholar
Lakoski, S. G., Jones, L. W., Krone, R. J., Stein, P. K. & Scott, J. M. Autonomic dysfunction in early breast cancer: Incidence, clinical importance, and underlying mechanisms. Am. Heart J. 170, 231–241 (2015).
PubMed PubMed Central Google Scholar
Shi, B. et al. Nonlinear heart rate variability biomarkers for gastric cancer severity: A pilot study. Sci. Rep. 9, 1–9 (2019).
ADS Google Scholar
Amisha, M. P., Pathania, M. & Rathaur, V. Overview of artificial intelligence in medicine. J. Fam. Med. Prim. Care 8, 2328 (2019).
CAS Google Scholar
Davenport, T. & Kalakota, R. The potential for artificial intelligence in healthcare. Futur. Healthc. J. 6, 94–98 (2019).
Google Scholar
Ledezma, C. A., Zhou, X., Rodríguez, B., Tan, P. J. & Díaz-Zuccarini, V. A modeling and machine learning approach to ECG feature engineering for the detection of ischemia using pseudo-ECG. PLoS ONE 14, e0220294 (2019).
CAS PubMed PubMed Central Google Scholar
Chen, T. M., Huang, C. H., Shih, E. S. C., Hu, Y. F. & Hwang, M. J. Detection and classification of cardiac arrhythmias by a challenge-best deep learning neural network model. Science 23, 100886 (2020).
Google Scholar
Ghorbani, A. et al. Deep learning interpretation of echocardiograms. NPJ Digit. Med. 3, 1–10 (2020).
Google Scholar
Seyd, A. P. T., Joseph, P. K. & Jacob, J. Automated diagnosis of diabetes using heart rate variability signals. J. Med. Syst. 36, 1935–1941 (2012).
Google Scholar
Aggarwal, Y., Das, J., Mazumder, P. M., Kumar, R. & Sinha, R. K. Heart rate variability features from nonlinear cardiac dynamics in identification of diabetes using artificial neural network and support vector machine. Biocybern. Biomed. Eng. 40, 1002–1009 (2020).
Google Scholar
Swapna, G., Soman, K. P. & Vinayakumar, R. Automated detection of diabetes using CNN and CNN-LSTM network and heart rate signals. Procedia Comput. Sci. 132, 1253–1262 (2018).
Google Scholar
Shukla, R. S. & Aggarwal, Y. Nonlinear heart rate variability based artificial intelligence in lung cancer prediction. J. Appl. Biomed. 16, 145–155 (2018).
Google Scholar
Shukla, R. S. & Aggarwal, Y. Fourier transform and autoregressive HRV features in prediction and classification of breast cancer. IETE J. Res. https://doi.org/10.1080/03772063.2021.1878063 (2021).
Article Google Scholar
The Global Cancer Observatory. Austria Fact Sheet 2020 https://gco.iarc.fr/today/data/factsheets/populations/40-austria-fact-sheets.pdf (2021).
Guo, Y. et al. Prognostic value of heart rate variability in patients with cancer. J. Clin. Neurophysiol. 32, 516 (2015).
PubMed PubMed Central Google Scholar
Kim, D. et al. Heart rate variability and length of survival in hospice cancer patients. J. Korean Med. Sci. 25, 1140–1145 (2010).
PubMed PubMed Central Google Scholar
Kuhn, M. caret: Classification and Regression Training. R package version 6.0–88 (2021).
R Core Team. R: A Language and Environment for Statistical Computing. (2021).
Tarvainen, M. P., Niskanen, J. P., Lipponen, J. A., Ranta-aho, P. O. & Karjalainen, P. A. Kubios HRV - Heart rate variability analysis software. Comput. Methods Programs Biomed. 113, 210–220 (2014).
PubMed Google Scholar
Tarvainen, M., Lipponen, J., Niskanen, J. & Ranta-Aho, P. Kubios HRV. (2017).
Couck, M. D., Maréchal, R., Moorthamers, S., Van Laethem, J. L. & Gidron, Y. Vagal nerve activity predicts overall survival in metastatic pancreatic cancer, mediated by inflammation. Cancer Epidemiol. 40, 47–51 (2016).
PubMed Google Scholar
Acharya, R. U., Lim, C. M. & Joseph, P. Heart rate variability analysis using correlation dimension and detrended fluctuation analysis. ITBM-RBM 23, 333–339 (2002).
Google Scholar
De Souza, N. M., Vanderlei, L. C. M. & Garner, D. M. Risk evaluation of diabetes mellitus by relation of chaotic globals to HRV. Complexity 20, 84–92 (2015).
Google Scholar
Mohebbi, M., Ghassemian, H. & Asl, B. M. Structures of the recurrence plot of heart rate variability signal as a tool for predicting the onset of paroxysmal atrial fibrillation. J. Med. Signals Sens. 1, 113–121 (2011).
PubMed PubMed Central Google Scholar
Roy, B. & Ghatak, S. Nonlinear methods to assess changes in heart rate variability in type 2 diabetic patients. Arq. Bras. Cardiol. 101, 317–326 (2013).
PubMed Google Scholar
Schlenker, J. et al. Recurrence plot of heart rate variability signal in patients with vasovagal syncopes. Biomed. Signal Process. Control 25, 1–11 (2016).
Google Scholar
Kuhn, M. & Johnson, K. Applied predictive modelling: Applied Predictive Modeling (Springer, 2013).
MATH Google Scholar
Boehmke, B. & Greenwell, B. Hands-On Machine Learning with R Hands-On Machine Learning with R (Chapman and Hall/CRC, 2019).
MATH Google Scholar
Deane-Mayer, Z. A. & Knowles, J. E. caretEnsemble: Ensembles of Caret Models. R package version 2.0.1 (2019).
Lütkenhöner, B. & Basel, T. Predictive modeling for diagnostic tests with high specificity, but low sensitivity: A study of the glycerol test in patients with suspected Menière’s disease. PLoS ONE 8, e79315 (2013).
ADS PubMed PubMed Central Google Scholar
Tobaldini, E. et al. Heart rate variability in normal and pathological sleep. Front. Physiol. 4, 1–11 (2013).
Google Scholar
Arab, C. et al. Heart rate variability measure in breast cancer patients and survivors: A systematic review. Psychoneuroendocrinology 68, 57–68 (2016).
PubMed Google Scholar
Kim, K., Chae, J. & Lee, S. The role of heart rate variability in advanced non-small-cell lung cancer patients. J. Palliat. Care 31, 103–108 (2015).
PubMed Google Scholar
Quintana, D. S. & Heathers, J. A. J. Considerations in the assessment of heart rate variability in biobehavioral research. Front. Psychol. 5, 805 (2014).
PubMed PubMed Central Google Scholar
Stein, C. J. & Colditz, G. A. Modifiable risk factors for cancer. Br. J. Cancer 90, 299–303 (2004).
CAS PubMed PubMed Central Google Scholar
Schwerdtfeger, A. R. et al. Heart rate variability (HRV): From brain death to resonance breathing at 6 breaths per minute. Clin. Neurophysiol. 131, 676–693 (2020).
PubMed Google Scholar
Gidron, Y., De Couck, M. & De Greve, J. If you have an active vagus nerve, cancer stage may no longer be important. J. Biol. Regul. Homeost. Agents 28, 195–201 (2014).
CAS PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to Claudia Traunmueller for providing the data of the healthy control group.

Author information

Authors and Affiliations

Division of Oncology, Medical University of Graz, Auenbruggerplatz 15, 8036, Graz, Austria
Marta Vigier & Elisabeth Andritsch
Institute of Psychology, University of Graz, Graz, Austria
Marta Vigier & Andreas R. Schwerdtfeger
Independent Researcher, Graz, Austria
Benjamin Vigier
BioTechMed Graz, Graz, Austria
Andreas R. Schwerdtfeger

Authors

Marta Vigier
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Vigier
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth Andritsch
View author publications
You can also search for this author in PubMed Google Scholar
Andreas R. Schwerdtfeger
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation: B.V., M.V., Study design and data analysis: M.V., Drafting of the manuscript: M.V., A.S., Writing & editing: M.V., B.V., A.S., Resources: E.A., Supervision: A.S.

Corresponding author

Correspondence to Marta Vigier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vigier, M., Vigier, B., Andritsch, E. et al. Cancer classification using machine learning and HRV analysis: preliminary evidence from a pilot study. Sci Rep 11, 22292 (2021). https://doi.org/10.1038/s41598-021-01779-1

Download citation

Received: 12 February 2021
Accepted: 01 November 2021
Published: 16 November 2021
DOI: https://doi.org/10.1038/s41598-021-01779-1

This article is cited by

Molecular imaging analysis in cancer using deep learning: a review
- Sheetal Prusty
- Lingraj Dora
- Ajith Abraham
Research on Biomedical Engineering (2023)
Potential and limitations of machine meta-learning (ensemble) methods for predicting COVID-19 mortality in a large inhospital Brazilian dataset
- Bruno Barbosa Miranda de Paiva
- Polianna Delfino Pereira
- Milena Soriano Marcolino
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

An overview of clinical decision support systems: benefits, risks, and strategies for success

A systems identification approach using Bayes factors to deconstruct the brain bases of emotion regulation

Introduction

Methods

Participants

HRV analysis and feature selection

Model development

Ethics approval

Results

HRV analysis

Feature selection

Model performance

Performance of base-classifiers

Performance of meta-classifier

Discussion

Conclusion

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Molecular imaging analysis in cancer using deep learning: a review

Potential and limitations of machine meta-learning (ensemble) methods for predicting COVID-19 mortality in a large inhospital Brazilian dataset

Comments

Search

Quick links