Entropy removal of medical diagnostics

He, Shuhan; Chong, Paul; Yoon, Byung-Jun; Chung, Pei-Hung; Chen, David; Marzouk, Sammer; Black, Kameron C.; Sharp, Wilson; Safari, Pedram; Goldstein, Joshua N.; Raja, Ali S.; Lee, Jarone

doi:10.1038/s41598-024-51268-4

Download PDF

Article
Open access
Published: 12 January 2024

Entropy removal of medical diagnostics

Shuhan He¹^na1,
Paul Chong²^na1,
Byung-Jun Yoon^3,4,
Pei-Hung Chung³,
David Chen⁵,
Sammer Marzouk⁶,
Kameron C. Black⁷,
Wilson Sharp²,
Pedram Safari⁸,
Joshua N. Goldstein¹,
Ali S. Raja¹ &
…
Jarone Lee¹

Scientific Reports volume 14, Article number: 1181 (2024) Cite this article

700 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Shannon entropy is a core concept in machine learning and information theory, particularly in decision tree modeling. To date, no studies have extensively and quantitatively applied Shannon entropy in a systematic way to quantify the entropy of clinical situations using diagnostic variables (true and false positives and negatives, respectively). Decision tree representations of medical decision-making tools can be generated using diagnostic variables found in literature and entropy removal can be calculated for these tools. This concept of clinical entropy removal has significant potential for further use to bring forth healthcare innovation, such as quantifying the impact of clinical guidelines and value of care and applications to Emergency Medicine scenarios where diagnostic accuracy in a limited time window is paramount. This analysis was done for 623 diagnostic tools and provided unique insights into their utility. For studies that provided detailed data on medical decision-making algorithms, bootstrapped datasets were generated from source data to perform comprehensive machine learning analysis on these algorithms and their constituent steps, which revealed a novel and thorough evaluation of medical diagnostic algorithms.

Heart patient health monitoring system using invasive and non-invasive measurement

Article Open access 26 April 2024

Identifying unreliable predictions in clinical risk models

Article Open access 23 January 2020

Predicting acute clinical deterioration with interpretable machine learning to support emergency care decision making

Article Open access 21 August 2023

Introduction

The use of medical literature to guide clinical practice as part of evidence-based medicine can reduce the number of medical error-related deaths in the US, which is over 98,000 annually, per IOM^1,2. The assessment of the diagnostic accuracy of medical decision-making aids and tools is an important step towards this goal of improving patient safety and healthcare provision^3,4. Standard metrics, including sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV), measure the predictive utility of medical decision-making tools^5,6,7,8,9. The sensitivity and specificity of diagnostic tests, such as the chest x-ray for pneumothorax, are well-established for common illnesses. However, with the myriad of conditions and diagnostic tools available, clinicians often face challenges in selecting the most appropriate order of tests for specific, time-sensitive clinical situations.

Shannon entropy is a core concept in machine learning and information theory, particularly in decision tree modeling of data analytics and machine learning¹⁰. To date, numerous research-based biological and clinical solutions have been developed based on the principle of Shannon entropy, a measure of uncertainty^{11,12,13,14,15,16,17}, including diagnostic accuracy evaluation. However, no diagnostic metrics that specifically measure the reduction of diagnostic uncertainty, which often leads to decision paralysis and the "shotgun" diagnostic approach¹¹, over-testing, delayed diagnosis, and patient harm¹¹, have been extensively applied and explored.

Shannon entropy, defined by Eq. (1), offers a solution:

$$H\left( x \right) = - \mathop \sum \limits_{i \epsilon x}^{{}} p_{i} \times log_{2} \left( {p_{i} } \right),$$

(1)

where $p_{i}$’s denote the probabilities of the possible outcomes of the event, and $p_{i} \times log_{2} \left( {p_{i} } \right)$ is taken to be zero when $p_{i}$ = 0, justified by the fact that the limit of $p_{i} \times log_{2} \left( {p_{i} } \right)$ is zero as $p_{i}$ → 0⁺. Shannon entropy is maximized for a uniform distribution. For binary events, in particular, as is the case in this study, the entropy H(x) is at its highest when the probabilities $p_{i}$ are exactly 0.5, that is, when there is the most uncertainty, and is at its lowest (zero) when the outcomes are certain, that is, when the outcome probabilities $p_{i}$ are one and zero, respectively. This corresponds to its application in a clinical setting, where the entropy, or uncertainty of a patient with respect to their diagnosis is maximal when they enter the hospital with no testing or diagnostic evaluation. Various diagnostic tools subsequently reduce this clinical uncertainty, ideally to a definitive diagnosis.

In emergency medicine, removal of entropy using testing and imaging tools can clarify the patient's presentation and optimize medical decision-making in time-sensitive settings. Quantifying entropy removal can elucidate the utility and sequence of diagnostic tools in removing uncertainty in those clinical settings and first exclude urgent, lethal pathology. In this study, we aim to characterize the utility and validity of Shannon entropy removal to reanalyze the performance of 623 clinical decision support tools in a publicly available database¹⁸ compared to traditional validity tools including sensitivity, specificity, PPV/NPV, Youden’s index, and diagnostic odds ratio.

Materials and methods

IRB statement

This study is exempt from IRB review of Massachusetts General Hospital and Harvard Medical School as research involves collecting and studying existing data of which sources are publicly available, and subjects cannot be identified directly or through identifiers linked to the subjects.

Data compilation

Diagnostic metrics (true and false positives and negatives, respectively) were compiled from an established online database of diagnostic accuracy, known as “Get the Diagnosis”, totaling 533 studies of 623 decision-making tools of 267 diagnoses¹⁸. Data collection was performed from November 17, 2022 through January 22, 2023. PubMed was utilized when studies cited from the online database were unable to be accessed directly; concomitant diagnostic tools were also separately explored for elements included in the database as applicable (for example, if studies that evaluated the diagnostic accuracy of mammography for breast cancer screening were included in the database, data was also compiled for low-dose computerized tomography (CT) scans for breast cancer screening; see Data availability statement for details). This data was used to calculate sensitivities, specificities, NPVs, and PPVs. In addition, the data was used to generate decision tree representations for each decision-making tool from which Shannon entropy and entropy removal were calculated (see “Decision tree representation” and Fig. 1 in addition to “Entropy calculation”).

In this study, patient-derived datasets were systematically bootstrapped using the decision tree data previously reported in the literature (see “Machine learning modeling and analysis”). This data was specifically derived from the “Step-By-Step Approach to Febrile Infants” and the “Pediatric Emergency Care Applied Research Network (PECARN) Pediatric Head Injury/Trauma Algorithm”^19,20,21.

Similar methods have been performed in other studies to generate health data for the evaluation of healthcare solutions from datasets, such as HES, A&E, and MIMIC^22,23. In each case, synthetic datasets that preserved the statistical properties of the original real data were generated²⁴. This was accomplished by using the decision tree data provided in the original and validation papers of the respective studies and synthesizing a binary dataset of the relevant metrics of each respective algorithm (ex. leukocyturia, age less than 21 days, loss of consciousness, etc.) and their binary value (0 for absence, 1 for presence). Thus, the original data used in the study was recreated with each patient in the respective studies being simplified to only their characteristics relevant to the study in addition to being reduced to a set of binary values for machine learning modeling.

Decision tree representation

Decision trees are constituted of parent nodes that split to yield children nodes; these nodes and decision splits are able to be generated to produce decision tree representations for diagnostic tools by using the diagnostic metrics for medical decision-making tools from 2 × 2 diagnostic tables as in Fig. 1, where N is the sample size of the study, TP is the number of true positives, FP is the number of false positives, FN is the number of false negatives, and TN is the number of true negatives.

Entropy calculation

Using diagnostic metrics (N, TP, FP, etc.), Shannon entropy was calculated as below in Eqs. (2) through (4) for the parent node and children nodes, with n_positive and n_negative representing the number of positive and negative tests, respectively:

$$entropy_{parent \,node} = \left[ {\frac{FP + TN}{N} \times \left( {log_{2} \left( N \right) - log_{2} \left( {FP + TN} \right)} \right)} \right] + \left[ {\frac{TP + FN}{N} \times \left( {log_{2} \left( N \right) - log_{2} \left( {TP + FN} \right)} \right)} \right]$$

(2)

$$entropy_{child \,node \,1} = \left[ {\frac{TP}{{n_{positive} }} \times \left( {log_{2} \left( {n_{positive} } \right) - log_{2} \left( {TP} \right)} \right)} \right] + \left[ {\frac{FP}{{n_{positive} }} \times \left( {log_{2} \left( {n_{positive} } \right) - log_{2} \left( {FP} \right)} \right)} \right]$$

(3)

$$entropy_{child \,node \,2} = \left[ {\frac{FN}{{n_{negative} }} \times \left( {log_{2} \left( {n_{negative} } \right) - log_{2} \left( {FN} \right)} \right)} \right] + \left[ {\frac{TN}{{n_{negative} }} \times \left( {log_{2} \left( {n_{negative} } \right) - log_{2} \left( {TN} \right)} \right)} \right]$$

(4)

Entropy removal was calculated by Eq. (5), where entropy removal equals the difference between the entropy of the parent node (the total entropy of the system) and the weighted average entropy of the children nodes (proportional to n_positive and n_negative, respectively):

$$\begin{aligned} & entropy \,removal = \left[ {\left( {\frac{FP + TN}{N} \times (log_{2} \left( N \right) - log_{2} \left( {FP + TN} \right)} \right) + \left( {\frac{TP + FN}{N} \times (log_{2} \left( N \right) - log_{2} \left( {TP + FN} \right)} \right)} \right] \\ & \quad - \left[ {\left[ {\left( {\frac{{n_{positive} }}{N}} \right)\left( {\frac{TP}{{n_{positive} }} \times (log_{2} \left( {n_{positive} } \right) - log_{2} \left( {TP} \right))} \right) + \left( {\frac{FP}{{n_{positive} }} \times (log_{2} \left( {n_{positive} } \right) - log_{2} \left( {FP} \right))} \right)} \right]} \right. \\ & \quad + \left. {\left[ {\left( {\frac{{n_{negative} }}{N}} \right)\left( {\frac{FN}{{n_{negative} }} \times (log_{2} \left( {n_{negative} } \right) - log_{2} \left( {FN} \right)} \right) + \left( {\frac{TN}{{n_{negative} }} \times (log_{2} \left( {n_{negative} } \right) - log_{2} \left( {TN} \right)} \right)} \right]} \right] \\ \end{aligned}$$

(5)

Data provided in the validation study by Gomez et al. was utilized to generate a patient dataset for analysis of the Step-By-Step Approach to Febrile Infants¹⁹. Data provided in the original study by Kupperman et al. was similarly utilized to generate two separate patient datasets for analysis of the PECARN Pediatric Head Injury/Trauma Algorithm: one for patients less than 2 years of age and another for patients greater than or equal to 2 years of age²⁰.

Machine learning modeling and analysis

The Python MATLAB and scikit-learn packages were utilized in this study to generate, analyze the performance, and visualize machine learning models developed from the synthetic patient datasets (for more details regarding the machine learning models developed)^25,26. Decision tree-based diagnostic algorithms pose unique applications for Shannon entropy analysis of the decision-making tool in its entirety and its constituent steps/nodes, allowing for evaluation of each feature in the algorithm. A decision tree was produced for each patient dataset and these decision trees were subsequently analyzed for the entropy removal and feature importance of each step within the algorithm. In the context of machine learning, feature importance is defined as the relative importance of each feature when making a prediction and is calculated as the decrease in entropy weighted by the probability of reaching that node, as shown below in Eqs. (6) and (7):

$$ni_{j} = w_{j} C_{j} - w_{left\left( j \right)} C_{left\left( j \right)} - w_{right\left( j \right)} C_{right\left( j \right)}$$

(6)

(ni_j = the importance of node j, w_j = weighted number of samples reaching node j, C_j = the impurity value of node j, left(j) = child node from left split on node j, right(j) = child node from right split on node j).

$$fi_{i} = \frac{{\mathop \sum \nolimits_{j: \,node \,j \,splits \,on \,feature i}^{{}} ni_{j} }}{{\mathop \sum \nolimits_{k\epsilon \,all \,nodes}^{{}} ni_{k} }}$$

(7)

(fi_i = the importance of feature I, ni_j = the importance of node j, ni_k = the importance of node k).

Results

Entropy removals

Entropy removal was calculated in addition to sensitivity, specificity, NPV, and PPV as well as diagnostic odds ratio and Youden’s index for 533 studies to evaluate the 623 medical decision-making tools.

Entropy removal displayed significant but weak positive correlations with sensitivity and NPV and showed significant moderate positive correlations with specificity and PPV (p < 0.001). Entropy removal exhibited significant strong positive correlations with comprehensive clinical diagnostic metrics, such as Youden’s index and logged diagnostic odds ratio (p < 0.001). Z-score calculation for differences in correlations revealed significant differences between the respective correlations of Youden’s index and logged diagnostic odds ratio with entropy removal as compared to the correlations of the other explored diagnostic metrics with entropy removal (p < 0.001). Figures 2 and 3 illustrate the correlation between different diagnostic metrics and entropy removal. Tables 1 and 2 provide the Pearson and Spearman correlation coefficients for entropy removal and the different diagnostic metrics, respectively. Tables 3 and 4 provide examples of comparisons of the different diagnostic accuracy metrics of tests evaluating for pneumothorax and thoracic aortic dissection, respectively. Table 5 displays the results of entropy removal analysis of decision tree-based clinical algorithms and their constituent steps.

Table 1 Pearson correlation coefficients of diagnostic metrics and entropy removal.

Full size table

Table 2 Spearman correlation coefficients of diagnostic metrics and entropy removal.

Full size table

Table 3 Comparison of diagnostic metrics of tests for pneumothorax.

Full size table

Table 4 Comparison of diagnostic metrics for thoracic aortic dissection evaluation.

Full size table

Table 5 Results of machine learning analysis for medical decision-making algorithms.

Full size table

The diagnostic metrics of different diagnostic tools that assess patients for the same pathology were able to be compared as in Tables 3 and 4.

Bootstrapping and stepwise entropy calculation

Decision tree machine learning analysis of the generated patient datasets yielded the exact decision trees of the original algorithms, supporting the validity of the clinical algorithms. Table 5 shows the results of machine learning analysis.

Discussion

Our study demonstrates the potential utility of quantified entropy removal of medical diagnostic decision-making tools. Diagnostic tools that are 100% sensitive and 100% specific (or definitively diagnostic) also have an entropy removal of 100% as all entropy (uncertainty) is completely removed with regard to a particular pathology. In cases in which diagnostic tools are less than 100% sensitive and/or specific, our entropy removal calculations provide further insight into how much diagnostic value the tool provides. In other words, entropy removal may be used as a “meta-metric” to assess existing clinical diagnostic metrics. The strong positive correlations of entropy removal with established comprehensive measures of diagnostic accuracy (Youden’s index and logged diagnostic odds ratio) may support its validity while its distinctive advantages support its novelty. Entropy removal provided unique insight on the diagnostic value of medical decision-making tools beyond the limitations of Youden’s index and diagnostic odds ratio (which include the omission of disease prevalence in calculation as well as inherent limitations of calculation in the respective formulas), demonstrating its clinical utility with particular potential in the setting of Emergency Medicine where exclusion of critical diagnoses within time-limited emergencies is critical. This utility of entropy removal in assessment of the diagnostic value of medical decision-making tools can be also seen in comparing different tools that evaluate for the same pathology.

Traditional measures of test quality, such as sensitivity and specificity, are not as easily used for comparing diagnostic strategies as entropy removal. For example, evaluation for thoracic aortic dissection via helical CT scan has a sensitivity of 97.64% and a specificity of 98.90%, whereas evaluation by way of MRI has a slightly lower sensitivity (93.33%) but a higher specificity (99.30%). Entropy removal calculation reveals that helical CT scan removes 87.23% of all entropy with respect to thoracic aortic dissection while MRI removes 81.36%, revealing the superior overall diagnostic value of a helical CT scan in assessment for thoracic aortic dissection. This demonstrates the ability of entropy removal to provide clarification and stratification that sensitivity and specificity do not offer. This advantage of entropy removal calculation can also be seen in the comparison between chest x-ray and low-dose CT scan for lung cancer screening, with CXR having a greater sensitivity (88.89% versus 73.38%) and low-dose CT having a greater specificity (92.60% versus 97.00%) but CXR having greater entropy removal (32.03% versus 28.20%).The superior imaging test for pneumothorax can also be identified by entropy removal calculations, as chest ultrasound yields a superior sensitivity (95.12% versus 55.83%) while supine AP chest x-ray provides a greater specificity (98.87% versus 100%), but chest ultrasound has greater entropy removal over supine AP chest x-ray (82.38% versus 40.94%). Entropy removal thus has the potential to provide an evidence-based foundation for the dynamic evaluation of patients, as it can potentially serve as the basis for guiding medical decision-making in the context of performing certain tests or utilizing particular tools in time restricted order to most effectively eliminate uncertainty regarding a patient’s acute care presentation.

Quantifying the entropy removal capability of medical diagnostics also opens the door for further exploration in healthcare innovation, such as the quantification of the impact of clinical guidelines by analyzing and comparing the diagnostic value of decision-making tools and tests. Entropy removal calculation also has potential use in financial analysis of healthcare costs, as metrics such as entropy removal per cost could be calculated and used to evaluate healthcare cost efficiency. For example, metrics such as the percent entropy removed per US dollar (USD) by a diagnostic tool can be calculated. Using publicly available Medicare costs²⁷, a chest x-ray screening for lung cancer was found to remove 1.28% entropy per USD while a low-dose CT scan screening for lung cancer removed 0.27% entropy per USD. Similarly, a chest US evaluating for pneumothorax removes 3.30% entropy per USD while a CXR evaluating for pneumothorax removes 1.64% entropy per USD. As a final example, an US of the abdomen evaluating for nephrolithiasis removes 0.13% entropy per USD and a CT scan of the abdomen evaluating for nephrolithiasis removes 0.11% entropy per USD.

With respect to entropy removal, in the examples above, it would be more cost-effective to pursue chest x-ray imaging to screen for lung cancer screening as well as to evaluate for pneumothorax as opposed to low-dose CT and ultrasound, respectively. With regard to nephrolithiasis assessment, a CT scan removes marginally more entropy than ultrasound, but has inferior cost-effectiveness (as measured by entropy removal per USD) compared to ultrasound. All these results have the potential to inform medical decision-making in various contexts, providing an alternative means of cost-effectiveness analysis in assessing the efficiency of healthcare systems.

Furthermore, entropy removal can be used to evaluate the diagnostic quality of entire departments or systems. For example, the diagnostic performances of expert radiologists and residents regarding COVID-19 identification on chest x-rays was evaluated in a 2021 study, which found that attending radiologists diagnosed COVID-19 at a sensitivity of 78.98% and a specificity of 80.45% as opposed to resident radiologists (75.09% and 57.89%, respectively)²⁸. Entropy removal calculations can be used to further evaluate the diagnostic quality of each respective subgroup, revealing that attending radiologists removed 24.55% of clinical uncertainty regarding COVID-19 via chest x-rays while residents only removed 7.55%.

Shannon entropy, proposed as a big data metric²⁹, can evaluate diagnostic quality across entire hospitals or health networks, not just specific pathologies. The utility of Shannon entropy can be extended to other research applications where data points of true and false positives and negatives are reported. Beyond individual groups, entropy removal can gauge the performance of whole departments and networks, indicating healthcare innovation and quality. Additionally, it can highlight healthcare disparities by comparing diagnostic efficiency across various regions and patient groups.

The generation of synthetic patient datasets from medical decision-making algorithms and subsequent analysis of these algorithms by decision tree machine learning analysis as performed in this study showed potential utility, as well, though with limitations (see limitations below). The resultant machine learning decision trees and calculated metrics from the algorithms evaluated in this study were in line with the medical decision-making algorithms used in practice and the results of this analysis can be understood to support and further validate these current clinical guidelines. The results also quantified the effectiveness of the individual constituent steps of the algorithms, providing measurable insight on the most clinically relevant information for patient assessment in the algorithms. If more data are provided in literature for the development and validation of medical decision-making algorithms, deeper analysis can be performed on these diagnostic tools in order to more thoroughly evaluate them.

The limitations of this study include the fact that the findings outlined in this study are statistical and mathematical modeling that will require further application to clinical practice. While the application of Shannon entropy to medical diagnostics, as in this study, is a limited implementation of established information theory and machine learning concepts to publicly available data, the need for clinical validation still remains. For example, the presence of differences in entropy removal from established metrics does not necessarily establish that such differences are clinically meaningful or accurately reflect the performance of the decision support tools unless some prospective testing is done. While this was outside of the scope of this study, further investigation and validation exploring these phenomena is warranted. Furthermore, the stepwise evaluation of algorithms as described in the latter portions of this paper made use of bootstrapped (resampled) data, which is very internally consistent but also requires prospective and external validation. Larger data sets from healthcare electronic medical records could provide valuable insight using our entropic approach.

Data availability

All data (including the specific studies and figures utilized in compiling diagnostic variables) have been uploaded into a public repository which can be accessed at the following URL: https://data.mendeley.com/datasets/hgwdb4mtpw/2 3.0³⁰.

References

Guyatt, G. et al. Evidence-based medicine: A new approach to teaching the practice of medicine. JAMA 268, 2420–2425 (1992).
Article Google Scholar
Kohn, K. T., Corrigan, J. M. & Donaldson, M. S. To Err Is Human: Building a Safer Health System (National Academy Press, 1999).
Google Scholar
“AHRQ National Scorecard on Hospital-Acquired Conditions Updated Baseline Rates and Preliminary Results 2014–2017” (Agency for Healthcare Research and Quality, 2019).
Newman, T. B. & Kohn, M. A. Evidence-Based Diagnosis: An Introduction to Clinical Epidemiology (Cambridge University Press, 2020).
Book Google Scholar
Bartol, T. Thoughtful use of diagnostic testing: Making practical sense of sensitivity, specificity, and predictive value. Nurse Practit. 40, 10–12 (2015).
Google Scholar
Naeger, D. M. et al. Correctly using sensitivity, specificity, and predictive values in clinical practice: How to avoid three common pitfalls. Am. J. Roentgenol. 200, W566–W570 (2013).
Article Google Scholar
Eusebi, P. Diagnostic accuracy measures. Cerebrovasc. Dis. 36, 267–272 (2013).
Article PubMed Google Scholar
Parikh, R., Mathai, A., Parikh, S., Chandra Sekhar, G. & Thomas, R. Understanding and using sensitivity, specificity and predictive values. Indian J. Ophthalmol. 56, 45–50 (2008).
Article PubMed PubMed Central Google Scholar
Monaghan, T. F. et al. Foundational statistical principles in medical research: Sensitivity, specificity, positive predictive value, and negative predictive value. Medicina 57, 1–7 (2021).
Article Google Scholar
Casagrande, A., Fabris, F. & Girometti, R. Fifty years of Shannon information theory in assessing the accuracy and agreement of diagnostic tests. Med. Biol. Eng. Comput. 60, 941–955 (2022).
Article PubMed PubMed Central Google Scholar
Ehrmann, D. E. et al. Making machine learning matter to clinicians: model actionability in medical decision-making. NPJ Digit. Med. 6, 7 (2023).
Article PubMed PubMed Central Google Scholar
Lotfi, F. H. & Fallahnejad, R. Imprecise Shannon’s entropy and multi attribute decision making. Entropy 12, 53–62 (2010).
Article ADS Google Scholar
Ting, H. W., Wu, J. T., Chan, C. L., Lin, S. L. & Chen, M. H. Decision model for acute appendicitis treatment with decision tree technology—A modification of the Alvarado scoring system. J. Chin. Med. Assoc. 8, 401–406 (2010).
Article Google Scholar
Bertolini, S., Maoli, A., Rauch, G. & Giacomini, M. Entropy-driven decision tree building for decision support in gastroenterology. Stud. Health Technol. Inform. 186, 93–97 (2013).
PubMed Google Scholar
Liu, Y. et al. Shannon entropy for time-varying persistence of cell migration. Biophys. J. 120, 2552–2656 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Halma, M. T. J., Ritchie, D. B. & Woodside, M. T. Conformational shannon entropy of mRNA structures from force spectroscopy measurements predicts the efficiency of -1 programmed ribosomal frameshift stimulation. Phys. Rev. Lett. 126, 1–7 (2021).
Article Google Scholar
Monaco, A. et al. Shannon entropy approach reveals relevant genes in Alzheimer’s disease. PLoS ONE 14, 1–29 (2019).
Article Google Scholar
Hammer, M. M. Kohlberg GDGet the diagnosis: An evidence-based medicine collaborative Wiki for diagnostic test accuracy. Postgrad. Med. J. 93, 179–185 (2017).
Article PubMed Google Scholar
Gomez, B. et al. Validation of the “step-by-step” approach in the management of young febrile infants. Pediatrics 138, 1–10 (2016).
Article Google Scholar
Kuppermann, N. et al. Identification of children at very low risk of clinically-important brain injuries after head trauma: A prospective cohort study. Lancet 374, 1160–1170 (2009).
Article PubMed Google Scholar
Berrar, D. & Dubitzky, W. Bootstrapping. In Encyclopedia of Systems Biology (eds Dubitzky, W. et al.) 158–162 (Springer, New York, NY, 2013).
Chapter Google Scholar
Arvanitis, T. N., White, S., Harrison, S., Chaplin, R. & Despotou, G. A method for machine learning generation of realistic synthetic datasets for validating healthcare applications. Health Inform. J. 28, 1–16 (2022).
Article Google Scholar
El Emam, K., Mosquera, L., Fang, X. & El-Hussuna, A. Utility metrics for evaluating synthetic health data generation methods: Validation study. JMIR Med. Inform. 10, 1–19 (2022).
Google Scholar
Goncalves, A. et al. Generation and evaluation of synthetic patient data. BMC Med. Res. Methodol. 20, 1–140 (2020).
Article Google Scholar
MATLAB 8.0 and Statistics Toolbox 8.1, The MathWorks, Inc., Natick, Massachusetts, United States.
Pedregosa, F. et al. Scikit-learn: Machine learning in python. JMLR 12, 2825–2830 (2011).
MathSciNet Google Scholar
Procedure Price Lookup for Outpatient Services | Medicare.gov. www.medicare.gov. https://www.medicare.gov/procedure-price-lookup/.
Flor, N. et al. Diagnostic performance of chest radiography in high COVID-19 prevalence setting: Experience from a European reference hospital. Emerg. Radiol. 28, 877–885 (2021).
Article PubMed PubMed Central Google Scholar
Juszczuk, P. et al. Real-world data difficulty estimation with the use of entropy. Entropy 23, 1–36 (2021).
Article Google Scholar
Chong, P. Entropy removal of medical diagnostics. Mendeley Data, V1. https://doi.org/10.17632/hgwdb4mtpw.1 (2023). https://data.mendeley.com/datasets/hgwdb4mtpw/2.

Download references

Author information

These authors contributed equally: Shuhan He and Paul Chong.

Authors and Affiliations

Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Shuhan He, Joshua N. Goldstein, Ali S. Raja & Jarone Lee
Campbell University School of Osteopathic Medicine, Lillington, NC, USA
Paul Chong & Wilson Sharp
Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA
Byung-Jun Yoon & Pei-Hung Chung
Brookhaven National Laboratory, Computational Science Initiative, Upton, NY, USA
Byung-Jun Yoon
Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
David Chen
Harvard University Department of Chemistry and Chemical Biology, Cambridge, MA, USA
Sammer Marzouk
Oregon Health and Science University, Portland, OR, USA
Kameron C. Black
Massachusetts General Hospital Institute of Health Professions, Boston, MA, USA
Pedram Safari

Authors

Shuhan He
View author publications
You can also search for this author in PubMed Google Scholar
Paul Chong
View author publications
You can also search for this author in PubMed Google Scholar
Byung-Jun Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Hung Chung
View author publications
You can also search for this author in PubMed Google Scholar
David Chen
View author publications
You can also search for this author in PubMed Google Scholar
Sammer Marzouk
View author publications
You can also search for this author in PubMed Google Scholar
Kameron C. Black
View author publications
You can also search for this author in PubMed Google Scholar
Wilson Sharp
View author publications
You can also search for this author in PubMed Google Scholar
Pedram Safari
View author publications
You can also search for this author in PubMed Google Scholar
Joshua N. Goldstein
View author publications
You can also search for this author in PubMed Google Scholar
Ali S. Raja
View author publications
You can also search for this author in PubMed Google Scholar
Jarone Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.H.: conceptualization, writing—review and editing, supervision. P.C.: methodology, formal analysis, investigation, data curation, writing—original draft. B.J.Y.: writing—review and editing. P.H.C.: writing—review and editing. D.C.: writing—review and editing, visualization. S.M.: writing—review and editing. K.C.B.: writing—review and editing. W.S.: formal analysis. P.S.: writing—review and editing, supervision. J.N.G.: writing—review and editing, supervision. A.R.: writing—review & editing, supervision. J.L.: writing—review and editing, supervision.

Corresponding author

Correspondence to Shuhan He.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

He, S., Chong, P., Yoon, BJ. et al. Entropy removal of medical diagnostics. Sci Rep 14, 1181 (2024). https://doi.org/10.1038/s41598-024-51268-4

Download citation

Received: 21 April 2023
Accepted: 03 January 2024
Published: 12 January 2024
DOI: https://doi.org/10.1038/s41598-024-51268-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.