Early assessment of lung function in coronavirus patients using invariant markers from chest X-rays images

Elsharkawy, Mohamed; Sharafeldeen, Ahmed; Taher, Fatma; Shalaby, Ahmed; Soliman, Ahmed; Mahmoud, Ali; Ghazal, Mohammed; Khalil, Ashraf; Alghamdi, Norah Saleh; Razek, Ahmed Abdel Khalek Abdel; Alnaghy, Eman; El-Melegy, Moumen T.; Sandhu, Harpal Singh; Giridharan, Guruprasad A.; El-Baz, Ayman

doi:10.1038/s41598-021-91305-0

Download PDF

Article
Open access
Published: 08 June 2021

Early assessment of lung function in coronavirus patients using invariant markers from chest X-rays images

Mohamed Elsharkawy¹^na1,
Ahmed Sharafeldeen¹^na1,
Fatma Taher²^na1,
Ahmed Shalaby¹,
Ahmed Soliman¹,
Ali Mahmoud¹,
Mohammed Ghazal³,
Ashraf Khalil²,
Norah Saleh Alghamdi⁴,
Ahmed Abdel Khalek Abdel Razek⁵,
Eman Alnaghy⁵,
Moumen T. El-Melegy⁶,
Harpal Singh Sandhu⁷,
Guruprasad A. Giridharan¹ &
…
Ayman El-Baz¹

Scientific Reports volume 11, Article number: 12095 (2021) Cite this article

9352 Accesses
18 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The primary goal of this manuscript is to develop a computer assisted diagnostic (CAD) system to assess pulmonary function and risk of mortality in patients with coronavirus disease 2019 (COVID-19). The CAD system processes chest X-ray data and provides accurate, objective imaging markers to assist in the determination of patients with a higher risk of death and thus are more likely to require mechanical ventilation and/or more intensive clinical care.To obtain an accurate stochastic model that has the ability to detect the severity of lung infection, we develop a second-order Markov-Gibbs random field (MGRF) invariant under rigid transformation (translation or rotation of the image) as well as scale (i.e., pixel size). The parameters of the MGRF model are learned automatically, given a training set of X-ray images with affected lung regions labeled. An X-ray input to the system undergoes pre-processing to correct for non-uniformity of illumination and to delimit the boundary of the lung, using either a fully-automated segmentation routine or manual delineation provided by the radiologist, prior to the diagnosis. The steps of the proposed methodology are: (i) estimate the Gibbs energy at several different radii to describe the inhomogeneity in lung infection; (ii) compute the cumulative distribution function (CDF) as a new representation to describe the local inhomogeneity in the infected region of lung; and (iii) input the CDFs to a new neural network-based fusion system to determine whether the severity of lung infection is low or high. This approach is tested on 200 clinical X-rays from 200 COVID-19 positive patients, 100 of whom died and 100 who recovered using multiple training/testing processes including leave-one-subject-out (LOSO), tenfold, fourfold, and twofold cross-validation tests. The Gibbs energy for lung pathology was estimated at three concentric rings of increasing radii. The accuracy and Dice similarity coefficient (DSC) of the system steadily improved as the radius increased. The overall CAD system combined the estimated Gibbs energy information from all radii and achieved a sensitivity, specificity, accuracy, and DSC of 100%, 97% ± 3%, 98% ± 2%, and 98% ± 2%, respectively, by twofold cross validation. Alternative classification algorithms, including support vector machine, random forest, naive Bayes classifier, K-nearest neighbors, and decision trees all produced inferior results compared to the proposed neural network used in this CAD system. The experiments demonstrate the feasibility of the proposed system as a novel tool to objectively assess disease severity and predict mortality in COVID-19 patients. The proposed tool can assist physicians to determine which patients might require more intensive clinical care, such a mechanical respiratory support.

Segment anything in medical images

Article Open access 22 January 2024

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

AI in health and medicine

Article 20 January 2022

Introduction

Coronavirus disease 2019 (COVID-19) was initially detected in Wuhan, China and is caused by a novel RNA virus belonging to the Coronaviridae family. It is believed to have been transmitted to humans from bats via an intermediate mammalian host before achieving human to human transmission. Such zoonotic origin is consistent with similar coronavirus outbreaks^1,2,3,4. Coronaviridae is a family of nonsegmented, enveloped, positive-sense, single-stranded ribonucleic acid viruses. Six species of coronavirus had previously been identified as pathogenic in humans: four of these cause mild respiratory illnesses, whereas the other two species, severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV), have led to epidemics with significant rates of mortality⁵.

The clinical diagnosis of COVID-19 depends on different symptoms including fever in 98% of cases, dry cough (75%), fatigue (45%), muscle aches (45%), difficulty breathing (55%), and acute respiratory distress syndrome (ARDS) (20%). Severe cases may progress to multiorgan dysfunction and even death (2.5%)^4,6,7,8. The disease may be classified as (i) mild type: moderate clinical symptoms with normal chest X-ray; (ii) typical type: fever, respiratory, and other clinical findings indicating signs of pneumonia; (iii) severe type: respiratory distress signs (respiratory rate $\ge 30$ breaths per minute and/or blood oxygen saturation of less than 93%; (iv) critical type: dysfunction of respiration necessitating mechanical ventilation, shock, and organ damage requiring monitoring and treatment from the intensive care unit (ICU)⁹.

Due to the wide variations in clinical presentation and progression rate for COVID-19, laboratory confirmation of SARS-CoV-2 infection is essential to initiate appropriate early treatment and to prevent further spread of the disease^{10,11,12,13,14,15}. The current reference standard for this purpose is real-time reverse transcription polymerase chain reaction (PCR) of viral RNA. The PCR test, according to current guidelines, is run on samples from nasopharyngeal and/or throat swabs. While PCR is the gold standard in diagnosing patients with COVID-19 infection, the sensitivity of a single PCR is suboptimal and depends on the timing of the test, sampling sites and sampling techniques^7,8,10,11,12.

Chest radiography is helpful for first-line evaluation of patients with a high pre-test probability of overt COVID-19 pneumonia, clinical follow up, and for the evaluation of potential complications. Chest radiography can detect areas of ground glass density, also observed on chest computed tomography (CT), which may often have a correlation with the severity of the disease, and may be intermixed with reticular pattern^{12,13,14,15,16,17}.

Based on the recent clinical research, COVID-19 radiological forms are variable in severity using plain radiography or CT, ranging from a normal chest (albeit rarely), to patchy involvement of one or both lungs in mild or moderate cases, to diffuse infiltration (white lung) in severe cases. This is an important issue as mild or moderate cases can be managed by medical treatment or non-invasive ventilation, while severe cases with bilateral lung infection urgently need mechanical ventilation to support respiration as patients develop ARDS. Given the paucity of mechanical ventilation units, patient selection for ventilation plays a crucial role in saving lives. We propose a methodology that utilizes the current state of machine learning and artificial intelligence (AI) to assist physicians by providing an objective metric that can differentiate severe cases from mild/moderate cases and potentially even predict mortality.

There are few preliminary studies and case reports discussing the role of AI on plain radiography and CT for early diagnosis of patients with COVID-19. AI can be used in conjunction with radiologists to improve the results of detection of COVID-19. AI can be a powerful aid in delineating and quantifying lesions in X-ray images and in tracking longitudinal changes between exams, which is crucial for precision medicine. In essence, AI is another means of analyzing data that clinicians can draw on to inform their judgment in issues of triage, diagnosis (in combination with PCR tests and epidemiological risk), prognosis, and selection between therapeutic alternatives in patients exhibiting COVID-19 symptoms. Plain radiography involves a low radiation dose compared to CT and is better suited for routine monitoring and follow up compared to a CT scan. AI may be capable of detecting subtle changes visible on either chest X-ray or CT in the lung, and can improve efficiency by decreasing the amount of time to return test results. This is necessary for screening the general population during the current COVID-19 pandemic and in the epicenters of any future outbreaks. Computer assisted detection alleviates the burden on radiologists and clinicians and facilitates rapid triage. Also, AI can be used for the differentiation of previous lung injury unrelated to COVID-19 from advanced lung dysfunction due to COVID-19, and assist in patient selection for ventilation^{18,19,20,21,22,23,24,25}.

CAD systems for assessing lung function in COVID-19 are limited in the literature. Sun et al.²⁶ developed an approach using deep transfer learning to detect signs of COVID-19-related pneumonia in chest X-ray images. Hassanien et al.²⁷ developed an automatic segmentation method for lung areas affected by COVID-19 that employed an Otsu-derived algorithm for multi-level thresholding of X-ray images and a support vector machine for the prediction task. Apostolopoulos et al.²⁸ studied the efficacy of deep learning convolutional neural networks (CNN) for deriving characteristic COVID-19 biomarkers from chest X-rays. Wang et al.²⁹ presented a novel CNN, named COVID-Net, tailored to the detection of COVID-19-related changes in chest X-rays. The deep learning network of Hammoudi et al.³⁰, on the other hand, was designed to automatically detect if a chest X-ray image indicates healthy lungs or evidence of pneumonia (bacterial or viral). Combined with prior information regarding the likelihood the patient has been exposed to the virus, an automatic diagnosis of viral pneumonia has a high true positive rate for detection of COVID-19.

Currently, the primary challenge is to apply different AI-based approaches to determine the severity of chest infection in COVID-19 patients given that X-ray images vary enormously in image quality due to the wide range of X-ray machines in use across the world. To overcome this challenge, we develop a new CAD system that operates on extracted X-ray image markers that are invariant under rotation, scaling, and translation.

Materials and methods

Patient data

The proposed approach is tested and validated on data from a publicly available archive of COVID-19 positive cases³¹, data from COVID-19 open research dataset challenge (CORD-19)³², and data from the University of Louisville, USA and Mansoura University, Egypt. The research protocol was approved by the institutional review board (IRB) at the University of Louisville and Mansoura university as well as all methods were performed in accordance with the relevant guidelines and regulations and the patients informed consent was obtained. For the patients who passed away, an informed consent was obtained from legal guardian/Next of kin for the dead patients. These databases include 200 subjects tested as COVID-19 positive, 100 from patients who eventually died from the infection and 100 patients who ultimately recovered. These databases comprise a heterogeneous collection of digital X-ray images, which was the primary motivation to develop rotation, scale, and translation invariant MGRF model from which we extract the proposed imaging markers to grade the severity of lung infection in COVID-19 patients. We did our experiments using the merged datasets from all three databases to overcome the issue of data balance as the number of subjects for the two classes, (high and low severity cases), in each database was different. The dead cases had been confirmed based on the following radiology protocol³³: (i) Bilateral and predominantly peripheral opacifications and/or consolidations were rated as typical for a COVID-19 infection. (ii) A distribution pattern with opacifications and/or consolidations limited to one pulmonary lobe consistent with lobar pneumonia was rated as non-typical for a COVID-19 infection. All changes that could not be classified as non-typical or typical were rated as indeterminate. In a subgroup of these patients with indeterminate findings, soft criteria for a possible COVID-19 infection were defined as the unilateral presence of predominantly peripheral.

Proposed computer aided diagnostic (CAD) system

The proposed CAD system to detect the severity of lung infection is shown in Fig. 1. The CAD system consists of three major steps: (i) preprocessing steps to improve contrast of the X-ray images and identify the region of interest in order to enhance diagnostic accuracy of subsequent steps; (ii) modeling the appearance of infected chest tissue using a new Markov–Gibbs random field (MGRF) constructed to be invariant under rotation, translation, and change of scale; and (iii) a neural network (NN)-based fusion and diagnostic system to determine whether the grade of lung infection is low or high.

Data preprocessing

To improve the accuracy of the proposed approach, we manually segmented the lung region from the original X-ray image, Fig. 2a, as demonstrated in Fig. 2b. The second step is to enhance lung tissue contrast, for which we use regional dynamic histogram equalization (RDHE)³⁴. Proper analysis of the type of noise present in the chest X-ray image may help to select proper denoising methods, which preserve the important texture information while reducing the noise^35,36. The RDHE approach divides the image into blocks x rows high by y columns wide. Then, dynamic histogram equalization is applied within each block to adaptively enhance the contrast. Therefore, the image histogram is remapped by block, and pixel values are adjusted relative to the other pixels in their $x\times y$ neighborhood. The contrast-enhanced X-ray image resulting from the RDHE approach is illustrated in Fig. 2c.

The third preprocessing step is to identify and mask off the healthy lung tissues from the infected tissues. This step narrows the search space to focus only on the abnormal tissues and serves to increase the diagnostic accuracy of the CAD system. To achieve this step, we use our previously published methodology³⁷ that considers both the spatial interaction between nearby image pixels and the intensity distribution of those pixels within the lung region of interest. We follow the conventional description of the MGRF model in terms of independent signals (images) and interdependent region labels (segmentations); yet we focus on more accurate model identification³⁷. Each image segment corresponds to a single dominant mode of the empirical distribution (i.e. histogram) of gray levels. To identify the dominant modes, each image histogram is considered to be sampled from a linear combination of discrete Gaussians (LCDG) distribution³⁷. We fit an initial LCDG model to the empirical distribution using a modified expectation-maximization (EM) algorithm³⁷. Free parameters of the LCDG to be optimized are the number of discrete Gaussian components and their respective weights (positive and negative), shape, and scale parameters. Then, the initial LCDG-based segmentation is iteratively refined using the MGRF model with its analytically estimated potentials³⁷. Figure 2d shows the extracted pathological tissues using our proposed algorithm. Additional details can be found in El-Baz et al.³⁷.

Rotation, scale, and translation invariant MGRF model

We constructed the proposed MGRF model in order that the image need not be aligned with any particular frame of reference in order to use it to grade the severity of lung infection (low vs. high). To construct the appearance of the infected lung regions, we consider the X-ray images as samples from a piecewise stationary MGRF with a central-symmetric system of pixel-pixel interactions. Let $\mathbf {n}_{\nu }$ denote a set of central-symmetric pixel neighborhoods indexed by $\nu \in \{1, \ldots , N\}$. Each $\mathbf {n}_{\nu }$ is a set of coordinate offsets $(\xi , \eta )$ specified by a semi-open interval of interpixel distances $(d_{\nu ,\min }, d_{\nu ,\max }]$ such that the $\mathbf {n}_{\nu }$-neighborhood of pixel (x, y) comprises all pixels $(x^\prime , y^\prime )$ such that $d_{\nu ,\min } < \sqrt{(x - x^\prime )^2 + (y - y^\prime )^2} \le d_{\nu ,\max }$. A neighborhood system corresponding to $d_{\nu ,\min } = \nu - {1\over 2}$ and $d_{\nu ,\max } = \nu + {1\over 2}$, $\nu \in \{1, 2, 3\}$, is illustrated in Fig. 3. Associated with the neighborhood system is a set of $N + 1$ Gibbs potential functions of gray value and gray value co-occurrences $V_0:Q \rightarrow \mathbb {R}$ and $V_\nu :Q\times Q \rightarrow \mathbb {R}$, $\nu \in \{1, \ldots , N\}$, where Q is the range of pixel gray levels, e.g. $Q = \{0, \ldots , 255\}$ in the case of 8-bit images.

For a given image/label map pair $(\mathbf {g}_t, \mathbf {m}_t)$ from our training set S, $t \in \{1, \ldots , T\}$, let $\mathbf {R}_t = \{(x,y) \mid m_t(x,y) = \mathsf {ob}\}$ denote the subset of the pixel lattice supporting the infected lung region. Denote the set of $\mathbf {n}_\nu$-neighboring pixels restricted to $\mathbf {R}_t$ by

$$\begin{aligned} \mathbf {C}_{\nu ,t} = \left\{ (x, y, x^\prime , y^\prime ) \, \big |\, (x,y)\in \mathbf {R}_t \wedge (x^\prime ,y^\prime ) \in \mathbf {R}_t \wedge (x-x^\prime , y-y^\prime ) \in \mathbf {n}_\nu \right\} . \end{aligned}$$

Finally let $f_{0,t}$ and $f_{\nu ,t}$, $\nu \in \{1, \ldots , N\}$ denote empirical probability distributions (i.e., relative frequency histograms) of gray values and gray value co-occurrences in the training infected region from the X-ray image $\mathbf {g}_t$,

$$\begin{aligned}&f_{0,t}(q) = \vert \mathbf {R}_t\vert ^{-1} \left| \{(x,y)\in \mathbf {R}_t \mid \mathbf {g}_t(x,y) = q\} \right| ; \end{aligned}$$

(1)

$$\begin{aligned}&f_{\nu ,t}(q,q^\prime ) = \vert \mathbf {C}_{\nu ,t}\vert ^{-1} \left| \{(x,y,x^\prime ,y^\prime ) \in \mathbf {C}_{\nu ,t} \mid \mathbf {g}_t(x,y) = q \wedge \mathbf {g}_t(x^\prime ,y^\prime ) = q^\prime \} \right| . \end{aligned}$$

(2)

The joint probability of object pixels in image $\mathbf {g}_t$ according to the MGRF model is given by the Gibbs distribution

$$\begin{aligned} \begin{array}{lll} P_t &{} = &{} Z_{t}^{-1} \exp \left( \sum \limits _{(x,y)\in \mathbf {R}_t} \left( V_0\left( \mathbf {g}_t(x,y)\right) + \sum \limits _{\nu =1}^{N} \sum \limits _{(\xi ,\eta )\in \mathbf {n}_{\nu }} V_{\nu }\left( \mathbf {g}_t(x,y),\mathbf {g}_t(x+\xi ,y+\eta )\right) \right) \right) \\ &{} = &{} Z_{t}^{-1} \exp \left( \vert \mathbf {R}_t\vert \left( \mathbf {V}_{0,t}^\mathsf {T}\mathbf {F}_{0,t} + \sum \limits _{\nu =1}^{N}\rho _{\nu ,t}\mathbf {V}_{\nu ,t}^\mathsf {T} \mathbf {F}_{\nu ,t} \right) \right) , \end{array} \end{aligned}$$

(3)

where $\rho _{\nu ,t}= \vert \mathbf {C}_{\nu ,t}\vert / \vert \mathbf {R}_t\vert$ is an average cardinality of $\mathbf {n}_{\nu }$ over the sublattice $\mathbf {R}_t$.

Assuming lungs having the same pathology exhibit similar morphology in X-ray images, then we may approximate the previous expressions by their averages over the training set S: $|\mathbf {R}_t| \approx R_\mathsf {ob}$ and $|\mathbf {C}_{\nu ,t}| \approx C_{\nu ,\mathsf {ob}}$. Here $R_\mathsf {ob} = {1\over T}\sum _{t=1}^{T}|\mathbf {R}_t|$ and $C_{\nu ,\mathsf {ob}} = {1\over T}\sum _{t=1}^{T}|\mathbf {C}_{\nu ,t}|$. If we assume further that the observations in S are statistically independent (e.g., each X-ray is taken from a different patient), the expression for joint probability of object pixels may be likewise simplified³⁸:

$$\begin{aligned} P_\mathbf {S}=\frac{1}{Z} \exp \left( TR_\mathsf {ob} \left( \mathbf {V}_{0}^\mathsf {T} \mathbf {F}_{0}+ \sum \limits _{\nu =1}^{N}\rho _{\nu }\mathbf {V}_{\nu }^\mathsf {T} \mathbf {F}_{\nu } \right) \right) . \end{aligned}$$

Here, $\rho _\nu = C_{\nu ,\mathsf {ob}} / R_\mathsf {ob}$, and the probability vectors $\mathbf {F}_{\mathrm {pix},\mathsf {ob}}$ and $\mathbf {F}_{\nu ,\mathsf {ob}}$ are the averages of the relative frequency histograms and normalized gray level co-occurrence matrices, respectively, over all objects in the training set. The problem of zero empirical probabilities, which can arise when a relatively small volume of the training data is available to identify the MGRF model, is dealt with using pseudocounts. Then Eqs. 1 and 2 are modified as follows

$$\begin{aligned}&f_{0,t}(q) = \frac{\left| \{(x,y)\in \mathbf {R}_t \mid \mathbf {g}_t(x,y) = q\} \right| + \varepsilon }{\vert \mathbf {R}_t\vert + Q\varepsilon } \end{aligned}$$

(4)

$$\begin{aligned}&f_{\nu ,t}(q,q^\prime ) = \frac{\left| \{(x,y,x^\prime ,y^\prime ) \in \mathbf {C}_{\nu ,t} \mid \mathbf {g}_t(x,y) = q \wedge \mathbf {g}_t(x^\prime ,y^\prime ) = q^\prime \} \right| + \varepsilon }{\vert \mathbf {C}_{\nu ,t}\vert + Q^2\varepsilon }. \end{aligned}$$

(5)

The Bayesian quadratic loss estimate suggests using the offset $\varepsilon = 1$, while a more conservative approach³⁸ suggests using $\varepsilon = 1/Q$ in Eq. 4 and $\varepsilon = 1/Q^2$ in Eq. 5.

Using the same analytical approach as in Ref.³⁸, the Gibbs potential functions are approximated using the centered, training-set average, normalized histograms and co-occurrence matrices:

$$\begin{aligned} \begin{array}{llll} V_{0}(q) &{} = &{} \left( f_{0}(q)-\frac{1}{Q} \right) ; \\ V_{\nu }(q,q^{\prime }) &{} = &{} \left( f_{\nu }(q,q^{\prime })-\frac{1}{Q^{2}} \right) . \end{array} \end{aligned}$$

(6)

Using the above estimated potentials, we can calculate the Gibbs energy of the infected lung region $\mathbf {b}$ in an X-ray image $\mathbf {g}$ as follows:

$$\begin{aligned} E(\mathbf {g},\mathbf {b}) = \mathbf {V}_{0}^\mathsf {T} \mathbf {F}_{0}(\mathbf {g}, \mathbf {b}) + \sum _{\nu \in \mathbf {N}^\prime }\mathbf {V}_{\nu }^\mathsf {T} \mathbf {F}_{\nu }(\mathbf {g}, \mathbf {b}). \end{aligned}$$

(7)

Here, $\mathbf {N}^\prime$ is a selected top-rank index subset of the neighborhoods, and the empirical probability distributions $F_0$ and $F_{\nu }$ are calculated over the object pixels $\mathbf {b}$ of $\mathbf {g}$.

To summarize, the whole training approach is as follows:

1.
Read all infected regions from the training data having class “severe” lung infection.
2.
Calculate the co-occurrence of the image signal at various radii ($\nu$1, $\nu$2, and $\nu$3).
3.
Normalize the co-occurrence frequency ($f_{\mathrm {pix},\mathsf {ob}}(q)$).
4.
Estimate the Gibbs potential ($V_{\mathrm {pix},\mathsf {ob}}(q)$) by using Eq. 6.
5.
Use Eq. 7 to calculate the Gibbs Energy ($E(\mathbf {g},\mathbf {b})$) for the training subjects.

NN-based fusion and diagnostic system

A new NN system that can fuse the diagnostic results from the three estimated Gibbs energy at three different radii is developed. The proposed NN-based model consists of four blocks as illustrated in Fig. 4. Three of them are fed with the three different cumulative distribution functions (CDFs) of the estimated Gibbs energy, then the results of the three blocks are fused into the last block to decide the final decision of the input X-ray image. We use a backpropagation approach to train the proposed NN-based diagnostic system as follows:

1.
Randomly initialize the weights of the proposed NN-network.
2.
Compute the output of each neuron in the hidden and output layers.
3.
Update the weights of the proposed NN-network using the batch-mode backpropagation approach.
4.
Repeat steps 2 and 3 until there are no significant changes in the NN-network weights.

In order to tune the hyper-parameters used in our proposed NN system, a hyper-parameters estimation approach is used. The parameters to be estimated are the number of bins used to calculate CDF, number of hidden layers in the NN model, number of neurons in the hidden layer, and finally the activation function used to calculate the output of each neuron. We ran several experiments using random values for these parameters to estimate their optimal values using training data. All the results that are demonstrated in the “Experimental results” section have been obtained using the following setting: to handle all energy values, the chosen value for the number of CDF bins is 175; the number of hidden layers in NN 1, NN 2, and fusion NN is one, while for NN 3, there are no hidden layers (searching from 0 to 10); the number of neurons per hidden layer is 50, 20, and 2 for NN 1, NN 2, and Fusion NN, respectively (searching from 1 to 200); and finally, the sigmoid activation function has been selected after also considering the tangent and softmax activation functions.

Experimental results

Table 1 Diagnostic accuracy of the proposed CAD system. Feature selection had a significant impact on classifier performance with Friedman test $\chi ^2 = 32.7$, 3 d.f., $p = 3\times 10^{-7}$. V: Wilcoxon signed rank statistic of performance compared to complete system; p: Associated Bonferroni-corrected p-value.

Full size table

To highlight the innovation in our approach, we demonstrate the Gibbs energy calculated at three radii as a color map fused over the X-ray images. One example for which it holds, it is clear from Fig. 5 that the Gibbs energy in cases of high severity of COVID-19 pneumonia is high compared with the Gibbs energy for low-severity COVID-19 pneumonia. Since the collected X-ray images have different resolutions, we use CDF as a new scale-invariant representation to the estimated Gibbs energy which makes it suitable for all data collection protocols as shown in Fig. 6. To highlight the advantage of the proposed Gibbs energy as a new discriminatory image marker, we calculate the average CDFs with a demonstration of the standard deviation at each point for both classes (high severity vs. low severity). As is clear from Fig. 7, the CDFs are rather distinctive which allows for straightforward classification by the proposed NN-based classifier. The output of the CAD system was an assessment of the severity of pneumonia in COVID-19 patients with two levels: a low severity of infection (“low”) or high severity of infection (“high”) as shown in Fig. 4. This was compared to the ground truth of the 200 clinical cases collected, 100 of which were from patients who died of COVID-19 and 100 of which recovered. Accurate system outputs include an assessment of “low” in a case that recovered and an output of “high” in a case that died. To confirm the accuracy of the proposed NN classification and fusion system, leave-one-subject-out (LOSO), tenfold, fourfold, and a twofold cross-validation approaches are performed on our datasets as demonstrated in Table 1. We use the following objective metrics to measure the accuracy of the proposed NN-based fusion system: (i) sensitivity, (ii) specificity, (iii) accuracy, and (iv) Dice similarity coefficient (DSC). As demonstrated in Table 1, the proposed system has achieved $100\%$ accuracy with the LOSO validation test and $98.00\% \pm 2.00\%$ for a twofold validation test (real-life scenario), all of which confirm the high accuracy of the proposed CAD system.

To highlight the contribution of each Gibbs energy at each radius, we construct an NN-based classifier using the estimated Gibbs energy at each radius. As is clear from Table 1, the NN-classifier based on the estimated Gibbs energy at $\nu$3 demonstrates the highest accuracy compared with the classification accuracies based on the estimated Gibbs energy at $\nu$2 and $\nu$1. Also, it is worth mentioning that fusing the three estimated Gibbs energies by using the NN-Based classification system achieves higher accuracy compared with classification accuracies based on each single estimated Gibbs energy. Finally, to highlight the accuracy of the proposed NN-based fusion system, we compare its accuracy with support vector machine (SVM), random forest, naive Bayes, K-nearest neighbors (KNN), and decision trees classifiers. Table 2 clearly shows that the NN-based classification and fusion system has achieved the highest accuracy compared with other approaches.

Table 2 Diagnostic accuracy using different classification systems. Feature selection had a significant impact on classifier performance with Friedman test $\chi ^2 = 35.3$, 5 d.f., $p = 1.3\times 10^{-6}$. V: Wilcoxon signed rank statistic of performance compared to complete system; p: Associated Bonferroni-corrected p-value.

Full size table

Statistical significance of the choice of feature set or classifier architecture on diagnostic performance was assessed using Friedman rank sum tests^39,40,41,42. Post hoc comparison of our proposed NN-based classifier with the alternatives was done using Wilcoxon signed rank tests with Bonferroni correction to the estimated $p-$values. Feature selection had a significant impact on classifier performance (Table 1), with Friedman test $\chi ^2 = 32.7$, 3 d.f., $p = 3\times 10^{-7}$. The fused feature set was shown to be preferable to v1 (Wilcoxon $V = 118$, Bonferroni corrected $p = 0.0033$), v2 ($V = 136$, $p = 0.0014$), and v3 ($V = 136$, $p = 0.0014$). Different classifier architectures also produced significantly different results (Table 2), with $\chi ^2 = 35.3$, 5 d.f., $p = 1.3\times 10^{-6}$. The proposed NN-based classifier outperformed SVM ($V = 136$, $p = 0.0024$), random forest ($V = 118$, $p = 0.0054$), naive Bayes classsifier ($V = 136$, $p = 0.0024$), KNN ($V = 127.5$, $p = 0.0114$), and the decision tree ($V = 127.5$, $p = 0.0114$).

Discussion and conclusion

ARDS is the most common and severe pulmonary complication in COVID-19 patients. It is an acute hypoxemic respiratory failure that requires oxygen and ventilation therapy including intubation and invasive ventilation. Clinically patients may have dyspnea, tachypnea (respiratory rate $\ge$ 30 breaths per minute), decreased peripheral oxygen saturation $\text {SpO}_2 \le 93\%$, poor oxygenation with the ratio of the partial pressure of arterial oxygen to fraction of inspired oxygen $\text {PaO}_2 / \text {FiO}_2 < 300$ mmHg, or lung infiltrates greater than 50% within 48 h⁹. ARDS occurred in 20% of hospitalized patients and 61% of ICU patients in Zhongnan Hospital in Wuhan^3,4. ARDS occurs when capillaries in the lung leak fluid into the alveoli, thereby impairing gas exchange in the lung and reducing oxygen uptake into the systemic arterial circulation. The consequent decrease in blood oxygen levels can be directly life-threatening, leading to multi-organ failure. Respiratory support of COVID-19 may use invasive or non-invasive methods to force oxygen into the airways under pressure. Invasive ventilation uses an endotracheal tube to feed oxygen directly into the lungs. Non-invasive methods employ such devices as continuous positive airway pressure (CPAP) and oxygen hoods; there is no use of an internal tube, and they are used in the management of less severe cases.

Despite being vital for supporting respiration in patients with ARDS, ventilators are in short supply in hospitals. According to Imperial College London, 30% of patients diagnosed with COVID-19 are strongly recommended to be admitted to the hospitals, with a significant fraction of those patients also requiring respiratory support. As the pandemic spreads across the world, many countries stopped exporting ventilators^43,44. The paucity of ventilators is even more acute in under developed and developing countries in South America, Asia, and Africa.

High-pressure ventilation may cause lung injury, also called barotrauma or ventilator-induced lung injury (VILI). Even non-invasive ventilation carries some risk, as stress and strain associated with high tidal volumes may cause patient self-induced lung injury (P-SILI). The additional inflammation due to VILI or P-SILI may lead to aggravation of pulmonary edema and worsening of the very respiratory distress that ventilation was intended to treat. There is also the risk of heart failure, hypervolemia, and multi organ dysfunction, alone or in combination⁴⁵. Unfortunately, COVID-19 patients who are admitted to the ICU and require mechanical ventilation show strikingly high rates of mortality, ranging from 50 to 97% early in the pandemic^{46,47,48,49,50,51}. A more recent study from Emory University showed lower but still dramatic mortality rates of 36% in ICU patients requiring mechanical ventilation and 30% in all COVID-19 patients admitted to the ICU⁵².

Accurate and rapid diagnosis of COVID-19 pneumonia severity is very challenging for radiologists as the disease has rapidly spread across the globe. Based on the results demonstrated in this study, AI systems, especially those based on deep learning, are promising tools to assist initial screening by radiologists. It could decrease workload, improve diagnostic accuracy, and enable appropriate treatments and ventilation management of COVID-19 patients. In the case of a pandemic as we now face, medical resources are seriously strained and must be used as efficiently as possible. Rapid diagnosis and accurate prognosis are essential. The AI-based method shows great potential to quantify disease severity and could be used to inform treatment decision-making in patients with COVID-19. AI in concert with thoracic imaging and other clinical information (epidemiology, PCR, clinical symptoms, and laboratory indicators) can effectively improve clinical outcomes⁵³. AI can increase the utility of chest X-ray imaging beyond first-line diagnostic imaging and into the areas of risk stratification, monitoring of clinical course, and selection between management approaches, such as invasive vs. non-invasive ventilation, for COVID-19 patients. Multimodal data, be they clinical, epidemiological, or potentially molecular data, can by fused with imaging data in an AI framework to build systems to detect and treat COVID-19 patients and potentially to contain its spread⁵⁴. Moreover, we plan to work on X-ray scans/data that are collected at different time points to evaluate the progressing of the infection/pneumonia with the treatment course.

In conclusion, the results herein demonstrate the feasibility of using AI with chest X-ray imaging data to determine the severity of lung involvement in cases of COVID-19. Severity of pneumonia on chest X-ray correlated highly with mortality in this study, and thus this CAD system can potentially also be used to predict mortality in COVD-19 patients.

Data availability

Materials, data, and associated protocols will be available to readers after the manuscript being accepted.

References

Fu, L. et al. Clinical characteristics of coronavirus disease 2019 (COVID-19) in China: A systematic review and meta-analysis. J. Infect. 80, 656–65 (2020).
Article CAS Google Scholar
Rodriguez-Morales, A. J. et al. Clinical, laboratory and imaging features of COVID-19: A systematic review and meta-analysis. Travel. Med. Infect. Dis. 34, 101623 (2020).
Article Google Scholar
Qian, G. Q. et al. Epidemiologic and clinical characteristics of 91 hospitalized patients with COVID-19 in Zhejiang, China: a retrospective, multi-centre case series. QJM Int. J. Med.https://doi.org/10.1093/qjmed/hcaa089 (2020).
Article Google Scholar
Chen, J. et al. Clinical progression of patients with COVID-19 in Shanghai, China. J. Infect. 80, e1–e6 (2020).
Article CAS Google Scholar
Su, S. et al. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol. 24, 490–502 (2016).
Article CAS Google Scholar
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 328, 727–733 (2020).
Ai, T. et al. Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: A report of 1014 cases.Radiology. https://doi.org/10.1148/radiol.2020200642 (2019).
Article Google Scholar
Xie, X. et al. Chest CT for typical 2019-ncov pneumonia: Relationship to negative RT-PCR testing. Radiology 296(2), E41-5 (2020).
Article Google Scholar
Lin, L. & Li, T. interpretation of’’ guidelines for the diagnosis and treatment of novel coronavirus (2019-ncov) infection by the national health commission (trial version 5)’’. Zhonghua yi xue za zhi 100, E001–E001 (2020).
CAS PubMed Google Scholar
Lan, L. et al. Positive RT-PCR test results in patients recovered from COVID-19. JAMA 323, 1502–1503. https://doi.org/10.1001/jama.2020.2783 (2020).
Article CAS PubMed PubMed Central Google Scholar
Waller, J. V. et al. Diagnostic tools for coronavirus disease (COVID-19): Comparing CT and RT-PCR viral nucleic acid testing. Am. J. Roentgenol. 215, 834–838 (2020).
Chen, D. et al. Can chest CT features distinguish patients with negative from those with positive initial RT-PCR results for coronavirus disease (COVID-19)?. Am. J. Roentgenol. 216, 66–70 (2020).
Jacobi, A., Chung, M., Bernheim, A. & Eber, C. Portable chest X-ray in coronavirus disease-19 (COVID-19): A pictorial review. Clin. Imaging 64, 35–42. https://doi.org/10.1016/j.clinimag.2020.04.001 (2020).
Article PubMed PubMed Central Google Scholar
Wong, H. Y. F. et al. Frequency and distribution of chest radiographic findings in COVID-19 positive patients. Radiology 296, E72-8 (2020).
Article Google Scholar
Yoon, S. H. et al. Chest radiographic and CT findings of the 2019 novel coronavirus disease (COVID-19): Analysis of nine patients treated in Korea. Korean J. Radiol. 21, 494–500 (2020).
Article Google Scholar
Borghesi, A. & Maroldi, R. Covid-19 outbreak in Italy: Experimental chest X-ray scoring system for quantifying and monitoring disease progression. La Radiol. Med. 125, 509–513 (2020).
Schiaffino, S. et al. dDiagnostic performance of chest X-ray for covid-19 pneumonia during the sars-cov-2 pandemic in Lombardy, Italy. J. Thorac. Imaging 35, W105-6 (2020).
Article Google Scholar
Liu, F. et al. CT quantification of pneumonia lesions in early days predicts progression to severe illness in a cohort of COVID-19 patients. Theranostics 10, 5613 (2020).
Article Google Scholar
Li, L. et al. Artificial intelligence distinguishes COVID-19 from community acquired pneumonia on chest CT. Radiology 296, E65–E71 (2020).
Belfiore, M. P. et al. Artificial intelligence to codify lung CT in COVID-19 patients. La Radiol. Med. 125, 500–504 (2020).
Ito, R., Iwano, S. & Naganawa, S. A review on the use of artificial intelligence for medical imaging of the lungs of patients with coronavirus disease 2019. Diagn. Interv. Radiol. (Ankara, Turkey) 26, 443 (2020).
Article Google Scholar
Mei, X. et al. Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Nat. Med. 26, 1224–1228 (2020).
Bai, H. X. et al. Ai augmentation of radiologist performance in distinguishing COVID-19 from pneumonia of other etiology on chest CT. Radiology 296, E156–E165 (2020).
Li, D. et al. False-negative results of real-time reverse-transcriptase polymerase chain reaction for severe acute respiratory syndrome coronavirus 2: role of deep-learning-based ct diagnosis and insights from two cases. Korean J. Radiol. 21, 505–508 (2020).
Article Google Scholar
Hurt, B., Kligerman, S. & Hsiao, A. Deep learning localization of pneumonia: 2019 coronavirus (COVID-19) outbreak. J. Thorac. Imaging 35, W87–W89 (2020).
Article Google Scholar
Narin, A., Kaya, C. & Pamuk, Z. Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks. arXiv preprint arXiv:2003.10849. (2020).
Hassanien, A. E., Mahdy, L. N., Ezzat, K. A., Elmousalami, H. H. & Ella, H. A. Automatic X-ray COVID-19 lung image classification system based on multilevel thresholding and support vector machine. medRxiv https://doi.org/10.1101/2020.03.30.20047787 (2020).
Apostolopoulos, I., Aznaouridis, S. & Tzani, M. Extracting possibly representative COVID-19 biomarkers from X-ray images with deep learning approach and image data related to pulmonary diseases. arXiv preprint arXiv:2004.00338 (2020).
Wang, L. & Wong, A. Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest radiography images. arXiv preprint arXiv:2003.09871. (2020).
Hammoudi, K. et al. Deep learning on chest X-ray images to detect and evaluate pneumonia cases at the era of COVID-19. arXiv preprint arXiv:2004.03399. (2020).
Cohen, J. P., Morrison, P. & Dao, L. COVID-19 image data collection. arXiv:2003.11597 (2020).
Allen Institute for AI. Covid-19 open research dataset challenge (cord-19) [dataset]. (Accessed 14 May 2020); https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge?select=metadata.readme.
Kasper, J. et al. Typical imaging patterns in covid-19 infections of the lung on plain chest radiographs to aid early triage. In RöFo-Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren (Georg Thieme Verlag KG, 2021).
Iwanami, T., Goto, T., Hirano, S. & Sakurai, M. An adaptive contrast enhancement using regional dynamic histogram equalization. In 2012 IEEE International Conference on Consumer Electronics (ICCE), 719–722 (2012).
Chandra, T. B. & Verma, K. Analysis of quantum noise-reducing filters on chest X-ray images: A review. Measurement 153, 107426 (2020).
Article Google Scholar
Lee, S., Lee, M. S. & Kang, M. G. Poisson-gaussian noise analysis and estimation for low-dose X-ray images in the NSCT domain. Sensors 18, 1019 (2018).
Article ADS Google Scholar
El-Baz, A., Gimel’farb, G. & Suri, J. Stochastic Modeling for Medical Image Analysis (CRC Press, 2016).
Google Scholar
Gimel’farb, G. L. Image Textures and Gibbs Random Fields (Springer, 1999).
Book Google Scholar
Daniel, W. W. Friedman two-way analysis of variance by ranks. Appl. Nonparametric Stat. 262–274 (1990).
Chandra, T. B., Verma, K., Singh, B. K., Jain, D. & Netam, S. S. Automatic detection of tuberculosis related abnormalities in chest X-ray images using hierarchical feature extraction scheme. Expert. Syst. Appl. 158, 113514 (2020).
Article Google Scholar
Agnihotri, D., Verma, K. & Tripathi, P. Variable global feature selection scheme for automatic classification of text documents. Expert. Syst. Appl. 81, 268–281 (2017).
Article Google Scholar
Chandra, T. B., Verma, K., Singh, B. K., Jain, D. & Netam, S. S. Coronavirus disease (COVID-19) detection in chest X-ray images using majority voting based classifier ensemble. Expert. Syst. Appl. 165, 113909 (2021).
Article Google Scholar
Iyengar, K., Bahl, S., Vaishya, R. & Vaish, A. Challenges and solutions in meeting up the urgent requirement of ventilators for covid-19 patients. Diabetes Metab. Syndr. Clin. Res. Rev. 14, 499–501 (2020).
Geier, M. R. & Geier, D. A. Respiratory conditions in coronavirus disease 2019 (COVID-19): Important considerations regarding novel treatment strategies to reduce mortality. Med. Hypotheses 140, 109760 (2020).
Article CAS Google Scholar
Möhlenkamp, S. & Thiele, H. Ventilation of COVID-19 patients in intensive care units. Herz 45, 329–331 (2020).
Article Google Scholar
Bhatraju, P. K. et al. COVID-19 in critically ill patients in the Seattle region—case series. N. Engl. J. Med. 382, 2012–2022 (2020).
Article CAS Google Scholar
Arentz, M. et al. Characteristics and outcomes of 21 critically ill patients with COVID-19 in Washington state. JAMA 323, 1612–1614 (2020).
Article CAS Google Scholar
Intensive Care National Audit & Research Centre. ICNARC report on COVID-19 in critical care 15 May 2020. (Accessed 30 May 2020); https://www.icnarc.org/DataServices/Attachments/Download/cbcb6217-f698-ea11-9125-00505601089b.
Richardson, S. et al. Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York city area. JAMA 323, 2052–9 (2020).
Article CAS Google Scholar
Wu, C. et al. Risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease 2019 pneumonia in Wuhan, China. JAMA Intern. Med. 180, 934–43 (2020).
Article CAS Google Scholar
Zhou, F. et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: A retrospective cohort study. Lancet 395, 1054–62 (2020).
Article CAS Google Scholar
Auld, S. et al. ICU and ventilator mortality among critically ill adults with COVID-19. medRxiv 48, e799–e804 (2020).
Wu, X. et al. Deep learning-based multi-view fusion model for screening 2019 novel coronavirus pneumonia: a multicentre study. Eur. J. Radiol. 128, 109041 (2020).
Kundu, S., Elhalawani, H., Gichoya, J. W. & Kahn Jr, C. E. How might AI and chest imaging help unravel COVID-19’s mysteries? Radiol. Artif. Intell. 2, e200053 (2020).

Download references

Acknowledgements

Authors like to acknowledge the financial support in part of Science, Technology, and Innovation Funding Authority (STIFA) in Egypt (Grant # 43744) in this work.

Author information

These authors contributed equally: Mohamed Elsharkawy, Ahmed Sharafeldeen and Fatma Taher.

Authors and Affiliations

BioImaging Laboratory, Department of Bioengineering, University of Louisville, Louisville, KY, USA
Mohamed Elsharkawy, Ahmed Sharafeldeen, Ahmed Shalaby, Ahmed Soliman, Ali Mahmoud, Guruprasad A. Giridharan & Ayman El-Baz
College of Technological Innovation, Zayed University, Dubai, UAE
Fatma Taher & Ashraf Khalil
Department of Electrical and Computer Engineering, Abu Dhabi University, Abu Dhabi, UAE
Mohammed Ghazal
College of Computer and Information Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
Norah Saleh Alghamdi
Department of Diagnostic Radiology, Faculty of Medicine, Mansoura University, Mansoura, Egypt
Ahmed Abdel Khalek Abdel Razek & Eman Alnaghy
Faculty of Engineering, Assiut University, Assiut, Egypt
Moumen T. El-Melegy
Department of Ophthalmology and Visual Sciences, University of Louisville, Louisville, KY, USA
Harpal Singh Sandhu

Authors

Mohamed Elsharkawy
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Sharafeldeen
View author publications
You can also search for this author in PubMed Google Scholar
Fatma Taher
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Shalaby
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Soliman
View author publications
You can also search for this author in PubMed Google Scholar
Ali Mahmoud
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Ghazal
View author publications
You can also search for this author in PubMed Google Scholar
Ashraf Khalil
View author publications
You can also search for this author in PubMed Google Scholar
Norah Saleh Alghamdi
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Abdel Khalek Abdel Razek
View author publications
You can also search for this author in PubMed Google Scholar
Eman Alnaghy
View author publications
You can also search for this author in PubMed Google Scholar
Moumen T. El-Melegy
View author publications
You can also search for this author in PubMed Google Scholar
Harpal Singh Sandhu
View author publications
You can also search for this author in PubMed Google Scholar
Guruprasad A. Giridharan
View author publications
You can also search for this author in PubMed Google Scholar
Ayman El-Baz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.E., A.SHAR., A.SHAL, A.S., A.M., and A.E. participated in problem analysis, methodology design, and wrote the main manuscript text. A.ABD., E.A., and H.S. provided the support for the medical data, data collection, and revision of medical problem statement. F.T., M.G., A.K., N.A., M.ELM., G.G., and A.E. provided advising and manuscript revision. All authors reviewed the manuscript.

Corresponding author

Correspondence to Ayman El-Baz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Elsharkawy, M., Sharafeldeen, A., Taher, F. et al. Early assessment of lung function in coronavirus patients using invariant markers from chest X-rays images. Sci Rep 11, 12095 (2021). https://doi.org/10.1038/s41598-021-91305-0

Download citation

Received: 06 August 2020
Accepted: 10 May 2021
Published: 08 June 2021
DOI: https://doi.org/10.1038/s41598-021-91305-0

This article is cited by

An AI-based novel system for predicting respiratory support in COVID-19 patients through CT imaging analysis
- Ibrahim Shawky Farahat
- Ahmed Sharafeldeen
- Ayman El-Baz
Scientific Reports (2024)
COVID-19 classification in X-ray/CT images using pretrained deep learning schemes
- Narenthira Kumar Appavu
- Nelson Kennedy Babu C
- Seifedine Kadry
Multimedia Tools and Applications (2024)
Automated COVID-19 diagnosis and prognosis with medical imaging and who is publishing: a systematic review
- Ashley G. Gillman
- Febrio Lunardo
- Jason A. Dowling
Physical and Engineering Sciences in Medicine (2022)
Detection and analysis of COVID-19 in medical images using deep learning techniques
- Dandi Yang
- Cristhian Martinez
- Jesus Carretero
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.