Evaluation of retrieval accuracy and visual similarity in content-based image retrieval of chest CT for obstructive lung disease

Choe, Jooae; Choi, Hye Young; Lee, Sang Min; Oh, Sang Young; Hwang, Hye Jeon; Kim, Namkug; Yun, Jihye; Lee, Jae Seung; Oh, Yeon-Mok; Yu, Donghoon; Kim, Byeongsoo; Seo, Joon Beom

doi:10.1038/s41598-024-54954-5

Download PDF

Article
Open access
Published: 26 February 2024

Evaluation of retrieval accuracy and visual similarity in content-based image retrieval of chest CT for obstructive lung disease

Jooae Choe¹,
Hye Young Choi^1,2,
Sang Min Lee¹,
Sang Young Oh¹,
Hye Jeon Hwang¹,
Namkug Kim^1,3,
Jihye Yun³,
Jae Seung Lee⁴,
Yeon-Mok Oh⁴,
Donghoon Yu⁵,
Byeongsoo Kim⁵ &
…
Joon Beom Seo¹

Scientific Reports volume 14, Article number: 4587 (2024) Cite this article

571 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The aim of our study was to assess the performance of content-based image retrieval (CBIR) for similar chest computed tomography (CT) in obstructive lung disease. This retrospective study included patients with obstructive lung disease who underwent volumetric chest CT scans. The CBIR database included 600 chest CT scans from 541 patients. To assess the system performance, follow-up chest CT scans of 50 patients were evaluated as query cases, which showed the stability of the CT findings between baseline and follow-up chest CT, as confirmed by thoracic radiologists. The CBIR system retrieved the top five similar CT scans for each query case from the database by quantifying and comparing emphysema extent and size, airway wall thickness, and peripheral pulmonary vasculatures in descending order from the database. The rates of retrieval of the same pairs of query CT scans in the top 1–5 retrievals were assessed. Two expert chest radiologists evaluated the visual similarities between the query and retrieved CT scans using a five-point scale grading system. The rates of retrieving the same pairs of query CTs were 60.0% (30/50) and 68.0% (34/50) for top-three and top-five retrievals. Radiologists rated 64.8% (95% confidence interval 58.8–70.4) of the retrieved CT scans with a visual similarity score of four or five and at least one case scored five points in 74% (74/100) of all query cases. The proposed CBIR system for obstructive lung disease integrating quantitative CT measures demonstrated potential for retrieving chest CT scans with similar imaging phenotypes. Further refinement and validation in this field would be valuable.

Segment anything in medical images

Article Open access 22 January 2024

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Transparent medical image AI via an image–text foundation model grounded in medical literature

Article 16 April 2024

Introduction

Due to substantial heterogeneity in physiological and imaging characteristics, therapy response, and disease course, including disease progression and mortality in patients with chronic obstructive pulmonary disease (COPD), there have been continuous efforts to characterize and define subtypes of COPD with distinct patterns of emphysema and airway disease in the past decade¹. Not all, but there are subgroups in which such efforts may afford a marked clinical benefit^2,3,4. Chest computed tomography (CT) imaging has revolutionized the diagnostic approach for COPD by defining phenotypes. Recognizing these different phenotypes in COPD and reclassifying COPD patient severity based on imaging measurements of the pathologies responsible for symptoms and progression, may serve as an initial move towards to start personalized or, at least, optimized therapies⁵. Chest CT allow emphysema classification and quantification, as well as quantification of other abnormalities, including bronchial wall thickening, air trapping reflecting small airway disease, bronchiectasis, and pulmonary vessels^6,7,8. Quantitative CT measures in patients with COPD have demonstrated their relationship to lung function parameters, clinical symptoms, exacerbation rates, and mortality^{9,10,11,12,13,14,15}. The commonly recognized clinical phenotypes include chronic bronchitis, emphysema, COPD-asthma, and frequent exacerbators. Patients in each phenotype group share clinical characteristics and, importantly, have similar responses to existing treatments¹⁶. For instance, patients with chronic bronchitis are good responders and the only candidates for phosphodiesterase-4 inhibitors, while those with overlapping COPD-asthma phenotype show better responses to inhaled corticosteroids¹⁷. Therefore, in the initial evaluation of patients with COPD, the objective identification of those with similar clinical characteristics and determination of their clinical outcomes might inform clinical decision-making.

Content-based image retrieval (CBIR) is an image search engine with tools for classifying, indexing, and retrieving images with similar appearances from a database. CBIR matches the visual contents of the query image, an input, with those in the archive; the closeness in visual similarity in terms of image feature vectors provides a basis for identifying images with similar appearances^18,19. To apply CBIR in obstructive lung diseases, the automated quantitative measures for emphysema, airway, and vessels extracted from chest CT can be integrated into the classification and measurement of the similarity index of the CBIR system to find similar patients who share similar phenotype of disease, which might aid in diagnostic evaluation and decision-making in patients with COPD. In this study, we developed a fully automated CBIR system for obstructive lung diseases by incorporating quantitative measures. We assessed the performance of this system for retrieving similar chest CT images as query CT images in patients with obstructive lung diseases and visually assessed the similarities between the retrieved and query CT scans.

Materials and methods

Study cohorts

All subjects were selected from the Korean Obstructive Lung Disease (KOLD) cohort studies, which were prospective longitudinal studies of patients with obstructive lung disease from the pulmonary clinics of 17 centers in South Korea. The KOLD cohort study included patients from May 2005 to October 2013. The details of the KOLD cohort study have been published previously²⁰ and also demonstrated in Supplementary material. Briefly, the study cohort enrolled patients aged > 18 years with chronic respiratory symptoms as well as one or both of airflow limitation or bronchial hyper-responsiveness.

After enrollment in the KOLD cohort, 541 patients who underwent volumetric chest CT (3D CT scans with sub-millimeter near isotropic resolution) with full inspiration and pulmonary function test (PFT) were included in the present study and formed the database for the CBIR system. The database contained a total of 600 volumetric CT scans. Among these patients, follow-up CT scans of 50 patients who had initial and follow-up chest CT scans and without significant interval changes in findings between those two CT scans on visual assessment were classified as the query dataset (Fig. 1). The stability of the CT findings between initial and follow-up CT scans was reviewed and confirmed by a thoracic radiologist (S.M.L. with 16 years of experience in thoracic imaging). For the remaining 500 scans, 18 CT scans (two distinct scans per patient) came from nine patients, with each scan taken at a different time point which was not identical (not included as query dataset) and 482 CT scans from 482 patients (one scan per patient). The present study was approved by the Institutional Review Board (No. 2017-1067) of Asan Medical Center and by the Institutional Review Boards of the other 16 participating hospitals. Written informed consent was obtained from all patients. All methods were performed in accordance with the relevant guidelines and regulations.

CT acquisition and quantitative analysis

We performed volumetric chest CT scans at full inspiration for all patients on 64- or 16-multidetector CT scanners (Somatom Sensation 16 or Definition AS, Siemens Healthineers, Erlangen, Germany; Philips Brilliance 40 or 64, Philips Medical System, Best, the Netherlands). The scan parameters were as follows: tube voltage, 140 kV; tube current, 100–135 effective mA without dose modulation; slice thickness, 0.6–0.75 mm; and reconstruction intervals, 0.5 mm. All images were analyzed using fully automated segmentation software (Aview, Coreline Soft, Seoul, Korea). Fully-automated quantification was performed for the following five feature categories: emphysema volume, emphysema size, airway wall thickness, and numbers of peripheral pulmonary vessels. The quantification of emphysema on CT was based on lung densitometry by determining the relative area of the lungs below -950 Hounsfield units (HU) on inspiration CT (ie, %LAA-950 or emphysema index [EI])²¹. The size variation of emphysema was assessed using the D-slope value by applying a three-dimensional size-based emphysema clustering technique²². For the calculation of D-slope, the diameter of the emphysema cluster was plotted against the cumulative number of lesions (number of each lesion diameter) on a log–log scale. The slope (D-slope) of these linear relationships was calculated, with a steeper slope (increase in absolute D value) indicating a smaller emphysema size.

A standardized measure for airway wall thickness was analyzed for each patient by obtaining the Pi10, the square root of the wall area of the theoretical airway with an internal perimeter of 10 mm²³. We obtained the Pi10 value by plotting the obtainable values of the internal perimeter and the square root of the wall area of the 3rd (lobar level)–8th branches of the bronchi in the whole lungs. The software automatically detected the airway lumens, magnified the images tenfold, and detected the inner and outer boundaries of airway walls using the integral-based half-band method. The details of the airway measurement algorithms for the integral-based half-band method have been described previously²⁴. The pulmonary vascular morphology was assessed by measuring the total numbers of aggregate blood vessels < 5 mm² in cross-section (VN_<5 mm) in the lung surface area 12 mm distant from the pleural surface²⁵. EI and VN_<5 mm were analyzed for whole lung, right and left lung and each five lobe. Pi10 and D-slope were analyzed for whole lung, right and left lung.

Development of the CBIR for obstructive lung diseases

Feature extraction was performed in all CT scans in the database through the aforementioned process. The results were indexed in the CBIR system. We normalized each quantitative feature by dividing it by the 95th percentile. The similarities between chest CT scans were compared by measuring the cosine distances among the four feature vectors; the most similar images had the least distance between their feature vectors and vice versa. Finally, the CBIR system retrieved the top five similar images from the database for the given query CT scan in descending order according to the calculated similarities to the query images and displayed the retrievals in the dedicated in-house user interface (Figs. 2 and 3). In addition to our primary features, we also explored the potential of incorporating the air trapping index (ATI) to assess small airway disease by comparing densities using co-registration of the ATI; however, as this did not improve the retrieval performance of similar cases, and adding this variable would render the CBIR system inapplicable to patients without expiratory CT scans, ATI was not included in the finally proposed CBIR system in this study. The details and results of our experimentation with the ATI are elaborated in the Supplementary material.

Assessment of the retrieval accuracy of the CBIR system

For assessment of the accuracy of the CBIR in retrieving similar CT images, we defined similar CT images as the pairs of chest CT images without change in CT findings showing the stability of the parenchymal abnormalities. The query cases consisted of 50 follow-up CT scans from 50 patients, scanned approximately 3 years after the initial CT scans, within a range of 36–38 months from the initial CT scans whose stability was confirmed by an expert thoracic radiologist. Therefore, the query cases in the database included the CT scans of baseline pairs. For a given query CT scan, the top five similar CT scans were retrieved from the database according to the calculated similarities. To assess the retrieval accuracy of the CBIR, we assessed the rates of retrieving the same pairs of query CT scans among the top 1–5 retrieved CT scans. To evaluate the effect of each feature on retrieval accuracy, the median values of each feature in the 50 query cases were evaluated as a threshold to divide the cases into the top 50% and bottom 50% values.

Reader assessments of the visual similarity of the retrieved images

Two experienced thoracic radiologists (H.J.H., and S.Y.O. with 15, and 12 years of experience, respectively) independently assessed the visual similarities between the five retrieved CT scans and the query CT. The query and retrieved CT scans were displayed and compared individually and side-by-side using the dedicated in-house interface (Coreline Soft, Seoul, Korea). The five retrieved CT scans were displayed in random order next to the query CT (not in the order of the calculated similarity). The readers were blinded to the patients’ clinical information including age, smoking history, and PFT results. The similarities were subjectively graded using a five-point scoring system according to the emphysema amount, size, and distribution including craniocaudal distribution and the predominant pattern among centrilobular, panlobular and paraseptal emphysema, as well as the degree of bronchial wall thickening. The similarity scores were defined as follows: score 5 (all features are similar), score 4 (two or three similar features), score 3 (one similar feature), score 2 (no features are similar) and score 1 (difficult to evaluate due to little emphysema and bronchial wall thickening). The radiologist who initially evaluated the CT scans for study inclusion did not participate in the visual assessment of feature similarities.

Statistical analysis

Data are expressed as numeric values with percentages, means with standard deviation, or medians with interquartile range (IQR). The retrieval accuracy was evaluated by grouping the patients according to their median values for each feature and comparing them between different groups using Fisher's exact tests. For the similarity scores of two readers, the mean and confidence interval of similarity scores for each reader and pooled data of two readers were calculated using generalized estimating equation with logit and identified link function. Interreader agreements of similarity scores of two readers were evaluated using the weighted k statistics and intraclass correlation coefficient (ICC). The statistical analyses were performed using IBM SPSS Statistics for Windows, version 23.0 (IBM Corp.) and Stata software, version 16.0 (Stata). For all tests, P < 0.05 indicated statistical significance.

Results

Patient characteristics

The CBIR database included 541 patients (mean age 66.9 ± 8.5 years; 500 men and 41 women) (Table 1). The mean age of the 50 query cases was 67.9 ± 7.0 years (45 men and 5 women), and the median interval between the initial and follow-up CT examinations was 36 months (range, 34.6–38.0 months).

Table 1 Clinical characteristics of the study patients.

Full size table

Among quantitative CT parameters, the mean values for each index were 12.2 ± 13.1 for EI, 5.1 ± 1.5 for D-slope, 4.0 ± 0.8 for Pi10, and 0.6 ± 0.1 for peripheral vessel volume in all patients (Table 2). The baseline and follow-up paired CT scans, which consisted of the database and query cases, respectively, showed no significant differences in CT indices between baseline and follow-up CT scans for all five quantitative parameters (all P > 0.05; Table 2). The mean values of each index in the query CTs were 11.8 ± 12.2 for EI, 4.9 ± 1.1 for D-slope, 4.3 ± 0.9 for Pi10, and 0.6 ± 0.1 for peripheral vessel volume.

Table 2 Clinical characteristics and CT parameters of study patients.

Full size table

Retrieval accuracy of the CBIR system

The rates of retrieving the same pairs of query CTs in the top 1–3 and top 1–5 images were 60.0% (30/50) and 68.0% (34/50), respectively (Table 3 and Fig. 4). The thresholds of each feature were 7.4% for EI, 4.7 for D-slope, 4.3 for Pi10, and 0.62 for VN_<5 mm. Regarding the severity of emphysema, the rate of retrieving the same pairs of query CT scans in the top 3 retrieval was slightly higher in patients with EI > 7.4% compared to those with EI ≤ 7.4% (retrieval rates, 72.0% [18/25] vs. 48.0% [12/25]; P = 0.09; Fig. 4). No significant difference was observed in retrieval accuracies for D-slope, Pi10, and VN_<5 mm (all P > 0.05).

Table 3 Retrieval accuracy of the content-based image retrieval (CBIR) system.

Full size table

Reader assessments of the visual similarity of the image retrievals

Among the retrieved cases, 64.8% (53.2% for reader 1 and 76.4% for reader 2) showed similarity scores of 4 or 5 as rated by two radiologists (Table 4 and Fig. 5). At least one case scored 5 points in 74% (74/100; 76% [38/50] for reader 1 and 72% [36/50] for reader 2) of all 50 query cases, and at least one case scored four or more points in 98% (49/50) and 100% (50/50) of all 50 query cases rated by two radiologists, respectively. The mean similarity scores of the top 1–5 retrieved CT scans rated by the two radiologists were 3.8 ± 0.05 (3.62 ± 0.06 for reader 1 and 3.99 ± 0.05 for reader 2, respectively. The interreader agreement for the similarity scores were moderate (weighted κ = 0.52, 95% confidence interval [CI] = 0.43–0.61; ICC = 0.68, 95%CI = 0.51–0.79).

Table 4 Similarity scores of the top 1–5 retrieved computed tomography (CT) images.

Full size table

Discussion

This study developed a CBIR system for patients with COPD that incorporated quantitative CT features of the lungs, airway, and pulmonary vessels. This CBIR system showed good performance and retrieved the same pairs of query CT scans as the top 1–5 retrievals in 34 of 50 queries (68.0%). Among the retrieved cases, 64.8% (324/500) showed visual similarity scores of 4 or 5 and at least one case scored 5 points in 74% (74/100) in all query cases. The mean similarity scores of the top 1–5 retrieved CT scans rated by the two radiologists were 3.8 ± 0.05.

Few studies have used CBIR as a diagnostic tool to interpret chest CT scans. Moreover, no dedicated CBIR system has yet been described for the evaluation of obstructive lung diseases^19,26,27. Oosawa et al. and Aisen et al. respectively, developed CBIR systems for various respiratory diseases including emphysema and other disease categories such as infectious diseases and pneumothorax^26,27. Although both studies included emphysema, the CBIR systems were not developed for the evaluation of obstructive lung diseases but rather to aid in the differential diagnosis of various respiratory diseases on chest CT. Moreover, the numbers of each disease in the databases and test cases were too small to evaluate the performance of CBIR. In addition, the criteria for assessing the visual similarities between query and retrieved CT scans were subjective and not precisely evaluated. A major challenge of CBIR as a diagnostic tool is identifying relevant and effective features and distance measures that match clinician requirements based on a particular application. Thus, CBIR research has aimed to reduce the semantic gap between image feature representation and human visual understanding to achieve better retrieval accuracy in terms of clinical relevance. In our study, by applying automated quantitative CT measures including emphysema, bronchial wall thickening and pulmonary vessels as features for CBIR, patients in obstructive lung diseases with similar phenotypes can be objectively identified in the database and easily visualized on our CBIR system.

The similar image retrieval can support clinical decision-making by offering patients with known clinical outcomes, including treatment response or prognosis with similar imaging characteristics, a second opinion in the management of patients with obstructive lung diseases. During the past decade, the Global Initiative for Obstructive Lung Disease (GOLD) therapeutic strategy acknowledged the limitations of using spirometry alone to guide therapy by assessing the disease severity²⁸. To address the complexity of COPD, identification of clinical phenotypes as clusters of patients with similar clinical characteristics, prognosis and/or therapeutic needs has emerged as an important approach to guide personalised medicine. However, many COPD subtypes have been proposed, but there is still no consensus about how many subtypes there are and how they should be objectively and reproducibly classified³. CBIR can be another approach to precision medicine for individualized therapy. Evaluating similar cases offers a slightly different perspective than directly knowing which cluster a case belongs based on cluster analysis (i.e., k-means clustering). Specifically, it can provide more granular details that are especially pertinent to patients who lie on the boundaries of distinct clusters. If the database is expansive and the features underpinning the similarity between cases are meticulously curated, CBIR can retrieve and showcase individual cases that mirror even closer characteristics within the same cluster group, surpassing the depth provided by standalone cluster analysis. CBIR stands out, especially when compared with other decision-support tools, for its intensive human-AI interaction. Depending on the interface, retrieval accuracy, and the presented linked information, CBIR can foster increased trust and utility for AI-based decision supporting tools, making it an invaluable human-centered tool²⁹. When a clinician is deciding follow-up strategies or determining which pharmacological treatment options to prescribe to a certain patient, by applying CBIR, the treatment effect and clinical outcomes of similar patients retrieved from the CBIR system can be demonstrated and easily reviewed to facilitate decision making whether the disease is reversible, to assess whether the disease will be refractory to standard therapy or benefit from biologics, and to estimate the risk of recurrent episodes of exacerbation. Incorporating clinical variables such as age, sex, smoking, lung function and comorbidities for calculating similarity index may expand the potential of CBIR and able to bring more meaningful retrievals of similar cases strongly associated with clinical outcomes. Furthermore, those similarity indices might help to discover new important phenotype in patients with COPD.

To build the CBIR system for obstructive lung diseases, we selected four quantitative features including emphysema, bronchial wall thickening, and peripheral pulmonary vasculature, to estimate the similarity distances between cases. Quantitative CT evaluation has been validated as a tool for the assessment of the presence and severity of emphysema, expiratory airflow obstruction, and airway wall thickening^30,31,32,33. These measures included EI or LAA₋₉₅₀ for emphysema extent and severity and Pi10 for airway wall thickness, which were well validated in multiple studies and showed strong associations with spirometric results and survival^34,35,36. The D-slope, which is a measure representing the distribution of emphysema hole-size, was significantly correlated with clinical parameters such as FEV1, diffusion capacity, exercise capacity, and quality of life³⁷. Regarding pulmonary vessels, pulmonary vascular remodeling in smokers is characterized by distal pruning of the blood vessels, which can be automatically identified, segmented, and quantified, including measures such as total blood vessel volume or numbers and the aggregate vessel volume for vessels < 5 mm²^25,38. The present study used VN_<5 mm (number of vessels with area < 5 mm² in the theoretical lung surface area at a depth of 12 mm from the pleural surface) as a measure of pulmonary vascular alteration, which was significantly correlated with FEV1 and FEV1 to FVC ratio and extent of emphysema²⁵. Our CBIR system showed good performance, achieving a 68.0% (34 of 50) rate of retrieving the same paired CT images from the same patients. While the selected features used in our CBIR system have been verified in various studies and also showed good performance in our study, further investigations are needed regarding methods to identify features that accurately reflect the clinical course and retrieve clinically meaningful similar cases.

Though our CBIR system successfully retrieved the same paired CT with a query CT, it would be relevant to confirm whether radiologists will interpret the retrieved CT as similar images. The mean similarity scores of the top 1–5 retrieved CT scans rated by two radiologists were 3.62 and 3.99, respectively, which is close to 4, indicating that more than two or three features were similar. For the top-one and top-two retrievals, the visual similarity score was 4 or more than four in 82% and 73% of total query cases, respectively. Moreover, for the overall scores of the two readers, the similarity scores were also consistent with the rank of retrievals based on the similarities given by the CBIR system. As visual similarity is subjective, assessment of the interobserver variability of the visual similarity scores showed moderate agreement between the readers. Therefore, the CBIR system produced reasonable results for retrieving similar cases in patients with obstructive lung diseases that radiologists could also agree on.

Our study has several limitations. First, we did not incorporate all quantitative CT features; rather, we included five well-established features for whole lung volume. As the recent research for CBIR is also shifting to the use of deep neural networks, incorporating large-scale quantitative feature datasets and clinical features by applying unsupervised learning methods might further improve the CBIR performance¹⁸. Second, as the data were not available, we could not link the patients’ characteristics to clinical outcomes such as treatment response or survival. Third, our dataset is derived from the Korean Obstructive Lung Disease (KOLD) cohort which ran from May 2005 to October 2013. This period predates the standardized CT protocols proposed by SPIROMICS. During this time, the tube voltage for the standardized protocol of the registry was set to 140 kV, while SPIROMICS recommended 120 kV³⁹. Using 140 kV might not be optimal; however, the single threshold of -950 HU remains a widely accepted measure for emphysema quantification and has been prevalently employed in prior research^40,41. In addition, previous studies also demonstrated that different tube voltage, such as 120 kV and 140 kV, had minimal impact on the CT number, air densities and/or CT numbers less than 0 HU compared with effect of vendors^42,43. Therefore, analyzing emphysema at 140 kV with -950HU is less likely to have significant effect. There can be the potential variability introduced by different CT protocols, which could indeed impact the sensitivity of emphysema quantification. However, given that all scans in the KOLD cohort adhered to a consistent 140 kV using standardized protocols, the variability in measurements might be minimal, ensuring a consistent assessment of feature similarity across the patients in the datasets. Finally, as this was a pilot study, we did not evaluate the clinical impact of CBIR for the diagnosis of obstructive lung diseases, including whether it can aid in the diagnosis of COPD phenotype and improve the clinical consequences by changing the diagnosis. In fact, evaluating the performance of the CBIR is challenging due to a lack of sufficient validated evaluation matrices. Consequently, clinical validation becomes paramount, assessing its impact on diagnosis and treatment. Therefore, what is needed for the future use of this method a multicenter collaborative approach to build a cloud-based lung image library or quantitative imaging database for COPD with clinical information and outcomes (treatment effect, exacerbation, and death) that is easily accessible and evaluating clinical impact of CBIR in diagnosis and management of obstructive pulmonary disease. Such a collective effort would not only enhance efficiency and accuracy of CBIR but also make it more applicable in clinical settings. Furthermore, promoting data availability and fostering collaborations would inevitably propel the research in this domain.

In conclusion, the proposed CBIR system for obstructive lung diseases integrating quantitative CT measures demonstrated potential for retrieving chest CT scans with similar phenotypic imaging characteristics. Applying CBIR in the obstructive lung diseases with further linking of clinical outcomes of similar cases may aid in the assessment of those patients to establish a treatment plan and predict prognosis.

Data availability

The datasets generated or analyzed during the study are available from the corresponding author on reasonable request.

References

Han, M. K. et al. Chronic obstructive pulmonary disease phenotypes: The future of COPD. Am. J. Respir. Crit. Care Med. 182, 598–604 (2010).
Article PubMed PubMed Central Google Scholar
Burgel, P.-R. et al. Clinical COPD phenotypes: A novel approach using principal component and cluster analyses. Eur. Respir. J. 36, 531–539 (2010).
Article PubMed Google Scholar
Castaldi, P. J. et al. Machine learning characterization of COPD subtypes: Insights from the COPDGene study. Chest 157, 1147–1157 (2020).
Article CAS PubMed Google Scholar
Weatherall, M. et al. Distinct clinical phenotypes of airways disease defined by cluster analysis. Eur. Respir. J. 34, 812–818 (2009).
Article CAS PubMed Google Scholar
Bhatt, S. P. et al. Imaging advances in chronic obstructive pulmonary disease. Insights from the genetic epidemiology of chronic obstructive pulmonary disease (COPDGene) study. Am. J. Respir. Crit. Care Med. 199, 286–301 (2019).
Article PubMed PubMed Central Google Scholar
Lynch, D. A. et al. CT-definable subtypes of chronic obstructive pulmonary disease: A statement of the Fleischner society. Radiology 277, 192–205 (2015).
Article PubMed Google Scholar
Hackx, M., Bankier, A. A. & Gevenois, P. A. Chronic obstructive pulmonary disease: CT quantification of airways disease. Radiology 265, 34–48 (2012).
Article PubMed Google Scholar
Ash, S. Y. et al. Pruning of the pulmonary vasculature in asthma the severe asthma. Research program (SARP) cohort. Am. J. Respir. Crit. Care Med. 198, 39–50 (2018).
Article PubMed PubMed Central Google Scholar
Gawlitza, J. et al. Predicting pulmonary function testing from quantified computed tomography using machine learning algorithms in patients with COPD. Diagnostics (Basel) 9, 33 (2019).
Article PubMed Google Scholar
Han, M. K. et al. Chronic obstructive pulmonary disease exacerbations in the COPDGene study: Associated radiologic phenotypes. Radiology 261, 274–282 (2011).
Article PubMed PubMed Central Google Scholar
Grydeland, T. B. et al. Quantitative computed tomography measures of emphysema and airway wall thickness are related to respiratory symptoms. Am. J. Respir. Crit. Care Med. 181, 353–359 (2010).
Article PubMed Google Scholar
Schroeder, J. D. Relationships between airflow obstruction and quantitative CT measurements of emphysema, air trapping, and airways in subjects with and without chronic obstructive pulmonary disease. AJR https://doi.org/10.2214/AJR.12.10102:W460-W470 (2013).
Article PubMed Google Scholar
Nambu, A. et al. Quantitative computed tomography measurements to evaluate airway disease in chronic obstructive pulmonary disease: Relationship to physiological measurements, clinical index and visual assessment of airway disease. Eur. J. Radiol. 85, 2144–2151 (2016).
Article PubMed PubMed Central Google Scholar
Haruna, A. et al. CT scan findings of emphysema predict mortality in COPD. Chest 138, 635–640 (2010).
Article PubMed Google Scholar
Johannessen, A. et al. Mortality by level of emphysema and airway wall thickness. Am. J. Respir. Crit. Care Med. 187, 602–608 (2013).
Article PubMed Google Scholar
Miravitlles, M., Soler-Cataluña, J. J., Calle, M. & Soriano, J. B. Treatment of COPD by clinical phenotypes: putting old evidence into clinical practice. Eur. Respir. J. 41, 1252–1256 (2013).
Article PubMed Google Scholar
Rennard, S. I., Calverley, P. M., Goehring, U. M., Bredenbröker, D. & Martinez, F. J. Reduction of exacerbations by the PDE4 inhibitor roflumilast–the importance of defining different subsets of patients with COPD. Respir. Res. 12, 18 (2011).
Article CAS PubMed PubMed Central Google Scholar
Latif, A. et al. Content-based image retrieval and feature extraction: A comprehensive review. Math. Probl. Eng. 2019, 9658350 (2019).
Article Google Scholar
Choe, J. et al. Content-based image retrieval by using deep learning for interstitial lung disease diagnosis with chest CT. Radiology 302, 187–197 (2022).
Article PubMed Google Scholar
Park, T. S. et al. Study design and outcomes of Korean obstructive lung disease (KOLD) cohort study. Tuberc. Respir. Dis. (Seoul) 76, 169–174 (2014).
Article PubMed Google Scholar
Heussel, C. P. et al. Fully automatic quantitative assessment of emphysema in computed tomography: Comparison with pulmonary function testing and normal values. Eur. Radiol. 19, 2391–2402 (2009).
Article CAS PubMed Google Scholar
Oh, S. Y. et al. Size variation and collapse of emphysema holes at inspiration and expiration CT scan: Evaluation with modified length scale method and image co-registration. Int. J. Chron. Obstruct. Pulmon. Dis. 12, 2043–2057 (2017).
Article PubMed PubMed Central Google Scholar
Grydeland, T. B. et al. Quantitative computed tomography: Emphysema and airway wall thickness by sex, age and smoking. Eur. Respir. J. 34, 858–865 (2009).
Article CAS PubMed Google Scholar
Cho, Y. H. et al. Comparison of a new integral-based half-band method for CT measurement of peripheral airways in COPD with a conventional full-width half-maximum method using both phantom and clinical CT images. J. Comput. Assist. Tomogr. 39, 428–436 (2015).
PubMed Google Scholar
Cho, Y. H. et al. Quantitative assessment of pulmonary vascular alterations in chronic obstructive lung disease: Associations with pulmonary function test and survival in the KOLD cohort. Eur. J. Radiol. 108, 276–282 (2018).
Article PubMed Google Scholar
Aisen, A. M. et al. Automated storage and retrieval of thin-section CT images to assist diagnosis: System description and preliminary assessment. Radiology 228, 265–270 (2003).
Article PubMed Google Scholar
Oosawa, A. et al. Development of a CT image case database and content-based image retrieval system for non-cancerous respiratory diseases: Method and preliminary assessment. Respir. Investig. 57, 490–498 (2019).
Article PubMed Google Scholar
Sidhaye, V. K., Nishida, K. & Martinez, F. J. Precision medicine in COPD: Where are we and where do we need to go?. Eur. Respir. Rev. 27, 180022 (2018).
Article PubMed PubMed Central Google Scholar
Cai, C.J., Reif, E., Hegde, N., Hipp, J.D., Kim, B., Smilkov, D., et al. Human-centered tools for coping with imperfect algorithms during medical decision-making (2019).
Madani, A., Zanen, J., Maertelaer, V. D. & Gevenois, P. A. Pulmonary emphysema: Objective quantification at multi-detector row CT—comparison with macroscopic and microscopic morphometry. Radiology 238, 1036–1043 (2006).
Article PubMed Google Scholar
Nakano, Y. et al. The prediction of small airway dimensions using computed tomography. Am. J. Respir. Crit. Care Med. 171, 142–146 (2005).
Article PubMed Google Scholar
Mets, O. M. et al. Diagnosis of chronic obstructive pulmonary disease in lung cancer screening computed tomography scans: Independent contribution of emphysema, air trapping and bronchial wall thickening. Respir. Res. 14, 59 (2013).
Article PubMed PubMed Central Google Scholar
Hartley, R. A. et al. Relationship between lung function and quantitative computed tomographic parameters of airway remodeling, air trapping, and emphysema in patients with asthma and chronic obstructive pulmonary disease: A single-center study. J. Allergy Clin. Immunol. 137, 1413-1422.e1412 (2016).
Article PubMed PubMed Central Google Scholar
Schroeder, J. D. et al. Relationships between airflow obstruction and quantitative CT measurements of emphysema, air trapping, and airways in subjects with and without chronic obstructive pulmonary disease. AJR Am. J. Roentgenol. 201, W460–W470 (2013).
Article PubMed PubMed Central Google Scholar
Lynch, D. A. et al. CT-based visual classification of emphysema: Association with mortality in the COPDGene study. Radiology 288, 859–866 (2018).
Article PubMed Google Scholar
Kim, E. Y. et al. Detailed analysis of the density change on chest CT of COPD using non-rigid registration of inspiration/expiration CT scans. Eur. Radiol. 25, 541–549 (2015).
Article PubMed Google Scholar
Hwang, J. et al. A size-based emphysema severity index: Robust to the breath-hold-level variations and correlated with clinical parameters. Int. J. Chronic Obstruct. Pulm. Dis. 11, 1835–1841 (2016).
Article Google Scholar
Estépar, R. S. J. et al. Computed tomographic measures of pulmonary vascular morphology in smokers and their clinical implications. Am. J. Respir. Crit. Care Med. 188, 231–239 (2013).
Article PubMed PubMed Central Google Scholar
Sieren, J. P. et al. SPIROMICS protocol for multicenter quantitative computed tomography to phenotype the lungs. Am. J. Respir. Crit. Care Med. 194, 794–806 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fernandes, L. et al. Small airway imaging phenotypes in biomass- and tobacco smoke-exposed patients with COPD. ERJ Open Res. 3, 00124 (2017).
Article PubMed PubMed Central Google Scholar
Crossley, D., Renton, M., Khan, M., Low, E. V. & Turner, A. M. CT densitometry in emphysema: A systematic review of its clinical utility. Int. J. Chron. Obstruct. Pulmon. Dis. 13, 547–563 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cropp, R. J., Seslija, P., Tso, D. & Thakur, Y. Scanner and kVp dependence of measured CT numbers in the ACR CT phantom. J. Appl. Clin. Med. Phys. 14, 4417 (2013).
Article PubMed Google Scholar
Afifi, M. B., Abdelrazek, A., Deiab, N. A., Abd El-Hafez, A. I. & El-Farrash, A. H. The effects of CT x-ray tube voltage and current variations on the relative electron density (RED) and CT number conversion curves. J. Radiat. Res. Appl. Sci. 13, 1–11 (2020).
CAS Google Scholar

Download references

Funding

This work was supported by the Korea Medical Device Development Fund grant funded by the Korea government (the Ministry of Science and ICT, the Ministry of Trade, Industry and Evergy, the Ministry of Health & Welfare, Republic of Korea, the Ministry of Food and Drug Safety) (Project Number: NTIS 1711138474).

Author information

Authors and Affiliations

Department of Radiology and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, 86 Asanbyeongwon-Gil, Songpa-Gu, 05505, Seoul, Korea
Jooae Choe, Hye Young Choi, Sang Min Lee, Sang Young Oh, Hye Jeon Hwang, Namkug Kim & Joon Beom Seo
Department of Radiology, Kyung Hee University Hospital at Gangdong, College of Medicine Kyung, Hee University, Seoul, Korea
Hye Young Choi
Department of Convergence Medicine, Biomedical Engineering Research Center, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
Namkug Kim & Jihye Yun
Department of Pulmonary and Critical Care Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
Jae Seung Lee & Yeon-Mok Oh
Coreline Soft, Co., Ltd., Seoul, Korea
Donghoon Yu & Byeongsoo Kim

Authors

Jooae Choe
View author publications
You can also search for this author in PubMed Google Scholar
Hye Young Choi
View author publications
You can also search for this author in PubMed Google Scholar
Sang Min Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sang Young Oh
View author publications
You can also search for this author in PubMed Google Scholar
Hye Jeon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Namkug Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jihye Yun
View author publications
You can also search for this author in PubMed Google Scholar
Jae Seung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Yeon-Mok Oh
View author publications
You can also search for this author in PubMed Google Scholar
Donghoon Yu
View author publications
You can also search for this author in PubMed Google Scholar
Byeongsoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Joon Beom Seo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.M.L. is the guarantor of the paper and takes responsibility for the integrity of the work as a whole. J.C., H.Y.C. and S.M.L. take responsibility for the data analysis. S.M.L. and J.B.S. contributed to the study design. S.Y.O. and H.J.H. contributed to the radiologic evaluation of the study subjects. N.K., J.Y., D.Y. and B.K. contributed to development of software. J.C., S.M.L. and J.B.S. contributed to the interpretation of results. J.C. and H.Y.C. drafted the initial manuscript. J.C., S.M.L., J.S.L., Y.O., and J.B.S. contributed to review and editing. All authors discussed the results and reviewed the manuscript.

Corresponding author

Correspondence to Sang Min Lee.

Ethics declarations

Competing interests

S.M.L. holds stock/stock options in Coreline Soft, Co., Ltd., Korea. N.K. hold a patent on the method for the automatic classifier of lung diseases (Patent No. KR-10-0998630), have received royalties from Coreline Soft, Co., Ltd. and hold stock/stock options in Coreline Soft, Co., Ltd., Korea. D.Y. and B.K. are employees of Coreline Soft, Co., Ltd., Korea. J.B.S. hold a patent on the method for the automatic classifier of lung diseases (Patent No. KR-10-0998630), have received royalties from Coreline Soft, Co., Ltd. and hold stock/stock options in Coreline Soft, Co., Ltd., Korea.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choe, J., Choi, H.Y., Lee, S.M. et al. Evaluation of retrieval accuracy and visual similarity in content-based image retrieval of chest CT for obstructive lung disease. Sci Rep 14, 4587 (2024). https://doi.org/10.1038/s41598-024-54954-5

Download citation

Received: 17 June 2023
Accepted: 19 February 2024
Published: 26 February 2024
DOI: https://doi.org/10.1038/s41598-024-54954-5

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.