Quantitative vessel tortuosity: A potential CT imaging biomarker for distinguishing lung granulomas from adenocarcinomas

Alilou, Mehdi; Orooji, Mahdi; Beig, Niha; Prasanna, Prateek; Rajiah, Prabhakar; Donatelli, Christopher; Velcheti, Vamsidhar; Rakshit, Sagar; Yang, Michael; Jacono, Frank; Gilkeson, Robert; Linden, Philip; Madabhushi, Anant

doi:10.1038/s41598-018-33473-0

Download PDF

Article
Open access
Published: 16 October 2018

Quantitative vessel tortuosity: A potential CT imaging biomarker for distinguishing lung granulomas from adenocarcinomas

Mehdi Alilou¹,
Mahdi Orooji¹,
Niha Beig ORCID: orcid.org/0000-0002-1150-5886¹,
Prateek Prasanna¹,
Prabhakar Rajiah²,
Christopher Donatelli²,
Vamsidhar Velcheti³,
Sagar Rakshit³,
Michael Yang²,
Frank Jacono⁴,
Robert Gilkeson²,
Philip Linden² &
…
Anant Madabhushi¹

Scientific Reports volume 8, Article number: 15290 (2018) Cite this article

3796 Accesses
23 Citations
7 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 29 October 2019

This article has been updated

Abstract

Adenocarcinomas and active granulomas can both have a spiculated appearance on computed tomography (CT) and both are often fluorodeoxyglucose (FDG) avid on positron emission tomography (PET) scan, making them difficult to distinguish. Consequently, patients with benign granulomas are often subjected to invasive surgical biopsies or resections. In this study, quantitative vessel tortuosity (QVT), a novel CT imaging biomarker to distinguish between benign granulomas and adenocarcinomas on routine non-contrast lung CT scans is introduced. Our study comprised of CT scans of 290 patients from two different institutions, one cohort for training (N = 145) and the other (N = 145) for independent validation. In conjunction with a machine learning classifier, the top informative and stable QVT features yielded an area under receiver operating characteristic curve (ROC AUC) of 0.85 in the independent validation set. On the same cohort, the corresponding AUCs for two human experts including a radiologist and a pulmonologist were found to be 0.61 and 0.60, respectively. QVT features also outperformed well known shape and textural radiomic features which had a maximum AUC of 0.73 (p-value = 0.002), as well as features learned using a convolutional neural network AUC = 0.76 (p-value = 0.028). Our results suggest that QVT features could potentially serve as a non-invasive imaging biomarker to distinguish granulomas from adenocarcinomas on non-contrast CT scans.

Radiomics as a non-invasive adjunct to Chest CT in distinguishing benign and malignant lung nodules

Article Open access 04 November 2023

Deep learning for the detection of benign and malignant pulmonary nodules in non-screening chest CT scans

Article Open access 27 October 2023

A diagnostic classification of lung nodules using multiple-scale residual network

Article Open access 13 July 2023

Introduction

Histoplasmosis is among the most common endemic fungal infection in the United States¹. On chest CT, active fungal infection induced granulomas and malignant nodules may both have a suspicious spiculated appearance. Moreover, if the fungal infection is active, both lesions may appear ‘hot’ on a PET scan. On screening CT scans, the great majority of pulmonary nodules are benign (95%) and a significant proportion represent granulomas caused by a fungal infection². A number of these patients identified with lung nodules end up having to undergo a surgical intervention in the form of a biopsy, bronchoscopy or a surgical wedge resection for histopathologic confirmation of presence or absence of a malignancy³. Even if no biopsy is recommended, the majority of these patients will have to undergo repeat CT scans for continued assessment of the nodule, thereby exposing them to unnecessary and potentially harmful radiation. Over 1 million people in the US are annually subjected to a CT guided or bronchoscopic biopsy and over 60,000⁴ are subjected to a surgical wedge resection for pathologic confirmation of a pulmonary nodule found on a CT scan⁵. Approximately 30% of pulmonary nodules identified as suspicious on a CT scan and that are subsequently biopsied or resected will finally be confirmed on histopathology to be benign.

Recognizing that there is an unmet need for decision support tools for analysis and interpretation of chest CT scans, a number of groups have been developing “radiomic” approaches for computerized analysis of textural and shape based attributes of the lesion^6,7,8,9. The primary assumption behind these approaches is that there are subtle microarchitectural differences within the nodule, as also shape differences (e.g. malignant nodules are more speculated and benign nodules smooth) that can be captured via corresponding radiomic measurements on chest CT scans. In¹⁰, Liu et al., showed that radiological image traits (quantitative features) were useful in predicting malignancy in lung nodules. The group studied a cohort of 172 patients who had low-dose CT images, with 102 and 70 patients grouped into training and validation cohorts, respectively. A set of 24 radiological traits were systematically scored and a linear classifier was built to identify the possible associations with malignancy. The best feature set including short axis, contour, concavity, and texture yielded an AUC of 0.88 in predicting malignancy in primary nodules in a validation cohort of 70 patients. In¹¹ the authors found an association between shape (e.g. surface area, volume and surface to volume ratio), textural and intensity features extracted from CT scans with underlying gene-expression profiles of lung cancer patients. Way et al.¹², showed via leave-one out cross validation approach that a combination of intra-tumoral texture and shape features could distinguish 44 malignant and 52 benign nodules with an AUC of 0.83. In an extension of this study¹³, the same group also incorporated surface features to complement the shape and texture features to improve classification area under the receiver operating characteristic curve (AUC) from 0.82 to 0.85. In another study, McWiliams et al., in¹⁴ attempted to predict the likelihood of malignancy associated with a nodule based off semi quantitative handcrafted nodule features such as the nodule size, spiculation, location and the number of detected nodules along with clinical variables including age, gender, body mass index and family history of lung cancer. This method yielded an AUC of 0.9 for predicting the presence of malignant nodules.

In addition to the previously mentioned approaches, recently deep learning models¹⁵ and feature embedding tools¹⁶ have been proposed for automatically learning the most discriminating features from nodules and their immediate surroundings. For example, authors in¹⁷ presented a deep learning architecture based on the stacked denoising auto-encoder for the differentiation of distinctive types of breast lesions and lung nodules on ultrasound and CT images. They showed that the model outperformed two intensity and texture based CADx methods previously presented in¹⁸. Setio et al.¹⁹, presented a novel computer aided detection system for pulmonary nodules using multi-view convolutional networks, in which discriminative features were automatically learned from the training data. On 888 scans, their method reached detection sensitivities of 85.4% and 90.1% at 1 and 4 false positives, respectively, per scan. In²⁰, the authors proposed a method to find associations between deep convolutional features and multiple human-defined semantic features of CT pulmonary nodules. In²¹, the authors proposed a multi-scale convolutional neural network (CNN) architecture for nodule classification. Their method achieved an 86.84% classification accuracy and outperformed a number of well established textural descriptors. In²², deep learning was employed for detecting active lesions exclusively from CT images. The resulting precision and recall (0.79 ± 0.18, 0.57 ± 0.18), was found to be worse compared to using a combined PET/CT exam (0.93 ± 0.13, 0.68 ± 0.27). In²³, Teramoto et al., proposed an improved FP-reduction method for the detection of pulmonary nodules in PET/CT images by means of a CNN. False positives were significantly decreased from 72.8 to 4.9 FPs/case. However, the sensitivity decreased from 97.2% to 90.1%. In the LUNA16²⁴ challenge involving pulmonary nodule detection, the leading solutions employed CNNs. Finally, Ciompi et al.²⁵, presented a deep learning based method for detection of different pulmonary nodules types including solid, calcified, part-solid, non-solid, perifissural and spiculated. Apart from nodule detection, there is a growing interest in the use of deep learning models for diagnosis of lung nodules on CT scans¹⁷. However, it remains to be seen how the quality of the annotations and the number of training exemplars will ultimately affect the performance of these networks.

It is well established that tumor vasculature is on account of angiogenesis, the process by which tumors grow new blood vessels for nutrient and oxygen supply, and metastasis. While benign tumors are also associated with vasculature, recent findings suggest differences in vascular morphology for benign and malignant tumors²⁶. Malignant lesions tend to affect regional changes by modulating to vessel shape and tortuosity. Contortions in the vessel tortuosity appear during the tumor development process and affect initially healthy vessels which spread beyond the confines of the tumor margins. For instance Shelton et al.²⁷, in a study involving quantitative morphologic analysis of tumor vessels in mice using Acoustic Angiography showed that vascular tortuosity was significantly more pronounced in tumors compared to normal controls. In a pioneering study²⁸, Bullitt et al., investigated the ability of computer extracted features of vessel morphology to predict the presence of a benign or malignant brain tumor on magnetic resonance angiography (MRA) scans. The same group in²⁹ found that the quantitative measurements of vessel morphology from MRA scans could also provide useful insights into brain tumor development and response to therapy. The findings from^28,29 and²⁷ therefore beg the following two questions. Firstly, whether there are significant differences in vessel tortuosity between lung granulomas and adenocarcinomas. Secondly, whether computerized features of vessel tortuosity can be extracted from routine clinical non-contrast CT scans and whether these measurements can enable discrimination of granulomas and adenocarcinomas.

In this study, we present quantitative vessel tortuosity (QVT), a novel CT imaging biomarker to distinguish between benign granulomas and adenocarcinomas on routine non-contrast lung CT scans. QVT is based off the idea that benign lesions like granulomas tend to have a less tortuous vasculature compared to adenocarcinomas.

Our study comprised of CT scans of 290 patients from two different institutions, one cohort for training (N = 145) and the other (N = 145) for independent validation. All patients had previously undergone surgical wedge resection based off suspicious findings on radiology and hence histopathologically confirmed diagnosis was available for all lesions. A 3D volume of interest was manually defined around the nodule of interest and the associated vasculature was segmented using a 3D region growing segmentation method. A set of 35 QVT features capturing the torsion, curvature and branching statistics of the vessels associated to the nodules were extracted from non-contrast diagnostic CT images. The most discriminating of the 35 QVT tortuosity features were established via feature selection and unsupervised cluster analysis. The features so identified were further pruned based off their stability and reproducibility in the RIDER dataset, a cohort of same day test-retest lung CT scans. The stable and discriminating features thus identified, were used to train machine learning classifiers designated to predict the risk of malignancy to each nodule in the validation set. We perform an exhaustive and rigorous evaluation of our approach with extensive human-machine comparison studies involving two different human readers. Additionally, tortuosity was also quantitatively compared on the validation set against the performance of the state of the art texture and shape features as well as deep features which were automatically learned from nodule regions. The sensitivity of the approach was evaluated as a function of slice thicknesses of CT scans that varied from 1 to 5 mm. The workflow of the current study is presented in Fig. 1. As may be appreciated from Fig. 1, in step 1, a 3D volume of interest was manually defined around the nodule of interest. In step 2, the associated vasculature was segmented using a 3D region growing algorithm, following which center lines of the vessels are extracted by a fast marching algorithm. In step 3, a set of 35 QVT features were extracted. Step 4 includes data analysis in which the stable and discriminating QVT features were used to train machine learning classifiers to predict the risk of nodule malignancy in the validation set. Unsupervised cluster analysis was then employed to find the association of clinical outcome with the QVT features. These features were also evaluated via human-machine comparison studies and were also compared against the predictive performance of texture and shape radiomics as well as deep features.

Results

Identifying the most discriminating QVT features and cluster analysis

The top 12 most predictive QVT features were identified using the minimum redundancy-maximum relevance (mRMR) feature selection algorithm³⁰. A comparison of feature selection strategies including mRMR, least absolute shrinkage and selection operator (LASSO) and principal component analysis (PCA) is provided in Table S5 of the Supplemental Material section. Since the nodule vasculature is comprised of several vessel branches, where each branch is in turn comprised of several points in a 3D space, these QVT features capture first order statistics associated with the average curvature values of the branches. Other QVT features include first order statistics of the maximum curvature values of the branches, volume of the vasculature and the ratio of vasculature volume to the 3D volume of the interest. Additional features include the histogram of curvature and torsion values associated with 3D points on the vessel branches. A detailed description of the features can be found on Table S1 in the Supplementary File.

Figure 2(a) illustrates a feature expression heat map of the most discriminating QVT features for the granulomas and adenocarcinomas in D₁. The QVT feature expression is illustrated on the Y-axis and the X-axis corresponds to the different patients in training set (D₁). As may be observed from Fig. 2(a), a number of the QVT features showed statistically significant differential expression between the adenocarcinomas and granulomas for the patients in D₁. The p-values computed under the null hypothesis that there was no significant difference between the 12 QVT features between adenocarcinomas and granulomas were found to be <0.05. The actual p-values for this comparison for the 12 QVT features for the studies in D₁ are listed in Table S1 of the Supplemental Material section.

Unsupervised clustering of the patients in D₁ was carried out to find a potential association between the predicted number of clusters and the clinical data (i.e, the histopathologic diagnosis of the lung nodule). A χ² test performed on clustering results revealed a significant association between patients cluster labels and the histopathologic diagnosis of these patients in D₁ with p-value = 3.5e⁻⁷. Additionally, unsupervised ensemble clustering³¹ of the patients described with the top 12 QVT features, yielded two dominant patient clusters. As may be appreciated from Fig. 2(b), evaluation of the two dominant clusters identified via unsupervised ensemble clustering revealed that roughly 75% of the cases within each cluster correspond to a single category (i.e. adenocarcinomas or granulomas).

Figure 3 illustrates CT scans corresponding to 2 adenocarcinomas and 2 granulomas. Additionally the figure also illustrates the 3D renderings of nodule vasculature (in red) for the two cases. The corresponding QVT feature vector for each nodule is illustrated in the form of a bar graph in the bottom left of each individual panel. The columns of the bar graph represent the most informative QVT features and the height reflects the corresponding quantitative feature expression. As may be appreciated in Fig. 3, the first 4 QVT features appear to significantly over-express in the case of the adenocarcinomas (Fig. 3(a,c)) and appear to mostly under-express for the granulomas (Fig. 3(b,d)).

Stability analysis of the QVT features identified as most discriminating

Among the 12 QVT features identified via feature selection, first order statistics pertaining to the maximum curvature value of the branches and the ratio of the vasculature volume to the 3D volume of interest were identified as being most stable. These features had an average Intraclass Correlation Coefficient (ICC) = 0.83 ± 0.09, suggesting a high degree of reproducibility for the same day test-retest cases within the RIDER datset. The ICC values obtained for all 35 QVT features on the RIDER dataset are illustrated in Fig. 2(c).

Evaluating the ability of the QVT features to distinguish adenocarcinomas from granulomas

The tortuosity based classifier (C_QVT) yielded an AUC of 0.85 for the cases in validation set (D₂). The corresponding AUC for D₁ was 0.94 ± 0.02. Figure 4(a) shows the AUC values for each individual feature from among the top 12 selected features for both D₁ and D₂. Figure 5(b) illustrates the ROC curves for C_QVT on both D₁ (red) and D₂ (blue) for the support vector machine (SVM) classifier trained with QVT features. The SVM classifier outperformed the k-nearest neighbors (KNN) and Naive Bayes classifiers. Figure 5(a) represents the 3D cluster plot of adenocarcinomas (red dots) and granulomas (blue dots) within D₁ in a 3D space involving the three most discriminating and stable QVT features identified on D₁. As may be appreciated in Fig. 5(a), two classes of nodules within D₁ appear to be clearly separable.

The impact of slice thickness on the performance of the C_QVT classifier was also evaluated. The distribution of the cases with differing slice thickness in both D₁ and D₂ is shown in Fig. 4(b). Table 1 illustrates the performance of C_QVT and C_rad classifiers in terms of AUC as a function of the slice thickness parameter on D₂. C_rad corresponds to a classifier trained with established radiomic features pertaining to nodule texture and shape on D₁. From Table 1, it can be seen that, the AUC values for the QVT features tends to drop slightly with increasing slice thickness.

Table 1 A breakdown of the AUC performance of the C_QVT and C_rad classifiers as a function of CT slice thickness for the cases in D₂.

Full size table

Comparing QVT with state of the art radiomic features

C_QVT outperformed the classifier trained with well-known radiomic features (C_rad) on D₂. C_rad yielded an AUC of 0.82 ± 0.05 and 0.73 on D₁ and D₂ respectively. Figure 5(d) illustrates the ROC curve of the C_QVT classifier on D₁ and D₂ respectively. Furthermore, in Fig. 5(a,b), it can be seen that there is an improved separation between nodules in the reduced space of QVT features, compared to the nodule derived texture and shape based radiomic features.

Comparison with deep learning networks

Using the same training and testing sets as in the previous experiments, with LeNet (C_LeNet), we achieved a training accuracy of 0.99. The weights after 100 epochs were locked down and used on the test set. The testing AUC was computed to be 0.76. At the operating point, the corresponding accuracy was computed to be 0.737. The performance metric across the training runs are shown in section I of the Supplemental Material. Additionally, the training set accuracy for VGG-19 CNN, trained on the ImageNet database, was 0.94, while the test set accuracy was 0.62. Table 2 shows the AUC values of C_QVT, C_rad and deep learning classifier (C_LeNet and C_VGGNet) in both the training and validation sets.

Table 2 AUC values for C_QVT, C_rad as well as deep learning classifiers (C_LeNet and C_VGGNet) obtained from both the training D₁ and validation D₂ cohorts.

Full size table

Comparison with human experts

While C_QVT yielded an AUC of 0.85 on D₂, on the same cohort the AUC for the two human readers was 0.61 and 0.60. These results are summarized in Table S4 of the Supplementary File.

Statistical analysis between patient and CT specific parameters with histopathologic diagnosis of the nodule

Statistical significance test results between patient parameters and clinical outcome for both the D₁ and D₂ cohorts is shown in Table 3. Similarly, Table 4 illustrates the results of significance testing between CT parameters and histopathologic diagnosis of the nodule. The presence of a statistically significant difference was indicated by p < 0.01. ‘Smoking status’ and ‘Age’ were the only patient parameters that were found to be significantly different between adenocarninoma and granulomas in D₁. While in D₂ ‘Age’ was found to be significantly different, no significant differences were identified between ‘Slice Thickness’, ‘Nodule Size’ and ‘Voxel Size’ between the adenocarninomas and granulomas in D₁ or D₂. The effect of scanner variability and voxel size as well as smoking history on the QVT features can be found on section H of the Supplemental Material.

Table 3 Statistical significance testing between patients parameters and disease outcome for both training and validation cohorts. The p-values were computed using Students t test for continuous variable and Fishers exact test for categorical data. Statistically significant difference was indicated by p < 0.01.

Full size table

Table 4 Statistical significance testing between CT parameters and disease outcome for both training and validation cohorts. The p-values were computed using two tailed and unpaired Students t test. Statistically significant difference was indicated by p < 0.01.

Full size table

Comparing nodule adjacent QVT with normal lung QVT

The QVT differences between the healthy and non-healthy regions were compared. The nodule-adjacent QVT values were significantly different (p < 0.05) between the adenocarcinomas and the granulomas. Interestingly, there was no significant difference between the QVT values of the healthy regions taken from patients with adenocarcinoma (Healthy_A) and healthy regions taken from patients with granulomas (Healthy_G). Also QVT was found to be significantly different between the healthy and non-healthy regions. The p-value of the t-test significance analysis between adenecarcinomas and Healthy_A was 0.017 while it was 0.0016 between granulomas and Healthy_G. Figure 6(a) illustrates a nodule (center panel), a 3D rendering of nodule associated vasculature (left panel) as well as 3D rendering of vasculature in a corresponding healthy region (right panel). Figure 6(b) shows the top ranked QVT feature values of the non-healthy regions including adenocarcinomas and granulomas and the healthy regions, Healthy_A and Healthy_G.

Association of the nodule location and diagnostic class

A frequency plot for the nodule position for both adenocarcinomas and granulomas is shown in Fig. 7. In the 2D transverse plane we identified whether a nodule was located within central, upper or lower lung regions (Fig. 7a). Additionally, in the sagittal plane (z-axis), we determined whether a nodule was located in the apical, mid or basal lung regions (Fig. 7b). Performing χ² test³² between nodule position and diagnostic class revealed that there was no significant association between the nodule position (in both 2D and along z-axis) and its diagnostic class. The corresponding p-values are shown in the Table 5. Additional details can be found in Section F.1 of the Supplementary Material.

Table 5 The p-values corresponding to the χ² test between the diagnostic class of a nodule and its relative position in 2D and 3D (along z-axis). p-value < 0.01 was considered as corresponding to a significant association between the spatial location of a nodule and its diagnostic class.

Full size table

Impact of spatial location of a nodule on its corresponding QVT feature measurements

As described in Section F.2 of the Supplementary Material, a set of control regions were defined to determine whether a nodule was located in the a) apical or lower lung portions, b) Gravity dependent or non-dependent portions of the lung and c) peripheral or central regions. Applying, statistical significance testing to the QVT measurements of the nodules belonging to the aforementioned regions revealed that QVT features were not significantly different between apical and lower lung regions. QVT features were however found to be significantly different for nodules located in the dependent and non-dependent lung regions. The statistically significantly different features included the mean torsion of the vessel branches for adenocarcinomas, and the ratio of vasculature volume to its bounding box for granulomas. Finally, 5 QVT features (branching count of the vasculature, 1 feature of curvature histogram, 3 features of torsion histogram) were significantly different between adenocarcinomas of central and peripheral regions. For the granulomas within the same regions, 3 QVT features (mean and standard deviation values of the torsion of the vessel branches, 1 feature of torsion histogram) were found to be significantly different. A more detailed description of these findings is in Section F.2 of the Supplementary Material.

The impact of smoking status on QVT measurements of healthy lung

In this section, we studied the effect of smoking status on the QVT measurements on healthy lung regions. A preliminary analysis was conducted. We selected 20 cases including 10 adenocarcinomas and 10 granulomas. Within each group, five were non-smokers and five were smokers and had at least 50 pack years. For each case we extracted the QVT features from healthy regions according to the method described in the Section entitled “Comparing nodule adjacent QVT with normal lung QVT”. Three QVT features were found to be significantly different between smokers and non-smokers, both within patients with adenocarcinomas and granulomas. These features (all of which had a p < 0.01) included mean, std and max values of the averaged curvature of the vasculature branches. The first row of Fig. 8 illustrates that the QVT features of adenocarcinomas were different between smokers and non-smokers. Similarly, the second row shows the difference between smokers and non-smokers with granulomas.

Discussion

The five-year survival rate following diagnosis of lung cancer in the U.S. is roughly 16%³³. In contrast, the survival rate is above 50% when the disease is diagnosed when it is still localized³⁴. Unfortunately, only 15% of lung cancers are diagnosed at an early stage³⁵. While substantial evidence exists that CT enables early detection of lung cancers and hence can be effective in decreasing the mortality associated with the disease. Radiologist interpretation of lung CT scans is subject to inter-reader variability³⁶. Additionally radiologist interpretations tend to result in a large number of lung nodules as being identified as either indeterminate or suspicious. For example, in a recent study³⁷, investigators from eight Veteran centers across the U.S. screened more than 2000 Veterans over two years using criteria from the national lung screening trial (NLST). Among the 2106 Veterans screened, a total of 1257 (59.7%) had nodules, of which 1184 (56.2%) required tracking. Nearly all of the positive results were negative for cancer, producing a false-positive rate of 97.5% from radiologist based CT interpretations of the CT exams. Consequently, there has been substantial recent interest in the application of radiomic based approaches for non-invasive characterization of lung nodules on CT scans. However most of these approaches have relied on textural^38,39 and intensity or shape⁴⁰ based attributes which may not necessarily provide sufficient discriminability between tumor confounding benign pathologies such as granulomas.

Recent findings in pre-clinical and clinical studies suggest that tumor vasculature might have different morphologies in benign versus malignant conditions²⁷. Some studies^28,29 have suggested that there are significant differences in tortuosity and convolutedness of tumor vasculature between benign and malignant lesions. Motivated by these findings, in this study we presented QVT, a novel CT imaging biomarker to distinguish between benign granulomas and adenocarcinomas on routine non-contrast lung CT scans. QVT represents a new set of mathematically derived measurements that are based off the idea that benign lesions like granulomas tend to have a less tortuous vasculature compared to adenocarcinomas. To the best of our knowledge, this is the first attempt to quantitatively capture and evaluate the role of vessel tortuosity to discriminate between granulomas and adenocarcinomas on lung CT images. The robustness of the QVT was examined via the independent validation and also ensuring that the training and validation cohorts comprised of cases from two different sites. We also performed an exhaustive and rigorous evaluation of our approach with extensive human-machine comparison studies involving two different human readers. It was demonstrated that, QVT performed (AUC = 0.85) substantially better compared to two human expert readers (AUC = 0.61, 0.60). Additionally, our experiments also revealed that QVT features yielded a better classification AUC performance compared to well-known nodule shape and texture radiomic features as well as deep features learned with a convolutional neural network. A recent study by He et al.⁴¹, have suggested that radiomic features tend to be affected by the choice of scanner, reconstruction kernel, and slice thickness. Differences in scanner and CT parameters may affect the reproducibility of radiomic features. In this work, we chose to explicitly evaluate the stability of QVT features using the RIDER test set and found that from among the initial set of 35 QVT features, 4 were highly stable (ICC > 0.7) while 6 were found to have moderate stability (ICC > 0.4). Highly stable features included first order statistics of the maximum curvature values of the branches and the ratio of vasculature volume to the 3D volume of the interest. The moderately stable features corresponded to the first order statistics of the torsion values of the branches. The majority of the features corresponding to the histogram of curvature and torsion measurements were found to not be stable. The segmentation quality of the vasculature could also be another potential source of feature instability. Future work will be necessary to study the effect of segmentation errors between repeated scans on feature instability. The sensitivity of the approach was also evaluated in terms of slice thickness of the CT scans. The AUC values for the QVT features tended to drop slightly with increasing slice thickness. This may be due to the loss of spatial resolution with a corresponding increase in slice thicknesses⁴¹. Even though the performance tapered off towards the higher slice thickness, C_QVT yielded better AUC compared to shape and texture radiomic features.

A number of studies have looked at the problem of distinguishing benign from malignant lesions on CT scans^10,13,42,43, but only a few of these studies were aware of have explicitly looked at the problem of distinguishing granulomas from adenocarcinomas^40,44. For instance, Shah et al.⁴⁵, achieved AUC values between 0.68 and 0.92 with 48 malignant and 33 benign nodules, but the benign nodules did not exclusively comprise granulomas, arguably the most challenging benign tumor confounder on CT and PET scans. Dennie et al.⁴⁶, employed Haralick-related texture features on 55 nodules to discriminate granulomas from primary lung cancer (including adenocarcinoma and squamous cell cancer). Although the approach reported an AUC = 90.2%, it was not validated on an independent test set. The role of shape radiomic features in distinguishing adenocarcinomas and granulomas was previously studied by us in⁴⁰. On a cohort comprising 149 cases, the shape based classifier yielded an AUC of 0.72.

One common attribute associated with the majority of previous radiomic related approaches for lung nodule characterization is the fact that they involve features pertaining to the nodule region alone and not associate nodular structures. However, a growing body of evidence suggests that features pertaining to peri-nodular regions and tumor adjoining areas may be critical in characterizing disease presence⁴⁷, aggressiveness⁴⁸ and treatment response⁴⁹. Braman et al., in⁴⁹ showed that radiomic features of the peri-tumoral region in breast magnetic resonance imaging (MRI) scans were predictive of pathologic complete response (pCR) in breast cancers, the peri-tumoral radiomic features being more predictive of pCR compared to the intra-tumoral features. Prasanna et al.⁴⁸, showed that texture features of the peri-tumoral habitat in brain tumors on MRI was prognostic of long versus short term survival. However, these approaches were primarily focused on radiomic based approaches pertaining to texture and heterogeneity characterization within the tumor habitat. Our novel approach goes beyond characterization of texture patterns of the tumor habitat, focusing instead on the morphology of the nodule vasculature. While previous studies have noted differences in vasculature morphology between benign and malignant presentations^27,28, QVT represents a radiomic based formulation to quantitatively capture and describe vessel tortuosity. The demonstrated QVT differences among normal vs abnormal lung as well as among adenocarcinomas and granulomas suggest that QVT could potentially be considered as an imaging biomarker. While there is certainly promise for QVT to be considered as a potentially predictive imaging marker to distinguish between granulomas and adenocarcinomas, the bar for validating it as a biomarker is higher. In other words, additional criteria^50,51,52 need to be met for QVT to be established and validated as a biomarker.

Our study did have its limitations. As mentioned earlier, the segmentation quality of the vasculature could be a potential source of feature instability. Further work is needed to study the effect of segmentation errors on possible instability of the QVT features. Additional validation of the method with larger datasets from multiple external sites is another future direction. Further, our approach was focused on granulomas and adenocarcinomas alone. Even though granulomas represent the most common confounder of adenocarcinomas, to showcase the utility of QVT as an imaging biomarker that is relevant for the lung cancer screening population, we would need to evaluate its ability to distinguish between other benign and malignant presentations such as lymphoma, fibroma and hamartoma. Another limitation of this study is that we focused solely on a single reader identified nodule for each study, i.e. the nodule that had been excised or biopsied. An obvious extension would be to analyze the vasculature for multiple nodules in the case of presentations with multi-nodular disease. Furthermore, additional analyses of QVT differences in normal and abnormal lung as well as the evaluation of the impact of nodule position and smoking history on QVT have been performed on small sub-sets. Therefore, as a future direction, an extensive validation and confirmation of the initial findings might need to be performed on additional datasets. In spite of the aforementioned limitations, our initial findings suggest that QVT could potentially serve as a non-invasive imaging biomarker to distinguish granulomas from adenocarcinomas on non-contrast CT scans.

Methods

Dataset

In this retrospective study, we collected patients from two sites. Inclusion criteria for these cases were the existence of a diagnostic or a screening CT exam and the presence of a radiographically confounding nodule, one that had previously triggered a surgical intervention, either in the form of a bronchoscopic or CT guided biopsy or a surgical wedge resection. Additionally the inclusion criteria included the histopathologic confirmation of whether the nodule was malignant or benign. Only granulomas and adenocarcinomas were included in this study. All scans acquired for this study were collected as part of an Institutional Review Board-approved, health insurance portability and accountability act of 1996 (HIPAA)-compliant protocol. The inclusion and exclusion criteria flowchart is shown in Fig. S3 of the Supplementary Material. All scans were in digital imaging and communications in medicine (DICOM) format and de-identified to remove patient header information. Need for an informed consent was waived. Histology was confirmed by an thoracic pathologist based off visual interrogation of the biopsied or surgically resected specimen. A total of 290 patients were included in this study. A set of 125 cases including 64 adenocarcinomas and 61 granulomas were acquired from site 1. Whereas, the cases from site 2 comprised of 165 cases including 81 adenocarcinomas and 84 granulomas. All cases were bundled together and split into training (D₁ = 145) and validation (D₂ = 145) cohorts in a blinded way with the only caveat that the training set have a 50-50 split of adenocarcinomas and granulomas. Details of the datasets are provided in Table 6. CT scans were acquired on both Philips and Siemens scanners. The scans were acquired with a tube current between 96 and 426 mAs and voltage of either 100 or 120 kVp.

Table 6 Details of the distribution of the granulomas and adenocarcinomas within D₁ and D₂.

Full size table

Nodule detection and Segmentation of Vasculature

The nodule of interest was manually identified on the CT scan by an expert cardiothoracic radiologist with 20 years of experience. A region of interest (ROI) was manually placed on the nodule of interest by the same radiologist across all contiguous slices on which the nodule was visible via a hand annotation tool in 3D-Slicer software⁵³. Each scan included one nodule of interest. The impact of nodule’s location on the predictions via the QVT classifier were also determined. This involved identifying whether there was an association between nodule position and its diagnostic class. Additional details on these correlative studies are presented in section F of the Supplemental Material.

Segmentation of vasculature associated with a nodule of interest

In the first step, lung regions are isolated from the surrounding anatomy using a multi-threshold based algorithm previously presented in^54,55. In the second step, a region growing algorithm is employed for the segmentation of the nodule vasculature⁵⁶. The center of gravity of the segmented nodules is used as the initial seed points for the region growing algorithm⁵⁷. Within the nodule volume, seed points were initialized at random locations. Based off the intensity similarity of the seed points and surrounding pixels, an initial region iteratively grows to incorporate nodule and vasculature related voxels. A fast marching algorithm⁵⁸ is then employed to identify the center lines of the 3D segmented vasculature⁵⁹. Figure 9 illustrates the process of nodule detection and segmentation of vasculature.

Sensitivity of QVT features to minor vasculature segmentation errors

This section presents the sensitivity of QVT features to minor changes in the vasculature segmentation. We identified the parameter S which controls the growing of the vasculature volume during the segmentation process. This parameter corresponds to the mean difference value in Hounsfield units between a growing volume and the candidate voxels in the neighborhood of the vasculature. The values of S were identified empirically S ∈ {−150, −100, −50, 0, 50}. For each of the values in S, we generated multiple segmentations for each nodule and its surrounding vasculature on the CT scan. Figure 10 (b,c,d,e,f) illustrates 5 segmentations of a nodule and its corresponding vasculature, see Fig. 10(a). Segmentations were generated based on the values of S. Then the QVT features were calculated for all 5 versions of segmentations. The QVT features were calculated for all of the training set examples. We denote the QVT features corresponding to each of the S values as QVT₋₁₅₀, QVT₋₁₀₀, QVT₋₅₀, QVT₀ and QVT₅₀ respectively. To measure sensitivity of QVT features to slight changes in vessel segmentation, we computed the correlation between QVT pairs which were denoted by Corr(QVT_s1, QVT_s2). Some features (13,14, 15, 16) were highly stable and correlated across changes in vessel segmentation. However, some features (17, 18, 19, 20) were found to show more abrupt changes on account of minor perturbations in vessel segmentation. Consequently, these features were not highly correlated between different segmentations. The AUC values of the C_QVT classifier for each QVT_S were computed.

As shown in Table 7, the C_QVT classifier yielded an AUC = 0.79 ± 0.06,0.82 ± 0.12,0.87 ± 0.11,0.75 ± 0.14 and 0.69 ± 18, respectively for the different values of S. A more detailed description of these findings is provided in section G of the Supplemental Material.

Table 7 AUC values of the C_QVT classifier as a function of parameter S.

Full size table

Feature extraction

We extracted 35 QVT features from the nodule vasculature in each CT scan. The extracted QVT features are based on three important properties of the vessels: (I) tortuosity, (II) curvature and (III) branching statistics and the volume measurements of the vasculature. A detailed description of the QVT features is presented on Table S1 of the Supplementary Document. The QVT features attempt to quantify convolutedness and tortuosity of the nodule vasculature. Following segmentation of the nodule vasculature from the surroundings, the fast marching algorithm⁵⁸ is employed to extract the skeleton of the vasculature. The nodule vasculature is assumed to be comprised of several vessel branches where each branch in turn is comprised of several points in a 3D space. The QVT features were measured for points, vessel branches and the entire vasculature. The torsion of a branch is defined as the ratio of the Euclidean distance between the starting and end points of a branch to the length of the branch. The torsion of a whole vasculature, is then calculated by computing the first order statistics of the torsion measurements associated with the branches of a vasculature.

The curvature is computed of points, branches and the entire vasculature. The curvature of a point which is located on the center line of a vessel in a 3D space is defined as the inverse of the radius of an osculating circle fitted to that point and also its left and right neighbors. The statistical moments of the curvature measurements of the points associated with the branches were computed to describe the curvature of a branch and similarly the curvature of the entire vasculature. In addition to torsion and curvature, the branching statistics of the vasculature were computed as well. These measurements refer to the number of small vessel branches in the vasculature. Feature extraction algorithms were implemented in MATLAB 2014b platform (Mathworks, Natick, MA).

Data analysis

A data analysis pipeline was designed to evaluate the discriminability of the QVT features. The data analysis pipeline comprised of the following steps: a) Identifying the most discriminating QVT features and cluster analysis, b) stability analysis of the QVT features identified as most discriminating, c) Evaluating the ability of the QVT features to distinguish adenocarcinomas from granulomas, d) Comparing QVT with the state of the art radiomic features, e) Quantitative comparison of the discriminability of the QVT features compared to the corresponding diagnosis of two expert human readers on the validation set f) Statistical analysis between patients and CT specific parameters with histopathologic diagnosis of the nodule. The following sections provide the details pertaining to these various individual steps.

Identifying the most discriminating QVT features and cluster analysis

The mRMR³⁰ feature selection algorithm was employed to identify the most predictive of the 35 QVT features from among the cases in D₁. The AUC value for each of the most predictive QVT features was calculated.

An unsupervised cluster analysis based off the mRMR identified subset of QVT features was also performed on the cases in D₁. The purpose of unsupervised clustering was to provide an alternative evaluation of the QVT features in addition to supervised classification. We also sought to discover patient clusters that showed corroboration between the QVT features and clinical parameters. A k-means clustering⁶⁰ algorithm was applied to the QVT and clinical parameters for all patients in D₁. To achieve this, unsupervised clustering of the patients into two clusters was carried out based off the top 12 QVT features identified via mRMR. Then, potential association of the predicted cluster numbers with the clinical data (i.e, histopathologic diagnosis of the nodules) is determined by the χ² test which is used to determine whether a significant association between the expected and the observed frequencies in one or more categories exists⁶¹. To identify the most stable and dominant clusters of the patients we also performed unsupervised ensemble clustering on the QVT features for the patients in D₁. This approach involved clustering the patients several times with different clustering parameters such as cluster number and distance metric. Next, we counted the co-occurrence of the pairs of samples, falling into the same cluster. Then the emerging clusters were compared with the corresponding clinical information for the patients.

Stability analysis of the QVT features identified as most discriminating

Stability is an important consideration in assessing the performance of radiomic features⁶². One specific attribute desireable in radiomic features is that the feature expression should either not change (or minimally change) for test-retest scans acquired within a short interval duration⁶³. Since images obtained across different CT scanners, institutions, sites, vendors and acquisition parameters (e.g. reconstruction slice thickness, contrast enhancement and convolutional kernel) tend to vary in appearance, it is critical to assess the ability of radiomic features to deal with these variations. Additionally image variations can also result temporally in a single site due to scanning position and patient specific attributes. Unstable radiomic features, even ones identified as highly predictive on a training set, could yield a markedly inferior classification performance on an independent test set. To assess the stability of the QVT features, we used the independent reference imaging database to evaluate response (RIDER)⁶⁴ lung cancer dataset which consists of same-day repeated test and re-test CT scans for 31 patients⁶⁵. The intraclass correlation coefficient (ICC) was used to assess the stability of the top QVT features identified via the mRMR feature selection approach. ICC varies between −1 to 1, where ICC = 1 corresponds to a highly reproducible feature and ICC = 0 corresponds to a feature which is not highly reproducible and hence unstable.

Evaluating the ability of the QVT features to distinguish adenocarcinomas from granulomas

The set of QVT features identified as stable and discriminating were used to train supervised machine learning algorithms to predict the risk of malignancy of nodules on CT scans. The classifiers included a SVM⁶⁶, Naive Bayes⁶⁷ and KNN⁶⁸ classifier. All classifiers were trained using the instances in the learning set (D₁) and then validated on the independent test set (D₂). For each of these classifiers, training was performed using a 3-fold cross validation re-sampling technique on D₁. The best performing classifier (C_QVT) was selected based on the computed AUC value for discriminating adenocarcinomas and granulomas in D₁. The best performing classifier was then locked down and C_QVT was evaluated in terms of classification performance on the independent validation set (D₂).

Comparison with state of the art radiomic features

The performance of C_QVT was compared with established radiomic features pertaining to nodule texture and shape (C_rad). In this regard, a total of 669 well-known radiomic features including 645 2D texture and intensity, along with 24 3D shape features were extracted from the segmented nodule on the CT scan. All features were extracted in 3D. The texture features comprised local binary patterns, gradient, Gabor filter features, Laws-Laplacian pyramidal features, Law and Haralick features^69,70. Shape features aimed at capturing irregularities and spiculations of nodule shape. Additionally these features also included measurements relating to geometrical properties of the nodule including size, compactness, eccentricity, elongation, convexity and sphericity. The description of these comparative radiomic features can be found in Tables S2 and S3 of the Supplementary Information file.

Comparison with deep learning networks

The performance of C_QVT was compared against two deep learning classifiers (C_LeNet, C_VGGNet). Two deep networks including a simple LeNet architecture and a VGG-19 CNN, in a transfer learning set up, were used for this purpose. The LeNet architecture comprised two sets of convolutional, rectified linear unit (ReLU) activation, and pooling layers, followed by a fully-connected layer, activation, another fully-connected, and finally a softmax classifier. For LeNet architecture,we used a simple patch-based classification approach, the softmax classifier returns the probability of each patch belonging to the two classes of interest. The model was trained over 100 epochs after which the weights were locked down. The performance metric across the training runs are shown in section I of the Supplemental Material. The learned weights were then evaluated on the independent validation set of 145 studies, and the predicted probabilities were utilized to generate the receiver operating characteristic curve. The VGG-19 CNN which was previously trained on the ImageNet database, was used as a feature extractor. The extracted features were then used to build a classifier on the training set, using a Random Forest classifier, and then evaluated on the test set.

Comparison with human experts

The classification performance of C_QVT was compared against the nodule diagnosis of two human experts on the cases in D₂. A board certified attending radiologist with 10 years of experience in thoracic radiology and a pulmonologist with 4 years of experience in reading chest CT scans served as Readers 1 and 2 respectively. Both readers were blinded to the true histopathologic diagnosis of the 145 cases which comprised D₂. Each reader was asked to assign a score between 1 to 5 to each nodule, with 1 referring to a high confidence that the nodule is “benign”, 2 referring to a diagnosis of “mostly benign”, 3 being “not sure”, 4 being “mostly malignant”, and 5 being “malignant”. To evaluate the performance of the experts, the classifier probability output was compared to diagnostic ground truth determined from the pathology reports. Based off the interpretations of the individual human readers, a ROC curve was obtained and the corresponding AUC was calculated.

Statistical analysis between patient and CT specific parameters with histopathologic diagnosis of the nodule

Statistical significance test was carried out between patient and CT specific parameters against histopathologic diagnosis of the nodule. The p-values were computed using the two tailed and unpaired Student t-test for continuous variables and Fishers exact test for categorical data. The threshold for determining statistical significance was considered by p < 0.01. The purpose of t-test analysis was to demonstrate that no significant difference existed in values for a parameter (nodule size for example) between the adenocarcinoma and granuloma groups in D₁. Similarly, the purpose of the Fisher’s exact test was to show that there was no significant association between a categorical parameter (gender for example) and disease outcome. Patient and CT parameters included ‘Gender’, ‘Smoking status’, ‘Ethnicity’, ‘Age’, ‘Nodule size’ and ‘Slice thickness’.

Comparing nodule adjacent QVT with normal lung QVT

The QVT features from both non-healthy (nodule and its surrounding vasculature) and healthy regions (non-tumor) of the lung were extracted from both adenocarcinoma and granuloma cases. For non-healthy regions, we considered a healthy region with the same size corresponding to the opposite lung. The position of a healthy region on the opposite lung was determined according to the line of symmetry between the left and right lungs (see Fig. 6a). The healthy regions obtained from the normal lung opposite to the adenocarcinomas are referred to as Healthy_A and similarly healthy regions obtained from the normal lung opposite to the granulomas are referred to as Healthy_G. Next, QVT features were extracted from both non-healthy and healthy regions and the top ranked QVT feature was compared between these two regions using the paired student t-test. Non-healthy regions included adenocarcinomas and granulomas while healthy regions included the Healthy_A and Healthy_G.

Change history

29 October 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Baddley, J. W. et al. Geographic distribution of endemic fungal infections among older persons, united states. Emerg Infect Dis 17, 1664–9 (2011).
PubMed PubMed Central Google Scholar
Mukhopadhyay, S. & Gal, A. A. Granulomatous lung disease: an approach to the differential diagnosis. Arch. pathology & laboratory medicine 134, 667–690 (2010).
Google Scholar
Henschke, C. et al. Survival of patients with stage I lung cancer detected on CT screening. New Engl. J. Medicine 2006, 1763–1771 (2006).
Google Scholar
Boskovic, T. et al. Pneumothorax after transbronchial needle biopsy. J. thoracic disease 6, S427 (2014).
Google Scholar
Rusu, M. et al. Co-registration of pre-operative CT with ex vivo surgically excised ground glass nodules to define spatial extent of invasive adenocarcinoma on in vivo imaging: a proof-of-concept study. Eur. Radiol. 27, 1–9 (2017).
Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: images are more than pictures, they are data. Radiol. 278, 563–577 (2015).
Google Scholar
Lambin, P. et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur. journal cancer 48, 441–446 (2012).
Google Scholar
Parmar, C. et al. Radiomic feature clusters and prognostic signatures specific for lung and head & neck cancer. Sci. reports 5, 11044 (2015).
ADS Google Scholar
Thawani, R. et al. Radiomics and radiogenomics in lung cancer: A review for the clinician. Lung Cancer 115, 34–41 (2018).
PubMed Google Scholar
Liu, Y. et al. Radiological image traits predictive of cancer status in pulmonary nodules. Clin. Cancer Res. 23, 1442–1449 (2017).
PubMed Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. communications 5, 4006 (2014).
ADS CAS Google Scholar
Way, T. W. et al. Computer-aided diagnosis of pulmonary nodules on CT scans: Segmentation and classification using 3D active contours. Med. Phys. 33, 2323–2337 (2006).
PubMed PubMed Central Google Scholar
Way, T. W. et al. Computer-aided diagnosis of pulmonary nodules on CT scans: Improvement of classification performance with nodule surface features. Med. physics 36, 3086–3098 (2009).
ADS Google Scholar
McWilliams, A. et al. Probability of cancer in pulmonary nodules detected on first screening CT. New Engl. J. Medicine 369, 910–919 (2013).
CAS Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
ADS CAS Google Scholar
Jia, Y. et al. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia, 675–678 (ACM, 2014).
Cheng, J. Z. et al. Computer-aided diagnosis with deep learning architecture: applications to breast lesions in US images and pulmonary nodules in CT scans. Scientific reports 6, 24454 (2016).
ADS CAS PubMed PubMed Central Google Scholar
Yang, M. et al. Robust Texture Analysis Using Multi-Resolution Gray-Scale Invariant Features for Breast Sonographic Tumor Diagnosis. IEEE Trans. Med. Imag. 32, 2262–2273 (2013).
Google Scholar
Setio, A. A. A. et al. Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. IEEE transactions on medical imaging 35, 1160–1169 (2016).
PubMed Google Scholar
Chen, S. et al. Bridging computational features toward multiple semantic features with multi-task regression: A study of CT pulmonary nodules. In International Conference on Medical Image Computing and Computer-Assisted Intervention–MICCAI, 53–60 (Springer, 2016).
Shen, W., Zhou, M., Yang, F., Yang, C. & Tian, J. Multi-scale convolutional neural networks for lung nodule classification. In International Conference on Information Processing in Medical Imaging, 588–599 (Springer, 2015).
Pawelczyk, K. et al. Towards Detecting High-Uptake Lesions from Lung CT Scans Using Deep Learning. In International Conference on Image Analysis and Processing, 310–320 (Springer, 2017).
Teramoto, A., Fujita, H., Yamamuro, O. & Tamaki, T. Automated detection of pulmonary nodules in PET/CT images: Ensemble false-positive reduction using a convolutional neural network technique. Medical physics 43, 2821–2827 (2016).
ADS PubMed Google Scholar
Setio, A. A. A. et al. Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge. Medical image analysis 42, 1–13 (2017).
PubMed Google Scholar
Ciompi, F. et al. Towards automatic pulmonary nodule management in lung cancer screening with deep learning. Scientific reports 7, 46479 (2017).
ADS CAS PubMed PubMed Central Google Scholar
Roberts, T., Hasleton, P. S., Musgrove, C., Swindell, R. & Lawson, R. Vascular invasion in non-small cell lung carcinoma. J. clinical pathology 45, 591–593 (1992).
CAS Google Scholar
Shelton, S. E. et al. Quantification of microvascular tortuosity during tumor evolution using acoustic angiography. Ultrasound medicine & biology 41, 1896–1904 (2015).
Google Scholar
Bullitt, E. et al. Vessel tortuosity and brain tumor malignancy: a blinded study. Acad. radiology 12, 1232–1240 (2005).
Google Scholar
Bullitt, E. et al. Computerized assessment of vessel morphological changes during treatment of glioblastoma multiforme: report of a case imaged serially by MRA over four years. Neuroimage 47, T143–T151 (2009).
PubMed Google Scholar
Peng, H., Long, F. & Ding, C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on pattern analysis machine intelligence 27, 1226–1238 (2005).
PubMed Google Scholar
Tumer, K. & Agogino, A. K. Ensemble clustering with voting active clusters. Pattern Recognit. Lett. 29, 1947–1953 (2008).
Google Scholar
McHugh, M. L. The chi-square test of independence. Biochemia Medica 23, 143–149 (2013).
CAS PubMed PubMed Central Google Scholar
Siegel, R., Ward, E., Brawley, O. & Jemal, A. The impact of eliminating socioeconomic and racial disparities on premature cancer deaths. CA-A Cancer J. for Clin. 61, 212–236 (2011).
Google Scholar
Palma, J. F., Das, P. & Liesenfeld, O. Lung cancer screening: utility of molecular applications in conjunction with low-dose computed tomography guidelines. Expert. review molecular diagnostics 16, 435–447 (2016).
CAS Google Scholar
Olak, J. & Ng, A. Diagnosis and treatment of early-stage non-small cell lung cancer. The oncologist 1, 201–209 (1996).
CAS PubMed Google Scholar
Van Riel, S. J. et al. Observer variability for classification of pulmonary nodules on low-dose CT images and its effect on nodule management. Radiol. 277, 863–871 (2015).
Google Scholar
Kinsinger, L. S. et al. Implementation of lung cancer screening in the veterans health administration. JAMA internal medicine 177, 399–406 (2017).
PubMed Google Scholar
Sun, T., Zhang, R., Wang, J., Li, X. & Guo, X. Computer-aided diagnosis for early-stage lung cancer based on longitudinal and balanced data. PloS one 8, e63559 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Yao, J., Dwyer, A., Summers, R. M. & Mollura, D. J. Computer-aided diagnosis of pulmonary infections using texture analysis and support vector machine classification. Acad. Radiol. 18, 306–314 (2011).
PubMed PubMed Central Google Scholar
Alilou, M. et al. An integrated segmentation and shape-based classification scheme for distinguishing adenocarcinomas from granulomas on lung CT. Med. Phys. 44, 3556–3569 (2017).
CAS PubMed PubMed Central Google Scholar
He, L. et al. Effects of contrast-enhancement, reconstruction slice thickness and convolution kernel on the diagnostic performance of radiomics signature in solitary pulmonary nodule. Sci. reports 6, 34921 (2016).
ADS CAS Google Scholar
Hawkins, S. et al. Predicting malignant nodules from screening CT scans. J. Thorac. Oncol. 11, 2120–2128 (2016).
PubMed PubMed Central Google Scholar
McCarville, M. B. et al. Distinguishing benign from malignant pulmonary nodules with helical chest CT in children with malignant solid tumors. Radiol. 239, 514–520 (2006).
Google Scholar
Alilou, M., Orooji, M. & Madabhushi, A. Intra-perinodular textural transition (ipris): A 3D descriptor for nodule diagnosis on lung CT. In International Conference on Medical Image Computing and Computer-Assisted Intervention–MICCAI, 647–655 (Springer, 2017).
Shah, S. K. et al. Computer-aided diagnosis of the solitary pulmonary nodule. Acad. Radiol. 12, 570–575 (2005).
PubMed Google Scholar
Dennie, C. et al. Role of quantitative computed tomography texture analysis in the differentiation of primary lung cancer and granulomatous nodules. Quant. imaging medicine surgery 6, 6 (2016).
Google Scholar
Gatenby, R. A., Grove, O. & Gillies, R. J. Quantitative imaging in cancer evolution and ecology. Radiol. 269, 8–14 (2013).
Google Scholar
Prasanna, P., Patel, J., Partovi, S., Madabhushi, A. & Tiwari, P. Radiomic features from the peritumoral brain parenchyma on treatment-naive multi-parametric MR imaging predict long versus short-term survival in glioblastoma multiforme: preliminary findings. Eur. radiology 27, 4188–4197 (2017).
Google Scholar
Braman, N. M. et al. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 19, 57 (2017).
PubMed PubMed Central Google Scholar
Hasan, N., Kumar, R. & Kavuru, M. S. Lung cancer screening beyond low-dose computed tomography: the role of novel biomarkers. Lung 192, 639–648 (2014).
Google Scholar
Dancey, J. E. et al. Guidelines for the development and incorporation of biomarker studies in early clinical trials of novel agents. Clinical cancer research 16, 1078–0432 (2010).
Google Scholar
Buckler, A. J., Bresolin, L., Dunnick, N. R. & Sullivan, D. C., Group. Quantitative imaging test approval and biomarker qualification: interrelated but distinct activities. Radiology 259, 875–884 (2011).
PubMed PubMed Central Google Scholar
Fedorov, A. et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn. resonance imaging 30, 1323–1341 (2012).
Google Scholar
Hu, S., Hoffman, E. A. & Reinhardt, J. M. Automatic lung segmentation for accurate quantitation of volumetric X-ray CT images. Med. Imaging, IEEE Transactions on 20, 490–498 (2001).
CAS Google Scholar
Leader, J. K. et al. . Automated lung segmentation in X-ray computed tomography: development and evaluation of a heuristic threshold-based scheme. Acad. radiology 10, 1224–1236 (2003).
Google Scholar
Rudyanto, R. D. et al. Comparing algorithms for automated vessel segmentation in computed tomography scans of the lung: the vessel 12 study. Med. image analysis 18, 1217–1232 (2014).
Google Scholar
Adams, R. & Bischof, L. Seeded region growing. Pattern Analysis Mach. Intell. IEEE Transactions on 16, 641–647 (1994).
Google Scholar
Sethian, J. A. Fast marching methods. SIAM review 41, 199–235 (1999).
ADS MathSciNet MATH Google Scholar
Li, H., Yezzi, A. & Cohen, L. 3D multi-branch tubular surface and centerline extraction with 4D iterative key points. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2009, 1042–1050 (Springer, 2009).
Hartigan, J. A. & Wong, M. A. Algorithm as 136: A k-means clustering algorithm. J. Royal Stat. Soc. Ser. C (Applied Stat). 28, 100–108 (1979).
MATH Google Scholar
Rao, J. N. & Scott, A. J. The analysis of categorical data from complex sample surveys: chi-squared tests for goodness of fit and independence in two-way tables. J. Am. Stat. Assoc. 76, 221–230 (1981).
MathSciNet MATH Google Scholar
Zhao, B. et al. Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci. reports 6, 23428 (2016).
ADS CAS Google Scholar
Balagurunathan, Y. et al. Test–retest reproducibility analysis of lung CT image features. J. digital imaging 27, 805–823 (2014).
Google Scholar
Armato, S. et al. The reference image database to evaluate response to therapy in lung cancer (RIDER) project: A resource for the development of change-analysis software. Clin. Pharmacol. & Ther. 84, 448–456 (2008).
CAS Google Scholar
Aerts, H. J. et al. Corrigendum: Defining a Radiomic Response Phenotype: A Pilot Study using targeted therapy in NSCLC. Sci. reports 7, 41197 (2017).
ADS CAS Google Scholar
Suykens, J. A. & Vandewalle, J. Least squares support vector machine classifiers. Neural processing letters 9, 293–300 (1999).
Google Scholar
Ng, A. Y. & Jordan, M. I. On discriminative vs. generative classifiers: A comparison of logistic regression and naive Bayes. In Advances in neural information processing systems, 841–848 (2002).
Cover, T. & Hart, P. Nearest neighbor pattern classification. IEEE transactions on information theory 13, 21–27 (1967).
MATH Google Scholar
He, D.-C. & Wang, L. Texture unit, texture spectrum, and texture analysis. IEEE transactions on Geosci. Remote. Sens. 28, 509–512 (1990).
ADS Google Scholar
Haralick, R. M. Statistical and structural approaches to texture. Proc. IEEE 67, 786–804 (1979).
Google Scholar
Van Uitert, R. & Bitter, I. Subvoxel precise skeletons of volumetric data based on fast marching methods. Med. physics 34, 627–638 (2007).
ADS Google Scholar

Download references

Acknowledgements

Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under award numbers 1U24CA199374-01, R01CA202752-01A1, R01CA216579-01A1, R01CA208236-01A1, R21CA179327-01; R21CA195152-01 the National Institute of Diabetes and Digestive and Kidney Diseases under award number R01DK098503-02, National Center for Research Resources under award number 1 C06 RR12463-01 the DOD Prostate Cancer Synergistic Idea Development Award (PC120857); the DOD Lung Cancer Idea Development New Investigator Award (LC130463), the DOD Prostate Cancer Idea Development Award; the DOD Peer Reviewed Cancer Research Program W81XWH-16-1-0329, W81XWH-15-1-0613, the Case Comprehensive Cancer Center Pilot Grant VelaSano Grant from the Cleveland Clinic the Wallace H. Coulter Foundation Program in the Department of Biomedical Engineering at Case Western Reserve University. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, 44106, USA
Mehdi Alilou, Mahdi Orooji, Niha Beig, Prateek Prasanna & Anant Madabhushi
University Hospital Case Medical Center, Cleveland, OH, 44106, USA
Prabhakar Rajiah, Christopher Donatelli, Michael Yang, Robert Gilkeson & Philip Linden
Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, 44106, USA
Vamsidhar Velcheti & Sagar Rakshit
Louis Stokes Cleveland VA Medical Center, Cleveland, OH, 44106, USA
Frank Jacono

Authors

Mehdi Alilou
View author publications
You can also search for this author in PubMed Google Scholar
Mahdi Orooji
View author publications
You can also search for this author in PubMed Google Scholar
Niha Beig
View author publications
You can also search for this author in PubMed Google Scholar
Prateek Prasanna
View author publications
You can also search for this author in PubMed Google Scholar
Prabhakar Rajiah
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Donatelli
View author publications
You can also search for this author in PubMed Google Scholar
Vamsidhar Velcheti
View author publications
You can also search for this author in PubMed Google Scholar
Sagar Rakshit
View author publications
You can also search for this author in PubMed Google Scholar
Michael Yang
View author publications
You can also search for this author in PubMed Google Scholar
Frank Jacono
View author publications
You can also search for this author in PubMed Google Scholar
Robert Gilkeson
View author publications
You can also search for this author in PubMed Google Scholar
Philip Linden
View author publications
You can also search for this author in PubMed Google Scholar
Anant Madabhushi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. and M.A. conceived the experiment(s), M.A. conducted the experiment(s), M.A., M.O., N.B. analyzed the results, P.P. conducted deep learning experiments, P.R. did the manual annotations, R.G., S.R. and V.M. curated the datasets, C.D. and R.G. participated in the human machine experiments, M.Y., R.G., F.J. and P.L. provided clinical insights into the results and also helped formulate the hypothesis. M.A. and A.M. wrote the manuscript. All authors have reviewed the manuscript.

Corresponding author

Correspondence to Mehdi Alilou.

Ethics declarations

Competing Interests

Dr. Madabhushi is an equity holder in Elucid Bioimaging and in Inspirata Inc. He is also a scientific advisory consultant for Inspirata Inc. In addition he currently serves as a scientific advisory board member for Inspirata Inc. He also has sponsored research agreements with Philips and Inspirata Inc. His technology has been licensed to Elucid Bioimaging and Inspirata Inc. He is also involved in a NIH U24 grant with PathCore Inc and 3 R01 grants with Inspirata Inc.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alilou, M., Orooji, M., Beig, N. et al. Quantitative vessel tortuosity: A potential CT imaging biomarker for distinguishing lung granulomas from adenocarcinomas. Sci Rep 8, 15290 (2018). https://doi.org/10.1038/s41598-018-33473-0

Download citation

Received: 08 January 2018
Accepted: 24 September 2018
Published: 16 October 2018
DOI: https://doi.org/10.1038/s41598-018-33473-0

Keywords

This article is cited by

Predicting cancer outcomes with radiomics and artificial intelligence in radiology
- Kaustav Bera
- Nathaniel Braman
- Anant Madabhushi
Nature Reviews Clinical Oncology (2022)
Distinguishing Adenocarcinomas from Granulomas in the CT scan of the chest: performance degradation evaluation in the automatic segmentation framework
- Mahsa Bank Tavakoli
- Mahdi Orooji
- Ramita Shahabifar
BMC Research Notes (2021)
Marginal radiomics features as imaging biomarkers for pathological invasion in lung adenocarcinoma
- Hwan-ho Cho
- Geewon Lee
- Hyunjin Park
European Radiology (2020)
Offbeat approaches to cancer research
- Nic Fleming
Nature (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Identifying the most discriminating QVT features and cluster analysis

Stability analysis of the QVT features identified as most discriminating

Evaluating the ability of the QVT features to distinguish adenocarcinomas from granulomas

Comparing QVT with state of the art radiomic features

Comparison with deep learning networks

Comparison with human experts

Statistical analysis between patient and CT specific parameters with histopathologic diagnosis of the nodule

Comparing nodule adjacent QVT with normal lung QVT

Association of the nodule location and diagnostic class

Impact of spatial location of a nodule on its corresponding QVT feature measurements

The impact of smoking status on QVT measurements of healthy lung

Discussion

Methods

Dataset

Nodule detection and Segmentation of Vasculature

Segmentation of vasculature associated with a nodule of interest

Sensitivity of QVT features to minor vasculature segmentation errors

Feature extraction

Data analysis

Identifying the most discriminating QVT features and cluster analysis

Stability analysis of the QVT features identified as most discriminating

Evaluating the ability of the QVT features to distinguish adenocarcinomas from granulomas

Comparison with state of the art radiomic features

Comparison with deep learning networks

Comparison with human experts

Statistical analysis between patient and CT specific parameters with histopathologic diagnosis of the nodule

Comparing nodule adjacent QVT with normal lung QVT

Change history

29 October 2019

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Comments

Search

Quick links