Radiomics feature robustness as measured using an MRI phantom

Lee, Joonsang; Steinmann, Angela; Ding, Yao; Lee, Hannah; Owens, Constance; Wang, Jihong; Yang, Jinzhong; Followill, David; Ger, Rachel; MacKin, Dennis; Court, Laurence E.

doi:10.1038/s41598-021-83593-3

Download PDF

Article
Open access
Published: 17 February 2021

Radiomics feature robustness as measured using an MRI phantom

Scientific Reports volume 11, Article number: 3973 (2021) Cite this article

4673 Accesses
44 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Radiomics involves high-throughput extraction of large numbers of quantitative features from medical images and analysis of these features to predict patients’ outcome and support clinical decision-making. However, radiomics features are sensitive to several factors, including scanning protocols. The purpose of this study was to investigate the robustness of magnetic resonance imaging (MRI) radiomics features with various MRI scanning protocol parameters and scanners using an MRI radiomics phantom. The variability of the radiomics features with different scanning parameters and repeatability measured using a test–retest scheme were evaluated using the coefficient of variation and intraclass correlation coefficient (ICC) for both T1- and T2-weighted images. For variability measures, the features were categorized into three groups: large, intermediate, and small variation. For repeatability measures, the average T1- and T2-weighted image ICCs for the phantom (0.963 and 0.959, respectively) were higher than those for a healthy volunteer (0.856 and 0.849, respectively). Our results demonstrated that various radiomics features are dependent on different scanning parameters and scanners. The radiomics features with a low coefficient of variation and high ICC for both the phantom and volunteer can be considered good candidates for MRI radiomics studies. The results of this study will assist current and future MRI radiomics studies.

Comparing effectiveness of image perturbation and test retest imaging in improving radiomic model reliability

Article Open access 25 October 2023

Repeatability and reproducibility study of radiomic features on a phantom and human cohort

Article Open access 21 January 2021

Impact of rescanning and repositioning on radiomic features employing a multi-object phantom in magnetic resonance imaging

Article Open access 09 July 2021

Introduction

Medical imaging plays an important role in clinical cancer care for diagnosis, radiation therapy, treatment planning, and cancer management. Researchers have developed various analytical medical imaging methods, such as image segmentation, registration, pattern recognition, and multivariate pattern classification. One of these, radiomics^1,2,3,4, has recently emerged as a promising medical image analysis tool for diagnosis and prediction of response to treatment of various diseases. Radiomics involves the high-throughput extraction of large numbers of quantitative features from medical images and analysis of these features to predict patients’ outcome and support clinical decision-making, such as classifying benign and malignant tumors, determining molecular subtypes and/or mutation status, and predicting overall survival.

Several radiomics analyses have been used with various imaging modalities in oncology, such as computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography (PET), and results showed that a large number of radiomics features have prognostic power in several studies, such as lung and head and neck cancer patients on CT images^3,5, prognosis of recurrence and survival in lung cancer patients on PET/CT images^6,7, and in brain tumor and breast cancer patients on MRI images^8,9,10,11,12. Radiomics features are sensitive to several factors, however, such as reconstruction settings^13,14, tumor delineation¹⁵, scanning protocols^16,17, different scanners¹⁸, and various noise sources. Several radiomics studies have investigated reproducibility and repeatability¹⁹. For example, Peerlings et al.²⁰ investigated on stability of radiomics features in apparent diffusion coefficient (ADC) maps. Schwier et al.²¹ investigated on repeatability of multiparametric prostate MRI radiomics features. Fave et al.²² evaluated how different image preprocessing techniques may impact both the volume dependence and prognostic potential of the features of non-small cell lung cancer in CT and investigated the variability in voxel size, slice thickness, and convolution kernels in CT²³. Also, Mackin et al.²⁴ investigated variability in radiomics features with the x-ray tube current used in CT. In a recent study, Shiri et al.²⁵ investigated the impact of image reconstruction settings on radiomics features using two PET/CT scanners. They found that the variability and robustness of PET/CT images are dependent on different features and concluded that radiomics features with a low coefficient of variation (COV) are good candidates for reproducible tumor quantification in multicenter studies. In a similar study of PET, Bailly et al.²⁶ investigated the variability of 15 textural features according to reconstruction parameters in multicenter trials and found that Homogeneity, Entropy, Dissimilarity, High Gray-Level Run Emphasis (HGRE), High Gray Level-Zone Emphasis (HGZE), and Zone Percentage (ZP) features are robust and suitable for use in multicenter trials.

However, not many studies have investigated the repeatability (variations when a patient is scanned twice on the same scanner with the same parameters) and variability when different scanning protocols are used for MRI radiomics studies. MRI is an important diagnostic imaging modality and has been widely used as a major diagnostic tool in both clinical imaging and scientific research, and quantitative radiomics analysis using MRI has increased recently.

Therefore, in the present study, we created an MRI radiomics phantom and used it to assess the robustness of MRI radiomics features with various MRI scanning protocols and two MRI scanners. First, we evaluated radiomics features of the MRI phantom by comparing each feature value with patient population data using the two-sigma range of feature values extracted from 97 T1- and T2-weighted MR images of patients with brain lesions. We then investigated the robustness of magnetic resonance imaging (MRI) radiomics features with various MRI scanning protocol parameters and scanners using an MRI radiomics phantom.

Results

We determined the suitability of the MRI phantom materials by comparing the radiomics feature values from the phantom materials with those of the brain lesions of the patient data (mean values ± two standard deviations [SDs]) (Table 1). Figure 1 illustrates this analysis, showing the values of the inverse variance texture feature for the phantom materials over various settings in a number of excitations (NEX). The orange solid lines and orange dashed line in the figure represent the mean ± two SDs bounds and mean patient population data for the inverse variance feature, respectively. Averages of 92.5% and 79.6% phantom radiomics features for the 20 materials were within the established patient population bounds for T1- and T2-weighted images, respectively.

Table 1 The percentages of radiomics features for the MRI phantom within the established patient population bounds (mean ± 2 SDs).

Full size table

We used the COV to assess the variability of radiomics features for the impact of different MRI parameter settings and plotted a heat map of the COV for both the phantom and volunteer. We repeated this with image intensity normalization, without normalization, with smoothing filter, and without smoothing filter as a preprocessing, respectively (Fig. 2). We used 3 sigma method²⁷ for the intensity normalization and the butterworth algorithm^28,29,30 for the smoothing filter. We also investigated the variability of radiomics features with different ROI size (diameter of 1.2 cm) (Fig. 2). Based on the COV, we categorized the features in terms of variation using three groups: large variation (COV > 30%), intermediate variation (10% < COV ≤ 30%), and small variation (COV ≤ 10%)²⁵. Without any image reconstruction such as normalization and filtering process, the average COVs in these three groups were 6.1%, 18.5%, and 45.5%, respectively, for T1-weighted images and 4.5%, 17.2%, and 51.4%, respectively, for T2-weighted images. Tables 2 and 3 summarize the radiomics features in the three groups for T1- and T2-weighted images, respectively. With normalization and filtering process, the average COVs for three groups summarized in Table 4. The detailed radiomics features in the three groups for T1 and T2- weighted images are listed in Tables S4, S5, S6, S7, S8 and S9 in the supplement information.

Table 2 Variations of radiomics features over different MRI scanning settings for T1-weighted images without normalization and smoothing.

Full size table

Table 3 Variations of radiomics features over different MRI scanning settings for T2-weighted images without normalization and smoothing.

Full size table

Table 4 Average COV for T1- and T2-weighted images.

Full size table

Figure 3 shows intraclass correlation coefficient (ICC) plots for T1- and T2-weighted images of the phantom and volunteer for a test–retest scheme on a single scanner. We found that the T1- and T2-weighted image repeatability measures for the phantom (average ICC, 0.963 and 0.959, respectively) were higher than those for the volunteer (average ICC, 0.856 and 0.849, respectively). In this study, we categorized repeatability variations using three groups: high repeatability (ICC ≥ 0.9), intermediate repeatability (0.6 ≤ ICC < 0.9), and poor repeatability (ICC < 0.6)³¹. Tables 5 and 6 summarize the repeatability of the radiomics features for various MRI scanning parameters for all three groups for the phantom and volunteer, respectively. For the phantom, the ICC for all features except the Gray Level Non-uniformity (T1), Inter Quartile Range (T2), and Information Measure Corr 1 (T2) was greater than 0.6 for both T1- and T2-weighted images. For the feature comparison between with and without normalization, with and without smoothing effects, and different ROI sizes, we summarized the results in Tables S10, S11, S12, respectively. Based on these results, we can see that features in GLRL and NID are more invariant compared to other feature categories.

Table 5 Repeatability of the radiomics features with different MRI scanning settings using the same scanner for the phantom.

Full size table

Table 6 Repeatability of the radiomics features with different MRI scanning settings using the same scanner for the volunteer.

Full size table

Discussion

In recent years, radiomic studies have become increasingly important for medical image analysis to assist the diagnosis, prognosis, and prediction of treatment response within clinical-decision making systems. However, radiomics features are sensitive to different image reconstruction settings, scanning protocols, scanners, and noise sources, so we must identify the radiomics features that remain stable to provide accurate and reliable decision support for patient care. In the present study, we made our phantom with 20 homogeneous and heterogeneous materials selected carefully (Fig. 4B). So, our phantom is similar to the human brain as brain has both homogeneous and heterogeneous regions for fair comparison. We showed the suitability of the phantom materials by comparing radiomics features obtained from phantom materials with those of the brain lesions of patients. We used the brain MRI data over other patient anatomies because of its stable movement. Various studies showed that respiratory motion was a major factor leading to irreproducibility in various modalities such as MRI, PET, and CT³². Next, we investigated the variability and repeatability in radiomics features extracted from T1- and T2-weighted MR images of an MRI phantom and a healthy volunteer to identify radiomics feature robustness for various scanning protocols and different scanners. Our results showed that the robustness of the MRI radiomics features across the different scanning protocols varies depending on radiomics features. According to our results, most intensity-based and gray level co-occurrence matrix (GLCM) features were in the intermediate or small variation group, whereas most neighborhood gray-tone difference (NGTD) features were in the high variation group. NGTD features are extracted from an image inside the region of interest (ROI), and intensity difference is computed in a two-dimensional neighborhood. NGTD features provide fundamental texture properties, such as coarseness, contrast, busyness, complexity, and texture strength³³. Of the GLCM features, variance, cluster shade, cluster tendency, and cluster prominence varied highly across different MRI scanning settings for both the volunteer and phantom, implying that these features are associated with poor robustness. Yang et al.³⁴ investigated on the impact of contouring variability on PET radiomics features in the lung. They reported that the impact of contouring variability is present to varying degrees. In this study, we used the same uniform ROI size for both the volunteer and phantom. Our results showed that some features vary more than other features with different settings. The reason is that each feature has its own formula to express its characteristics of the image and some features are dealing with pixel-wise changes such as NGTD features that describe the differences between each voxel and the neighboring voxels, while other features are dealing with overall (average) changes in an image such as sum average that quantify the mean of the sum histogram of an image. Although NGTD features and these four GLCM features are sensitive to different scanning parameters, they have high reproducibility if the parameters are kept the same. These features, therefore, may be useful for intrascanner studies with fixed protocol settings.

In this study, we performed several scans with various scanning protocol parameters such as NEX, slice thickness, phasing steps, and FOV for T1 and T2 respectively with a multi-center scanner. We also performed all scans twice for each setting to evaluate the reliability of scans. Although we limited the number of scans, our results of repeatability showed highly reproducible. For the repeatability measures, we computed the ICC for radiomics features obtained using the two MRI scanners and showed that the repeatability for the phantom was very high (average ICC, 0.963 and 0.959 for T1- and T2-weighted images, respectively) but that the repeatability for the volunteer was intermediate (average ICC, 0.865 and 0.849 for T1- and T2-weighted images, respectively). The repeatability of the volunteer is slightly lower than that of the phantom. This is not surprising, as humans have factors such as patient movement, respiration, and blood flow that can affect radiomics features, and also highlights the fact that phantom measurements alone are not sufficient for understanding variabilities in MRI-based radiomics features. Also, we showed that for the volunteer, the overall repeatability for T1-weighted images was slightly lower than that for T2-weighted images. Of note is that 39 radiomics features were highly reproducible for T1-weighted images of the volunteer, and 41 radiomics features were highly reproducible for T2-weighted images. The variability results for the normalization and filtering effect (Table 4) did not show much difference between them in average COV values.

We also found that radiomics features have different effects depending on the scanning parameters, which similar studies by other groups also demonstrated. For example, Ford et al.³⁵ investigated the impact of pulse sequence parameter selection (i.e., echo time [TE] and repetition time [TR]) on MRI textural features of the brain. They found that the variability in radiomics features with the choice of pulse sequence and imaging parameters was feature-dependent and can be substantial. In another study, Saha et al.¹⁷ assessed the impact of various MRI scanner parameters on the radiomics features in breast MRI studies. They found that the feature group related to variation in fibroglandular tissue enhancement was the most sensitive to the scanner manufacturer and parameters.

Our study had some limitations. First, we could not remove the effect of the volunteer’s movement including blood flow, which influences radiomics feature values. We sought to minimize this effect by using an immobilization mask to fix the volunteer’s head in place during the scan. Also, we simulated a movement effect with the phantom on an MR image. For example, we shifted an image 1 mm to the right and generated a new image by averaging this shifted image with the original image to simulate an image for NEX 2. However, this simulation study did not change the radiomics feature values and does not explain the effect of the volunteer’s motion artifacts including blood flow. For repeatability measures, we took about a 30-min break between two scans for the volunteer. This may have resulted in uncertainties when the volunteer returned to the original position. In this study, we performed image preprocessing to reduce uncertainty in the feature analysis and used a uniform ROI size. However, there is an uncertainty remaining in the lesion segmentation procedure of the patient data, which may affect the suitability test for our phantom materials. Lastly, it should be noted that our previous study and other work reported volume-dependent and gray level-dependent features^22,36, respectively. In the current study, Tables S2 and S3 are provided in the supplementary information to show the corrected formulas along with the original formulas for the volume-dependent and the gray level-dependent GLCM features, respectively. In this study, corrected formulas were used for the volume-dependent features (Table S2) but original formulas were used for the gray level-dependent GLCM features (Table S3). Please note that since our analysis is based on the same gray levels with various MRI parameter settings for GLCM features, different gray levels with different MRI parameter settings could have different results although the GLCM features in the large variation (COV > 30%) would still be in the same category. Also, it should be noted that our repeatability test will not be affected since the repeatability analysis used the same parameter settings.

In this study, we aimed to identify the robustness of MRI radiomics features with various scanning parameters and multi-scanner variation using an MRI radiomics phantom, which is very useful for calibrating, testing, and evaluating new MRI techniques and variability and repeatability measurements. In this study, we focused on the scanning parameters such as NEX, slice thickness, phasing steps, and FOV, which are the most commonly used in MRI scanning and we fixed all other parameters including filtering, smoothing, and coil sensitivity to avoid introducing other uncertainty factors in this study. We showed that all of the materials in the phantom were suitable by comparing its radiomics features with the patient data from the 97 T1- and T2-weighted MR images and investigated the robustness of various radiomics features with different MRI scanning protocols and two scanners.

Conclusions

In the present work, an MRI phantom was constructed with 20 MRI materials covering a wide range of radiomics feature values and several scans were performed with various scanning protocol parameters such as NEX, slice thickness, phasing steps, and FOV for T1 and T2 respectively. The ICC showed high repeatability for the phantom but intermediate repeatability for the volunteer, while the COV revealed little difference in variability between normalization and filtering effect.

We believe that this study is very useful for practice in the radiomics community, especially in MRI radiomics studies. Our results demonstrated that various radiomics features have different effects depending on the different scanning parameters and scanners. Furthermore, we identified the robust MRI radiomics features with various scanning parameters and multi-scanner variation using an MRI radiomics phantom. The radiomics features with a low COV and high ICC can be considered good candidates for MRI radiomics studies, whereas those with a high COV and low ICC must be used with caution.

Methods

MRI phantom and volunteer

An MRI phantom was created and used to investigate the repeatability and robustness in quantitative radiomics features with various MRI scanning protocol parameters, preprocessing (normalization and image filtering), and scanners. Figure 4 shows the MRI phantom, which was made of acrylic with dimensions of 14.5 × 17.8 × 10.3 cm. Inside the phantom, there were 20 cylinders and each cylinder had a diameter of 2.4 cm and length of 10.3 cm. The phantom could be filled with water through the hole on top of it (Fig. 4A). The MRI phantom was constructed of 20 MRI materials covering a wide range of radiomics feature values (Table 7).

Table 7 The 20 materials used in the MRI phantom.

Full size table

The phantom and the brain of the healthy volunteer were scanned using a 1.5 T Siemens MRI system (SIEMENS Magnetom Aera, Erlangen, Germany) with three-dimensional T1-weighted gradient echo sequence and T2-weighted fast spin echo sequence. A fixed TR (11 ms) and TE (4.77 ms) and flip angle of 30° with various scanning protocol parameters were used for T1-weighted images. For T2-weighted images, a TE of 281 ms, TR of 1530 ms, and flip angle of 160° with various scanning protocols were used. For comparison, scanning of the MRI phantom was also performed using a 1.5 T Philips MRI system (PHILIPS Marlin, Finland). For this scanner, a fixed TR (11 ms) and TE (4.61 ms) and flip angle of 30° were used for T1-weighted images, and a TE of 281 ms, TR of 1535 ms, and flip angle of 90° were used for T2-weighted images. We then varied the following scanning protocol parameters: number of excitation (NEX), slice thickness, phasing steps, and field of view (FOV). The detailed scanning protocols are listed in Table 8. Each scan was performed twice with the same setting for both scanners for the repeatability test. The phantom was removed from the scanner after the first scan and repositioned for the second scan. For the volunteer, the scan was also performed twice with the same setting and the volunteer took about a 30-min break between the two scans. The scans were performed each week for multi-scanner variability. In order to determine the variability from different scanning parameters and scanners accurately, we did not perform any intensity normalization on MR images to prevent another uncertainty on radiomics features or diminishing the effects of various scanning settings.

Table 8 The scanning protocols used with the Siemens and Philips 1.5 T MRI scanners.

Full size table

Patient data for the suitability test

First, we investigated the suitability of our phantom materials with brain lesions. A total of 97 patient data identified as having necrosis or progression of brain lesions were used to evaluate the suitability of each phantom material³⁷. The use of all patient data were approved and written informed consent was waived by The MD Anderson Cancer Center Institutional Review Board. All MR images of these patients were acquired using a GE 1.5 T MRI scanner with a slice thickness of 5 mm, slice spacing of 6.5 mm, and field-of-view of 22 cm for T1- and T2-weighted images. The brain lesions were segmented on the post-contrast T1 images by a radiation oncologist because the lesions were easier to identify. The post-contrast T1 contour was then rigidly mapped to the other scan sequences such as pre-contrast T1- and T2-weighted images for each patient at each time point using the Velocity AI software (version 3.0.1; Varian Medical Systems, Atlanta, GA, USA).

Phantom and a healthy volunteer data for the repeatability and variability

For the repeatability and variability of the radiomics features, we used the features from the phantom and a healthy volunteer from two scans. All ROIs on the phantom and a healthy volunteer were delineated semiautomatically using a contour tool available with our in-house imaging software program IBEX^23,38. Each ROI had a cylindrical shape with a diameter of 1.8 cm and a height of 10 cm for both the phantom and the volunteer. We used axial images where the height is along the z-axis. We used this uniform ROI size on MR images of the phantom and the volunteer to avoid uncertainty between the ROI size and radiomics features. Twenty ROIs on the phantom and volunteer’s brain were delineated (Fig. 4B,C, respectively); Twenty ROIs on a healthy volunteer’s brain were evenly selected over the brain. For patient data, each lesion on MR images for each patient was delineated by ValocityAI software (version 3.0.1; Varian Medical Systems, Atlanta, GA, USA). The radiation oncologist reviewed the contours on the MR images to ensure correct mapping and modified them when necessary.

In this study, we performed image preprocessing before extracting radiomics features to reduce uncertainty in the feature analysis; an edge-preserving smoothing filter was applied to the tumor volume before the feature calculations to preserve meaningful edge information while smoothing out undesirable imaging noise²⁹. Then, we extracted a total 76 radiomics features from delineated ROIs from MR images of the phantom, volunteer, and patients, respectively. The radiomics features consisted of 7 Gradient orient histogram features, 22 GLCM features, 11 GLRL features, 31 intensity features, 5 neighborhood gray-tone matrix (NGTDM). The detailed features are listed in Table 9 and Table S1 in the supplementary information. All quantitative image features were calculated and extracted using IBEX^23,38,39. This software was designed based on MATLAB (version 8.1.0; MathWorks, Natick, MA), and available at http://bit.ly/IBEX_MDAnderson. Our previous study and other work reported volume dependent and gray level dependent features^22,36. In this study, corrected formulas were used for the volume-dependent features and original formulas were used for the gray level-dependent GLCM features as shown in the Table S2 and S3.

Table 9 The examined radiomics features extracted from delineated ROIs on MR images.

Full size table

Data analysis

First, we investigated the suitability of each phantom material to see whether the range of radiomics features of each material was similar to the range of radiomics features of the brain lesions of patients. This was done by comparing each feature value from the phantom with those from brain lesions using mean values ± 2 SDs, where this range covers 95% of an approximately normal data set and excludes outliers of the data. This brain lesions of patients only used for the suitability of the phantom materials. Next, we investigated the robustness of the radiomics features obtained from the 20 phantom materials in T1- and T2-weighted images using various scanning protocols and the two scanners. To assess the robustness of the various radiomics features with the different MRI scanning protocol parameters, the COV was computed for each radiomics feature in each scan using Eq. (1)

$$\mathrm{COV}=\frac{\sigma }{\mu }\times 100$$

(1)

where σ is the standard deviation and μ is the mean when applying different scanning settings for each MRI parameter (i.e., NEX = 1, 2, and 3).

Next, the repeatability of the radiomics features in two scans was investigated. This was performed with the Siemens 1.5 T MRI scanner twice under the same conditions, such as the same range of whole scanning parameter settings. The repeatability of the radiomics features extracted from normalized images was assessed using the ICC, a measure of the reliability of measurements that can demonstrate how strongly measurements with the same settings resemble each other. For our test–retest scheme with two repeated scans, the ICC was computed using Eq. (2) ⁴⁰

$$ICC\left(\mathrm{1,1}\right)= \frac{BMS-WMS}{BMS+WMS}$$

(2)

where BMS is the between-subjects mean square and WMS is the within-subjects mean square. Therefore, the ICC considers the variation in repeated scans in relation to the total variation in the population⁴⁰.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Kumar, V. et al. Radiomics: The process and the challenges. Magn. Reson. Imaging 30, 1234–1248. https://doi.org/10.1016/j.mri.2012.06.010 (2012).
Article PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48, 441–446. https://doi.org/10.1016/j.ejca.2011.11.036 (2012).
Article PubMed PubMed Central Google Scholar
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006. https://doi.org/10.1038/ncomms5006 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: Images are more than pictures, they are data. Radiology 278, 563–577. https://doi.org/10.1148/radiol.2015151169 (2016).
Article PubMed Google Scholar
Yuan, M. et al. Prognostic impact of the findings on thin-section computed tomography in stage I lung adenocarcinoma with visceral pleural invasion. Sci. Rep. 8, 4743. https://doi.org/10.1038/s41598-018-22853-1 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Oikonomou, A. et al. Radiomics analysis at PET/CT contributes to prognosis of recurrence and survival in lung cancer treated with stereotactic body radiotherapy. Sci. Rep. 8, 4003. https://doi.org/10.1038/s41598-018-22357-y (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Kirienko, M. et al. Prediction of disease-free survival by the PET/CT radiomic signature in non-small cell lung cancer patients undergoing surgery. Eur. J. Nucl. Med. Mol. Imaging 45, 207–217. https://doi.org/10.1007/s00259-017-3837-7 (2018).
Article PubMed Google Scholar
Lee, J. et al. Texture feature ratios from relative CBV maps of perfusion MRI are associated with patient survival in glioblastoma. AJNR Am. J. Neuroradiol. 37, 37–43. https://doi.org/10.3174/ajnr.A4534 (2016).
Article CAS PubMed Google Scholar
Arita, H. et al. Lesion location implemented magnetic resonance imaging radiomics for predicting IDH and TERT promoter mutations in grade II/III gliomas. Sci. Rep. 8, 11773. https://doi.org/10.1038/s41598-018-30273-4 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Guo, J. et al. MR-based radiomics signature in differentiating ocular adnexal lymphoma from idiopathic orbital inflammation. Eur. Radiol. 28, 3872–3881. https://doi.org/10.1007/s00330-018-5381-7 (2018).
Article PubMed Google Scholar
Kim, S., Kim, M. J., Kim, E. K., Yoon, J. H. & Park, V. Y. MRI radiomic features: Association with disease-free survival in patients with triple-negative breast cancer. Sci. Rep. 10, 3750. https://doi.org/10.1038/s41598-020-60822-9 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Park, J. E. et al. Radiomics prognostication model in glioblastoma using diffusion- and perfusion-weighted MRI. Sci. Rep. 10, 4250. https://doi.org/10.1038/s41598-020-61178-w (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Yan, J. et al. Impact of image reconstruction settings on texture features in 18F-FDG PET. J. Nucl. Med. 56, 1667–1673. https://doi.org/10.2967/jnumed.115.156927 (2015).
Article CAS PubMed Google Scholar
Galavis, P. E., Hollensen, C., Jallow, N., Paliwal, B. & Jeraj, R. Variability of textural features in FDG PET images due to different acquisition modes and reconstruction parameters. Acta Oncol. 49, 1012–1016. https://doi.org/10.3109/0284186X.2010.498437 (2010).
Article PubMed PubMed Central Google Scholar
Veeraraghavan, H. et al. Appearance constrained semi-automatic segmentation from DCE-MRI is reproducible and feasible for breast cancer radiomics: A feasibility study. Sci. Rep. 8, 4838. https://doi.org/10.1038/s41598-018-22980-9 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Ger, R. B. et al. Quantitative image feature variability amongst CT scanners with a controlled scan protocol. Proc. Spie https://doi.org/10.1117/12.2293701 (2018).
Article Google Scholar
Saha, A., Yu, X. Z., Sahoo, D. & Mazurowski, M. A. Effects of MRI scanner parameters on breast cancer radiomics. Expert Syst. Appl. 87, 384–391. https://doi.org/10.1016/j.eswa.2017.06.029 (2017).
Article PubMed PubMed Central Google Scholar
Chirra, P. et al. Empirical evaluation of cross-site reproducibility in radiomic features for characterizing prostate MRI. Proc. Spie https://doi.org/10.1117/12.2293992 (2018).
Article Google Scholar
Hu, P. et al. Reproducibility with repeat CT in radiomics study for rectal cancer. Oncotarget 7, 71440–71446. https://doi.org/10.18632/oncotarget.12199 (2016).
Article PubMed PubMed Central Google Scholar
Peerlings, J. et al. Stability of radiomics features in apparent diffusion coefficient maps from a multi-centre test–retest trial. Sci. Rep. 9, 4800. https://doi.org/10.1038/s41598-019-41344-5 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Schwier, M. et al. Repeatability of multiparametric prostate MRI radiomics features. Sci. Rep. 9, 9441. https://doi.org/10.1038/s41598-019-45766-z (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Fave, X. et al. Impact of image preprocessing on the volume dependence and prognostic potential of radiomics features in non-small cell lung cancer. Transl. Cancer Res. 5, 349–363. https://doi.org/10.21037/tcr.2016.07.11 (2016).
Article CAS Google Scholar
Fave, X. et al. Can radiomics features be reproducibly measured from CBCT images for patients with non-small cell lung cancer?. Med. Phys. 42, 6784–6797. https://doi.org/10.1118/1.4934826 (2015).
Article PubMed PubMed Central Google Scholar
Mackin, D. et al. Effect of tube current on computed tomography radiomic features. Sci. Rep. 8, 2354. https://doi.org/10.1038/s41598-018-20713-6 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
Shiri, I. et al. The impact of image reconstruction settings on 18F-FDG PET radiomic features: Multi-scanner phantom and patient studies. Eur. Radiol. 27, 4498–4509. https://doi.org/10.1007/s00330-017-4859-z (2017).
Article PubMed Google Scholar
Bailly, C. et al. Revisiting the robustness of PET-based textural features in the context of multi-centric trials. PLoS ONE 11, e0159984. https://doi.org/10.1371/journal.pone.0159984 (2016).
Article CAS PubMed PubMed Central Google Scholar
Collewet, G., Strzelecki, M. & Mariette, F. Influence of MRI acquisition protocols and image intensity normalization methods on texture classification. Magn. Reson. Imaging 22, 81–91. https://doi.org/10.1016/j.mri.2003.09.001 (2004).
Article CAS PubMed Google Scholar
Butterworth, S. J. W. E. On the theory of filter amplifiers. Sci. Res. 7, 536–541 (1930).
Google Scholar
Branco, L. R. F. et al. Technical note: Proof of concept for radiomics-based quality assurance for computed tomography. J. Appl. Clin. Med. Phys. 20, 199–205. https://doi.org/10.1002/acm2.12750 (2019).
Article PubMed PubMed Central Google Scholar
Tanaka, S. et al. Investigation of thoracic four-dimensional CT-based dimension reduction technique for extracting the robust radiomic features. Phys. Med. 58, 141–148. https://doi.org/10.1016/j.ejmp.2019.02.009 (2019).
Article PubMed Google Scholar
Koo, T. K. & Li, M. Y. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med. 15, 155–163. https://doi.org/10.1016/j.jcm.2016.02.012 (2016).
Article PubMed PubMed Central Google Scholar
Oliver, J. A. et al. Variability of image features computed from conventional and respiratory-gated PET/CT images of lung cancer. Transl. Oncol. 8, 524–534. https://doi.org/10.1016/j.tranon.2015.11.013 (2015).
Article PubMed PubMed Central Google Scholar
Amadasun, M. & King, R. Textural features corresponding to textural properties. IEEE T. Syst. Man. Cybn. 19, 1264–1274. https://doi.org/10.1109/21.44046 (1989).
Article Google Scholar
Yang, F. et al. Impact of contouring variability on oncological PET radiomics features in the lung. Sci. Rep. 10, 369. https://doi.org/10.1038/s41598-019-57171-7 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Ford, J., Dogan, N., Young, L. & Yang, F. Quantitative radiomics: Impact of pulse sequence parameter selection on mri-based textural features of the brain. Contrast Media Mol. Imaging 2018, 1729071. https://doi.org/10.1155/2018/1729071 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shafiq-Ul-Hassan, M. et al. Intrinsic dependencies of CT radiomic features on voxel size and number of gray levels. Med. Phys. 44, 1050–1062. https://doi.org/10.1002/mp.12123 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. A predictive model for distinguishing radiation necrosis from tumour progression after gamma knife radiosurgery based on radiomic features from MR images. Eur. Radiol. 28, 2255–2263. https://doi.org/10.1007/s00330-017-5154-8 (2018).
Article PubMed Google Scholar
Zhang, L. et al. IBEX: An open infrastructure software platform to facilitate collaborative work in radiomics. Med. Phys. 42, 1341–1353. https://doi.org/10.1118/1.4908210 (2015).
Article PubMed PubMed Central Google Scholar
Fave, X. et al. Delta-radiomics: The prognostic value of therapy-induced changes in radiomics features for stage III non-small cell lung cancer patients. Med. Phys. 43, 3750–3750. https://doi.org/10.1118/1.4957510 (2016).
Article Google Scholar
Shrout, P. E. & Fleiss, J. L. Intraclass correlations—uses in assessing rater reliability. Psychol. Bull. 86, 420–428. https://doi.org/10.1037//0033-2909.86.2.420 (1979).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Donald Norwood of MD Anderson’s Department of Scientific Publications for scientific editing and suggestions. The funding for this work was provided by the generous support from the Scurlock Foundation to the Center for Radiation Oncology Research at the University of Texas MD Anderson Cancer Center.

Author information

Authors and Affiliations

Department of Radiation Physics, Unit 1420, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX, 77030, USA
Joonsang Lee, Angela Steinmann, Yao Ding, Hannah Lee, Constance Owens, Jihong Wang, Jinzhong Yang, David Followill, Rachel Ger, Dennis MacKin & Laurence E. Court
Department of Computational Medicine and Bioinformatics, University of Michigan, 500 S State Street, Ann Arbor, MI, 48109, USA
Joonsang Lee

Authors

Joonsang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Angela Steinmann
View author publications
You can also search for this author in PubMed Google Scholar
Yao Ding
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Lee
View author publications
You can also search for this author in PubMed Google Scholar
Constance Owens
View author publications
You can also search for this author in PubMed Google Scholar
Jihong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jinzhong Yang
View author publications
You can also search for this author in PubMed Google Scholar
David Followill
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Ger
View author publications
You can also search for this author in PubMed Google Scholar
Dennis MacKin
View author publications
You can also search for this author in PubMed Google Scholar
Laurence E. Court
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Project conception and design was by J.L., J.W., J.Y., D.F., R.G., D.M., and L.C. The phantom design was by J.L., A.S., D.F., and L.C. The data collection was performed by J.L.., Y.D., H.L., J.Y., and C.O. The software programming, statistical analysis and interpretation were performed by J.L., and L.C. The manuscript was written by J.L. and L.C.

Corresponding author

Correspondence to Laurence E. Court.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, J., Steinmann, A., Ding, Y. et al. Radiomics feature robustness as measured using an MRI phantom. Sci Rep 11, 3973 (2021). https://doi.org/10.1038/s41598-021-83593-3

Download citation

Received: 22 October 2018
Accepted: 15 January 2021
Published: 17 February 2021
DOI: https://doi.org/10.1038/s41598-021-83593-3

This article is cited by

Impact of harmonization on the reproducibility of MRI radiomic features when using different scanners, acquisition parameters, and image pre-processing techniques: a phantom study
- Ghasem Hajianfar
- Seyyed Ali Hosseini
- Habib Zaidi
Medical & Biological Engineering & Computing (2024)
Comparison of Machine Learning Models Using Diffusion-Weighted Images for Pathological Grade of Intrahepatic Mass-Forming Cholangiocarcinoma
- Li-Hong Xing
- Shu-Ping Wang
- Xiao-Ping Yin
Journal of Imaging Informatics in Medicine (2024)
Comparing effectiveness of image perturbation and test retest imaging in improving radiomic model reliability
- Jiang Zhang
- Xinzhi Teng
- Jing Cai
Scientific Reports (2023)
An Explainable MRI-Radiomic Quantum Neural Network to Differentiate Between Large Brain Metastases and High-Grade Glioma Using Quantum Annealing for Feature Selection
- Tony Felefly
- Camille Roukoz
- Ziad Francis
Journal of Digital Imaging (2023)
Achieving imaging and computational reproducibility on multiparametric MRI radiomics features in brain tumor diagnosis: phantom and clinical validation
- E.-Nae Cheong
- Ji Eun Park
- Ho Sung Kim
European Radiology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Discussion

Conclusions

Methods

MRI phantom and volunteer

Patient data for the suitability test

Phantom and a healthy volunteer data for the repeatability and variability

Data analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links