Introduction

Hypertrophic cardiomyopathy (HCM) is a genetically determined disease that affects about 1:500 people in the general adult population1. It is characterized by left ventricular (LV) hypertrophy, myofibrillar disarray, and myocardial fibrosis. Cardiac magnetic resonance (CMR) imaging has an important role in diagnosis, risk stratification, and treatment planning in HCM and, according to the European Society of Cardiology guidelines, it should be performed, at least as an initial evaluation, for all HCM patients2.

Actually, CMR imaging is considered the gold standard for the evaluation of ventricular morphology and function, as well as for the assessment of ventricular wall thickness3,4. Furthermore, late gadolinium enhancement (LGE) imaging enables the identification of focal myocardial fibrosis, which has been demonstrated to have a negative prognostic value in HCM patients3,4. More recently, cardiac T1 and T2 mapping techniques have been employed to obtain a more quantitative evaluation of myocardial tissue characteristics. Previous studies have consistently found that T1 and T2 values, as well as extracellular volume (ECV) values obtained from pre- and post-contrast T1 maps, are increased in HCM patients and have a potential prognostic role5,6,7,8,9,10,11. Actually, both native (i.e., without contrast administration) T1 and ECV values are considered markers of myocardial fibrosis12. On the other hand, T2 mapping is currently considered the gold standard for the evaluation of myocardial edema. Increased T2 values, probably related to low grade myocardial inflammation and dilated lymphatic channels13,14, have been demonstrated in HCM patients and have been proven to be associated with an increased arrhythmic risk15,16, as well as with more severe myocardial injury17. However, the usefulness of these mapping techniques is limited by a partial overlap of T1 and T2 values between HCM patients and healthy controls, as well as between HCM patients and patients with other myocardial diseases18,19,20,21.

Radiomics is a novel tool that involves the extraction of a large number of quantitative morphological and textural characteristics (i.e., radiomic features) from digital medical images22. The underlying idea is that medical images are data containing objective and quantitative information, which is not obtainable from qualitative visual inspection as usually performed in routine clinical practice. In addition to other available data from demographics, pathology, blood biomarkers, and genomics, radiomic features can be used for diagnostic, prognostic or predictive purposes exploiting statistical or machine learning (ML) methods22,23,24.

While radiomic analysis has been mainly applied in the field of oncology, there is a growing interest in improving the diagnostic accuracy and prognostic value of CMR imaging by exploiting radiomic techniques25,26,27,28,29,30. Recently, radiomics has aroused as a useful tool for unveiling myocardial tissue characteristics in HCM patients18,19,20,21,31,32,33,34,35,36. In this regard, previous studies focused mainly on traditional T1-weighted, T2-weighted, cine, and LGE images18,31,32,33,34,35. However, given their inherent quantitative nature, T1 and T2 maps might be particularly suitable for radiomic analysis. So far, only a few studies have applied texture analysis to T1 maps in patients with HCM19,20,21,36. They have found texture features able to discriminate among hypertensive heart disease (HHD), HCM, and healthy subjects19,21, between HCM patients with different genetic mutations20, as well as between LGE+ and LGE− patients36.

Despite the increasing interest in radiomics of CMR imaging, its proper application deserves some caution and a preliminary assessment of possible radiomic features dependence on various factors. Indeed, each step of the radiomic workflow (i.e., image acquisition and reconstruction, image segmentation, image preprocessing, image filtering, and feature extraction) could influence features estimation, thus potentially affecting their discriminative or predictive power37,38. Several previous studies have assessed the sensitivity of radiomic features estimation to various elements in computed tomography (CT)39,40,41 and nuclear medicine (NM)42,43 imaging, as well as in magnetic resonance imaging (MRI), for various applications44,45,46,47,48,49,50,51,52,53,54,55. On the other hand, only three studies have assessed some of these issues for CMR imaging56,57,58. In particular, Jang et al.56 have investigated both test–retest repeatability, as well as inter- and intra-observer reproducibility, in phantoms, healthy participants, and patients who were referred for clinical CMR imaging. They have observed different repeatability patterns among acquisition sequences in in vivo test–retest studies. Moreover, in an inter- and intra-observer reproducibility analysis, 32–47% and 61–73% out of 1023 features, respectively, were identified as reproducible, showing intraclass correlation coefficient (ICC) ≥ 0.8. Across all analyses, first order and gray level co-occurrence matrix were the most frequently identified reproducible radiomic feature classes. Another CMR imaging study by Jang et al.57 has evaluated, in a group of healthy participants and in a heterogeneous group of patients, the sensitivity of radiomic features estimate to changes in acquisition sequence parameters including flip angle, in-plane spatial resolution, slice thickness, and parallel imaging technique. As a whole, they have found that approximately 60% of the 4007 considered radiomic features were robust to changes in any acquisition parameters (i.e., standardized Cohen’s mean difference < 0.2), with qualitative acquisition sequences (i.e., cine balanced steady-state free-precession, T1-weighted spoiled gradient-echo, T2-weighted turbo spin-echo) and quantitative T1/T2 mapping most sensitive to changes in in-plane spatial resolution and flip angle, respectively. Alis et al.58 have evaluated the inter-observer reproducibility of radiomic features, as well as the influence of cardiac phases (i.e., systole, diastole or a 4D stack of cine images covering the cardiac cycle) on radiomic features estimation, from non-enhanced cine CMR images of healthy subjects. For the inter-observer analysis, they have found that the number of radiomic features with ICC > 0.8/0.85/0.9 changed with different phases of the cardiac cycle, and that texture features belonging to the gray level size zone matrix showed the worst performance. Moreover, approximately 55% out of 352 radiomic features showed a variability (in terms of coefficient of variation) > 20% through the cardiac cycle, with gray level dependence matrix and gray level run length matrix classes demonstrating less variability compared with other feature classes.

Image resampling and discretization, as well as spatial filtering, are some steps of the radiomic workflow that may be performed as preprocessing before the radiomic features extraction from the acquired image data59,60. In particular, image interpolation at the same voxel size is a common and recommended practice (especially in retrospective studies) to reduce any heterogeneity in acquisition voxel size, while image discretization is required to make texture feature estimation computationally less burdensome42,59. Moreover, the application of spatial filters before radiomic features estimation could allow uncovering further tissue characteristics. Notably, such a preprocessing actually alters acquired image data and possibly their radiomic characteristics. Nonetheless, so far, no work has specifically assessed the impact of both image resampling/discretization and spatial filtering on the estimate of radiomic features from CMR imaging. Therefore, this work aimed to comprehensively assess, for the first time, the effect of preprocessing—in terms of voxel size resampling, map values discretization, and spatial filtering—on radiomic features estimated from CMR T1 and T2 mapping in a group of HCM patients.

Methods

Subjects

We retrospectively identified 26 subjects (12 females, 14 males), referred for clinical CMR imaging between November 2013 and July 2020 for known or suspected HCM, for whom a comprehensive MRI study including both T1 and T2 mapping sequences was performed. The HCM diagnoses followed the latest European Society of Cardiology guidelines and were based on the presence of LV wall thickness ≥ 15 mm in one or more myocardial segments, not explained solely by loading conditions2. Clinical characteristics of the patient group are detailed in Table 1. The study was approved by the local ethics committee of the Azienda USL Toscana Nord Ovest (Pisa, Italy). Written informed consent was obtained from all subjects, and all experiments were performed in accordance with relevant guidelines and regulations.

Table 1 Clinical and CMR-derived chacteristics (mean ± standard deviation) of 26 patients with HCM.

CMR imaging

All CMR imaging examinations were performed using a 1.5 T MRI scanner system (MAGNETOM Avanto, Siemens Healthcare, Erlangen, Germany) equipped with 45 mT/m gradients strength and a 12-channel phased array coil.

Cine images were performed in the 2- and 4-chamber view planes (3 slices each) and in short-axis view (8–14 slices encompassing the entire LV), using a TrueFISP sequence (TR = 2.5 ms, TE = 1.2 ms, slice thickness = 8 mm).

Both T1 and T2 maps were obtained in the short axis view (a single slice located where myocardial thickness, evaluated with cine images, was maximum). T1 mapping was performed utilizing a modified look-locker inversion recovery (MOLLI) pulse sequence with a 3–3–5 acquisition scheme61. Pulse sequence parameters were as follows: TE/TR = 1.14/2.5 ms, flip angle = 35°, matrix size = 126 × 192, in-plane resolution ranged from 1.77 mm × 1.77 mm to 2.34 mm × 2.34 mm, typical field of view = 380 mm × 275 mm, slice thickness = 8 mm. T2 maps were obtained utilizing a T2-prepared TrueFISP sequence62 with the following parameters: T2 preparation time = 0/24/55 ms, TR = 4 × RR, flip angle = 70°, matrix size = 126 × 192, in-plane resolution ranged from 1.77 × 1.77 mm to 2.34 × 2.34 mm, typical field of view = 380 mm × 275 mm, slice thickness = 6 mm.

Then, 10–15 min after intravenous administration of 0.2 mmol/kg of gadolinium DTPA (Magnevist, Schering), gadoteric acid (Dotarem, Guerbet) or gadoteridol (Prohance, Bracco), LGE images were acquired in 2- and 4-chamber view of the LV (3 slices each), as well as in the short-axis view (8–14 slices encompassing the entire LV) using a 2D phase-sensitive inversion recovery (PSIR) sequence (TR = 700 ms, TE = 1.09 ms, slice thickness = 8 mm, inversion time = 200–300 ms, typical in-plane resolution = 2.4 mm × 3.2 mm).

Preprocessing of T1 and T2 maps

For each subject, a region of interest (ROI) covering the entire myocardium was manually segmented by a single expert radiologist in cardiac MRI, with 15 years of experience, using 3D Slicer (Version 4.11.2)63,64. ROIs were delineated on T1 and T2 maps separately, avoiding voxels with potential partial volume effect. Supplementary Figure S1 shows an example myocardium ROI for a representative HCM patient.

Given that T1 and T2 maps allowed acquiring only a single slice, resampling voxel size was performed through 2D interpolation by using the B-spline interpolation algorithm (with the origins of interpolation and original image grids aligned together59). The original in-plane spatial resolution ranged from 1.77 mm × 1.77 mm to 2.34 mm × 2.34 mm across patients for both T1 and T2 maps. Therefore, T1 and T2 maps were resampled to an in-plane isotropic spatial resolution of 1.8 mm, 1.9 mm, 2.0 mm, 2.1 mm, 2.2 mm, 2.3 mm, and 2.4 mm. Resampling voxel size was performed for different bin widths (see below).

T1 and T2 maps discretization was carried out following a fixed bin width approach, as recommended by the Image Biomarker Standardisation Initiative (IBSI) when dealing with quantitative data59. In particular, for each resampling voxel size (i.e., 1.8, 1.9, 2, 2.1, 2.2, 2.3, and 2.4 mm in-plane isotropic spatial resolution), bin width values of 3.60 ms, 3.95 ms, 4.30 ms, 4.65 ms, 5.00 ms, 5.35 ms, 5.70 ms, 6.05 ms, and 6.40 ms were employed for T1 maps, while bin width values of 0.49 ms, 0.50 ms, 0.51 ms, 0.52 ms, 0.53 ms, 0.54 ms, 0.55 ms, 0.56 ms, and 0.57 ms were employed for T2 maps. Indeed, given that the median (across HCM patients) range of T1 and T2 values (i.e., the difference between the maximum and minimum T1 and T2 values of voxels within the ROI) was 265 ms and 26 ms, respectively, bin width values were chosen in such a way that the number of quantization levels of T1 and T2 maps was within the range of 30–130, for each HCM patient. This approach, adopted also in previous technical studies40,41,52,54, has the potential to limit variability in radiomic features estimation43,65,66.

Different spatial filters were applied on T1 and T2 maps: 2D wavelets (Daubechies 3) yielding four filtered maps (i.e., wavelet-LH, -HL, -HH, -LL, where L/H refers to the combination of low-/high-pass filters applied in the horizontal and vertical direction), gradient magnitude of the map (i.e., gradient filter), the square of the map values (i.e., square filter), and the square root of the absolute map value (i.e., square-root filter). Specifically, in order to restrict the large number of possible combinations, image filtering was performed at fixed resampling in-plane isotropic spatial resolution of 2.1 mm, and at fixed bin width of 6 ms and 0.56 ms for T1 and T2 maps, respectively, yielding median (across subjects) numbers of quantization levels between 30 and 13043,65,66.

All preprocessing steps (i.e., T1/T2 maps resampling, discretization, and spatial filtering) and subsequent radiomic features estimation were carried out by using the open source PyRadiomics library67 (Version 3.0.1) with Python (Version 3.7.3), running on a MacBook Air (macOS Version 10.14) with a 1.8 GHz Intel Core i5 CPU.

Radiomic features estimation

Given that the used acquisition sequences allowed obtaining T1 and T2 maps on a single slice, the 2D versions of radiomic features were considered. For each ROI and preprocessing combination, in terms of resampling voxel size and bin width, a total of 98 features were obtained: 9 2D-shape features, 16 first order features (14 intensity-based statistical features and 2 intensity histogram features, namely Entropy and Uniformity), and 73 second order features (i.e., textural features) from gray level co-occurrence matrix (GLCM, 22 features), gray level run length matrix (GLRLM, 16 features), gray level size zone matrix (GLSZM, 16 features), gray level dependence matrix (GLDM, 14 features, with coarseness parameter α = 0), and neighborhood gray tone difference matrix (NGTDM, 5 features). Second order features estimation was performed according to the Chebyshev norm with a distance of 1 pixel. GLCM and GLRLM features were computed from each 2D directional matrix (i.e., at 0°, 45°, 90°, and 135°) and averaged over 2D directions.

For each ROI and spatial filter applied on T1 and T2 maps, a total of 89 features were estimated, i.e. all the above features but the shape features. Indeed, given that shape features are usually estimated regardless of the applied image filter, they were not included in our analysis.

All radiomic features were computed in accordance with the definitions provided by the IBSI, with shape features computed in 2D instead of the proposed 3D version. It is worth noting that the first order feature of Kurtosis calculated by PyRadiomics was in accordance with the IBSI except for an offset value (i.e., 3).

Statistical analysis

In this study, three different effects on radiomic features estimation were assessed for T1 and T2 maps singularly:

  1. A.

    For each bin width, the effect of using different resampling voxel sizes.

  2. B.

    For each voxel size, the effect of using different bin widths.

  3. C.

    At fixed resampling voxel size and bin width, the effect of using different spatial filters.

For each effect of interest, any variability in radiomic features estimate was assessed through ICC analysis68,69. In particular, the two-way mixed effects model, with single rater and absolute agreement, was selected. Accordingly, the ICC coefficient was calculated as:

$$ICC = \frac{{MS_{R} - MS_{E} }}{{MS_{R} + (k - 1)MS_{E} + \frac{k}{n}(MS_{C} - MS_{E} )}}.$$
(1)

where MSR = mean square for rows (i.e., between subjects), MSE = mean square for error, MSC = mean square for columns (i.e., between measurements), n = number of subjects and k = number of raters, with ICC matrices realized considering each resampling voxel size, quantization bin width or image filter as a rater. ICC values range between 0 (maximum variability) and 1 (minimum variability) and express the variability of radiomic features estimate associated with the effect of interest with respect to the variance between subjects (i.e., a relative variability). Radiomic features were stratified based on their degree of relative variability37,52: high (ICC ≤ 0.5), considerable (0.5 < ICC ≤ 0.75), moderate (0.75 < ICC ≤ 0.9), and low (0.9 < ICC ≤ 1) relative variability.

In order to characterize in greater detail and absolutely the variability in radiomic feature estimate, an additional analysis of the coefficient of variation (CV) was performed. In particular, CVs were computed as the percentage ratio between standard deviation and absolute mean values of features estimate obtained by varying one element of interest (resampling voxel size, bin width or spatial filter) while fixing the others. Accordingly, this yielded 26 × 9 CVs for effect A (i.e., 26 subjects and 9 fixed bin widths), 26 × 7 CVs for effect B (i.e., 26 subjects and 7 fixed resampling voxel size), and 26 CVs for effect C (i.e., 26 subjects with fixed resampling voxel size and bin width).

Any linear correlation between radiomic features estimate and resampling voxel size or bin width was assessed by a repeated measures correlation analysis, namely rmcorr70. This technique accounts for non-independence among observations (i.e., repeated measures of the same radiomic features of the same subject by varying one preprocessing element) using analysis of covariance (ANCOVA) to statistically adjust for the variability between subjects. A p-value < 0.05, adjusted for multiple comparisons using Bonferroni correction, was considered significant.

Statistical analysis was carried out by using R (Version 3.6.2) software package in the RStudio (Version 1.2.5033) environment71.

Results

The entire time taken by the PyRadiomics tool to extract all the combinations of radiomic features for effects A or B was approximately 35 s, while the estimation process for effect C took approximately 10 s.

ICC results for the effects of interest A and B are reported in Figs. 1 and 2, respectively, for both T1 and T2 maps. ICC values were > 0.9 for radiomic features belonging to the shape class. In general, first and second order features estimated from T2 maps showed a lower relative variability than those estimated from T1 maps, especially for effect B where ICC values were always in moderate or low ranges (with few exceptions for some resampling voxel size values, i.e., GLSZM-LargeAreaLowGrayLevelEmphasis, GLSZM-SmallAreaLowGrayLevelEmphasis, and GLDM-LargeDependenceLowGrayLevelEmphasis). For both T1 and T2 maps, effect C resulted in higher relative variability (ICC ≤ 0.5) of radiomic features estimate as compared to effect A and B (see Supplementary Fig. S2).

Figure 1
figure 1

ICC results for effect A, for both T1 and T2 maps. The heatmap of each radiomic features class (i.e., shape, first order, GLCM, GLRLM, GLSZM, GLDM, and NGTDM) shows the degree of relative variability in radiomic features estimate when using different resampling voxel sizes for different bin widths (i.e., 3.60, 3.95, 4.30, 4.65, 5.00, 5.35, 5.70, 6.05, and 6.40 ms for T1 maps and 0.49, 0.50, 0.51, 0.52, 0.53, 0.54, 0.55, 0.56, and 0.57 ms for T2 maps).

Figure 2
figure 2

ICC results for effect B, for both T1 and T2 maps. The heatmap of each radiomic features class (i.e., shape, first order, GLCM, GLRLM, GLSZM, GLDM, and NGTDM) shows the degree of relative variability in radiomic features estimate when using different bin widths for different resampling voxel sizes (i.e., 1.8, 1.9, 2.0, 2.1, 2.2, 2.3, and 2.4 mm for both T1 and T2 maps).

For effects A and B, Table 2 summarizes the median (across subjects and bin widths/voxel sizes for effect A/B) CV values of radiomic features estimated from T1 and T2 maps. The CV was less than 40% for both effects as well as for T1 and T2 (with few exceptions for effect B assessed on T1 maps, i.e., GLCM-ClusterShade, GLCM-ClusterProminence, NGTDM-Complexity, and GLDM-SmallDependenceHighGrayLevelEmphasis). Specifically, for effect B assessed on T2 maps, this absolute variability was < 20%. Moreover, Figs. 3, 4a–c show radiomic features with both median (across subjects and bin widths/voxel sizes for effect A/B) CV > 15% and ICC > 0.75 for effect A/B assessed on T1 and T2 maps, respectively. For effect C, median (across subjects) CV values of radiomic features estimated from T1 and T2 maps are summarized in Supplementary Table S1, showing that the absolute variability associated with this effect was markedly greater than that associated with effects A and B.

Table 2 Median CV (%) of radiomic features estimate for effect A and B, for both T1 and T2 maps. The median CV was calculated across subjects and bin widths/voxel sizes for effect A/B.
Figure 3
figure 3

Box and whisker plots of CV (%) values of enrolled subjects for effect A assessed on T1 maps. Radiomic features with both median (across subjects) CV > 15% and ICC > 0.75 for each bin width (i.e., 3.60, 3.95, 4.30, 4.65, 5.00, 5.35, 5.70, 6.05, and 6.40 ms) are shown.

Figure 4
figure 4

Box and whisker plots of CV (%) values of enrolled subjects for effect B assessed on T1 maps (a) and for effect A (b)/B (c) assessed on T2 maps. (a,c) Radiomic features with both median (across subjects) CV > 15% and ICC > 0.75 for each resampling voxel size (i.e., 1.8, 1.9, 2.0, 2.1, 2.2, 2.3, and 2.4) are shown for T1 (a) and T2 maps (c). (b) Radiomic features computed from T2 maps with both median (across subjects) CV > 15% and ICC > 0.75 for each bin width (i.e., 0.49, 0.50, 0.51, 0.52, 0.53, 0.54, 0.55, 0.56, and 0.57 ms) are shown.

The results of the analysis of the linear correlation between radiomic features estimate and voxel size/bin width are reported in Supplementary Tables S2S5 for T1 and T2 maps, respectively. Most of the textural features showed a significant linear correlation with voxel size and bin width within the considered range of variation, with some exceptions (see Supplementary Tables S4) among radiomic features estimated from T2 maps when varying resampling voxel size (i.e., 6 GLCM, 7 GLRLM, 6 GLSZM, 6 GLDM, and 3 NGTDM radiomic features).

Discussion

Recently, some studies have applied radiomic analysis to CMR imaging in order to assess whether this tool could reveal myocardial phenotypic alterations in HCM patients18,19,20,21,31,32,33,34,35,36. By using conventional CMR imaging (i.e., T1-weighted, T2-weighted STIR, LGE, and cine images), radiomic analysis has allowed discriminating between HCM patients and healthy controls18,31, as well as distinguishing among different aetiologies of left ventricular hypertrophy, such as HCM, HHD, cardiac amyloid, and aortic stenosis32. Moreover, some studies have focused on the prognostic role of radiomic analysis of LGE images in HCM, suggesting that texture features are related to the arrhythmic risk33,35 and adverse clinical outcome34. Notably, only a few pilot HCM studies19,20,21,36 have applied radiomic analysis to T1 maps. Specifically, two studies19,21 have focused on the diagnostic capability of radiomic analysis to discriminate between healthy controls and HCM/HHD patients. Also, Neisius et al.36, have found that radiomic analysis applied to T1 maps was able to distinguish between LGE+ and LGE–, identifying a subgroup of subjects in whom gadolinium administration could be safely avoided. Wang et al.20 have found that radiomic analysis of T1 maps can discriminate HCM patients with different mutations of sarcomere-related genes.

Previous studies have investigated the effect of image preprocessing on radiomic features estimation as a function of various factors, which include imaging technique/modality and anatomical region40,41,42,43,47,48,51,52,53,54,55,72. Nonetheless, to the best of our knowledge, this is the first study aimed at assessing the effect of both image resampling/discretization and spatial filtering on radiomic features estimated from CMR T1 and T2 maps. Thus, in a group of HCM patients, we performed a relatively comprehensive analysis, which considered multiple elements such as voxel size resampling, T1/T2 values discretization, and spatial filtering.

In radiomic studies, voxel size resampling is a recommended and commonly employed preprocessing step when analyzing data with different acquisition protocols or from different scanners, which can result in different acquisition voxel sizes59. For both T1 and T2 maps, we found that textural features belonging to GLCM and NGTDM classes showed moderate relative variability (0.75 < ICC ≤ 0.9) when varying resampling voxel size, except for a limited number of radiomic features (Fig. 1). Conversely, the estimate of textural features of GLRLM, GLSZM, and GLDM classes showed a higher relative variability when varying resampling voxel size, with few exceptions (e.g., GrayLevelVariance from GLRLM, GLSZM, and GLDM, which presented ICC > 0.9 for both T1 and T2 maps). In general, radiomic features belonging to shape and first order classes were characterized by higher ICC values than textural features (Fig. 1), indicating that resampling voxel size has less impact on their estimate. In particular, shape radiomic features had ICC values > 0.9 (Elongation was the only exception, showing 0.75 < ICC ≤ 0.9) and median CV values within 0.8%-2.7% (Table 2) for both T1 and T2 maps. Most of first order radiomic features presented ICC values > 0.75, with only few features showing considerable (i.e., Energy from T1 and T2 maps, Kurtosis from T1 maps, and Skewness from T2 maps) or high (i.e., Kurtosis from T2 maps) relative variability. Furthermore, we revealed that even some radiomic features from T1 and T2 maps with ICC > 0.75 can have a non-negligible absolute variability in terms of CV (up to 30% or more, for single subjects) when varying resampling voxel size. In this regard, we found that T1 maps presented an appreciably higher number of radiomic features with ICC > 0.75 and median CV > 15% than T2 maps (Figs. 3, 4b). For most textural features, a significant linear correlation between their estimates and resampling voxel sizes was observed (Supplementary Tables S2, S4).

Image discretization is another important step before radiomic features estimation, which allows simplifying rather complex computational operations. Given that T1 and T2 represent quantitative physical properties of tissues, we chose to apply a fixed bin width approach59. Bin width ranges of variation (i.e., 3.6–6.4 ms and 0.49–0.57 ms for T1 and T2 maps, respectively) were selected using the same criterion (i.e., median number of quantization levels between 30 and 13043,65,66), allowing a comparison between the results of T1 and T2 maps. A remarkable difference between T1 and T2 maps in sensitivity of radiomic features estimation to discretization was found. In general, textural features estimated from T1 maps showed higher variability than their estimates from T2 maps (ICC > 0.75). Only a limited number of T1 maps textural features were characterized by ICC > 0.75 (Fig. 2) and some of them still had median CV > 15% (Fig. 4a). Overall, CV values were greater for T1 maps (up to 75%) than for T2 maps (less than 20%) (see Table 2). As expected, shape and intensity-based statistical features (i.e., all first order features but Entropy and Uniformity) yielded ICC = 1 and CV = 0%. Indeed, these radiomic features are estimated by PyRadiomics, according to the IBSI recommendation, prior to discretization59 and hence this preprocessing step does not affect their estimate. The repeated measures correlation analysis showed that all radiomic textural features (with few exceptions) have a significant linear correlation between their estimates and bin width (Supplementary Tables S3, S5).

Digital image filters can be applied before radiomic features extraction to detect and emphasize tissue characteristics different from those usually obtained from original (i.e., unfiltered) images. In this regard, the IBSI has proposed a new reference manual, in order to define and standardize the implementation of image filters in radiomics software60. Given that a spatial filter can actually modify (even in a substantial way) T1 and T2 maps, we observed a relevant sensitivity of both first order and textural features estimates to image filter, with ICC ≤ 0.5 (Supplementary Fig. S2). On the other hand, the entity of this effect can vary with radiomic features and is not easily predictable especially for textural radiomic features. Indeed, as reported in Supplementary Table S1, median CVs of radiomic features estimate for different spatial filters can range from less than 10% to 200% or more.

The revealed effect of resampling voxel size and bin width on the estimate of myocardial radiomic features from T1 and T2 mapping, albeit limited in many cases, sustains the importance of reporting/describing in detail these aspects in clinical and research studies. In this regard, while previous studies involving myocardial T1 mapping have described the radiomic features extraction process quite exhaustively19,20,21,36, some information about image preprocessing was still missing. For instance, none of these studies have reported whether and how image discretization was performed (notably, Neisius et al.19 have specified the number of discretized intensity levels but only for the radiomic features belonging to the GLRLM class, while no discretization was indicated for the GLCM class). Furthermore, the studies conducted by Neisius et al.19 and Shi et al.21 have acquired images with different in-plane spatial resolutions (i.e., 2.1 and 1.3 mm, respectively) and employed slightly different preprocessing steps (e.g., range re-segmentation and intensity outlier filtering, respectively59). This could partly explain the obtained different results in terms of discriminative radiomic features and accuracy.

So far, previous HCM radiomic studies19,20,21,36 have exploited only T1 mapping, mainly because of its capability of revealing myocardial fibrosis. However, T2 maps are considered the gold standard for the evaluation of myocardial edema, which is a well-known negative prognostic factor in HCM15,16. Moreover, although T2 mapping is not per se capable of evaluating myocardial fibrosis, texture analyses might theoretically overcome this limitation, unveiling myocardial structural heterogeneity due to myofibrillar disarray and fibrosis18,27. Therefore, given also that, overall, radiomic features from T2 maps have proven to be characterized by a lower sensitivity to image resampling and discretization than radiomic features from T1 maps (especially when varying bin width), radiomics from T2 mapping might be exploited to obtain a complementary characterization of HCM, albeit this lower sensitivity does not necessarily imply higher discriminative or predictive power. In addition, it should be noted that, for both T1 and T2 mapping, the sensitivity of several radiomic features to resampling voxel size and bin width resulted rather independent of bin width and resampling voxel size, respectively. Moreover, only a limited number of radiomic features showed high relative variability (ICC ≤ 0.5) associated with different resampling voxel size or bin width. Some radiomic features with ICC > 0.75 still presented high variability (> 30%) in terms of CV for single subjects, suggesting that such radiomic data from different centers should be compared or pooled with caution.

We recognize and submit the following potential limitations and implications, respectively, of our study. First, while we analyzed only single slice acquisitions of T1 and T2 mapping, in HCM patients, myocardial changes may involve also visually non-hypertrophied myocardial segments. Therefore, a whole-heart coverage might provide a more comprehensive evaluation of disease burden and increase the diagnostic performance of CMR imaging. However, for a global/diffuse disease, it is considered adequate to analyze a single ROI on mid-cavity short-axis map73. Furthermore, the main aim of our exquisitely technical study was to assess the effect of image resampling and discretization on radiomic features estimated from typical cardiac T1 and T2 maps. Accordingly, we focused on a single short-axis slice, located where myocardial thickness was maximum and myocardial changes were assumed to be more severe, obtaining hence minimum partial volume effects, which can greatly affect regions of thinner myocardial segments. Moreover, this is a retrospective study that has enrolled subjects referred for clinical/routine CMR imaging, which requires avoiding too long acquisition times especially for non-collaborative patients. Second, myocardial T1 and T2 maps can be affected by noise, as well as by possible fitting errors and patient motion. In this regard, while appropriate denoising filters applied on T1- and T2-weighted images can be used to improve the precision and accuracy of computed T1 and T2 maps74,75, we adopted no denoising method. Third, in this technical study involving HCM patients, no clinical/diagnostic results tailored to this disease were reported. In this regard, in order to avoid any potential confounding effects in our analysis, we preferred to enroll only a group of homogeneous subjects with the same pathology. Therefore, this preliminary technical radiomic study on T1 and T2 mapping can be helpful and prodromic for further clinical studies on HCM. Nonetheless, we feel our analysis framework could be applied to other cardiac populations likely obtaining similar findings, albeit that would need further experimental studies to be confirmed. Fourth, image resampling and discretization are only two of the various steps of the radiomic pipeline, albeit important and recommended by the IBSI guidelines59. The issue of standardization in radiomic studies can involve additional factors, such as scanner performances, acquisition protocols, acquisition sequence parameters, and data analysis methods. However, we feel the results of our study can represent a potentially helpful step toward the standardization of radiomics in cardiac magnetic resonance imaging. Indeed, image resampling and discretization modify de facto acquired image data and possibly their radiomic characteristics. Therefore, professionals and researchers executing or planning clinical or research studies should be aware of this critical aspect of radiomics in cardiac magnetic resonance imaging and its entity for both T1 and T2 mapping. As for the practical implications of this study, we note that radiomic techniques allow extracting hundreds of features. In clinical and research studies employing radiomics and artificial intelligence methods, even when enrolling numerous subjects, it is hence mandatory to apply appropriate methods of features selection, in order to avoid any possible bias due to overfitting and obtain reliable results. While supervised or unsupervised methods of features selection can be used76, a limited sensitivity to image resampling and discretization can represent an additional criterion for selecting features. This does not necessarily imply higher discriminative or predictive power in clinical or research studies, which might also depend on the specific effect size associated with selected features, but still can improve reproducibility of results and comparison/pooling of data from different centers.

Conclusions

Toward an optimization and standardization in radiomic applications, any potential dependence of radiomic features on various aspect of the radiomic workflow should be possibly taken into account when planning a clinical or research study. Nowadays, there is a growing interest in exploiting radiomic techniques in various clinical applications. In particular, the study of myocardial diseases represents an emerging and promising field of application. In this HCM study, while we observed a sensitivity of myocardial radiomic features from T1 and T2 mapping to some conventional image preprocessing procedures such as image resampling/discretization, this effect was very remarkable only when considering image filtering. Indeed, the sensitivity of radiomic features to different resampling voxel sizes and bin widths was limited for many textural radiomic features, as well as for shape and first order radiomic features. The estimate of most textural radiomic features showed a linear significant correlation with resampling voxel size and bin width. In general, radiomic features from T1 mapping were more sensitive to image preprocessing than radiomic features from T2 mapping, especially when varying bin width. Overall, our results might corroborate the potential of radiomics from T1 and T2 mapping in HCM and hopefully in other myocardial diseases.