Abstract
CTP is an important diagnostic tool in managing patients with acute ischemic stroke, but challenges persist in the agreement of stroke lesion volumes and ischemic core-penumbra mismatch profiles determined with different CTP post-processing software. We investigated a systematic method of calibrating CTP stroke lesion thresholds between deconvolution algorithms using a digital perfusion phantom to improve inter-software agreement of mismatch profiles. Deconvolution-estimated cerebral blood flow (CBF) and Tmax was compared to the phantom ground truth via linear regression for one model-independent and two model-based deconvolution algorithms. Using the clinical standard of model-independent CBF < 30% and Tmax > 6 s as reference thresholds for ischemic core and penumbra, respectively, we determined that model-based CBF < 15% and Tmax > 6 s were the corresponding calibrated thresholds after accounting for quantitative differences revealed at linear regression. Calibrated thresholds were then validated in 63 patients with large vessel stroke by evaluating agreement (concordance and Cohen’s kappa, κ) between the two model-based and model-independent deconvolution methods in determining mismatch profiles used for clinical decision-making. Both model-based deconvolution methods achieved 95% concordance with model-independent assessment and Cohen’s kappa was excellent (κ = 0.87; 95% confidence interval [CI] 0.72–1.00 and κ = 0.86; 95% CI 0.70–1.00). Our systematic method of calibrating CTP stroke lesion thresholds may help harmonize mismatch profiles determined by different software.
Similar content being viewed by others
Introduction
Imaging-based assessment of acute ischemic stroke, most often with computed tomography (CT), remains an important factor in evaluating eligibility for reperfusion treatment. Current guidelines recommend endovascular thrombectomy in patients with a demonstrated large vessel anterior circulation occlusion at CT angiography, and either mild-to-moderate early ischemic changes at non-contrast CT within 6 h from stroke symptom onset or a favourable mismatch profile identified at CT perfusion (CTP) between 6 and 24 h after stroke onset1,2. A favourable mismatch is defined as a small ischemic core volume relative to the expected penumbra, which is assessed volumetrically by CTP3,4, or by the National Institutes of Health Stroke Scale (NIHSS)5. Similarly, favourable CTP mismatch profiles were associated with better outcomes in patients treated with thrombolysis compared to placebo between 4.5 and 9 h after stroke onset6, which is beyond the < 4.5-h time window currently recommended in patients demonstrating mild-to-moderate early ischemic changes at non-contrast CT1.
Ischemic core and penumbral volumes are therefore diagnostic markers of interest in acute ischemic stroke. These markers are most often delineated with CTP parametric maps by identifying voxels above or below a pre-determined parameter threshold. In multiple clinical trials evaluating treatment eligibility with CTP3,4,5,6, a single commercial CTP software (RAPID CTP, RapidAI, Menlo Park, CA) was used, which estimated ischemic core and penumbra by CBF < 30% relative to the mean CBF in normally perfused brain and Tmax > 6 s, respectively. These thresholds were validated in prior studies comparing CTP CBF lesions against near-concurrent diffusion-weighted MR imaging (DWI)7 and Tmax lesions against final infarct volume and clinical outcome8,9,10. However, thresholds are specific to each CTP software, and variability in ischemic lesions determined by different CTP software packages have been reported11,12. To demonstrate near-equivalence to RAPID CTP, software-specific optimal thresholds for ischemic core and penumbra have been re-derived, mainly by comparing against follow-up infarct volume, and were found to differ by software13,14,15,16. These empirical threshold calibration techniques assume that follow-up infarct volume is roughly equivalent to true infarction at admission CTP, but infarct growth is expected in the time interval to follow-up imaging, even after successful endovascular thrombectomy. Empirical threshold calibration techniques are therefore dependent on the dataset and the extent to which follow-up infarct matches true admission infarct.
One key component of CTP software is the deconvolution algorithm used to estimate perfusion, which may differ in estimated perfusion between deconvolution methods. Kudo et al. used a digital perfusion phantom in which CTP time-density curves (TDCs) were simulated to quantify CTP-estimated perfusion against the phantom’s ground truth perfusion17. Differences in estimated perfusion were found between commercial CTP software, which incorporated different deconvolution algorithms. Since this study, the diagnostic endpoint of identifying ischemic core and penumbral volumes with CTP has been clearly defined and validated in clinical trials, and it is of renewed interest to characterize how choices in CTP algorithms affect the estimation of stroke lesion volumes. Specifically, it is desirable to determine a method to calibrate thresholds between deconvolution algorithms such that estimates of lesion volumes may be reproducible between CTP software. In this study, we showed that CTP ischemic core and penumbral thresholds could be calibrated between three deconvolution algorithms by comparing against ground truth perfusion from a digital perfusion phantom. We utilized the digital perfusion phantom to establish a precise mapping of the relationship between known phantom perfusion values and the estimates generated by each deconvolution method. These mappings served as the foundation for identifying calibrated ischemic lesion thresholds, which helped harmonize the results between deconvolution methods by accounting for their quantitative disparities. Subsequently, these calibrated thresholds were applied to CTP studies of patients with acute ischemic stroke, facilitating a comparative analysis of their diagnostic mismatch profiles across different software platforms. Our proposed systematic method may help harmonize ischemic stroke lesion volumes and mismatch profiles determined by different CTP software.
Method
Theory
Concentration of contrast in tissue over time, \(Q(t)\), is related to that of a feeding artery by the governing equation:
where \(F\) is the blood flow (in units of ml/min/100 g) delivering contrast to tissue, \({C}_{a}(t)\) is the concentration of contrast in the feeding artery as a function of time, \(\otimes\) is the convolution operator, and \(R(t)\) is the impulse residue function (IRF), which describes the proportion of contrast remaining in tissue over time when a unit mass of contrast is injected as a short bolus into the tissue. A flow-scaled IRF, \({R}_{F}(t)\), is also commonly used and is defined as \({R}_{F}\left(t\right)=FR(t)\). The IRF is decoupled from the influence of the arterial input and describes the hemodynamic characteristics of the local tissue. As such, the perfusion parameters derived from the IRF are of diagnostic interest, and voxel-wise evaluation of the perfusion parameters result in quantitative maps. Perfusion parameters include the mean transit time (MTT) of contrast through the vasculature, which is the area underneath the IRF, and the Tmax, defined as the time-to-maximum of the IRF. Additionally, the cerebral blood volume (CBV) is related to CBF and MTT by the Central Volume Principle18:
CBV is the volume of flowing blood in units of ml/100 g in the tissue vasculature. MTT and Tmax are normally expressed in units of seconds.
In a CTP study, the IRF cannot be directly measured from the native CT images, but rather, must be calculated by voxel-wise deconvolution of the arterial TDC, \({C}_{a}(t)\), from the measured tissue TDC, \(Q(t)\). Different methods for deconvolution have been proposed, which lead to differences in estimated perfusion and thus CTP stroke lesion thresholds17. We compared CTP stroke lesion thresholds for three deconvolution algorithms: (1) Fourier Transform-based deconvolution19, (2) model-based deconvolution with a plug flow model of contrast transport20,21, and (3) model-based deconvolution with a Johnson-Wilson-Lee (JWL) model of contrast transport22. The first two were implemented in-house as representative model-independent and model-dependent deconvolution methods. Of note, the model-independent deconvolution algorithm has been described in detail in a previous study19 and was the algorithm incorporated into clinically-validated RAPID CTP software, so its CTP lesion thresholds (ischemic core: CBF < 30%; penumbra: Tmax > 6 s) were selected as the reference on which the other two methods’ thresholds were calibrated. The third method was available in commercial CT Perfusion 4D (CTP4D Version 16.0–2.216, GE Healthcare, Waukesha, WI), which we henceforth refer as the “CTP4D” algorithm to avoid confusion with the plug-flow model-dependent deconvolution method. We briefly review the details of each model-independent and model-dependent deconvolution method, though the technical details of the CTP4D algorithm are unavailable.
Model-independent deconvolution
The Fourier Transform-based method leverages the convolution theorem, which states that a convolution in the time domain is equivalent to the product of frequency spectra:
where \(\mathcal{F}\left\{\cdot \right\}\) is the Fourier Transform. In numerical implementation, \(Q(t)\) and \({C}_{a}(t)\) are linearly interpolated to a virtual sampling interval of 1 s and zero-padded to twice their length to avoid time aliasing19. This formulation is equivalent to circulant convolution in the time domain and therefore considered insensitive to delay in contrast arrival between the artery and the tissue (T0)19,23. The flow-scaled IRF can be estimated by the inverse Fourier Transform of the quotient between tissue and arterial TDC frequency spectrums. However, due to noise and other artifacts, the raw solution often results in flow-scaled IRFs with spurious oscillations that are not physiologically plausible. Fourier deconvolution is therefore regularized with a Wiener-like filter19 in the frequency domain using a 20% regularization threshold relative to the maximum magnitude of the arterial frequency spectrum. From the estimated flow-scaled IRF, CBF was determined as the peak height and Tmax as the time-to-maximum of the IRF. In the model-independent method, CBV was calculated as the area underneath the tissue TDC divided by the area underneath the arterial TDC. Areas were computed numerically by the trapezoidal rule. MTT was then calculated by the quotient of CBV and CBF as in the Central Volume Principle18.
Model-dependent deconvolution
The IRF of the model-dependent (plug-flow) method is given by:
where T0 is the delay time between contrast arriving at the artery to arrival at tissue, MTT is the mean (uniform) transit time of contrast through the vasculature, and \(U(t)\) is the Heaviside unit step function. Substituting (4) into (1), a closed-form solution to \(Q(t)\) can be obtained:
In other words, in the plug-flow model, the tissue TDC is the difference between the time-integral of the arterial TDC shifted by T0 and T0 + MTT, respectively, and linearly scaled by the CBF. We numerically solved (5) by a grid search of T0 and MTT, which linearizes the estimation of CBF. Specifically, all combinations of T0 \(\in \left[0, 1, 2, \dots , 23, 24\right]\) seconds and \(MTT\in [2, 3, \dots , 23, 24]\) seconds were searched, leading to a total of 575 grid search combinations per tissue TDC. A non-negative linear least squares algorithm24 solved for physiologically-constrained positive values of CBF for each T0 and MTT pair. The set of estimated parameters that produced the least squared difference between the measured and the estimated tissue TDC by (5) was considered the optimal parameters. In this method, T0 and MTT were directly estimated, and CBV was calculated by the Central Volume Principle in (2). Tmax was computed as Tmax = T0 + 0.5MTT, that is, time at half the width of the T0-shifted boxcar IRF.
While the implementation details of the CTP4D algorithm is unavailable, the IRF of the JWL model of contrast transport is22:
where \(W\) is the minimum transit time of contrast through the vasculature, \(E\) is the fraction of contrast with transit time \(>W\), and \(k\) is a decay rate constant related to the heterogeneity of vascular transit time. Compared to the plug flow model, which assumes uniform transit time of contrast through vasculature, the JWL model appends an exponential decay following the boxcar IRF to account for heterogeneity in contrast transit time. MTT was the area underneath the IRF and Tmax was defined as Tmax = T0 + 0.5MTT16,22.
Digital perfusion phantom-based threshold calibration
The quantitative accuracy of estimated perfusion parameters for each deconvolution algorithm was established by comparing against reference perfusion values in a digital perfusion phantom. Digital perfusion phantoms have been described previously17 and is briefly reviewed here. Tissue TDCs were simulated by convolving an assumed arterial TDC and simulated IRFs with known ground truth perfusion parameters (CBF, CBV, MTT, T0, and Tmax). Flow-scaled IRFs were simulated as gamma-variate functions to ensure they had a different shape than the IRFs used in the model-based deconvolution methods. Gamma-variate IRFs were simulated with a wide range of ground truth perfusion parameters: \({T}_{0}\in [0.0, 0.5, 1.0, 2.0, 3.0, 4.0, 8.0]\) s, \(MTT\in [3.4, 4.0, 6.0, 8.0, 10.0, 12.0, 16.0]\) s, and \(CBV\in [0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 4.0, 5.0]\) ml/100 g. CBF was calculated as CBV/MTT by the Central Volume Principle18; accordingly, CBF ranged from 1.9 to 88.2 ml/min/100 g at non-uniform intervals. Ground truth tissue TDCs were calculated by numerically convolving the simulated IRF and the linearly interpolated patient arterial curve at 0.01 s interval then resampled at 2 s interval. Zero-mean Gaussian noise with standard deviation, \(\sigma =1.5\) HU was added to the tissue TDCs to simulate the expected noise variation in tissue TDCs after standard Gaussian filtering of dynamic CTP images (Supplemental Fig. 1). In total, 1024 noisy tissue TDCs were generated for each set of perfusion parameters by random sampling of Gaussian distributions with σ = 1.5 HU. Further details on generating the phantom are in the Supplemental Materials.
We proposed to correlate deconvolution-estimated and ground truth phantom perfusion to calibrate stroke lesion thresholds between deconvolution methods. We focused on CBF and Tmax as they are most commonly used in practice to identify ischemic core and penumbra, respectively. First, the mean deconvolution-estimated CBF and Tmax over the 1024 noise realizations for each set of perfusion parameters was computed. Linear regressions between the deconvolution-estimated mean values and ground truth values were then calculated. These regression relationships were then used to construct calibration relationships for predicting the equivalent lesion thresholds between the three deconvolution thresholds. The derivation of the calibration relationship is provided in the Supplementary Materials. The equivalent model-based relative CBF threshold, \({R}_{MB}\), to the reference model-independent relative CBF threshold, \({R}_{MI}\), is given by:
where \(\alpha\) and \(\beta\) are linear regression slopes and intercepts, respectively, of model-independent (MI) and model-based (MB) deconvolution-estimated CBF against ground truth CBF in the digital perfusion phantom. \({N}_{GT}=\) 50 ml/min/100 g and was the assumed ground truth CBF in the normally perfused brain for normalization of absolute CBF to relative CBF25. Similarly, the calibration relationship between model-independent Tmax and model-based Tmax thresholds was:
where \(\gamma\) and \(\delta\) are linear regression slopes and intercepts, respectively, of model-independent and model-based deconvolution-estimated Tmax against ground truth Tmax in the digital perfusion phantom. The reference model-independent ischemic core and penumbra thresholds were \({R}_{MI}=\) 30% and \({T}_{max,MI}=\) 6 s, respectively4.
Validation of calibrated threshold on patient CTP studies
The calibrated thresholds determined from the digital perfusion phantom experiments were validated in patient CTP studies. Validation data were from the ISLES 2018 Challenge (http://www.isles-challenge.org/ISLES2018)26, which consisted of anonymized imaging of 103 patients with a diagnosed acute large vessel occlusion within 8 h of onset. Patients were pooled from two multicentre ischemic stroke cohort studies27,28 and clinical information such as age, NIHSS, onset-to-CT time, CT-to-DWI time, and CT scanner manufacturer for the entire 103-patient dataset have been reported previously, but were not directly provided with the ISLES 2018 dataset7,26. CTP was performed at admission and diffusion-weighted MRI (DWI) within 3 h of CTP. In a subset of 63 patients, DWI lesion segmentations were provided to serve as the reference for CTP ischemic core delineation. Only these 63 patients with reference DWI lesion volumes were included in our study; they were a subset of patients described in a previous publication7, which was the main basis for the commonly used relative CBF < 30% ischemic core threshold in RAPID CTP software5. Partitioned clinical data for the subgroup of 63 included patients of this study were not available. CTP studies were acquired with a wide range of scanners from different manufacturers (GE Healthcare, Philips Healthcare, Siemens Healthineers, and Toshiba Medical), with different axial coverage (2–16-cm whole brain), and scan durations (43–64 s). All CTP studies were motion corrected and resampled to 1 s scan interval26. Of note, patients with dual-slab CTP studies (required for short axial coverage scanners, denoted with a suffix “A/B” in the ISLES dataset) were processed as individual studies but grouped together as a single patient for statistical analysis. All methods and experimental protocols involving the acquisition of imaging data in the ISLES 2018 dataset was approved by the local institutional review board at each participating centre.
CTP studies were automatically processed by an in-house Python-based 3D Slicer extension module for the model-independent and model-dependent deconvolution algorithms and the CTP4D software (Version 16.0-2.216, GE Healthcare, Waukesha, WI). The arterial and venous TDCs were automatically detected in a proximal healthy brain artery (e.g., middle or anterior cerebral artery or the basilar artery) and the sagittal sinus, respectively, and manually adjusted as required. In our in-house CTP software, skull-stripped dynamic CTP images were filtered by a 2D Gaussian kernel with a standard deviation of 2.4 mm. The model-independent and model-dependent deconvolution algorithms were then applied on identically pre-processed dynamic CTP studies to generate CBF, CBV, MTT, and Tmax maps. Similarly, CTP4D software pre-processed the dynamic images using a proprietary edge-preserving spatiotemporal filter and generated CBF, CBV, MTT, and Tmax maps using a JWL model-based deconvolution method22.
CTP4D software automatically segmented the ischemic core and penumbra using the determined calibrated CBF and Tmax thresholds. Uncalibrated threshold lesion volumes were also recorded for comparison. For our in-house software, the steps to segment the perfusion lesions were as follows. First, hypoperfused and normal tissue were considered parenchymal brain with Tmax greater than or lesser than or equal to 4 s, respectively. The mean CBF in the normally perfused brain was used as the reference CBF to calculate the relative CBF thresholds. Ischemic core was delineated within the hypoperfusion mask by model-independent CBF < 30% and the calibrated model-dependent CBF threshold. Model-dependent CBF < 30% lesions were also segmented for comparison. Similarly, the penumbra was segmented within the hypoperfused mask by model-independent Tmax > 6 s and the calibrated model-dependent Tmax threshold. Hypoperfusion, ischemic core, and penumbra segmentations were post-processed by morphological operations (dilation, hole filling, then erosion using a 5-mm disk structure element) to improve robustness to CTP map noise. Of note, model-independent Tmax > 6 s is an operational CTP definition of penumbra that may not reflect true penumbral tissue of intact but functionally inactive neurons29. The mismatch ratio was calculated as the quotient of the penumbral volume and the ischemic core volume.
Correlation and agreement of ischemic core volume compared to DWI lesion volumes were assessed by the Pearson correlation coefficient and Bland–Altman analysis, respectively. Similarly, ischemic core and penumbral volumes between CTP software were compared by Pearson correlation and Bland–Altman analysis. Bland–Altman differences were reported as the reference (DWI or model-independent lesion volume) minus the compared technique. Patient-wise target mismatch profiles (core volume < 70 ml, penumbra volume ≥ 15 ml, mismatch ratio ≥ 1.8)4 were evaluated for each CTP software. Agreement between model-independent versus model-dependent, and model-independent versus CTP4D target mismatch profiles were assessed by Cohen’s kappa coefficient (\(\kappa\)). Cohen’s kappa was categorically scored as no agreement (\(\kappa\) < 0.00), slight (\(\kappa\) = 0.00–0.20), fair (\(\kappa\) = 0.21–0.40), moderate (\(\kappa\) = 0.41–0.60), substantial (\(\kappa\) = 0.61–0.80), and excellent (\(\kappa\) = 0.81–1.00) agreement30. Accuracy, sensitivity, and specificity of target mismatch profiles were also computed using the model-independent determination as the reference. A two-tailed alpha < 0.05 was considered statistically significant.
Results
Threshold calibration
Figure 1 summarizes the scatter plots and linear regressions of ground truth versus CTP-estimated CBF and Tmax. Model-independent and model-dependent deconvolution methods underestimated higher ground truth CBF, but underestimation was more severe with model-independent deconvolution. CTP4D CBF agreed well with ground truth CBF. Linear regression slopes were 0.36, 0.84, and 0.94 for model-independent, model-dependent, and CTP4D, respectively, and corresponding intercepts were 6.11, 4.12, and 5.98. The goodness of fits (R2) of linear regression were 0.76, 0.97, and 0.98 for model-independent, model-dependent, and CTP4D methods, respectively. Using the calibration relationship in Eq. (7), the determined linear regression parameters, and \({R}_{MI}=\) 30% as the reference threshold, the calibrated relative CBF thresholds were \({R}_{MD}=14.4\%\) and \({R}_{4D}=16.6\%\) for model-dependent and CTP4D methods, respectively, which were both rounded to 15% for validation in patient studies.
For Tmax threshold calibration, we found that model-independent Tmax substantially overestimated the ground truth at CBV of 0.5 ml/100 g due to low SNR (Fig. 1, grey crosses). CTP-estimated Tmax values at CBV 0.5 ml/100 g were excluded for more robust linear regression against ground truth values. The linear regression slopes for ground truth versus model-independent, model-dependent, and CTP4D Tmax were 1.06, 1.11, and 1.10, and intercepts were 1.29 and 1.35, 1.54, respectively. Linear regression R2 were 0.89, 0.97, and 0.96 for model-independent, model-dependent, and CTP4D methods, respectively. Using the calibration relationship in Eq. (8), the determined linear regression parameters, and \({T}_{max,MI}=\) 6 s as the reference threshold, the calibrated Tmax thresholds were \({T}_{max, MD}=6.3\) s and \({T}_{max, 4D}=6.4\) s for the model-dependent and CTP4D methods, respectively, which were rounded to 6 s for application on patient datasets.
Validation of calibrated thresholds
Median (interquartile range, IQR) stroke lesion volumes from DWI and CTP are summarized in Table 1. Correlation plots and Bland–Altman analyses comparing DWI lesion versus CTP ischemic core volumes are shown in Fig. 2. Plots for the uncalibrated CBF < 30% ischemic core threshold are not shown as it clearly overestimated both DWI and model-independent relative CBF < 30% lesion volumes (Table 1). Mean volume differences (95% limits of agreement) between DWI lesion versus model-independent CBF < 30% and calibrated model-dependent CBF < 15% and CTP4D CBF < 15% were + 16.2 (− 30.9 to 63.3) ml, + 10.9 (− 32.9 to 54.7) ml, and + 13.8 (− 48.1 to 75.7) ml, respectively. Correlation and agreement of ischemic core and penumbra volumes estimated between CTP software was also strong (Fig. 3) after threshold calibration, albeit model-independent lesion volumes were marginally smaller than model-dependent and CTP4D ones.
Of the 63 included patients, 50 met the mismatch criteria and 13 did not by model-independent deconvolution assessment. Patients who did not meet the mismatch criteria by model-independent deconvolution had a core volume > 70 ml (N = 4), mismatch ratio < 1.8 (N = 3), or a combination of negative criteria (N = 6). Agreement and classification metrics between CTP software using model-independent target mismatch profile as the reference are summarized in Table 2, including subgroup analyses for core volume, penumbral volume, and mismatch ratio. Overall, excellent agreement was found in target mismatch profiles determined from model-independent deconvolution and model-dependent deconvolution (\(\kappa\) = 0.87, 95% confidence interval [CI] 0.72–1.00) and CTP4D software (\(\kappa\) = 0.86, 95% CI 0.70–1.00). Strong agreement was also found in classifying favourable/unfavourable core and penumbra volumes but weaker for mismatch ratios (Table 2). Sensitivity and accuracy of classifying a favourable/unfavourable stroke lesion profile was overall excellent. Figure 4 and 5 show examples of concordant and discordant target mismatch profiles between CTP software.
Discussion
CTP is an important diagnostic tool in managing patients with acute ischemic stroke, but challenges persist in the agreement of stroke lesion volumes and mismatch profiles determined with different software packages11. In this study, we reported a digital perfusion phantom-based method to systematically calibrate CTP lesion thresholds with respect to the deconvolution algorithm used. We found that model-independent deconvolution substantially underestimated higher ground truth CBF in the perfusion phantom, whereas the two model-based methods (plug flow20 and JWL22 models) had better agreement with ground truth CBF. The ischemic core CBF threshold for the two model-based methods therefore required reduction to CBF < 15% with guidance from the perfusion phantom to match that of the model-independent CBF < 30% threshold. Tmax penumbral thresholds were roughly similar between all deconvolution methods in our study. In patient CTP studies using DWI lesion volume as reference, we found ischemic core volume estimation was substantially improved after threshold calibration. Importantly, our phantom-based threshold calibration method was valid with commercial CTP4D software, which used proprietary processing steps different to those used in our in-house CTP software, but nonetheless achieved strong agreement to reference (DWI) stroke lesion volumes and predicted mismatch profiles. Our novel contribution was to leverage quantitative relationships between deconvolution-estimated and ground truth perfusion in digital perfusion phantoms to calibrate perfusion thresholds between deconvolution methods.
Linear regression to quantify the parameter estimation accuracy has been reported previously17, but has not been extended to systematically calibrate lesion thresholds between deconvolution methods. Previous threshold calibration studies empirically adjusted CTP lesion thresholds and demonstrated equivalence in estimating lesion volume to other validated CTP software13,14,15. Whereas this empirical approach is likely dataset dependent, our phantom-based calibration technique is systematic and based on the quantification accuracy of each deconvolution algorithm. As confirmed by our digital perfusion phantom experiment, different deconvolution algorithms can have substantially different accuracy in estimating CBF, and consequently, CBF thresholds for ischemic core must be adjusted. Specifically, the linear regression (Fig. 1) slope represents the sensitivity of the deconvolution method to true changes in perfusion parameters. The intercept can loosely be interpreted as a parameter estimation bias at low SNR. Although physiologically, we expect the intercept to cross the origin for all methods, this was not reflected in our data and thus we did not enforce a fixed zero intercept at linear regression. We believe that the physiological non-negativity constraint of perfusion parameters estimates caused a positive bias in low SNR TDCs. Because of the non-zero intercept, linear regression parameters did not differ between CTP methods by a constant scaling factor; as such, even relative perfusion parameters would not yield equivalent thresholds and parameter estimates between methods.
Accordingly, the need to adjust ischemic core relative CBF thresholds between deconvolution methods was supported by patient CTP studies. In our study, the model-dependent method and CTP4D software, the latter which also uses model-based deconvolution, achieved roughly the same calibrated thresholds. This result does not necessarily imply that all model-based deconvolution methods have the same stroke lesion thresholds. The phantom-based calibration experiment should be performed for each CTP software and whether it uses deconvolution or not. Importantly, the commonly used relative CBF < 30% threshold for ischemic core7,31,32 does not appear to be an absolute pathophysiological marker of infarction, but rather, is dependent on the deconvolution method used to estimate CBF. This may explain the well-known problem of widely different estimates of ischemic core and mismatch profiles from CTP maps generated by different software when a single relative CBF < 30% threshold was used15,33. In our analysis, the Tmax threshold did not have to be adjusted among the three deconvolution methods. This may be in part due to the simulated gamma-variate IRF was “forced” to have Tmax = T0 + 0.5MTT, which was the definition of Tmax in IRFs used in the two model-based deconvolution methods. Patient IRFs may have a different relationship between Tmax, T0, and MTT. Nonetheless, all three CTP software produced similar penumbral volumes in the patient CTP studies. Our digital perfusion phantom-based calibration technique may be incorporated in diagnostic quality assurance programs performed by users when deploying new CTP software or a software upgrade. Vendors may also use our method to determine optimal stroke lesion thresholds for their software relative to other clinically validated software.
Target mismatch profile is of the greatest diagnostic interest in acute ischemic stroke CTP. In our study, substantial agreement in identifying favourable/unfavourable mismatch profiles was achieved between the three CTP software after threshold calibration. As shown by subcategory classification metrics (core < 70 ml, penumbra ≥ 15 ml, and mismatch ratio ≥ 1.8), disagreement was mostly in detecting favourable/unfavourable mismatch ratios. Mismatch ratio is the quotient of penumbral and core volumes, so the error in each measurement propagates to the mismatch ratio and worsens agreement between software. However, these incremental differences may not be as severe in clinical practice because a clinician may be able to rule out artefacts, integrate critical information from clinical assessments, and holistically make judgment calls when mismatch profiles are at the borderline between favourable and unfavourable. The few cases with discordant mismatch profiles may be due to differences between CTP software that cannot be accounted by our proposed linear calibration method. These differences may include the dependence of each deconvolution algorithm on T0, MTT, and signal-to-noise ratio, or other algorithmic differences such as dynamic image filtering and lesion segmentation methods. Though our phantom-based technique was successful in calibrating the lesion threshold and improving diagnostic agreement in most cases, further work is required to reduce discrepancies in stroke lesion volumes and mismatch profiles caused by these other CTP processing steps.
The validity of the threshold calibration technique is contingent on how well the phantom reproduces conditions expected in patient CTP studies. Each component of the phantom can be scrutinized to evaluate its fidelity to patient CTP studies. First, the assumed arterial curve was taken from a patient CTP study, which contained a small amount of noise relative to the large arterial signal. Arterial TDC noise was therefore incorporated into the convolved tissue TDCs, which may cause the simulated tissue TDC to be noisier than the real ones. We expect the small amount of added noise to be of little consequence. Further, the patient arterial TDC used was an improvement on prior digital phantoms that used an analytical gamma-variate arterial curve17 because it included contrast recirculation. Second, only gamma-variate IRFs were modeled in our phantom. A prior study showed that estimation of perfusion parameters with both model-independent and model-dependent deconvolution algorithms are dependent on the IRF model17. As such, the calibration curve may depend on the selected IRF model for generating the digital perfusion phantom. However, canonical IRF shapes may vary by tissue subtype and status, such as normal/ischemic grey and white matter. Incorporating these factors into a digital perfusion phantom would be challenging. The gamma-variate IRF model was therefore chosen for practicality and to be different from the IRFs used in the model-dependent deconvolution methods so as not to provide an advantage to model-based deconvolution. Third, we simulated a wider range of ground truth perfusion parameters in the digital perfusion phantom compared to prior studies17,21 to better balance the prevalence of low and high ground truth CBF and CBV in linear regression. Lastly, the injected Gaussian noise level of \(\sigma\) = 1.5 HU appears low compared to other studies17,21, but this noise level was chosen to match that of dynamic CTP images after standard Gaussian filtering at the strength used in our study (Supplemental Fig. 1). A fixed noise level, as opposed to one adjusted to meet a prescribed SNR17, is required to simulate realistic conditions in which low-CBV tissue would have lower SNR than that of normal tissue. By injecting a post-filtering noise level, the need to filter the dynamic phantom images was also obviated. Gaussian noise added to the digital perfusion phantom is spatially uncorrelated, so Gaussian filtering would be more effective in the phantom compared to that in patient CTP studies where noise is spatially correlated.
This study had limitations. First, we did not have access to RAPID CTP software and its predicted stroke lesion volumes and mismatch profiles. Instead, reference stroke lesion volumes and target mismatch profiles were generated with our in-house implementation of the RAPID CTP model-independent method19. However, we validated our ischemic core volume estimation against acute DWI, which achieved similar agreement to a previous validation study by Cereda et al. in which Bland–Altman mean difference (95% limits of agreement) between DWI versus RAPID CTP CBF < 30% was + 12.0 (− 25.9 to 49.2) ml7. Our study was not a definitive comparison against RAPID CTP software, but rather, a demonstration that better agreement in mismatch profiles can be achieved between CTP software by using our proposed phantom-based threshold calibration method. Direct comparison against RAPID CTP should be performed in future studies to determine calibrated thresholds equivalent to RAPID CTP ischemic core and penumbral thresholds. Second, as with previous studies validating CTP ischemic core thresholds, the DWI lesion volume used as the reference was not the true infarct volume at the time of CTP. Sources of error include infarct growth in the time interval between CTP and DWI, co-registration errors, manual segmentation errors, and potential DWI lesion reversal34, which could not be confirmed due to the absence of additional delayed imaging in this dataset7,26. Furthermore, there was no ground truth for penumbral lesions; as such, only the agreement of Tmax > 6 s lesions between deconvolution algorithms could be assessed. It remains to be determined whether lesion volumes from the calibrated threshold can correctly identify stroke treatment candidates and are associated with clinical outcomes. Additional validation on a large cohort of patients with acute ischemic stroke is warranted. Lastly, the median time from stroke onset to CT in the ISLES 2018 challenge dataset was 185 (IQR: 180–238) min, which was outside the 6–24-h CTP time window indicated by common best practice guidelines1. Other guidelines do not discourage CTP in the early time window2, and there are studies that suggest its benefit over standard neuroimaging alone35,36. Our calibration method nonetheless requires additional validation in patients presenting in the 6–24-h time window.
In conclusion, we investigated a method to systematically calibrate CTP thresholds for the estimation of ischemic core and penumbral thresholds based on CBF and Tmax parametric maps generated by different deconvolution software. The quantitative accuracy of estimating perfusion parameters for each deconvolution algorithm was benchmarked by a digital perfusion phantom, and these relationships were used to perform inter-software perfusion parameter threshold calibration. Validation on patient CTP studies suggested that the calibrated thresholds substantially standardize stroke lesion volume estimation compared to uncalibrated thresholds. As such, diagnostic agreement between CTP software was substantially improved after threshold calibration as assessed by target mismatch profiles. To further improve diagnostic consistency between CTP software, future studies are warranted to better characterize how choices of pre- and post-deconvolution processing algorithms affect the detection of diagnostic markers in acute ischemic stroke.
Data availability
The ISLES 2018 dataset can be requested from https://www.smir.ch/ISLES/Start2018.
References
Powers, W. J. et al. Guidelines for the Early Management of Patients With Acute Ischemic Stroke: 2019 Update to the 2018 Guidelines for the Early Management of Acute Ischemic Stroke: A Guideline for Healthcare Professionals From the American Heart Association/American Stroke Association. Stroke 50, (2019).
Boulanger, J. et al. Canadian stroke best practice recommendations for acute stroke management: Prehospital, emergency department, and acute inpatient stroke care, 6th edition, update 2018. Int. J. Stroke 13, 949–984 (2018).
Campbell, B. C. V. et al. Endovascular therapy for ischemic stroke with perfusion-imaging selection. N. Engl. J. Med. 372, 1009–1018 (2015).
Albers, G. W. et al. Thrombectomy for Stroke at 6–16 h with selection by perfusion imaging. N. Engl. J. Med. 378, 708–718 (2018).
Nogueira, R. G. et al. Thrombectomy 6–24 h after stroke with a mismatch between deficit and infarct. N. Engl. J. Med. 378, 11–21 (2018).
Ma, H. et al. Thrombolysis guided by perfusion imaging up to 9 h after onset of stroke. N. Engl. J. Med. 380, 1795–1803 (2019).
Cereda, C. W. et al. A benchmarking tool to evaluate computer tomography perfusion infarct core predictions against a DWI standard. J. Cereb. Blood Flow Metab. 36, 1780–1789 (2016).
Olivot, J.-M. et al. Optimal Tmax threshold for predicting penumbral tissue in acute stroke. Stroke 40, 469–475 (2009).
Lansberg, M. G. et al. RAPID automated patient selection for reperfusion therapy: A pooled analysis of the echoplanar imaging thrombolytic evaluation trial (EPITHET) and the diffusion and perfusion imaging evaluation for understanding stroke evolution (DEFUSE) study. Stroke 42, 1608–1614 (2011).
Dehkharghani, S. et al. Performance and predictive value of a user-independent platform for CT perfusion analysis: Threshold-derived automated systems outperform examiner-driven approaches in outcome prediction of acute ischemic stroke. AJNR Am. J. Neuroradiol. 36, 1419–1425 (2015).
Austein, F. et al. Comparison of perfusion CT software to predict the final infarct volume after thrombectomy. Stroke 47, 2311–2317 (2016).
Bivard, A., Levi, C., Spratt, N. & Parsons, M. Perfusion CT in acute stroke: A comprehensive analysis of infarct and penumbra. Radiology 267, 543–550 (2013).
Koopman, M. S. et al. Comparison of three commonly used CT perfusion software packages in patients with acute ischemic stroke. J. NeuroIntervent. Surg. 11, 1249–1256 (2019).
Rava, R. A. et al. Assessment of a Bayesian Vitrea CT perfusion analysis to predict final infarct and penumbra volumes in patients with acute ischemic stroke: A comparison with RAPID. AJNR Am. J. Neuroradiol. 41, 206–212 (2020).
Muehlen, I. et al. Comparison of two automated computed tomography perfusion applications to predict the final infarct volume after thrombolysis in cerebral infarction 3 recanalization. Stroke 53, 1657–1664 (2022).
d’Esterre, C. D. et al. Time-dependent computed tomographic perfusion thresholds for patients with acute ischemic stroke. Stroke 46, 3390–3397 (2015).
Kudo, K. et al. Accuracy and reliability assessment of CT and MR perfusion analysis software using a digital phantom. Radiology 267, 201–211 (2013).
Meier, P. & Zierler, K. L. On the theory of the indicator-dilution method for measurement of blood flow and volume. J. Appl. Physiol. 6, 731–744 (1954).
Straka, M., Albers, G. W. & Bammer, R. Real-time diffusion-perfusion mismatch analysis in acute stroke. J. Magn. Reson. Imag. 32, 1024–1037 (2010).
Nambu, K., Takehara, R. & Terada, T. A method of regional cerebral blood perfusion measurement using dynamic CT with an iodinated contrast medium. Acta Neurol. Scand. 93, 28–31 (1996).
Bennink, E. et al. Fast nonlinear regression method for CT brain perfusion analysis. J. Med. Imag. 3, 026003 (2016).
Lee, T.-Y., Yang, D. M., Li, F. & Marants, R. CT perfusion techniques and applications in stroke and cancer. In Computed Tomography: Approaches, Applications, and Operations (eds Samei, E. & Pelc, N. J.) 347–365 (Springer International Publishing, 2020).
Wu, O. et al. Tracer arrival timing-insensitive technique for estimating flow in MR perfusion-weighted imaging using singular value decomposition with a block-circulant deconvolution matrix: Technique for Estimating CBF in PWI. Magn. Reson. Med. 50, 164–174 (2003).
Lawson, C. L. & Hanson, R. J. Solving Least Squares Problems (SIAM, 1995).
Lassen, N. A. Normal average value of cerebral blood flow in younger adults is 50 ml/100 g/min. J. Cereb. Blood Flow Metab. 5, 347–349 (1985).
Hakim, A. et al. Predicting infarct core from computed tomography perfusion in acute ischemia with machine learning: Lessons from the ISLES challenge. Stroke 52, 2328–2337 (2021).
Lansberg, M. G. et al. MRI profile and response to endovascular reperfusion after stroke (DEFUSE 2): A prospective cohort study. Lancet Neurol. 11, 860–867 (2012).
Lin, L., Bivard, A., Levi, C. R. & Parsons, M. W. Comparison of computed tomographic and magnetic resonance perfusion measurements in acute ischemic stroke: Back-to-back quantitative analysis. Stroke 45, 1727–1732 (2014).
Astrup, J., Symon, L., Branston, N. M. & Lassen, N. A. Cortical evoked potential and extracellular K+ and H+ at critical levels of brain ischemia. Stroke 8, 51–57 (1977).
Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159 (1977).
Campbell, B. C. V. et al. Cerebral blood flow is the optimal CT perfusion parameter for assessing infarct core. Stroke 42, 3435–3440 (2011).
Bivard, A. et al. Ischemic core thresholds change with time to reperfusion: A case control study. Ann. Neurol. 82, 995–1003 (2017).
Ospel, J. M. et al. Impact of multiphase computed tomography angiography for endovascular treatment decision-making on outcomes in patients with acute ischemic stroke. J. Stroke 23, 377–387 (2021).
Yoo, J. et al. Ischemic diffusion lesion reversal after endovascular treatment: Prevalence, prognosis, and predictors. Stroke 50, 1504–1509 (2019).
Tan, Z. et al. Comparison of computed tomography perfusion and multiphase computed tomography angiogram in predicting clinical outcomes in endovascular thrombectomy. Stroke 53(9), 2926–2934 (2022).
Dhillon, P. S. et al. Perfusion imaging for endovascular thrombectomy in acute ischemic stroke is associated with improved functional outcomes in the early and late time windows. Stroke 53(9), 2770–2778 (2022).
Acknowledgements
The authors acknowledge the organizers of the ISLES 2018 Challenge for access to the invaluable imaging dataset. Funding support was gratefully provided by the Canadian Institutes of Health Research (CIHR), Canada Foundation for Innovation (CFI), and Ontario Research Foundation (ORF). K.J.C. was supported in part by a CIHR Canada Graduate Scholarship-Doctoral (CGS-D).
Author information
Authors and Affiliations
Contributions
K.J.C. and T.Y.L. contributed to all aspects of the study. D.D.S. assisted with data collection and analysis and critical revision of the manuscript.
Corresponding author
Ethics declarations
Competing interests
Dr. Ting-Yim Lee licenses CT Perfusion 4D software to GE Healthcare. The other authors declare that there is no conflict of interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chung, K.J., De Sarno, D. & Lee, TY. CT perfusion stroke lesion threshold calibration between deconvolution algorithms. Sci Rep 13, 21458 (2023). https://doi.org/10.1038/s41598-023-48700-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-48700-6
This article is cited by
-
Multiphase CT angiography perfusion maps for predicting target mismatch and ischemic lesion volumes
Scientific Reports (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.