Introduction

Cardiovascular magnetic resonance (CMR) imaging has been established as a key imaging modality for comprehensive myocardial function assessment in clinical cardiology over the past decades1,2. Within the scope of cardiac deformation analyses, especially CMR-derived myocardial strain assessments have been proven to possess accurate diagnostic capabilities and to provide substantial prognostic information3,4,5,6. Despite several clinically approved and widely used imaging biomarkers, efforts and new developments in non-invasive imaging strive for identifying even more sensitive and precise parameters of subclinical failing cardiac performance7. In this context, hemodynamic force (HDF) assessment provides novel and promising imaging parameters representing the force exchange between ventricular blood and surrounding myocardium but required knowledge of intracardiac velocity fields measured by CMR-based 4D flow tools up to now. Recently, CMR-based HDF analyses derived from routinely acquired CMR cine images have been introduced allowing non-invasive estimation of intracardiac pressure gradients solely based on LV geometry, endocardial tissue motion as well as aortic and mitral valve orifice areas8,9. Based on the assumption that HDF is mainly influenced by flow properties following myocardial deformation and the exchange of momentum across the mitral and aortic orifices, this novel approach estimates main features of HDF vectors from endocardial tissue dynamics. Of note, this novel method has been proven to possess high accuracy when validated against conventional 4D flow-based assessments10.

Importantly, HDF analyses were recently demonstrated to improve detection of early cardiac disease and myocardial dysfunction when volumetric and deformation cardiac measures are still intact9. There is even evidence to suggest that this technology identifies subtle systolic alterations in heart failure with preserved ejection fraction (HFpEF) that cannot be picked up with alternative technology11,12. However, for an unrestricted clinical use and in order to provide robust and valid measurements, image post-processing applications must be sufficiently reproducible13.

Consequently, the aim of this study was to evaluate the inter-study reproducibility of CMR cine image-based HDF assessments and to define the current potential and limitations of this novel deformation imaging tool.

Methods

Study population

The study population included 11 healthy participants (6 males and 5 females) with a mean age of 46 years (± 23 years). Study participants underwent CMR twice at a median interval of 63 days (IQR 49–87 days) using a standardized CMR scan protocol14,15. To avoid a recollection bias of the involved CMR staff a minimum time gap of 6 weeks between both scans was determined. Particular emphasis was laid on performing image acquisitions at the same anatomical levels of the heart. Furthermore, new onset of a cardiac disease was excluded before the second CMR scan. All subjects had stable sinus rhythm during CMR acquisition. The study was approved by the Ethics Committee of the Charité-University Medicine Berlin (registration: EA4/112/16; German Clinical Trials Registry, DRKS, DRKS00015615). All subjects gave written informed consent before participation and the study was conducted according to the principles of the Helsinki Declaration.

Cardiovascular magnetic resonance imaging

All subjects underwent CMR imaging using a standardized scanning protocol on a 1.5 Tesla scanner (Achieva, Philips Healthcare, Best, The Netherlands) equipped with a cardiac, five-element phased array coil. Conventional retrospectively ECG-gated balanced steady state free precession (bSSFP) cine sequences were acquired during breath-hold for the assessment of myocardial function. 2-dimensional cine sequences included 2-, 3- and 4- chamber views (CV) in long axis (LAX) orientation as well as short axis (SAX) stacks covering the entire left ventricle. Imaging parameters were as follows: 50 frames/ cardiac cycle, repetition time (TR) = 3.3 ms, echo time (TE) = 1.6 ms, flip angle = 60° and acquisition voxel size = 1.8 × 1.7 × 8.0 mm3. The same imaging parameters were applied for baseline and repeated CMR scans.

Dedicated commercially available software was used for CMR image post-processing (QStrain® and HDF module, Medical Imaging Systems, Leiden, Netherlands). Movements from deformation imaging of 2-,3- and 4-CV were used for apical-basal LV HDF calculation after initial feature-tracking (FT)16 and measurements of mitral and aortic valve width (Fig. 1). HDF evaluations report an integral of LV pressure gradients. To correct for different LV sizes and to facilitate comparisons between participants HDF evaluations are normalized to LV volume and blood specific weight and presented in percentage of gravity acceleration. Detailed HDF calculations and more mathematical features of the method transforming endocardial dynamics into flow forces have been described elsewhere17,18.

Figure 1
figure 1

Myocardial delineation for hemodynamic force analysis. Delineated epi- and endocardial contours in long axis 2-/3-/4-chamber view (CV) orientations during enddiastole (ED) and endsystole (ES) with a corresponding hemodynamic force (HDF) curve.

In brief, the general relationship between pressure gradient and velocity field can be derived from the Navier‐Stokes equation. The original formula required blood velocity (e.g., measured by 4D flow) but could be adapted so that this variable can be computed from an integral over the LV endocardial boundary allowing evaluation HDF from the knowledge of the moving ventricular geometry and the valve orifice. The HDF calculations in this study comprised the longitudinal force over an entire cardiac cycle with the direction of HDF pointing from higher towards lower pressure areas. Briefly explained, during the beginning of systole, apical-basal pressure gradients cause a positive deflection of the HDF curve resulting in blood ejection from the LV. After reaching the peak of systolic impulse, tension of the LV contraction decreases and the apical-basal gradient changes into a descending but still positive HDF curve.

Subsequently, with decelerating ventricular flow and aortic pressure surpassing the ventricular pressure, the apical-basal gradient reverses, which is depicted by the first half of the negative systolic-diastolic transition curve. After the aortic valve has closed, diastole begins and caused by relaxation and recoil of the LV myocardium, an early diastolic suction occurs generating a basal–apical gradient, that is represented by the second half of the negative systolic-diastolic transition curve. Subsequent diastolic deceleration is characterized by passive LV filling and upward movement of the mitral plane. In this phase, first LV pressure increases from apex to base and makes HDF grow in the positive ascending phase. By exceeding the atrial pressure LV filling decelerates and reduced passage of blood from the atrium to the LV progressively equilibrates the pressures in both chambers resulting in a positive but descending phase on the HDF curve. Atrial thrust reflects late diastolic filling due to atrial contraction, that causes a relative gradient from apex to base, resulting in negative HDF vectors. Finally, as the blood accumulates in the LV chamber, the gradient reverses again and the HDF vector becomes positive before systolic impulse begins18 (Fig. 2).

Figure 2
figure 2

Left ventricular hemodynamic force analysis in apical-basal direction. Illustration of an exemplary hemodynamic force (HDF) curve for the left ventricular apical-basal motion over a whole cardiac cycle. Measured quantitative HDF metrics correspond to the area under the curve of each HDF phase.

The mean longitudinal force amplitude over the entire cardiac cycle is expressed as a dimensionless root mean square (RMS) considering both positive and negative values. The peak of the systolic impulse curve was defined as systolic peak HDF. All HDF parameters were calculated from the area under the curve (AUC) normalised to the respective time interval and are reported as average values based on three independently repeated measurements19. In addition to the HDF measurements LV global longitudinal strain (GLS) values were assessed in LAX 2-, 3- and 4-CV cine images20.

Post-processing was performed by an experienced observer. The same observer repeated feature-tracking-based myocardial border delineation as well as measurements of mitral and aortic valve width on the same data-set to assess intra-observer variability. Similarly, a second observer performed feature-tracking in LAX orientations and measurement of valve widths for the calculation of inter-observer reproducibility.

Volumetric analyses were performed in LV SAX orientations comprising LV enddiastolic/-systolic and stroke volumes (EDV/ESV/SV) as well as LV ejection fraction (EF) and mass.

Statistical analyses

Statistical analyses were performed using SPSS version 28.0 (IBM, Armonk, New York, USA) and Microsoft Excel. All p-values are provided 2-tailed and an alpha level < 0.05 was considered statistically significant. Continuous parameters are reported as mean along corresponding standard deviations (SD). For dependent continuous parameters changes from Exam 1 to 2 were evaluated applying the Wilcoxon signed-rank test.

Inter-study and inter-observer variability was analysed using intra-class correlation coefficients (ICC) based on absolute agreement (excellent for ICC > 0.74, good between 0.60 and 0.74, fair between 0.4 and 0.59 and poor below 0.4)13, coefficient of variation (CoV, SD of mean difference divided by the mean (SD [mean difference])/mean) and Bland–Altman plots (mean difference between measurements with 95% confidence interval [CI])21.

Study samples required to show a respective 10%, 15% and 20% relative change in HDF measurements with a power of 90% and an α error of 0.05 were calculated as follows22,23:

$${\text{n}} = {\text{f}}\;\left( {\alpha ,{\text{ P}}} \right)*\sigma^{2} *\left( {2/\delta^{2} } \right)$$

where n is the sample size, α the significance level, P the study power required and f the value of the factor for different values of α and P (f = 10.5 for α = 0.05 and p = 0.90), with σ the inter-study SD and δ the desired difference to be detected23.

Results

Cardiovascular magnetic resonance imaging

Results of CMR analyses of both exams are summarized in Table 1. HDF profiles of all repetitions are presented in Fig. 3. Neither LV volumes nor LV GLS differed significantly between both exams, only LV mass showed a significant difference (91.3 vs. 88.2 g/m2; p = 0.02). Amongst HDF parameters, there were significant differences between both exams for the values of LV longitudinal force (p = 0.03), systolic peak (p = 0.02) and systolic impulse (p = 0.04), while the other HDF measurements did not differ significantly.

Table 1 Cardiovascular magnetic resonance image analyses.
Figure 3
figure 3

Hemodynamic force analyses profiles. Hemodynamic force (HDF) analyses profiles are displayed for both exams. The blue curve (first exam) and red curve (second exam) represent the respective average of the whole study group with the respective 95% confidence interval of the measurements (A). HDF analyses profiles for intra- (B) and inter-observer (C) measurements based on the first exam each representing the averages of the whole study group with the respective 95% confidence intervals (please see colour codes).

Inter-study and observer reproducibility

Results of inter-study reproducibility are reported in Table 2 and Fig. 4. Overall reproducibility was excellent for all HDF values according to ICC (ICC ≥ 0.80 for all). Atrial thrust (CoV 30.9%), the ratio of diastolic deceleration/atrial thrust (CoV 25.9%) and diastolic deceleration (CoV 22.2%) showed notably higher inter-study variability than systolic HDF measures (systolic impulse: CoV 15.2% and systolic peak: 15.6%).

Table 2 Inter-study reproducibility.
Figure 4
figure 4

Inter-study agreement of hemodynamic force analyses. Bland Altman plots with limits of agreement (95% confidence intervals) showing inter-study reproducibility of hemodynamic force (HDF) analyses. Delta values (Δ) displaying the difference for inter-study measurements. RMS root mean square, LV left ventricular.

Intra- and inter-observer reproducibility are summarized in Tables 3 and 4 as well as in Fig. 5 and 6. Both overall intra- and inter-observer reproducibility was excellent (ICC > 0.80 for all) with intra-observer assessments showing slightly lower variability compared to inter-observer analyses. Amongst intra-observer measurements LV longitudinal force showed the highest reproducibility (CoV 7.6% and ICC 0.98 [0.91–0.99]), while LV systolic/diastolic transition was most reproducible amongst inter-observer measurements (CoV 8.1% and ICC 0.97 [0.89–0.99]).

Table 3 Intra-observer reproducibility.
Table 4 Inter-observer reproducibility.
Figure 5
figure 5

Intra-observer agreement of hemodynamic force analyses. Bland Altman plots with limits of agreement (95% confidence intervals) showing intra-observer reproducibility of hemodynamic force (HDF) analyses. Delta values (Δ) displaying the difference for intra-observer measurements. RMS root mean square, LV left ventricular.

Figure 6
figure 6

Inter-observer agreement of hemodynamic force analyses. Bland Altman plots with limits of agreement (95% confidence intervals) showing inter-observer reproducibility of hemodynamic force (HDF) analyses. Delta values (Δ) displaying the difference for inter-observer measurements. RMS root mean square, LV left ventricular.

Sample size calculations

Sample sizes required for the detection of 10%, 15% or 20% relative changes in follow-up studies differed between HDF parameters (Table 5). While relatively fewer subjects allow sufficient detection of changes for systolic HDF parameters (ranging from n = 12 for the detection of a 20% relative change in systolic impulse to n = 53 for the detection of 10% relative change in LV longitudinal force) comparatively more subjects were required for diastolic parameters (ranging from n = 26 for the detection of a 20% relative change in diastolic deceleration to n = 200 for the detection of a 10% relative change in atrial thrust.

Table 5 Sample sizes.

Established imaging parameters LVEF and LV GLS required lower sample sizes (LVEF = 2, LV GLS = 22) (Table 5).

Discussion

This study aimed to assess the inter-study reproducibility of novel CMR-based HDF analyses derived from CMR cine images and has several notable findings. Firstly, the overall inter-study reproducibility of HDF parameters between repeated exams was sufficient with systolic HDF parameters having lower inter-study variation than diastolic HDF measurements. Secondly, smaller required sample sizes for the detection of relative changes of HDF parameters were demonstrated for systolic HDF parameters. Thirdly, overall intra- and inter-observer reproducibility for all HDF measures was excellent based on absolute agreement.

Specific blood motion patterns inside the ventricular chamber are responsible for flow-mediated forces that can be assessed by HDF analyses and were shown to be altered in failing myocardial performance24. Since the usage of three-dimensional/ three directional phase contrast based 4D flow techniques requires time-consuming procedures for both acquisition and post-processing25, advances in non-invasive CMR image post-processing software enable assessments of HDF from the motion of the LV endocardial boundary in routinely acquired cine images without the need of complex and time-consuming 4D blood flow measurements10.

However, at present CMR-FT-based HDF analyses are scarcely used in clinical routine and inter-study assessments for these novel parameters have not been reported yet. In our study, besides an overall sufficient inter-study reproducibility according to ICC, systolic HDF parameters showed lower inter-study variability than diastolic analyses. Similarly, required sample sizes varied between the respective HDF values and smaller sample sizes to detect relative changes of HDF measurements were observed for systolic HDF parameters. On the one hand, the relatively small study sample of the current work needs to be considered limiting a more precise detection especially of smaller absolute values and subsequently would also not allow a more precise detection of for example a 5% relative change13. On the other hand, required sample sizes to detect changes of other conventional imaging parameters (e.g. LVEF: 2 or LV GLS: 22 for a 10% relative change) were in line with preceding larger studies and therefore reported data can be supposed to reflect valid sample size results of HDF measurements as well22,26. Furthermore, previous studies assessing inter-study reproducibility of CMR-derived myocardial dyssynchrony, torsion, atrial or segmental strain on the basis of similar study participant numbers documented even larger required sample sizes (> 100) to detect similar changes for these parameters13,27,28,29. Consequently, the required sample sizes for HDF analyses can be considered within acceptable limits amongst deployed CMR parameters.

In general, comprehensive assessment and knowledge of inter-study reproducibility is a key element for successful implementation and application of a novel technique in clinical routine. Especially for the detection and monitoring of cardiovascular diseases, serial CMR examinations and subsequent post-processing rely on the assumption that changes of the imaging parameters are reliably detectable30.

Recently, first studies have demonstrated HDF analyses to unmask subtle impaired early diastolic filling in HFpEF patients and identifying those at an earlier stage of the disease cascade7. In this context, HDF analyses have been proven to outperform conventional CMR-based volumetric and deformation analyses for the detection of both systolic and diastolic impairment11.

Thus, HDF analyses could enable precise non-invasive monitoring of declining function in these patients. Furthermore, impaired HDF values were previously demonstrated to be associated with adverse LV remodelling after acute myocardial infarction and, moreover, HDF measurements were even suggested as potential early predictors of adverse outcome in cardiovascular diseases31. Consequently, CMR-based non-invasive HDF analyses possess the potential to become a clinically useful and important imaging technique for monitoring disease progression, potential treatment efficacy or deciding on the timing of therapy underlining the importance of sufficient reproducibility assessment. However, it must be noted that at present the inter-study reproducibility of HDF measurements was demonstrated to be lower compared to most commonly used LV strain assessments13,16,29. In this context there are several potential explanations that need to be considered when analysing HDF for deformation analyses. Since HDF assessments are based on CMR-FT measurements, they are likely to depend on known similar variability and limitations (e.g., through- or out-of-plane displacements of myocardial borders) of this technique32. However, the values of LV GLS did not significantly differ between both exams in our study while at least systolic HDF measurements did, indicating higher robustness of improved conventional strain analyses over just recently developed HDF assessments. In fact, the reproducibility of CMR-based strain analyses has considerably improved since their introduction and first clinical applications13. Similar advances might be anticipated for HDF analyses both by future software refinements and algorithm updates. Compared with strain the analyses of HDF parameters require measurements of mitral and aortic valve diameters implying an additional source of variability. This is particularly true because even minimal variations of anatomical levels occurring during image acquisition have significant impact on valve orifice measurements and subsequent HDF calculations. In this context, automated or preconfigured valve size calculation might also lead to improvements of reproducibility in the future.

Of note, while the current cine image-based HDF measurement approach requires a complete image set of 2-, 3- and 4-chamber LAX orientations, strain analyses are even possible in single image orientations and consequently allow regional assessments as compared to global HDF analyses33.

Besides the assessment of reproducibility, the current data suggest potential sample size adjustments, that need to be considered when applying different HDF parameters in clinical routine or studies with repeated examinations. The individual clinical usefulness of the different systolic or diastolic HDF parameters will need to be addressed in future clinical studies.

Interestingly, some discrepancies for 4D flow and cine image-derived HDF parameters detecting heart failure have been described34. In healthy hearts longitudinal shortening predominantly contributes to ventricular contraction, which is attenuated in heart failure or acute myocardial injury. While the 4D flow method for HDF quantification is more unlikely to be affected by decreased longitudinal contraction (at an early stage), it is possible that the endocardial dynamics-based model is more sensitive to such alterations25. On the other hand, the cine image-based approach might apply better in regular ventricular geometry whereas a more complex myocardial anatomy or pathological alterations could limit its applicability8. Considering these potential advantages and disadvantages of 4D flow and cine image-derived HDF estimations, their accuracy, comparability and interchangeability (in different cardiac diseases and altered loading conditions) have to be investigated by future studies.

Beyond the evaluation of the technique´s accuracy the current findings are also relevant for the application of these imaging biomarkers in clinical trials since higher reproducibility and increased reliability of an imaging technique could result in potentially higher cost-efficiency due to required smaller study populations35. In this context it is important to note, that the conditions of healthy subjects (e.g. stable lower heart rate or good scan compliance) might differ in patients and consequently results of the current work might not be fully transferrable to other (patient) cohorts. Furthermore, it is interesting to speculate whether a transfer and application of the mathematical models10 used by the software for CMR-FT-based HDF assessment to other myocardial strain evaluation techniques like tagging or strain-encoded (SENC) deformation imaging (both with even more challenging valve orifice assessments) would be feasible providing similar reproducibility for HDF parameters. Furthermore, the influence of CMR field strength and/or temporal as well as spatial resolution might be addressed by future studies36,37.

Beyond future technical refinements potentially improving reproducibility, further enhanced software applicability might be attained by fully automated HDF analyses. Artificial intelligence-based CMR-FT strain and volumetric analyses have been already shown to be feasible and to possess equally high accuracy for risk prediction compared to manual approaches38,39. In addition, implementation of HDF parameters beyond CMR-FT-derived strain values in novel risk prediction models might enable more precise myocardial shape and contraction analyses as well as improved risk stratification40,41.

Further and larger studies are needed to validate these findings and to enhance clinical applicability as well as utility of novel CMR-based HDF analyses.

Study limitations

The main limitation of this study is its small sample size and that derived conclusions are based on the inclusion of healthy volunteers rather than patients. Consequently, it will be interesting to evaluate whether the results can be extrapolated to patients with (distinct) myocardial dysfunction. However, it is common and important to assess novel post-processing tools in healthy volunteers at first before extending the application on various cardiovascular diseases. Only in this way it is possible to understand the new image analysis method and to provide a solid basis for further technique improvement and subsequent studies including different patient collectives.

Conclusion

CMR-FT-based non-invasive HDF analyses possess an overall good inter-study reproducibility with systolic HDF measurements showing lower inter-study variability than diastolic HDF parameters. Accordingly, at present smaller study sample sizes are required to detect relative changes for systolic HDF values compared to diastolic HDF metrics. Inter-study variability might benefit from further software refinements and future validation studies are required to enable a widespread and unrestricted clinical application of CMR-based HDF analyses.