Introduction

Speckle tracking echocardiography (STE) is an advanced echocardiographic methodology for the non-invasive determination of myocardial performance parameters, namely strain and strain rate1. It can be used to identify early subclinical cardiac dysfunction in asymptomatic patients with chronic disease and unremarkable conventional echocardiography such as adult patients with arterial hypertension2 and children with type 1 diabetes mellitus3. Moreover, STE is sensitive enough to detect even discretely altered cardiac mechanics due to transient changes in blood sugar levels4. Furthermore, STE was utilized to demonstrate subclinical myocardial alterations due to primarily non-cardiac diseases like inflammatory bowel disease in both adult and pediatric patient populations5,6.

Recently, in a pursuit to increase the diagnostic yield of functional cardiac imaging, STE is increasingly used in combination with dobutamine or ergometer stress testing7. The mainstay of this combination in adult medicine is non-invasive workup for coronary artery disease8. In pediatric cardiology, strain imaging has been implemented into clinical practice, e.g. to monitor cardiotoxicity in chemotherapy patients9. Recently, the hybrid use of speckle tracking imaging and stress echocardiography in children has become a promising method to unmask potential cardiac dysfunction that might remain occult at rest10. However, in contrast to diagnostic procedures in adult patients, accurate echocardiographic image acquisition in children is oftentimes more challenging. This is partly due to limited compliance during long-lasting examinations, a lack of understanding of the significance of a diagnostic procedure or a low threshold for personal discomfort. Nevertheless, reproducibility and the significance of image quality during pediatric bicycle ergometer stress testing are still poorly studied. This is the first study to assess the relation of echocardiographic image quality and inter- and intraobserver reproducibility of STE in combination with stress echocardiography in children.

Methods

Study population

We enrolled 127 children in this study. Inclusion criteria were good physical health and a written consent by the child itself and by the legal guardian(s). Primary exclusion criteria were the presence of any debilitating symptoms such as fever, pain, fatigue or other past or present health conditions likely affecting physical fitness or the cardiovascular system. This included but was not limited to kidney disease, LV dysfunction, acquired valvular disease, congenital heart disease, developmental delay, body mass index >30 kg/m2, pathologic ECG-changes as well as technical limitations such as short leg length or submaximal effort during exercise testing. 15 patients were excluded from the study during the echocardiographic examination due to several reasons: inadequate cycling effort (n = 2), heart disease (n = 1) or insufficient echocardiographic image quality (n = 12).

Initially, all study subjects were assessed with a thorough medical history and physical examination followed by ECG and standard echocardiography. Prior to enrollment, every child as well as their legal guardian had signed a written informed consent. The study sample size of 127 participants was achieved through assumption of power analysis under the consideration of the necessary number of children to form a representative random sample on the one hand as well as keeping the total participants number manageable for a single-center study, on the other hand. A priori, a study design was established sub-categorizing the study cohort into groups of flawless, near-optimal and sub-optimal echocardiographic image quality.

The study was carried out in accordance with the declaration of Helsinki’s ethical principles for medical research involving human subjects and approved by the Witten/Herdecke University ethics committee (clinical trial number: 103/2014).

Conventional echocardiography

We performed a comprehensive echocardiographic study including spectral and color flow Doppler examination in accordance with the standard guidelines of the American Heart Association in all included study participants11. The commercially available ultrasound device iE33 by Phillips Ultrasound Inc., USA, with a S5–1 Sector Array transducer (Sector 1–5 MHz) was used. All images were digitally recorded and subsequently transferred to an offline workstation for later analyses, using XCelera Version 3.1.1.422 by Phillips Ultrasound Inc., USA. Images were acquired in the apical 4-, 3- and 2-chamber views, the parasternal long axis view and in two short axis views at the mitral level and at the level of the papillary muscles. M-mode images were obtained at the level of the aortic valve and the LV for subsequent measurement of aortic root diameter, left atrial diameter, interventricular septum, LV cavity and LV posterior wall. Fractional shortening, LV mass, relative wall thickness, LV enddiastolic/endsystolic volume, ejection fraction (EF), stroke volume and cardiac output were assessed. EF was calculated using the modified Simpson’s biplane method. Utilizing PW-Doppler and PW-TDI E/A-ratio, E/E′-ratio and mitral deceleration time were detected for the assessment of LV diastolic function as described elsewhere12. All echocardiographic parameters were evaluated utilizing Z-scores13.

Speckle tracking echocardiography

Standard cross-sectional 2D grayscale LV images were acquired for myocardial deformation (strain and strain rate) analyses. Using conventional B-Mode imaging longitudinal strain and strain rate were measured in standard apical 4-chamber (AP4), 3-chamber (AP3) and 2-chamber (AP2) views as previously described in detail14. Specifically, circumferential strain (CS) was measured in the standard parasternal short-axis at the mitral valve plane (SAXB) and the papillary muscle plane (SAXM). As recently suggested, 3–4 consecutive cardiac cycles synchronized to a continuous ECG were recorded with frame rate adjusted between 60 and 90 frames per second15,16. To achieve accurate deformation parameters, a special focus was set to avoid noise and minimize artifacts during the entire process of echocardiographic image acquisition. Data was anonymized, digitally stored in DICOM format and transferred to an off-line workstation for postprocessing utilizing the commercially available software QLAB Version 10. Global and segmental strain and strain rate were assessed in seven segments per view for longitudinal strain (LS) and six segments for CS by semi-automated tracing of the endocardial border line at end-diastole. Tissue tracking quality was verified in real-time and full thickness coverage of the myocardium including the endocardial and epicardial contours was readjusted manually where necessary. A representative echocardiographic image example of STE assessment in a healthy child during bicycle stress testing is given in Figure S1.

Stress echocardiography

To unmask potential abnormalities in myocardial performance that might remain undiscovered at rest, and to evaluate STE reproducibility during stress testing, we additionally exposed the study population to bicycle ergometer stress testing and performed STE. A schematic diagram outlining the experimental setup of STE application during stress testing is provided in Fig. 1. After completion of resting echocardiography image acquisition, the children were asked to start peddling in a semisupine position on a standard bicycle ergometer at 60 rounds per minute. We utilized a standard ramp protocol with inclining resistance (25 Watts every 2 minutes). Echocardiographic images for STI analysis were acquired at baseline and at two different stress levels: at an intermediate level (approximately 0.5–1 Watt per kilogram body weight) and at the maximum level of physical exhaustion (approximately 1.5–2 Watts per kilogram body weight). A standardized pattern of consecutive images was acquired at all three levels in the above-mentioned viewing planes (SAXB, SAXM, AP4, AP2, and AP3). Peripheral blood pressure and heart rate measurements were obtained at 2-minute intervals and a 3-channel ECG was continuously monitored.

Figure 1
figure 1

Schematic diagram outlining the experimental setup of speckle tracking echocardiography at baseline and during bicycle ergometer stress testing.

Image quality scoring

A previously published score of “0”–“3” based on myocardial wall visualization was used by a trained echocardiographer to rate echocardiographic image quality17. A score of “3” was assigned to images with >95% myocardial wall visualization, “2” was given when 70–95% of the relevant wall structures were visualized, “1” referred to images of <70% wall visibility. A score of “0” referred to images without clear endocardial border delineation or absent views.

Assessment of reproducibility

Inter- and intraobserver variabilities were assessed by additional evaluation of baseline and exercise echocardiographic images by a second, independent interpreter, who was blinded to the results of the first echocardiographic reader. In detail, two experienced echocardiographers performed the exact same procedure measuring peak LV strain and strain rate values in all five viewing planes as specifically described above. Baseline and stress test images were assessed this way in all study subjects in a blinded manner. To determine intraobserver variability, one echocardiographer repeated the measurements in a randomly selected order with a minimum of 48 hours between the two analyses. Here, circumferential strain and strain rate were measured in the SAXM view and longitudinal strain and strain rate were repeatedly assessed in the apical 4- chamber view.

Biostatistical analyses

Demographics, clinical parameters, hemodynamics and echocardiographic data were described as mean and standard deviation. Clinical, hemodynamic and echocardiographic data of the three groups were compared utilizing the ANOVA test. P-values <0.05 constituted statistical significance. The data distribution was graphically displayed using Bland-Altman and correlation graphs. Linear regression analyses and Pearson’s correlation was performed to assess the potential association of echocardiographic variables and clinical parameters. Stata Version 13 (College Station, TX) and Microsoft Excel Version 16.0 for PC were used for all statistical analyses.

Ethics approval and consent to participate

The study was carried out in accordance with the declaration of Helsinki’s ethical principles for medical research involving human subjects and approved by the Witten/Herdecke University ethics committee (clinical trial number: 103/2014).

Consent for publication

Consent for publication was obtained from all included children, where applicable, as well as from their parents or legal guardians.

Availability of data and material

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Results

Patient characteristics

127 children were included in this prospective study. Mean age ( ± standard deviation) was 13.4 ± 3.0 years and gender distribution was almost equal (50.4% female). All included study subjects had no signs of structural heart disease, and their anthropometric data was in accordance with their age and maturation level as evaluated by Z-scores. Mean tanner puberty stage was 2.83 ± 1.25, mean height was 159.3 ± 16.2 cm and mean body weight was 52.6 ± 17.6 kg. On average, the children participated in physical activity 1–2 times per week (Table S1). For all patients the mean heart rate, systolic and diastolic blood pressure at rest were 73 ± 11 beats/min, 111 ± 12 mmHg and 65 ± 9 mmHg, respectively. The mean heart rate increased to 105 ± 11 beats/min at low stress (0.5 ± 0.2 W/kg body weight) and 148 ± 17 beats/min at the maximal stress level (1.6 ± 0.6 W/kg body weight).

Conventional echocardiographic parameters

Conventional echocardiographic parameters were all normal as evaluated by Z-scores (Table S2)18. Specifically, mean LA/Ao ratio was 1.02 ± 0.17, mean estimated LV mass was 121.0 ± 43.5 g and mean LV stroke volume was calculated as 65.9 ± 26.0 ml. The conventional echocardiographic systolic performance indices fractional shortening and LV ejection fraction were 34.43 ± 3.9% and 61.0 ± 4.0%, respectively. Mean mitral annular plane systolic excursion (MAPSE) was 1.51 ± 0.17 cm. Diastolic function was normal as determined by mean E/A ratio of 1.85 ± 0.35 and E/E′ ratio of 8.23 ± 1.44.

Speckle tracking stress echocardiography

Peak LV global strain and strain rate were detected in all standard echocardiographic planes as specified above. Mean values and standard deviations are indicated for each individual echocardiographic viewing plane and for global strain and global strain rate in the resting state as well as under low and high resistance during exercise testing. As expected, peak LV strain rate was significantly increasing with inclining cycling resistance both longitudinal (−1.36 ± 0.38 vs. −1.54 ± 0.38 vs. −1.83 ± 0.5 s−1) and circumferential (−1.64 ± 0.38 vs. −1.79 ± 0.45 vs. −1.79 ± 0.45 s−1) throughout all echocardiographic viewing planes (p < 0.01, Figure S2, Table S3). Peak LV longitudinal strain did not differ significantly between the various stress levels (Table S4).

Strain and strain rate inter- and intraobserver reproducibility at rest and during stress testing

Overall inter- and intraobserver reproducibility for LV global longitudinal strain and strain rate at rest and during exercise testing are given numerically in Tables 1 and 2 and as Bland-Altman graphs in Figs 25. The latter include mean difference of the average values and limits of agreement for inter- and intraobserver variability for circumferential strain (Fig. 2), circumferential strain rate (Fig. 3), longitudinal strain (Fig. 4) and longitudinal strain rate (Fig. 5) at baseline and during stress testing. The mean difference represents bias. A difference of 0% would indicate complete agreement and a bias of more than 1.5% strain was considered significant.

Table 1 Inter- and intraobserver reproducibility of global circumferential and global longitudinal strain at rest and during exercise testing.
Table 2 Inter- and intraobserver reproducibility of global circumferential and global longitudinal strain rate at rest and during exercise testing.
Figure 2
figure 2

Bland-Altman graphic demonstrating the reproducibility of circumferential strain. (A) Interobserver variability of circumferential strain at baseline. (B) Interobserver variability of circumferential strain during exercise testing. (C) Intraobserver variability of circumferential strain at baseline. (D) Intraobserver variability of circumferential strain during exercise testing. The blue dashed line indicates the mean bias (mean difference) and the red dashed lines represent the limits of agreement.

Figure 3
figure 3

Bland-Altman graphic: reproducibility of circumferential strain rate. (A) Interobserver variability of circumferential strain rate at baseline. (B) Interobserver variability of circumferential strain rate during exercise testing. (C) Intraobserver variability of circumferential strain rate at baseline. (D) Intraobserver variability of circumferential strain rate during exercise testing. The blue dashed line indicates the mean bias (mean difference) and the red dashed lines represent the limits of agreement.

Figure 4
figure 4

Bland-Altman graphic: reproducibility of longitudinal strain. (A) Interobserver variability of longitudinal strain at baseline. (B) Interobserver variability of longitudinal strain during exercise testing. (C) Intraobserver variability of longitudinal strain at baseline. (D) Intraobserver variability of longitudinal strain during exercise testing. The blue dashed line indicates the mean bias (mean difference) and the red dashed lines represent the limits of agreement.

Figure 5
figure 5

Bland-Altman graphic: reproducibility of longitudinal strain rate. (A) Interobserver variability of longitudinal strain rate at baseline. (B) Interobserver variability of longitudinal strain rate during exercise testing. (C) Intraobserver variability of longitudinal strain rate at baseline. (D) Intraobserver variability of longitudinal strain rate during exercise testing. The blue dashed line indicates the mean bias (mean difference) and the red dashed lines represent the limits of agreement.

Global strain agreement was excellent without any significant bias for CS or LS for both inter- and intraobserver variability at rest and throughout all stages of bicycle ergometer stress testing (Table 1). Interestingly, global strain agreement does not significantly deteriorate but remains stable (interobserver) or even increases (intraobserver) in measurements during stress testing (mean CS difference 0.38 ± 2.73 vs. 0.86 ± 2.46 and −0.60 ± 2.50 vs. 0.34 ± 2.33). The second examiner obtained modestly higher circumferential and longitudinal strain values compared to the first examiner as reflected by nonsignificant bias of 0.4 and 0.6%, respectively. Intraobserver variability was equally low with comparably strong correlations and minor bias ranging from −0.6 to 0.3%. The clear majority of all CS and LS measurement repetitions range within the 95% confidence interval (Figs 2 and 4, Table 1).

Strain rate measurements were equally well reproducible (Table 2). Overall interobserver agreement was high with only marginal bias (mean difference for global CSR −0.04 ± 0.23 and LSR −0.01 ± 0.19). During exercise testing interobserver bias was comparably small (mean difference for global CSR 0.00 ± 0.31 and LSR 0.03 ± 0.28) with slightly increased limits of agreement (−0.48; 0.41 vs. −0.61; 0.62 for CSR and −0.36; 0.36 vs. −0.52; 0.58 at rest and during stress testing, respectively). Interestingly, agreement was best for values between 0 and −1.5 at all examined viewing planes. In contrast, distortion was increased for strain rate measurements between −1.5 and −3 (Figs 3 and 5). On average, strain rate intraobserver variability was similar to interobserver reproducibility with slightly increased mean difference values (mean difference of CSR −0.08 ± 0.26 and LSR −0.07 ± 0.22) and less pronounced differences of resting and stress testing limits of agreement (CSR −0.58; 0.42 at rest vs. −0.51; 0.59 during stress testing).

The influence of echocardiographic image quality on speckle tracking reproducibility

To analyze the impact of echocardiographic image quality on the reproducibility of strain and strain rate measurements, we compared interobserver (Fig. 6) and intraobserver (Fig. 7) agreement for optimal (95–100% accurate myocardial wall visualization), suboptimal (70–95% visualization) and limited (<70% visualization) quality of echocardiographic images both in the resting state and during ergometer stress testing. Circumferential strain and strain rate measurements were compared at the levels of the papillary muscles and the mitral anulus. Longitudinal strain and strain rate were assessed at the AP4, AP3 and AP2 viewing planes. Limits of agreement and p-values are indicated both for the total number of analyzed images as well as separately for the three degrees of visualization for strain in Table 3 and for strain rate in Table 4. Importantly, echocardiographic image quality did not significantly influence the degree of reproducibility, neither for strain (Table 3) nor for strain rate measurements (Table 4). Furthermore, there was no systematic difference in limits of agreement between the several levels of image quality (i.e. LOA for 4-chamber derived LSR −0.43; 0.42 in <70% vs. −0.44; 0.41 in 95–100%, p = 0.058). While single p-values fell below the <0.05 border, none was statistically significant after Bonferroni correction for multiple testing.

Figure 6
figure 6

Scatterplots of strain and strain rate measurement inter-rater variability subject to echocardiographic image quality. (A) Circumferential strain. (B) Circumferential strain rate. (C) Longitudinal strain. (D) Longitudinal strain rate.

Figure 7
figure 7

Scatterplots of strain and strain rate measurement intra-rater day-to-day variability subject to echocardiographic image quality. (A) Circumferential strain. (B) Circumferential strain rate. (C) Longitudinal strain. (D) Longitudinal strain rate.

Table 3 Inter-observer reproducibility and the influence of echocardiographic image quality for strain imaging. P-values were calculated using the Kruskal-Wallis-Test and Mann-Whitney-U-Test.
Table 4 Inter-observer reproducibility and the influence of echocardiographic image quality for strain rate imaging.

Discussion

Speckle tracking echocardiography derived strain and strain rate measurements in children are highly reproducible at rest and during ergometer stress testing

To assess reliability of pediatric STE at baseline and during bicycle stress testing, we analyzed inter- and intraobserver variability in 127 consecutive normotensive children without structural heart disease. Reproducibility was excellent at rest and, interestingly, did not deteriorate during exercise. Since its first steps in the early 2000s, STE has been used in a variety of clinical and experimental scenarios. Today, despite its step-wise implementation in clinical practice of pediatric medicine, i.e. to monitor cardiotoxicity in chemotherapy patients9, it is still mostly used for research purposes. Recently, clinical scientists increasingly utilize STE during functional assessments such as dobutamine stress echocardiography19 or ergometer stress testing in adult20 and pediatric patient populations21. However, the reproducibility of these combinatory assessments is still largely unknown, and data is scarce or incomplete in children22. Nonetheless, studies in which pediatric STE was analyzed, are mostly in agreement with our results. Firstly, three-dimensional STE derived LV strain was shown to be reproducible in children23. Secondly, Wisotzkey et al. found global longitudinal strain at rest well reproducible in pediatric heart transplant recipients24. Strain rate and reproducibility during stress testing, however, were not assessed in these studies. Thirdly, there is conflicting data on the required level of expertise for the reliable performance of STE dobutamine stress testing. In 41 adult patients, STE was shown to be highly reproducible at all stages of dobutamine stress testing for both expert and novice strain readers25. In contrast, STE measurements were highly specific in combination with stress echocardiography in 37 adult patients, but not without a significant degree of dependence on the examiner’s expertise, as reported in another study26. One study only assessed longitudinal but not circumferential strain. Moreover, strain rate or image quality were not assessed in both studies and sample size was considerably smaller when compared to the present investigation. Nevertheless, the importance of circumferential strain is not to be underestimated. Recently, Broch and colleagues reported that in patients with compensated aortic regurgitation a relatively high circumferential strain compensates for the reduced longitudinal strain in a manner that is consistent with the preserved EF of these patients27.

Importantly, while overall inter- and intraobserver reproducibility was excellent for strain and strain rate imaging, strain rate measurements tended to feature an increased amount of distortion in higher (more negative, to be specific) values between −1.5 and −3 (Figs 3 and 5). This is in line with findings from a similar study in adult patients comparing tissue-Doppler and STE derived strain and strain rate assessments before and during exercise echocardiography28. Similarly, Uusitalo and colleagues recently reported increased standard deviations of STE derived global LV strain rate at peak dobutamine stress testing29. While only a few studies analyzed STE reproducibility during exercise (mostly in adults), data on strain rate in pediatric stress testing is scarce30.

Echocardiographic image quality does not influence strain and strain rate reproducibility

In a recent well-conducted critical review of current approaches for echocardiographic reproducibility and reliability, the authors found that the statistical metrics used to assess data quality in echocardiographic clinical research vary substantially or were often not reported at all31. This is important for two distinct reasons. Firstly, when learning from a published study, the reader should be capable to have an idea of the input raw data quality that is the basis for the subsequently drawn conclusions to facilitate a critical approach to scientific literature. Secondly, when using a diagnostic modality, the clinician must comprehend how to interpret measurements made under suboptimal technical conditions, i.e. echocardiographic noise in STE – a frequent challenge not only in pediatric stress echocardiography. Examples of the latter are abundant in clinical echocardiography. Tissue-Doppler imaging measurements are underestimated with increasing angle-dependency32. For the assessment of LV ejection fraction, Simpson biplane method is superior to Teichholz in good image quality and vice versa. On the other hand, bare “eyeballing” is less affected by image quality33. Image position errors result in false LV volume quantification which can be overcome with the utilization of increasing numbers of imaging planes, i.e. three-dimensional echocardiography34. However, little is known about the STE derived strain and strain rate assessment in suboptimal imaging conditions, neither in adult nor in pediatric patient populations.

To close this gap, we assessed the impact of echocardiographic image quality on inter- and intraobserver STE reproducibility. Remarkably, we found that strain and strain rate measurements did not depend on echocardiographic image quality. This is surprising, as for instance three-dimensional STE derived LV volume measurements have been shown to depend on image quality35. On the one hand, limited echocardiography with poor echocardiographic image quality in patients who underwent cardiac surgery has been shown to be of use for the determination of crude measurements such as hemodynamic state36. For the three-dimensional echocardiographic LV volume assessment of unselected patients, as little as >60% endocardial border visualization was found to result in reliable measurements37. STE on the other hand requires visualization of speckles of each wall aspect to be analyzed. This naturally implies the necessity of optimal image quality. Some authors suggest restricting speckle tracking imaging to subjects with adequate imaging conditions to guarantee favorable accuracy and reproducibility38. Yet, our data suggest, that this is not strictly necessary.

Limitations

The surprising finding that STE reproducibility was not significantly affected by echocardiographic image quality in this study must not be over-interpreted. While the data supporting this result are robust, the study was not designed to evaluate for underlying causes of (un)altered STE accuracy in various image quality scenarios. Possibly, the underlying algorithms of STE post-processing software systems may extrapolate data to an extent, that may lead to false positive assumptions for strain and strain rate measurements of wall aspects that evade thorough tissue tracing due to inadequate image quality. For example, if only 4 out of 6 wall segments are well visualized in a given echocardiographic plane, deformation of the remaining 2 segments with substandard myocardial wall visualization may be corrupted. Clearly, if a segment is not visualized at all or moving out of the echocardiographic viewing plane during a cardiac cycle, it must be excluded from STI analysis. It is the grey area in between, that is challenging (i.e. echocardiographic images with mostly visible myocardium but minor endocardial blurring, etc.). Currently, there is no data to support a guideline on how to proceed in these cases. Therefore, we would recommend excluding substandard images from STI assessments until validation data of STI measurements in impaired echocardiographic image quality is available. Moreover, future studies should investigate difficult imaging scenarios (i.e. children, obese patients, etc.) and correlate STI strain and strain rate to cardiac MRI, taking into consideration echocardiographic intervendor consistency and different underlying post-processing algorithms. Experimental studies may utilize a validation with sonomicrometry39, the current gold standard for myocardial deformation imaging to validate STI accuracy in suboptimal echocardiographic image quality.

Conclusion

STE derived LV strain and strain rate measurements in healthy children feature excellent inter- and intraobserver reproducibility both at baseline and during ergometer stress testing. Importantly, substandard echocardiographic image quality does not influence speckle tracking reproducibility or result in obviously implausible strain (rate) values. Consequently, caution must be paid to critically review STE derived deformation values in suboptimal visualization conditions.