Clinical service evaluation of the feasibility and reproducibility of novel artificial intelligence based-echocardiographic quantification of global longitudinal strain and left ventricular ejection fraction in trastuzumab-treated patients

Jiang, J.; Liu, B.; Li, Y. W.; Hothi, S. S.

doi:10.3389/fcvm.2023.1250311

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 13 November 2023

Sec. Cardiovascular Imaging

Volume 10 - 2023 | https://doi.org/10.3389/fcvm.2023.1250311

This article is part of the Research TopicThe Combination of Data-Driven Machine Learning Approaches and Prior Knowledge for Robust Medical Image Processing and AnalysisView all 13 articles

Clinical service evaluation of the feasibility and reproducibility of novel artificial intelligence based-echocardiographic quantification of global longitudinal strain and left ventricular ejection fraction in trastuzumab-treated patients

J. Jiang¹

B. Liu^2,3

Y. W. Li⁴

S. S. Hothi^1,3,5*

¹Heart and Lung Centre, New Cross Hospital, Royal Wolverhampton NHS Trust, Wolverhampton, United Kingdom
²Department of Cardiology, Manchester University NHS Foundation Trust, Manchester, United Kingdom
³Institute of Cardiovascular Sciences, University of Birmingham, Birmingham, United Kingdom
⁴Department of Anaesthesia, New Cross Hospital, Royal Wolverhampton NHS Trust, Wolverhampton, United Kingdom
⁵Research Centre for Health and Life Sciences, Coventry University, Coventry, United Kingdom

Introduction: Cardiotoxicity is a potential prognostically important complication of certain chemotherapeutic agents that may result in preclinical or overt clinical heart failure. In some cases, chemotherapy must be withheld when left ventricular (LV) systolic function becomes significantly impaired, to protect cardiac function at the expense of a change in the oncological treatment plan, leading to associated changes in oncological prognosis. Accordingly, patients receiving potentially cardiotoxic chemotherapy undergo routine surveillance before, during and following completion of therapy, usually with transthoracic echocardiography (TTE). Recent advancements in AI-based cardiac imaging reveal areas of promise but key challenges remain. There are ongoing questions as to whether the ability of AI to detect subtle changes in individual patients is at a level equivalent to manual analysis. This raises the question as to whether AI-based left ventricular strain analysis could provide a potential solution to left ventricular systolic function analysis in a manner equivocal to or superior to conventional assessment, in a real-world clinical service. AI based automated analyses may represent a potential solution for addressing the pressure of increasing echocardiographic demands within limited service-capacity healthcare systems, in addition to facilitating more accurate diagnoses.

Methods: This clinical service evaluation aims to establish whether AI-automated analysis compared to conventional methods (1) is a feasible method for assessing LV-GLS and LVEF, (2) yields moderate to good correlation between the two approaches, and (3) would lead to different clinical recommendations with serial surveillance in a real-world clinical population.

Results and Discussion: We observed a moderate correlation (r = 0.541) in GLS between AI automated assessment compared to conventional methods. The LVEF quantification between methods demonstrated a strong correlation (r = 0.895). AI-generated GLS and LVEF values compared reasonably well with conventional methods, demonstrating a similar temporal pattern throughout echocardiographic surveillance. The apical-three chamber view demonstrated the lowest correlation (r = 0.423) and revealed to be least successful for acquisition of GLS and LVEF. Compared to conventional methodology, AI-automated analysis has a significantly lower feasibility rate, demonstrating a success rate of 14% (GLS) and 51% (LVEF).

Introduction

Cardiotoxicity is a significant, potential complication of certain chemotherapeutic agents that can lead to either preclinical or overt heart failure. In some cases, chemotherapy must be withheld when cardiac function, primarily left ventricular (LV) systolic function, becomes significantly impaired to protect cardiac function at the expense of a change in the oncological treatment plan and associated changes in prognosis (1). Accordingly, patients receiving potentially cardiotoxic chemotherapy are recommended to undergo routine surveillance before, during and following completion of therapy, usually with transthoracic echocardiography (TTE). Transthoracic echocardiography is a well-established and widely available imaging modality with an important role in determining cardiac structure and function. To date, it remains the preferred technique for assessing the development, progression and regression of cardiotoxicity among oncology patients undergoing cardiac surveillance (2).

Echocardiographic indices such as left ventricular ejection fraction (LVEF) by Simpson's Biplane method has traditionally been used to assess changes in LV systolic function. However, in the modern era of speckle tracking echocardiography (STE), strain quantification has rapidly evolved into a valuable tool for the early detection of cardiotoxicity during oncological therapy and has since been incorporated into international guidance (3, 4).

Until now, global longitudinal strain (GLS) has been the most studied strain parameter with the largest body of literature supporting its diagnostic and prognostic value (5, 6). One early study evaluated eighty-one females with newly diagnosed HER2 + breast cancer for early alterations of myocardial strain during treatment with anthracycline and/or trastuzumab. Patients received three-monthly surveillance throughout the course of a fifteen-month study period. A reduction in LVEF was observed in the overall cohort (64 ± 5% to 59 ± 6%; p < 0.0001); twenty-six patients [32%, (22%–43%)] developed cardiotoxicity, and of these patients, 5 [6%, (2%–14%)] developed symptoms of heart failure (HF). Significant LVEF reduction (≥8%) was detected in 15% of patients that developed subsequent cardiotoxicity, whereas upon the application of strain analysis, the incidence rate increased to 78%. Among the patients that later developed HF, all had a reported GLS of less than −19% (7).

While strain quantification with speckle tracking echocardiography represents a sensitive method for assessing LV function, this postprocessing analysis remains laborious, time-consuming and is subject to significant inter- and intra-observer variability, related to reproducibility of contouring cardiac structure by manual and even semi-automated contouring. In recent years, the emergence of artificial intelligence (AI) in echocardiography has generated much interest among the cardiac imaging community. The technology is rapidly evolving but is yet to be widely adopted into clinical practice. Recent evidence has revealed promising findings, demonstrating that the application of AI enables data analysis free from human operator bias, accelerated workflow and quantification, along with high feasibility rate in the absence of operator input. One multicentre study which assessed LVEF and longitudinal strain using visual, manual and fully AI-automated-methods (TomTec-Arena 1.2, TomTec Imaging Systems) reported a high feasibility (98%) of AI-automated assessment (8). Good correlation and levels of agreement were observed between manual and automated assessment (ICC: 0.83; bias: 0.7%; 95% CI: 0.1%–1.3%). Expectedly, bias and levels of agreement were wider when visual assessments were compared. A key advantage of automated LVEF and LV-GLS compared to manual and visual assessment was the absence of inter-measurement variability on repeated assessments with the AI method able to identify the same patterns each time. Finally, beat to-beat variability was 0.96 ± 3.52% for automated LVEF, 2.7 ± 8.16% for manual LVEF, 0.19 ± 1.31% for automated GLS, and 1.09 ± 3.29% for manual GLS (8).

In support of these findings is another recent trial by Salte et al., which reported good correlation (R = 0.93, p < 0.001) and low bias of −1.4 ± 0.3% (p < 0.01) with an estimated level of agreement (LOA) of ±3.7% when comparing AI-automated vs. conventional methodology (EchoPAC v.202, GE), suggesting that the application of AI is potentially comparable to human expert performance using conventional methodology (9).

While AI-based cardiac imaging analysis appear promising, there are areas that require further assessment. AI-automated analysis must be able to perform at least as well as established methodologies to detect subtle changes in left ventricular function, whether LVEF or GLS. Hence, further research is needed to fully establish the vulnerability of automated image processing networks. Furthermore, this automated approach relies upon a large training dataset to implicitly learn features of the heart relevant to segmentation which is resource intensive, demands close clinical supervision and raises potential ethical and privacy concerns.

If AI-automated analysis of LV function can be demonstrated to be equivocal to or superior to conventional methods within a real-world clinical service, then it may represent a potential solution for the challenges of limited clinical service capacity by reducing the pressures of increasing echocardiographic demands, in addition to facilitating more accurate diagnoses. This clinical service evaluation aims to establish whether AI-automated analysis is: (1) a feasible method for assessing LV-GLS and LVEF, (2) correlates well with conventional methods, and (3) whether AI analysis would lead to different clinical recommendations during serial surveillance in a real-world clinical population.

Materials and methods

Patient population

This single-centre audit and service evaluation retrospectively reviewed all HER2 + breast cancer patients that underwent TTE surveillance and trastuzumab therapy between January 2019 and October 2022 at the Royal Wolverhampton NHS Trust (UK) and assessed the evaluation of cardiac function against international cardio-oncology guidance (Audit/Service evaluation number 5918, Royal Wolverhampton NHS Trust, UK). Informed consent was not required due to the retrospective nature of the clinical audit and evaluation. Patients undergoing combination therapy including anthracycline were excluded from the study. Patients with atrial fibrillation or other form of arrhythmias during the echocardiographic studies were also excluded. To reflect real-world patient population and feasibility, patients with partially suboptimal endocardial border definition were not excluded. Clinical characteristics of our cohort were collected from the image reporting system and hospital records and are summarised in Table 1.

TABLE 1

Table 1. Demographic and clinical characteristics of the patient population.

Echocardiographic imaging protocol and analysis

648 TTE studies acquired from 142 oncology patients that received trastuzumab echocardiographic surveillance between 2019 and 2022 were retrospectively evaluated. All echocardiographic studies within our British Society of Echocardiography (BSE) accredited imaging laboratory were comprehensive studies which complied with BSE cardio-oncology guidelines. Echo imaging was performed by BSE accredited echocardiographers using commercial equipment (Affiniti, EPIQ and iE33, Phillips Medical Systems, Andover, Massachusetts, USA).

Assessment of GLS and LVEF

AI-automated and conventionally measured GLS and LVEF were assessed from standard apical four- (A4C), three- (A3C), and two-chamber (A2C) cine loops in accordance with BSE guidance.

AI-automated assessments (GLS and LVEF) were performed on individual echocardiographic studies using an AI-based platform (Ultromics EchoGo Core, Oxford, UK). The investigators submitted individual clinical studies required for analysis from the local hospital archiving system to the AI pipeline (Ultromics SaaS). Individual views are identified and classified with the existing convolutional neural network (CNN) model and subsequently processed by a U-Net based architecture for view-specific LV contouring, myocardial segmentation, and myocardial motion tracking to compute GLS and LVEF in the absence of manual adjustments (10).

Conventional GLS assessment was performed in a semi-automated fashion from the apical four-, three- and two-chamber LV-focused cine images in dedicated conventional software (QLab, version 15.5, Philips Medical Systems). Upon detection of the endocardial border, the software automatically established a region of interest (ROI) and calculated the strain values of the selected view. The BSE-accredited or similarly experienced operator manually adjusted the ROI to optimise tracking if deemed necessary and strain values were recalculated to reflect this adjustment. Where image quality was insufficient to permit strain assessment of all three views, then a global strain value could not be calculated. Conventional LVEF was manually performed using the Simpson's biplane method of discs (Modified Simpson's rule) for LV volumes and LVEF calculation. End-diastole was defined as the frame following mitral valve closure or the frame in which the cardiac dimension is largest, in preference to the onset of the QRS. End-systole was defined as the frame preceding mitral valve opening or the time in the cardiac cycle in which the cardiac dimension is smallest, respectively. This protocol was performed using the LV-focused A4C and A2C views.

Statistical analysis

Continuous variables were expressed as mean ± standard deviation and categorical variables were presented as n (%). Linear regression analysis was performed to evaluate the relationship between GLS and LVEF when assessed by either conventional or AI-automated methods. Bland-Altman analysis was used to assess the levels of agreement and quantify systemic differences between assessments. Comparison of mean values between the automated and conventional groups were performed using the paired sample student t-test. Analysis of variance (ANOVA) was used to compare the means of three of more groups. For all statistical tests performed, a p-value less than 0.05 was regarded as statistically significant. Statistical analyses were performed using IBM SPSS Statistics version 29 (New York, USA).

Results

Subject characteristics

The patient cohort included 142 patients which had undergone a total of 648 echocardiographic studies as part of their oncological therapy cardiac surveillance. The population comprised 140 females (99%), with mean age 59 ± 13 years (range 28–89 years). Oncological diagnoses predominantly comprised breast cancer (84%), but also included gastric (13%) and oesophageal cancer (3%). Patient demographic and clinical characteristics are summarised in Table 1.

Technical feasibility of AI-based compared to conventional assessment in GLS and LVEF

AI-generated GLS and LVEF values were acquired in 14% and 51% of all studies, respectively. Representative examples of normal and abnormal GLS studies analysed by AI-generated and conventional assessment are shown in Figures 1, 2 respectively. The rate of success in obtaining strain results using AI vs. conventional methods for the three standard apical views were: A4C, 56% vs. 74%; A3C, 14% vs. 38%; A2C, 46% vs. 53%, respectively (Figure 3).

FIGURE 1

Figure 1. Normal GLS data yielded by (A) AI-based and (B) conventional semi-automated strain analysis.

FIGURE 2

Figure 2. Abnormal GLS data yielded by (A) AI-based and (B) conventional semi-automated strain analysis.

FIGURE 3

Figure 3. Feasibility of AI-based versus conventional semi-automated strain analysis and LVEF in the standard apical views.

Technical failure to derive strain from the A3C was therefore the main reason for the low rate of success in obtaining AI-generated GLS (ANOVA p = 0.028). Whilst the success rate of deriving longitudinal strain from the A3C via the conventional method was also low, the failure rate was superior to that of AI. Factors contributing to suboptimal image quality, particularly affecting the A3C, included challenging body composition, tachyarrhythmias, ectopy, limited rib space and previous mastectomy.

GLS and LVEF using AI vs. conventional assessment

Mean GLS in whole cohort was −17.9 ± 2.2% (AI) vs. −19.1 ± 2.0% (conventional). Mean LVEF in the whole cohort was 61.6 ± 5.7% (AI) vs. 60.7 ± 4.9% (conventional). Linear regression and Bland-Altman analysis for GLS revealed moderate correlation (r = 0.541, p < 0.001) and disagreement (mean bias −1.2%, 95% CI: −5.2% to 2.8%; Figures 4A,B). In contrast, LVEF showed strong correlation (r = 0.895, p < .001) with small biases (Figures 5A,B).

FIGURE 4

Figure 4. (A) Correlation between conventional and AI-automated global longitudinal strain. (B) Bland-Altman plot of conventional and AI-automated global longitudinal strain.

FIGURE 5

Figure 5. (A) Correlation between conventional and AI-automated left ventricular ejection fraction. (B) Bland-Altman plot of conventional and AI-automated left ventricular ejection fraction.

Comparison between strain at individual apical views using AI vs. conventional assessment

Mean longitudinal strain values from specific apical views were −18.7 ± 2.9% and −19.0 ± 2.6% (A4C) (Figures 6A,B), −18.1 ± 2.8% and −18.6 ± 2.6% (A2C) (Figures 7A,B), −15.7 ± 2.6% and −16.6 ± 1.6% (A3C) (Figures 8A,B), and −18.2 ± 2.7% and −18.6 ± 2.6% for the AI method and the conventional method, respectively. A strong correlation and agreement was demonstrated in the A4C (r = 0.883, p < .001, 95% CI: −3.0% to 2.4%) and A4C/A2C (measurable values achieved from both A4C and A2C views within a given study) strain (r = 0.853, p < .001, 95% CI: −3.2% to 2.4%) views for strain between AI-automated and conventional methods (Figures 9A,B). In comparison, the A2C strain revealed a moderate correlation (r = 0.771, p < .001). The weakest correlation (r = 0.423, p = 0.008) and widest limits of agreement among each individual apical view were observed in the A3C view.

FIGURE 6

Figure 6. (A) Correlation between conventional and AI-automated strain in apical-four chamber view. (B) Bland-Altman plot of conventional and AI-automated strain in apical-four chamber view.

FIGURE 7

Figure 7. (A) Correlation between conventional and AI-automated strain in apical-two chamber view. (B) Bland-Altman plot of conventional and AI-automated strain in apical-two chamber view.

FIGURE 8

Figure 8. (A) Correlation between conventional and AI-automated strain in apical-three chamber view. (B) Bland-Altman plot of conventional and AI-automated strain in apical-three chamber view.

FIGURE 9

Figure 9. (A) Correlation between conventional and AI-automated strain in apical-four/-two chamber view. (B) Bland-Altman plot of conventional and AI-automated strain in apical-four/-two chamber view.

Temporal changes in GLS and LVEF between AI vs. conventional assessments during surveillance

Serial changes in strain and LVEF during TTE surveillance are summarised in Table 2. Statistical differences between the conventional and AI-automated methods at each time point are illustrated in Table 3 using the independent sample t-test. Conventional and AI-automated values followed a similar temporal pattern in patients receiving trastuzumab therapy for both GLS and LVEF irrespective of the cardiotoxic cohort or the total study population (Figures 10, 11). At 3 months (T1), both conventional and automated method demonstrated a reduction in GLS and LVEF compared to baseline measurements (T0). By 6 months (T2), further reduction in LV function was observed to a similar degree by both methods. The GLS and LVEF were seen to be lowest at 9 months (T3) from the initiation of trastuzumab therapy. The AI-automated GLS values were consistently more negative lower at each timepoint compared to the conventional method (Table 3 and Figure 11). The LVEF values at timepoint 3 to 5 were almost identical by both methods although a higher degree of variation was observed from the AI-automated method (T3: 58.9 ± 8.7%, p = 0.422; T4: 58.9 ± 7.4, p = 0.638; T5: 62 ± 6.0, p = 0.038). At 12- (T4) and 15-months (T5), AI-automated values demonstrated improvements in GLS and LVEF. Similar trends were observed from the conventional method although the degree of improvement is shown to be smaller in LVEF at 15-months. There were no significant differences observed between the AI-automated and conventional methods for GLS. For LVEF, there was a significantly lower LVEF from the conventional method (59.5 ± 5.7% vs. 62 ± 6.0%, p = 0.038). Based on the GLS and LVEF criteria (11), six patients developed cardiotoxicity; this number was considered too small to allow statistical sub-analysis. Nevertheless, the limited cases have highlighted the ability for AI-automated analysis in detecting left ventricular changes among the cardiotoxic cohort.

TABLE 2

Table 2. Mean values and standard deviation of conventional GLS and AI-automated GLS at individual timepoints during trastuzumab therapy.

TABLE 3

Table 3. AI-Automated and conventional global longitudinal strain and left ventricular ejection fraction at each timepoint.

FIGURE 10

Figure 10. Mean values and standard deviation of conventional GLS and AI-automated GLS at individual timepoints during trastuzumab therapy in the study population.

FIGURE 11

Figure 11. Mean values and standard deviation of conventional LVEF and AI-automated LVEF at individual timepoints during trastuzumab therapy in the study population.

Discussion

In this real-world service evaluation and audit of the assessment of left ventricular ejection fraction and strain in a cohort of patients receiving trastuzumab chemotherapy, we assessed whether an AI-automated solution to LV systolic function is a feasible and reliable methodology compared to conventional analysis. The main findings are firstly, that GLS and LVEF quantification obtained from AI-automated assessment showed moderate to strong correlation compared to conventional methods. Secondly, AI-generated GLS and LVEF values compared reasonably well with conventional methods, demonstrating a similar temporal pattern throughout the echocardiographic surveillance. Thirdly, the apical-three chamber view demonstrated the lowest correlation and revealed to be least successful for acquisition of GLS and LVEF. Finally, compared to conventional methodology, AI-automated analysis has a significantly lower feasibility rate, demonstrating a success rate of 14% (GLS) and 51% (LVEF).

Clinical demand and relevance

While the introduction of speckle tracking has provided exciting opportunities in the field of cardiac imaging, its clinical application is rendered meritless if performed by unexperienced or suboptimally trained practitioners. Like any echocardiographic technique, there is a steep learning curve with performing and interpreting echocardiograms (12). Interpretation of echocardiographic studies is demanding and this can limit workflow particularly among smaller centres with fewer trained echocardiographers. The application of AI echocardiography may potentially address these challenges by utilising an AI-based analysis of LV strain.

There is emerging data suggesting that a fully automated AI assessment could potentially reduce post-processing time with high reproducibility and reduced risk imposed by human-software interaction. However, in the presence of significant knowledge gaps the technology may fall short of this potential. Presently, semi-automated assessments are in clinical use and accepted as a standard, feasible method for LV strain assessment, supported by evidence from numerous studies have supported the use of these methods (13–16). However, the human-software interaction is such that the current semi-automated approach yields values that are highly influenced by the level of experience and training of the sonographer.

Furthermore, research to date has rarely explored the application of AI-automated assessment in cancer therapy-related cardiac dysfunction but instead has largely focused on ischaemia-related cardiac abnormalities. Given that cancer therapy-induced heart failure carries a worse prognosis compared to heart failure related to other causes (17), the need for accurate and frequent echocardiographic surveillance is clear and of paramount importance. It follows that there is a clinical need for research into AI-automated detection of subclinical changes in cardiac function to accurately, reliable and rapidly detect changes earlier in the disease process. To the best of our knowledge, this is the first real-world evaluation of such an approach to validate and explore the clinical feasibility of AI-automated LV assessment in this patient cohort throughout the surveillance period.

The feasibility and accuracy of automated GLS and LVEF

The present findings reveal that the current version of AI-automated GLS possess some limitations in feasibility, achieving successful acquisition of GLS in only 14% of all studies. The higher rate of success demonstrated from conventional methods (38%) suggests either that the AI-automated approach is inferior to the semi-automated approach or that the semi-automated approach is overly generous in the studies to which it is applied. The unifying consideration here is that of a threshold for acceptability for an echo study to be amenable to either of the assessment methods. We speculate that the two approaches accept image qualities of different levels. Standardising this threshold is not necessarily a straight-forward proposition as even with a group of selected studies, the AI-automated system is using different approaches to strain assessment than in the semi-automated system.

In either analysis approach, the acquisition of GLS requires the strain values of three individual apical views. The present study found that the A3C view was the most frequently limiting view followed by the A2C (46%) in preventing a GLS assessment. These findings are in keeping with a study by Kawakami et al. which examined the automated tracking quality in each individual LV segments (14). The study found that the LV segments in these in these views are often associated with considerably poorer automated tracking compared to segments in the A4C view.

In contrast to previous studies that excluded echo studies where image quality were deemed substandard (14), the present analysis did not exclude these patients and is therefore relevant to real-world clinical practice. All oncology patients that were administered trastuzumab and underwent echo surveillance were included to minimise selection bias and reflect real-world patient cohorts, including known imaging challenges often specific to cardio-oncology patients such as radiotherapy, breast reconstruction surgeries, mastectomy and breast implantation (18). This might explain the lower rate of successful acquisition compared to previous trials as the availability of diagnostic quality images are reduced. Conversely, the possibility for over-analysis in potentially non-feasible images should not be excluded. The likelihood of the operator repeatedly adjusting the region of interest in the presence of limited or absence of endocardial border definition to “inaccurately” create a GLS value that is consistent with visual assessment is not uncommon and ought to be considered.

Previous validation studies (8, 9, 14, 19) comparing AI-automated and conventional methods have reported good feasibility and correlation values, often in patient groups with ischaemia-related heart diseases and other pathologies unrelated to chemotherapy. In the setting of cardio-oncology, our results are in line with previously reported evidence which demonstrated a reasonable correlation between AI-derived GLS and LVEF values to the conventional method, suggesting that there were no considerable differences between method of assessments.

Although our reported values were lower compared to the literature, this may be influenced by the preselection of subjects with segments suited for assessment in previous studies. Our findings also demonstrated that serial monitoring of trastuzumab-treated oncology patients with AI-assisted technology to detect subtle changes in LVEF and GLS may be done with similar certainty to conventional assessment with the values generated from both methods being largely similar.

A significant difference in LVEF was observed at one timepoint although this may be attributed to smaller sample size at the final follow-up. Further work will be required to assess longitudinal echocardiographic trends in addition to correlation between AI-automated and conventional analyses, and there may be systematic differences in absolute values whether related to the vendor or system used.

In the cardiotoxic cohort, while the sample size was small, both methods demonstrated a similar temporal trend highlighting the potential for AI-automated methods to reliability detect LV functional deterioration. Such findings suggest that AI-automated LV assessments represent a valuable method of serial echocardiographic monitoring in longitudinal patient care and can build a case for future prospective studies in this area.

Study limitations

There are a few potential limitations associated with the present analysis that deserves to be mentioned. First, we only studied patients in sinus rhythm, thus data could not be extrapolated from patients with irregular heart rhythms. Additionally, our study included a relatively small sample size. Despite this, our patient cohort included all patients during the study period to reflect a real-world clinical setting and is the first to study functional changes in this specific patient cohort, thereby providing valuable insight into the application of AI-automated analysis in serial echocardiographic studies in trastuzumab-treated patients. Our report and early insights thereby provide a basis for future studies to expand upon. Second, the potential vendor differences in AI-imaging software for strain and LVEF analysis due to differences in AI-algorithms should be noted. Third, is the lack of gold standard reference to compare our strain and EF measurements. However, the primary objective was to determine the level of correlation between AI-automated and conventional methods thus identifying the “true” reference value is of lesser significance. We therefore used the current clinically accepted semi-automated approach as the comparator. Finally, the analysis was conducted retrospectively which meant that it suffered from the inherent limitations of a restrospective study design. Nevertheless, this report describes a straightforward comparision of imaging as opposed to patient outcomes, thus selection bias is of lesser relevance.

Future research directions

With increasing echocardiographic demands surpassing clinical capacity in the face of a shortage of echocardiographers, there is now an urgent need for the active incorporation of AI guided technology to assist, or potentially substitute the need for operator input into analysis of advanced echocardiographic techniques. Consequently, software solutions must possess the accuracy to where it could be confidently applied irrespective of the GLS experience of the operator. There are a number of challenges in the widespread clinical implementation of AI echocardiography, none of which are considered insurmountable.

The future appears positive for the application of AI in echocardiography and significant advances are anticipated to address the current knowledge gaps. Future work should explore whether: (1) AI-based assessment is superior to less experienced humans, (2) image rejection threshold appropriateness, (3) accuracy and reproducibility of automated, semi-automated and manually generated data, and (4) improvements in post-processing time and overall workflow on echocardiography services.

Conclusions

Despite enthusiasm for the application of AI technology in healthcare, it is yet to be widely embraced in the echocardiographic community. Due to significant limitations and knowledge gaps in automation, AI technology in echocardiography remains premature for clinical use if adopted completely independent of operator intervention. Instead, at present, it could be a useful unbiased “second opinion” for “experienced” practitioners. Our analysis is supportive of prospective studies into the utility and application of AI-based analysis of heart function by echocardiography in patients receiving potentially cardiotoxic chemotherapy.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

SH and YL contributed to conception and design of the study. JJ performed the statistical analysis. JJ wrote the first draft of the manuscript. SH, BL, JJ critically revised and wrote sections of the manuscript. All authors contributed to manuscript revision, read and approved the submitted version.

Conflict of interest

The Royal Wolverhampton NHS Trust received an NHSx phase 4 award for the use of Ultromics artificial intelligence stress echo analysis software in my trust (Royal Wolverhampton NHS Trust). This comprised cost towards IT set up, clinical implementation, and cost-free provision of the novel Ultromics Echo Core Pro software for a one-year period. SSH has research agreements with Ligence Heart and Ventripoint Medical System.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sawaya H, Sebag IA, Plana JC, Januzzi JL, Ky B, Cohen V, et al. Early detection and prediction of cardiotoxicity in chemotherapy-treated patients. Am J Cardiol. (2011) 107(9):1375–80. doi: 10.1016/j.amjcard.2011.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Thavendiranathan P, Negishi T, Coté MA, Penicka M, Massey R, Cho GY, et al. Single versus standard multiview assessment of global longitudinal strain for the diagnosis of cardiotoxicity during cancer therapy. JACC Cardiovasc Imaging. (2018) 11(8):1109–18. doi: 10.1016/j.jcmg.2018.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Heimdal A, Støylen A, Torp H, Skjærpe T. Real-time strain rate imaging of the left ventricle by ultrasound. J Am Soc Echocardiogr. (1998) 11(11):1013–9. doi: 10.1016/S0894-7317(98)70151-8

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Leitman M, Lysyansky P, Sidenko S, Shir V, Peleg E, Binenbaum M, et al. Two-dimensional strain–a novel software for real-time quantitative echocardiographic assessment of myocardial function. J Am Soc Echocardiogr. (2004) 17(10):1021–9. doi: 10.1016/j.echo.2004.06.019

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Thavendiranathan P, Poulin F, Lim KD, Plana JC, Woo A, Marwick TH. Use of myocardial strain imaging by echocardiography for the early detection of cardiotoxicity in patients during and after cancer chemotherapy: a systematic review. J Am Coll Cardiol. (2014) 63(25):2751–68. doi: 10.1016/j.jacc.2014.01.073

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ye L, Yang ZG, Selvanayagam JB, Luo H, Yang TZ, Perry R, et al. Myocardial strain imaging by echocardiography for the prediction of cardiotoxicity in chemotherapy-treated patients: a meta-analysis. JACC Cardiovasc Imaging. (2020) 13(3):881–2. doi: 10.1016/j.jcmg.2019.09.013

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Sawaya H, Sebag IA, Plana JC, Januzzi JL, Ky B, Tan TC, et al. Assessment of echocardiography and biomarkers for the extended prediction of cardiotoxicity in patients treated with anthracyclines, taxanes, and trastuzumab. Circ Cardiovasc Imaging. (2012) 5(5):596–603. doi: 10.1161/CIRCIMAGING.112.973321

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Knackstedt C, Bekkers SC, Schummers G, Schreckenberg M, Muraru D, Badano LP, et al. Fully automated versus standard tracking of left ventricular ejection fraction and longitudinal strain: the FAST-EFs multicenter study. J Am Coll Cardiol. (2015) 66(13):1456–66. doi: 10.1016/j.jacc.2015.07.052

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Salte IM, Østvik A, Smistad E, Melichova D, Nguyen TM, Karlsen S, et al. Artificial intelligence for automatic measurement of left ventricular strain in echocardiography. JACC Cardiovasc Imaging. (2021) 14(10):1918–28. doi: 10.1016/j.jcmg.2021.04.018

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Upton R, Beqiri A, Parker A, Hawkes W, Gao S, Porumb M, et al. Automated echocardiographic detection of severe coronary artery disease using artificial intelligence. JACC Cardiovasc Imaging. (2022) 15(5):715–27. doi: 10.1016/j.jcmg.2021.10.013

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Lyon AR, López-Fernández T, Couch LS, Asteggiano R, Aznar MC, Bergler-Klein J, et al. 2022 ESC guidelines on cardio-oncology developed in collaboration with the European Hematology Association (EHA), the European Society for Therapeutic Radiology and Oncology (ESTRO) and the International Cardio-Oncology Society (IC-OS) developed by the task force on cardio-oncology of the European Society of Cardiology (ESC). Eur Heart J. (2022) 43(41):4229–361. doi: 10.1093/eurheartj/ehac244

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Khan AM, Wiegers SE. The importance of being expert: is it time to revisit the concept? J Am Soc Echocardiogr. (2012) 25(2):218–9. doi: 10.1016/j.echo.2011.12.001

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Kitano T, Nabeshima Y, Abe Y, Otsuji Y, Takeuchi M. Accuracy and reliability of novel semi-automated two-dimensional layer specific speckle tracking software for quantifying left ventricular volumes and function. PLoS One. (2019) 14(8):e0221204. doi: 10.1371/journal.pone.0221204

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Kawakami H, Wright L, Nolan M, Potter EL, Yang H, Marwick TH. Feasibility, reproducibility, and clinical implications of the novel fully automated assessment for global longitudinal strain. J Am Soc Echocardiogr. (2021) 34(2):136–45. doi: 10.1016/j.echo.2020.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Medvedofsky D, Kebed K, Laffin L, Stone J, Addetia K, Lang RM, et al. Reproducibility and experience dependence of echocardiographic indices of left ventricular function: side-by-side comparison of global longitudinal strain and ejection fraction. Echocardiography. (2017) 34(3):365–70. doi: 10.1111/echo.13446

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chen Y, Hua W, Yang W, Shi Z, Fang Y. Reliability and feasibility of automated function imaging for quantification in patients with left ventricular dilation: comparison with cardiac magnetic resonance. Int J Cardiovasc Imaging. (2022) 38(6):1267–76. doi: 10.1007/s10554-021-02510-x

CrossRef Full Text | Google Scholar

17. Nadruz W, West E, Sengeløv M, Grove GL, Santos M, Groarke JD, et al. Cardiovascular phenotype and prognosis of patients with heart failure induced by cancer therapy. Heart. (2019) 105(1):34–41. doi: 10.1136/heartjnl-2018-313234

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Plana JC, Galderisi M, Barac A, Ewer MS, Ky B, Scherrer-Crosbie M, et al. Expert consensus for multimodality imaging evaluation of adult patients during and after cancer therapy: a report from the American society of echocardiography and the European association of cardiovascular imaging. Eur Heart J Cardiovasc Imaging. (2014) 15(10):1063–93. doi: 10.1093/ehjci/jeu192

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Wierzbowska-Drabik K, Hamala P, Roszczyk N, Lipiec P, Plewka M, Kręcki R, et al. Feasibility and correlation of standard 2D speckle tracking echocardiography and automated function imaging derived parameters of left ventricular function during dobutamine stress test. Int J Cardiovasc Imaging. (2014) 30(4):729–37. doi: 10.1007/s10554-014-0386-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: cardio-oncology, trastuzumab, cardiotoxicity, artificial intelligence, strain, echocardiography

Citation: Jiang J, Liu B, Li YW and Hothi SS (2023) Clinical service evaluation of the feasibility and reproducibility of novel artificial intelligence based-echocardiographic quantification of global longitudinal strain and left ventricular ejection fraction in trastuzumab-treated patients. Front. Cardiovasc. Med. 10:1250311. doi: 10.3389/fcvm.2023.1250311

Received: 5 July 2023; Accepted: 16 October 2023;
Published: 13 November 2023.

Edited by:

Gongning Luo, Harbin Institute of Technology, China

Reviewed by:

Kamran Shamsa, University of California, Los Angeles, United States
Wenqi Lu, Manchester Metropolitan University, United Kingdom

© 2023 Jiang, Liu, Li and Hothi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: S. S. Hothi cy5ob3RoaUBuaHMubmV0

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.