Spatiotemporal feature disentanglement for quality surveillance of left ventricular echocardiographic video using ST-R(2 + 1)D-ConvNeXt
Journal
Biomedical Signal Processing and Control
Date Issued
2025-07-01
Author(s)
DOI
10.1016/j.bspc.2025.107671
Abstract
The left ventricle (LV), as the primary chamber responsible for systemic circulation, plays a crucial role in cardiac function assessment. Echocardiography which particularly focuses on LV, is vital for cardiac disease diagnosis. However, the diagnostic accuracy heavily depends on image quality, which requires systematic assessment. In this study, we propose a two-stage deep learning approach for echocardiographic quality surveillance using a dataset of 514 annotated videos. The first stage employs EchoNet, to extract LV volumes of interest. The second stage introduces ST-R(2 + 1)D-ConvNeXt, a novel ConvNeXt-based model designed to disentangle spatiotemporal features and leverage echocardiographic hallmarks within the apical-four-chamber (A4C) dynamic echocardiogram data. The proposed approach achieves an accuracy of 82.63 %, an Area Under the Curve (AUC) of 0.89, a sensitivity of 84.10 %, and a specificity of 81.08 % in classifying echocardiographic videos into high and low quality. Furthermore, through explainable AI techniques, our model identifies specific quality issues such as missing cardiac walls, distorted or poorly positioned chambers, and other anomalies, providing interpretable feedback for clinical applications.
Subjects
Deep learning; Left ventricular echocardiography; Quality surveillance; Spatiotemporal feature disentanglement
Type
journal article
