Cardiac function state recognition model based on bimodal time–frequency representation-ZENTIME PUBLISHING CORPORATION LIMITED

Home Progress in Medical Devices All issues Issue 2

Cardiac function state recognition model based on bimodal time–frequency representation

Mingzhi Zhang, Piding Li

School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China.

Address correspondence to: Piding Li, School of Health Science and Engineering, University of Shanghai for Science and Technology, No. 516 Jungong Road, Yangpu District, Shanghai 200093, China. E-mail: lpdbyusst@163.com.

DOI: https://doi.org/10.61189/784716ypyhmm

Received November 28, 2025; Accepted February 27, 2026; Published June 24, 2026

Highlights

● We use two types of cardiac physiological signals together. They complement each other and help improve the final classification accuracy.

● This study converts phonocardiograms and electrocardiograms into time–frequency images, which helps increase the positive detection rate and enables automatic learning of modality-specific features through a neural network.

● This study modifies the baseline model to achieve a more streamlined neural network architecture and incorporates an attention mechanism to better focus on information correlations.

Abstract

Objective: This study uses dual-modality signals, including phonocardiogram (PCG) and electrocardiogram (ECG), together with machine learning methods to distinguish cardiac function states in subjects. Methods: We developed a model based on time–frequency representations. The model includes data preprocessing, a time–frequency conversion module, a feature extraction module, and a feature-fusion classifier module. The system uses complete ensemble empirical mode decomposition with adaptive noise to remove noise from the PCG and applies filters to reduce noise in the ECG. The system extracts Mel-frequency cepstral coefficients from the PCG and uses Fourier synchrosqueezed transform for the ECG. This study also improves VGG16 and ResNet18 as feature extractors by inserting a variant attention mechanism into the feature extraction networks. Finally, the system feeds the feature vector into a support vector machine for classification. Results: The dual-modality time–frequency method achieves 95.4% accuracy and 97.4% sensitivity for positive cases on public datasets, demonstrating strong performance in cardiac function classification. Conclusion: This research shows that the approach improves both diagnostic accuracy and sensitivity. The system provides valuable support for the preliminary screening of cardiac dysfunction.

Keywords: Multi-modal, Phonocardiogram signal, Electrocardiogram signal, Feature encoding, Heart disease screening

Download

Cite

Views

Downloads

Lastest Issue

Development of an automated cytological smear staining device for rapid on-site evaluation Review of key technologies in ankle rehabilitation robots Design and analysis of a tissue retraction manipulator for neuroendoscopic surgery Heart sound classification based on the fusion of dynamic features and images of mel-frequency cepstral coefficients Research progress on hemostatic techniques for combat trauma AI-assisted diagnosis of myocardial hypertrophy based on cardiac MRI: A systemic review Slim exquisite easy-exposing video laryngoscope: A novel video laryngoscope A metallic foreign object detection algorithm in pharmaceuticals based on phase rotation and smoothed pseudo-Wigner-Ville distribution Optimization of multilayer shell structures for wearable sensors based on polyvinylidene fluoride A correlation study of paraspinal muscle functions in adolescent idiopathic scoliosis