Abstract:Accurately predicting heart activity and other biological signals is crucial for diagnosis and monitoring. Given that speech is an outcome of multiple physiological systems, a significant body of work studied the acoustic correlates of heart activity. Recently, self-supervised models have excelled in speech-related tasks compared to traditional acoustic methods. However, the robustness of data-driven representations in predicting heart activity remained unexplored. In this study, we demonstrate that self-supervised speech models outperform acoustic features in predicting heart activity parameters. We also emphasize the impact of individual variability on model generalizability. These findings underscore the value of data-driven representations in such tasks and the need for more speech-based physiological data to mitigate speaker-related challenges.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: **The effectiveness and feasibility of predicting cardiac activity parameters (such as heart rate and heart rate variability) through voice signals**. Specifically, the author aims to evaluate the performance of self - supervised learning models (SSMs) in predicting cardiac activity and compare them with traditional knowledge - based acoustic features. In addition, the study also explores the impact of inter - individual and intra - individual differences on the generalization ability of the model. ### Specific description of the problem: 1. **Limitations of existing methods**: - Although previous studies have shown that voice signals can reflect changes in cardiac activity, most studies rely on traditional acoustic features (such as spectral features), which perform poorly when generalizing across individuals. - Self - supervised learning models have performed well in voice - related tasks, but their application in predicting cardiac activity has not been fully explored. 2. **Research objectives**: - Evaluate the performance of self - supervised learning models (especially the Hybrid BYOL - S model) in predicting cardiac activity parameters (such as BPM and HRV). - Compare the performance of self - supervised learning models with traditional acoustic features. - Study the impact of inter - individual and intra - individual differences on the generalization ability of the model. - Explore the impact of different context window lengths on prediction performance. - Analyze which acoustic features are most important for predicting cardiac activity. ### Research background: - **Relationship between cardiac activity and voice**: Previous studies have shown that cardiac activity (such as heart rate and blood pressure) can affect voice features (such as fundamental frequency F0). Therefore, predicting cardiac activity by analyzing voice signals has potential application value. - **Advantages of self - supervised learning**: Self - supervised learning models can be pre - trained with a large amount of unlabeled data, thereby extracting more robust voice representations, which may help improve the accuracy of predicting cardiac activity. ### Main contributions: - **Evaluate for the first time the performance of self - supervised learning models in predicting cardiac activity**. - **Emphasize the impact of individual differences on the generalization ability of the model**, and point out that more voice physiological data are needed to alleviate speaker - related challenges. - **Propose that increasing the context window length can significantly improve prediction performance**. - **Identify the acoustic features that are most important for predicting cardiac activity**, providing directions for future research. Through these studies, the author hopes to provide new insights into predicting cardiac activity using voice signals and promote further development in this field.

Predicting Heart Activity from Speech using Data-driven and Knowledge-based features

Artificial intelligence framework for heart disease classification from audio signals

Speech Signal Analysis for the Estimation of Heart Rates Under Different Emotional States

Predicting Pulmonary Function From the Analysis of Voice: A Machine Learning Approach

Application of an end-to-end model with self-attention mechanism in cardiac disease prediction

Dataset of raw and pre-processed speech signals, Mel Frequency Cepstral Coefficients of Speech and Heart Rate measurements

A Robust Predictive Model for Early Detection of Heart Disease using Machine Learning

Voice-Driven Mortality Prediction in Hospitalized Heart Failure Patients: A Machine Learning Approach Enhanced with Diagnostic Biomarkers

Pre-Trained Foundation Model representations to uncover Breathing patterns in Speech

Cardiac disease prediction using AI algorithms with SelectKBest

Heart Disease Prediction Using Machine Learning Algorithms

Cardiovascular Disease Recognition Based on Heartbeat Segmentation and Selection Process

Learning Representations from Heart Sound: A Comparative Study on Shallow and Deep Models

Abnormal Heart Sound Detection using Time-Frequency Analysis and Machine Learning Techniques

Predictive Modeling of Biomedical Signals Using Controlled Spatial Transformation

Machine Learning-Based Heart Disease Prediction Model

Learning Generalizable Physiological Representations from Large-scale Wearable Data

Machine Learning-Enabled Hypertension Screening Through Acoustical Speech Analysis: Model Development and Validation

A Clinical Decision Support System for Heart Disease Prediction Using Deep Learning

Federated Abnormal Heart Sound Detection with Weak to No Labels

Experimental evaluation of artificial intelligence assisted heart disease prediction using deep learning principle