Abstract:Respiratory pattern is a representation of human breathing activity, which can reflect people's physical and psychological condition. Capturing the unexpected abnormal respiratory pattern unobtrusively of the patient or the potential patient has great significance. In the current work, we attempt to capitalize on depth camera and deep learning architecture to achieve the accurate and unobtrusive measurement of abnormal respiratory patterns, and the whole system can classify multiple people's respiratory patterns in a real-time manner. The challenges in this task are threefold: 1) the real-time online system means that the Region of Interest (ROI) needs to be located and tracked automatically; 2) the amount of real-world data is not enough for training to obtain the robust deep neural network; and 3) the intraclass variation is large and the outer class variation is small. Consequently, human joints tracking is applied to determine the location of subjects shoulder and chest. Based on the characteristics of actual respiratory signals, a novel and efficient respiratory simulation model (RSM) is proposed to generate abundant and high-quality training data. Finally, we apply a gated recurrent unit (GRU) neural network with bidirectional and attentional mechanisms (BI-AT-GRU) to classify six clinically significant respiratory patterns (Eupnea, Tachypnea, Bradypnea, Biots, Cheyne-Stokes, and Central-Apnea). The performance of the obtained BI-AT-GRU is tested by the data that is actually measured by the depth camera. The experimental results demonstrate that the proposed model can classify six different respiratory patterns with the accuracy, precision, recall, and F1 of 94.5%, 94.4%, 95.1%, and 94.8%, respectively. In comparative experiments, the obtained BI-AT-GRU specific to respiratory pattern classification outperforms the existing state-of-the-art, viz., BI-AT-LSTM, GRU, long short-term memory (LSTM), and BI-AT-GRU. Moreover, other experimental results indicate that the proposed online measuring system, deep neural network, and the modeling ideas have the potential to be extended to the large-scale applications, such as public places, sleep scenario, and office environment. The demo videos of the proposed system are available at: https://doi.org/10.6084/m9.figshare.11493666.v1.

Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features

SmartSit: Sitting Posture Recognition Through Acoustic Sensing on Smartphones

Speaker identification from the sound of the human breath

I Sense You by Breath: Speaker Recognition Via Breath Biometrics

Pre-Trained Foundation Model representations to uncover Breathing patterns in Speech

Smartphone Based Human Breath Analysis from Respiratory Sounds

Investigation of Self-supervised Pre-trained Models for Classification of Voice Quality from Speech and Neck Surface Accelerometer Signals

SitPAA: Sitting Posture and Action Recognition Using Acoustic Sensing

Classification of Breathing Phase and Path with In-Ear Microphones

Unobtrusive and Automatic Classification of Multiple People's Abnormal Respiratory Patterns in Real Time Using Deep Neural Network and Depth Camera.

A boosting framework for human posture recognition using spatio-temporal features along with radon transform

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

Respiratory Distress Detection from Telephone Speech using Acoustic and Prosodic Features

Coherent Feature Extraction with Swarm Intelligence Based Hybrid Adaboost Weighted ELM Classification for Snoring Sound Classification

BreathPass: Ultrasounic Authentication by Chest and Abdomen Movement While Breathing

Respiratory Disease Classification and Biometric Analysis Using Biosignals from Digital Stethoscopes

Medical Difficult Airway Detection using Speech Technology

Personalized breath based biometric authentication with wearable multimodality

Fine-Grained Classroom Activity Detection from Audio with Neural Networks

Development of High Accuracy Classifier for the Speaker Recognition System

CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions