Abstract:BACKGROUND Multimodal wearable technologies have brought forward wide possibilities in human activity recognition, and more specifically personalized monitoring of eating habits. The emerging challenge now is the selection of most discriminative information from high-dimensional data collected from multiple sources. The available fusion algorithms with their complex structure are poorly adopted to the computationally constrained environment which requires integrating information directly at the source. As a result, more simple low-level fusion methods are needed. OBJECTIVE In the absence of a data combining process, the cost of directly applying high-dimensional raw data to a deep classifier would be computationally expensive with regard to the response time, energy consumption, and memory requirement. Taking this into account, we aimed to develop a data fusion technique in a computationally efficient way to achieve a more comprehensive insight of human activity dynamics in a lower dimension. The major objective was considering statistical dependency of multisensory data and exploring intermodality correlation patterns for different activities. METHODS In this technique, the information in time (regardless of the number of sources) is transformed into a 2D space that facilitates classification of eating episodes from others. This is based on a hypothesis that data captured by various sensors are statistically associated with each other and the covariance matrix of all these signals has a unique distribution correlated with each activity which can be encoded on a contour representation. These representations are then used as input of a deep model to learn specific patterns associated with specific activity. RESULTS In order to show the generalizability of the proposed fusion algorithm, 2 different scenarios were taken into account. These scenarios were different in terms of temporal segment size, type of activity, wearable device, subjects, and deep learning architecture. The first scenario used a data set in which a single participant performed a limited number of activities while wearing the Empatica E4 wristband. In the second scenario, a data set related to the activities of daily living was used where 10 different participants wore inertial measurement units while performing a more complex set of activities. The precision metric obtained from leave-one-subject-out cross-validation for the second scenario reached 0.803. The impact of missing data on performance degradation was also evaluated. CONCLUSIONS To conclude, the proposed fusion technique provides the possibility of embedding joint variability information over different modalities in just a single 2D representation which results in obtaining a more global view of different aspects of daily human activities at hand, and yet preserving the desired performance level in activity recognition.

Ameliorating multimodal food classification using state of the art deep learning techniques

Food Classification using Joint Representation of Visual and Textual Data

Fine-grained food image classification and recipe extraction using a customized deep neural network and NLP

Food Image Classification and Calorie Prediction for Dietary Analysis

Food Recognition using Fusion of Classifiers based on CNNs

A Model for Automated Food Logging Through Food Recognition and Attribute Estimation Using Deep Learning

DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment

Deep Learning–Based Multimodal Data Fusion: Case Study in Food Intake Episodes Detection Using Wearable Sensors (Preprint)

Deep Learning–Based Multimodal Data Fusion: Case Study in Food Intake Episodes Detection Using Wearable Sensors

Performance Evaluation of Indian Food Image Classification system using Transfer Learning with MobileNetV3

Deep neural network for food image classification and nutrient identification: A systematic review

NUTRINET: INDIAN FOOD NUTRITION CLASSIFICATION USING DEEP LEARNING

Food Recognition based on Deep Learning Algorithms

A Novel Method for Accurate & Real-time Food Classification: The Synergistic Integration of EfficientNetB7, CBAM, Transfer Learning, and Data Augmentation

Study for Food Recognition System Using Deep Learning

Multimodal medical image fusion and classification using deep learning techniques

A review of deep learning-based information fusion techniques for multimodal medical image classification

Multi-Task Image-Based Dietary Assessment for Food Recognition and Portion Size Estimation

Indian Food Image Classification Using Convolutional Neural Network

Few-Shot And Many-Shot Fusion Learning In Mobile Visual Food Recognition

VTnet+Handcrafted based approach for food cuisines classification