Ftmar: A Fusion Transformer Network for Multi-Resident Activity Recognition

Jingjing Cao,Jie Chu,Fukang Guo,Kai Liu,Ruitao Xie,Hu Qin
DOI: https://doi.org/10.2139/ssrn.4064632
2022-01-01
Abstract:The booming development of smart homes ignites huge interest in human activity recognition (HAR). Current work mainly puts emphasis on single resident activity recognition in a room with little consideration of the multiple residents scenario. Besides, traditional methods accomplish identification based on the temporal relationship of data, ignoring the relationship among different sensors, which are hard to achieve satisfactory classification performance. In this paper, we put forward an end-to-end framework named Fusion Transformer Network for Multi-resident Activity Recognition (FTMAR) to address the above issues. Firstly, FTMAR preprocesses binary ambient sensor sequences into embedding matrices as network input. Secondly, in order to fully take advantage of ambient sensor information, two Transformer-based encoders: Sequence Encoder and Embedding Encoder, with respect to time sequence dimension and embedding dimension are constructed separately. Then, we design an ensemble strategy by employing the convolution layer for intermediate feature fusion between encoders. Finally, to handle the multi-resident challenge, two classification heads enable the identification of residents and activities simultaneously. Experimental results on CASAS dataset show that our approach can improve recognition performance both in resident and activity.
What problem does this paper attempt to address?