OPERAnet: A Multimodal Activity Recognition Dataset Acquired from Radio Frequency and Vision-based Sensors

Mohammud J. Bocus,Wenda Li,Shelly Vishwakarma,Roget Kou,Chong Tang,Karl Woodbridge,Ian Craddock,Ryan McConville,Raul Santos-Rodriguez,Kevin Chetty,Robert Piechocki
DOI: https://doi.org/10.48550/arXiv.2110.04239
2021-10-09
Abstract:This paper presents a comprehensive dataset intended to evaluate passive Human Activity Recognition (HAR) and localization techniques with measurements obtained from synchronized Radio-Frequency (RF) devices and vision-based sensors. The dataset consists of RF data including Channel State Information (CSI) extracted from a WiFi Network Interface Card (NIC), Passive WiFi Radar (PWR) built upon a Software Defined Radio (SDR) platform, and Ultra-Wideband (UWB) signals acquired via commercial off-the-shelf hardware. It also consists of vision/Infra-red based data acquired from Kinect sensors. Approximately 8 hours of annotated measurements are provided, which are collected across two rooms from 6 participants performing 6 daily activities. This dataset can be exploited to advance WiFi and vision-based HAR, for example, using pattern recognition, skeletal representation, deep learning algorithms or other novel approaches to accurately recognize human activities. Furthermore, it can potentially be used to passively track a human in an indoor environment. Such datasets are key tools required for the development of new algorithms and methods in the context of smart homes, elderly care, and surveillance applications.
Signal Processing,Image and Video Processing
What problem does this paper attempt to address?
The problem this paper attempts to address is the development of a comprehensive multimodal dataset for evaluating human activity recognition (HAR) and localization techniques obtained through synchronized radio frequency (RF) devices and vision-based sensors. Specifically, the paper aims to: 1. **Create a multimodal dataset**: This dataset includes channel state information (CSI) extracted from Wi-Fi network interface cards (NIC), passive Wi-Fi radar (PWR) built on a software-defined radio (SDR) platform, ultra-wideband (UWB) signals acquired through commercial off-the-shelf hardware, and visual/infrared data from Kinect sensors. 2. **Cover various daily activities**: The dataset contains approximately 8 hours of annotated measurement data collected in two rooms, where 6 participants performed 6 types of daily activities. 3. **Support non-cooperative localization**: The dataset is not only used for human activity recognition but also for passively tracking humans in indoor environments, meaning the target subjects are unaware of these processes and merely reflect or scatter signals from the transmitter to the receiver. 4. **Facilitate the development of algorithms and technologies**: This dataset can be used to develop and test new pattern recognition, skeleton representation, deep learning algorithms, and other methods to more accurately recognize human activities. 5. **Accelerate the development of self-supervised learning techniques**: This is the first dataset explicitly aimed at accelerating the development of self-supervised learning techniques, which require much larger datasets than traditional supervised learning. Through these contributions, this dataset can provide critical tools for research in fields such as smart homes, elderly care, and surveillance.