Abstract:During recent years, human activity recognition (HAR) using smart wearable sensors has become a main research focus in ubiquitous computing scenario. Deep convolutional neural networks (CNNs) have achieved significant success in HAR due to their automatic feature extracting ability in capturing local activity details. Due to superior performance, previous most works always prefer to apply small kernels instead of large kernels to handle time series sensor data for activity recognition. However, they do not intend to answer the key questions: why do large kernels underperform small kernels? How to close the performance gap? Intuitively, benefiting from larger receptive field (RF), larger kernels should have a great potential to model long-range dependencies in time series sensor data. So far, there has been little effort devoted to the larger-kernel design. In this article, we revisit the design of larger-kernel convolutions, which long have been neglected in the context of HAR. We find that both identity shortcut and structural re-parameterization can fully unleash the potential of larger-kernel convolutions. Extensive experiments and ablation studies on four mainstream benchmark datasets including PAMAP2, USC-HAD, UniMiB-SHAR, and OPPORTUNITY, show that our larger-kernel convolutions can further push the limit of small-kernel CNN performances under similar inference time, which can be used a drop-in replacement for small-kernel conv layers. For example, compared to the small-kernel baselines, our proposed approach can consistently boost recognition accuracy by 0.55%, 1.00%, 3.94%, and 1.64% on PAMAP2, USC-HAD, UniMiB-SHAR, and OPPORTUNITY, respectively, which is very competitive among the state-of-the-arts (SOTA). We believe that the incurred high performance is mainly due to larger effective RFs built via large kernels. The practical inference time is evaluated on a real hardware device. Our code can be available at: https://github.com/MinghuiYao/ELK-HAR/ .

Revisiting Large-Kernel CNN Design Via Structural Re-Parameterization for Sensor-Based Human Activity Recognition

Large Receptive Field Attention: An Innovation in Decomposing Large-Kernel Convolution for Sensor-Based Activity Recognition

Deep Neural Networks for Sensor-Based Human Activity Recognition Using Selective Kernel Convolution

Multiscale Deep Feature Learning for Human Activity Recognition Using Wearable Sensors

Real-time Human Activity Recognition Using Conditionally Parametrized Convolutions on Mobile and Wearable Devices

A multi-scale feature extraction fusion model for human activity recognition

Innovative Dual-Decoupling CNN with Layer-wise Temporal-Spatial Attention for Sensor-Based Human Activity Recognition

Deep Convolutional Networks With Tunable Speed–Accuracy Tradeoff for Human Activity Recognition Using Wearables

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

Human activity recognition using wearable sensors by heterogeneous convolutional neural networks

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Dual-Branch Interactive Networks on Multichannel Time Series for Human Activity Recognition

Shallow Convolutional Neural Networks for Human Activity Recognition Using Wearable Sensors

An Efficient Diverse-Branch Convolution Scheme for Sensor-Based Human Activity Recognition

A Multidimensional Parallel Convolutional Connected Network Based on Multisource and Multimodal Sensor Data for Human Activity Recognition

Layer-Wise Training Convolutional Neural Networks With Smaller Filters for Human Activity Recognition Using Wearable Sensors

Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data

Cross-Attention Enhanced Pyramid Multi-Scale Networks for Sensor-based Human Activity Recognition

A Multi-dimensional Parallel Convolutional Connected Network Based on Multi-source and Multi-modal Sensor Data for Human Activity Recognition

Human activity recognition with fine-tuned CNN-LSTM

The Convolutional Neural Networks Training With Channel-Selectivity for Human Activity Recognition Based on Sensors