Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition

Yiting Dong,Yang Li,Dongcheng Zhao,Guobin Shen,Yi Zeng
2023-10-23
Abstract:The prevalence of violence in daily life poses significant threats to individuals' physical and mental well-being. Using surveillance cameras in public spaces has proven effective in proactively deterring and preventing such incidents. However, concerns regarding privacy invasion have emerged due to their widespread deployment. To address the problem, we leverage Dynamic Vision Sensors (DVS) cameras to detect violent incidents and preserve privacy since it captures pixel brightness variations instead of static imagery. We introduce the Bullying10K dataset, encompassing various actions, complex movements, and occlusions from real-life scenarios. It provides three benchmarks for evaluating different tasks: action recognition, temporal action localization, and pose estimation. With 10,000 event segments, totaling 12 billion events and 255 GB of data, Bullying10K contributes significantly by balancing violence detection and personal privacy persevering. And it also poses a challenge to the neuromorphic dataset. It will serve as a valuable resource for training and developing privacy-protecting video systems. The Bullying10K opens new possibilities for innovative approaches in these domains.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the threat to personal physical and mental health posed by violent incidents in daily life, with a particular focus on the issue of privacy invasion when using surveillance cameras for violence detection in public places. To balance the need for violence detection with personal privacy protection, the research team developed a large-scale neuromorphic dataset called Bullying10K using Dynamic Vision Sensor (DVS) cameras. This dataset contains real-world video clips under various complex actions, fast movements, and occlusions, collecting a total of 10,000 event clips, 12 billion events, and 255GB of data. The Bullying10K dataset provides three benchmark tasks to evaluate the performance of different methods: action recognition, temporal action localization, and pose estimation. By using DVS cameras instead of traditional RGB cameras, this dataset can effectively protect personal privacy while capturing violent events, as DVS cameras record pixel brightness changes rather than static images. Additionally, the research team provides keypoint labels for the human pose estimation task. This work not only offers valuable training resources for developing privacy-protecting video systems but also opens up new possibilities for future related research. In summary, the Bullying10K dataset aims to meet the real-time violence detection needs while maximizing the privacy protection of the individuals being recorded.