Abstract:In the current surveillance system, video streams are firstly captured and compressed at the cameras, and then transmitted to the backend severs or cloud for big data analysis. It is impractical to aggregate all video streams from hundreds of thousands of cameras for big data analysis. Transcoding the videos to low-bitrate ones is the conventional solution to solve the aggregation bottleneck. However, it is recognized that transcoding will inevitably affect visual feature extraction, consequently degrading the subsequent analysis performance. To address these challenges, we thus propose a new video big data analysis framework, called end-edge-cloud collaborative system. Under the end-edge-cloud collaborative framework, a camera can output two streams simultaneously, including a compressed video stream for viewing and data storage, and a compact feature stream extracted from the original video signals for visual analysis. Video stream and feature stream are synchronized by unified identification. We identify three key technologies to enable the end-edge-cloud collaborative system, including analysis-friendly video coding, visual feature compact descriptor, and user-defined neural network and parameter updating. By real-time feeding only the feature streams into the cloud center, these cameras thus form a large-scale brain-like vision system for the smart city. A prototype has been implemented to demonstrate its feasibility. Experiment results show that our system can achieve high efficient video compression and guarantee the analysis performance. Furthermore, our system makes the big data analysis feasible which only need aggregate low bit-rate compressed feature stream.

RT3C: Real-Time Crowd Counting in Multi-Scene Video Streams Via Cloud-Edge-Device Collaboration

Relevant Region Prediction for Crowd Counting

Multi-branch Progressive Embedding Network for Crowd Counting

Edge-Cloud Collaborative Streaming Video Analytics with Multi-agent Deep Reinforcement Learning

Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering

Frame-Recurrent Video Crowd Counting

CrowdVision: A Computing Platform for Video Crowdprocessing Using Deep Learning

Edge-Cloud Collaboration for Human Activity Recognition on Multiple Subjects

Over-crowdedness Alert! Forecasting the Future Crowd Distribution

Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting

Scheduling Massive Camera Streams to Optimize Large-Scale Live Video Analytics

A Flow Base Bi-path Network for Cross-scene Video Crowd Understanding in Aerial View

Optimizing Cloud-Based Video Crowdsensing.

End-Edge-Cloud Collaborative System: A Video Big Data Processing and Analysis Architecture.

Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting

Enhanced 3D convolutional networks for crowd counting

CrossVision: Real-time On-Camera Video Analysis via Common RoI Load Balancing

CLRNet: A Cross Locality Relation Network for Crowd Counting in Videos

Dense Crowd Counting Based on Adaptive Scene Division

A Dynamic-Attention On Crowd Region With Physical Optical Flow Features For Crowd Counting

Proffler: Toward Collaborative and Scalable Edge-Assisted Crowdsourced Livecast