Abstract:Collaborative perception (CP) is emerging as a promising solution to the inherent limitations of stand-alone intelligence. However, current wireless communication systems are unable to support feature-level and raw-level collaborative algorithms due to their enormous bandwidth demands. In this paper, we propose DiffCP, a novel CP paradigm that utilizes a specialized diffusion model to efficiently compress the sensing information of collaborators. By incorporating both geometric and semantic conditions into the generative model, DiffCP enables feature-level collaboration with an ultra-low communication cost, advancing the practical implementation of CP systems. This paradigm can be seamlessly integrated into existing CP algorithms to enhance a wide range of downstream tasks. Through extensive experimentation, we investigate the trade-offs between communication, computation, and performance. Numerical results demonstrate that DiffCP can significantly reduce communication costs by 14.5-fold while maintaining the same performance as the state-of-the-art algorithm.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the contradiction between the high - bandwidth requirements and low communication costs faced by Collaborative Perception (CP) in Intelligent Unmanned Systems (IUSs) under the bandwidth limitations of existing wireless communication systems. Specifically: 1. **Background problems**: - A single - agent framework (such as an autonomous vehicle, an intelligent robot) is limited by factors such as sensor failures, limited sensing range, and environmental occlusion, and it is difficult to meet the requirements for safety and reliability. - Although Device - to - Device (D2D) communication technologies (such as the sidelink in C - V2X networks) enable agents to share sensing information through wireless channels, it is still challenging to achieve high - reliability and low - latency transmissions in densely deployed, highly mobile, and obstructed environments. 2. **Limitations of existing methods**: - The raw - data - level CP method, although retaining detailed information, requires a huge amount of bandwidth (for example, a 64 - line LiDAR requires approximately 360 Mbps, and a single HD camera requires approximately 20 Mbps), far exceeding the current C - V2X channel capacity. - The object - level CP method reduces the bandwidth requirement (approximately 150 Kbps) by transmitting detection results, but it depends on the individual detection capabilities of each agent, limiting the overall performance. - Feature - level CP methods (such as F - Cooper and V2VNet) compress raw data for communication, but these methods still have limitations in performance or bandwidth efficiency. 3. **Solutions proposed in the paper**: - DiffCP, a novel CP paradigm based on the diffusion model, is proposed, which can achieve feature - level collaboration within the object - level communication cost. - DiffCP uses geometric and semantic conditional generation models to efficiently compress the perception information of collaborators, thereby significantly reducing communication costs while maintaining high performance. - Through experimental verification, DiffCP can maintain the same performance as the state - of - the - art algorithms while reducing the communication cost by 14.5 times. In summary, this paper aims to solve the high - communication - cost problem of existing CP methods in bandwidth - limited environments by introducing DiffCP based on the diffusion model, thereby promoting the practical application and development of collaborative perception systems.

DiffCP: Ultra-Low Bit Collaborative Perception via Diffusion Model

Slim-FCP: Lightweight-Feature-Based Cooperative Perception for Connected Automated Vehicles

Task-Oriented Wireless Communications for Collaborative Perception in Intelligent Unmanned Systems

Efficient Vehicular Collaborative Perception Based on Saptial-Temporal Feature Compression

Pragmatic Communication in Multi-Agent Collaborative Perception

What2comm: Towards Communication-efficient Collaborative Perception Via Feature Decoupling

Cooperative Networking Towards Maritime Cyber Physical Systems

UMC: A Unified Bandwidth-efficient and Multi-resolution based Collaborative Perception Framework

DCP-Net: A Distributed Collaborative Perception Network for Remote Sensing Semantic Segmentation

Communication-Efficient Collaborative Perception via Information Filling with Codebook

DMCE: Diffusion Model Channel Enhancer for Multi-User Semantic Communication Systems

Multi-User Probabilistic Semantic Communication with Semantic Compression Ratio Optimization

Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency

RSU-Aided Energy-Efficient Collaborative Perception for Connected Autonomous Vehicles

R-ACP: Real-Time Adaptive Collaborative Perception Leveraging Robust Task-Oriented Communications

SparseComm: an Efficient Sparse Communication Framework for Vehicle-Infrastructure Cooperative 3D Detection

C-MASS: Combinatorial Mobility-Aware Sensor Scheduling for Collaborative Perception with Second-Order Topology Approximation

PACP: Priority-Aware Collaborative Perception for Connected and Autonomous Vehicles

How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception

Task-Oriented Communication for Multi-Device Cooperative Edge Inference