Abstract:In this paper, we propose Point-Voxel Correlation Fields to explore relations between two consecutive point clouds and estimate scene flow that represents 3D motions. Most existing works only consider local correlations, which are able to handle small movements but fail when there are large displacements. Therefore, it is essential to introduce all-pair correlation volumes that are free from local neighbor restrictions and cover both short- and long-term dependencies. However, it is challenging to efficiently extract correlation features from all-pairs fields in the 3D space, given the irregular and unordered nature of point clouds. To tackle this problem, we present point-voxel correlation fields, proposing distinct point and voxel branches to inquire about local and long-range correlations from all-pair fields respectively. To exploit point-based correlations, we adopt the K-Nearest Neighbors search that preserves fine-grained information in the local region, which guarantees the scene flow estimation precision. By voxelizing point clouds in a multi-scale manner, we construct pyramid correlation voxels to model long-range correspondences, which are utilized to handle fast-moving objects. Integrating these two types of correlations, we propose Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) architecture that employs an iterative scheme to estimate scene flow from point clouds. To adapt to different flow scope conditions and obtain more fine-grained results, we further propose Deformable PV-RAFT (DPV-RAFT), where the Spatial Deformation deforms the voxelized neighborhood, and the Temporal Deformation controls the iterative update process. We evaluate the proposed method on the FlyingThings3D and KITTI Scene Flow 2015 datasets and experimental results show that we outperform state-of-the-art methods by remarkable margins.

RPPformer-Flow: Relative Position Guided Point Transformer for Scene Flow Estimation

Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity.

SAFIT: Segmentation-Aware Scene Flow with Improved Transformer

TransFlow: Transformer as Flow Learner

HCRF-Flow: Scene Flow from Point Clouds with Continuous High-order CRFs and Position-aware Flow Embedding

Optical Flow as Spatial-Temporal Attention Learners

DELFlow: Dense Efficient Learning of Scene Flow for Large-Scale Point Clouds

RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds

Hierarchical Attention Learning of Scene Flow in 3D Point Clouds

3D Point-Voxel Correlation Fields for Scene Flow Estimation.

Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds

Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior

Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer

RMS-FlowNet++: Efficient and Robust Multi-scale Scene Flow Estimation for Large-Scale Point Clouds

ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow Forecasting

Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior

PTTR: Relational 3D Point Cloud Object Tracking with Transformer

RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation