Abstract:Thoroughly testing autonomy systems is crucial in the pursuit of safe autonomous driving vehicles. It necessitates creating safety-critical scenarios that go beyond what can be safely collected from real-world data, as many of these scenarios occur infrequently on public roads. However, the evaluation of most existing NVS methods relies on sporadic sampling of image frames from the training data, comparing the rendered images with ground truth images using metrics. Unfortunately, this evaluation protocol falls short of meeting the actual requirements in closed-loop simulations. Specifically, the true application demands the capability to render novel views that extend beyond the original trajectory (such as cross-lane views), which are challenging to capture in the real world. To address this, this paper presents a novel driving view synthesis dataset and benchmark specifically designed for autonomous driving simulations. This dataset is unique as it includes testing images captured by deviating from the training trajectory by 1-4 meters. It comprises six sequences encompassing various time and weather conditions. Each sequence contains 450 training images, 150 testing images, and their corresponding camera poses and intrinsic parameters. Leveraging this novel dataset, we establish the first realistic benchmark for evaluating existing NVS approaches under front-only and multi-camera settings. The experimental findings underscore the significant gap that exists in current approaches, revealing their inadequate ability to fulfill the demanding prerequisites of cross-lane or closed-loop simulation. Our dataset is released publicly at the project page: <a class="link-external link-https" href="https://3d-aigc.github.io/XLD/" rel="external noopener nofollow">this https URL</a>.

FreeVS: Generative View Synthesis on Free Driving Trajectory

Driving Scene Synthesis on Free-form Trajectories with Generative Prior

Ivs-Net: Learning Human View Synthesis from Internet Videos

FreeSim: Toward Free-viewpoint Camera Simulation in Driving Scenes

Asymmetric Bidirectional View Synthesis for Free Viewpoint and Three-Dimensional Video

DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation

Pose-Based View Synthesis for Vehicles: A Perspective Aware Method

Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles

DreamDrive: Generative 4D Scene Modeling from Street View Images

Generative View Synthesis: From Single-view Semantics to Novel-view Images

3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis

Novel View Synthesis of Dynamic Human with Sparse Cameras.

Free3D: Consistent Novel View Synthesis without 3D Representation

StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models

WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving

AUTO3D: Novel view synthesis through unsupervisely learned variational viewpoint and global 3D representation

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis

Self-Supervised Visibility Learning for Novel View Synthesis