EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Bohao Liao,Wei Zhai,Zengyu Wan,Tianzhu Zhang,Yang Cao,Zheng-Jun Zha
2024-10-23
Abstract:Scene reconstruction from casually captured videos has wide applications in real-world scenarios. With recent advancements in differentiable rendering techniques, several methods have attempted to simultaneously optimize scene representations (NeRF or 3DGS) and camera poses. Despite recent progress, existing methods relying on traditional camera input tend to fail in high-speed (or equivalently low-frame-rate) scenarios. Event cameras, inspired by biological vision, record pixel-wise intensity changes asynchronously with high temporal resolution, providing valuable scene and motion information in blind inter-frame intervals. In this paper, we introduce the event camera to aid scene construction from a casually captured video for the first time, and propose Event-Aided Free-Trajectory 3DGS, called EF-3DGS, which seamlessly integrates the advantages of event cameras into 3DGS through three key components. First, we leverage the Event Generation Model (EGM) to fuse events and frames, supervising the rendered views observed by the event stream. Second, we adopt the Contrast Maximization (CMax) framework in a piece-wise manner to extract motion information by maximizing the contrast of the Image of Warped Events (IWE), thereby calibrating the estimated poses. Besides, based on the Linear Event Generation Model (LEGM), the brightness information encoded in the IWE is also utilized to constrain the 3DGS in the gradient domain. Third, to mitigate the absence of color information of events, we introduce photometric bundle adjustment (PBA) to ensure view consistency across events and frames. We evaluate our method on the public Tanks and Temples benchmark and a newly collected real-world dataset, RealEv-DAVIS. Our project page is <a class="link-external link-https" href="https://lbh666.github.io/ef-3dgs/" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is scene reconstruction from casually - filmed videos in high - speed or low - frame - rate scenarios. Specifically, existing methods, when dealing with high - speed motion scenes, lead to a decline in the quality of scene reconstruction due to large inter - frame pixel displacements and insufficient observations. The paper proposes a new framework - Event - Aided Free - Trajectory 3D Gaussian Splatting (EF - 3DGS), which improves these problems by introducing data from event cameras. Event cameras can record pixel - luminance changes with high temporal resolution, providing scene and motion information missing between traditional camera frames, thereby improving the quality of scene reconstruction and the accuracy of camera - pose estimation. The main contributions of the paper include: 1. **Introducing event cameras into free - trajectory scene reconstruction for the first time**: Utilizing the high - temporal - resolution and low - latency characteristics of event cameras, demonstrating the potential of event data in challenging scenarios. 2. **Designing loss functions based on the Event Generation Model (EGM) and the Linear Event Generation Model (LEGM)**: These loss functions mine the motion and luminance information in event data and seamlessly integrate them into the 3D Gaussian Splatting (3DGS) optimization process. 3. **Experimental verification**: Experiments on public benchmarks and real - world datasets show that this method significantly outperforms existing methods in terms of rendering quality and trajectory - estimation accuracy. In particular, in high - speed scenarios, this method can achieve a PSNR improvement of up to 2 dB and a 40% reduction in absolute trajectory error (ATE). Through these innovations, the paper effectively solves the problem of scene reconstruction in high - speed or low - frame - rate scenarios, improving the reconstruction quality and robustness.