Multi-Object Tracking in the Dark

Xinzhe Wang,Kang Ma,Qiankun Liu,Yunhao Zou,Ying Fu

2024-05-11

Abstract:Low-light scenes are prevalent in real-world applications (e.g. autonomous driving and surveillance at night). Recently, multi-object tracking in various practical use cases have received much attention, but multi-object tracking in dark scenes is rarely considered. In this paper, we focus on multi-object tracking in dark scenes. To address the lack of datasets, we first build a Low-light Multi-Object Tracking (LMOT) dataset. LMOT provides well-aligned low-light video pairs captured by our dual-camera system, and high-quality multi-object tracking annotations for all videos. Then, we propose a low-light multi-object tracking method, termed as LTrack. We introduce the adaptive low-pass downsample module to enhance low-frequency components of images outside the sensor noises. The degradation suppression learning strategy enables the model to learn invariant information under noise disturbance and image quality degradation. These components improve the robustness of multi-object tracking in dark scenes. We conducted a comprehensive analysis of our LMOT dataset and proposed LTrack. Experimental results demonstrate the superiority of the proposed method and its competitiveness in real night low-light scenes. Dataset and Code: https: //github.com/ying-fu/LMOT

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper primarily addresses the issue of multi-object tracking in low-light environments, aiming to solve two core challenges: 1. **Lack of multi-object tracking datasets under low-light conditions**: Existing multi-object tracking datasets are mainly collected under well-lit conditions, while collecting high-quality videos and their annotations in low-light environments is very difficult and costly. 2. **Technical challenges of multi-object tracking under low-light conditions**: In low-light conditions, image quality is poor and noise is high, which directly affects the performance of object detectors and appearance-based association modules, thereby impacting the effectiveness of multi-object tracking. To address the above issues, the paper proposes the following contributions: - **Constructed the first multi-object tracking dataset under low-light conditions (LMOT)**: Researchers designed a dual-camera system that can simultaneously capture video frames under well-lit and low-light conditions. This setup allows annotation work to be done on well-lit videos, while these videos can also provide additional supervision information for the model during the training phase. The LMOT dataset contains 32 video sequences, over 35,000 frames, and more than 815,000 bounding boxes. - **Proposed a multi-object tracking method under low-light conditions (LTrack)**: To improve tracking performance, the LTrack method includes two key components: - **Adaptive Low-pass Downsampling Module (ALD)**: Enhances feature maps by extracting low-frequency components from images through spatial low-pass convolution and filtering out high-frequency noise. - **Degradation Suppression Learning Strategy (DSL)**: Utilizes paired low-light videos to help the model suppress image noise and encourage content response in the feature domain, thereby improving the model's robustness to noise. Experimental results show that LTrack demonstrates superiority and competitiveness in multi-object tracking tasks under low-light conditions, especially in real night scene tests.

Multi-Object Tracking in the Dark

Low-Light Object Tracking: A Benchmark

Supplementary Material: Quasi-Dense Similarity Learning for Multiple Object Tracking

Object-Level Pseudo-3D Lifting for Distance-Aware Tracking

A Comprehensive Study of Object Tracking in Low-Light Environments

Robust Unsupervised Multi-Object Tracking in Noisy Environments

Cross-Modal Object Tracking via Modality-Aware Fusion Network and a Large-Scale Dataset

Deep learning and multi-modal fusion for real-time multi-object tracking: Algorithms, challenges, datasets, and comparative study

Cross-Modal Object Tracking: Modality-Aware Representations and a Unified Benchmark

DarkVision: A Benchmark for Low-light Image/Video Perception

Transnational Image Object Detection Datasets from Nighttime Driving

CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking With Camera-LiDAR Fusion

MotionTrack: Learning Robust Short-term and Long-term Motions for Multi-Object Tracking

MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Analysis Based on Recent Deep Learning Approaches Applied in Real-Time Multi-Object Tracking: A Review

Awesome Multi-modal Object Tracking

Getting to Know Low-light Images with The Exclusively Dark Dataset

Multi-Granularity Language-Guided Multi-Object Tracking

Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes