Foundation Model-Powered 3D Few-Shot Class Incremental Learning via Training-free Adaptor

Sahar Ahmadi,Ali Cheraghian,Morteza Saberi,Md.Towsif Abir,Hamidreza Dastmalchi,Farookh Hussain,Shafin Rahman
2024-10-12
Abstract:Recent advances in deep learning for processing point clouds hold increased interest in Few-Shot Class Incremental Learning (FSCIL) for 3D computer vision. This paper introduces a new method to tackle the Few-Shot Continual Incremental Learning (FSCIL) problem in 3D point cloud environments. We leverage a foundational 3D model trained extensively on point cloud data. Drawing from recent improvements in foundation models, known for their ability to work well across different tasks, we propose a novel strategy that does not require additional training to adapt to new tasks. Our approach uses a dual cache system: first, it uses previous test samples based on how confident the model was in its predictions to prevent forgetting, and second, it includes a small number of new task samples to prevent overfitting. This dynamic adaptation ensures strong performance across different learning tasks without needing lots of fine-tuning. We tested our approach on datasets like ModelNet, ShapeNet, ScanObjectNN, and CO3D, showing that it outperforms other FSCIL methods and demonstrating its effectiveness and versatility. The code is available at \url{<a class="link-external link-https" href="https://github.com/ahmadisahar/ACCV_FCIL3D" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the Few - Shot Class Incremental Learning (FSCIL) problem in the 3D point - cloud environment. Specifically, the author proposes a new method to address the challenges of FSCIL in 3D computer vision, especially how to avoid catastrophic forgetting and overfitting when dealing with limited new - class data. ### Problem Background 1. **Catastrophic Forgetting**: When a model is learning a new task, it often forgets the knowledge it has learned before. This is especially serious for 3D point - cloud data because the data of new classes are usually very limited. 2. **Overfitting**: Since the amount of data for new classes is small, the model is prone to overfit these small amounts of data, resulting in a decline in generalization ability. 3. **Domain Gap**: The base tasks of 3D point - clouds usually use synthetic data, while new tasks use real - scan data, which leads to the domain - gap problem and increases the complexity of FSCIL. ### Research Motivation Existing FSCIL methods usually rely on rehearsal strategies or fine - tuning. Although these methods can alleviate the forgetting problem to a certain extent, they also bring the risk of overfitting. In addition, these methods require additional training, increasing the computational cost and complexity. ### Proposed Method To solve the above problems, the author proposes a training - free adaptor based on a pre - trained foundation model. The main features of this method are as follows: 1. **Utilizing the Pre - trained Foundation Model**: The author uses a foundation model pre - trained on a large amount of 3D point - cloud data. This model has strong generalization ability and can transfer knowledge between different tasks. 2. **Training - Free Adaptor**: By introducing an adaptor module that does not require additional training, this module contains two cache systems: - **Base - Task Cache**: Stores test samples from the base task to prevent forgetting the knowledge of old classes. - **New - Task Cache**: Stores a small number of training samples of new classes to prevent overfitting. 3. **Dynamic Adaptation**: The adaptor module dynamically adjusts the output of the model according to the characteristics of the input samples, ensuring that the model can effectively learn new classes without additional training. ### Experimental Results The author conducted experiments on multiple 3D datasets (such as ModelNet, ShapeNet, ScanObjectNN, and CO3D). The results show that this method has achieved state - of - the - art performance in the FSCIL task, especially in the cross - dataset incremental learning scenarios. ### Summary This paper successfully solves the catastrophic forgetting and overfitting problems in FSCIL in the 3D point - cloud environment by introducing a training - free adaptor based on a pre - trained foundation model, demonstrating its robustness and generalization ability in different tasks and datasets.