Parameter-efficient Prompt Learning for 3D Point Cloud Understanding.

Sun Hongyu,Wang Yongcai,Chen Wang,Deng Haoran,Li Deying
DOI: https://doi.org/10.1109/icra57147.2024.10610093
2024-01-01
Abstract:This paper presents a parameter-efficient prompt tuning method, named PPT, toadapt a large multi-modal model for 3D point cloud understanding. Existingstrategies are quite expensive in computation and storage, and depend ontime-consuming prompt engineering. We address the problems from three aspects.Firstly, a PromptLearner module is devised to replace hand-crafted prompts withlearnable contexts to automate the prompt tuning process. Then, we lock thepre-trained backbone instead of adopting the full fine-tuning paradigm tosubstantially improve the parameter efficiency. Finally, a lightweightPointAdapter module is arranged near target tasks to enhance prompt tuning for3D point cloud understanding. Comprehensive experiments are conducted todemonstrate the superior parameter and data efficiency of the proposedmethod.Meanwhile, we obtain new records on 4 public datasets and multiple 3Dtasks, i.e., point cloud recognition, few-shot learning, and part segmentation.The implementation is available at https://github.com/auniquesun/PPT.
What problem does this paper attempt to address?