Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2

Lv Tang,Bo Li
2024-07-31
Abstract:The Segment Anything Model (SAM), introduced by Meta AI Research as a generic object segmentation model, quickly garnered widespread attention and significantly influenced the academic community. To extend its application to video, Meta further develops Segment Anything Model 2 (SAM2), a unified model capable of both video and image segmentation. SAM2 shows notable improvements over its predecessor in terms of applicable domains, promptable segmentation accuracy, and running speed. However, this report reveals a decline in SAM2's ability to perceive different objects in images without prompts in its auto mode, compared to SAM. Specifically, we employ the challenging task of camouflaged object detection to assess this performance decrease, hoping to inspire further exploration of the SAM model family by researchers. The results of this paper are provided in \url{<a class="link-external link-https" href="https://github.com/luckybird1994/SAMCOD" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem this paper attempts to address is the evaluation of the performance of the Segment Anything Model 2 (SAM2) in the task of Camouflaged Object Detection (COD), particularly in comparison to the previous generation model SAM. Specifically, the paper focuses on the following two aspects: 1. **Promptable Segmentation**: How SAM2 performs in segmentation accuracy when segmentation prompts are provided. 2. **Auto Mode Segmentation**: Whether SAM2's ability to automatically identify and segment all objects in an image declines in the absence of prompts. Through these evaluations, the paper aims to reveal the strengths and limitations of SAM2 in different application scenarios and to inspire researchers to further explore the potential of the SAM model family.