SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More

Tianrun Chen,Ankang Lu,Lanyun Zhu,Chaotao Ding,Chunan Yu,Deyi Ji,Zejian Li,Lingyun Sun,Papa Mao,Ying Zang
2024-08-10
Abstract:The advent of large models, also known as foundation models, has significantly transformed the AI research landscape, with models like Segment Anything (SAM) achieving notable success in diverse image segmentation scenarios. Despite its advancements, SAM encountered limitations in handling some complex low-level segmentation tasks like camouflaged object and medical imaging. In response, in 2023, we introduced SAM-Adapter, which demonstrated improved performance on these challenging tasks. Now, with the release of Segment Anything 2 (SAM2), a successor with enhanced architecture and a larger training corpus, we reassess these challenges. This paper introduces SAM2-Adapter, the first adapter designed to overcome the persistent limitations observed in SAM2 and achieve new state-of-the-art (SOTA) results in specific downstream tasks including medical image segmentation, camouflaged (concealed) object detection, and shadow detection. SAM2-Adapter builds on the SAM-Adapter's strengths, offering enhanced generalizability and composability for diverse applications. We present extensive experimental results demonstrating SAM2-Adapter's effectiveness. We show the potential and encourage the research community to leverage the SAM2 model with our SAM2-Adapter for achieving superior segmentation outcomes. Code, pre-trained models, and data processing protocols are available at <a class="link-external link-http" href="http://tianrun-chen.github.io/SAM-Adaptor/" rel="external noopener nofollow">this http URL</a>
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the challenges faced by the Segment Anything Model (SAM) and its subsequent version Segment Anything 2 (SAM2) in specific downstream tasks. Specifically: 1. **Limitations of the Base Model**: Although SAM has achieved significant success in the field of image segmentation, its performance still has limitations when dealing with some complex low-level segmentation tasks (such as camouflaged object detection, medical image segmentation, shadow detection, etc.). 2. **Improvements in SAM2**: With the release of SAM2, the model has been enhanced in architecture and trained on a larger dataset, but SAM2's performance in the aforementioned challenging tasks still needs improvement. 3. **Proposing SAM2-Adapter**: To overcome these challenges, researchers have proposed SAM2-Adapter, a multi-adapter configuration scheme aimed at leveraging the enhanced components of SAM2 to achieve new state-of-the-art (SOTA) results. Experiments have validated the effectiveness of SAM2-Adapter in multiple tasks, including medical image segmentation, camouflaged object detection, and shadow detection. In summary, the main goal of this paper is to enhance the performance of SAM2 in specific downstream tasks, particularly those that the base model finds difficult to cover or adapt to, by developing SAM2-Adapter.