Iris-SAM: Iris Segmentation Using a Foundation Model

Parisa Farmanifard,Arun Ross
2024-05-31
Abstract:Iris segmentation is a critical component of an iris biometric system and it involves extracting the annular iris region from an ocular image. In this work, we develop a pixel-level iris segmentation model from a foundational model, viz., Segment Anything Model (SAM), that has been successfully used for segmenting arbitrary objects. The primary contribution of this work lies in the integration of different loss functions during the fine-tuning of SAM on ocular images. In particular, the importance of Focal Loss is borne out in the fine-tuning process since it strategically addresses the class imbalance problem (i.e., iris versus non-iris pixels). Experiments on ND-IRIS-0405, CASIA-Iris-Interval-v3, and IIT-Delhi-Iris datasets convey the efficacy of the trained model for the task of iris segmentation. For instance, on the ND-IRIS-0405 dataset, an average segmentation accuracy of 99.58% was achieved, compared to the best baseline performance of 89.75%.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper primarily discusses the problem of iris segmentation using the Segment Anything Model (SAM). Iris segmentation is crucial in biometric systems and involves extracting the annular iris region from eye images. The researchers developed a pixel-level iris segmentation model by fine-tuning SAM on eye images, particularly using Focal Loss to address class imbalance. Experimental results demonstrate that this approach achieves high accuracy in iris segmentation on the ND-IRIS-0405, CASIA-Iris-Interval-v3, and IIT-Delhi-Iris datasets, such as achieving an average segmentation accuracy of 99.58% on ND-IRIS-0405, surpassing the baseline method of 89.75%. The paper points out that iris segmentation faces challenges such as lighting variations, occlusions from eyelashes or eyelids, reflections, and low resolution in long-distance captures. By fine-tuning SAM and combining different loss functions, particularly Focal Loss, the model is able to better handle these challenges. Focal Loss adjusts the model's focus on non-iris pixels strategically to address class imbalance. The study also discusses the application of the base model (e.g., SAM) in specific domains such as iris segmentation, considering factors like iris pattern complexity, lighting variations, and eyelash occlusions. By combining extensively trained base models with fine-tuning for specific tasks, the model's performance can be improved. Experimental results demonstrate that the SAM model with Focal Loss performs well in iris segmentation tasks, indicating the potential of base models in fine-grained tasks.