HaSPeR: An Image Repository for Hand Shadow Puppet Recognition

Syed Rifat Raiyan,Zibran Zarif Amio,Sabbir Ahmed
2024-08-20
Abstract:Hand shadow puppetry, also known as shadowgraphy or ombromanie, is a form of theatrical art and storytelling where hand shadows are projected onto flat surfaces to create illusions of living creatures. The skilled performers create these silhouettes by hand positioning, finger movements, and dexterous gestures to resemble shadows of animals and objects. Due to the lack of practitioners and a seismic shift in people's entertainment standards, this art form is on the verge of extinction. To facilitate its preservation and proliferate it to a wider audience, we introduce ${\rm H{\small A}SP{\small E}R}$, a novel dataset consisting of 8,340 images of hand shadow puppets across 11 classes extracted from both professional and amateur hand shadow puppeteer clips. We provide a detailed statistical analysis of the dataset and employ a range of pretrained image classification models to establish baselines. Our findings show a substantial performance superiority of traditional convolutional models over attention-based transformer architectures. We also find that lightweight models, such as MobileNetV2, suited for mobile applications and embedded devices, perform comparatively well. We surmise that such low-latency architectures can be useful in developing ombromanie teaching tools, and we create a prototype application to explore this surmission. Keeping the best-performing model InceptionV3 under the limelight, we conduct comprehensive feature-spatial, explainability, and error analyses to gain insights into its decision-making process. To the best of our knowledge, this is the first documented dataset and research endeavor to preserve this dying art for future generations, with computer vision approaches. Our code and data are publicly available.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the preservation and promotion of Hand Shadow Puppetry (also known as Shadowgraphy or Ombromanie), a traditional art form. Specifically, the authors hope to protect and revive this endangered art form by creating a hand - shadow image dataset named HASPER and using computer vision technology. ### Problem Background Hand Shadow Puppetry is a traditional performing art that creates phantoms of animals or other objects by projecting shadows on a flat surface through hand movements. Due to the lack of practitioners and the change in people's entertainment methods, this art form is in danger of extinction. To help preserve and promote Hand Shadow Puppetry, researchers introduced the HASPER dataset. ### Research Objectives 1. **Create a high - quality dataset**: HASPER contains 8,340 images from professional and amateur Hand Shadow Puppetry performers, covering 11 categories. 2. **Evaluate the performance of different models**: Use a series of pre - trained image classification models (such as Inception V3, MobileNetV2, etc.) to benchmark the dataset to determine the best model. 3. **Develop teaching tools**: Based on the best - performing model (such as Inception V3), develop a lightweight application for teaching and learning Hand Shadow Puppetry. 4. **Analyze model performance**: Conduct a detailed feature space, interpretability, and error analysis of the Inception V3 model to understand its decision - making process. ### Main Contributions - **The first publicly available Hand Shadow Puppetry dataset**: HASPER is the first image dataset specifically designed for Hand Shadow Puppetry. - **Model performance evaluation**: Research shows that traditional convolutional models (such as Inception V3) outperform Transformer architectures based on the attention mechanism in the Hand Shadow Puppetry image classification task. - **Application potential**: The research results show that low - latency architectures (such as MobileNetV2) can be used to develop teaching tools suitable for mobile devices. Through these efforts, researchers hope to use AI tools to revive this little - known cinematic art form and provide a basis for future research and applications.