SCENIC: A JAX Library for Computer Vision Research and Beyond

Mostafa Dehghani,Alexey Gritsenko,Anurag Arnab,Matthias Minderer,Yi Tay
DOI: https://doi.org/10.48550/arXiv.2110.11403
2021-10-18
Computer Vision and Pattern Recognition
Abstract:Scenic is an open-source JAX library with a focus on Transformer-based models for computer vision research and beyond. The goal of this toolkit is to facilitate rapid experimentation, prototyping, and research of new vision architectures and models. Scenic supports a diverse range of vision tasks (e.g., classification, segmentation, detection)and facilitates working on multi-modal problems, along with GPU/TPU support for multi-host, multi-device large-scale training. Scenic also offers optimized implementations of state-of-the-art research models spanning a wide range of modalities. Scenic has been successfully used for numerous projects and published papers and continues serving as the library of choice for quick prototyping and publication of new research ideas.
What problem does this paper attempt to address?