Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Andrey Ignatov,Grigory Malivenko,Radu Timofte,Łukasz Treszczotko,Xin Chang,Piotr Książek,Michał Łopuszyński,Maciej Pióro,Rafal Rudnicki,Maciej Smyl,Yujie Ma,Zhenyu Li,Zehui Chen,Jialei Xu,Xianming Liu,Junjun Jiang,XueChao Shi,Di-Fan Xu,Yanan Li,Xiaotao Wang,Lei,Ziyu Zhang,Yicheng Wang,Zilong Huang,Guozhong Luo,Gang Yu,Bin Fu,Jiaqi Li,Yiran Wang,Zihao Huang,Zhiguo Cao,Marcos V. Conde,Denis Sapozhnikov,Byeong Hyun Lee,Dong-Won Park,Seong-Min Hong,Joon‐Hee Lee,Seunggyu Lee,Se Young Chun
DOI: https://doi.org/10.1007/978-3-031-25066-8_4
2023-01-01
Abstract:Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth estimation solutions that can show a real-time performance on IoT platforms and smartphones. For this, the participants used a large-scale RGB-to-depth dataset that was collected with the ZED stereo camera capable to generated depth maps for objects located at up to 50 m. The runtime of all models was evaluated on the Raspberry Pi 4 platform, where the developed solutions were able to generate VGA resolution depth maps at up to 27 FPS while achieving high fidelity results. All models developed in the challenge are also compatible with any Android or Linux-based mobile devices, their detailed description is provided in this paper.
What problem does this paper attempt to address?