Abstract:The knowledge of environmental depth is essential in multiple robotics and computer vision tasks for both terrestrial and underwater scenarios. Moreover, the hardware on which this technology runs, generally IoT and embedded devices, are limited in terms of power consumption, and therefore, models with a low-energy footprint are required to be designed. Recent works aim at enabling depth perception using single RGB images on deep architectures, such as convolutional neural networks and vision transformers, which are generally unsuitable for real-time inferences on low-power embedded hardware. Moreover, such architectures are trained to estimate depth maps mainly on terrestrial scenarios due to the scarcity of underwater depth data. Purposely, we present two lightweight architectures based on optimized MobileNetV3 encoders and a specifically designed decoder to achieve fast inferences and accurate estimations over embedded devices, a feasibility study to predict depth maps over underwater scenarios, and an energy assessment to understand which is the effective energy consumption during the inference. Precisely, we propose the MobileNetV3S75 configuration to infer on the 32-bit ARM CPU and the MobileNetV3LMin for the 8-bit Edge TPU hardware. In underwater settings, the proposed design achieves comparable estimations with fast inference performances compared to state-of-the-art methods. Moreover, we statistically proved that the architecture of the models has an impact on the energy footprint in terms of Watts required by the device during the inference. Then, the proposed architectures would be considered to be a promising approach for real-time monocular depth estimation by offering the best trade-off between inference performances, estimation error and energy consumption, with the aim of improving the environment perception for underwater drones, lightweight robots and Internet of things.

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

A Robust Monocular Depth Estimation Framework Based on Light-Weight ERF-Pspnet for Day-Night Driving Scenes

MobiDepth: Real-Time Depth Estimation Using On-Device Dual Cameras.

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Depth Generation Network: Estimating Real World Depth From Stereo And Depth Images

HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation

Anytime Stereo Image Depth Estimation on Mobile Devices

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 Challenge: Report

Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning.

DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image

MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation

Depth Estimation of Traffic Scenes from Image Sequence Using Deep Learning.

Real-time single image depth perception in the wild with handheld devices

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

Lightweight and Energy-Aware Monocular Depth Estimation Models for IoT Embedded Devices: Challenges and Performances in Terrestrial and Underwater Scenarios

Real-time Monocular Depth Estimation on Embedded Systems

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI &amp; AIM 2022 Challenge: Report

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

A Robust Monocular Depth Estimation Framework Based on Light-Weight ERF-Pspnet for Day-Night Driving Scenes

MobiDepth: Real-Time Depth Estimation Using On-Device Dual Cameras.

A Depth Estimation Framework Based on Unsupervised Learning and Cross-Modal Translation

Depth Generation Network: Estimating Real World Depth From Stereo And Depth Images

HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception

MIPI 2022 Challenge on RGB+ToF Depth Completion: Dataset and Report

The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation

Anytime Stereo Image Depth Estimation on Mobile Devices

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI &amp; AIM 2022 Challenge: Report

Depth Estimation from Monocular Images Using Dilated Convolution and Uncertainty Learning.

DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image

MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation

Depth Estimation of Traffic Scenes from Image Sequence Using Deep Learning.

Real-time single image depth perception in the wild with handheld devices

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

Lightweight and Energy-Aware Monocular Depth Estimation Models for IoT Embedded Devices: Challenges and Performances in Terrestrial and Underwater Scenarios

Real-time Monocular Depth Estimation on Embedded Systems

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 Challenge: Report