Monocular Depth Estimation Based on Active Learning

?

Monocular Depth Estimation Based on Active Learning

P. 78–85.

Saleh H., Goncharov D., Shadi S., Avdoshin S. M., Naixue X.

Estimating depth is a necessary task to understand and navigate the environment surrounding us. Over the years,
many active sensors have been developed to measure depth, but they are expensive and require additional space for mounting. A cheaper alternative is to estimate depth from a single RGB image taken by an ordinary monocular camera, which can be placed even inside the smartphone. However, it is a well-known problem that neural networks require huge amount of labeled data to be effectively learned. That fact serves a barrier to the further development of the monocular depth estimation. In this paper, we address this problem. We propose a novel active deep learning training framework that reduces the dataset volume ratio by adaptively selecting the most informative data for labeling that focus on the most relevant human vision features for monocular depth estimation, which help us identify the image pixels that are most relevant for depth estimation. Our methodology indicates that it is possible to reduce the amount of labeled training data by 81% and at the same time preserve the comparable accuracy on the KITTI Odometry dataset.

Language: English

Full text

Text on another site

In book

Proceedings 2026 IEEE 11th International Conference on Smart Cloud SmartCloud 2026 8-10 May 2026

Los Alamitos: IEEE Computer Society, 2026.

Object Localization Based on a Single RGB Camera for a 4-DOF Robotic Arm

Chebotareva E., Mukhamedshin A., Imamov N. et al., , in: 2025 11th International Conference on Automation, Robotics, and Applications (ICARA), 12-14 Feb. 2025.: IEEE, 2025. Ch. 2025 P. 252–256.

Added: March 17, 2026

Pose Networks Unveiled: Bridging the Gap for Monocular Depth Perception

Dayoub Y., Andrey V. Savchenko, Makarov I., , in: 2024 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct).: IEEE, 2024. P. 584–587.

Depth estimation is essential in Augmented Reality applications, enabling realistic object placement, scene understanding, spatial mapping, interaction, and environment awareness. This paper proposes a method to enhance depth model performance without increasing inference costs by improving the pose network in a selfsupervised learning setup. In particular, we enrich spatial information in the pose network by ...

Added: December 3, 2024

Inpainting Semantic and Depth Features to Improve Visual Place Recognition in the Wild

Semenkov I., Karpov A., Savchenko A. et al., IEEE Access 2024 Vol. 12 P. 5163–5176

Visual place recognition is one of the core modern computer vision tasks concerned with identifying location based on the image taken there. Modern state-of-the-art approaches heavily rely on RGB images which are largely affected by changes in the same scene such as varying daytime, illumination, seasonal changes, and presence of dynamic objects (people, vehicles). This ...

Added: March 15, 2024

Efficient Monocular Depth Estimation for Edge Computing Platforms

Saleh S., Saleh H., Dmitry Goncharov et al., , in: 2023 International Symposium ELMAR, 11-13 September 2023, Zadar, Croatia.: IEEE, 2023. P. 23–27.

Estimating depth is necessary to understand and navigate the environment surrounding us. Over the years, many active sensors have been developed to measure depth, but they are expensive and require additional space for mounting. A cheaper alternative is estimating depth from a single RGB image taken by an ordinary monocular camera, which can be placed ...

Added: January 26, 2024