3-D Depth Reconstruction from a Single Still Image |
| |
Authors: | Ashutosh Saxena Sung H Chung Andrew Y Ng |
| |
Affiliation: | (1) Computer Science Department, Stanford University, Stanford, CA 94305, USA |
| |
Abstract: | We consider the task of 3-d depth estimation from a single still image. We take a supervised learning approach to this problem,
in which we begin by collecting a training set of monocular images (of unstructured indoor and outdoor environments which
include forests, sidewalks, trees, buildings, etc.) and their corresponding ground-truth depthmaps. Then, we apply supervised
learning to predict the value of the depthmap as a function of the image. Depth estimation is a challenging problem, since
local features alone are insufficient to estimate depth at a point, and one needs to consider the global context of the image.
Our model uses a hierarchical, multiscale Markov Random Field (MRF) that incorporates multiscale local- and global-image features,
and models the depths and the relation between depths at different points in the image. We show that, even on unstructured
scenes, our algorithm is frequently able to recover fairly accurate depthmaps. We further propose a model that incorporates
both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible
using either monocular or stereo cues alone. |
| |
Keywords: | Monocular vision Learning depth 3D reconstruction Dense reconstruction Markov random field Depth estimation Monocular depth Stereo vision Hand-held camera Visual modeling |
本文献已被 SpringerLink 等数据库收录! |
|