Monocular image depth estimation using dilated convolution and spatial pyramid polling structure

2019 
In this work, we address the problem of depth estimation from a single image. This is a challenging task because a single still image on its own does not give much depth cue, while recent advances in CNNs have made learning and predicting depth from a single image possible. We propose a new residual convolutional neural network (CNN) with dilated convolution and spatial pyramid pooling (SPP) structure to model the ambiguous mapping from a monocular 2D image to its depth map. The advantages of our method come from the use of dilated convolution and multi spatial scale information. Compared with existing deep CNN based methods, our method achieves much better results in indoor and outdoor scenarios.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    1
    References
    0
    Citations
    NaN
    KQI
    []