GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation

2021 
In 6D object pose estimation task, object models are usually available and represented as the point cloud set in canonical object frame, which are important references for estimating object poses to the camera frame. However, directly introducing object models as the prior knowledge (i.e., object model point cloud) will cause potential perturbations and even degenerate pose estimation performance. To make the most of object model priors and eliminate the problem, we present an end-to-end deep learning approach called the Geometric Constraint Co-attention Network (GCCN) for 6D object pose estimation. GCCN is designed to explicitly leverage the object model priors effectively with the co-attention mechanism. We add explicit geometric constraints to a co-attention module to inform the geometric correspondence relationships between points in the scene and object model priors and develop a novel geometric constraint loss to guide the training. In this manner, our method effectively eliminates the side effect of directly introducing the object model priors into the network. Experiments on the YCB-Video and LineMOD datasets demonstrate that our GCCN substantially improves the performance of pose estimation and is robust against heavy occlusions. We also demonstrate that GCCN is accurate and robust enough to be deployed in real-world robotic tasks.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    0
    Citations
    NaN
    KQI
    []