Pyramidal region context module for semantic segmentation

2019 
Context modeling is widely exploited to enhance semantic correlation in semantic segmentation task. Recent approaches (e.g., OCNet, CCNet and DANet) apply non-local type of network to capture the context information. However, they are not accurate enough for handling scale-varying objects due to that they consider very little local dependencies of the adjacent pixels. In this work, we address the complex scene segmentation problem by combining region dependencies and global contextual information. Motivated by the fact that scale of objects largely varies on images, we design the Pyramidal Region Context Module(PRCM) to handle the neighbor relationship of multi-scale regions. In addition, we adopt a depth-to-space layer(PixelShuffle) to form the Scale Transfer Classifier (STC). Based on the two newly proposed modules, we introduce an end-to-end segmentation network - Pyramidal Region Network(PRNet). We empirically demonstrate the effectiveness of our approach on Cityscapes dataset, the results have shown impressive improvement compared with baselines. Notably, PRNet obtains mean IoU of 81.3 on test set of Cityscapes.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    37
    References
    0
    Citations
    NaN
    KQI
    []