A fast X-shaped foreground segmentation network with CompactASPP

2021 
Abstract Foreground segmentation models are designed to extract moving objects of varying sizes from the background, which can benefit from representations of various scales. As an effective module for capturing multi-scale contexts, Atrous Spatial Pyramid Pooling (ASPP) convolves a final feature representation via multiple parallel atrous convolutions with different dilation rates. However, as the dilation rate increases, ASPP gradually loses its large-scale modeling ability because the sampling of atrous kernel becomes progressively sparse within the receptive field. To solve this problem, we design a CompactASPP module to convolve feature maps compactly. Without significantly increasing the module size, the CompactASPP can encode multi-scale features from all neurons within the receptive field rather than from neurons in several sparsely distributed positions. Furthermore, we leverage CompactASPP modules to enhance our previous X-Net. The proposed Fast X-Net substantially improves the segmentation speed by over 63.6% and attains new state-of-the-art performances on CDnet2014, SBI2015 and UCSD benchmarks.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    40
    References
    1
    Citations
    NaN
    KQI
    []