ST-Tracking: Spatial-temporal Graph Convolution Neural Network for Multi-object Tracking

2021 
Multi-Object Tracking (MOT) which aims to estimate trajectories for objects of interest in videos, is a crucial component in intelligent transportation systems and has gained considerable attention from both academia and industry. However, existing MOT approaches apply simple motion and appearance features, ignoring the spatial-temporal correlations of interactions with other objects. To fill this gap, this paper proposes a novel MOT framework ST-Tracking to extract spatial-temporal features for tracking objects, which is trained in an end-to-end manner. Specifically, a directed graph is first constructed to represent the spatial-temporal correlations, where the node attributes include both geometric and appearance features, and the edges are deliberately defined to embrace location and topology cues. Then, a spatial-temporal graph convolutional module is proposed to aggregate information from both spatial and temporal dimensions, thus extracting discriminative features. Extensive experiments on the challenging KITTI benchmark demonstrate the effectiveness and superior of ST-Tracking as compared with several state-of-the-art methods.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    36
    References
    0
    Citations
    NaN
    KQI
    []