VDetor: An Effective and Efficient Neural Network for Vehicle Detection in Aerial Image

2019 
Vehicle detection in aerial image is the foundation of some applications, such as the traffic management, parking lot utilization, etc. Recently, universal object detection methods based on the convolutional neural networks have achieved state-of-the-art performances, this is mainly because CNNs can extract more effective features compared with the handcrafted features in early mainstream methods. Extracting effective feature is crucial for vehicle detection in aerial image where the vehicles are small and the background is rather complicated. As a result, these methods based on CNNs have been used to detect the vehicles in aerial image. However, the performance may be poor when directly performing these universal methods. Firstly, these existing methods mostly detect the vehicles with horizontal bounding boxes. But these horizontal boxes do not match the vehicles in aerial image with arbitrary orientations and multiply aspect ratios. As a result, this kind of box would harm the detection accuracy directly. In addition, these methods are mostly computationally expensive, so they are not suitable for the platform with limited computational resources, for example the unmanned aerial vehicle. To address these problems above, by introducing the rotated bounding box regression and a lightweight network into SSD, we propose our vehicle detection networks, VDetor. Specifically, we use rotated bounding box rather than horizontal one to match the vehicle well and use a lightweight network, PeleeNet, as the backbone network of SSD to speed up the inference. Experiments on VEDAI dataset illustrate that our model performs better than SSD in terms of the detection accuracy and detection speed.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    1
    Citations
    NaN
    KQI
    []