Large-scale building extraction in very high-resolution aerial imagery using Mask R-CNN

2019 
Urban areas are hotspots of complex and dynamic alterations of the Earth’s surface. Using deep learning (DL) techniques in remote sensing applications can significantly contribute to document these tremendous changes. Open source building data at a very high level of detail are still scarce or incomplete for many regions, therefore, hindering research and policy to properly provide knowledge on urban structures. In this study, we use a convolutional neural network to extract buildings for the city of Santiago de Chile. We deploy the recently released Mask R-CNN and use a pretrained model (PM) which already has been trained with remote sensing imagery. We fine-tune PM with very high-resolution (VHR) airborne RGB images from our study region and generate the fine-tuned model (FM). To extend the number of training data, we test several data augmentation methods for training purposes and evaluate their performance in context of urban environments. We achieve highest overall accuracy of 92 % by using augmentations and the generated FM. Our findings encourage to use DL methods in the urban context. The presented method can be adapted and applied to other global urban regions, and, help to overcome lacks in open source building data to assess urban environments.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    8
    Citations
    NaN
    KQI
    []