Open world plant image identification based on convolutional neural network

2016 
In this paper, we propose several enhancements to the well-known VGG 16-layers Convolutional Neural Network (CNN) model towards open world image classification, by taking plant identification as an example. We first propose to replace the last pooling layer of the VGG 16-layers model with a Spatial Pyramid Pooling layer, enabling the model to accept arbitrary sized input images. Second, for the activation function, we replace Rectified Linear Unit (ReLU) with Parametric ReLU in order to increase the adaptability of parameter learning. In addition, we introduce the Unseen Category Query Identification algorithm to identify and omit images of unseen category, thus preventing false classification into predefined categories. Such algorithm is essential in real life, since there is no guarantee that a given image has to fall into a predefined category. We use the dataset from the LifeCLEF 2016 plant identification task. We compare our results with other participants and demonstrate that our enhanced model with proposed algorithm exhibits outstanding performance.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    5
    Citations
    NaN
    KQI
    []