A Research on Image Captioning by Different Encoder Networks

2020 
Many current research issues of image captioning focus on modifying the CNN (Convolutional Neural Network) or RNN (Recurrent Neural Network), while supplementing the attention mechanism to enhance the long-term memory ability of the RNN. However, the relationship with input data and CNN model could be another important point. This paper defines the image complexity to enhance model's accuracy. After analyzing the data set, some criteria of the image complexity are defined according to the image grayscale entropy and the two-dimensional entropy for image Captioning. In this paper, a new model is setup to compare with the other model. Although the result is better than the other model by a revised bilingual evaluation understudy (R-BLEU) evaluation index which is a new calculation formula to evaluate image captioning performance.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    13
    References
    0
    Citations
    NaN
    KQI
    []