A Study of General Data Improvement for Large-Angle Head Pose Estimation

2021 
Predicting Euler angles of head pose using end-to-end CNN from a single RGB image is a popular application in recent years. However, the existing methods ignored the information about the rotation order contained in the Euler angles, always following the traditional pitch-yaw-roll order. They also neglected the error sources from outlier samples with large-angle poses. We analyzed current shortcomings and made suggestions for improvement from the perspective of data distribution. We studied the influence of different rotation orders on the data distribution and showed choosing an appropriate rotation order to learn head pose can significantly optimize the data distribution and improve the prediction accuracy. Then a data enhancement method was proposed to increase the large-angle poses by rotating the 2D images randomly and solving the corresponding head poses, which can improve network performance on the large-angle poses. Evaluated on two popular networks and different datasets, our methods were proved to be effective and general.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    16
    References
    0
    Citations
    NaN
    KQI
    []