DS-YOLOv5: Deformable and Scalable YOLOv5 for Mathematical Formula Detection in Scientific Documents

2021 
Mathematical formula detection (MFD) is a prerequisite step for the digitization of scientific documents. The MFD task has two key challenges, i.e. a large scale span between embedded formula and isolated formula, and a huge variation of the ratio between height and width. However, the detection accuracy of the most existing approaches rely on page segmentation still needs improvement due to the errors of complex documents. In this work, to solve the important problem of scale variation, we aim to assess the performance of a multi-scaled deformable method for the MFD task based on deformable convolution, image representation, and YOLOv5 detector. For the experimental study, the proposed method has been evaluated on the Marmot dataset, which is an existing benchmark. In our evaluation, the experimental results show that the proposed method outperforms previous methods on the Marmot dataset by a large margin. Moreover, we accomplished correct detection accuracy of 82.42% on embedded formulas and 90.69% on isolated formulas on the Marmot dataset, which results in a significant error reduction.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    19
    References
    0
    Citations
    NaN
    KQI
    []