Hierarchical clustering model for pixel-based classification of document images

Rémi Vieux,Jean-Philippe Domenger

Hierarchical clustering model for pixel-based classification of document images

2012

Rémi Vieux
Jean-Philippe Domenger

We propose a method to learn and classify pixels in document images, e.g., to separate text from illustrations or other predefined classes. We extract texture information using a bank of Gabor filters, and learn a hierarchical clustering model that can be used as a K-Nearest Neighbours (KNN) classifier. The model has advantages over other local document image classification methods, making it efficient for real industrial applications: we do not rely on the accuracy of preprocessing steps such as binarisation or segmentation, the model can be efficiently trained using zone level annotations and it seamlessly supports multi-class classification. We demonstrate the performance of the method on a public dataset containing complex documents from magazines and technical journals.

Keywords:

Artificial intelligence
Computer vision
Hierarchical clustering
Image texture
Preprocessor
Contextual image classification
Pattern recognition
Feature extraction
Computer science
Pixel
Classifier (linguistics)
Segmentation

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations