FROM PIXELS TO TRUE XML STRUCTURES IN DIGITAL DOCUMENT IMAGES

2004 
XML has been widely used as metadata for image retrieval. As a standard, it makes it easier to index and retrieve information across different platforms. However, how to automatically convert an image into XML format remains a challenge. In this paper, a system for generating structured document in XML from digitally captured document images is presented. The system is aimed at providing an easy to use tool for average users without requiring depth of knowledge in the document processing areas. Further, a XML/XSL generator is developed to accurately represent a document in a XML structure, yet in a representation that reflects its original layout.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    0
    Citations
    NaN
    KQI
    []