Semantic indexing for XML documents using RDBMS

2015 
Indexing is a common technique used by search engines for a fast and efficient search and retrieval process. XML search engines are no different. But the search engines consider XML file as single unit completely ignoring the fact that XML document contains records in the form of semi-structured data. This hierarchal structure of XML inherits a parent/child relationship. This relationship make two tags semantically related, if they share same parent. Indexing document as a whole, result in low precision. This paper proposes an indexing scheme that preserves the parent/child relation information using document structure. The information is then used to identify the semantic relation between items. Semantic based search and retrieval on XML documents can provide more accurate results. The paper uses RDBMS to store these indices in the form of table. To check the accuracy of the proposed scheme, a case study is also performed using two queries on a sample XML file that share semantically related data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    21
    References
    0
    Citations
    NaN
    KQI
    []