A Domain-Specific Web Document Re-ranking Algorithm

2017 
In order to build a domain-specific knowledge hub for learning on the web, the web resources crawled by generic search engines will need to be sifted and sorted before use. We propose a re-ranking algorithm that recognizes the highly domain relevant web data to feed in the domain knowledge learning hub. The algorithm studies the structure and semantics of the domain ontology (graph) and constructs computational relations among nodes. Through mining matching terms between ontology dictionary and the textual content (text, metadata) of the retrieved documents crawled by some credited web search engines, we calculate three-dimensional information scores - distance, direction, and attributes of each document and subsequently re-rank the retrieved documents to provide learners with more meaningful knowledge in the domain space they embrace.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    3
    Citations
    NaN
    KQI
    []