language-icon Old Web
English
Sign In

Office Document Search Engine

2019 
Search engine have been widely used to find some documents for many reasons. One of frequently used kind of document is office document. Office document is classified as semi-structured document because sometimes they have consistent structure in a document category. Office document also has various categories and formats. To build a search engine, there are two main processes that must be implemented. Those processes are indexing process and query process. Every process consists of some methods that has some function for each of them. Not all kind of methods can be used and implemented for that processes. A suitable method needs to be selected in order to produce an optimal search engine for a specific defined domain. This paper will explain how to recognize office document's pattern that will be used to build a search engine. It will also explain about selection of methods that were used to build an optimal search engine in office document domain. This search engine will be evaluated with some testing scenario to calculate its precision for some queries and to know how optimal it is. This proposed search engine more focus on having effective result than efficiency of processing. However, the evaluation still covers both of effectiveness and efficiency of the system.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    4
    References
    0
    Citations
    NaN
    KQI
    []