Topic Level Disambiguation for Weak Queries

Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queries. However, existing IR approaches such as query expan sion are not overly effective because they make little effort to analyze and exploit the meanings of the queries. Furthermore, word sense disambiguation approaches, which rely on textual context, are ineffective against weak queries that are typically short. Motivated by the demand for a robust IR system that can consistently provide highly accurate results, the proposed study implemented a novel topic detection that leveraged both the lan guage model and structural knowledge of Wikipedia and systematically evaluated the effect of query disam biguation and topic-based retrieval approaches on TREC collections. The results not only confirm the effective ness of the proposed topic detection and topic-based retrieval approaches but also demonstrate that query dis ambiguation does not improve IR as expected.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader