Exploring Influence of Topic Segmentation on Information Retrieval Quality

2018 
In the present paper we address the issue of how an information retrieval system might be improved via text segmentation and to what extent. We assume that topic text segmentation allows one to better model text structure and therefore language itself, which influences the quality of text representation. We propose a search pipeline based on text segmentation by means of BigARTM tool and TopicTiling algorithm. We test the initial hypothesis by conducting experiments with several baseline models on two textual collections. The results are rather contradictory: while one collection showed that segmentation does improve the quality of retrieval, the other one demonstrated that segmentation does not influence the quality significantly.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    17
    References
    2
    Citations
    NaN
    KQI
    []