Method of Similar Textual Content Selection Based on Thematic Information Retrieval

2019 
This paper describes a software tool called Text Search, which allows you to identify a plurality of content based on a thematic analysis of the text. It works with English fiction texts. Input is a user's query in English in the form of a text that is subject to thematic analysis and compared with the thematic analysis of texts in the database. This program is a database that stores, processes and issues information at the same time. The interface is implemented using the C # language and Windows Forms components, a database with a specific language, Transact-SQL, which was created jointly between Microsoft and Sybase. This language is an alternative to SQL with more advanced functionality. Modeling an information system is an important process that allows you to make a qualitative assessment of the scale of the system, to examine all its processes, the data to be used, and communications. By demonstrating the use of different types of diagrams, the main purpose of the system's operation is identified: the identification of a plurality of content based on a thematic analysis of similar texts. A hierarchical sequence of temporary objectives which lead to the accomplishment of the main goal is presented. The main processes of the information system and the data flows that it uses are demonstrated. Sub-processes are demonstrated at the second level of decomposition. In addition, a tree-like structure of the process hierarchy is created. It demonstrates the main processes and tasks that arise during their implementation, as well as their relationship.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    8
    References
    16
    Citations
    NaN
    KQI
    []