Detection and filter method of network community garbage information based on topic consensus coverage rate

2013 
The invention provides a detection and filter method of network community garbage information based on a topic consensus coverage rate, belongs to the research category of data quality, relates to the technical field of feature research of user behaviors, evaluation of network information quality, feature value extraction of text content, building and optimization of a text classification model, and the like. Mainly aiming at the situation that an effective automatic detection and filter mechanism for the network community garbage information does not exist, a garbage information detection model is built, a topic convergence restricted relationship is constructed according to main topic content and normal replay content, a feature value of the topic consensus coverage rate is provided and applied to a text classifier, and accordingly automatic detection and filter of the network community garbage information are achieved. The method can be widely applied to the problems of screening of various contents in network community quality management, automatically judge and clean irrelevant advertisements, invalid contents, even malicious opinions, and improve the network community information quality to a certain degree.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    1
    References
    0
    Citations
    NaN
    KQI
    []