Community Discovery Algorithm and Its Technical Improvement Based on Link Structure–Taking Web Community Algorithm as an Example

2020 
Link analysis is an important way to discover potential web communities. This paper analyzes the characteristics of the Web link structure, and studies the traditional maximum flow algorithm and the maximum flow algorithm based on HITS algorithm for edge capacity allocation. The traditional maximum flow problem, but there are still shortcomings. Based on the analysis of existing link similarity definitions, this paper proposes a new definition of link similarity and topic dissimilarity to better describe the relationship between linked pages, and to measure based on link similarity and topic dissimilarity. The similarity of the pages gives a more reasonable and efficient maximum flow-side capacity allocation scheme. The community discovery of 6 topics shows that the maximum flow algorithm proposed in this paper using the link similarity and topic dissimilarity model to allocate edge capacity can better solve the problems of existing algorithms and significantly improve the quality of the Web community. This paper proposes new methods and ideas for page similarity measurement, and provides new strategies for improving the Web community discovery algorithm based on link analysis.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    9
    References
    0
    Citations
    NaN
    KQI
    []