A scholar disambiguation method based on heterogeneous relation-fusion and attribute enhancement

2020 
Name disambiguation becomes increasingly important in information retrieval in the era of big data, while on how to further improve the accuracy of disambiguating duplicate names, the existing models are encountering many problems or facing many challenges, such as 1) how to capture and sufficiently use the global structure features of the network; 2) how to handle the complexity of feature data sets; 3) how to differentiate different types of feature relations, and 4) how to deal with the feature information missing etc. All these challenges are very important issues in further improving the accuracy. In order to address the above issues, this paper proposed a novel model which is called HRFAENE (Heterogeneous Relation Fusion and Attribute Enhanced Network Embedding Model). This model considers both feature network structure information (multiple relations) and document attribute features. The feature network structure information is represented by a scalable loss function which is designed based on pairwise constraints. The document attribute features are comprehensively extracted through multiple heterogeneous information networks which are constructed based on strong features. In order to better identify the disambiguation entity, this model uses weak features as node attributes in strong feature networks and iteratively learns network structure information and node attribute information. The experimental results show that the proposed model significantly outperforms the existing models in the disambiguation accuracy and has good stability, indicating that the model is very effective as expected and can be applied in reality.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    1
    Citations
    NaN
    KQI
    []