A two-step deep learning approach to data classification and modeling and a demonstration on subject type relationship analysis in the Web of Science

2020 
It is common sense that some subjects have strong relationships while others are perhaps almost mutually independent, but a quantitative and systematic approach to describe such sense is a deficiency. A technique called pointwise mutual information (PMI) from information science helps to fulfill the request, but the calculation through a large-scale database is computationally infeasible if one requires an instantaneous value. This work provides a two-step remedy via deep learning for estimating and predicting relationships among two subject types that are found in the large-scale citation database called the Web of Science. The resulting model successfully replicates existing PMI values among subject types, and it can be used for predicting PMI values of two subject types if one or both subject types does not exist in the database.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    28
    References
    1
    Citations
    NaN
    KQI
    []