String similarity computing based on position and cosine

2017 
E-Business platform needs to have the production selection functionalities according to the products' feature and their cost performance, and at the same time, we need to clean data in the production and sale process, so it is important to calculate similarity between products. This paper proposes a new way to compute the similarity of string by segmenting string into words, numbering the corresponding positions and vectorizing the string. Then the similarity between the strings is computed by computing the cosine angle of the two vectors. Experiments show that the method avoids the maximum or minimum of LCS and GST. In addition, the proposed method also improves the accuracy of similarity calculation.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    3
    References
    1
    Citations
    NaN
    KQI
    []