Two phase cluster validation approach towards measuring cluster quality in unstructured and structured numerical datasets

2020 
This paper presents an improved cluster validation scheme called two phase cluster validation (TPCV) and aims to estimate the inter closeness and inter separation among the clusters in the cluster set of unsupervised clustering schemes based on probability measure for validating the cluster quality without prior identification. First phase, the TPCV computes the representative cluster centroid of each individual cluster in the cluster set based on standard mean operation and then it estimates the probability of inter closeness of each cluster with other clusters in the cluster set based on cluster centroid. Next phase, it calculates the probability of separation among the clusters in the cluster set based on cluster centroid by distance measure. Experimental results show that the TPCV scheme is simple and effective to estimate the cluster quality by measuring the probability of closeness and separation between the clusters in the result of unsupervised clustering scheme.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    23
    References
    6
    Citations
    NaN
    KQI
    []