A Polynomial Algorithm for Balanced Clustering via Graph Partitioning

2020 
Abstract The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as detecting the inherent separations between groups of a given point set in a metric space governed by a similarity function. The pairwise similarities between all data objects form a weighted graph whose adjacency matrix contains all necessary information for the clustering process. Consequently, the clustering task can be formulated as a graph partitioning problem. In this context, we propose a new cluster quality measure which uses the ratio of intra- and inter-cluster variance and allows us to compute the optimal clustering under the min-max principle in polynomial time. Our algorithm can be applied to both partitional and hierarchical clustering.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    24
    References
    0
    Citations
    NaN
    KQI
    []