EGA-FMC: enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data

2018 
Categorical data clustering is the unsupervised technique of grouping similar objects which have categorical attributes. We propose a genetic algorithm-based fuzzy k-modes categorical data clustering algorithm using multi-objective rank-based selection with enhanced elitism operation. Compactness of the clusters and inter-cluster separation were chosen as objectives to be optimised. During elitism, in every iteration, the best parent chromosomes were identified. The entire population was passed through the selection, crossover and mutation steps. The worst children were then replaced by the best parents. Our method was evaluated on three real-world datasets and resulted in clusters of better quality as compared to current methods with a significant reduction in computation time. Additionally, statistical significance tests were conducted to show the superiority of our approach over other clustering solutions.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    1
    Citations
    NaN
    KQI
    []