On non-traditional spatial clustering frameworks

2010 
In traditional clustering, grouping of objects is obtained by running a clustering algorithm once on a single dataset. The main goal of this dissertation is the investigation of frameworks, tools, techniques, and algorithms for cluster analysis, which significantly deviate from the traditional clustering paradigm. We propose techniques to cluster multiple datasets to mine interesting relationships between them. The first approach, called Correspondence Analysis by Interestingness Comparison, clusters individual datasets separately and then obtains knowledge by analyzing the relationship between different clusterings. A variant of this approach is a technique called Correspondence Clustering that clusters one dataset by using clusters of the other datasets as guidance. The third approach called CMDJ (Clustering Multiple Datasets Jointly) clusters multiple datasets jointly. We cluster multiple datasets in a single run of a clustering algorithm and the clusters obtained by this approach contain objects that originate from different datasets. Post analysis techniques are then used to extract knowledge from the obtained clusters. Finally, we introduce a clustering technique called Multi-run Clustering that clusters a single dataset multiple times to obtain the better clustering results by combining clusters that originate from different runs. The presented frameworks are evaluated in case studies centering on progression of glaucoma, ozone pollution, and earthquake analysis.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []