On non-traditional spatial clustering frameworks
2010
In traditional clustering, grouping of objects is obtained by running a clustering algorithm once on a single dataset. The main goal of this dissertation is the investigation of frameworks, tools, techniques, and algorithms for cluster analysis, which significantly deviate from the traditional clustering paradigm. We propose techniques to cluster multiple datasets to mine interesting relationships between them. The first approach, called Correspondence Analysis by Interestingness Comparison, clusters individual datasets separately and then obtains knowledge by analyzing the relationship between different clusterings. A variant of this approach is a technique called Correspondence Clustering that clusters one dataset by using clusters of the other datasets as guidance. The third approach called CMDJ (Clustering Multiple Datasets Jointly) clusters multiple datasets jointly. We cluster multiple datasets in a single run of a clustering algorithm and the clusters obtained by this approach contain objects that originate from different datasets. Post analysis techniques are then used to extract knowledge from the obtained clusters. Finally, we introduce a clustering technique called Multi-run Clustering that clusters a single dataset multiple times to obtain the better clustering results by combining clusters that originate from different runs. The presented frameworks are evaluated in case studies centering on progression of glaucoma, ozone pollution, and earthquake analysis.
Keywords:
- k-medians clustering
- Correlation clustering
- Single-linkage clustering
- FLAME clustering
- Cluster analysis
- Canopy clustering algorithm
- CURE data clustering algorithm
- Machine learning
- Artificial intelligence
- Brown clustering
- Computer science
- Data mining
- Hierarchical clustering
- Determining the number of clusters in a data set
- Fuzzy clustering
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
0
References
0
Citations
NaN
KQI