Generalized Independent Subspace Clustering
2016
Data can encapsulate different object groupings in subspaces of arbitrary dimension and orientation. Finding such subspaces and the groupings within them is the goal of generalized subspace clustering. In this work we present a generalized subspace clustering technique capable of finding multiple non-redundant clusterings in arbitrarily-oriented subspaces. We use Independent Subspace Analysis (ISA) to find the subspace collection that minimizes the statistical dependency (redundancy) between clusterings. We then cluster in the arbitrarily-oriented subspaces identified by ISA. Our algorithm ISAAC (Independent Subspace Analysis and Clustering) uses the Minimum Description Length principle to automatically choose parameters that are otherwise difficult to set. We comprehensively demonstrate the effectiveness of our approach on synthetic and real-world data.
Keywords:
- Data mining
- Linear subspace
- Kernel (linear algebra)
- Minimum description length
- Probability density function
- Machine learning
- Principal component analysis
- Cluster analysis
- Random variable
- Subspace topology
- Artificial intelligence
- Pattern recognition
- Mathematics
- Computer science
- Redundancy (engineering)
- Data modeling
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
22
References
17
Citations
NaN
KQI