Abstract
Current spectral clustering algorithms suffer from both sensitivity to scaling parameter selection in similarity matrix construction, and data perturbation. This paper aims to improve robustness in clustering algorithms and combat these two limitations based on heat kernel theory. Heat kernel can statistically depict traces of random walk, so it has an intrinsic connection with diffusion distance, with which we can ensure robustness during any clustering process. By integrating heat distributed along time scale, we propose a novel method called Aggregated Heat Kernel (AHK) to measure the distance between each point pair in their eigenspace. Using AHK and Laplace-Beltrami Normalization (LBN) we are able to apply an advanced noise-resisting robust spectral mapping to original dataset. Moreover it offers stability on scaling parameter tuning. Experimental results show that, compared to other popular spectral clustering methods, our algorithm can achieve robust clustering results on both synthetic and UCI real datasets.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 11th IEEE International Conference on Data Mining, ICDM 2011 |
Pages | 270-279 |
Number of pages | 10 |
DOIs | |
State | Published - Dec 1 2011 |
Externally published | Yes |
Event | 11th IEEE International Conference on Data Mining, ICDM 2011 - Vancouver, BC, Canada Duration: Dec 11 2011 → Dec 14 2011 |
Other
Other | 11th IEEE International Conference on Data Mining, ICDM 2011 |
---|---|
Country/Territory | Canada |
City | Vancouver, BC |
Period | 12/11/11 → 12/14/11 |
All Science Journal Classification (ASJC) codes
- Engineering(all)