1 Department of Computer Science, Science and Technology, Aarhus University2 Department of Computing and Information Systems, The University of Melbourne3 Department of Computer Science, Science and Technology, Aarhus University
Clustering, the grouping of data based on mutual similarity, is often used as one of principal tools to analyze and understand data. Unfortunately, most conventional techniques aim at finding only a single clustering over the data. For many practical applications, especially those being described in high dimensional data, it is common to see that the data can be grouped into different yet meaningful ways. This gives rise to the recently emerging research area of discovering alternative clusterings. In this preliminary work, we propose a novel framework to generate multiple clustering views. The framework relies on a constrained data projection approach by which we ensure that a novel alternative clustering being found is not only qualitatively strong but also distinctively different from a reference clustering solution. We demonstrate the potential of the proposed framework using both synthetic and real world datasets and discuss some future research directions with the approach.
2012 Siam International Conference on Data Mining: 3rd Multiclust Workshop: Discovering, Summarizing and Using Multiple Clusterings, 2012, p. 23-30