CluSim: a python package for calculating clustering similarity

Loading...
Thumbnail Image
Can’t use the file because of accessibility barriers? Contact us

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Clustering is a primary method to reveal the structure of data (Jain, Murty, & Flynn, 1999). To understand, evaluate, and leverage data clusterings, we need to quantitatively compare them. Clustering comparison is the basis for method evaluation, consensus clustering, and tracking the temporal evolution of clusters, among many other tasks. For instance, the evaluation of a clustering method is usually achieved by comparing the method’s result to a planted reference clustering, assuming that the more similar the method’s solution is to the reference clustering, the better the method. Despite the importance of clustering comparison, no consensus has been reached for a standardized assessment; each similarity measure rewards and penalizes different criteria, sometimes producing contradictory conclusions.

Table of Contents

Description

Keywords

Citation

Gates, Alexander J., and Ahn, Yong Yeol. "CluSim: a python package for calculating clustering similarity." Journal of Open Source Software, vol. 4, no. 35, 2019-03-21, https://doi.org/10.21105/joss.01264.

Journal

Journal of Open Source Software

DOI

Relation

Rights

Type