CluSim: a python package for calculating clustering similarity

dc.contributor.authorGates, Alexander J.
dc.contributor.authorAhn, Yong Yeol
dc.date.accessioned2025-02-20T16:03:07Z
dc.date.available2025-02-20T16:03:07Z
dc.date.issued2019-03-21
dc.description.abstractClustering is a primary method to reveal the structure of data (Jain, Murty, & Flynn, 1999). To understand, evaluate, and leverage data clusterings, we need to quantitatively compare them. Clustering comparison is the basis for method evaluation, consensus clustering, and tracking the temporal evolution of clusters, among many other tasks. For instance, the evaluation of a clustering method is usually achieved by comparing the method’s result to a planted reference clustering, assuming that the more similar the method’s solution is to the reference clustering, the better the method. Despite the importance of clustering comparison, no consensus has been reached for a standardized assessment; each similarity measure rewards and penalizes different criteria, sometimes producing contradictory conclusions.
dc.identifier.citationGates, Alexander J., and Ahn, Yong Yeol. "CluSim: a python package for calculating clustering similarity." Journal of Open Source Software, vol. 4, no. 35, 2019-03-21, https://doi.org/10.21105/joss.01264.
dc.identifier.issn2475-9066
dc.identifier.otherBRITE 4392
dc.identifier.urihttps://hdl.handle.net/2022/31324
dc.language.isoen
dc.relation.isversionofhttps://doi.org/10.21105/joss.01264
dc.relation.isversionofhttps://joss.theoj.org/papers/10.21105/joss.01264.pdf
dc.relation.journalJournal of Open Source Software
dc.rightsThis work may be protected by copyright unless otherwise stated.
dc.titleCluSim: a python package for calculating clustering similarity

Files

Can’t use the file because of accessibility barriers? Contact us