Bisect and Conquer: Hierarchical Clustering via Max-Uncut Bisection

dc.contributor.authorAhmadian, Sara
dc.contributor.authorChatziafratis, Vaggos
dc.contributor.authorEpasto, Allesandro
dc.contributor.authorLee, Euiwoong
dc.contributor.authorMahdian, Mohammad
dc.contributor.authorMakarychev, Konstantin
dc.contributor.authorYaroslavtsev, Grigory
dc.date.accessioned2025-02-20T15:55:52Z
dc.date.available2025-02-20T15:55:52Z
dc.date.issued2019-12-15
dc.description.abstractHierarchical Clustering is an unsupervised data analysis method which has been widely used for decades. Despite its popularity, it had an underdeveloped analytical foundation and to ad- dress this, Dasgupta recently introduced an optimization viewpoint of hierarchical clustering with pairwise similarity information that spurred a line of work shedding light on old algorithms (e.g., Average-Linkage), but also designing new algorithms. Here, for the maximization dual of Das- gupta’s objective (introduced by Moseley-Wang), we present polynomial-time .4246 approxima- tion algorithms that use Max-Uncut Bisection as a subroutine. The previous best worst-case approximation factor in polynomial time was .336, improving only slightly over Average-Linkage which achieves 1/3. Finally, we complement our positive results by providing APX-hardness (even for 0-1 similarities), under the Small Set Expansion hypothesis.
dc.identifier.citationAhmadian, Sara, et al. "Bisect and Conquer: Hierarchical Clustering via Max-Uncut Bisection." 2019-12-15.
dc.identifier.otherBRITE 7282
dc.identifier.urihttps://hdl.handle.net/2022/32175
dc.language.isoen
dc.relation.isversionofhttps://arxiv.org/pdf/1912.06983.pdf
dc.titleBisect and Conquer: Hierarchical Clustering via Max-Uncut Bisection

Files

Can’t use the file because of accessibility barriers? Contact us