hdbscan: Hierarchical density based clustering

McInnes, Leland; Healy, John; Astels, Steve

doi:10.21105/joss.00205

Record: found
Abstract: found
Article: found

Is Open Access

hdbscan: Hierarchical density based clustering

Author(s): Leland McInnes , John Healy , Steve Astels

Publication date Created: March 2017

Journal: The Journal of Open Source Software

Publisher: The Open Journal

Read this article at

ScienceOpen Publisher

Bookmark

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 3

Record: found
Abstract: not found
Book Chapter: not found

Density-Based Clustering Based on Hierarchical Density Estimates

Ricardo Campello, Davoud Moulavi, Joerg Sander (2013)

0 comments Cited 146 times – based on 0 reviews

Bookmark

Record: found
Abstract: not found
Article: not found

Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection

Ricardo Campello, Davoud Moulavi, Arthur Zimek … (2015)

0 comments Cited 89 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Consistent procedures for cluster tree estimation and pruning

Ulrike von Luxburg, Kamalika Chaudhuri, Sanjoy Dasgupta … (2014)

For a density \(f\) on \({\mathbb R}^d\), a {\it high-density cluster} is any connected component of \(\{x: f(x) \geq \lambda\}\), for some \(\lambda > 0\). The set of all high-density clusters forms a hierarchy called the {\it cluster tree} of \(f\). We present two procedures for estimating the cluster tree given samples from \(f\). The first is a robust variant of the single linkage algorithm for hierarchical clustering. The second is based on the \(k\)-nearest neighbor graph of the samples. We give finite-sample convergence rates for these algorithms which also imply consistency, and we derive lower bounds on the sample complexity of cluster tree estimation. Finally, we study a tree pruning procedure that guarantees, under milder conditions than usual, to remove clusters that are spurious while recovering those that are salient.