Jorsorokin/HDBSCAN

HDBSCAN - hierarchical density-based clustering for applications with noise

https://github.com/Jorsorokin/HDBSCAN

You are now following this Submission

This is a MATLAB implementation of HDBSCAN, a hierarchical version of DBSCAN. HDBSCAN is described in Campello et al. 2013 and Campello et al. 2015. Please see the extensive documentation in the github repository. Suggestions for improvement / collaborations are encouraged!

Cite As

Jordan Sorokin (2026). Jorsorokin/HDBSCAN (https://github.com/Jorsorokin/HDBSCAN), GitHub. Retrieved .

Acknowledgements

Inspired by: gaimc : Graph Algorithms In Matlab Code

General Information

MATLAB Release Compatibility

  • Compatible with any release

Platform Compatibility

  • Windows
  • macOS
  • Linux

Versions that use the GitHub default branch cannot be downloaded

Version Published Release Notes Action
1.0.0.0

Added "minClustNum" parameter to the HDBSCAN object, which helps realize child clusters in situations where the algorithm finds a few single large clusters but the user disagrees with the results.

Updates to main algorithm for massive speedup (5-10x) by switching away from native matlab "graph" class during fitting. Prediction of new points is also faster and more accurate
Improved performance and memory usage for very large (>15,000 point) data sets. Also added "sparse_to_csr.m", a file by the author of "bfs.m" and "mst_prim.m" for converting sparse matrices

To view or report issues in this GitHub add-on, visit the GitHub Repository.
To view or report issues in this GitHub add-on, visit the GitHub Repository.