Balanced Iterative Reducing and Clustering Using Hierarchies
A Hierarchical Clustering method which uses the following two phases:
- Phase 1: Builld an inital CF-Tree (Clustering Feature) which is a multi-level compression of the data that preserves inherent clustering structure of the data
- Phase 2: Use any Clustering method to cluster the leaf nodes of the CF-Tree
Algorithm
- For each point
- Find closest leaf-node entry
- Add point to leaf-node entry and update CF
- If entry_diameter > max_diameter, then split leaf node and possibly parents
- Information about new point is passed to root
Laufzeit is and incremental.
Advantages
- Finds good clustering with a single scan
- Can finetune the clustering with additional scans