Frequent Pattern Growth
Avoids explicit candidate generation (compared to A priori - Frequent Item Approach) and thus has a Depth-first search approach.
Major philosophy
Grow long patterns from short ones using local frequent items only. For example restricting database to frequent item abc. Now local frequent item d on restricted database will lead to frequent item abcd on whole dataset.
Method
1. Construct FP-tree from Transaction Database
- Scan DB once to find frequent 1-Itemset based on
min_sup
- Sort frequent items in frequency-descending order → F-list
- Scan DB again to construct FP-tree
2. Find Itemset p from its Conditional Pattern Base
- Frequent Item - Header Table
- Conditional Pattern Base
- Conditional FP-tree
- Recursion on Conditional FP-tree
Scaling by Database Projection
Parallel Projection Partition Projection
Advantages
- Can use “Divide and Conquer” Approach
- Compressed DB using FP-tree
- No candidate generation and test needed → see A priori - Frequent Item Approach
- No repeated scans of the Database