Frequent Pattern Growth

Avoids explicit candidate generation (compared to A priori - Frequent Item Approach) and thus has a Depth-first search approach.

Major philosophy

Grow long patterns from short ones using local frequent items only. For example restricting database to frequent item abc. Now local frequent item d on restricted database will lead to frequent item abcd on whole dataset.

Method

1. Construct FP-tree from Transaction Database

  1. Scan DB once to find frequent 1-Itemset based on min_sup
  2. Sort frequent items in frequency-descending order → F-list
  3. Scan DB again to construct FP-tree

2. Find Itemset p from its Conditional Pattern Base

Scaling by Database Projection

Parallel Projection Partition Projection

Advantages