BOAT

Decision Tree Induction algorithm that is scalable.

Uses Bootstraping to create several smaller samples each fitting in memory. Each subset will be used to create a tree which will then be combined into the final Decision Tree.

Advantages

  • scalable → Database only needs to be scanned twice
  • incremental → can be update with new data