Binning

Used to discretize continous/numerical data sequences.

Equal Width (distance)

Dividing the data into equal sized intervals with a width that can be calculated like this:

  • Outlier can dominate
  • Skewed data isn’t handled well

Equal Depth (frequency)

Dividing the data into intervals, each containing approximately the same numper of samples.

Other Methods