Feature Selection

A way to reduce the dimensionality of a Dataset.

Methods

  • Remove redundant attributes or irrelevant attributes (e.g. IDs)

    • manul selection with expert knowledge
  • Heuristic Search

  • LASSO

  • Elastic-Net Regression

  • Sensitivity Analysis

  • Univariate selection by Correlation coefficient between feature and target (minimum threshold to keep feature)

    • might miss other interactions
  • Forward Selection

    • train with one feature, select the best feature
    • add feature
    • repeat until number of desired features is reached
  • Backward Selection

    • start with all features
    • remove one feature, throw away with best performance
    • repeat