Linear Discriminant Analysis
A Supervised Learning type of Dimensionality Reduction. It takes class labels into account.
It tries to find the feature subspace that optimizes class separability.
Assumptions
- normally distributed
- identical Covariance matrices
- independent
but still works reasonably well with slight violations of these assumptions.
General Idea
Steps
- Standardization
- d-dimensional mean vector for every class
- Between-Class Scatter Matrix and Within-Class Scatter Matrix
- Eigenvectors and Eigenvalues for
- Sort
- Choose top k and create transformation matrix
- Project into subspace with transformation matrix