Shapley Additive Explanations

A more efficient way to approximate Shapley Values.

Global SHAP

Run SHAP for every instance and obtain a matrix of shapley values.

SHAP Feature Importance, average over all shapley values (for each instance) for a given feature. So we get the average contribution on model output of a feature.

Pros

Cons

  • all Shapley Values properties
  • not selective
  • training data access needed
  • predictions on unrealistic datapoints if data is correlated
  • KernelSHAP still slow
  • KernelSHAP ignores dependence between subsets of included and absent features

References

https://www.youtube.com/watch?v=VB9uV-x0gtg