AdaBoost

A model based on Boosting.

Training

We have Dataset $D$ with samples $(x_{d}, y_{d})$ .

Initialize lists
1. List to hold weights: $w$
2. List to hold classifiers: $M$
3. List to hold weight-updates: $β$
Initialize weigths for first classifier so that each tuple has the same probability. $w_{i}^{1} \leftarrow \frac{1}{d}$
Generate $k$ classifiers in $k$ iterations
At iteration $i$ do:
1. Calculate normalized weights: $p^{i} = \frac{w ^{i}}{\sum _{j = 1}^{N} w _{j}^{i}}$
2. Use Bootstap method to sample Dataset with replacement according to the previously assigned weights to form the training set $D_{i}$ for classifier $M_{i}$
3. Derive model $M_{i}$ from $D_{i}$
4. Test model $M_{i}$ with test set $D_{i}$ by calculating error $ϵ_{i}$ as the sum of all missclassified weights $w_{i}$
5. If this error is bigger than $0.5$ go back to step 4.1 and abandon this classifier
6. Calculate the weight update $β_{i}$ as $\frac{ϵ _{i}}{1 - ϵ _{i}}$
7. Update weigths for the next iteration by multiplying them with $β_{i}$ if they have been correctly classified: $w_{i}^{i + 1} = w_{j}^{i} β_{i}^{^{-} err (x_{j})}$ thus reducing the weight if they were classified correctly and leaving the weight as it is if it has been missclassified.
8. Add $w^{i + 1}, M_{i}, β_{i}$ to their respective lists

Initialize weigths of each class to zero
For each classifier
1. Calculate weight of its vote: $w_{i} = lo g (\frac{1}{β _{i}})$
2. Get prediction $c$ from that weak classifier
3. Add $w_{i}$ to the weight for class $c$
Return class with the largest weight

Or in short: $M (x) = ar g max_{y \in Y} \sum_{i = 1}^{k} (lo g \frac{1}{β _{i}}) M_{i} (x)$