Marcs Notes

❯

❯

❯

Machine Learning

❯

Classification

10. Juni 20251 min read

Classification

Predicts categorical class labels (discrete, nominal) by constructing a model based on training data to use for classifying new data → Decision Boundary.

Class Imbalance

General Process

Model construction:

training set with class labels
classification rules, decision trees, formulae

Model usage:

Estimate Accuracy
- Test set (independent of training set) to compare with results from model
- Accuracy Rate → hit rate
if acceptable use model to classify new data

Methods

Bayesian Classifier
Naive Bayes Classifier
Decision Tree
Random Forest
Logistic Regression
Perceptron
Adaline
SVM
K-Nearest Neighbors

Perceptron & Adaline

Both are not able to converge for data that is not linearly seperable

SVM vs. Logistic Regression

Logistic Regression is more prone to Outliers. SVM only look at the support vectors so it isnt as sensitive.
Logistic Regression can be updated easier with streamed data

also see Performance Evaluation Metrics

Graphansicht

Classification
General Process
Methods
Perceptron & Adaline
SVM vs. Logistic Regression

Backlinks

Data Mining in the ML and Statistics Community
Hierarchy
Binary Classification
Class Imbalance
One-Versus-Rest
Soft Margin
K-Nearest Neighbors
Decision Tree
Bagging
Boosting
Confusion Matrix
Classification and Regression Trees
Linear Regression
Logistic Regression
MOC - Intro to XAI
Unit
Forest-RC
Text Classification
Logistic Function
Lasso Regression
Supervised Learning
Support Vector Machine
Mining Frequent Patterns, Associations and Correlations
Subsumption Test

Erstellt mit Quartz v4.5.0 © 2025

GitHub