SMOTE & SMOTE-NC Calculator

Machine Learning Lectures SS2026 HKA/EIT

Step-by-step synthetic data generation with full calculations

1 Original Imbalanced Dataset

Total Samples: 25
Minority: 5
Majority: 20
Imbalance Ratio: 1:4
Index X₁ (Feature 1) X₂ (Feature 2) Class
Index X₁ (Feature 1) X₂ (Feature 2) Class
Index X₁ (Feature 1) X₂ (Feature 2) Class

📊 Data Visualization: Before Oversampling

Status: Imbalanced

Feature Space Distribution BEFORE

Minority Class
Majority Class
Minority
5
Majority
20
Ratio
1:4

Class Distribution

⚠ Problem: The minority class is significantly underrepresented, which can lead to biased model predictions favoring the majority class.