Interactive visualization of PCA dimensionality reduction with simulated projections from classic ML datasets
📊 Simulated 2D/3D projection of 4D iris measurements
Original features: sepal length, sepal width, petal length, petal width
Classes: Setosa, Versicolor, Virginica
Real PCA typically preserves ~95% variance in 2 components
Step 1: Center data by subtracting mean
Step 2: Compute covariance matrix
Step 3: Find eigenvalues/eigenvectors
Step 4: Select principal components
Step 5: Transform data
Unsupervised clustering
Density-based spatial clustering
Identify outliers using various algorithms