Aggregated the clustering work I’ve been working on.
Based on 2010 Census Data, a Statistical Abstract of the United States
Optimal k for clusters as found by NbClust’s 30 tests was 2 (majority rule). I used box-cox transformed PCA Whitened and weighted variables as well as fviz_dist to visualize the similarities of clusters.
Violin plots to aggregate the actual clusters
A 3d PCA plot showing the top 3 principal components which made up 77% of the proportion of variance
And finally pairplot to show the 2 clusters mapped against every possible scatterplot of the original data.