KAy-means for MIxed LArge data(English)
- algorithm that enables simultaneous handling of continuous and categorical data. This method has the capacity to manage mixed data types while maintaining the data’s native scale, integrating k-means for continuous variables with k-modes for categorical variables. The optimal number of clusters can be identified using multiple validation methods: silhouette analysis (cluster package), within-cluster sum of squares, and gap statistics
- ICM, SHAP
- Statistics, Classification, Cardiology
- https://doi.org…554-025-03541-4