Authors
Byoung Jik Lee, Western Illinois University, USA
Abstract
We propose a valid approach to find the degree of important attributes in unlabeled dataset to improve the clustering performance. The significant degrees of attributes are extracted through the training of unsupervised simple competitive learning with the raw unlabeled data. These significant degrees are applied to the original dataset and generate the weighted dataset reflected by the degrees of influentialvalues for the set ofattributes. This work is simulated on the UCI Machine Learning repository dataset. The Scikit-learn K-Means clustering with raw data, scaled data, and the weighted data are tested. The result shows that the proposed approach improves the performance.
Keywords
Unsupervised MachineLearning, Simple Competitive Learning, SignificantDegree of Attributes, Scikitlearn K-Means Clustering, Weighted Data, UCI Machine Learning Data.