The Evalutation Of The K-Means Clustering Algorithm Using Different Distancing Methods

Publication Year2015

0
Citations
34
Usage
0
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Usage
34
- Abstract Views
  34

Artifact Description

This research primarily focused on finding differences in various distancing methods used in the k-means clustering algorithm. The distancing methods used throughout the experiment are the Euclidean, Manhattan, and Earth Movers Distance. To accomplish this task, code in Python, wrapped around some C and Fortran code, was used to process images and determine the quality of the clusters made in the algorithm. The tests executed were performed on a ground truth to determine the quality of the measurements. For this experiment, Kylberg’s Texture Set was used as that primary ground truth. After initial results were determined, with an assumed cluster count of 28 (1 cluster per texture), further testing was required to search for significant differences in the data. So, the cluster count was optimized using a Q-test and the Anderson-Darling Statistic. All the optimization data was used to find more accurate results of the primary experiment, and the cluster count, for Kylberg’s Texture set, was actually optimized around 400 or 500. Results for the k-means clustering run on Kylberg’s Texture set clusters, using the optimized cluster count, are not included in this paper.

Bibliographic Details

REPOSITORY URLhttps://scholarexchange.furman.edu/scjas/2015/all/112

URL IDhttps://scholarexchange.furman.edu/scjas/2015/all/112; https://scholarexchange.furman.edu/cgi/viewcontent.cgi?article=1222&context=scjas

AUTHOR(S)

Alex Hoover

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know