Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

A Comparison of K-Means and Mean Shift Algorithms

Version 1 : Received: 3 August 2021 / Approved: 5 August 2021 / Online: 5 August 2021 (11:00:32 CEST)

How to cite: Shumaila, M.N. A Comparison of K-Means and Mean Shift Algorithms. Preprints 2021, 2021080140 (doi: 10.20944/preprints202108.0140.v1). Shumaila, M.N. A Comparison of K-Means and Mean Shift Algorithms. Preprints 2021, 2021080140 (doi: 10.20944/preprints202108.0140.v1).

Abstract

Clustering, or otherwise known as cluster analysis, is a learning problem that takes place without any human supervision. This technique has often been utilized, much efficiently, in data analysis, and serves for observing and identifying interesting, useful, or desired patterns in the said data. The clustering technique functions by performing a structured division of the data involved, in similar objects based on the characteristics that it identifies. This process results in the formation of groups, and each group that is formed, is called a cluster. A single said cluster consists of objects from the data, that have similarities among other objects found in the same cluster, and resemble differences when compared to objects identified from the data that now exist in other clusters. The process of clustering is very significant in various aspects of data analysis, as it determines and presents the intrinsic grouping of objects present in the data, based on their attributes, in a batch of unlabeled raw data. A textbook or otherwise said, good criteria, does not exist in this method of cluster analysis. That is because this process is so different and so customizable for every user, that needs it in his/her various and different needs. There is no outright best clustering algorithm, as it massively depends on the user’s scenario and needs. This paper is intended to compare and study two different clustering algorithms. The algorithms under investigation are k-mean and mean shift. These algorithms are compared according to the following factors: time complexity, training, prediction performance and accuracy of the clustering algorithms.

Keywords

K-Mean, Mean-Shift, Performance, Accuracy

Subject

MATHEMATICS & COMPUTER SCIENCE, Algebra & Number Theory

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.