Page 1 of 1

What are the main limitations of K-means?

Posted: Wed Apr 06, 2022 4:35 am
by Mark
Hello
I need to answer this question for my assignment:

What are the main limitations of K-means?

I think the key issue is how to set the parameter k. Is there some other limitations?

thanks in advance

Re: What are the main limitations of K-means?

Posted: Wed Apr 06, 2022 12:46 pm
by Lin
also kmeans tend to find clusters having a spherical shape, dont work well with clusters of different shapes and densities

Re: What are the main limitations of K-means?

Posted: Wed Apr 06, 2022 11:31 pm
by gmc
About the parameter k, first it Is hard to set and second there Is the problem of cluster initialization. I mean that depending on how the cluster centers are initialized can give very different results.

Re: What are the main limitations of K-means?

Posted: Wed Jun 01, 2022 2:57 am
by Mark
gmc wrote: Wed Apr 06, 2022 11:31 pm About the parameter k, first it Is hard to set and second there Is the problem of cluster initialization. I mean that depending on how the cluster centers are initialized can give very different results.
thanks All for your answers. That means a lot to me

Re: What are the main limitations of K-means?

Posted: Wed Jan 04, 2023 1:47 am
by Gerald
The main limitations of K-Means are that it needs the number of clusters as an input, it is a biased algorithm, and also it fails to detect outliers. Outliers can be classified as any data that are not near the cluster or do not belong to the cluster. These outliers have a tendency to break down the dataset, thereby rendering the accuracy and performance of the K-Means algorithm weak.

Re: What are the main limitations of K-means?

Posted: Tue Jan 31, 2023 1:10 am
by admin
Some important limitations of K-means are that it is sensitive to outliers, it requires the specification of the number of clusters, and it tend to produce clusters that have globular shapes.

Also K-means is not suitable for clustering data with non-numeric features or categorical data, as it relies on the mean of the data points in order to determine the clusters.

Finally, K-means can fail to converge on the optimal clustering solution if the initial cluster centers are chosen poorly.