K-means：每次添加后连续更新质心

K-means:Updaing centroids successively after each addition

k-means

假设我们有五个数据点 A、B、C、D、E，我们使用 K 均值聚类算法将它们聚类成两个 clusters.Can 我们更新质心如下：

让我们select前两个，即A，B作为初始簇的质心。然后计算C到A的距离以及B.SayC离A更近。在下一步之前用质心 A 更新簇的质心，即现在新的质心是 (A+C)/2 和 B。然后计算D与这些新质心的距离等等。

是的，我们似乎可以按照 Kumar 的 “数据挖掘导论”第 8 章中的解释在 k-means 中逐步更新质心。这是实际的文本：

Updating Centroids Incrementally

Instead of updating cluster centroids after all points have been assigned to a cluster, the centroids can be updated incrementally, after each assignment of a point to a cluster. Notice that this requires either zero or two updates to cluster centroids at each step, since a point either moves to a new cluster (two updates) or stays in its current cluster (zero updates). Using an incremental update strategy guarantees that empty clusters are not produced since all clusters start with a single point, and if a cluster ever has only one point, then that point will always be reassigned to the same cluster.

In addition, if incremental updating is used, the relative weight of the point being added may be adjusted; e.g., the weight of points is often decreased as the clustering proceeds. While this can result in better accuracy and faster convergence, it can be difficult to make a good choice for the relative weight, especially in a wide variety of situations. These update issues are similar to those involved in updating weights for artificial neural networks.

Yet another benefit of incremental updates has to do with using objectives other than “minimize SSE.” Suppose that we are given an arbitrary objective function to measure the goodness of a set of clusters. When we process an individual point, we can compute the value of the objective function for each possible cluster assignment, and then choose the one that optimizes the objective. Specific examples of alternative objective functions are given in Section 8.5.2.

On the negative side, updating centroids incrementally introduces an order dependency. In other words, the clusters produced may depend on the order in which the points are processed. Although this can be addressed by randomizing the order in which the points are processed, the basic K-means approach of updating the centroids after all points have been assigned to clusters has no order dependency. Also, incremental updates are slightly more expensive. However, K-means converges rather quickly, and therefore, the number of points switching clusters quickly becomes relatively small.