- All Implemented Interfaces:
- Program, ProgramDescription
public class KMeansSingleStep
extends Object
implements Program, ProgramDescription
The K-Means cluster algorithm is well-known (see
http://en.wikipedia.org/wiki/K-means_clustering). KMeansIteration is a PACT
program that computes a single iteration of the k-means algorithm. The job
has two inputs, a set of data points and a set of cluster centers. A Cross
PACT is used to compute all distances from all centers to all points. A
following Reduce PACT assigns each data point to the cluster center that is
next to it. Finally, a second Reduce PACT compute the new locations of all
cluster centers.