This is based on the concept of dynamically adding and modifying the clusters as new data is presented, where the modification to the clusters affects both the position of the clusters and the size of the cluster, in terms of a radius parameter associated with each cluster that determines the boundaries of that cluster. ECM has only one parameter, which drives the addition of clusters, known as the distance threshold . When new clusters are added, their centres are set to equal the example that triggered their creation, and the radius of a new cluster is initially set to zero. grows as more vectors are allocated to the cluster. Due to the mechanism by which is updated, it cannot exceed . The ECM algorithm is shown below:
Create the first cluster centre from the first example
for each subsequent vector do
Find the minimum distance between and each cluster centre
if is less than any cluster radius then
Add to the nearest cluster
else
FInd the cluster with minimum value of , where , is the distance between the cluster centre and vector , and is the radius of cluster
if then
Create a new cluster
else
Update
end if
end if
end for
When cluster is updated, its centre is shifted closer to and its radius is set according to the equation below:
The new centre of is set so that its distance is on the line between and at a distance of