1 This algorithm is historically known as the K-means method in statistics and pattern recognition.

[2] We already have an interface to a linear discriminant although we still do not have experimental results.

3 For reasons of simplicity we shall use the term class instead of dependent variable from now on.

4 We always use the median as a centrality statistic. We prefer it to the mean to avoid outliers influence.