Hi,
Could someone advise me please as to whether it is acceptable to use K means clustering on a dataset that contains binary variables. I did not think this was appropriate, as the distance measure is the Euclidean distance measure for continuous data, but I have recently seen examples of this.
Thanks
Jenny
|