Hi,
I have a lot of data (around 300,000 rows) and 5 clustering variables. 3 of these have only positive integral values, one is binary and the last one (a variable for day of week) is what is really killing me. Essentially the day of week variable (dow) takes values from Mon-Sun. I dont know how to treat this variable as Mon-Wed distance is the same as Sat-Mon distance. Is it possible to define a distance matrix specifically for this variable. I'm really new to this so any help would be really appreciated.
Thanks