A Data Science Central Community
I am facing a simple problem and trying to find the optimum solution:
Y(cont) = x1(cat) + x2(cat) +x3(cat) + x4(cat) + x5(cont)
Where: cat = categorical and cont = continuous.My categorical variables have 100 classes.
So my Y is cont and 4/5 Xs are categorical. What is the optimum approach? ANOVA? For ANOVA I think that would be true only when ALL of my Xs were categorical. If I simply apply a linear regression, then I…Continue
I have this question. I have a dataset with unique IDs (people).
Each one has some attributes. I want to classify them to good and bad customers.
Since I donot have a training set (i.e. having for some IDs their score 0 or 1), how can I classify them to 2 groups?
I understand that regression (logistic for example) cannot take place since I donot have a dependent variable.
One solution could be clustering for example and have only 2…Continue
Added by Konstantinos Chlouverakis on March 16, 2014 at 5:43pm — No Comments