A Data Science Central Community
I have performed a desiccation tree analysis. The problem is that I get an "impossible" tree, the combination that the tree gives can’t be true.
The first split is with a variable, how many donations the customer has given during a period. One of node is 'Missing', zero gifts during that time period.
Then this node splittes into several child nodes with help of a component variable that holds info about recency and frequency. When I read the child nodes titles I see that a component value that cannot be here is there. I have checked I the raw data and that combination don’t exist.
I have not run the score code, just locked at the tree diagram and that’s when a saw the odd split. Can it be like this, the tree diagram gives wrong info but the core code does it right? Can it be like this because of some 'missing' vägue bug I EM or because of that there are several hundreds of different values of the component variable and the diagram just shows med one or two.
Anyone met this problem before?