A Data Science Central Community
Tags: R, forests, partitioning, random, trees
There is a great book called "Elements of Statistical Learning" by Hastie, Tibshirani, and Friedman which describes in detail classification and regression trees and many other data mining methods. You can buy the book or download it in pdf. The link is:
http://www-stat.stanford.edu/~tibs/ElemStatLearn/
Lester Wollman
Isn't it funny that CART is one of the best algorithms 27 years after it was introduced? :-)
Logistic regression has similar story - very old and very useful for today's problems.
If you want to improve accuracy learn more about ensembles - boosting, bagging, adaboost, random forests. But don't expect miracles, they all improve CART's performance by relatively small margin. And all got some pros&cons.
http://www.salford-systems.com/doc/newhybridmethods.pdf - here's nice presentation about CART, only 13 years old, but really worth reading.
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles