A Data Science Central Community

Here are 13 books on Machine Learning and Data Mining that are great resources, references, and refreshers for Data Scientists. (This is definitely a small selective subsample of the many excellent books available.)

- The Top Ten Algorithms in Data Mining, by Xindong Wu and Vipin Kumar (editors)
- Learning from Data, by Y.Abu-Mostafa, M.Magdon-Ismail, H-S.Lin
- Mining of Massive Datasets, by Jeffrey David Ullman and Anand Rajaraman
- Handbook of Statistical Analysis and Data Mining Applications, by G.Miner, J.Elder, R.Nisbet
- Machine Learning for Hackers, by Drew Conway and John Myles White
- Mahout in Action, by S.Owen, R.Anil, T.Dunning, E.Friedman
- Statistical and Machine-Learning Data Mining: Techniques for Better..., by Bruce Ratner
Networks, Crowds, and Markets: Reasoning About a Highly Connected W..., by David Easley and Jon Kleinberg

- Bayesian Reasoning and Machine Learning, by David Barber
Ensemble Methods in Data Mining: Improving Accuracy Through Combini..., by Giovanni Seni and John Elder (Older Edition is also available)

- Data Mining with R: Learning with Case Studies, by Luis Torgo
- Using R for Data Management, Statistical Analysis, and Graphics, by Nicholas Horton and Ken Kleinman
- Introduction to Data Mining, by P-N.Tan, M.Steinbach, V.Kumar

And for my astronomer friends, here are a couple of additional suggestions:

14. Statistics, Data Mining, and Machine Learning in Astronomy: A Pract..., by Z.Ivezic, A.Connolly, J.VanderPlas, A.Gray

15. Advances in Machine Learning and Data Mining for Astronomy, by M.Way, J.Scargle, K.Ali, A.Srivastava

© 2020 TechTarget, Inc. Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions