Subscribe to DSC Newsletter

Hi, all:
I am a SAS/STAT user in my company and our company has no budget for any expensive data mining packages like SAS EM or SPSS CLEMENTINE. Can someone suggest to me good open source data mining packges that are good for commercial usage? What I am trying to get is a package that is capable of handling SAS table with size of about 14 million records and about 10 variables.

Also, I know there are tons of data mining algorithms out there and I am totally new to this field. As I said, we use classical statistics methods in our work. Can someone recommend some resources that will help me learn these algorithms? For example, a book or an web site that explains to me when and what algorithm is good for what situaton. Thanks a lot.

Views: 98

Reply to This

Replies to This Discussion

Hello,
i would recommends you to tike a look at this data mining software called Knime (www.knime.org). it looks similar to clementine and you have built-in algorithm and tools. the advantage is that it integrates R & weka so you have a wide range of algorithms and statistical packages.
regarding the resources for learning data-mining i would recommends you to visit http://videolectures.net/ you have plenty of video related to data mining, algorithm and statistical learning.
hope this suggestion will be helpful.
kind regards,
Ramzi Temanni,
Hi, Ramzi:
Thanks for your recommendation. I thought no one would reply to my post. Anyway, I know that there are WEKA, RapidMiner, Knime and even R out there but I don't know which one to start with. How do you think?

RSS

On Data Science Central

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service