Subscribe to DSC Newsletter

4 open source data mining tools (with GUI)

I was at the Semantic Web Meetup @ the Hearst Building in NYC (amazing venue, the first green building completed in NYC) yesterday and someone asked about open source tools available for data mining, specifically for clustering. Unfortunately I had to run out after the meetup and couldn’t provide these to him. The one mentioned by the presenter was Weka, which also the first free open source tool I came across.

Anyway, here are the ones I have found that are worth checking out and I’m sure there are others and more to come.

These 4 have GUIs for us non-programmers.

Use at your own risk! I cannot speak to the accuracy of the algorithms although many of them seem to be well established in the field. There are really 2 issues here - you have to be concerned not only about the underlying algorithm being used but also how effectively and accurately it was translated into code. Nevertheless these seem like solid apps


Weka
Java based, open source, with GUI
From the University of Waikato in New Zealand (data mining in New Zealand sounds like lots of fun)


Orange
Python based, open source, GUI
From the AI Laboratory in Ljubljana, Slovenia


Rattle
based on R, open source, GUI
http://rattle.togaware.com/

I just downloaded this one but haven’t had a chance to look at it yet. I thought it should be considered due to the popularity of R.

Rapid Miner
Java based, open source, GUI

This implements the full WEKA catalog as well as their own library

Views: 20565

Comment

You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge

Comment by Ralf Klinkenberg on December 21, 2009 at 11:11am
Getting started with the open source data mining software RapidMiner:

Introductory video about RapidMiner 5: The most important new features of the completely re-designed new version of the most widely used open source data mining softare RapidMiner:
http://rapid-i.com/videos/rm_5_demo_EN/rm_5_demo_EN.html

KDnuggets Polls 2007, 2008, and 2009: RapidMiner is among the top 3 data mining tools worldwide and the leading open source data mining software:
http://www.pressebox.de/pressemeldungen/rapid-i-gmbh/boxid-276804.html
http://www.kdnuggets.com/polls/2009/data-mining-tools-used.htm

Download and use RapidMiner Community Edition free of charge (no license costs) for unlimited data mining, text mining, web mining, predictive analytics, time series analysis and forecasting:
http://rapid-i.com/content/view/26/84/

Get professional support with guaranteed response times and all the guarantees you need with the RapidMiner Enterprise Edition directly from the RapidMiner team:
http://rapid-i.com/content/view/123/141/

Best regards,
Ralf Klinkenberg

Rapid-I

Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2017   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service