Subscribe to DSC Newsletter

Venkatesh Umaashankar's Blog (4)

De-duplicating, merging customer records with clustering

Frustrated with multiple records of the same customer which just differ due to a typo or abbreviation or different possible representations of same address?



Customer duplicate records could be very tricky. They suffer the problems such as abbreviating the address, typos and various possible representation of same address and name.



read more @...…

Continue

Added by Venkatesh Umaashankar on October 30, 2012 at 8:50am — 1 Comment

Optimization plugin for RapidMiner

 Optimization in general means selecting a best choice out of various alternatives, which reduces the cost or disadvantage of an objective.  Optimization problems are very popular in the fields such as economics, finance, logistics, etc.

 

more... @ …

Continue

Added by Venkatesh Umaashankar on October 29, 2012 at 4:12am — No Comments

Real time example for Normalization

In

this article, we are going to see normalization in action in a popular

web application. People who are not familiar with normalization please refer to my previous post.



We all know very well the capability of Google to exploit the… Continue

Added by Venkatesh Umaashankar on September 16, 2010 at 11:23am — No Comments

Data Preprocessing – Normalization

Data Preprocessing – Normalization



Further

to introduction, in this article I am going to discuss “Data

Preprocessing” an important step in the knowledge discovery process, can

be even considered as a fundamental building block of data mining.

People who come from data…
Continue

Added by Venkatesh Umaashankar on September 16, 2010 at 4:06am — 1 Comment

On Data Science Central

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service