Subscribe to DSC Newsletter

Data Preprocessing – Normalization


Further
to introduction, in this article I am going to discuss “Data
Preprocessing” an important step in the knowledge discovery process, can
be even considered as a fundamental building block of data mining.
People who come from data warehousing background may already be familiar
with the term ETL ( Stands for Extraction,Transformation and Loading).
Any data mining or data warehousing effort's success is dependent on how
good the ETL is
performed. DP ( I am going to refer Data preprocessing as DP henceforth)
is a part of ETL, its nothing but transforming the data. To be more
precise modifying the source data in to a different format which

(i) enables data mining algorithms to be applied easily
(ii) improves the effectiveness and the performance of the mining algorithms
(iii) represents the data in easily understandable format for both humans and machines
(iv) supports faster data retrieval from databases
(v) makes the data suitable for a specific analysis to be performed.

Read more @
http://intelligencemining.blogspot.com/2009/07/data-preprocessing-n...

Views: 833

Comment

You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge

Comment by Venkatesh Umaashankar on September 16, 2010 at 11:20am
@Tom - Thanks Tom. I came up with this article to help the beginners under the need for normalization, this was the kind of article which i wish I could ve got when I started. Thanks for your appreciation.

On Data Science Central

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service