A Data Science Central Community
Tags:
We can simply normalize data by subtracting it from mean & dividing that term by standard deviation.
(x1-mean(x1,x2,..xn))/sd(x1,x2,....xn)
I hope, it will be useful to you.
Vivek
Hi Sagar,
I have got your concern. Could you please share sample data on which normalization needed. I'll also try from my end & will share you the approach.
regards,
Vivek
Hi Sagar,
If I am correct, your main objective is to transform non-normal data to normal. You can use Johnson transformation.
I hope, it will help you.
Shahid
Hi Sagar,
I'd like to help with your question but want to get some background information first. I assume you are trying to satisfy the requirement of normality of errors in your regression model?
How are you determining that your independent variables are skewed? What measure or method are you using?
Try plotting the residuals of the independent variable and the dependent variable and see what the pattern or shape looks like. That can help you determine what transformation you need to use on your data. Normalizing your data wont hurt either, it will give you more robust coefficient estimates but will change the interpretation slightly.
Let us know how it works!
Kevin
Box Cox transformation, apply boxcox to see the closest transformation power you can apply to make the relationship linear, if first difference and log normalization did not work.
Hello Sagar,
I would suggest bootstrap and/or jackknife methods (in general, resampling is useful in highly non-normal data)
Hi Sagar
I would have taken the approached Mortal Kolle mentioned. However my starting point would also have been Box Cox, but as you mentioned this was unsuccessful.
Regards
Daniel
© 2021 TechTarget, Inc. Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles