How to determine the 'Optimum' sample size - AnalyticBridge2020-09-24T19:03:56Zhttps://www.analyticbridge.datasciencecentral.com/forum/topics/how-to-determine-the-optimum?feed=yes&xn_auth=no
Minethedata:
http://www.m…tag:www.analyticbridge.datasciencecentral.com,2011-02-05:2004291:Comment:875382011-02-05T18:45:26.002ZRalph Wintershttps://www.analyticbridge.datasciencecentral.com/profile/RalphWinters
<p> </p>
<p>Minethedata:</p>
<p> </p>
<p><a href="http://www.med.univ-rennes1.fr/wkf/stock/RENNES20100407100014blaviolllogistic1.pdf" target="_blank">http://www.med.univ-rennes1.fr/wkf/stock/RENNES20100407100014blaviolllogistic1.pdf</a></p>
<p> </p>
<p>Minethedata:</p>
<p> </p>
<p><a href="http://www.med.univ-rennes1.fr/wkf/stock/RENNES20100407100014blaviolllogistic1.pdf" target="_blank">http://www.med.univ-rennes1.fr/wkf/stock/RENNES20100407100014blaviolllogistic1.pdf</a></p> Ralph can you share the detai…tag:www.analyticbridge.datasciencecentral.com,2011-02-05:2004291:Comment:871282011-02-05T15:09:38.701ZMinethedatahttps://www.analyticbridge.datasciencecentral.com/profile/Minethedata
Ralph can you share the details of the paper.
Ralph can you share the details of the paper. by variables, I mean main ef…tag:www.analyticbridge.datasciencecentral.com,2011-02-03:2004291:Comment:872302011-02-03T21:41:17.297ZRalph Wintershttps://www.analyticbridge.datasciencecentral.com/profile/RalphWinters
<p>by variables, I mean main effects in the model. There is a paper by Peduzzi that discusses this in which he shows than 10 times the number of parameters / the least likely outcome (in your case .08 churn) yields a proper number. However, I'm not sure what you mean by "optimum" sample size. This will always be dependent upon the number of variables in the model. If you end up throwing out variables for whatever reason, it will change.</p>
<p> </p>
<p>-Ralph Winters</p>
<p>by variables, I mean main effects in the model. There is a paper by Peduzzi that discusses this in which he shows than 10 times the number of parameters / the least likely outcome (in your case .08 churn) yields a proper number. However, I'm not sure what you mean by "optimum" sample size. This will always be dependent upon the number of variables in the model. If you end up throwing out variables for whatever reason, it will change.</p>
<p> </p>
<p>-Ralph Winters</p> Ralph,
Thanks for your respon…tag:www.analyticbridge.datasciencecentral.com,2011-02-03:2004291:Comment:873212011-02-03T19:55:56.295ZSharath Dandamudihttps://www.analyticbridge.datasciencecentral.com/profile/SharathDandamudi
<p>Ralph,</p>
<p>Thanks for your response.I have a few queries based on your reply.I would appreciate if you resolve those queries.</p>
<p>1. When you say 25 variables in the model-Do you mean the 25 raw variables in the dataset available to me initially?</p>
<p>2. Could you explain the formula/function that you have mentioned? Precisely, how do we get the values of 10 and 0.8?</p>
<p> </p>
<p>In your reply I see the word 'minimum', but I would like to know the 'optimum' sample size…</p>
<p>Ralph,</p>
<p>Thanks for your response.I have a few queries based on your reply.I would appreciate if you resolve those queries.</p>
<p>1. When you say 25 variables in the model-Do you mean the 25 raw variables in the dataset available to me initially?</p>
<p>2. Could you explain the formula/function that you have mentioned? Precisely, how do we get the values of 10 and 0.8?</p>
<p> </p>
<p>In your reply I see the word 'minimum', but I would like to know the 'optimum' sample size instead.</p>
<p> </p>
<p>Regards,</p>
<p>Sharath</p>
This is a function of the n…tag:www.analyticbridge.datasciencecentral.com,2011-02-03:2004291:Comment:874322011-02-03T19:44:02.602ZRalph Wintershttps://www.analyticbridge.datasciencecentral.com/profile/RalphWinters
<p> </p>
<p>This is a function of the number of variables in your model. For example if you have 25 variables in your model, as a rule of thumb, you will need a minimum of 25*10 / .08 sample size (3125). Then you need to scale up to accomodate your 70%/30% validation criteria.</p>
<p> </p>
<p>-Ralph Winters</p>
<p> </p>
<p>This is a function of the number of variables in your model. For example if you have 25 variables in your model, as a rule of thumb, you will need a minimum of 25*10 / .08 sample size (3125). Then you need to scale up to accomodate your 70%/30% validation criteria.</p>
<p> </p>
<p>-Ralph Winters</p>