Cut off point in logistic regression - AnalyticBridge 2019-10-22T16:28:50Z https://www.analyticbridge.datasciencecentral.com/forum/topics/cut-off-point-in-logistic?feed=yes&xn_auth=no The key point is balancing be… tag:www.analyticbridge.datasciencecentral.com,2017-01-04:2004291:Comment:357869 2017-01-04T00:48:20.293Z Nethra Sambamoorthi https://www.analyticbridge.datasciencecentral.com/profile/NethraSambamoorthi <p>The key point is balancing between predicting true positives in the presence of false positives.  So <span>Use ROC. </span></p> <p>Another method is to use a cost/revenue function for the true positive vs. false positives, so one can use a loss/profit as a measure of balancing true positive with respect to false positives.</p> <p>The key point is balancing between predicting true positives in the presence of false positives.  So <span>Use ROC. </span></p> <p>Another method is to use a cost/revenue function for the true positive vs. false positives, so one can use a loss/profit as a measure of balancing true positive with respect to false positives.</p> I would assume that Hari has… tag:www.analyticbridge.datasciencecentral.com,2014-09-09:2004291:Comment:308165 2014-09-09T20:40:25.773Z Sunpreet Singh Khanuja https://www.analyticbridge.datasciencecentral.com/profile/SunpreetSinghKhanuja <p>I would assume that Hari has balanced the data set to a near 50% probability of event rate and he has not accounted for the balanced sampling in his question.</p> <p>If the apriori probability of event is 17% in the original sample and the balanced sampling yields a near/exact 50% probability, then a 50% cutoff will minimize misclassification.</p> <p>I would assume that Hari has balanced the data set to a near 50% probability of event rate and he has not accounted for the balanced sampling in his question.</p> <p>If the apriori probability of event is 17% in the original sample and the balanced sampling yields a near/exact 50% probability, then a 50% cutoff will minimize misclassification.</p> If your event rate is around… tag:www.analyticbridge.datasciencecentral.com,2011-03-30:2004291:Comment:94655 2011-03-30T16:45:09.416Z Arun https://www.analyticbridge.datasciencecentral.com/profile/Arun <p>If your event rate is around 17% and you say that at 50% cutoff you're getting a very good classification, there's something fishy! How can a logistic model trained to fit only 17% be better than what information the dataset has?</p> <p>Unless, you're measure of accuracy of fit is different from misclassification! Remember, the model usually fits the remaining 83% well, so the misclassification there would be low as compared to the 17%. But I'm unsure how you're getting a 50% cutoff more…</p> <p>If your event rate is around 17% and you say that at 50% cutoff you're getting a very good classification, there's something fishy! How can a logistic model trained to fit only 17% be better than what information the dataset has?</p> <p>Unless, you're measure of accuracy of fit is different from misclassification! Remember, the model usually fits the remaining 83% well, so the misclassification there would be low as compared to the 17%. But I'm unsure how you're getting a 50% cutoff more accurate in terms of misclassification - since, a decrease here, is going to increase it there.</p> <p> </p> <p>The best way to find out the cutoff is by plotting for different values as already suggested, but it's usually got to be around the event rate! In cases where you fit multiple logistic models for homogeneous segments, you could generally lift the cutoff point, not otherwise from my experience!</p> <p> </p> <p>Would be interesting to know what you find out...</p> Hi Hari,   if u r talking abo… tag:www.analyticbridge.datasciencecentral.com,2011-03-30:2004291:Comment:94092 2011-03-30T08:22:48.697Z Triveni Hiremath https://www.analyticbridge.datasciencecentral.com/profile/TriveniHiremath <p>Hi Hari,</p> <p> </p> <p>if u r talking about cut point for probability value, u can decide it by 2 ways .</p> <p> 1. Calculate the misclassification cost for different probability values, and choose the one which will have least misclassification cost. .</p> <p>2 . Draw lift chart for probility values, number of acuretely classified events per decile (In precise Results of KS test). Point where u get highest distance is the cut of point for your probability</p> <p> </p> <p>Hope this…</p> <p>Hi Hari,</p> <p> </p> <p>if u r talking about cut point for probability value, u can decide it by 2 ways .</p> <p> 1. Calculate the misclassification cost for different probability values, and choose the one which will have least misclassification cost. .</p> <p>2 . Draw lift chart for probility values, number of acuretely classified events per decile (In precise Results of KS test). Point where u get highest distance is the cut of point for your probability</p> <p> </p> <p>Hope this helps</p> Sandeep, create plot: risk ra… tag:www.analyticbridge.datasciencecentral.com,2011-03-26:2004291:Comment:93966 2011-03-26T13:07:23.017Z Jozo Kovac https://www.analyticbridge.datasciencecentral.com/profile/JozoKovac Sandeep, create plot: risk rate in percentyls of scored population. And give it to decision maker/risk manager.<br /> <br />  Cut off score is bussiness decision based on risk strategy. There isnt formula for it. Sandeep, create plot: risk rate in percentyls of scored population. And give it to decision maker/risk manager.<br /> <br />  Cut off score is bussiness decision based on risk strategy. There isnt formula for it. May I know which accuracy mea… tag:www.analyticbridge.datasciencecentral.com,2011-02-17:2004291:Comment:90498 2011-02-17T07:38:36.098Z Sandeep Sunkara https://www.analyticbridge.datasciencecentral.com/profile/SandeepSunkara May I know which accuracy measure you are using? May I know which accuracy measure you are using? I have a case where the event… tag:www.analyticbridge.datasciencecentral.com,2011-02-17:2004291:Comment:90496 2011-02-17T07:30:50.183Z Minethedata https://www.analyticbridge.datasciencecentral.com/profile/Minethedata I have a case where the event rate is 17% but a very high accuracy is obtained if the cut off point is 50%. and the accuracy obtained at 17% is very low. So in this case what should be the cut off point. I have a case where the event rate is 17% but a very high accuracy is obtained if the cut off point is 50%. and the accuracy obtained at 17% is very low. So in this case what should be the cut off point. Yes it is. If you have event… tag:www.analyticbridge.datasciencecentral.com,2011-02-15:2004291:Comment:90456 2011-02-15T08:33:36.495Z Sandeep Sunkara https://www.analyticbridge.datasciencecentral.com/profile/SandeepSunkara <p>Yes it is.</p> <p>If you have event rate of 10%, then the predicted probabilities will cluster around 0.1 and hence the cut-off point will also be arount 0.1.</p> <p>If you have event rate of 70%, then the predicted probabilities will cluster around 0.7 and hence the cut-off point will also be arount 0.7.</p> <p> </p> <p>Hope it helps.</p> <p>Yes it is.</p> <p>If you have event rate of 10%, then the predicted probabilities will cluster around 0.1 and hence the cut-off point will also be arount 0.1.</p> <p>If you have event rate of 70%, then the predicted probabilities will cluster around 0.7 and hence the cut-off point will also be arount 0.7.</p> <p> </p> <p>Hope it helps.</p>