<p>Hi all,</p>
<p>I have data containing few categorical columns with a huge amount of categories at each (more than 1000 different categories at each column). I have to build a predictive model on this data, using the Logistic Regression method (I cannot use any model that can handle categorical data as is - Random Forest, Naïve Bayes, etc.).</p>
<p>Applying the standard 1-to-N method, to change the categorical values to 0-1 vectors, generates a really huge dimension and causes the algorithm…</p>
<p>Thanks in advance!</p> Payment projection scorecardtag:www.analyticbridge.datasciencecentral.com,2012-05-03:2004291:Topic:1886312012-05-03T09:24:06.624ZJanvihttps://www.analyticbridge.datasciencecentral.com/profile/ManishaSadhwani
<p>Hi Dear fellow members,</p>
<p>I am working on a payment projection scorecard for Collections team. I wanted to build continuous outcome model where the observed % payment received could be split into one event and one non event with suitable weights (proportion recovered could be weight for event while 1-proportion not recovered would be weight for non event).</p>
<p>Would proc logistic with weights option be a good option or should I consider using survey logistic. I am not using any…</p>
<p></p> How to reduce high concordance (more than 85) in logistic regression model?tag:www.analyticbridge.datasciencecentral.com,2010-08-03:2004291:Topic:754902010-08-03T18:20:32.746ZBiswajit Palhttps://www.analyticbridge.datasciencecentral.com/profile/BiswajitPal
<p align="left" class="MsoNormal" style="MARGIN: 0cm 0cm 10pt"><font color="#000000" face="Calibri" size="3">Hi</font></p>
<p align="left" class="MsoNormal" style="MARGIN: 0cm 0cm 10pt"><font color="#000000" face="Calibri" size="3">I am getting a very high concordance in one of my logistic regression model.</font></p>
<p align="left" class="MsoNormal" style="MARGIN: 0cm 0cm 10pt"><font color="#000000" face="Calibri" size="3">Can anybody explain the effect of it in the model or why it is not…</font></p>
<p>Most people use logistic regression for modeling response, attrition, risk, etc. And in the world of business, these are usually rare occurences.</p>
<p> </p>
<p>One practise widely accepted is oversampling or undersampling to model these rare events. Sometime back, I was working on a campaign response model using logistic regression. After getting frustrated with the model performance/accuracy, I use weights to oversample the responders. I remember clearly that I got the same or a very…</p>
