All Discussions Tagged 'rule' - AnalyticBridge2019-08-18T17:26:03Zhttps://www.analyticbridge.datasciencecentral.com/forum/topic/listForTag?tag=rule&feed=yes&xn_auth=noOutliers : Capping ruletag:www.analyticbridge.datasciencecentral.com,2013-06-15:2004291:Topic:2514662013-06-15T19:48:51.041ZRaghuhttps://www.analyticbridge.datasciencecentral.com/profile/Raghu9
<p>Can anyone please help me understand the logic and reasoning behind this piece of code :</p>
<p></p>
<p><span style="color: #ff6600;">data xyz;</span></p>
<p><span style="color: #ff6600;">set xyz;</span></p>
<p><span style="color: #ff6600;">if(_n_ eq 1) then set incdata(keep=incstd inc99);</span></p>
<p><span style="color: #ff6600;">if incstd>2*inc99 then inc_est2= min(inc_est,(4*inc99));</span></p>
<p><span style="color: #ff6600;">else inc_est2=inc_est;…</span></p>
<p></p>
<p>Can anyone please help me understand the logic and reasoning behind this piece of code :</p>
<p></p>
<p><span style="color: #ff6600;">data xyz;</span></p>
<p><span style="color: #ff6600;">set xyz;</span></p>
<p><span style="color: #ff6600;">if(_n_ eq 1) then set incdata(keep=incstd inc99);</span></p>
<p><span style="color: #ff6600;">if incstd>2*inc99 then inc_est2= min(inc_est,(4*inc99));</span></p>
<p><span style="color: #ff6600;">else inc_est2=inc_est;</span></p>
<p><span style="color: #ff6600;">end;</span></p>
<p></p>
<p>Thanks!</p>
<p>Raghu</p> How to prevent scores from caking in scoring models?tag:www.analyticbridge.datasciencecentral.com,2008-03-14:2004291:Topic:63192008-03-14T12:14:42.709ZVincent Granvillehttps://www.analyticbridge.datasciencecentral.com/profile/VincentGranville
The general question is actually about how to produce a nice score distribution, with no large gaps and no huge spikes.<br />
<br />
For instance, if a score S = A1*R1 + A2*R2 + A3*R3 + A4*R4, where R1, R2, R3, R4 are four binary rules (e.g. R4 is "no late payment in last 12 months"), and A1, A2, A3, A4 are weights (penalties) respectively equal to 5, 5, 10 and 20 points, then we have few unique scores because 5+5 =10, 5+5+10 = 20. The weights 4, 5, 10, 20 eliminate this problem, but still produce large…
The general question is actually about how to produce a nice score distribution, with no large gaps and no huge spikes.<br />
<br />
For instance, if a score S = A1*R1 + A2*R2 + A3*R3 + A4*R4, where R1, R2, R3, R4 are four binary rules (e.g. R4 is "no late payment in last 12 months"), and A1, A2, A3, A4 are weights (penalties) respectively equal to 5, 5, 10 and 20 points, then we have few unique scores because 5+5 =10, 5+5+10 = 20. The weights 4, 5, 10, 20 eliminate this problem, but still produce large gaps. Gaps can be reduced by choosing the weights 2, 4, 8, 16, but then this is a too drastic change to the weights, and if rules have highly variable triggering rates ranging from 2 to 60%, we can still end up with an "ugly" score distribution.<br />
<br />
I was wondering if there is some literature on this subject, or how did you address this issue? In particular, in systems with more than 100 rules.