Subscribe to DSC Newsletter

Has anyone used or have thoughts on using a 2-step hurdle model to address the imbalance of "GOODS" vs "BADS" often present in a sample of borrowers?

That is, first run a logistic regression on your Good vs Bad, then take all of your Bads and use the % paid on the loan as the dependent variable and run a separate linear regression. In the Linear, those who defaulted after 3 months would fair worst that those who nearly paid off completely.

Finally combine outcomes of the two models to create a scorecard.

Any thoughts?



Tags: Credit, Data, Hurdle, Imbalanced, Model, Modeling

Views: 314

Reply to This

On Data Science Central

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service