I have received a data where only 0.23% claims are fraudulent rest 99.73% are legitimate claims. Can I build a logistic regression model using this data set to identify future suspicious claims/ fraudulent claims?
My worry is such a low % of fraudulent claims in the present data set may not give me a proper result if I use it as it is.
Can you suggest me any particular technique?…
You can share this discussion in two ways…
Share this link:
Send it with your computer's email program: Email this