计算机应用 ›› 2009, Vol. 29 ›› Issue (11): 3088-3091.

• 数据库与数据挖掘 • 上一篇    下一篇

基于Logistic的信用卡套现侦测评分模型

姜盛   

  1. 中国工商银行股份有限公司牡丹卡中心
  • 收稿日期:2009-05-06 修回日期:2009-07-13 发布日期:2009-11-26 出版日期:2009-11-01
  • 通讯作者: 姜盛

Credit scoring model of detecting illegal cash advance based on Logistic

Sheng JIANG   

  • Received:2009-05-06 Revised:2009-07-13 Online:2009-11-26 Published:2009-11-01
  • Contact: Sheng JIANG

摘要: 信用卡套现是信用卡产业面临的一种主要风险。单笔的套现交易与正常交易间无显著差异,难以进行基于特征的过滤筛选。为自动、高效地识别套现账户,根据统计学特征先遴选出各相关变量,并结合业务分析,利用Logistic回归模型的非线性曲线特征缺陷,克服其自变量多维相关敏感性缺陷,计算出各变量的影响权重系数,构建了信用卡套现侦测评分模型。实践表明识别准确率达到82.72%。

关键词: 数据挖掘, Logistic回归, Logit变换, χ2检验, 积矩相关系数, 最大似然估计, 马氏距离, K-S统计量

Abstract: The illegal-cash-advance is one of the major fraud risks of the credit-card industry. There is almost no difference between single illegal-cash- advance transaction and the normal one, so it could not distinguish them based on different characteristics. To detect the illegal-cash-advance accounts automatically and accurately, the authors picked up correlative variables first, then made a business analysis, and took advantage of the nonlinear curve feature—the one defection of Logistic, and overcame the sensitivity of the multidimensional relativity between independent variables — the another defection, at last computed the weight of coefficient, and constructed a credit scoring model. The applications indicate that the accuracy of the model has achieved 82.72%.

Key words: data mining, Logistic regression, Logit transformation, χ2 test, product moment correlation coefficient, Maximum Likelihood Estimate (MLE), Mahalanobis distance, K-S statistic