计算机应用 ›› 2016, Vol. 36 ›› Issue (6): 1757-1761.DOI: 10.11772/j.issn.1001-9081.2016.06.1757

• 行业与领域应用 • 上一篇    下一篇

基于并行分类算法的电力客户欠费预警

陈羽中1,2, 郭松荣1,2, 陈宏3, 李婉华1,2, 郭昆1,2, 黄启成1,2   

  1. 1. 福州大学 数学与计算机科学学院, 福州 350116;
    2. 福建省网络计算与智能信息处理重点实验室, 福州 350116;
    3. 国网信通亿力科技有限责任公司, 福州 350001
  • 收稿日期:2015-12-04 修回日期:2016-03-16 出版日期:2016-06-10 发布日期:2016-06-08
  • 通讯作者: 郭昆
  • 作者简介:陈羽中(1979-),男,福建福州人,副教授,博士,CCF会员,主要研究方向:复杂网络、计算智能、数据挖掘;郭松荣(1991-),男,福建泉州人,硕士研究生,主要研究方向:云计算、数据挖掘;陈宏(1974-),男,福建龙岩人,高级工程师,主要研究方向:大数据;李婉华(1991-),女,福建泉州人,硕士研究生,主要研究方向:云计算、数据挖掘;郭昆(1979-),男,福建福州人,副教授,博士,CCF会员,主要研究方向:灰色系统理论、复杂大数据挖掘、云计算;黄启成(1991-),男,福建漳州人,硕士研究生,主要研究方向:云计算、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61300104,61300103);福建省科技创新平台建设项目(2009J1007);福建省自然科学基金资助项目(2013J01230,2013J01232);福建省高校杰出青年科学基金资助项目(JA12016);福建省高等学校新世纪优秀人才支持计划资助项目(JA13021);福建省教育厅科技重点项目(JK2012003);福建省科技厅产学重大项目(2014H6014)。

Electricity customers arrears alert based on parallel classification algorithm

CHEN Yuzhong1,2, GUO Songrong1,2, CHEN Hong3, LI Wanhua1,2, GUO Kun1,2, HUANG Qicheng1,2   

  1. 1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350116, China;
    2. Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou Fujian 350116, China;
    3. State Grid ICT Billion of Technology Company Limited, Fuzhou Fujian 350001, China
  • Received:2015-12-04 Revised:2016-03-16 Online:2016-06-10 Published:2016-06-08
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61300104, 61300103), the Technology Innovation Platform Project of Fujian Province (2009J1007), the Natural Science Foundation of Fujian Province (2013J01230, 2013J01232), the Fujian Province High School Science Fund for Distinguished Young Scholars (JA12016), the Program for New Century Excellent Talents in Fujian Province University (JA13021), the Key Project of Fujian Education Committee (JK2012003), the Key Project of Industry-Academic Cooperation of Fujian Science and Technology Committee (2014H6014).

摘要: 针对供电企业"先消费后付款"的经营模式可能造成用电客户因失信引发的欠费风险,需要在用电客户欠费行为发生之前实时快速地分析海量的用电用户的数据,给出潜在的欠费客户名单的问题,提出一种基于并行分类算法的电力客户欠费预警方法。首先,该方法使用基于Spark的随机森林(RF)分类算法对欠费用户进行建模;其次,根据用户以往历史用电行为和缴费记录使用时间序列进行预测得到其未来用电和缴费行为特征;最后,使用之前得到的模型对用户进行分类得到未来潜在高危险欠费用户。将该方法与并行化后的支持向量机(SVM)算法和在线序列极限学习机(OSELM)算法进行对比分析,实验结果表明,所提方法相对于对比算法在准确率上有较大提高,便于电费回收管理人员进行提前催缴,确保电费回收的及时性,有利于电力企业进行客户欠费风险管理。

关键词: 欠费预警, 随机森林, 并行算法, 时间序列, 海量数据

Abstract: The "consumption first and replenishment afterward" operation model of the power supply companies may cause the risk of arrears due to poor credit of some power consumers. Therefore, it is necessary to analyze of the tremendous user data in real-time and quickly before the arrears' happening and provide a list of the potential customers in arrear. In order to solve the problem, a method for arrears alert of power consumers based on the parallel classification algorithm was proposed. Firstly, the arrear behaviors were modeled by the parallel Random Forest (RF) classification algorithm based on the Spark framework. Secondly, based on previous consumption behaviors and payment records, the future characteristics of consumption and payment behavior were predicted by time series. Finally, the list of the potential hig-risk customers in arrear was obtained by using the obtained model for classifying users. The proposed algorithm was compared with the parallel Support Vector Machine (SVM) algorithm and Online Sequential Extreme Learning Machine (OSELM) algorithm. The experimental results demonstrate that, the prediction accuracy of the proposed algorithm performs better than the other algorithms in comparison. Therefore, the proposed method is a convenient way for electricity recycling management to remind the customers of paying the electricity bills ahead of time, which can ensure timeliness electricity recovery. Moreover, the proposed method is also beneficial for consumer arrear risk management of the power supply companies.

Key words: arrears alert, Random Forest (RF), parallel algorithm, time series, massive data

中图分类号: