[1] 雷君召,禹冰.电费回收及风险防范对策[J].煤炭技术,2013,32(6):280-282.(LEI J Z, YU B. Electricity recovery and risk prevention countermeasures[J]. Coal Technology, 2013, 32(6): 280-282.) [2] 张东霞,苗新,刘丽平,等.智能电网大数据技术发展研究[J].中国电机工程学报,2015,35(1):2-12.(ZHANG D X, MIAO X, LIU L P, et al. Research on development strategy for smart grid big data[J]. Proceedings of the CSEE, 2015, 35(1): 2-12.) [3] 宋亚奇,周国亮,朱永利.智能电网大数据处理技术现状与挑战[J].电网技术,2013,37(4):927-935.(SONG Y Q, ZHOU G L, ZHU Y L. Present status and challenges of big data processing in smart grid[J]. Power System Technology, 2013, 37(4): 927-935.) [4] WANG J, WEN Y. Application of data mining in arrear risks prediction of power customers[C]//KAM'08: Proceedings of the 2008 International Symposium on Knowledge Acquisition and Modeling. Piscataway, NJ: IEEE, 2008: 206-210. [5] 周晖,王毅,王玮,等.基于Logistic回归模型的电力客户欠费违约概率的预测[J].电网技术,2007,31(17):85-88. (ZHOU H,WANG Y,WANG W, et al. Predication of default probability of clients' electricity charges arrears based on Logistic regression model[J]. Power System Technology, 2007, 31(17): 85-88.) [6] 周晖,王毅,王玮,等.市场条件下电力客户欠费预警模型[J].中国电机工程学报,2008,28(22):107-112.(ZHOU H, WANG Y, WANG W, et al. Arrears forewarning model for power clients in electricity market[J]. Proceedings of the CSEE, 2008, 28(22): 107-112.) [7] KARAU H, KONWINSKI A, WENDELL P, et al. Learning spark: lightning-fast big data snalysis[M]. Sebastopol, CA: O'Reilly Media, 2015: 1-7. [8] 胡俊,胡贤德,程家兴.基于Spark的大数据混合计算模型[J].计算机系统应用,2015,24(4):214-218.(HU J, HU X D, CHEN J X. Big data hybrid computing mode based on spark[J]. Computer Systems & Applications, 2015, 24(4): 214-218) [9] Apache. Spark Lightning-fast cluster computing[EB/OL].[2015-12-02]. http://spark.apache.org/. [10] 谢桂兰,罗省贤.基于Hadoop MapReduce模型的应用研究[J].微型机与应用,2010,29(8):4-7.(XIE G L, LUO S X. Study on application of MapReduce model based on Hadoop[J]. Microcomputer & Its Applications, 2010, 29(8): 4-7.). [11] WANG B T, HUANG S, QIU J H, et al. Parallel online sequential extreme learning machine based on MapReduce[J]. Neurocomputing, 2015, 149(Part A): 224-232. [12] 曹正凤.随机森林算法优化研究[D].北京:首都经济贸易大学,2014:6-15. (CAO Z F. Study on optimization of random forests algorithm[D]. Beijing: Capital University of Economics and Business, 2014: 6-15.) [13] 陈慧萍,林莉莉,王建东,等.WEKA数据挖掘平台及其二次开发[J].计算机工程与应用,2008,44(19):76-79. (CHEN H P, LIN L L, WANG J D, et al. Data mining platform-WEKA and secondary development on WEKA[J]. Computer Engineering and Applications, 2008, 44(19): 76-79.) [14] CRYER J D, CHAN K S.时间序列分析及应用:R语言[M].2版.潘红宇,译.北京:机械工业出版社,2011:56-57. (CRYER J D, CHAN K S. Time Series Analysis with Applications in R[M]. 2nd ed. PAN H Y, translated. Beijing: China Machine Press, 2011: 56-57.) [15] HAN J W, KAMBER M, PEI J.数据挖掘:概念与技术[M].3版.范明,孟小峰,译.北京:机械工业出版社,2012:74-76.(HAN J W, KAMBER M, PEI J. Data Mining: Concepts and Techniques[M]. 3rd ed. FAN M, MENG X F, translated. Beijing: China Machine Press, 2012: 74-76.) [16] WITTEN I H, FRANK E, HALL M A.数据挖掘:实用机器学习工具与技术[M].3版.李川,张永辉,译.北京:机械工业出版社,2014:104-125.(WITTEN I H, FRANK E, HALL M A. Data Mining: Practial Machine Learning Tools and Techniques[M]. 3rd ed. LI C, ZHANG Y H, translated. Beijing: China Machine Press, 2014: 104-125.) |