Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (9): 2678-2683.DOI: 10.11772/j.issn.1001-9081.2017.09.2678

Previous Articles     Next Articles

Interval-value attribute reduction algorithm for meteorological observation data based on genetic algorithm

ZHENG Zhongren1, CHENG Yong2, WANG Jun1,2, ZHONG Shuiming1, XU Liya3   

  1. 1. School of Computer & Software, Nanjing University of Information Science and Technology, Nanjing Jiangsu 210044, China;
    2. Information Construction and Management Department, Nanjing University of Information Science and Technology, Nanjing Jiangsu 210044, China;
    3. School of Information Science and Technology, Jiujiang University, Jiujiang Jiangxi 332005, China
  • Received:2017-03-17 Revised:2017-04-25 Online:2017-09-10 Published:2017-09-13
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61402236, 61373064), the "Six Talent Peaks Project in Jiangsu Province (2015-DZXX-015), CERNET Innovation Project (NGⅡ20160318).

基于遗传算法的气象观测数据区间值属性约简算法

郑忠仁1, 程勇2, 王军1,2, 钟水明1, 徐利亚3   

  1. 1. 南京信息工程大学 计算机与软件学院, 南京 210044;
    2. 南京信息工程大学 信息化建设与管理处, 南京 210044;
    3. 九江学院 信息科学与技术学院, 江西 九江 332005
  • 通讯作者: 郑忠仁,zrzheng@foxmail.com
  • 作者简介:郑忠仁(1991-),男,江苏淮安人,硕士研究生,主要研究方向:大数据;程勇(1980-),男,重庆人,高级工程师,博士,CCF会员,主要研究方向:无线传感器网络、大数据;王军(1970-),男,安徽铜陵人,教授,博士,CCF会员,主要研究方向:无线传感器网络、大数据;钟水明(1971-),男,江西瑞金人,讲师,博士,CCF会员,主要研究方向:人工神经网络、模式识别、数据挖掘;徐利亚(1984-),男,江西九江人,讲师,博士,主要研究方向:无线传感器网络、大数据。
  • 基金资助:
    国家自然科学基金资助项目(61402236, 61373064); 江苏省"六大人才高峰"项目(2015-DZXX-015); 赛尔网络下一代互联网技术创新项目(NGⅡ20160318)。

Abstract: Aiming at the problems that the purpose of the meteorological observation data acquisition is weak, the redundancy of data is high, and the number of single values in the observation data interval is large, the precision of equivalence partitioning is low, an attribute reduction algorithm for Meteorological Observation data Interval-value based on Genetic Algorithm (MOIvGA) was proposed. Firstly, by improving the similarity degree of interval value, the proposed algorithm could be suitable for both single value equivalence relation judgment and interval value similarity analysis. Secondly, the convergence of the algorithm was improved by the improved adaptive genetic algorithm. Finally, the simulation experiments show that the number of the iterations of the proposed algorithm is reduced by 22, compared with the method which operated AGAv (Adaptive Genetic Attribute reduction) algorithm to solve the optimal value. In the time interval of 1 hour precipitation classification, the average classification accuracy of the MOIvGA (λ-Reduction in Interval-valued decision table based on Dependence) algorithm is 6.3% higher than that of RIvD algorithm; the accuracy of no rain forecasting is increased by 7.13%; at the same time, the classification accuracy can be significantly impoved by the attribute subset received by operating the MOIvGA algorithm. Therefore, the MOIvGA algorithm can increase the convergence rate and the classification accuracy in the analysis of interval value meteorological observation data.

Key words: meteorological observation data, attribute reduction, interval-value similarity, genetic algorithm, attribute subset

摘要: 针对气象观测数据采集目的性弱、数据冗余度较高以及观测数据区间化中单值较多、等价类划分精度低的问题,提出一种基于遗传算法的气象观测数据区间值属性约简算法(MOIvGA)。首先,通过改进区间值相似度,使其能够同时适用于单值等价关系判断和区间值相似度分析;其次,通过改进自适应遗传算法,提高其收敛性;最后,通过仿真实验证明,相对于运行自适应遗传属性约简(AGAv)算法求解最优值,所提算法迭代代数减少了22代;在区间长度为1 h降水分类中,基于依赖度的区间值决策表λ-约简(MOIvGA)平均分类准确率比RIvD算法提高了6.3%,对无雨的预测准确率提高了7.13%;同时约简后的属性子集显著提高了分类准确率。由此可见,MOIvGA在区间值气象观测数据分析中能够提高收敛速度以及分类准确率。

关键词: 气象观测数据, 属性约简, 区间值相似度, 遗传算法, 属性子集

CLC Number: