计算机应用 ›› 2013, Vol. 33 ›› Issue (11): 3049-3051.

• 数据库技术 • 上一篇    下一篇

空间关联规则的增量维护

董林,舒红   

  1. 武汉大学 测绘遥感信息工程国家重点实验室,武汉 430079
  • 收稿日期:2013-05-31 修回日期:2013-07-25 出版日期:2013-11-01 发布日期:2013-12-04
  • 通讯作者: 董林
  • 作者简介:董林(1984-),男,河北石家庄人,博士研究生,主要研究方向:空间数据挖掘;舒红(1970-),男,湖北黄冈人,教授,博士,主要研究方向:地理时空计算。
  • 基金资助:
    国家863计划项目;国家自然科学基金资助项目;地理空间信息工程国家测绘地理信息局重点实验室开放研究基金资助项目

Incremental maintenance of discovered spatial association rules

DONG Lin,SHU Hong   

  1. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan Hubei 430079, China
  • Received:2013-05-31 Revised:2013-07-25 Online:2013-12-04 Published:2013-11-01
  • Contact: DONG Lin

摘要: 为了得到有趣且有效的空间关联规则通常需要多次执行挖掘操作,可以使用增量维护算法来提高挖掘效率。然而,能够直接使用空间数据的关联规则增量更新算法尚属空白。为解决这一问题,对挖掘阈值改变和空间数据集更新后通过筛选或增量挖掘等方法实现规则维护的策略进行了分析,并提出适用于支持度阈值减小和空间图层增加这两类情况的增量挖掘算法——ISA。ISA算法不依赖于空间事务表的构建与更新,可以直接使用空间图层作为输入数据。在基于实际数据的实验中,采用ISA算法所得结果与类Apriori算法一致,耗时则相对缩短20.0%至71.0%;此外,对1372772条规则进行了基于筛选的更新,耗时低于0.1s。实验结果表明,所提出的空间关联规则增量维护策略和算法是可行、正确且高效的。

关键词: 空间数据, 关联规则, 增量更新, 空间分析, 数据挖掘

Abstract: Executing spatial association rule mining repeatedly is often necessary to get interesting and effective rules. Though incremental maintenance algorithms can be introduced to improve the efficiency of association rule mining, currently there exists no such algorithm that can use spatial datasets directly. To solve this problem, the update strategy of the discovered rules was discussed. Both threshold changes and spatial datasets updates were taken into consideration, and an incremental mining algorithm called Incremental Spatial Apriori (ISA) was suggested. ISA algorithm aimed to update frequent predicate sets and association rules after the minimum support threshold decreased or new spatial layers added. This algorithm did not rely on the creation and update of spatial transaction tables; it directly used spatial layers as input data. In experiments with real-world data, the mining result extracted by ISA and Apriori-like algorithms are identical, but ISA can save 20.0% to 71.0% time comparatively. Besides, 1372722 rules were successfully updated with the filtering method, costing less than 0.1 seconds. These results indicate the incremental update strategy and algorithm for spatial association rules suggested in this paper are correct, efficient and applicable.

Key words: spatial data, association rule, incremental update, spatial analysis, data mining

中图分类号: