计算机应用 ›› 2018, Vol. 38 ›› Issue (2): 491-496.DOI: 10.11772/j.issn.1001-9081.2017081938

• 数据科学与技术 • 上一篇    下一篇

带邻近作用的高增益率co-location模式挖掘

曾新, 李晓伟, 杨健   

  1. 大理大学 数学与计算机学院, 云南 大理 671003
  • 收稿日期:2017-08-09 修回日期:2017-09-12 出版日期:2018-02-10 发布日期:2018-02-10
  • 通讯作者: 曾新
  • 作者简介:曾新(1986-),男,湖北随州人,讲师,硕士,主要研究方向:空间数据挖掘;李晓伟(1985-),男,吉林通化人,讲师,博士,主要研究方向:信息安全、计算机网络;杨健(1976-),男,浙江上虞人,副教授,博士,主要研究方向:云计算、数据安全、隐私保护。
  • 基金资助:
    国家自然科学基金资助项目(71462001);云南省科技厅应用基础青年项目(2016FD071);云南省教育厅资助项目(2016ZZX192)。

Mining high gain rate co-location patterns with neighboring effection

ZENG Xin, LI Xiaowei, YANG Jian   

  1. College of Mathematics and Computer, Dali University, Dali Yunnan 671003, China
  • Received:2017-08-09 Revised:2017-09-12 Online:2018-02-10 Published:2018-02-10
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (71462001), the Application Foundation Youth Project of Yunnan Provincial Science and Technology Department (2016FD071), the Project of Yunnan Provincial Education Department (2016ZZX192).

摘要: 大多数空间co-location模式挖掘将距离阈值作为衡量不同对象实例间邻近关系的标准,进而挖掘出频繁co-location模式,并没有考虑具有邻近关系的实例间的相互影响和模式的增益率问题。在空间co-location模式挖掘过程中,引入实例间的相互作用率和对象的季均收益,定义了对象作用率、套间总收益和增益率等概念,并提出挖掘高增益率co-location模式的基础算法(NAGA)和有效的剪枝算法(NAGA_JZ)。最后通过大量的实验来验证基础算法的正确性和实用性,并对基础算法和剪枝算法的挖掘效率进行了对比,验证了剪枝算法的高效性。

关键词: co-location模式, 邻近作用, 增益率, 高增益率模式

Abstract: For most spatial co-location pattern mining methods, distance threshold is used as a standard to measure the neighboring relation among instances of different objects, then to mine frequent co-location patterns, but the interation between instances with neighboring relations and the gain rate of patterns are not considered. In the spatial co-location patterns mining process, by introducing the interation rate between instances and the seasonal average income of objects, the concepts of object effect rate, suite total income and gain rate were defined, and a basic algorithm named NAGA and an efficient pruning algorithm named NAGA_JZ for mining high gain rate co-location patterns were put forward. Finally, a large number of experiments were carried out to verify the correctness and practicability of the basic algorithm, and the mining efficiency of the basic algorithm and the pruning algorithm were compared. The experimental results prove the high efficiency of the pruning algorithm.

Key words: co-location pattern, neighboring effection, gain rate, high gain rate pattern

中图分类号: