计算机应用 ›› 2013, Vol. 33 ›› Issue (10): 2796-2800.

• 人工智能 • 上一篇    下一篇

基于Apriori的相容数据集间关联规则演绎方法

张春生,庄丽艳   

  1. 内蒙古民族大学 计算机科学与技术学院,内蒙古 通辽 028043
  • 收稿日期:2013-04-24 修回日期:2013-06-12 出版日期:2013-10-01 发布日期:2013-11-01
  • 通讯作者: 张春生
  • 作者简介:张春生(1965-),男,河北乐亭人,教授,主要研究方向:数据库、数据挖掘、软件理论;庄丽艳(1976-),女,内蒙古通辽人,讲师,硕士,主要研究方向:人工智能。
  • 基金资助:
    国家自然科学基金资助项目;内蒙古自然科学基金资助项

Deductive method of association rules among compatible datasets based on Apriori

ZHANG Chunsheng,ZHUANG Liyan   

  1. College of Computer Science and Technology, Inner Mongolia University for Nationalities,Tongliao Inner Mongolia 028043,China
  • Received:2013-04-24 Revised:2013-06-12 Online:2013-11-01 Published:2013-10-01
  • Contact: ZHANG Chunsheng

摘要: Apriori关联规则数据挖掘算法只针对一类相关数据集进行数据挖掘,而现实世界中各种不同的数据集非常庞大,如何在不相关数据集间进行数据挖掘,拓展规则的数量具有挑战性。目前Apriori关联规则算法研究基本上集中在算法性能优化和针对不同数据形式的基础上,没有突破不相关数据集的界限。针对这个问题,首先给出了相关数据集、不相关数据集、相容数据集的概念,进一步给出了一种基于Apriori的不相关数据集中相容数据集间的关联规则演绎算法,给出了算法演绎规则,通过构建法证明了算法的正确性。通过实例演示了应用方法,该算法可实现基于Apriori的相容数据集间关联规则的规则演绎,是普通数据挖掘算法无法实现的,扩展了关联规则算法的应用领域;同时,由于关联规则是在相容数据集上独立挖掘出来的,没有进行原始数据间的交换,在一定程度上实现了隐私保护

关键词: 相容数据集, 关联规则, 规则演绎, Apriori算法

Abstract: Data mining algorithm based on Apriori of association rules mines data only for a class of correlated datasets. However, various datasets are very large in the real world, and how to mine data among uncorrelated datasets and how to expand the number of rules are the challenging issues. The study of Apriori algorithm of association rules basically focus on the performance optimization of algorithm and different data forms at present, which does not breakthrough the limit of the uncorrelated datasets. For this, the concepts of correlated datasets, uncorrelated datasets and compatible datasets were given in the paper, furthermore a deductive method of association rules among uncorrelated datasets based on Apriori was given in this paper, and in which deductive rules of the algorithm were given. The correctness of the algorithm was proved by construction method, and the application method was demonstrated by examples. The algorithm can realize rules deduction among correlated rules based on Apriori for uncorrelated datasets, which cannot be realized by common data mining algorithms. The algorithm expands the application field of correlated rules algorithm; meanwhile, it realizes the privacy protection in a certain extent because the rules are mined independently out on the basis of compatible datasets and have not shared original data.

Key words: compatible datasets, association rules, rules deduction, Apriori algorithm

中图分类号: