计算机应用 ›› 2011, Vol. 31 ›› Issue (02): 435-437.

• 数据库与数据挖掘 • 上一篇    下一篇

Apriori算法低频规则的有效性及实现

张春生1,庄丽艳2,李艳2   

  1. 1. 内蒙古民族大学
    2.
  • 收稿日期:2010-07-27 修回日期:2010-09-15 发布日期:2011-02-01 出版日期:2011-02-01
  • 通讯作者: 张春生
  • 基金资助:
    内蒙古人才基金;内蒙古教育科研项目

Effectiveness and Implementation of Low Frequency rule Based on Apriori Algorithm

  • Received:2010-07-27 Revised:2010-09-15 Online:2011-02-01 Published:2011-02-01

摘要: 针对经典Apriori算法基于全局、高频两个条件的缺陷,指出事务数据库低频规则的有效性,并通过对C4.5决策树的规则构造,进一步证明事务数据库存在低频规则,在此基础上,给出了一种Apriori低频规则挖掘算法。该算法与经典的Apriori算法兼容,但不是对Apriori算法简单的扩展,而是从理论上打破了Apriori算法基于全局和高频两个条件。最后通过实例用Apriori低频规则挖掘算法和C4.5算法对实例数据库进行挖掘,证明两者的一致性和Apriori低频规则的有效性,同时也证明了Apriori低频规则挖掘算法的有效性。

关键词: Apriori算法, 低频规则, 有效性, C4.5算法, 数据挖掘

Abstract: Firstly, the defects of classical Apriori algorithm based on global view and high frequency were pointed out, and the effectiveness of low frequency rule of transaction database was presented. By constructing the rules of C4.5 decision tree, that the low frequency rule exists in transaction database also was proved. On the foundation of this, a mining algorithm based on low frequency rule of Apriori algorithm was given, which was compatible with classical Apriori algorithm. However, it was not a simple extension of Apriori algorithm, it had broken theoretically Apriori algorithm view based on global view and high frequency. Finally, case database was mined by mining algorithm based on low frequency rule of Apriori and C4.5 algorithms, and the consistency of two methods and the effectiveness of low frequency rule were proved. Moreover, the effectiveness of mining algorithm based on low frequency rule of Apriori algorithm was validated.

Key words: Apriori algorithm, low frequency rule, effectiveness, C4.5 algorithm, data mining