计算机应用

• 数据库技术 • 上一篇    下一篇

一种改进的FP-Growth算法及其在业务关联中的应用

赵孝敏 何松华 李贤鹏 尹波   

  1. 湖南大学计算机与通信学院 湖南大学计算机与通信学院 湖南大学计算机与通信学院 湖南大学计算机与通信学院
  • 收稿日期:2008-03-12 修回日期:1900-01-01 发布日期:2008-09-01 出版日期:2008-09-01
  • 通讯作者: 赵孝敏

Improved FP-Growth algorithm and its applications in the business association

<a href="http://www.joca.cn/EN/article/advancedSearchResult.do?searchSQL=(((ZHAO Xiao-Min[Author]) AND 1[Journal]) AND year[Order])" target="_blank">ZHAO Xiao-Min</a> HE Song-Hua LI Xian-Peng YIN Bo   

  • Received:2008-03-12 Revised:1900-01-01 Online:2008-09-01 Published:2008-09-01
  • Contact: ZHAO Xiao-Min 赵孝敏

摘要: 基于FP-树的FP-Growth算法在挖掘频繁模式过程中需要递归地产生大量的条件FP-树,效率不高,并且不太适合应用在移动通信业务交叉销售等具有业务约束的关联规则挖掘中。因此,提出了基于项目约束的频繁模式树ICFP-树和直接在此树上进行挖掘的新算法——ICFP-Mine。理论分析和实验结果表明,ICFP-Mine算法在内存占用和时间开销等方面比FP-Growth算法更优越,在移动通信业务交叉销售领域的应用中取得了较好的效果。

关键词: 频繁模式, 项目约束, ICFP-树, 交叉销售

Abstract: The FP-Growth algorithm, based on FP-Tree, needs to create a large number of conditional FP-Trees recursively in the process of mining frequent patterns. It is not efficient and not good to apply in mobile communication business cross-selling, in which the association rules mining is business-constraint. Therefore, an items-constraint frequent pattern tree ICFP-Tree and a new ICFP-Mine algorithm which directly mines in the tree were proposed. Theoretical analysis and experimental results show that the ICFP-Mine algorithm is superior to FP-Growth algorithm in memory occupancy and time costs. It has achieved better results in the field of mobile communication business cross-selling applications.

Key words: frequent patterns, items-constraint, ICFP-Tree, cross-selling