Text categorization rule extraction based on fuzzy decision tree

doi:10.3724/SP.J.1087.2005.01634

Journal of Computer Applications ›› 2005, Vol. 25 ›› Issue (07): 1634-1637.DOI: 10.3724/SP.J.1087.2005.01634

• Artificial intelligence • Previous Articles Next Articles

Text categorization rule extraction based on fuzzy decision tree

WANG Yu^1,2, WANG Zheng-ou¹

1. Institute of Systems Engineering, Tianjin University;
2. School of Mathematics and Computer Science, Hebei University

Received:2005-01-14 Revised:2005-02-25 Online:2011-04-22 Published:2005-07-01

基于模糊决策树的文本分类规则抽取

王煜^1,2，王正欧¹

1．天津大学系统工程研究所，天津 300072； 2.河北大学数学与计算机学院，河北保定 071002

作者简介:王煜（1971-），女，河北保定人，讲师，博士研究生，主要研究方向：文本挖掘；王正欧（1938-），男，上海人，教授，博士生导师，主要研究方向：神经网络、数据挖掘、知识发现.
基金资助:
国家自然科学基金资助项目（60275020）

Abstract

Abstract:

A new method was presented, which extracted similar text categorization rule by a fuzzy decision tree merging some branches. χ² statistic was analyzed and improved. The new method converged features of text in terms of the improved χ² statistic, and so largely reduced the dimension of the vector space. And then, the fuzzy decision tree was applied to text categorization. The number of categorization rule was reduced largely by merging some branches. Both the understandable categorization rules extraction and better accuracy of categorization can be acquired.

摘要：

提出一种合并分枝的模糊决策树文本分类方法对相似文本类进行分类，并可抽取出分类精度较高的模糊分类规则。首先研究改进了的χ²统计量，并根据改进的χ²统计量对文本的特征词条进行聚合，有效地降低了文本向量空间的维数。然后使用一种合并分枝的模糊决策树进行分类，大大减少了抽取的规则数量。从而既保证了决策树分类的精度和速度，又可抽取出可理解的模糊分类规则。

关键词: 相似文本分类, 规则抽取, χ²统计量, 模糊决策树

CLC Number:

TP391.1

WANG Yu, WANG Zheng-ou. Text categorization rule extraction based on fuzzy decision tree[J]. Journal of Computer Applications, 2005, 25(07): 1634-1637.

王煜，王正欧. 基于模糊决策树的文本分类规则抽取[J]. 计算机应用, 2005, 25(07): 1634-1637.

[1]	ZHANG KUN ZHOU De-yun WANG Qian XU Jie. Airborne multi-sensor management methods based on fuzzy decision tree [J]. Journal of Computer Applications, 2011, 31(12): 3255-3257.
[2]	. Anomaly intrusion detection based on genetic optimization and fuzzy rules mining [J]. Journal of Computer Applications, 2009, 29(08): 2227-2229.
[3]	. Extracting symbolic rules from support vector machines based on the heuristic information [J]. Journal of Computer Applications, 2008, 28(3): 729-731.
[4]	Yang Yang . Application of fuzzy decision trees to the public critical system [J]. Journal of Computer Applications, 2006, 26(10): 2457-2459.
[5]	WU Zhi-feng, JI Gen-lin. Attribute reduction and rule extraction algorithms based on decision matrices [J]. Journal of Computer Applications, 2005, 25(03): 639-642.

Text categorization rule extraction based on fuzzy decision tree

基于模糊决策树的文本分类规则抽取

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 5

Recommended Articles

Metrics