计算机应用 ›› 2012, Vol. 32 ›› Issue (08): 2212-2215.DOI: 10.3724/SP.J.1087.2012.02212

• 人工智能 • 上一篇    下一篇

求三支决策最优阈值的新算法

陈刚1,2,刘秉权2,吴岩2   

  1. 1. 广东科技学院 计算机系,广东 东莞 523083
    2. 哈尔滨工业大学 计算机科学与技术学院,哈尔滨 150001
  • 收稿日期:2012-02-23 修回日期:2012-04-16 发布日期:2012-08-28 出版日期:2012-08-01
  • 通讯作者: 陈刚
  • 作者简介:陈刚(1977-),男,江西高安人,讲师,硕士,CCF会员,主要研究方向:数据挖掘、人工智能;
    刘秉权(1970-),男,黑龙江哈尔滨人,副教授,博士,CCF高级会员,主要研究方向:自然语言处理、人工智能;
    吴岩(1965-),女,辽宁大连人,副教授,博士,主要研究方向:自然语言处理、人工智能。
  • 基金资助:
    国家自然科学基金资助项目(60803024)

New algorithm to get optimal threshold for three-decision-making

CHEN Gang1,2,LIU Bing-quan2,WU Yan2   

  1. 1. Department of Computer, Guangdong Univesity of Science and Technology, Dongguan Guangdong 523083, China
    2. School of Computer Sicence and Technology, Harbin Institute of Technology, Harbin Heilongjiang 150001, China
  • Received:2012-02-23 Revised:2012-04-16 Online:2012-08-28 Published:2012-08-01
  • Contact: CHEN Gang

摘要: 传统的三支决策模型是依靠专家经验来设置阈值的,从而阻碍了三支决策模型在许多领域的广泛应用。针对此不足,提出不需要依赖于专家经验的基于网格搜索的最优阈值生成算法,即以三支决策风险损失函数为模型,以决策风险最小为目标,以网格搜索为手段,以样本的条件概率为搜索空间,找出能使风险损失最小的参数组合——最优阈值。最后将以该算法得到的阈值构建的三支分类器与贝叶斯分类器分别应用于UCI部分数据集,结果显示三支分类器分类性能更优,从而说明该算法有效。

关键词: 三支决策, 最优阈值, 网格搜索, 风险损失, 数据集

Abstract: The traditional three-decision-making model relies on the experience of experts to set the threshold, thus impeding the wide application of three-decision-making model in many fields. To minimize the decision-making risk, a computational model of the risk-loss was built, and a new classification algorithm which needs no priori knowledge was given. The algorithm used model conditions to determine the range of parameters value which minimized the risk-loss, then divided the range into several equal grids, got the smallest range of parameters through searching these grids, and the smallest range was the optimal threshold. At last, a three-decision-making classifier was built by using the threshold, and then this classifier and Bayesian classifier were applied to part of UCI data sets. The comparison shows that the performance of three-decision-making classifier is superior, which shows the effectiveness of the algorithm.

Key words: three-decision-making, optimal threshold, gid searching, risk-loss, data set

中图分类号: