计算机应用 ›› 2011, Vol. 31 ›› Issue (07): 1781-1784.DOI: 10.3724/SP.J.1087.2011.01781

• 人工智能 • 上一篇    下一篇

基于机器学习的类目映射方法

靳雪茹1,齐建东1,王立臣2,周林志3   

  1. 1. 北京林业大学 信息学院,北京 100083
    2. 北京信息科技大学 自动化学院,北京 100192
    3. 北京航空航天大学 网络信息与计算中心,北京 100191
  • 收稿日期:2011-01-18 修回日期:2011-03-07 发布日期:2011-07-01 出版日期:2011-07-01
  • 通讯作者: 靳雪茹
  • 作者简介:靳雪茹(1986-),女,河北邢台人,硕士研究生,主要研究方向:分类法映射;齐建东(1976-),男,内蒙古赤峰人,副教授,博士,主要研究方向:计算机网络、智能信息处理;王立臣(1982-),男,吉林敦化人, 硕士研究生, 主要研究方向:虚拟现实、智能信息处理;周林志(1984-),男,浙江台州人,硕士研究生,主要研究生方向:计算机网络、智能信息处理。

Approach of classification mapping between international patent classification and chinese library classification based on machine learning

Xue-ru JIN1,Jian-dong QI1,Li-chen WANG2,Lin-zhi ZHOU3   

  1. 1. School of Information Science and Technology,Beijing Forestry University,Beijing 100083,China
    2. School of Automation,Beijing Information Science and Technology University,Beijing 100192,China
    3. Network Center,Beihang University,Beijing 100191,China
  • Received:2011-01-18 Revised:2011-03-07 Online:2011-07-01 Published:2011-07-01
  • Contact: Xue-ru JIN

摘要: 专利和期刊隶属于不同的知识组织体系,要实现专利与期刊文献的交叉浏览和检索必须解决两种分类法(中国图书馆分类法(CLC)和国际专利分类法(IPC))之间的映射问题。在调研现有分类法类目映射方法的基础上,讨论了基于机器学习实现中国图书馆分类法和国际专利分类法之间类目映射的方法。通过对中图法某个类目标识的语料进行训练得到该类目的分类器,然后用其对国际专利分类法标识的语料进行分类,对分类结果进行分析得出类目间的映射关系。对比实验证明了该方法的有效性。

关键词: 类目映射, 国际专利分类法, 中国图书馆分类法, 朴素贝叶斯, 决策树

Abstract: Patents and journals belong to different knowledge organization systems. To achieve the crossbrowsing and crossretrieval between journal literature and patents,the mapping problem between two classifications Chinese Library Classification (CLC) and International Patent Classification (IPC), must be addressed. According to the survey of the existing methods of classification mapping, this paper discussed a method to achieve the mapping between CLC and IPC based on machine learning. The learner was got by training the corpus identified by the CLC category, with which to classify the corpus identified by the IPC category. The mapping relations can be found after analyzing the classification results. And the comparison experiment proves the effectiveness of this method.

Key words: classification mapping, International Patent Classification (IPC), Chinese Library Classification (CLC), Naive Bayes, decision tree