Journal of Computer Applications ›› 2016, Vol. 36 ›› Issue (2): 465-471.DOI: 10.11772/j.issn.1001-9081.2016.02.0465

Previous Articles     Next Articles

Patent knowledge extraction method for innovation design

MA Jianhong, ZHANG Mingyue, ZHAO Yanan   

  1. School of Computer Science and Engineering, Hebei University of Technology, Tianjin 300401, China
  • Received:2015-08-29 Revised:2015-09-14 Online:2016-02-10 Published:2016-02-03

面向创新设计的专利知识抽取方法

马建红, 张明月, 赵亚男   

  1. 河北工业大学 计算机科学与软件学院, 天津 300401
  • 通讯作者: 马建红(1965-),女,河北保定人,教授,博士生导师,博士,CCF会员,主要研究方向:软件工程、计算机辅助创新设计、发明问题解决理论、计算机辅助教学软件。
  • 作者简介:张明月(1990-),女,河北邢台人,硕士研究生,主要研究方向:计算机辅助创新设计、软件工程、专利分析、自然语言处理;赵亚男(1990-),女,河北衡水人,硕士研究生,主要研究方向:软件工程、专利分析、自然语言处理。
  • 基金资助:
    河北省科技厅工作专项资助项目(2014055907)。

Abstract: Patent contains lots of information about background, technology, function and so on, which plays an important role in innovation field. Patent is something created by innovation knowledge, at the same time, it promotes us to make more use of innovation knowledge and break the inherent thinking and the limitation of knowledge, which inspires designers in the process of product design. From the term of innovation design, a new method for extracting innovation knowledge was proposed based on combination feature and maximum entropy classifier. The natural language processing was used, patent terms recognition algorithm was given, and word feature and syntactic feature of the closed package tree in the shortest path were jointed to compute the middle result. After that, the maximum entropy algorithm was applied to extract innovation knowledge based on semantic analysis and mark the attributes of knowledge. The results show that the combination feature can effectively deal with patent issues which need to be solved, and the relationships among the semantic role of knowledge innovation about target function, function principle and position feature in the technical scheme.

Key words: semantic analysis, patent knowledge, innovation knowledge, information extraction, semantic role, maximum entropy

摘要: 专利蕴含丰富的背景、技术、功能等知识,对创新设计领域起着重要的作用。对创新知识进行有效提取,能推动人们对知识的利用,助于突破固有的思维定势及知识面的限制,启发设计者从独特、新颖的角度进行产品设计。从创新设计的角度,提出基于组合特征和最大熵分类器的专利创新知识抽取方法。该方法运用自然语言处理方法,增加专利领域术语识别算法,联合词特征和最短路径闭包树句法特征,最后采用最大熵进行基于语义分析的知识提取,并对知识属性进行标注。实验结果表明,引入组合特征,能高效地处理专利要解决的问题,以及技术方案中的目标功能、作用原理、位置特征等创新知识之间的语义角色关系。

关键词: 语义分析, 专利知识, 创新知识, 信息抽取, 语义角色, 最大熵

CLC Number: