计算机应用 ›› 2019, Vol. 39 ›› Issue (6): 1639-1645.DOI: 10.11772/j.issn.1001-9081.2018102184

• 人工智能 • 上一篇    下一篇

基于注意力和字嵌入的中文医疗问答匹配方法

陈志豪1, 余翔1, 刘子辰2, 邱大伟2, 顾本刚1   

  1. 1. 重庆邮电大学 通信与信息工程学院, 重庆 400065;
    2. 移动计算与新型终端北京重点实验室(中国科学院 计算技术研究所), 北京 100190
  • 收稿日期:2018-10-31 修回日期:2018-12-31 出版日期:2019-06-10 发布日期:2019-06-17
  • 通讯作者: 陈志豪
  • 作者简介:陈志豪(1994-),男,重庆人,硕士研究生,主要研究方向:自然语言处理、智能问答系统;余翔(1964-),男,四川成都人,正高级工程师,主要研究方向:数字通信、无线信号处理;刘子辰(1985-),男,山东临沂人,助理研究员,主要研究方向:网络通信、大数据挖掘;邱大伟(1991-),男,内蒙古赤峰市人,博士研究生,主要研究方向:模式识别、机器学习、自然语言处理;顾本刚(1992-),男,安徽淮南人,硕士研究生,主要研究方向:网络通信。
  • 基金资助:
    国家重大科技专项(2016ZX03002010-003)。

Chinese medical question answer matching method based on attention mechanism and character embedding

CHEN Zhihao1, YU Xiang1, LIU Zichen2, QIU Dawei2, GU Bengang1   

  1. 1. School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China;
    2. Beijing Key Laboratory of Mobile Computing and Pervasive Device(Institute of Computing Technology, Chinese Academy of Sciences), Beijing 100190, China
  • Received:2018-10-31 Revised:2018-12-31 Online:2019-06-10 Published:2019-06-17
  • Supported by:
    This work is partially supported by National Science and Technology Major Project (2016ZX03002010-003).

摘要: 针对当前的分词工具在中文医疗领域无法有效切分出所有医学术语,且特征工程需消耗大量人力成本的问题,提出了一种基于注意力机制和字嵌入的多尺度卷积神经网络建模方法。该方法使用字嵌入结合多尺度卷积神经网络用以提取问题句子和答案句子不同尺度的上下文信息,并引入注意力机制来强调问题和答案句子之间的相互影响,该方法能有效学习问题句子和正确答案句子之间的语义关系。由于中文医疗领域问答匹配任务没有标准的评测数据集,因此使用公开可用的中文医疗问答数据集(cMedQA)进行评测,实验结果表明该方法优于词匹配、字匹配和双向长短时记忆神经网络(BiLSTM)建模方法,并且Top-1准确率为65.43%。

关键词: 自然语言处理, 问答对匹配, 卷积神经网络, 字嵌入, 注意力机制

Abstract: Aiming at the problems that the current word segmentation tool can not effectively distinguish all medical terms in Chinese medical field, and feature engineering has high labor cost, a multi-scale Convolutional Neural Network (CNN) modeling method based on attention mechanism and character embedding was proposed. In the proposed method, character embedding was combined with multi-scale CNN to extract context information at different scales of question and answer sentences, and attention mechanism was introduced to emphasize the interaction between question sentences and answer sentences, meanwhile the semantic relationship between the question sentence and the correct answer sentence was able to be effectively learned. Since the question and answer matching task in Chinese medical field does not have a standard evaluation dataset, the proposed method was evaluated using the publicly available Chinese Medical Question and Answer dataset (cMedQA). The experimental results show that the proposed method is superior to word matching, character matching and Bi-directional Long Short-Term Memory network (BiLSTM) modeling method, and the Top-1 accuracy is 65.43%.

Key words: natural language processing, question answer matching, Convolutional Neural Network (CNN), character embedding, attention mechanism

中图分类号: