计算机应用 ›› 2021, Vol. 41 ›› Issue (8): 2427-2431.DOI: 10.11772/j.issn.1001-9081.2020101568

所属专题: 第八届CCF大数据学术会议(CCF Bigdata 2020)

• 第八届CCF大数据学术会议 • 上一篇    下一篇

融合句法指导与字符注意力机制的案情阅读理解方法

何正海1,2, 线岩团1,2, 王蒙1,2, 余正涛1,2   

  1. 1. 昆明理工大学 信息工程与自动化学院, 昆明 650504;
    2. 云南省人工智能重点实验室(昆明理工大学), 昆明 650504
  • 收稿日期:2020-07-15 修回日期:2020-09-18 出版日期:2021-08-10 发布日期:2021-01-27
  • 通讯作者: 线岩团
  • 作者简介:何正海(1991-),男,甘肃兰州人,硕士研究生,CCF会员,主要研究方向:自然语言处理、信息检索;线岩团(1981-),男,云南昆明人,副教授,博士研究生,CCF会员,主要研究方向:自然语言处理、信息抽取;王蒙(1981-),男,云南昆明人,副教授,博士,主要研究方向:计算机视觉、图像处理、机器学习、语音识别;余正涛(1970-),男,云南昆明人,教授,博士,CCF会员,主要研究方向:自然语言处理、信息检索、机器翻译、机器学习、智能系统、决策分析。
  • 基金资助:
    国家重点研发计划项目(2018YFC0830100);云南省基础研究专项(202001AT070046)。

Case reading comprehension method combining syntactic guidance and character attention mechanism

HE Zhenghai1,2, XIAN Yantuan1,2, WANG Meng1,2, YU Zhengtao1,2   

  1. 1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming Yunnan 650504, China;
    2. Yunnan Key Laboratory of Artificial Intelligence(Kunming University of Science and Technology), Kunming Yunnan 650504, China
  • Received:2020-07-15 Revised:2020-09-18 Online:2021-08-10 Published:2021-01-27
  • Supported by:
    This work is partially supported by the National Key Research and Development Program of China (2018YFC0830100), the Special Project of Basic Research of Yunnan Province (202001AT070046).

摘要: 案情阅读理解是机器阅读理解在司法领域的具体应用。案情阅读理解通过计算机阅读裁判文书,并回答相关问题,是司法智能化的重要应用之一。当前机器阅读理解的主流方法是采用深度学习模型对文本词语进行编码,并由此获得文本的向量表示。模型建立的核心问题是如何获得文本的语义表示,以及问题与上下文的匹配。考虑到句法信息有助于模型学习句子主干信息,以及中文字符具有潜在的语义信息,提出了融合句法指导与字符注意力机制的案情阅读理解方法。通过融合句法信息及中文字符信息,提升模型对案情文本的编码能力。在法研杯2019阅读理解数据集上的实验结果表明,所提出的方法与基线模型相比EM值提升了0.816,F1值提升了1.809%。

关键词: 阅读理解, 裁判文书, 字符注意力, 句法指导注意力, 深度学习

Abstract: Case reading comprehension is the specific application of machine reading comprehension in judicial field. Case reading comprehension is one of the important applications of judicial intelligence, which reads the judgment documents by computer and answers the related questions. At present, the mainstream method of machine reading comprehension is to use deep learning model to encode the text words and obtain vector representation of the text. The core problem of model construction is how to obtain the semantic representation of the text and how to match the questions with the context. Considering that syntactic information is helpful for model learning the sentence skeleton information and Chinese characters have potential semantic information, a case reading comprehension method that integrates syntactic guidance and character attention mechanism was proposed. By fusing the syntactic information and Chinese character information, the coding ability of the model for the case text was improved. Experimental results on the reading comprehension dataset of Law Research Cup 2019 show that compared with the baseline model, the proposed method has the Exact Match (EM) value increased by 0.816 and the F1 value improved by 1.809%.

Key words: reading comprehension, judgment document, character attention, syntactically guided attention, deep learning

中图分类号: