面向阅读理解的句子组合模型

doi:10.11772/j.issn.1001-9081.2017.06.1741

计算机应用 ›› 2017, Vol. 37 ›› Issue (6): 1741-1746.DOI: 10.11772/j.issn.1001-9081.2017.06.1741

面向阅读理解的句子组合模型

王元龙

山西大学计算机与信息技术学院, 太原 030006

收稿日期:2016-11-21 修回日期:2017-02-06 出版日期:2017-06-10 发布日期:2017-06-14
通讯作者: 王元龙
作者简介:王元龙(1983-),男,山西大同人,讲师,博士,CCF会员,主要研究方向:虚拟现实、自然语言处理、高性能计算。
基金资助:
国家863计划项目（2015AA015407）；山西省自然科学基金资助项目（201601D102030）。

Sentence composition model for reading comprehension

WANG Yuanlong

School of Computer and Information Technology, Shanxi University, Taiyuan Shanxi 030006, China

Received:2016-11-21 Revised:2017-02-06 Online:2017-06-10 Published:2017-06-14
Supported by:
This work is partially supported by the National High Technology Research and Development Program (863 Program) of China (2015AA015407), the Natural Science Foundation of Shanxi Province (201601D102030).

摘要/Abstract

摘要： 阅读理解任务需要综合运用文本的表示、理解、推理等自然语言处理技术。针对高考语文中文学作品阅读理解的选项题问题，提出了基于分层组合模式的句子组合模型，用来实现句子级的语义一致性计算。首先，通过单个词和短语向量组成的三元组来训练一个神经网络模型；然后，通过训练好的神经网络模型来组合句子向量（两种组合方法：一种为递归方法；另一种为循环方法），得到句子的分布式向量表示。句子间的一致性利用两个句子向量之间的余弦相似度来表示。为了验证所提方法，收集了769篇模拟材料+13篇北京高考语文试卷材料（包括原文与选择题）作为测试集。实验结果表明，与传统最优的基于知网语义方法相比，循环方法准确率在高考材料中提高了7.8个百分点，在模拟材料中提高了2.7个百分点。

关键词: 自然语言理解, 句子组合模型, 阅读理解, 语义相似度计算

Abstract: The reading comprehension of document in Natural Language Processing (NLP) requires the technologies such as representation, understanding and reasoning on the document. Aiming at the choice questions of literature reading comprehension in college entrance examination, a sentence composition model based on the hierarchical composition model was proposed, which could achieve the semantic consistency measure at the sentence level. Firstly, a neural network model was trained by the triple consisted of single word and phrase vector. Then, the sentence vectors were combined by the trained neural network model (two composition methods:the recursion method and the recurrent method) to obtain the distributed vector of sentence. The similarity between sentences was presented by the cosine similarity between the two sentence vectors. In order to verify the proposed method, the 769 simulation materials and 13 Beijing college entrance examination materials (including the source text and the choice question) were collected as the test set. The experimental results show that, compared with the traditional optimal method based on HowNet semantics, the precision of the proposed recurrent method is improved by 7.8 percentage points in college entrance examination materials and 2.7 percentage points in simulation materials respectively.

Key words: natural language comprehension, sentence composition model, reading comprehension, semantic similarity computation

中图分类号:

TP391.1

王元龙. 面向阅读理解的句子组合模型[J]. 计算机应用, 2017, 37(6): 1741-1746.

WANG Yuanlong. Sentence composition model for reading comprehension[J]. Journal of Computer Applications, 2017, 37(6): 1741-1746.

参考文献

[1] CHEN D Q, BOLTON J, MANING C D. A thorough examination of the CNN/Daily Mail reading comprehension task[C]//Proceeding of the 2016 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:ACL, 2016:2359-2367.
[2] 刘知远,孙茂松,林衍凯,等.知识表示学习研究进展[J].计算机研究与发展,2016,53(2):247-261.(LIU Z Y, SUN M S, LIN Y K, et al. Knowledge representation learning:a review[J]. Journal of Computer Research and Development, 2016, 53(2):247-261.)
[3] TURNEY P D, PANTEL P. From frequency to meaning:vector space models of semantics[J]. Journal of Artificial Intelligence Research, 2010, 37(1):141-188.
[4] WIDDOWS D. Semantic vector products:some initial investigations[C/OL]//Proceedings of the 2008 Second AAAI Symposium on Quantum Interaction.[2016-10-09]. http://www.puttypeg.net/papers/semantic-vector-products.pdf.
[5] MARELLI M, BENTIVOGLI L, BARONI M, et al. Semeval-2014 Task 1:evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment[C]//Proceedings of the 2014 8th International Workshop on Semantic Evaluation. Stroudsburg, PA:ACL, 2014:1-8.
[6] WIDDOWS D. Geometry and Meaning[M]. Stanford, CA:CSLI Publications, 2004:23-28.
[7] MITCHELL J, LAPATA M. Vector based models of semantic composition[C]//Proceedings of the 2008 Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:ACL, 2008:236-244.
[8] BLACOE W, LAPATA M. A comparison of vector-based representations for semantic composition[C]//Proceeding of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Stroudsburg, PA:ACL, 2012:546-556.
[9] GUEVARA E. A regression model of adjective-noun compositionality in distributional semantics[C]//Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics. Stroudsburg, PA:ACL, 2010:33-37.
[10] MITCHELL J, LAPATA M. Composition in distributional models of semantics[J]. Cognitive Science, 2010, 34(8):1388-1429.
[11] SOCHER R, HUANG E, PENNINGTON J, et al. Dynamic pooling and unfolding recursive autoencoders for paraphrase detection[C]//Proceedings of the 2011 International Conference on Neural Information Processing Systems. Cambridge, MA:MIT Press, 2011:801-809.
[12] ZANZOTTO F M, KORKONTZELOS I, FALLUCCHI F, et al. Estimating linear models for compositional distributional semantics[C]//Proceedings of the 2010 23rd International Conference on Computational Linguistics. Stroudsburg, PA:ACL, 2010:1263-1271.
[13] SOCHER R, HUVAL B, MANNING C D, et al. Semantic compositionality through recursive matrix-vector spaces[C]//Proceedings of the 2012 Joint Conference on Empirical Methods in Natural language Processing and Computational Natural Language Learning. Stroudsburg, PA:ACL, 2012:1201-1211.
[14] GUEVARA E. A regression model of adjective-noun compositionality in distributional semantics[C]//Proceedings of the 2010 Workshop on GEometrical Models of Natural Language Semantics. Stroudsburg, PA:ACL, 2010:33-37.
[15] BARONI M, ZAMPARELLI R. Nouns are vectors, adjectives are matrices:representing adjective-noun constructions in semantic space[C]//Proceedings the 2010 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:ACL, 2010:1183-1193.
[16] PAPERNO D, PHAM N, BARONI M. A practical and linguistically-motivated approach to compositional distributional semantics[C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:ACL, 2014:90-99.
[17] TAI K S, SOCHER R, MANNING C D. Improved semantic representations from tree-structured long short-term memory networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Stroudsburg, PA:ACL, 2015:1556-1566.
[18] ZAREMBA W, SUTSKEVER I. Learning to execute[EB/OL].[2016-10-09]. http://www.cs.nyu.edu/~zaremba/docs/Learning%20to%20Execute.pdf.
[19] 王智强,李茹,梁吉业,等.基于汉语篇章框架语义分析的阅读理解问答研究[J].计算机学报,2016,39(4):795-807.(WANG Z Q, LI R, LIANG J Y, et al. Research on question answering for reading comprehension based on Chinese discourse frame semantic parsing[J]. Chinese Journal of Computers, 2016, 39(4):795-807.)
[20] MIKOLOV T, CHEN K,CORRADO G, et al. Efficient estimation of word representations in vector space[EB/OL].[2016-10-09]. https://core.ac.uk/download/pdf/24794691.pdf.
[21] 张志昌,张宇,刘挺,等.基于浅层语义树核的阅读理解答案句抽取[J].中文信息学报,2008,22(1):80-86.(ZHANG Z C, ZHANG Y, LIU T, et al. Answer sentence extraction of reading comprehension based on shallow semantic tree kernel[J]. Journal of Chinese Information Processing, 2008, 22(1):80-86.)
[22] 朱征宇,孙俊华.改进的基于《知网》的词汇语义相似度计算[J].计算机应用,2013,33(8):2276-2279.(ZHU Z Y, SUN J H. Improved vocabulary semantic similarity calculation based on HowNet[J]. Journal of Computer Applications, 2013, 33(8):2276-2279.)

面向阅读理解的句子组合模型

Sentence composition model for reading comprehension

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

[1]	何正海, 线岩团, 王蒙, 余正涛. 融合句法指导与字符注意力机制的案情阅读理解方法[J]. 计算机应用, 2021, 41(8): 2427-2431.
[2]	张宗仁杨天奇. 基于自然语言理解的SPARQL本体查询[J]. 计算机应用, 2010, 30(12): 3397-3400.