计算机应用 ›› 2020, Vol. 40 ›› Issue (10): 2845-2849.DOI: 10.11772/j.issn.1001-9081.2020020280
收稿日期:
2020-03-14
修回日期:
2020-04-22
出版日期:
2020-10-10
发布日期:
2020-05-18
通讯作者:
余正涛
作者简介:
王剑(1976-),男,浙江长兴人,副教授,硕士,主要研究方向:自然语言处理、机器学习、软件过程与演化;唐珊(1993-),女,辽宁义县人,硕士研究生,主要研究方向:自然语言处理、跨语言情感分析;黄于欣(1983-),男,河南洛阳人,博士研究生,CCF会员,主要研究方向:自然语言处理、文本摘要;余正涛(1970-),男,云南曲靖人,教授,博士,CCF会员,主要研究方向:自然语言处理、机器翻译、信息检索。
基金资助:
WANG Jian1,2, TANG Shan1,2, HUANG Yuxin1,2, YU Zhengtao1,2
Received:
2020-03-14
Revised:
2020-04-22
Online:
2020-10-10
Published:
2020-05-18
Supported by:
摘要: 传统的观点句识别多利用句子内部的情感特征进行分类,而在跨语言的多文档观点句识别任务中,不同语言、不同文档的句子之间具有密切的关联,这些关联特征对于观点句识别有一定的支撑作用。因此,提出一种基于双向长短期记忆(Bi-LSTM)网络框架并融入句子关联特征的汉越双语多文档新闻观点句识别方法。首先提取汉越双语句子的情感要素和事件要素,构建句子关联图,并利用TextRank算法得到句子关联特征;然后基于双语词嵌入和Bi-LSTM将汉语和越语的新闻文本编码在同一个语义空间;最后联合考虑句子编码特征和关联特征进行观点句识别。理论分析和模拟结果表明,融入句子关联图能够有效地提升多文档观点句识别的准确率。
中图分类号:
王剑, 唐珊, 黄于欣, 余正涛. 基于句子关联图的汉越双语多文档新闻观点句识别[J]. 计算机应用, 2020, 40(10): 2845-2849.
WANG Jian, TANG Shan, HUANG Yuxin, YU Zhengtao. Chinese-Vietnamese bilingual multi-document news opinion sentence recognition based on sentence association graph[J]. Journal of Computer Applications, 2020, 40(10): 2845-2849.
[1] KIM S M,HOVY E. Determining the sentiment of opinions[C]//Proceedings of the 20th International Conference on Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2004:1367-1373. [2] MIHALCEA R, BANEA C, WIEBE J. Learning multilingual subjective language via cross-lingual projections[C]//Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2007:976-983. [3] 刘培玉, 荀静, 费绍栋, 等. 基于隐马尔可夫模型的主观句识别[J]. 中文信息学报,2016,30(4):206-212.(LIU P Y,XUN J, FEI S D,et al. Subjective sentence recognition based on hidden Markov model[J]. Journal of Chinese Information Processing, 2016,30(4):206-212.) [4] ALMEIDA M S C,PINTO C,FIGUEIRA H,et al. Aligning opinions:cross-lingual opinion mining with dependencies[C]//Proceeding of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2015:408-418. [5] BANEA C, MIHALCEA R, WIEBE J. Porting multilingual subjectivity resources across languages[J]. IEEE Transactions on Affective Computing,2013,4(2):211-225. [6] BANEA C, MIHALCEA R, WIEBE J, et al. Multilingual subjectivity analysis using machine translation[C]//Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics,2008:127-135. [7] ZHOU X,WAN X,XIAO J. Attention-based LSTM network for cross-lingual sentiment classification[C]//Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics, 2016:247-256. [8] ZHOU X,WAN X,XIAO J. Cross-lingual sentiment classification with bilingual document representation learning[C]//Proceeding of the 54th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics,2016:1403-1412. [9] ZHOU H,YANG Y,LIU Z,et al. Jointly learning bilingual sentiment and semantic representation for cross-language sentiment classification[C]//Proceedings of the 23rd China Conference on Information Retrieval, LNCS 10390. Cham:Springer, 2017:149-160. [10] 林思琦, 余正涛, 郭军军, 等. 融入多特征的汉越新闻观点句抽取方法[J]. 中文信息学报,2019,33(11):101-106.(LIN S Q, YU Z T,GUO J J,et al. Chinese-Vietnamese news perspective sentence extraction methods incorporating multiple features[J]. Journal of Chinese Information Processing, 2019, 33(11):101-106.) [11] 刘书龙. 汉越双语新闻观点句抽取及分析方法研究[D]. 昆明:昆明理工大学,2017:13-17.(LIU S L. Research on the analysis method of Chinese-Vietnamese bilingual news opinion sentences extraction[D]. Kunming:Kunming University of Science and Technology,2017:13-17.) [12] MAYFIELD E. Sentence diagram generation using dependency parsing[C]//Proceedings of the ACL-IJCNLP 2009 Student Research Workshop. Stroudsburg, PA:Association for Computational Linguistics,2009:45-53. [13] HOCHREITER S,SCHMIDHUBER J. LSTM can solve hard long time lag problems[C]//Proceedings of the 9th International Conference on Neural Information Processing Systems. Cambridge:MIT Press,1996:473-479. [14] MIHALCEA R,TARAU P. TextRank:bringing order into texts[C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. Stroudsburg,PA:Association for Computational Linguistics,2004:404-411. [15] KU L W,CHEN H H. Mining opinions from the Web:Beyond relevance retrieval[J]. Journal of the American Society for Information Science and Technology,2007,58(12):1838-1850. |
[1] | 吴军 欧阳艾嘉 张琳. 基于影响度的统计显著序列模式挖掘算法[J]. 计算机应用, 0, (): 0-0. |
[2] | 张璐 方春 祝铭. 基于Res2Net-YOLACT和融合特征的室内跌倒检测算法[J]. 计算机应用, 0, (): 0-0. |
[3] | 殷雨昌 王洪元 陈莉 冯尊登 肖宇. 基于单标注样本的多损失学习与联合度量视频行人重识别[J]. 计算机应用, 0, (): 0-0. |
[4] | 胡军 许正康 刘立 钟福金 张清华. 融合多粒度社区信息的网络嵌入方法[J]. 计算机应用, 0, (): 0-0. |
[5] | 李润泽 孙雪姣. 基于时间条件提取序列的数据流偏好查询[J]. 计算机应用, 0, (): 0-0. |
[6] | 罗圣钦 陈金怡 李洪均. 基于注意力机制的多尺度残差UNet实现乳腺癌灶分割[J]. 计算机应用, 0, (): 0-0. |
[7] | 曹一珉 蔡磊 高敬阳. 基于生成对抗网络的基因数据生成方法[J]. 计算机应用, 0, (): 0-0. |
[8] | 陈冲 闫珠 赵继轩 何为 梁华庆. 基于集合经验模态分解和长短期记忆网络的催化裂化装置NOx排放预测[J]. 计算机应用, 0, (): 0-0. |
[9] | 徐光柱 林文杰 陈莎 匡婉 雷帮军 周军. U-Net与自适应阈值脉冲耦合神经网络相结合的眼底血管分割方法[J]. 计算机应用, 0, (): 0-0. |
[10] | 杨鼎康 黄帅 王顺利 翟鹏 李一丹 张立华. 基于对抗生成网络和网络集成的面部表情识别方法EE-GAN[J]. 计算机应用, 0, (): 0-0. |
[11] | 李讷 徐光柱 雷帮军 马国亮 石勇涛. 交通道路行驶车辆车标识别算法[J]. 计算机应用, 0, (): 0-0. |
[12] | 孟杰 王莉 杨延杰 廉飚. 基于多模态深度融合的虚假信息检测[J]. 计算机应用, 0, (): 0-0. |
[13] | 秦庭威 赵鹏程 秦品乐 曾建朝 柴锐 黄永琦. 基于残差注意力机制的点云配准算法[J]. 计算机应用, 0, (): 0-0. |
[14] | 鲁永帅 唐英杰 马鑫然. 基于深度特征融合的无纺布低对比度浆丝缺陷检测方法[J]. 计算机应用, 0, (): 0-0. |
[15] | 王宇航 周永霞 吴良武. 基于高斯函数的池化算法[J]. 计算机应用, 0, (): 0-0. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||