结合句法特征和卷积神经网络的多意图识别模型

doi:10.11772/j.issn.1001-9081.2017122996

计算机应用 ›› 2018, Vol. 38 ›› Issue (7): 1839-1845.DOI: 10.11772/j.issn.1001-9081.2017122996

结合句法特征和卷积神经网络的多意图识别模型

杨春妮¹, 冯朝胜^1,2

1. 四川师范大学计算机科学学院, 成都 610101;
2. 电子科技大学信息与软件工程学院, 成都 610054

收稿日期:2017-12-21 修回日期:2018-02-08 出版日期:2018-07-10 发布日期:2018-07-12
通讯作者: 冯朝胜
作者简介:杨春妮(1994-),女,四川宜宾人,硕士研究生,主要研究方向:自然语言处理、文本挖掘;冯朝胜(1971-),男,四川成都人,教授,博士,CCF高级会员,主要研究方向:云计算与大数据、隐私保护、数据安全。
基金资助:
国家自然科学基金资助项目（61373163）；国家科技支撑计划项目（2014BAH11F02，2014BAH11F01）。

Multi-intention recognition model with combination of syntactic feature and convolution neural network

YANG Chunni¹, FENG Chaosheng^1,2

1. School of Computer Science, Sichuan Normal University, Chengdu Sichuan 610101, China;
2. School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu Sichuan 610054, China

Received:2017-12-21 Revised:2018-02-08 Online:2018-07-10 Published:2018-07-12
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61373163), the National Key Technology Support Program (2014BAH11F02, 2014BAH11F01).

摘要/Abstract

摘要： 短文本的多意图识别是口语理解（SLU）中的难题，因短文本的特征稀疏、字数少但包含信息量大，在分类问题中难以提取其有效特征。为解决该问题，将句法特征和卷积神经网络（CNN）进行结合，提出一种多意图识别模型。首先，将句子进行依存句法分析以确定是否包含多意图；然后，利用词频-逆文档频率（TF-IDF）和训练好的词向量计算距离矩阵，以确定意图的个数；其次，把该距离矩阵作为CNN模型的输入，进行意图分类；最后，判断每个意图的情感极性，计算用户的真实意图。采用现有的智能客服系统的真实数据进行实验，实验结果表明，结合句法特征的CNN模型在10个意图上的单分类精准率达到93.5%，比未结合句法特征的CNN模型高1.4个百分点；而在多意图识别上，精准率比其他模型提高约30个百分点。

关键词: 口语理解, 多意图识别, 句法特征, 卷积神经网络, 自然语言

Abstract: Multi-Intention (MI) recognition of short texts is a problem in Spoken Language Understanding (SLU). The effective features of short texts are difficult to extract in classification problems because of sparse features of short texts and few words containing many information. To solve the problem, by combining syntactic features and Convolution Neural Network (CNN), a multi-intention recognition model was proposed. Firstly, the sentence was syntactically analyzed to determine whether it contains multi-intention. Secondly, the number of intentions and matrix of distance were calculated by using Term Frequency-Inverse Document Frequency (TF-IDF) and word embedding. Then the matrix of distance was acted as the input of CNN model to classify intentions. Finally, the emotional polarity of each intention was judged to return to the user's true intentions. The experiment was carried out by using the real data of the existing intelligent customer service system. The experimental results show that, the single classification precision of the combination model of syntactic features and CNN is 93.5% in 10 intentions, which is 1.4 percentage points higher than the original CNN model without syntactic features. And in mutil-intention recognition, the classification precision is 30 percentage points higher than others.

Key words: Spoken Language Understanding (SLU), Multi-Intention (MI) recognition, syntactic feature, Convolution Neural Network (CNN), natural language

中图分类号:

TP391.1
TP18

杨春妮, 冯朝胜. 结合句法特征和卷积神经网络的多意图识别模型[J]. 计算机应用, 2018, 38(7): 1839-1845.

YANG Chunni, FENG Chaosheng. Multi-intention recognition model with combination of syntactic feature and convolution neural network[J]. Journal of Computer Applications, 2018, 38(7): 1839-1845.

参考文献

[1] AGGARWAL C C, ZHAIC X. A survey of text classification algorithms[M]//Mining Text Data. Berlin:Springer, 2012:163-222.
[2] KIM Y. Convolutional neural networks for sentence classification[C]//EMNLP 2014:Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA:Association for Computational Linguistics, 2014:1746-1751.
[3] MIKOLOV T, SUTSKEVER I, CHEN K, et al. Distributed representations of words and phrases and their compositionality[C]//NIPS'13:Proceedings of the 26th International Conference on Neural Information Processing Systems. West Chester, OH:Curran Associates Inc., 2013:3111-3119.
[4] RAVUNI S, STOLCKE A. Recurrent neural network and LSTM models for lexical utterance classification[C]//ISCA2015:Proceedings of the 42nd International Symposium on Computer Architecture. Piscataway, NJ:IEEE, 2015:135-139.
[5] LAI S W, XU L H, LIU K, et al. Recurrent convolutional neural networks for text classification[C]//Proceedings of the 2015 Twenty-Ninth AAAI Conference on Artificial Intelligence. Menlo Park, CA:AAAI, 2015:2267-2273.
[6] ZHOU C T, SUN C L, LIU Z Y, et al. A C-LSTM neural network for text classification[EB/OL].[2017-11-12]. https://arxiv.org/pdf/1511.08630.pdf.
[7] JOULIN A, GRAVE E, BOJANOWSKI P, et al. Bag of tricks for efficient text classification[C]//Proceedings of the 2017 Conference of the European Chapter of the Association for Computational Linguistics. Stroudsburg, PA:Association for Computational Linguistics, 2017:427-431.
[8] 孟欣,左万利.基于word embedding的短文本特征扩展与分类[J].小型微型计算机系统,2017,38(8):1712-1717.(MENG X, ZUO W L. Short text expansion and classification based on word embedding[J]. Journal of Chinese Computer Systems, 2017, 38(8):1712-1717.)
[9] 张猛.基于LDA的短文本分类中特征扩展方法的研究[D].北京:中国地质大学,2017.(ZHANG M. Feature extension method for short-text classification based on LDA[D]. Beijing:China University of Geosciences, 2017.)
[10] 余本功,张连彬.基于CP-CNN的中文短文本分类研究[J].计算机应用研究,2018,35(4):1001-1004.(YU B G, ZHANG L B. Chinese short text classification based on CP-CNN[J]. Application Research of Computers, 2018, 35(4):1001-1004.)
[11] 卢玲,杨武,杨有俊,等.结合语义扩展和卷积神经网络的中文短文本分类方法[J].计算机应用,2017,37(12):3498-3503.(LU L, YANG W, YANG Y J, et al. Chinese short text calssification method by combining semantic expansion and convolutional neural network[J]. Journal of Computer Applications, 2017, 37(12):3498-3503.)
[12] MADJAROV G, KOCEV D, GJORGJVIKJ D, et al. An extensive experimental comparison of methods for multi-label learning[J]. Pattern Recognition, 2012, 45(9):3084-3104.
[13] BOUTELL M R, LUO J, SHEN X, et al. Learning multi-label scene classification[J]. Pattern Recognition, 2004, 37(9):1757-1771.
[14] BRINKER K. Multilabel classification via calibrated label ranking[J]. Machine Learning, 2008, 73(2):133-153.
[15] TSOUMAKAS G, VLANHAVAS I. Random k-labelsets:an ensemble method for multilabel classification[C]//ECML 2007:Proceedings of the 18th European Conference on Machine Learning. Berlin:Springer, 2007:406-417.
[16] ZHANG M L, ZHOU Z H. ML-KNN:a lazy learning approach to multi-label learning[J]. Pattern Recognition, 2007, 40(7):2038-2048.
[17] ELISSEEFF A, WESTON J. A kernel method for multi-labelled classification[C]//NIPS'01:Proceedings of the 14th International Conference on Neural Information Processing Systems:Natural and Synthetic. Cambridge, MA:MIT Press, 2001:681-687.
[18] ZHANG M L, ZHANG K. Multi-label learning by exploiting label dependency[C]//Proceedings of the 2010 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York:ACM, 2010:999-1008.
[19] WU F, WANG Z, ZHANG Z, et al. Weakly semi-supervised deep learning for multi-label image annotation[J]. IEEE Transactions on Big Data, 2017, 1(3):109-122.
[20] CHEN G, YE D, XING Z, et al. Ensemble application of convolutional and recurrent neural networks for multi-label text categorization[C]//Proceedings of the 2017 International Joint Conference on Neural Networks. Piscataway, NJ:IEEE, 2017:2377-2383.
[21] 韩栋,王春华,肖敏.结合旋转森林和AdaBoost分类器的多标签文本分类方法[J/OL].计算机应用研究,2018,35(12)[2017-12-08]. http://www.arocmag.com/article/02-2018-12-014.html.(HAN D, WANG C H, XIAO M. Multi-label text classification method based on rotating forest and AdaBoost classifier[J/OL]. Application Research of Computers, 2018, 35(12)[2017-12-08]. http://www.arocmag.com/article/02-2018-12-014.html.)
[22] 张晶,李裕,李培培.基于随机子空间的多标签类属特征提取算法[J/OL].计算机应用研究,2019,36(2) (2017-05-12)[2018-01-19]. http://www.arocmag.com/article/02-2019-02-012.html.(ZHANG J, LI Y, LI P P. Multi-label label-specific feature extraction algorithm based on random subspace[J/OL]. Application Research of Computers, 2019, 36(2) (2017-05-12)[2018-01-19]. http://www.arocmag.com/article/02-2019-02-012.html.)
[23] 孙松涛,何炎祥.基于CNN特征空间的微博多标签情感分类[J].工程科学与技术,2017,49(3):162-169.(SUN S T, HE Y X. Multi-label emotion classification for microblog based on CNN feature space[J]. Advanced Engineering Science, 2017, 49(3):162-169.)
[24] SRIVASTAVA N, HINTON G, KRIZHEVSKY A, et al. Dropout:a simple way to prevent neural networks from overfitting[J]. Journal of Machine Learning Research, 2014, 15(1):1929-1958.
[25] VORONSOV M A, SIVOKON V. Stochastic parallel-gradient-descent technique for high-resolution wave-front phase-distortion correction[J]. Journal of the Optical Society of America A, 1998, 15(10):2745-2758.

结合句法特征和卷积神经网络的多意图识别模型

Multi-intention recognition model with combination of syntactic feature and convolution neural network

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	谢德峰, 吉建民. 融入句法感知表示进行句法增强的语义解析[J]. 计算机应用, 2021, 41(9): 2489-2495.
[2]	刘雅璇, 钟勇. 基于头实体注意力的实体关系联合抽取方法[J]. 计算机应用, 2021, 41(9): 2517-2522.
[3]	宋中山, 梁家锐, 郑禄, 刘振宇, 帖军. 基于双向门控尺度特征融合的遥感场景分类[J]. 计算机应用, 2021, 41(9): 2726-2735.
[4]	李康康, 张静. 基于注意力机制的多层次编码和解码的图像描述模型[J]. 计算机应用, 2021, 41(9): 2504-2509.
[5]	张永斌, 常文欣, 孙连山, 张航. 基于字典的域名生成算法生成域名的检测方法[J]. 计算机应用, 2021, 41(9): 2609-2614.
[6]	赵宏, 孔东一. 图像特征注意力与自适应注意力融合的图像内容中文描述[J]. 计算机应用, 2021, 41(9): 2496-2503.
[7]	徐江浪, 李林燕, 万新军, 胡伏原. 结合目标检测的室内场景识别方法[J]. 计算机应用, 2021, 41(9): 2720-2725.
[8]	牟长宁, 王海鹏, 周丕宇, 侯鑫行. 基于图卷积神经网络的串联质谱从头测序[J]. 计算机应用, 2021, 41(9): 2773-2779.
[9]	王贺兵, 张春梅. 基于非对称卷积-压缩激发-次代残差网络的人脸关键点检测[J]. 计算机应用, 2021, 41(9): 2741-2747.
[10]	周险兵, 樊小超, 任鸽, 杨勇. 基于多层次语义特征的英文作文自动评分方法[J]. 计算机应用, 2021, 41(8): 2205-2211.
[11]	曾祥银, 郑伯川, 刘丹. 基于深度卷积神经网络和聚类的左右轨道线检测[J]. 计算机应用, 2021, 41(8): 2324-2329.
[12]	曹玉红, 徐海, 刘荪傲, 王紫霄, 李宏亮. 基于深度学习的医学影像分割研究综述[J]. 计算机应用, 2021, 41(8): 2273-2287.
[13]	秦斌斌, 彭良康, 卢向明, 钱江波. 司机分心驾驶检测研究进展[J]. 计算机应用, 2021, 41(8): 2330-2337.
[14]	黄程程, 董霄霄, 李钊. 基于二维Winograd算法的深流水线5×5卷积方法[J]. 计算机应用, 2021, 41(8): 2258-2264.
[15]	武光利, 李雷霆, 郭振洲, 王成祥. 基于改进的双向长短期记忆网络的视频摘要生成模型[J]. 计算机应用, 2021, 41(7): 1908-1914.