Automatic emotion annotation method of Yi language data based on double-layer features

doi:10.11772/j.issn.1001-9081.2020020148

Journal of Computer Applications ›› 2020, Vol. 40 ›› Issue (10): 2850-2855.DOI: 10.11772/j.issn.1001-9081.2020020148

• Artificial intelligence • Previous Articles Next Articles

Automatic emotion annotation method of Yi language data based on double-layer features

HE Jun¹, ZHANG Caiqing², ZHANG Yunfei¹, ZHANG Dehai³, LI Xiaozhen¹

1. School of Information Engineering, Kunming University, Kunming Yunnan 650214, China;
2. School of Foreign Languages, Yunnan University, Kunming Yunnan 650206, China;
3. School of Software, Yunnan University, Kunming Yunnan 650206, China

Received:2020-02-17 Revised:2020-03-27 Online:2020-04-17 Published:2020-10-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61263043, 61864004), the Joint Special Foundation for Basic Research of Local Universities in Yunnan Province (2017FH001-058).

基于双层特征的彝语数据情感自动标注方法

何俊¹, 张彩庆², 张云飞¹, 张德海³, 李小珍¹

1. 昆明学院信息工程学院, 昆明 650214;
2. 云南大学外国语学院, 昆明 650206;
3. 云南大学软件学院, 昆明 650206

通讯作者: 张彩庆
作者简介:何俊(1977-),男,云南云县人,副教授,博士,CCF会员,主要研究方向:数据分析;张彩庆(1977-),女,云南云县人,讲师,硕士,主要研究方向:少数民族语言;张云飞(1986-),男,云南曲靖人,讲师,硕士,主要研究方向:软件工程;张德海(1977-),男,云南临沧人,副教授,博士,CCF会员,主要研究方向:人工智能;李小珍(1983-),女,陕西商洛人,讲师,博士,主要研究方向:物联网。
基金资助:
国家自然科学基金资助项目（61263043，61864004）；云南省地方本科高校基础研究联合专项基金资助项目（2017FH001-058）。

Abstract

Abstract: Most of the existing automatic emotion annotation methods only extract the single recognition feature from acoustic layer or language layer. While Yi language is affected by the factors such as too many branch dialects and high complexity, so the accuracy of automatic annotation of Yi language with single-layer emotion feature is low. Based on the features such as rich emotional affixes in Yi language, a double-layer feature fusion method was proposed. In the method, the emotional features from acoustic layer and language layer were extracted respectively, the methods of generating sequence and adding units as needed were applied to complete the feature sequence alignment, and the process of automatic emotion annotation was realized through the corresponding feature fusion and automatic annotation algorithm. Taking the speech and text data of Yi language in a poverty alleviation log database as samples, three different classifiers were used for comparative experiments. The results show that the classifier has no obvious effect on the automatic annotation results, and the accuracy of automatic annotation after the fusion of double-layer features is significantly improved, the accuracy is increased from 48.1% of acoustic layer and 34.4% of language layer to 64.2% of double-layer fusion.

Key words: Yi language, automatic annotation, emotion recognition, double-layer feature fusion, poverty alleviation

摘要： 现有的情感自动标注方法大多仅从声学层或语言层提取单一识别特征，而彝语受分支方言多、复杂性高等因素的影响，对其使用单层情感特征进行自动标注的正确率较低。利用彝语情感词缀丰富等特点，提出一种双层特征融合方法，分别从声学层和语言层提取情感特征，采用生成序列和按需加入单元的方法完成特征序列对齐，最后通过相应的特征融合和自动标注算法来实现情感自动标注过程。以某扶贫日志数据库中的彝语语音和文本数据为样本，分别采用三种不同分类器进行对比实验。结果表明分类器对自动标注结果影响不明显，而双层特征融合后的自动标注正确率明显提高，正确率从声学层的48.1%和语言层的34.4%提高到双层融合的64.2%。

关键词: 彝语, 自动标注, 情感识别, 双层特征融合, 扶贫

CLC Number:

TP391.1

HE Jun, ZHANG Caiqing, ZHANG Yunfei, ZHANG Dehai, LI Xiaozhen. Automatic emotion annotation method of Yi language data based on double-layer features[J]. Journal of Computer Applications, 2020, 40(10): 2850-2855.

何俊, 张彩庆, 张云飞, 张德海, 李小珍. 基于双层特征的彝语数据情感自动标注方法[J]. 计算机应用, 2020, 40(10): 2850-2855.

References

[1] CHEN K T,CHANG C J,WU C C,et al. Quadrant of euphoria:a crowdsourcing platform for QoE assessment[J]. IEEE Network,2010,24(2):28-35.
[2] KOHLER T. Crowdsourcing-based business models[J]. California Management Review,2015,57(4):63-84.
[3] SCHUTTE N S, MALOUFF J M, THORSTEINSSON E B. Increasing emotional intelligence through training:current status and future directions[J]. International Journal of Emotional Education,2013,5(1):56-72.
[4] 韩文静, 李海峰, 阮华斌, 等. 语音情感识别研究进展综述[J]. 软件学报,2014,25(1):37-50.(HAN W J,LI H F,RUAN H B,et al. Review on speech emotion recognition[J]. Journal of Software, 2014,25(1):37-50.)
[5] LOCATELLO F,BAUER S,LUCIC M,et al. Challenging common assumptions in the unsupervised learning of disentangled representations[J]. Statistics,2019,2(6):238-249.
[6] WILLIS C G,LAW E,WILLIAMS A C,et al. CrowdCurio:an online crowdsourcing platform to facilitate climate change studies using herbarium specimens[J]. New Phytologist,2017,215(1):479-488.
[7] 李宏言, 范利春, 高鹏, 等. 大数据语音语料库的社会标注技术[J]. 清华大学学报(自然科学版),2013,53(6):908-912.(LI H Y,FAN L C,GAO P,et al. Social annotation for large speech corpora[J]. Journal of Tsinghua University (Natural Science Edition),2013,53(6):908-912.)
[8] FU K,LI J,JIN J,et al. Image-text surgery:efficient concept learning in image captioning by generating pseudopairs[J]. IEEE Transactions on Neural Networks and Learning Systems,2018,29(12):5910-5921.
[9] GAO L,FAN K,SONG J,et al. Deliberate attention networks for image captioning[C]//Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Palo Alto,CA:AAAI,2019:8320-8327.
[10] CHEN F,JI R,SUN X,et al. GroupCap:group-based image captioning with structured relevance and diversity constraints[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:1345-1353.
[11] 徐世鹏, 杨鸿武, 王海燕. 面向藏语语音合成的语音基元自动标注方法[J]. 计算机工程与应用,2015,51(6):199-203.(XU S P,YANG H W,WANG H Y. Speech unit segmentation for Tibetan speech synthesis[J]. Computer Engineering and Applications,2015,51(6):199-203.)
[12] 唐素勤, 孙亚茹, 李志欣, 等. 基于强化学习的壮语词性标注[J]. 计算机工程,2020,46(4):309-315.(TANG S Q,SUN Y R,LI Z X,et al. Part of speech tagging of Zhuang language based on reinforcement learning[J]. Computer Engineering,2020,46(4):309-315.)
[13] 傅睿博, 陶建华, 李雅, 等. 基于静音时长和文本特征融合的韵律边界自动标注[J]. 清华大学学报(自然科学版),2018,58(1):61-66,74.(FU R B,TAO J H,LI Y,et al. Automatic prosodic boundary labeling based on fusing the silence duration with the lexical features[J]. Journal of Tsinghua University (Science and Technology),2018,58(1):61-66,74.)
[14] COWIE R, DOUGLAS-COWIE E, SAVVIDOU S, et al. ‘FEELTRACE’:an instrument for recording perceived emotion in real time[C]//Proceedings of the 2000 ISCA Tutorial and Research Workshop on Speech and Emotion. Belfast:International Speech Communication Association,2000:19-24.
[15] EYBEN F,WOLLMER M,SCHULLER B. openSMILE-the Munich versatile and fast open-source audio feature extractor[C]//Proceedings of the 18th ACM International Conference on Multimedia. New York:ACM,2010:1459-1462.
[16] MCKEOWN G,VALSTAR M F,COWIE R,et al. The SEMAINE corpus of emotionally coloured character interactions[C]//Proceedings of the 2010 IEEE International Conference on Multimedia and Expo. Piscataway:IEEE,2010:1079-1084.
[17] 陈盼弟, 黄华, 何凌. 基于自相关和倒谱法的基音检测改进算法[J]. 计算机应用与软件,2015,32(1):163-166.(CHENG P D,HUANG H,HE L. Improved algorithm for pitch detection based on ACF and CEP[J]. Computer Applications and Software, 2015,32(1):163-166.)
[18] GHARAVIAN D,SHEIKHAN M,ASHOFTEDEL F. Emotion recognition improvement using normalized formant supplementary features by hybrid of DTW-MLP-GMM model[J]. Neural Computing and Applications,2013,22(6):1181-1191.
[19] NALINI N J,PALANIVEL S,BALASUBRAMANIAN M. Speech emotion recognition using residual phase and MFCC features[J]. International Journal of Engineering and Technology,2014,5(6):4515-4527.
[20] 魏云超, 赵耀. 基于DCNN的图像语义分割综述[J]. 北京交通大学学报,2016,40(4):82-91.(WEI Y C,ZHAO Y. A review on image semantic segmentation based on DCNN[J]. Journal of Beijing Jiaotong University,2016,40(4):82-91.)
[21] ŚMIEJA M,TABOR J,SPUREK P. SVM with a neutral class[J]. Pattern Analysis and Applications,2019,22(2):573-582.
[22] WAN H,GUO S,YIN K,et al. CTS-LSTM:LSTM-based neural networks for correlated time series prediction[J]. KnowledgeBased Systems,2020,191:No. 105239.
[23] YUAN T,DENG C,SHI W. Speech emotion recognition based on fuzzy K-NN algorithm with fractionally spaced blind equalization[C]//Proceedings of the 2nd Workshop on Advanced Research and Technology in Industry Applications. Paris:Atlantis Press, 2016:1806-1809.
[24] 陈康. 彝语方言研究[M]. 北京:中央民族大学出版社,2010:73-81.(CHEN K. A Study of Yi Dialect[M]. Beijing:China Minzu University Press,2010:73-81.)
[25] 庄莉. 彝族的语言使用情况调查[D]. 重庆:四川外国语大学, 2015:47-49.(ZHUANG L. A survey of Yi language use[D]. Chongqing:Sichuan International Studies University, 2015:47-49.)

Automatic emotion annotation method of Yi language data based on double-layer features

基于双层特征的彝语数据情感自动标注方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Pengqi GAO, Heming HUANG, Yonghong FAN. Fusion of coordinate and multi-head attention mechanisms for interactive speech emotion recognition [J]. Journal of Computer Applications, 2024, 44(8): 2400-2406.
[2]	Hao CHAO, Shuqi FENG, Yongli LIU. Convolutional recurrent neural network optimized by multiple context vectors in EEG-based emotion recognition [J]. Journal of Computer Applications, 2024, 44(7): 2041-2046.
[3]	Juxiang ZHOU, Jinsheng LIU, Jianhou GAN, Di WU, Zijie LI. Classroom speech emotion recognition method based on multi-scale temporal-aware network [J]. Journal of Computer Applications, 2024, 44(5): 1636-1643.
[4]	Tian CHEN, Conghu CAI, Xiaohui YUAN, Beibei LUO. Multimodal emotion recognition method based on multiscale convolution and self-attention feature fusion [J]. Journal of Computer Applications, 2024, 44(2): 369-376.
[5]	Mu LI, Yuheng YANG, Xizheng KE. Emotion recognition model based on hybrid-mel gama frequency cross-attention transformer modal [J]. Journal of Computer Applications, 2024, 44(1): 86-93.
[6]	Lubao LI, Tian CHEN, Fuji REN, Beibei LUO. Bimodal emotion recognition method based on graph neural network and attention [J]. Journal of Computer Applications, 2023, 43(3): 700-705.
[7]	Yu WANG, Yubo YUAN, Yi GUO, Jiajie ZHANG. Sentiment boosting model for emotion recognition in conversation text [J]. Journal of Computer Applications, 2023, 43(3): 706-712.
[8]	Yang WANG, Hongliang FU, Huawei TAO, Jing YANG, Yue XIE, Li ZHAO. Cross-corpus speech emotion recognition based on decision boundary optimized domain adaptation [J]. Journal of Computer Applications, 2023, 43(2): 374-379.
[9]	Lei YANG, Hongdong ZHAO, Kuaikuai YU. End-to-end speech emotion recognition based on multi-head attention [J]. Journal of Computer Applications, 2022, 42(6): 1869-1875.
[10]	LI Ming, GUO Chenhao, CHEN Xing. Automatic annotation of visual deep neural network [J]. Journal of Computer Applications, 2020, 40(6): 1593-1600.
[11]	ZHANG Chen, QIAN Tao, JI Donghong. Joint model of microblog emotion recognition of emoticons and emotion cause detection based on neural network [J]. Journal of Computer Applications, 2018, 38(9): 2464-2468.
[12]	LI Shuling LIU Rong ZHANG Liuqin LIU Hong. Speech emotion recognition algorithm based on modified SVM [J]. Journal of Computer Applications, 2013, 33(07): 1938-1941.
[13]	YANG Min LIU Guang-yuan WEN Wan-hui. Nonlinear analysis of ECG and HRV signals with two types of emotional states [J]. Journal of Computer Applications, 2012, 32(10): 2963-2965.
[14]	HAN Ying-jie ZAN Hong-ying ZHANG Kun-li CAI Yu-mei. Automatic annotation of auxiliary words usage in rule-based Chinese language [J]. Journal of Computer Applications, 2011, 31(12): 3271-3274.
[15]	ZHOU Yu-ting LIU Guang-yuan LAI Xiang-wei. Applications of simulated annealing-immune particle swarm optimization in emotion recognition of galvanic skin response signal [J]. Journal of Computer Applications, 2011, 31(10): 2814-2817.