《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (6): 1762-1769.DOI: 10.11772/j.issn.1001-9081.2021061390

• 2021年全国开放式分布与并行计算学术年会(DPCS 2021)论文 • 上一篇    

基于单标签射频识别的唇语识别算法

张瑛琪, 彭大卫, 李森, 孙莹, 牛强()   

  1. 中国矿业大学 计算机科学与技术学院,江苏 徐州 221116
  • 收稿日期:2021-08-03 修回日期:2021-08-31 接受日期:2021-10-15 发布日期:2022-01-10 出版日期:2022-06-10
  • 通讯作者: 牛强
  • 作者简介:张瑛琪(1996—),女,辽宁营口人,硕士研究生,CCF会员,主要研究方向:物联网、无线感知
    彭大卫(2001—),男,江苏邳州人,主要研究方向:无线感知
    李森(1998—),男,河南郑州人,主要研究方向:无线感知
    孙莹(2000—),女,江苏泰兴人,主要研究方向:无线感知
  • 基金资助:
    国家自然科学基金资助项目(51674255)

Lip language recognition algorithm based on single-tag radio frequency identification

Yingqi ZHANG, Dawei PENG, Sen LI, Ying SUN, Qiang NIU()   

  1. School of Computer Science and Technology,China University of Mining and Technology,Xuzhou Jiangsu 221116,China
  • Received:2021-08-03 Revised:2021-08-31 Accepted:2021-10-15 Online:2022-01-10 Published:2022-06-10
  • Contact: Qiang NIU
  • About author:ZHANG Yingq,born in 1996,M. S. candidate. Her research interests include Internet of Things,wireless sensing
    PENG Dawei,born in 2001. His research interests include wireless sensing.
    LI Sen,born in 1998. His research interests include wireless sensing.
    SUN Ying,born in 1974,Ph. D.,professor. His research interests include intelligent information processing,artificial intelligence,pattern recognition,machine learning,data mining.
  • Supported by:
    National Natural Science Foundation of China(51674255)

摘要:

近年来,有研究提出了使用多个定制且可拉伸的射频识别(RFID)标签进行语音识别的无线平台,但该标签难以精准捕捉拉伸引起的大频率偏移,而且需要探测多个标签,标签脱落或自然磨损时还须重新校准。针对以上问题,提出基于单标签RFID的唇语识别算法,将灵活、易于隐藏且没有侵入性的单个通用RFID标签贴在脸上,即使用户不发出声音,仅依靠面部的微动作也可进行唇语识别。首先建立模型处理RFID阅读器接收的单个标签随时间和频率响应的接收信号强度(RSS)和相位变化,然后采用高斯函数对原始数据的噪点进行平滑去噪预处理,再采用动态时间规整(DTW)算法对收集到的信号特征进行评估分析,以解决发音长短不匹配的问题;最后创建无线语音识别系统来识别区分与声音相对应的面部表情,从而达到识别唇语的目的。实验结果表明,对于识别不同用户的200组数字信号特征,该方法的RSS准确率可以达到86.5%以上。

关键词: 射频识别, 唇语识别, 单标签, 接收信号强度, 动态时间规整

Abstract:

In recent years, a wireless platform for speech recognition using multiple customized and stretchable Radio Frequency Identification (RFID) tags has been proposed, however, it is difficult for the tags to accurately capture large frequency shifts caused by stretching, and multiple tags need to be detected and recalibrated when the tags fall off or wear out naturally. In response to the above problems, a lip language recognition algorithm based on single-tag RFID was proposed, in which a flexible, easily concealable and non-invasive single universal RFID tag was attached to the face, allowing lip language recognition even if the user does not make a sound and relies only on facial micro-actions. Firstly, a model was established to process the Received Signal Strength (RSS) and phase changes of individual tags received by an RFID reader responding over time and frequency. Then the Gaussian function was used to preprocess the noise of the original data by smoothing and denoising, and the Dynamic Time Warping (DTW) algorithm was used to evaluate and analyze the collected signal characteristics to solve the problem of pronunciation length mismatch. Finally, a wireless speech recognition system was created to recognize and distinguish the facial expressions corresponding to the voice, thus achieving the purpose of lip language recognition. Experimental results show that the accuracy of RSS can reach more than 86.5% by the proposed algorithm for identifying 200 groups of digital signal characteristics of different users.

Key words: Radio Frequency IDentification (RFID), lip language recognition, single tag, Received Signal Strength (RSS), Dynamic Time Warping (DTW)

中图分类号: