Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (3): 723-727.DOI: 10.11772/j.issn.1001-9081.2022020175

• Artificial intelligence • Previous Articles    

Hidden state initialization method for recurrent neural network-based human motion model

Nanfan LI1, Wenwen SI1, Siyuan DU1, Zhiyong WANG2,3, Chongyang ZHONG2,3(), Shihong XIA2,3   

  1. 1.State Grid Beijing Urban Power Supply Company,Beijing 100034,China
    2.Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
    3.University of Chinese Academy of Sciences,Beijing 100049,China
  • Received:2022-02-18 Revised:2022-05-12 Accepted:2022-05-13 Online:2022-08-16 Published:2023-03-10
  • Contact: Chongyang ZHONG
  • About author:LI Nanfan, born in 1993, M. S., engineer. His research interests include human motion simulation.
    SI Wenwen, born in 1984, M. S., senior engineer. Her research interests include human motion simulation.
    DU Siyuan, born in 1994, assistant engineer. His research interests include human motion simulation.
    WANG Zhiyong, born in 1990, Ph. D. His research interests include computer graphics.
    XIA Shihong, born in 1974, Ph. D., research fellow. His research interests include computer graphics, virtual reality, artificial intelligence.
  • Supported by:
    Special Project of Science and Technology for Winter Olympics of National Key Research and Development Program of China(2020YFF0304701);Science and Technology Project of Beijing Electric Power Company(52020220004B)

基于循环神经网络的人体运动模型的隐状态初始化方法

李南帆1, 司文文1, 杜思远1, 王志勇2,3, 钟重阳2,3(), 夏时洪2,3   

  1. 1.国网北京城区供电公司,北京 100034
    2.中国科学院计算技术研究所,北京 100190
    3.中国科学院大学,北京 100049
  • 通讯作者: 钟重阳
  • 作者简介:李南帆(1993—),男,北京人,工程师,硕士,主要研究方向:人体运动仿真
    司文文(1984—),女,山东济南人,高级工程师,硕士,主要研究方向:人体运动仿真
    杜思远(1994—),男,北京人,助理工程师,主要研究方向:人体运动仿真
    王志勇(1990—),男,天津人,博士,主要研究方向:计算机图形学
    钟重阳(1994—),男,重庆人,博士,主要研究方向:计算机图形学
    夏时洪(1974—),男,四川平昌人,研究员,博士,主要研究方向:计算机图形学、虚拟现实、人工智能。
  • 基金资助:
    国家重点研发计划科技冬奥重点专项(2020YFF0304701);北京市电力公司科技项目(52020220004B)

Abstract:

Aiming at the problem of the jump existed in the first frame of human motion synthesis method based on Recurrent Neural Network (RNN), which affects the quality of generated motion, a human motion synthesis method with hidden state initialization was proposed. The initial hidden state was used as independent variable, the objective function of the neural network was used as optimization goal, and the gradient descent method was used to optimize and solve the problem to obtain a suitable initial hidden state. Compared with Encoder-Recurrent-Decoder (ERD) model and Residual Gate Recurrent Unit (RGRU) model, the proposed method with initial hidden state estimation reduces the prediction error of the first frame by 63.51% and 6.90% respectively, and decreases the total error of 10 frames by 50.00% and 4.89% respectively. Experimental results show that the proposed method is better than the method without initial hidden state estimation in both motion synthesis quality and motion prediction accuracy. And the proposed method accurately estimates the hidden state of the first frame of RNN-based human motion model, which improves the quality of motion synthesis and provides reliable data support for action recognition model in real-time security monitoring.

Key words: human motion synthesis, Recurrent Neural Network (RNN), hidden state estimation, action recognition, motion model

摘要:

针对基于循环神经网络(RNN)的人体运动合成方法存在首帧跳变,进而影响生成运动的质量的问题,提出一种带有隐状态初始化的人体运动合成方法,将初始隐状态作为自变量,利用神经网络的目标函数作为优化目标,并使用梯度下降的方法进行优化求解,以得到一个合适的初始隐状态。相较于编码器-循环-解码器(ERD)、残差门控循环单元(RGRU)模型,所提方法在首帧的预测误差分别减小63.51%和6.90%,10帧的总误差分别减小50.00%和4.89%。实验结果表明,该方法无论是运动合成质量还是运动预测精度都优于不进行初始隐状态估计的方法;它通过准确估计基于RNN的人体运动模型的首帧隐状态可提升运动合成的质量,并且为实时安全监测中的动作识别模型提供可靠的数据支持。

关键词: 人体运动合成, 循环神经网络, 隐状态估计, 动作识别, 运动模型

CLC Number: