计算机应用 ›› 2018, Vol. 38 ›› Issue (4): 1146-1150.DOI: 10.11772/j.issn.1001-9081.2017092154

• 虚拟现实与多媒体计算 • 上一篇    下一篇

光流估计下的移动端实时人脸检测

魏震宇1, 文畅1, 谢凯2, 贺建飚3   

  1. 1. 长江大学 计算机科学学院, 湖北 荆州 434023;
    2. 长江大学 电子信息学院, 湖北 荆州 434023;
    3. 中南大学 信息科学与工程学院, 长沙 410083
  • 收稿日期:2017-09-05 修回日期:2017-11-13 出版日期:2018-04-10 发布日期:2018-04-09
  • 通讯作者: 文畅
  • 作者简介:魏震宇(1997-),男,江苏泰兴人,硕士研究生,CCF会员,主要研究方向:人脸识别、图像与视频处理;文畅(1979-),女,湖北荆州人,讲师,硕士,主要研究方向:信号与信息处理、模式识别、三维建模;谢凯(1974-),男,湖北潜江人,教授,博士,主要研究方向:信号与信息处理;贺建飚(1964-),男,湖南长沙人,副教授,博士,主要研究方向:信号处理。
  • 基金资助:
    国家自然科学基金资助项目(61272147);长江大学大学生创新创业训练计划项目(2017008)。

Real-time face detection for mobile devices with optical flow estimation

WEI Zhenyu1, WEN Chang1, XIE Kai2, HE Jianbiao3   

  1. 1. College of Computer Science, Yangtze River University, Jingzhou Hubei 434023, China;
    2. College of Electronic Information, Yangtze River University, Jingzhou Hubei 434023, China;
    3. College of Information Science and Engineering, Central South University, Changsha Hunan 410083, China
  • Received:2017-09-05 Revised:2017-11-13 Online:2018-04-10 Published:2018-04-09
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (61272147), the Innovation and Entrepreneurship Training Program for Yangtze University Students (2017008)

摘要: 为了提高移动设备人脸检测准确率,提出一种应用于移动设备的实时人脸检测算法。通过改进Viola-Jones方法进行人脸区域快速分割,在不损失速度的情况下提高分割精度;同时应用了光流估计方法将卷积神经网络子网络在离散关键帧上的特征提取结果传播至非关键帧,提高神经网络实际检测运行效率。实验使用YouTube视频人脸数据库、自建20人各1 min正位人脸视频数据库和实际检测项目在不同分辨率下进行,实验结果表明运行速度在2.35帧/秒~22.25帧/秒,达到了一般人脸检测水平;人脸检测在10%误检率下召回率由Viola-Jones的65.93%提高到82.5%~90.8%,接近卷积神经网络检测精度,满足了移动设备实时人脸检测的速度和精度要求。

关键词: 人脸检测, 快速区域分割, 瀑布型分类器, 卷积神经网络, 光流估计

Abstract: To improve the face detection accuracy of mobile devices, a new real-time face detection algorithm for mobile devices was proposed. The improved Viola-Jones was used for a quick region segmentation to improve segmentation precision without decreasing segmentation speed. At the same time, the optical flow estimation method was used to propagate the features of discrete keyframes extracted by the sub-network of a convolution neural network to other non-keyframes, which increased the efficiency of convolution neural network. Experiments were conducted on YouTube video face database, a self-built one-minute face video database of 20 people and the real test items at different resolutions. The results show that the running speed is between 2.35 frames per second and 22.25 frames per second, reaching the average face detection level; the recall rate of face detection is increased from 65.93% to 82.5%-90.8% at rate of 10% false alarm, approaching the detection accuracy of convolution neural network, which satisfies the speed and accuracy requirements for real-time face detection of mobile devices.

Key words: face detection, quick region segmentation, cascade classifier, Convolution Neural Network (CNN), optical flow estimation

中图分类号: