基于三维模型的Android手机端人脸姿态实时估计系统

doi:10.11772/j.issn.1001-9081.2015.08.2321

计算机应用 ›› 2015, Vol. 35 ›› Issue (8): 2321-2326.DOI: 10.11772/j.issn.1001-9081.2015.08.2321

基于三维模型的Android手机端人脸姿态实时估计系统

王海鹏¹, 王正良², 许威威¹, 范然¹

1. 杭州师范大学杭州国际服务工程学院, 杭州 311121;
2. 浙江省科技信息研究院信息资源中心, 杭州 311121

收稿日期:2015-01-28 修回日期:2015-03-28 出版日期:2015-08-10 发布日期:2015-08-14
通讯作者: 许威威(1975-),男,安徽绩溪人,教授,博士,主要研究方向:计算机图形图像处理,weiwei.xu.g@gmail.com
作者简介:王海鹏(1989-),男,山东东营人,硕士研究生,主要研究方向:计算机视觉、计算机图形; 王正良(1978-),男,浙江台州人,馆员,主要研究方向:计算机视觉; 范然(1984-),男,山东济南人,博士,主要研究方向:计算机图形学、三维几何处理。
基金资助:
国家自然科学基金资助项目(61322204,61272392)。

Real-time face pose estimation system based on 3D face model on Android mobile platform

WANG Haipeng¹, WANG Zhengliang², XU Weiwei¹, FAN Ran¹

1. Institute of Service Engineering, Hangzhou Normal University, Hangzhou Zhejiang 311121, China;
2. Information Resource Center, Institute of Scientific and Technical Information of Zhejiang Province, Hangzhou Zhejiang 311121, China

Received:2015-01-28 Revised:2015-03-28 Online:2015-08-10 Published:2015-08-14

摘要/Abstract

摘要：

针对人脸姿态估计对系统性能要求高、在手机上运行无法满足实时性要求等问题,实现了一种Android手机端的人脸姿态实时估计系统。首先,由摄像头获得一幅正面和一幅偏移一定角度的人脸图像,利用从运动中构建结构(SfM)算法建立简单三维人脸模型;然后,提取实时人脸图像中与三维人脸模型相互对应的特征点,基于缩放正投影位姿估计(POSIT)算法估计人脸姿态角度;最后将三维人脸模型通过开放图形开发库(OpenGL)实时显示在手机屏幕上。实验结果表明,实时视频中检测人脸姿态并显示的速度可以达到20 frame/s,接近计算机端的基于仿射对应的三维人脸姿态估计算法,而且针对大量图片序列的检测可以达到50 frame/s,能够满足Android手机端的性能和检测人脸姿态的实时性要求。

关键词: 人脸姿态, 从运动中构建结构算法, 显示形状回归, 基于缩放正投影位姿估计, 随机抽样一致算法, Android, 增强现实

Abstract:

Concerning that the high performance requirement of face pose estimation system which could not run on mobile phone in real time, a real-time face pose estimation system was realized for Android mobile phone terminals. First of all, one positive face image and one face image with a certain offset angle were obtained by the camera for establishing a simple 3D face model by Structure from Motion (SfM) algorithm. Secondly, the system extracted corresponding feature points from the real-time face image to 3D face model. The 3D face pose parameters were got by POSIT (Pose from Orthography and Scaling with ITeration) algorithm. At last, the 3D face model was displayed on Android mobile terminals in real-time using OpenGL (Open Graphics Library). The experimental results showed that the speed of detecting and displaying the face pose was up to 20 frame/s in the real-time video, which is close to 3D face pose estimation algorithm based on the affine correspondance on computer terminals; and the speed of detecting a large number of image sequences reached 50 frame/s. The results indicate that the system can satisfy the performance requirement for Android mobile phone terminals and real-time requirement of detecting the face pose.

Key words: face pose, Structure from Motion (SfM) algorithm, explicit shape regression, Pose from Orthography and Scaling with ITeration (POSIT), Random Sampling Consensus (RANSAC), Android, augmented reality

中图分类号:

TP391.41

王海鹏, 王正良, 许威威, 范然. 基于三维模型的Android手机端人脸姿态实时估计系统[J]. 计算机应用, 2015, 35(8): 2321-2326.

WANG Haipeng, WANG Zhengliang, XU Weiwei, FAN Ran. Real-time face pose estimation system based on 3D face model on Android mobile platform[J]. Journal of Computer Applications, 2015, 35(8): 2321-2326.

参考文献

[1] FU Y, HUANG T S. hMouse: head tracking driven virtual computer mouse [C]//WACV '07: Proceedings of the Eighth IEEE Workshop on Applications of Computer Vision. Washington, DC: IEEE Computer Society, 2007: 30-35.
[2] MORENCY L-P, SIDNER C, LEE C, et al. Head gestures for perceptual interfaces: the role of context in improving recognition [J]. Artificial Intelligence, 2007, 171(8/9): 568-585.
[3] MURPHY-CHUTORIAN E, TRIVEDI M M. HyHOPE: hybrid head orientation and position estimation for vision-based driver head tracking [C]//Proceedings of the 2008 IEEE Intelligent Vehicles Symposium. Piscataway: IEEE, 2008: 512-517.
[4] ROWEIS S T, SAU L K. Non-linear dimensionality reduction by locally linear embedding [J]. Science, 2000, 290(5500): 2323-2326.
[5] TENENBAUM J B, SILVA V D, LANGFORD J C. A global geometric framework for non-linear dimensionality reduction [J]. Science, 2000, 290(5500): 1959-1966.
[6] ZHAO S. Pose estimation, recognition algorithms and fuse algorithm in face recognition [D]. Hefei: University of Science and Technology of China, 2009: 9-10. (赵松. 人脸识别中的姿态估计、识别算法和融合算法的研究[D].合肥:中国科学技术大学,2009:9-10.)
[7] BALASUBRAMANIAN V N, YE J, PANCHANATHAN S. Biased manifold embedding: a framework for person independent head pose estimation [C]//CVPR 2007: Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2007: 1-7.
[8] DARREL T, MOGHADDAM B, PENTLAND A P. Active face tracking and pose estimation in an interactive room [C]//CVPR '96: Proceedings of the 1996 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 1996: 67-72.
[9] CHEN Q, WU H, SHINADA T, et al. A robust algorithm for 3D head pose estimation [C]//Proceedings of the 1998 Fourteenth International Conference on Pattern Recognition. Piscataway: IEEE, 1999, 2: 697-702.
[10] COOTES T F, EDWARDS G J, TAYLOR C J. Active appearance models [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(6): 681-685.
[11] COOTES T F, TAYLOR C J, COOPER D H, et al. Training models of shape from sets of examples [C]//Proceedings of the 1992 British Machine Vision Conference. London: Springer, 1992: 266-275.
[12] LOPEZ R, HUANG T S. 3D Head pose computation from 2D images: template versus features [C]//ICIP '95: Proceedings of the 1995 International Conference on Image Processing. Washington, DC: IEEE Computer Society, 1995, 2: 599-602.
[13] YILMAZ A, SHAH M. Automatic feature detection and pose recovery of faces [C]//ACCV 2002: Proceedings of the 5th Asian Conference on Computer Vision. Tokyo: Asian Federation of Computer Vision Societies, 2002: 23-35.
[14] YANG R, ZHANG Z. Model-based head pose tracking with stereo vision [C]//Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition. Piscataway: IEEE, 2002: 255-260.
[15] DEMENTHON D F, DAVIS L S. Model-based object pose in 25 lines of code [J]. International Journal of Computer Vision, 1995, 15(1/2): 123-141
[16] RUBLEE E, RABAUD V, KONOLIGE K, et al. ORB: an efficient alternative to SIFT or SURF [C]//ICCV '11: Proceedings of the 2011 International Conference on Computer Vision. Washington, DC: IEEE Computer Society, 2011: 2564-257.
[17] FISCHLER M A, BOLLES R C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography [M]//Readings in Computer Vision: Issues, Problems, Principles, and Paradigms. San Francisco: Morgan Kaufmann Publishers, 1987: 726-740.
[18] CAO X. Face alignment by explicit shape regression [C]//CVPR 2012: Proceedings of the 1996 IEEE Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2012: 2887-2894.
[19] CAO C, HOU Q, ZHOU K. Displaced dynamic expression regression for real-time facial tracking and animation [J]. ACM Transactions on Graphics: Proceedings of ACM SIGGRAPH 2014, 2014, 33(4): Article No. 43.
[20] CAO C, WENG Y, LIN S, et al. 3D shape regression for real-time facial animation [J]. ACM Transactions on Graphics: SIGGRAPH 2013 Conference Proceedings, 2013, 32(4): Article No. 41.
[21] ROSTEN E, DRUMMOND T. Machine learning for high-speed corner detection [C]//Proceedings of the 2006 European Conference on Computer Vision. Berlin: Springer, 2006: 430-443.
[22] CALONDER M, LEPETIT V, STRECHA C, et al. BRIEF: Binary Robust Independent Elementary Features [C]//ECCV 2010: Proceedings of the 11th European Conference on Computer Vision, LNCS 6314. Berlin: Springer, 2010: 778-792.
[23] LOWE D G. Distinctive image features from scale invariant features [J]. International Journal of Computer Vision, 2004, 60(2): 91-110.
[24] BAY H, ESS A, TUYTELAARS T, et al. SURF: Speeded Up Robust Features [J]. Computer Vision and Image Understanding, 2008,110(3): 346-359.
[25] BYRÖD M, ÅSTRÖM K. Bundle adjustment using conjugate gradients with multiscale preconditioning [C]//BMVC 2009: Proceedings of the 2009 British Machine Vision Conference. London: Lund University Publications, 2009: 29-42.
[26] LOURAKIS M I A, ARGYROS A A. SBA: a software package for generic sparse bundle adjustment [J]. ACM Transactions on Mathematical Software, 2009, 36(1): Article No. 2.
[27] MEI P. Research on real-time face detection and pose estimation in video sequence [D]. Wuhan: Wuhan University of Technology, 2011: 46-48. (梅鹏.视频序列中实时人脸检测及姿态估计的研究[D].武汉:武汉理工大学,2011:46-48.)
[28] LIANG G, ZHA H, LIU H. Face pose estimation based on 3D models and affine correspondences [J]. Chinese Journal of Computers, 2005, 28(5):792-800. (梁国远,査红彬,刘宏. 基于三维模型和仿射对应原理的人脸姿态估计方法[J].计算机学报,2005, 28(5):792-800.)

基于三维模型的Android手机端人脸姿态实时估计系统

Real-time face pose estimation system based on 3D face model on Android mobile platform

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	谭兆一, 陈白帆. 基于坐标逆映射的增强型车辆三维全景影像[J]. 计算机应用, 2021, 41(4): 1165-1171.
[2]	孙启昌, 麦永锋, 陈晓军. 基于增强现实的手术导航系统快速标定算法[J]. 计算机应用, 2021, 41(3): 833-838.
[3]	方书雅, 刘守印. 基于学生人体检测的无感知课堂考勤方法[J]. 计算机应用, 2020, 40(9): 2519-2524.
[4]	罗斌, 于波. 移动边缘计算中基于粒子群优化的计算卸载策略[J]. 计算机应用, 2020, 40(8): 2293-2298.
[5]	周翔, 唐丽玉, 林定. 基于二进制鲁棒不变尺度关键点-加速稳健特征的自然特征虚实注册方法[J]. 计算机应用, 2020, 40(5): 1403-1408.
[6]	任海培, 李腾. 面向移动平台人脸检测的FaceYoLo算法[J]. 计算机应用, 2020, 40(4): 1002-1008.
[7]	席志红, 王洪旭, 韩双全. 基于ORB-SLAM2系统的快速误匹配剔除算法与地图构建[J]. 计算机应用, 2020, 40(11): 3289-3294.
[8]	曹震寰, 蔡小孩, 顾梦鹤, 顾小卓, 李晓伟. 基于访问控制列表机制的Android权限管控方案[J]. 计算机应用, 2019, 39(11): 3316-3322.
[9]	余韵, 连晓灿, 朱宇航, 谭国平. 增强现实场景下移动边缘计算资源分配优化方法[J]. 计算机应用, 2019, 39(1): 22-25.
[10]	卜同同, 曹天杰. 基于权限的Android应用风险评估方法[J]. 计算机应用, 2019, 39(1): 131-135.
[11]	路子聪, 徐开勇, 郭松, 肖警续. 基于ARM虚拟化扩展的Android内核动态度量方法[J]. 计算机应用, 2018, 38(9): 2644-2649.
[12]	高钦泉, 黄伟萍, 杜民, 韦孟宇, 柯栋忠. 基于双目视觉的盆腔微创手术增强现实导航仿真系统的设计[J]. 计算机应用, 2018, 38(9): 2660-2665.
[13]	罗文塽, 曹天杰. 基于非用户操作序列的恶意软件检测方法[J]. 计算机应用, 2018, 38(1): 56-60.
[14]	刘其源, 焦健, 曹宏盛. Android隐式信息流检测的本体模型[J]. 计算机应用, 2018, 38(1): 61-66.
[15]	卢晓勇, 游斌, 林珮瑜, 陈木生. 基于数字相机和时间心理视觉调制的增强现实技术[J]. 计算机应用, 2017, 37(8): 2298-2301.