基于多任务学习的多姿态人脸重建与识别

doi:10.11772/j.issn.1001-9081.2017.03.896

计算机应用 ›› 2017, Vol. 37 ›› Issue (3): 896-900.DOI: 10.11772/j.issn.1001-9081.2017.03.896

• 应用前沿、交叉与综合 • 上一篇下一篇

基于多任务学习的多姿态人脸重建与识别

欧阳宁^1,2, 马玉涛², 林乐平^1,2

1. 认知无线电与信息处理省部共建教育部重点实验室(桂林电子科技大学), 广西桂林 541004;
2. 桂林电子科技大学信息与通信学院, 广西桂林 541004

收稿日期:2016-08-01 修回日期:2016-09-07 发布日期:2017-03-22 出版日期:2017-03-10
通讯作者: 林乐平
作者简介:欧阳宁(1972-),男,湖南宁远人,教授,硕士,主要研究方向:数字图像处理、智能信息处理;马玉涛(1991-),女,内蒙古乌兰察布人,硕士研究生,主要研究方向:人脸识别、深度学习;林乐平(1980-),女,广西桂平人,博士,主要研究方向:模式识别、智能信息处理、图像信号处理。
基金资助:
国家自然科学基金资助项目（61362021，61661017）；广西自然科学基金资助项目（2013GXNSFDA019030，2014GXNSFDA118035）；广西科技创新能力与条件建设计划项目（桂科能1598025-21）；桂林科技开发项目（20150103-6）。

Multi-pose face reconstruction and recognition based on multi-task learning

OUYANG Ning^1,2, MA Yutao², LIN Leping^1,2

1. Key Laboratory of Cognitive Radio and Information Processing, Ministry of Education(Guilin University of Electronic Technology), Guilin Guangxi 541004, China;
2. School of Information and Communication, Guilin University of Electronic Technology, Guilin Guangxi 541004, China

Received:2016-08-01 Revised:2016-09-07 Online:2017-03-22 Published:2017-03-10
Supported by:
This work is partially supported by the Natural Science Foundation of China (61362021, 61661017), the Natural Science Foundation of Guangxi (2013GXNSFDA019030, 2014GXNSFDA118035), the Scientific and Technological Innovation Ability and Condition Construction Plan of Guangxi (1598025-21), the Scientific and Technological Development Project of Guilin (20150103-6).

摘要/Abstract

摘要： 针对当前人脸识别中姿态变化会影响识别性能，以及姿态恢复过程中脸部局部细节信息容易丢失的问题，提出一种基于多任务学习的多姿态人脸重建与识别方法——多任务学习堆叠自编码器（MtLSAE）。该方法通过运用多任务学习机制，联合考虑人脸姿态恢复和脸部局部细节信息保留这两个相关的任务，在步进逐层恢复正面人脸姿态的同时，引入非负约束稀疏自编码器，使得非负约束稀疏自编码器能够学习到人脸部的部分特征；其次在姿态恢复和局部信息保留两个任务之间通过共享参数的方式来学习整个网络框架；最后将重建出来的正脸图像通过Fisherface进行降维并提取具有判别信息的特征，并用最近邻分类器来识别。实验结果表明，MtLSAE方法获得了较好的姿态重建质量，保留的局部纹理信息清晰，而且与局部Gabor二值模式（LGBP）、基于视角的主动外观模型（VAAM）以及堆叠步进自编码器（SPAE）等经典方法相比，识别率性能得以提升。

关键词: 多任务学习, 姿态恢复, 局部细节信息, 自编码器, 共享参数

Abstract: To circumvent the influence of pose variance on face recognition performance and considerable probability of losing the facial local detail information in the process of pose recovery, a multi-pose face reconstruction and recognition method based on multi-task learning was proposed, namely Multi-task Learning Stacked Auto-encoder (MtLSAE). Considering the correlation between pose recovery and retaining local detail information, multi-task learning mechanism was used and sparse auto-encoder with non-negativity constraints was introduced by MtLSAE to learn part features of the face when recovering frontal images using step-wise approach. And then the whole net framework was learned by sharing parameters between above two related tasks. Finally, Fisherface was used for dimensionality reduction and extracting discriminative features of reconstructed positive face image, and the nearest neighbor classifier was used for recognition. The experimental results demonstrate that MtLSAE achieves good pose reconstruction quality and makes facial local texture information clear; on the other hand, it also achieves higher recognition rate than some classical methods such as Local Gabor Binary Pattern(LGBP), View-Based Active Appearance (VAAM) and Stacked Progressive Auto-encoder (SPAE).

Key words: multi-task learning, pose recovery, local detail information, auto-encoder, sharing parameter

中图分类号:

TP391.3

欧阳宁, 马玉涛, 林乐平. 基于多任务学习的多姿态人脸重建与识别[J]. 计算机应用, 2017, 37(3): 896-900.

OUYANG Ning, MA Yutao, LIN Leping. Multi-pose face reconstruction and recognition based on multi-task learning[J]. Journal of Computer Applications, 2017, 37(3): 896-900.

参考文献

[1] TAN X, TRIGGS B. Enhanced local texture feature sets for face recognition under difficult lighting conditions[J]. IEEE Transactions on Image Processing, 2010, 19(6):1635-1650.
[2] HUANG G B, RAMESH M, BERG T, et al. Labeled faces in the wild:a database for studying face recognition in unconstrained environments[R]. Cambridge:University of Massachusetts, 2007:49.
[3] GÜNTHER M, COSTA-PAZO A, DING C, et al. The 2013 face recognition evaluation in mobile environment[C]//ICB 2013:Proceedings of the 2013 International Conference on Biometrics. Piscataway, NJ:IEEE, 2013:1-7.
[4] ZHANG W, SHAN S, GAO W, et al. Local Gabor Binary Pattern Histogram Sequence (LGBPHS):a novel non-statistical model for face representation and recognition[C]//ICCV'05:Proceedings of the Tenth IEEE International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2005, 1:786-791.
[5] ASTHANA A, MARKS T K, JONES M J, et al. Fully automatic pose-invariant face recognition via 3D pose normalization[C]//ICCV'11:Proceedings of the 2011 International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2011:937-944.
[6] HO H T, CHELLAPPA R. Pose-invariant face recognition using Markov random fields[J]. IEEE Transactions on Image Processing, 2013, 22(4):1573-1584.
[7] ZHU Z, LUO P, WANG X, et al. Deep learning identity-preserving face space[C]//ICCV'13:Proceedings of the 2013 IEEE International Conference on Computer Vision. Washington, DC:IEEE Computer Society, 2013:113-120.
[8] ZHU Z, LUO P, WANG X, et al. Multi-view perceptron:a deep model for learning face identity and view representations[C]//NIPS 2014:Advances in Neural Information Processing Systems. Cambridge, MA:MIT Press, 2014:217-225.
[9] BENGIO Y. Learning deep architectures for AI[J]. Foundations and Trends in Machine Learning, 2009, 2(1):1-127.
[10] KAN M, SHAN S, CHANG H, et al. Stacked Progressive Auto-Encoders (SPAE) for face recognition across poses[C]//CVPR'14:Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition.Washington, DC:IEEE Computer Society, 2014:1883-1890.
[11] SHIELDS T J, AMER M R, EHRLICH M, et al. Action-affect classification and morphing using multi-task representation learning[J/OL]. arXiv preprint arXiv:1603.06554, 2016[2016-03-21]. https://arxiv.org/abs/1603.06554.
[12] ARGYRIOU A, EVGENIOU T, PONTIL M. Multi-task feature learning[C]//NIPS 2006:Advances in Neural Information Processing Systems. Cambridge, MA:MIT Press, 2007, 19:41-48.
[13] HOSSEINI-ASL E, ZURADA J M, NASRAOUI O. Deep learning of part-based representation of data using sparse autoencoders with nonnegativity constraints[J]. IEEE Transactions on Neural Networks and Learning Systems, 2015, 27(12):1-13.
[14] BELHUMEUR P N, HESPANHA J P, KRIEGMAN D J. Eigenfaces vs. fisherfaces:recognition using class specific linear projection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(7):711-720.
[15] NAIR V, HINTON G E. Rectified linear units improve restricted Holtzmann machines[C]//ICML-10:Proceedings of the 27th International Conference on Machine Learning. Haifa:Omnipress, 2010:807-814.
[16] GRAVELINES C. Deep learning via stacked sparse autoencoders for automated voxel-wise brain parcellation based on functional connectivity[D]. Ontario, Canada:The University of Western Ontario, 2014:1-76.
[17] LEE H, EKANADHAM C, NG A Y. Sparse deep belief net model for visual area V2[C]//NIPS 2007:Advances in Neural Information Processing Systems. Cambridge, MA:MIT Press, 2008:873-880.
[18] NG A, NGIAM J, FOO C Y, et al. UFLDL Tutorial[EB/OL]. (2013-04-07)[2016-08-26].http://deeplearning.stanford.edu/wiki/index.php/Gradient_checking_and_advanced_optimization.
[19] GROSS R, MATTHEWS I, COHN J, et al. The CMU multi-pose, illumination, and expression (Multi-PIE) face database, TR-07-08[R]. Pittsburgh:CMU Robotics Institute, 2007.
[20] 李航.统计学习方法[M].北京:清华大学出版社,2012:14-15. (LI H. Statical Learning Methods[M]. Beijing:Tsinghua University Press, 2012:14-15.)

基于多任务学习的多姿态人脸重建与识别

Multi-pose face reconstruction and recognition based on multi-task learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

[1]	邓凯丽, 魏伟波, 潘振宽. 改进掩码自编码器的工业缺陷检测方法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2595-2603.
[2]	李宗禹, 强思维, 郭晓波, 朱振峰. 重加权的对抗变分自编码器及其在工业因果效应估计中的应用[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1099-1106.
[3]	李威, 陈玲, 徐修远, 朱敏, 郭际香, 周凯, 牛颢, 张煜宸, 易珊烨, 章毅, 罗凤鸣. 基于多任务学习的间质性肺病分割算法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1285-1293.
[4]	尚爱国, 朱欣娟. 基于多任务学习的意图检测和槽位填充联合方法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 690-695.
[5]	张卓, 陈花竹. 基于一致性和多样性的多尺度自表示学习的深度子空间聚类[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 353-359.
[6]	廖存燚, 郑毅, 刘玮瑾, 于欢, 刘守印. 自动驾驶环境感知多任务去耦-融合算法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 424-431.
[7]	宋钰丹, 王晶, 王雪徽, 马朝阳, 林友芳. 基于自适应多任务学习的睡眠生理时序分类方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 654-662.
[8]	蒋辉, 闫秋艳, 姜竹郡. 面向多元时间序列异常检测的对称正定自编码器方法[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3294-3299.
[9]	郭晓, 陈艳平, 唐瑞雪, 黄瑞章, 秦永彬. 融合行为词的罪名预测多任务学习模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 159-166.
[10]	王静红, 周志霞, 王辉, 李昊康. 双路自编码器的属性网络表示学习[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2338-2344.
[11]	黄梦林, 段磊, 张袁昊, 王培妍, 李仁昊. 基于Prompt学习的无监督关系抽取模型[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2010-2016.
[12]	何建辉, 胡春龙, 束鑫. 基于多峰标签分布学习的多任务年龄估计方法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1578-1583.
[13]	尹春勇, 周立文. 基于再编码的无监督时间序列异常检测模型[J]. 《计算机应用》唯一官方网站, 2023, 43(3): 804-811.
[14]	徐少康, 张战成, 姚浩男, 邹智伟, 张宝成. 基于姿态编码器的2D/3D脊椎医学图像实时配准方法[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 589-594.
[15]	马志峰, 于俊洋, 王龙葛. 多样性表示的深度子空间聚类算法[J]. 《计算机应用》唯一官方网站, 2023, 43(2): 407-412.