Multi-view feature projection and synthesis-analysis dictionary learning for image classification

doi:10.11772/j.issn.1001-9081.2017.07.1960

Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (7): 1960-1966.DOI: 10.11772/j.issn.1001-9081.2017.07.1960

Previous Articles Next Articles

Multi-view feature projection and synthesis-analysis dictionary learning for image classification

FENG Hui, JING Xiaoyuan, ZHU Xiaoke

School of Computer, Wuhan University, Wuhan Hubei 430072, China

Received:2016-12-15 Revised:2017-03-06 Online:2017-07-18 Published:2017-07-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61272273).

基于多视图特征投影与合成解析字典学习的图像分类

冯辉, 荆晓远, 朱小柯

武汉大学计算机学院, 武汉 430072

通讯作者: 冯辉
作者简介:冯辉(1992-),男,湖北黄冈人,硕士研究生,主要研究方向:模式识别、计算机视觉;荆晓远(1971-),男,江苏南京人,教授,博士,CCF会员,主要研究方向:模式识别、机器学习、软件工程;朱小柯(1981-),男,河南开封人,博士研究生,CCF会员,主要研究方向:模式识别、计算机视觉。
基金资助:
国家自然科学基金资助项目（61272273）。

Abstract

Abstract: Concerning the problem that the existing synthesis-analysis dictionary learning method can not effectively eliminate the differences between the samples of the same class and ignore the different effects of different features on the classification, an image classification method based on Multi-view Feature Projection and Synthesis-analysis Dictionary Learning (MFPSDL) was put forward. Firstly, different feature projection matrices were learned for different features in the process of synthesis-analysis dictionary learning, so the influence of the within-class differences on recognition was reduced. Secondly, discriminant constraint was added to the synthesis-analysis dictionary, so that similar sparse representation coefficients were obtained for samples of the same class. Finally, by learning different weights for different features, multiple features could be fully integrated. Several experiments were carried out on the Labeled Faces in the Wild (LFW) and Modified National Institute of Standards and Technology (MNIST) database, the training time of MFPSDL method on LFW and MNIST databases were 61.236 s and 52.281 s. Compared with Fisher Discrimination Dictionary Learning (FDDL), Lable Consistent K Singular Value Decomposition (LC-KSVD) and Dictionary Pair Learning (DPL), the recognition rate of MFPSDL method on LFW and MNIST was increased by at least 2.15 and 2.08 percentage points. The experimental results show that MFPSDL method can obtain higher recognition rate while keeping low time complexity, and it is suitable for image classification.

Key words: image classification, dictionary learning, sparse representation, multi-view learning, feature learning

摘要： 针对目前存在的合成解析字典学习方法不能有效地消除同类样本之间的差异性和忽略了不同特征对分类的不同影响的问题，提出了一种基于多视图特征投影与合成解析字典学习（MFPSDL）的图像分类方法。首先，在合成解析字典学习过程中为每种特征学习不同的特征投影矩阵，减小了类内样本间的差异对识别带来的影响；其次，对合成解析字典添加鉴别性的约束，使得同类样本具有相似的稀疏表示系数；最后通过为不同类型的特征学习权重，充分地融合多种特征。在公开人脸数据库（LFW）和手写体识别数据库（MNIST）上进行多项对比实验，MFPSDL方法在LFW和MNIST数据库上的训练时间分别为61.236 s和52.281 s，MFPSDL方法相比Fisher鉴别字典学习（FDDL）、类别一致的K奇异值分解（LC-KSVD）、字典对学习（DPL）等字典学习方法，在LFW和MNIST上的识别率提高了至少2.15和2.08个百分点。实验结果表明，所提方法在保证较低的时间复杂度的同时，获得了更好的识别效果，适用于图像分类。

关键词: 图像分类, 字典学习, 稀疏表示, 多视图学习, 特征学习

CLC Number:

FENG Hui, JING Xiaoyuan, ZHU Xiaoke. Multi-view feature projection and synthesis-analysis dictionary learning for image classification[J]. Journal of Computer Applications, 2017, 37(7): 1960-1966.

冯辉, 荆晓远, 朱小柯. 基于多视图特征投影与合成解析字典学习的图像分类[J]. 计算机应用, 2017, 37(7): 1960-1966.

References

[1] ZHANG H, LAO S. Multi-view discriminant analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(1):188-194.
[2] XU C, TAO D, XU C. Multi-view intact space learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(12):2531-2544.
[3] YANG M, ZHANG L, YANG J, et al. Metaface learning for sparse representation based face recognition[C]//ICIP 2010:Proceedings of the 2010 IEEE International Conference on Image Processing. Piscataway, NJ:IEEE, 2010:1601-1604.
[4] MAIRAL J, BACH F, PONCE J. Task-driven dictionary learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4):791-804.
[5] WANG Z, YANG J, NASRABADI N, et al. A max-margin per-spective on sparse representation-based classification[C]//ICCV 2013:Proceedings of the 2013 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2013:1217-1224.
[6] JIANG Z, LIN Z, DAVIS L S. Label consistent K-SVD:learning a discriminative dictionary for recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(11):2651-2664.
[7] YANG M, ZHANG L, FENG X, et al. Sparse representation based fisher discrimination dictionary learning for image classification[J]. International Journal of Computer Vision, 2014, 109(3):209-232.
[8] GU S, ZHANG L, ZUO W, et al. Projective dictionary pair learning for pattern classification[C]//NIPS 2014:Proceedings of the 2014 Annual Conference on Neural Information Processing Systems. Cambridge:MIT, 2014:793-801.
[9] 程晓雅,王春红.基于特征化字典的低秩表示人脸识别[J].计算机应用,2016,36(12):3423-3428.(CHENG X Y, WANG C H. Characterized dictionary-based low-rank representation for face recognition[J]. Journal of Computer Applications, 2016, 36(12):3423-3428.)
[10] ZHANG L, YANG M, FENG X. Sparse representation or collaborative representation:which helps face recognition?[C]//ICCV 2011:Proceedings of the 2011 IEEE International Conference on Computer Vision. Piscataway, NJ:IEEE, 2011:471-478.
[11] YANG M, ZHANG L. Gabor Feature Based Sparse Representation For Face Recognition With Gabor Occlusion Dictionary[M]. Berlin:Springer, 2010:448-461.
[12] TAN X, TRIGGS B. Enhanced local texture feature sets for face recognition under difficult lighting conditions[J]. IEEE Transactions on Image Processing, 2010, 19(6):1635-1650.
[13] HINTON G E, SALAKHUTDINOV R R. Reducing the dimensionality of data with neural networks[J]. Science, 2006, 313(5786):504-507.
[14] 余凯,贾磊,陈雨强,等.深度学习的昨天、今天和明天[J].计算机研究与发展,2013,50(9):1799-1804.(YU K, JIA L, CHEN Y Q, et al. Deep learning:yesterday, today, and tomorrow[J]. Journal of Computer Research and Development, 2013, 50(9):1799-1804.)
[15] BOYD S, PARIKH N, CHU E, et al. Distributed optimization and statistical learning via the alternating direction method of multipliers[J]. Foundations and Trends in Machine Learning, 2011, 3(1):1-122.
[16] HUANG G, MATTAR M, LEE H, et al. Learning to align from scratch[C]//NIPS 2012:Proceedings of the 2012 Annual Conference on Neural Information Processing Systems. Cambridge:MIT, 2012:764-772.
[17] LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86(11):2278-2324.
[18] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]//NIPS 2012:Proceedings of the 2012 Annual Conference on Neural Information Processing Systems. Cambridge:MIT, 2012:1097-1105.

Multi-view feature projection and synthesis-analysis dictionary learning for image classification

基于多视图特征投影与合成解析字典学习的图像分类

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Pengqi GAO, Heming HUANG, Yonghong FAN. Fusion of coordinate and multi-head attention mechanisms for interactive speech emotion recognition [J]. Journal of Computer Applications, 2024, 44(8): 2400-2406.
[2]	Dongwei WANG, Baichen LIU, Zhi HAN, Yanmei WANG, Yandong TANG. Deep network compression method based on low-rank decomposition and vector quantization [J]. Journal of Computer Applications, 2024, 44(7): 1987-1994.
[3]	Yao DONG, Yixue FU, Yongfeng DONG, Jin SHI, Chen CHEN. Survey of incomplete multi-view clustering [J]. Journal of Computer Applications, 2024, 44(6): 1673-1682.
[4]	Feiyu ZHAI, Handa MA. Hybrid classical-quantum classification model based on DenseNet [J]. Journal of Computer Applications, 2024, 44(6): 1905-1910.
[5]	Bin XIAO, Mo YANG, Min WANG, Guangyuan QIN, Huan LI. Domain generalization method of phase-frequency fusion from independent perspective [J]. Journal of Computer Applications, 2024, 44(4): 1002-1009.
[6]	Xue LI, Guangle YAO, Honghui WANG, Jun LI, Haoran ZHOU, Shaoze YE. Remote sensing image classification based on sample incremental learning [J]. Journal of Computer Applications, 2024, 44(3): 732-736.
[7]	Jingxin LIU, Wenjing HUANG, Liangsheng XU, Chong HUANG, Jiansheng WU. Unsupervised feature selection model with dictionary learning and sample correlation preservation [J]. Journal of Computer Applications, 2024, 44(12): 3766-3775.
[8]	Li XIE, Weiping SHU, Junjie GENG, Qiong WANG, Hailin YANG. Few-shot cervical cell classification combining weighted prototype and adaptive tensor subspace [J]. Journal of Computer Applications, 2024, 44(10): 3200-3208.
[9]	Wen ZHOU, Yuzhang CHEN, Zhiyuan WEN, Shiqi WANG. Fish image classification based on positional overlapping patch embedding and multi-scale channel interactive attention [J]. Journal of Computer Applications, 2024, 44(10): 3209-3216.
[10]	Tong CHEN, Jiwei WEI, Shiyuan HE, Jingkuan SONG, Yang YANG. Adversarial training method with adaptive attack strength [J]. Journal of Computer Applications, 2024, 44(1): 94-100.
[11]	Zhixiong ZHENG, Jianhua LIU, Shuihua SUN, Ge XU, Honghui LIN. Aspect-based sentiment analysis model fused with multi-window local information [J]. Journal of Computer Applications, 2023, 43(6): 1796-1802.
[12]	Haitao TANG, Hongjun WANG, Tianrui LI. Discriminative multidimensional scaling for feature learning [J]. Journal of Computer Applications, 2023, 43(5): 1323-1329.
[13]	Bin WANG, Tian XIANG, Yidong LYU, Xiaofan WANG. Adaptive multi-scale feature channel grouping optimization algorithm based on NSGA‑Ⅱ [J]. Journal of Computer Applications, 2023, 43(5): 1401-1408.
[14]	Zhenliang LI, Bo LI. Improved method of convolution neural network based on matrix decomposition [J]. Journal of Computer Applications, 2023, 43(3): 685-691.
[15]	Kai WEN, Xiao XUE, Juan JI. Shared transformation matrix capsule network for complex image classification [J]. Journal of Computer Applications, 2023, 43(11): 3411-3417.