Zero-shot image classification based on visual error and semantic attributes

doi:10.11772/j.issn.1001-9081.2019081475

Journal of Computer Applications ›› 2020, Vol. 40 ›› Issue (4): 1016-1022.DOI: 10.11772/j.issn.1001-9081.2019081475

• Artificial intelligence • Previous Articles Next Articles

Zero-shot image classification based on visual error and semantic attributes

XU Ge¹, XIAO Yongqiang^2,3,4, WANG Tao^1,2, CHEN Kaizhi^2,3,4, LIAO Xiangwen^2,3,4, WU Yunbing^2,3,4

1. College of Computer and Control Engineering, Minjiang University, Fuzhou Fujian 350108, China;
2. College of Mathematics and Computer Science, Fuzhou University, Fuzhou Fujian 350116, China;
3. Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University), Fuzhou Fujian 350116, China;
4. Digital Fujian Financial Big Data Institute, Fuzhou Fujian 350116, China

Received:2019-09-03 Revised:2019-10-23 Online:2019-11-18 Published:2020-04-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China(61772135,U1605251,61703195),the Open Fund of the Key Laboratory of Network Data Science and Technology of the Chinese Academy of Sciences(CASNDST201708,CASNDST 201606), the Open Fund of the State Key Laboratory of Pattern Recognition (201900041),the Surface Program of Fujian Natural Science Foundation (2017J01755), the CERNET Innovation Project(NGII20160501).

基于视觉误差与语义属性的零样本图像分类

徐戈¹, 肖永强^2,3,4, 汪涛^1,2, 陈开志^2,3,4, 廖祥文^2,3,4, 吴运兵^2,3,4

1. 闽江学院计算机与控制工程学院, 福州 350108;
2. 福州大学数学与计算机科学学院, 福州 350116;
3. 福建省网络计算与智能信息处理重点实验室(福州大学), 福州 350116;
4. 数字福建金融大数据研究所, 福州 350116

通讯作者: 廖祥文
作者简介:徐戈(1978-),男,福建福州人,副教授,博士,主要研究方向:人工智能、自然语言处理;肖永强(1994-),男,福建龙岩人,硕士研究生,主要研究方向:多模态、人工智能;汪涛(1987-),男,福建福州人,讲师,博士,主要研究方向:场景理解、目标检测与分割;陈开志(1983-),男,福建福州人,讲师,博士,主要研究方向:自然语言处理、图像识别、深度学习算法;廖祥文(1980-),男,福建福州人,教授,博士,主要研究方向:观点挖掘、情感分析、自然语言处理;吴运兵(1976-),男,福建福州人,副教授,硕士,主要研究方向:知识表示与知识发现。
基金资助:
国家自然科学基金资助项目（61772135，U1605251，61703195）；中国科学院网络数据科学与技术重点实验室开放课题基金资助项目（CASNDST201708，CASNDST201606）；模式识别国家重点实验室开放课题基金资助项目（201900041）；福建省自然科学基金面上项目（2017J01755）；赛尔网络下一代互联网技术创新项目（NGII20160501）。

Abstract

Abstract: In the practical applications of image classification,some categories may have no labeled training data at all. The purpose of Zero-Shot Learning(ZSL)is to transfer knowledge such as image features of labeled categories to unlabeled categories and to correctly classify the unlabeled categories. However,the existing state-of-the-art methods cannot explicitly distinguish the input image belonging to the known categories or unknown categories,which leads to a large performance gap for unlabeled categories between the traditional ZSL prediction and the Generalized ZSL(GZSL)prediction. Therefore,a method of fusing of visual error and semantic attributes was proposed to alleviate the prediction bias problem in zero-shot image classification. Firstly,a semi-supervised learning based generative adversarial network framework was designed to obtain visual error information,so as to predict whether the image belongs to the known categories. Then,a zero-shot image classification network combining semantic attributes was proposed to achieve zero-shot image classification. Finally,the performance of zero-shot image classification algorithm combining visual error and semantic attributes was tested on AwA2 (Animal with Attributes) and CUB (Caltech-UCSD-Birds-200-2011) datasets. The experimental results show that, compared to the baseline models,the proposed method can effectively alleviate the prediction bias problem,and has the harmonic index H increased by 31. 7 percentage points on AwA2 dataset and 8. 7 percentage points on CUB dataset.

Key words: Zero-Shot Learning (ZSL), image classification, generative adversarial network, visual error, semantic attribute

摘要： 在图像分类的实际应用过程中，部分类别可能完全没有带标签的训练数据。零样本学习（ZSL）的目的是将带标签类别的图像特征等知识迁移到无标签的类别上，实现无标签类别的正确分类。现有方法在测试时无法显式地区分输入图像属于已知类还是未知类，很大程度上导致未知类在传统设定下的ZSL和广义设定下的ZSL（GZSL）上的预测效果相差甚远。为此，提出一种融合视觉误差与属性语义信息的方法来缓解零样本图像分类中的预测偏置问题。首先，设计一种半监督学习方式的生成对抗网络架构来获取视觉误差信息，由此预测图像是否属于已知类；然后，提出融合属性语义信息的零样本图像分类网络来实现零样本图像分类；最后，测试融合视觉误差与属性语义的零样本图像分类方法在数据集AwA2和CUB上的效果。实验结果表明，与对比模型相比，所提方法有效缓解了预测偏置问题，其调和指标H在AwA2（Animal with Attributes）上提升了31.7个百分点，在CUB（Caltech-UCSD-Birds-200-2011）上提升了8.7个百分点。

关键词: 零样本学习, 图像分类, 生成对抗网络, 视觉误差, 属性语义

CLC Number:

TP391

XU Ge, XIAO Yongqiang, WANG Tao, CHEN Kaizhi, LIAO Xiangwen, WU Yunbing. Zero-shot image classification based on visual error and semantic attributes[J]. Journal of Computer Applications, 2020, 40(4): 1016-1022.

徐戈, 肖永强, 汪涛, 陈开志, 廖祥文, 吴运兵. 基于视觉误差与语义属性的零样本图像分类[J]. 计算机应用, 2020, 40(4): 1016-1022.

References

[1] LAROCHELLE H,ERHAN D,BENGIO Y. Zero-data learning of new tasks[C]//Proceedings of the 23rd AAAI Conference on Artificial Intelligence. Palo Alto, CA:AAAI Press, 2008:646-651.
[2] LAMPERT C H,NICKISCH H,HARMELING S. Learning to detect unseen object classes by between-class attribute transfer[C]//Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2009:951-958.
[3] LAMPERT C H,NICKISCH H,HARMELING S. Attribute-based classification for zero-shot visual object categorization[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2014, 36(3):453-465.
[4] ROMERA-PAREDES B,TORR P H S. An embarrassingly simple approach to zero-shot learning[C]//Proceedings of the 32nd International Conference on Machine Learning. New York:JMLR. org, 2015:2152-2161.
[5] LIU M,ZHANG D,CHEN S. Attribute relation learning for zero-shot classification[J]. Neurocomputing,2014,139:34-46.
[6] 巩萍, 程玉虎, 王雪松. 基于属性关系图正则化特征选择的零样本分类[J]. 中国矿业大学学报,2015,44(6):1097-1104. (GONG P,CHEN Y H,WANG X S. Zero-shot classification based on attribute correlation graph regularized feature selection[J]. Journal of China University of Mining and Technology,2015,44(6):1097-1104.)
[7] AKATA Z,REED S,WALTER D,et al. Evaluation of output embeddings for fine-grained image classification[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2015:2927-2936.
[8] XIAN Y,AKATA Z,SHARMA G,et al. Latent embeddings for zero-shot classification[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:69-77.
[9] AKATA Z, MALINOWSKI M, FRITZ M, et al. Multi-cue zero-shot learning with strong supervision[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2016:59-68.
[10] ZHANG L,XIANG T,GONG S. Learning a deep embedding model for zero-shot learning[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2017:3010-3019.
[11] ZHU Y,ELHOSEINY M,LIU B,et al. A generative adversarial approach for zero-shot learning from noisy texts[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:1004-1013.
[12] SUNG F,YANG Y,ZHANG L,et al. Learning to compare:relation network for few-shot learning[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:1199-1208.
[13] LI Y,ZHANG J,ZHANG J,et al. Discriminative learning of latent features for zero-shot recognition[C]//Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE,2018:7463-7471.
[14] FU J,ZHENG H,MEI T. Look closer to see better:recurrent attention convolutional neural network for fine-grained image recognition[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2017:4476-4484.
[15] AKCAY S, ATAPOUR-ABARGHOUEI A, BRECKON T P. GANomaly:semi-supervised anomaly detection via adversarial training[C]//Proceedings of the 2018 Asian Conference on Computer Vision,LNCS 11363. Cham:Springer,2018:622-637.
[16] RUSSAKOVSKY O,DENG J,SU H,et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision,2015,115(3):211-252.
[17] HE K,ZHANG X,REN S,et al. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE, 2016:770-778.

Zero-shot image classification based on visual error and semantic attributes

基于视觉误差与语义属性的零样本图像分类

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Dongwei WANG, Baichen LIU, Zhi HAN, Yanmei WANG, Yandong TANG. Deep network compression method based on low-rank decomposition and vector quantization [J]. Journal of Computer Applications, 2024, 44(7): 1987-1994.
[2]	Li LIU, Haijin HOU, Anhong WANG, Tao ZHANG. Generative data hiding algorithm based on multi-scale attention [J]. Journal of Computer Applications, 2024, 44(7): 2102-2109.
[3]	Feiyu ZHAI, Handa MA. Hybrid classical-quantum classification model based on DenseNet [J]. Journal of Computer Applications, 2024, 44(6): 1905-1910.
[4]	Haoran WANG, Dan YU, Yuli YANG, Yao MA, Yongle CHEN. Domain transfer intrusion detection method for unknown attacks on industrial control systems [J]. Journal of Computer Applications, 2024, 44(4): 1158-1165.
[5]	Bin XIAO, Mo YANG, Min WANG, Guangyuan QIN, Huan LI. Domain generalization method of phase-frequency fusion from independent perspective [J]. Journal of Computer Applications, 2024, 44(4): 1002-1009.
[6]	Xue LI, Guangle YAO, Honghui WANG, Jun LI, Haoran ZHOU, Shaoze YE. Remote sensing image classification based on sample incremental learning [J]. Journal of Computer Applications, 2024, 44(3): 732-736.
[7]	Yi ZHENG, Cunyi LIAO, Tianqian ZHANG, Ji WANG, Shouyin LIU. Image denoising-based cell-level RSRP estimation method for urban areas [J]. Journal of Computer Applications, 2024, 44(3): 855-862.
[8]	Sunjie YU, Hui ZENG, Shiyu XIONG, Hongzhou SHI. Incentive mechanism for federated learning based on generative adversarial network [J]. Journal of Computer Applications, 2024, 44(2): 344-352.
[9]	Li XIE, Weiping SHU, Junjie GENG, Qiong WANG, Hailin YANG. Few-shot cervical cell classification combining weighted prototype and adaptive tensor subspace [J]. Journal of Computer Applications, 2024, 44(10): 3200-3208.
[10]	Wen ZHOU, Yuzhang CHEN, Zhiyuan WEN, Shiqi WANG. Fish image classification based on positional overlapping patch embedding and multi-scale channel interactive attention [J]. Journal of Computer Applications, 2024, 44(10): 3209-3216.
[11]	Tong CHEN, Jiwei WEI, Shiyuan HE, Jingkuan SONG, Yang YANG. Adversarial training method with adaptive attack strength [J]. Journal of Computer Applications, 2024, 44(1): 94-100.
[12]	Hui ZHOU, Yuling CHEN, Xuewei WANG, Yangwen ZHANG, Jianjiang HE. Deep shadow defense scheme of federated learning based on generative adversarial network [J]. Journal of Computer Applications, 2024, 44(1): 223-232.
[13]	Shaoquan CHEN, Jianping CAI, Lan SUN. Differential privacy generative adversarial network algorithm with dynamic gradient threshold clipping [J]. Journal of Computer Applications, 2023, 43(7): 2065-2072.
[14]	Anyang LIU, Huaici ZHAO, Wenlong CAI, Zechao XU, Ruideng XIE. Adaptive image deblurring generative adversarial network algorithm based on active discrimination mechanism [J]. Journal of Computer Applications, 2023, 43(7): 2288-2294.
[15]	Xin JIN, Yangchuan LIU, Yechen ZHU, Zijian ZHANG, Xin GAO. Sinogram inpainting for sparse-view cone-beam computed tomography image reconstruction based on residual encoder-decoder generative adversarial network [J]. Journal of Computer Applications, 2023, 43(6): 1950-1957.