Image labeling based on fully-connected conditional random field

doi:10.11772/j.issn.1001-9081.2017.10.2841

Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (10): 2841-2846.DOI: 10.11772/j.issn.1001-9081.2017.10.2841

Previous Articles Next Articles

Image labeling based on fully-connected conditional random field

LIU Tong, HUANG Xiutian, MA Jianshe, SU Ping

Graduate School at Shenzhen, Tsinghua University, Shenzhen Guangdong 518055, China

Received:2017-04-10 Revised:2017-05-22 Online:2017-10-16 Published:2017-10-10
Supported by:
This work is partially supported by the National High Technology Research and Development Program (863 Program) of China (2015AA043302), the Program of Industry-university-research Cooperation in Guangdong Province (2013A090100002), the Development Program of Technology Research in Guangdong Huadu District (HD14ZD004).

基于完全联系的条件随机场的图像标注

刘彤, 黄修添, 马建设, 苏萍

清华大学深圳研究生院, 广东深圳 518055

通讯作者: 黄修添(1991-),男,福建宁德人,硕士研究生,主要研究方向:模式识别、计算机视觉,E-mail:709794457@qq.com
作者简介:刘彤(1970-),男,河南洛阳人,讲师,博士,主要研究方向:计算机视觉、LED封装;黄修添(1991-),男,福建宁德人,硕士研究生,主要研究方向:模式识别、计算机视觉;马建设(1965-),男,河南郑州人,副教授,博士,主要研究方向:计算机视觉、机器人控制;苏萍(1975-),女,河南洛阳人,讲师,博士,主要研究方向:三维全息显示系统、图像处理.
基金资助:
国家863计划项目（2015AA043302）；2013年广东省省部产学研合作项目（2013A090100002）；2014年广东省花都区科技计划项目（HD14ZD004）。

Abstract

Abstract: The traditional image labeling models often have two deficiencies; they only can model short-range contextual information in pixel-level of the image and have a complicated inference. To improve the precision of image labeling, the fully-connected Conditional Random Field (CRF) model was used; to simplify the inference of the model, the mean filed approximation based on Gaussian kd-tree for inference was proposed. To verify the effectiveness of the proposed algorithm, the experimental image datasets not only contained the standard picture library MSRC-9, but also contained MyDataset_1 (machine parts) and MyDataset_2 (office table) which made by authors. The precisions of the proposed method on those three datasets are 77.96%, 97.15% and 95.35% respectively, and the mean cost time of each picture is 2s. The results indicate that the fully-connected CRF model can improve the precision of image labeling by considering the contextual information of image and the mean field approximation using Gaussian kd-tree can raise the efficiency of inference.

Key words: Conditional Random Field (CRF), image labeling, contextual information, Gaussian kd-tree, model inference

摘要： 传统的图像标注模型通常存在两个问题：只能够对短距离的像素上下文信息进行建模和复杂的模型推理过程。为了提高图像标注的精度、简化图像标注的模型推理过程，采用完全联系的条件随机场模型进行图像标注，提出利用基于高斯kd树的平均场估计方法实现该模型的高效推理。为了更好地验证算法的有效性，实验的图片数据库不仅包含标准的图片库--剑桥大学微软研究图片库（MSRC-9），还包含作者制作的机械零件图片库（MyDataset_1）和办公桌图片库（MyDataset_2）。新算法在三个图片库上的平均标注精度分别可以达到77.96%、97.15%和95.35%，每幅图的平均运行时间为2s。实验结果表明，基于完全联系的条件随机场的图像标注能够更充分地考虑不同的像素上下文信息来提高标注精度，而基于高斯kd树的模型推理能够提高模型推理的效率。

关键词: 条件随机场, 图像标注, 上下文信息, 高斯kd树, 模型推理

CLC Number:

TP391.413

LIU Tong, HUANG Xiutian, MA Jianshe, SU Ping. Image labeling based on fully-connected conditional random field[J]. Journal of Computer Applications, 2017, 37(10): 2841-2846.

刘彤, 黄修添, 马建设, 苏萍. 基于完全联系的条件随机场的图像标注[J]. 计算机应用, 2017, 37(10): 2841-2846.

References

[1] LAFFERTY J D, McCALLUM A, PEREIRA F C N. Conditional random fields: probabilistic models for segmenting and labeling sequence data[C]//ICML 2001: Proceedings of the Eighteenth International Conference on Machine Learning. San Francisco: Morgan Kaufmann Publishers, 2001: 282-289.
[2] 杨耘, 徐丽. 基于分层特征关联条件随机场的遥感图像分类[J]. 计算机应用, 2014, 34(6): 1741-1745. (YANG Y, XU L. Remote sensing image classification based on conditional random field with hierarchical correlated features[J]. Journal of Computer Applications, 2014, 34(6): 1741-1745.)
[3] 李林, 练金, 吴跃, 等. 基于概率图模型的图像整体场景理解综述[J]. 计算机应用, 2014, 34(10): 2913-2921. (LI L, LIAN J, WU Y, et al. The review of image scenario understanding based on graphical models[J]. Journal of Computer Applications, 2014, 34(10): 2913-2921.)
[4] 张微, 汪西莉. 基于超像素的条件随机场图像分类[J]. 计算机应用, 2012, 32(5): 1272-1275. (ZHANG W, WANG X L. Image classification based on super-pixel condition random fields[J]. Journal of Computer Applications, 2012, 32(5): 1272-1275.)
[5] KOHLIEMAIL P, LADICKY L, TORR P H S. Robust higher order potentials for enforcing label consistency[J]. International Journal of Computer Vision, 2009, 82(3): 302-324.
[6] LADICKY L, RUSSELL C, KOHLI P, et al. Associative hierarchical CRFs for object class image segmentation[C]//Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. Piscataway, NJ: IEEE, 2009: 739-746.
[7] SHOTTON J, WINN J, ROTHER C, et al. TextonBoost for image understanding: multi-class object recognition and segmentation by jointly modeling texture, layout, and context[J]. International Journal of Computer Vision, 2009, 81(1): 2-23.
[8] KRÄHENBVHL P, KOLTUN V. Efficient inference in fully connected CRFs with Gaussian edge potentials[EB/OL]. [2017-01-10]. https://arxiv.org/pdf/1210.5644.pdf.
[9] VINEET V, WARRELL J, TORR P H S. Filter-based mean-field inference for random fields with higher-order terms and product label-spaces[J]. International Journal of Computer Vision, 2014, 110(3): 290-307.
[10] HE X, ZEMEL R S, CARREIRA-PERPI M A. Multiscale conditional random fields for image labeling[C]//CVPR 2004: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Washington, DC: IEEE Computer Society, 2004,2:695-702.
[11] ABNEY S, SCHAPIRE R E, SINGER Y. Boosting applied to tagging and PP attachment[EB/OL]. [2017-01-10]. http://www. vinartus. net/spa/98b. pdf.
[12] PINTO D, McCALLUM A, WEI X, et al. Table extraction using conditional random fields[C]//Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM, 2003: 235-242.
[13] TOYODA T, HASEGAWA O. Random field model for integration of local information and global information[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2008, 30(8): 1483.
[14] ADAMS A, GELFAND N, DOLSON J, et al. Gaussian KD-trees for fast high-dimensional filtering[J]. ACM Transactions on Graphics, 2009, 28(3): 1-12.
[15] KOLLER D, FRIEDMAN N. Probabilistic Graphical Models: Principles and Techniques-Adaptive Computation and Machine Learning[M]. Cambridge: MIT Press, 2009.
[16] SMITH S W. The Scientist and Engineer's Guide to Digital Signal Processing[M]. San Diego, CA: California Technical Publishing, 1997: 503-534.
[17] PARIS S, DURAND F. A fast approximation of the bilateral filter using a signal processing approach[J]. International Journal of Computer Vision, 2009, 81(1): 24-52.

Image labeling based on fully-connected conditional random field

基于完全联系的条件随机场的图像标注

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Yongfeng DONG, Jiaming BAI, Liqin WANG, Xu WANG. Chinese named entity recognition combining prior knowledge and glyph features [J]. Journal of Computer Applications, 2024, 44(3): 702-708.
[2]	Qingtang LIU, Xinqian MA, Jie ZHOU, Linjing WU, Pengxiao ZHOU. Understanding of math word problems integrating commonsense knowledge base and grammatical features [J]. Journal of Computer Applications, 2023, 43(2): 356-364.
[3]	Zixing YU, Shaojun QU, Xin HE, Zhuo WANG. High-low dimensional feature guided real-time semantic segmentation network [J]. Journal of Computer Applications, 2023, 43(10): 3077-3085.
[4]	Kai WEN, Weiwei TANG, Junchen XIONG. Real-time segmentation algorithm based on attention mechanism and effective factorized convolution [J]. Journal of Computer Applications, 2022, 42(9): 2659-2666.
[5]	HU Tiantian, DAN Yabo, HU Jie, LI Xiang, LI Shaobo. News named entity recognition and sentiment classification based on attention-based bi-directional long short-term memory neural network and conditional random field [J]. Journal of Computer Applications, 2020, 40(7): 1879-1883.
[6]	LIU Jing, WU Yingfei, YUAN Zhenming, SUN Xiaoyan. Blood pressure prediction with multi-factor cue long short-term memory model [J]. Journal of Computer Applications, 2019, 39(5): 1551-1556.
[7]	LIAO Bin, LI Haowen. Image depth estimation model based on atrous convolutional neural network [J]. Journal of Computer Applications, 2019, 39(1): 267-274.
[8]	ZHOU Shuangshuang, XU Jin'an, CHEN Yufeng, ZHANG Yujie. New words detection method for microblog text based on integrating of rules and statistics [J]. Journal of Computer Applications, 2017, 37(4): 1044-1050.
[9]	WANG Ting, WANG Qi, HUANG Yueqi, YIN Yichao, GAO Ju. Automatic hyponymy extracting method based on symptom components [J]. Journal of Computer Applications, 2017, 37(10): 2999-3005.
[10]	HUANG Nian'e, HUANG He, WANG Rujing. Agriculture-related product name extraction and category labeling based on ontology and conditional random field [J]. Journal of Computer Applications, 2017, 37(1): 233-238.
[11]	LIU Chunli, LI Xiaoge, LIU Rui, FAN Xian, DU Liping. Chinese word segment based on character representation learning [J]. Journal of Computer Applications, 2016, 36(10): 2794-2798.
[12]	ZHOU Xiang, LI Shaobo, YANG Guanci. Entity recognition of clothing commodity attributes [J]. Journal of Computer Applications, 2015, 35(7): 1945-1949.
[13]	LIU Li, WANG Yongheng, WEI Hang. Fine-grained sentiment analysis oriented to product comment [J]. Journal of Computer Applications, 2015, 35(12): 3481-3486.
[14]	MO Yiwen, JI Donghong, HUANG Jiangping. Slight-pause marks boundary identification based on conditional random field [J]. Journal of Computer Applications, 2015, 35(10): 2838-2842.
[15]	YANG Yufei DAI Qi JIA Zhen YI Hongfeng. Weakly supervised method for attribute relation extraction [J]. Journal of Computer Applications, 2014, 34(1): 64-68.