基于多模态关联图的图像语义标注方法

计算机应用 ›› 2010, Vol. 30 ›› Issue (12): 3295-3297.

基于多模态关联图的图像语义标注方法

郭玉堂¹,罗斌²

1. 合肥师范学院
2.

收稿日期:2010-05-20 修回日期:2010-08-08 发布日期:2010-12-22 出版日期:2010-12-01
通讯作者: 郭玉堂
基金资助:
基于内容的视频信息结构化建模方法研究;基于图理论的图像语义自动标注研究

Image semantic annotation method based on multi-modal relational graph

Received:2010-05-20 Revised:2010-08-08 Online:2010-12-22 Published:2010-12-01
Contact: Guo YuTang

摘要/Abstract

摘要： 为了改善图像标注的性能，提出了一种基于多模态关联图的图像语义标注方法。该方法用一个无向图表达了图像区域特征、标注词以及图像三者之间的关系，结合图像区域特征相似性和语义间的相关性提取图像语义信息，提高了图像标注的精度。利用逆向文档频率（IDF）修正图像节点与其标注词节点之间边的权值，克服了传统方法中因高频词引起的偏差，有效地提高了图像标注的性能。在Corel图像数据集上进行了实验，实验结果验证了该方法的有效性。

关键词: 图像语义, 多模态图, 逆向文档频率, 高频词

Abstract: In order to improve the performance of the image annotation, an image semantic annotation method based on multi-modal relational graph was proposed. The relationship between the low-level features of the image region, annotated words and images was presented by an undirected graph. Semantic information was extracted by combining similarity measured in the region feature space and the correlation of annotation words to improve the accuracy of the extracted semantics. Inverse Document Frequency (IDF) was introduced to adjust the weights of edges between the image node and its annotation words node in order to overcome the deviation caused by high-frequency words. It can effectively improve the image annotation performance. The experimental results on the Corel image datasets show the effectiveness of the proposed approach in terms of quality of the image annotation.

Key words: image semantic, multi-modal graph, Inverse Document Frequency （IDF）, high-frequency word

郭玉堂罗斌. 基于多模态关联图的图像语义标注方法[J]. 计算机应用, 2010, 30(12): 3295-3297.

[1]	赵小虎, 李晓. 基于多特征提取的图像语义描述算法[J]. 计算机应用, 2021, 41(6): 1640-1646.
[2]	胡嵽, 冯子亮. 基于深度学习的轻量级道路图像语义分割算法[J]. 计算机应用, 2021, 41(5): 1326-1331.
[3]	董阳, 潘海为, 崔倩娜, 边晓菲, 滕腾, 王邦菊. 面向多模态磁共振脑瘤图像的小样本分割方法[J]. 计算机应用, 2021, 41(4): 1049-1054.
[4]	赵一, 段兴, 谢仕义, 梁春林. 面向特定目标自识别的交通图像语义检索方法[J]. 计算机应用, 2020, 40(2): 553-560.
[5]	王玉龙, 蒲军, 赵江华, 黎建辉. 基于生成对抗网络的地面新增建筑检测[J]. 计算机应用, 2019, 39(5): 1518-1522.
[6]	王丽芳, 成茜, 秦品乐, 高媛. 基于多通道稀疏编码的非刚性多模态医学图像配准[J]. 计算机应用, 2018, 38(4): 1127-1133.
[7]	王丽芳, 董侠, 秦品乐, 高媛. 基于自适应联合字典学习的脑部多模态图像融合方法[J]. 计算机应用, 2018, 38(4): 1134-1140.
[8]	史婷婷闫大顺沈玉利. 基于个性化本体的图像语义标注和检索[J]. 计算机应用, 2010, 30(1): 90-93.