Journal of Computer Applications ›› 2025, Vol. 45 ›› Issue (4): 1310-1316.DOI: 10.11772/j.issn.1001-9081.2024030387

• Multimedia computing and computer simulation • Previous Articles     Next Articles

Remote sensing image building extraction network based on dual promotion of semantic and detailed features

Yang ZHOU, Hui LI()   

  1. National Key Laboratory of Fundamental Science on Synthetic Vision (Sichuan University),Chengdu Sichuan 610065,China
  • Received:2024-04-03 Revised:2024-07-24 Accepted:2024-07-26 Online:2024-08-28 Published:2025-04-10
  • Contact: Hui LI
  • About author:ZHOU Yang, born in 1991, M. S. candidate. His research interests include computer vision, image processing, urban 3D construction.
  • Supported by:
    National Natural Science Foundation of China(U20A20161)

基于语义和细节特征双促进的遥感影像建筑物提取网络

周阳, 李辉()   

  1. 视觉合成图形图像技术国家级重点实验室(四川大学),成都 610065
  • 通讯作者: 李辉
  • 作者简介:周阳(1991—),男,江苏盐城人,硕士研究生,主要研究方向:计算机视觉、图像处理、城市三维构建
  • 基金资助:
    国家自然科学基金资助项目(U20A20161)

Abstract:

Accurate edge information extraction is crucial for building segmentation. Current approaches often simply fuse multi-scale detailed features with semantic features or design complex loss functions to guide the network’s focus on edge information, ignoring the mutual promotion effect between semantic and detailed features. To address these issues, a remote sensing image building extraction network based on dual promotion of semantic and detailed features was developed. The structure of the proposed network was similar to the framework of U-Net. The shallow high-resolution detailed feature maps were extracted in the encoder, and the deep Semantic and Detail Feature dual Facilitation module(SDFF) was embedded in the backbone network in the decoder, so as to enable the network to have both good semantic and detail feature extraction capabilities. After that, channel fusion was performed on semantic and detailed features, and combined with edge loss supervision of images with varying resolutions, the ability to extract building details and the generalization of the network were enhanced. Experimental results demonstrate that compared to various mainstream methods such as U-Net and Dual-Stream Detail-Concerned Network (DSDCNet), the proposed network achieves superior semantic segmentation results on WHU and Massachusetts buildings (Massachusetts) datasets, showing better preservation of building edge features and effective improvement of building segmentation accuracy in remote sensing images.

Key words: remote sensing image, feature fusion, edge extraction, deep learning, semantic segmentation

摘要:

提取准确的边缘信息对分割建筑物至关重要。将多尺度细节与语义特征进行简单融合,或者设计复杂的损失函数引导网络关注边缘信息是当前较常见的方法,然而这些方法很少关注语义和细节特征的相互促进作用。针对该问题,提出一种基于语义和细节特征双促进的遥感影像建筑物提取网络。所提网络的结构类似U-Net框架,在编码端提取浅层高分辨率细节特征图,在解码端将深层的语义与细节特征双促进模块(SDFF)嵌入主干网络中,从而使网络同时具备较好的语义特征和细节特征的提取能力。之后对语义和细节特征进行通道融合,并结合不同分辨率影像的边缘损失监督,提高网络对建筑物细节的提取能力和泛化性。实验结果表明:与U-Net和双路细节关注网络(DSDCNet)等多种主流方法相比,所提网络在WHU数据集和马萨诸塞州建筑物(Massachusetts)数据集上均取得了最佳的语义分割结果。可见,所提网络能更好地保留建筑物边缘特征,有效提升遥感影像中的建筑物分割精度。

关键词: 遥感影像, 特征融合, 边缘提取, 深度学习, 语义分割

CLC Number: