Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (10): 3177-3183.DOI: 10.11772/j.issn.1001-9081.2021091614

• Multimedia computing and computer simulation • Previous Articles    

Waterweed image segmentation method based on improved U-Net

Qiwen WU, Jianhua WANG, Xiang ZHENG, Ju FENG, Hongyan JIANG, Yubo WANG   

  1. Key Laboratory of Marine Technology and Control Engineering,Ministry of Transport (Shanghai Maritime University),Shanghai 201306,China
  • Received:2021-09-13 Revised:2021-12-24 Accepted:2022-01-04 Online:2022-04-15 Published:2022-10-10
  • Contact: Jianhua WANG
  • About author:WU Qiwen, born in 1997, M. S. candidate. Her research interests include machine vision.
    WANG Jianhua, born in 1965, Ph. D. , associate professor. His research interests include surface robot, machine vision.
    ZHENG Xiang, born in 1981, Ph. D. , lecturer. Her research interests include motion control of unmanned surface vehicles, vibration control of smart structures.
    FENG Ju, born in 1997, M. S. candidate. Her research interests include unmanned surface vehicle perception decision-making and control.
    JIANG Hongyan, born in 1997, M. S. candidate. His research interests include ship and port automation technology.
    WANG Yubo, born in 1997, M. S. candidate. His research interests include unmanned surface vehicle visual navigation,autonomous berthing.
  • Supported by:
    National Natural Science Foundation of China(62176150)

基于改进U-Net的水草图像分割方法

吴奇文, 王建华, 郑翔, 冯居, 姜洪岩, 王昱博   

  1. 航运技术与控制工程交通运输行业重点实验室(上海海事大学),上海 201306
  • 通讯作者: 王建华
  • 作者简介:第一联系人:吴奇文(1997—),女,江西上饶人,硕士研究生,主要研究方向:机器视觉
    王建华(1965—),男,云南鹤庆人,副教授,博士,主要研究方向:水面机器人、机器视觉; jhwang@shmtu.edu.cn
    郑翔(1981—),女,江苏扬州人,讲师,博士,主要研究方向:无人艇运动控制、智能结构减振控制
    冯居(1997—),女,江苏盐城人,硕士研究生,主要研究方向:无人艇感知决策与控制
    姜洪岩(1997—),男,吉林德惠人,硕士研究生,主要研究方向:船舶与港口自动化技术
    王昱博(1997—),男,河南洛阳人,硕士研究生,主要研究方向:无人艇视觉导航、自主靠泊。
  • 基金资助:
    国家自然科学基金资助项目(62176150)

Abstract:

During the operation of the Unmanned Surface Vehicles (USVs), the propellers are easily gotten entangled by waterweeds, which is a problem encountered by the whole industry. Concerning the global distribution, dispersivity, and complexity of the edge and texture of waterweeds in the water surface images, the U-Net was improved and used to classify all pixels in the image, in order to reduce the feature loss of the network, and enhance the extraction of both global and local features, thereby improving the overall segmentation performance. Firstly, the image data of waterweeds in multiple locations and multiple periods were collected, and a comprehensive dataset of waterweeds for semantic segmentation was built. Secondly, three scales of input images were introduced into the network to enable full extraction of the features via the network, and three loss functions for the upsampled images were introduced to balance the overall loss brought by the three different scales of input images. In addition, a hybrid attention module, including the dilated convolution branch and the channel attention enhancement branch, was proposed and introduced to the network. Finally, the proposed network was verified on the newly built waterweed dataset. Experimental results show that the accuracy, mean Intersection over Union (mIoU) and mean Pixel Accuracy (mPA) values of the proposed method can reach 96.8%, 91.22% and 95.29%, respectively, which are improved by 4.62 percentage points, 3.87 percentage points and 3.12 percentage points compared with those of U-Net (VGG16) segmentation method. The proposed method can be applied to unmanned surface vehicles for detection of waterweeds, and perform the corresponding path planning to realize waterweed avoidance.

Key words: waterweed, U-Net, semantic segmentation, attention mechanism, multi-scale input, loss function

摘要:

无人艇(USV)在河道水面作业过程中,水草会缠绕推进器,这是整个业界应用都遇到的困扰。针对水面图像中水草分布的全局性、分散性以及边缘和纹理的复杂性,对U-Net进行改进并用于对图像所有的像素进行分类,以减少网络特征信息的丢失,并加强全局和局部特征的提取,从而提高分割性能。首先,采集多地多时段水草图像数据,制作了一个比较全面的水草语义分割数据集;其次,提出在U-Net中引入三个尺度的图像输入,从而使得网络对特征进行充分提取,并引进三种上采样图像的损失函数来平衡三种尺度的输入图像带来的总体损失;此外,还提出了一种混合注意力模块并引入到网络中,其包含空洞卷积和通道注意增强两个分支;最后,在新构建的水草数据集上对所提网络进行验证。实验结果显示,所提方法的准确率、均交并比(mIoU)和平均像素精度(mPA)值分别可达96.8%、91.22%和95.29%,与U-Net(VGG16)分割方法相比,分别提高了4.62个百分点、3.87个百分点和3.12个百分点。所提方法可应用于水面无人艇对水草的检测,并进行相应的路径规划来实现水草避让。

关键词: 水草, U-Net, 语义分割, 注意力机制, 多尺度输入, 损失函数

CLC Number: