《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (4): 1284-1291.DOI: 10.11772/j.issn.1001-9081.2021071279

• CCF第36届中国计算机应用大会 (CCF NCCA 2021) • 上一篇    

结合长尾数据解决方法的野生动物目标检测

蔡前舟1, 郑伯川2(), 曾祥银2, 侯金3   

  1. 1.西华师范大学 数学与信息学院,四川 南充 637009
    2.西华师范大学 计算机学院,四川 南充 637009
    3.西华师范大学 生命科学学院,四川 南充 637009
  • 收稿日期:2021-07-16 修回日期:2021-08-23 接受日期:2021-08-27 发布日期:2022-04-28 出版日期:2022-04-10
  • 通讯作者: 郑伯川
  • 作者简介:蔡前舟(1996—),男,四川通江人,硕士研究生,CCF会员,主要研究方向:深度学习、目标检测
    曾祥银(1994—),男,四川大竹人,硕士研究生,主要研究方向:机器学习、深度学习
    侯金(1995—),男,四川攀枝花人,硕士研究生,主要研究方向:生物多样性保护、可持续发展。
  • 基金资助:
    国家自然科学基金资助项目(62176217);四川省科技创新苗子工程项目(2020029);西华师范大学大学生创新创业项目(cxcy2020150)

Wildlife object detection combined with solving method of long-tail data

Qianzhou CAI1, Bochuan ZHENG2(), Xiangyin ZENG2, Jin HOU3   

  1. 1.School of Mathematics and Information,China West Normal University,Nanchong Sichuan 637009,China
    2.School of Computer Science,China West Normal University,Nanchong Sichuan 637009,China
    3.College of Life Sciences,China West Normal University,Nanchong Sichuan 637009,China
  • Received:2021-07-16 Revised:2021-08-23 Accepted:2021-08-27 Online:2022-04-28 Published:2022-04-10
  • Contact: Bochuan ZHENG
  • About author:CAI Qianzhou, born in 1996, M. S. candidate. His research interests include deep learning, object detection.
    ZENG Xiangyin, born in 1994, M. S. candidate. His research interests include machine learning, deep learning.
    HOU Jin, born in 1995, M. S. candidate. His research interests include biodiversity conservation, sustainable development.
  • Supported by:
    National Natural Science Foundation of China(62176217);Program of Sichuan Science and Technology Innovation Seedling Project(2020029);Innovation and Entrepreneurship Project of College Students of China West Normal University(cycx2020150)

摘要:

基于红外相机图像的野生动物目标检测有利于研究和保护野生动物。由于不同种类的野生动物数量差别大,红外相机采集到的野生动物数据集存在种类数量分布不均的长尾数据问题,进而影响目标检测神经网络模型的整体性能提升。针对野生动物的长尾数据导致的目标检测精度低的问题,提出了一种基于两阶段学习和重加权相结合的长尾数据解决方法,并将该方法用于基于YOLOv4-Tiny的野生动物目标检测。首先,采集、标注并构建了一个新的野生动物数据集,该数据集具有明显的长尾数据特征;其次,采用基于迁移学习的两阶段方法训练神经网络,第一阶段在分类损失函数中采用无加权方式进行训练,而在第二阶段提出了两种改进的重加权方法,并以第一阶段所得权重作为预训练权重进行重加权训练;最后,对野生动物测试集进行测试。实验结果表明,在分类损失采用交叉熵损失函数和焦点损失函数下,所提出的长尾数据解决方法达到了60.47%和61.18%的平均精确率均值(mAP),相较于无加权方法在两种损失函数下分别提高了3.30个百分点和5.16个百分点,相较于所提改进的有效样本加权方法在焦点损失函数下提高了2.14个百分点,说明该方法能提升YOLOv4-Tiny网络对具有长尾数据特征的野生动物数据集的目标检测性能。

关键词: 长尾数据, 目标检测, 两阶段学习, 重加权, YOLOv4-Tiny

Abstract:

Wild animal object detection based on infrared camera images is conducive to the research and protection of wild animals. Because of the large difference in the number of different species of wildlife, there is the long-tail data problem of uneven distribution of numbers of species in the wildlife dataset collected by infrared cameras. This problem affects the overall performance improvement of the object detection neural network models. In order to solve the problem of low accuracy of object detection caused by long-tail data of wild animals, a method based on two-stage learning and re-weighting to solve long-tail data was proposed, and the method was applied to wildlife object detection based on YOLOv4-Tiny. Firstly, a new wildlife dataset with obvious long-tail data characteristics was collected, labelled and constructed. Secondly, a two-stage method based on transfer learning was used to train the neural network. In the first stage, the classification loss function was trained without weighting. In the second stage, two improved re-weighting methods were proposed, and the weights obtained in the first stage were used as the pre-training weights for re-weighting training. Finally, the wildlife test set was used to tested. Experimental results showed that the proposed long-tail data solving method achieved 60.47% and 61.18% mAP (mean Average Precision) with cross-entropy loss function and focal loss function as classification loss respectively, which was 3.30 percentage points and 5.16 percentage points higher than that the no-weighting method under the two loss functions, and 2.14 percentage points higher than that of the proposed improved effective sample weighting method under focus loss function. It shows that the proposed method can improve the object detection performance of YOLOv4-Tiny network for wildlife datasets with long-tail data characteristics.

Key words: long-tail data, object detection, two-stage learning, re-weighting, YOLOv4-Tiny

中图分类号: