Journal of Computer Applications

    Next Articles

Lightweight network based on segmentation guidance and frequency domain enhancement for earhead counting

LI Yuxi1, WANG Yaying1, TIAN Shiqin2, ZHENG Bochuan1,3   

  1. 1 School of Computer Science, China West Normal University 2 School of Mathematics & Information, China West Normal University 3 Institute of Artificial Intelligence, China West Normal University
  • Received:2025-09-23 Revised:2025-11-03 Online:2026-03-13 Published:2026-03-13
  • Contact: Bo-Chuan ZHENG
  • About author:LI Yuxi, born in 2000, M. S. candidate. His research interests include machine learning, deep learning, object counting. WANG Yaying, born in 2002, M. S. candidate. Her research interests include machine learning, deep learning, object counting. TIAN Shiqin, born in 2001, M. S. candidate. Her research interests include machine learning, deep learning, image segmentation. ZHENG Bochuan, born in 1974, Ph. D. professor. His research interests include machine learning, deep learning, computer vision.
  • Supported by:
    National Nature Science Foundation of China (62176217)

基于分割引导和频域增强的轻量级谷穗计数网络

李雨希1,王雅莹1,田诗芹2,郑伯川1,3   

  1. 1.西华师范大学 计算机学院 2.西华师范大学 数学科学学院 3.西华师范大学 人工智能研究所
  • 通讯作者: 郑伯川
  • 作者简介:李雨希(2000—),男,四川遂宁人,硕士研究生,主要研究方向:机器学习、深度学习、目标计数;王雅莹(2002—),女,四川资阳人,硕士研究生,主要研究方向:机器学习、深度学习、目标计数;田诗芹(2001—),女,四川成都人,硕士研究生,主要研究方向:机器学习、深度学习、图像分割;郑伯川(1974—),男,四川自贡人,教授,博士,CCF会员,主要研究方向:机器学习、深度学习、计算机视觉。
  • 基金资助:
    国家自然科学基金面上项目(62176217)。

Abstract: Convolutional neural network (CNN) and Transformers, which are widely used for Earhead counting, continue to struggle with capturing long-range dependencies efficiently and often exhibit false positives or false negatives in densely populated regions. To overcome these limitations, a novel counting approach named PCLMamba is proposed in this paper. This lightweight network is built on a Mamba-based architecture and incorporates segmentation guidance and frequency domain enhancement. A Grouping Directional Mamba Model (GDMM) is introduced, which processes image sequences in groups and performs parallel selective scanning in different orders, thereby efficiently capturing long-range dependencies with low computational cost. Local features and globally scanned features are integrated via a Channel Attention Fusion Module (CAFM). To reduce false positives and negatives, a segmentation module (Segmenter) is designed to supply prior knowledge, which suppresses background interference while highlighting foreground regions. Additionally, a Frequency Spatial Enhancement module(FSE) is proposed to reinforce intermediate features in both spatial and frequency domains, further boosting counting accuracy. Experimental evaluations on multiple plant counting datasets including MTC, WED, and SHC perform well. In particular, the Mean Absolute Error (MAE) of the three datasets above reaches 4.4, 4.2, and 15.4 respectively, and other performance is also better than the comparison method as a whole.

Key words: smart agriculture, earhead counting, State-Space Model (SSM), Fast Fourier Transform (FFT), frequency enhancement

摘要: 基于卷积神经网络(CNN)与Transformer的谷穗计数方法仍存在长距离建模困难、计算效率低以及密度误报与密度漏报的问题。为解决这些问题,提出名为PCLMamba的新计数方法。PCLMamba是一个基于分割引导与频域增强的轻量Mamba网络。该网络通过引入分组的多方向状态空间模型(GDMM)分组处理图像序列且并行地送入不同顺序的选择性扫描中,轻量且高效地捕获长距离关系。然后利用通道注意力融合模块(CAFM)融合局部特征与全局扫描特征。为了解决密度误报与密度漏报的问题,设计了分割模块(Segmenter)提供先验知识,以抑制背景和增强前景。同时设计了双域信息增强模块(FSE),该模块将中间特征在空域与频域上进行处理增强,进一步提高计数精度。实验结果表明,PCLMamba在MTC、WED、SHC数据集的计数任务上表现出色,平均绝对误差(MAE)分别达到4.4、4.2、15.4,且其他指标也整体优于对比方法。

关键词: 智慧农业, 谷穗计数, 状态空间模型, 快速傅里叶变换, 频域增强

CLC Number: