《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (2): 485-492.DOI: 10.11772/j.issn.1001-9081.2021020332

• 数据科学与技术 • 上一篇    

基于邻域熵的高光谱波段选择算法

翟东昌1, 陈红梅2()   

  1. 1.西南交通大学 唐山研究院,河北 唐山 063000
    2.西南交通大学 计算机与人工智能学院,成都 611756
  • 收稿日期:2021-03-05 修回日期:2021-04-30 接受日期:2021-05-07 发布日期:2021-05-18 出版日期:2022-02-10
  • 通讯作者: 陈红梅
  • 作者简介:翟东昌(1993—),男,甘肃兰州人,硕士研究生,主要研究方向:数据挖掘;
    陈红梅(1971—),女,四川成都人,教授,博士,CCF会员,主要研究方向:智能信息处理、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61976128);四川省国际科技创新合作重点项目(2019YFH0097)

Hyperspectral band selection algorithm based on neighborhood entropy

Dongchang ZHAI1, Hongmei CHEN2()   

  1. 1.TangShan Institute,Southwest Jiaotong University,Tangshan Hebei 063000,China
    2.School of Computer and Artificial Intelligence,Southwest Jiaotong University,Chengdu Sichuan 611756,China
  • Received:2021-03-05 Revised:2021-04-30 Accepted:2021-05-07 Online:2021-05-18 Published:2022-02-10
  • Contact: Hongmei CHEN
  • About author:ZHAI Dongchang, born in 1993, M. S. candidate. His research interests include data mining.
    CHEN Hongmei, born in 1971, Ph. D., professor. Her research interests include intelligent information processing, data mining.
  • Supported by:
    National Natural Science Foundation of China(61976182);Key Program for International Science and Technology Cooperation of Sichuan Province(2019YFH0097)

摘要:

为了减少高光谱图像数据中的冗余信息,优化计算效率,并提升图像数据后续应用的有效性,提出一种基于邻域熵(NE)的高光谱波段选择算法。首先,为了高效计算样本的邻域子集,采用了局部敏感哈希(LSH)作为近似最近邻的搜索策略;然后,引入了NE理论来度量波段和类之间的互信息(MI),并把最小化特征集合与类变量之间的条件熵作为选取有效波段的方法;最后,采用两个数据集,通过支持向量机(SVM)和随机森林(RM)进行分类实验。实验结果表明,相较于四种基于MI的特征选择算法,从总体精度以及Kappa系数上看,所提算法能够在30个波段内较快地选取有效波段子集,并达到局部最优。该算法的部分实验结果的总体精度以及Kappa系数分别达到全局最优的92.99%以及0.860 8,表明所提算法能有效地处理高光谱波段选择问题。

关键词: 波段选择, 高光谱图像, 互信息, 邻域熵, 近似最近邻搜索

Abstract:

In order to reduce the redundant information of hyperspectral image data, optimize the computational efficiency and improve the effectiveness of subsequent applications of image data, a hyperspectral band selection algorithm based on Neighborhood Entropy (NE) was proposed. Firstly, in order to efficiently calculate the neighborhood subset of samples, the Local Sensitive Hashing (LSH) was used as the nearest neighbor search strategy. Then, the NE theory was introduced to measure the Mutual Information (MI) between bands and classes, and minimization of the conditional entropy between feature sets and class variables was used as a method to select effective bands. Finally, two datasets were used to carry out classification experiments through Support Vector Machine (SVM) and Random Forest (RM). Experimental results show that, compared with four MI based feature selection algorithms, from the perspectives of overall accuracy and Kappa coefficient, the proposed algorithm can select effective band subset within 30 bands faster and achieve local optimization. Some experimental results of the proposed algorithm reach 92.99% and 0.860 8 at the global optimum on overall accuracy and Kappa coefficient respectively, verifying that the proposed algorithm can effectively deal with hyperspectral band selection problem.

Key words: band selection, hyperspectral image, Mutual Information (MI), Neighborhood Entropy (NE), approximate nearest neighbor search

中图分类号: