Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (12): 3755-3763.DOI: 10.11772/j.issn.1001-9081.2023010094

Special Issue: 数据科学与技术

• Data science and technology • Previous Articles     Next Articles

Agglomerative hierarchical clustering algorithm based on hesitant fuzzy set

Wenquan LI(), Yimin MAO, Xindong PENG   

  1. School of Information Engineering,Shaoguan University,Shaoguan Guangdong 512005,China
  • Received:2023-02-07 Revised:2023-05-05 Accepted:2023-05-08 Online:2023-06-06 Published:2023-12-10
  • Contact: Wenquan LI
  • About author:MAO Yimin, born in 1970, Ph. D., professor. Her research interests include data mining, big data security.
    PENG Xindong, born in 1990, Ph. D., associate professor. His research interests include fuzzy mathematics, artificial intelligence.
  • Supported by:
    National Natural Science Foundation of China(62006155);Scientific Research Project of Department of Education of Guangdong Province(2022ZDJS048);Characteristic Innovation Project in Ordinary Universities in Guangdong Province(2023KTSCX137)


李文全(), 毛伊敏, 彭新东   

  1. 韶关学院 信息工程学院,广东 韶关 512005
  • 通讯作者: 李文全
  • 作者简介:李文全(1980—),男,江西龙南人,副教授,硕士,主要研究方向:数据挖掘、模糊数学;
  • 基金资助:


Aiming at the problems of information distortion, poor objectivity of attribute weights, and high time complexity in hesitant fuzzy clustering analysis, an Agglomerative Hierarchical Clustering algorithm based on Hesitant Fuzzy set (AHCHF) was proposed. Firstly, the average value of hesitancy fuzzy elements was used to expand the data object with small hesitation. Secondly, the weights of data object before and after expansion were calculated by using the original information entropy and internal maximum difference, and the comprehensive attribute weight was determined according to the minimum discrimination information between the two weight vectors. Finally, with the goal of making the sum of weighted distances smaller, a center point construction method with constant hesitation was given. Experimental results on specific examples and synthetic datasets show that compared with the classic Hesitant Fuzzy Hierarchical Clustering algorithm (HFHC) and the recent Fuzzy Hierarchical Clustering Algorithm (FHCA), the proposed AHCHF increases the mean Silhouette Coefficient (SC) by 23.99% and 9.28% respectively, and shortens the running time by 27.18% and 6.40% averagely and respectively, proving that the proposed algorithm can effectively solve the problems of information distortion and poor objectivity of attribute weights, and improve the clustering effect and performance well.

Key words: hesitant fuzzy set, clustering analysis, hesitation, data mining, fuzzy entropy



关键词: 犹豫模糊集, 聚类分析, 犹豫度, 数据挖掘, 模糊熵

CLC Number: