Journal of Computer Applications ›› 2017, Vol. 37 ›› Issue (10): 2973-2977.DOI: 10.11772/j.issn.1001-9081.2017.10.2973

Previous Articles     Next Articles

Component retrieval method based on identification of faceted classification and cluster tree

QIAN Xiaojie, DU Shenghao   

  1. School of Information Engineering, Zhengzhou University, Zhengzhou Henan 450001, China
  • Received:2017-05-11 Revised:2017-06-29 Online:2017-10-10 Published:2017-10-16
  • Supported by:
    This work is partially supported by the National Social Science Foundation of China (14BYY096).

基于刻面分类标识和聚类树的构件检索方法

钱晓捷, 杜胜浩   

  1. 郑州大学 信息工程学院, 郑州 450001
  • 通讯作者: 杜胜浩(1990-),男,河南濮阳人,硕士研究生,主要研究方向:软件体系结构、数据挖掘,E-mail:iezzdxdsh@126.com
  • 作者简介:钱晓捷(1963-),男,江苏无锡人,副教授,硕士,CCF会员,主要研究方向:嵌入式系统、计算机系统结构;杜胜浩(1990-),男,河南濮阳人,硕士研究生,主要研究方向:软件体系结构、数据挖掘.
  • 基金资助:
    国家社会科学基金资助项目(14BYY096)。

Abstract: To quickly and efficiently retrieve the target component from a large software component library, a component retrieval method based on identification of faceted classification and cluster tree was proposed. The component with facet classification identification was described by using the set of component identification, which overcomes the impact of subjective factors when only using facets classification to describe and retrieve components. By introducing cluster tree, the component cluster tree was established by analysis clustering of components based on semantic similarity, thus narrowing the retrieval area, reducing the number of comparisons with component libary, and improving the search efficiency. Finally, the proposed method was experimented and compared with other common retrieval methods. The results show that the precision of the proposed method is 88.3% and the recall ratio is 93.1%; moreover, the proposed method also has a good retrieval effect when searching in a large-scale component library.

Key words: software component, faceted classification, component identification, cluster tree, component retrieval

摘要: 针对如何从规模庞大的软件构件库中快速且高效地检索出目标构件的问题,提出一种基于刻面分类标识和聚类树的构件检索方法。使用构件标识集合对构件进行刻面分类标识描述,克服了单纯采用刻面分类法对构件进行分类描述和检索时带来的主观因素的影响;引入聚类树的思想,对构件进行基于语义相似度的聚类分析,建立构件聚类树,能有效地缩小检索范围,减少检索构件与构件库中构件比较的次数,提高检索效率。最后在实验中与一般检索方法对比,实验结果表明该方法的构件查准率为88.3%,查全率为93.1%;而且在大规模的构件库中使用时依然有良好的检索效果。

关键词: 软件构件, 刻面分类, 构件标识, 聚类树, 构件检索

CLC Number: