《计算机应用》唯一官方网站 ›› 2020, Vol. 40 ›› Issue (2): 459-464.DOI: 10.11772/j.issn.1001-9081.2019091662

• 第36届CCF中国数据库学术会议(NDBC 2019) • 上一篇    下一篇

基于BTM的物联网服务发现方法

王舒漫, 李爱萍(), 段利国, 付佳, 陈永乐   

  1. 太原理工大学 信息与计算机学院,太原 030024
  • 收稿日期:2019-08-12 修回日期:2019-09-29 接受日期:2019-10-24 发布日期:2019-10-31 出版日期:2020-02-10
  • 通讯作者: 李爱萍
  • 作者简介:王舒漫(1994—),女,山西长治人,硕士研究生,主要研究方向:服务计算、信息处理
    段利国(1970—),男,山西繁峙人,副教授,博士,CCF高级会员,主要研究方向:中文信息处理、知识图谱
    付佳(1995—),女,内蒙古通辽人,硕士研究生,主要研究方向:服务计算、信息处理
    陈永乐(1983—),男,山东潍坊人,副教授,博士,CCF会员,主要研究方向:无线传感网络、物联网安全。
  • 基金资助:
    国家重点研发计划“网络空间安全”专项子课题资助项目(2018YFB0803402)

Service discovery method for Internet of Things based on Biterm topic model

Shuman WANG, Aiping LI(), Liguo DUAN, Jia FU, Yongle CHEN   

  1. College of Information and Computer,Taiyuan University of Technology,Taiyuan Shanxi 030024,China
  • Received:2019-08-12 Revised:2019-09-29 Accepted:2019-10-24 Online:2019-10-31 Published:2020-02-10
  • Contact: Aiping LI
  • About author:WANG Shuman, born in 1994, M. S. candidate. Her research interests include service computing, information processing.
    DUAN Liguo, born in 1970, Ph. D., associate professor. His research interests include Chinese information processing, knowledge mapping.
    FU Jia, born in 1995, M. S. candidate. Her research interests include service computing, information processing.
    CHEN Yongle, born in 1983, Ph. D., associate professor. His research interests include wireless sensor network, IoT security.
  • Supported by:
    the National Key R&D Programe of China (Rsearch and Development Program — "cyberspace security"(2018YFB0803402)

摘要:

针对物联网(IoT)服务描述文本篇幅较短、特征稀疏,直接采用传统的主题模型对IoT服务建模得到的聚类效果不佳,从而导致无法发现最佳服务的问题,提出了一种基于BTM的IoT服务发现方法。该方法首先利用BTM挖掘现有IoT服务的隐含主题,并通过全局主题分布和主题-词分布计算推理得到服务文档-主题概率分布;其次利用K-means算法对服务进行聚类,并返回服务请求的最佳匹配结果。实验结果分析表明,该方法能够有效提高IoT服务的聚类效果,从而得到匹配的最佳服务。与现有的HDP(Hierarchical Dirichlet Process)、基于K-means的隐狄利克雷分配(LDA-K)等方法相比,该方法进行最佳服务发现的准确度(Precision)和归一化折损累积增益(NDCG)均有一定幅度的提高。

关键词: 物联网服务, BTM, 短文本, 主题建模, 服务发现

Abstract:

Service description texts for Internet of Things (IoT) are short in length and sparse in text features, and direct modeling the IoT service by using traditional topic model has poor clustering effect, so that the best service cannot be discovered. To solve this problem, an IoT service discovery method based on Biterm Topic Model (BTM) was proposed. Firstly, BTM was employed to mine the latent topic of the existing IoT services, and the service document-topic probability distribution was calculated and deduced through global topic distribution and theme-word distribution. Then, K-means algorithm was used to cluster the services and return the best matching results of service requests. Experimental results show that the proposed method can improve the clustering effect of services for IoT and thus obtain the matched best service. Compared with the methods of HDP (Hierarchical Dirichlet Process) and LDA-K (Latent Dirichlet Allocation based on K-means), the proposed method achieves better performance in terms of Precision and Normalized Discounted Cumulative Gain (NDCG) for best service discovery.

Key words: service for Internet of Things (IoT), Biterm Topic Model (BTM), short text, topic modeling, service discovery

中图分类号: