Journal of Computer Applications ›› 2010, Vol. 30 ›› Issue (9): 2335-2338.

• Database and knowledge engineering • Previous Articles     Next Articles

Multi-keyword search over P2P based on Bloom filter

  

  • Received:2010-03-17 Revised:2010-05-12 Online:2010-09-03 Published:2010-09-01

基于Bloom滤波器的对等网多关键字检索

严华云1,关佶红2   

  1. 1. 湖州师范学院信息工程学院
    2.
  • 通讯作者: 严华云
  • 基金资助:
    对等网络环境下文本检索技术研究

Abstract: In keyword search over Peer-to-Peer (P2P) based on standard Bloom Filter (BF), it is difficult to estimate the maximum number of the data sets because they are increasing continuously; hence, two problems show up: it is difficult to determine the upper value of the length of BF vector, and it cannot handle the multi-keyword search efficiently. To solve these problems, the authors proposed a new structure called Block Dynamic Bloom Filter (BDBF) which partitioned the keyword based on the frequency, and presented a new Top-k multi-keyword search model: the node sends the higher frequency DBF firstly, and then sends the secondary higher frequency DBF if need. The experimental results show that the proposed method can be applicable for the increasing data of index list, and it can also decrease the network's traffic and efficiently resolve the problem of Top-k query in multi-keyword search over P2P.

Key words: Peer-to-Peer (P2P) network, multi-keyword search, Bloom Filter (BF), Block Dynamic Bloom Filter (BDBF)

摘要: 现有基于Bloom滤波器(BF)的对等网(P2P)检索,由于索引表的不断增长且不能确定数据量的上限,存在两个问题:一是难以确定BF向量长度;二是不能高效处理P2P多关键字Top-k查询。提出了一种基于关键词频率进行分块的分块Dynamic Bloom Filter(BDBF)以解决上述问题;并给出了相应的P2P多关键字Top-k查询模型,即当节点传送BF时先传送高频DBF,如不能满足Top-k查询则继续传送次高频的BF。实验分析发现,该结构更能适应数据量的连续增长,降低网络传输流量,并能高效处理多关键字检索中的Top-k查询问题。

关键词: 对等网, 多关键字检索, Bloom滤波器, 分块动态Bloom滤波器

CLC Number: