Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (6): 1729-1747.DOI: 10.11772/j.issn.1001-9081.2021061392

• National Open Distributed and Parallel Computing Conference 2021 (DPCS 2021) • Previous Articles    

Research on Bloom filter: a survey

Wendi HUA1,2,3,4, Yuan GAO1,2,3,4, Meng LYU1,2,3,4, Ping XIE1,2,3,4()   

  1. 1.Computer College,Qinghai Normal University,Xining Qinghai 8100016,China
    2.Key Laboratory of Internet of Things of Qinghai Province,Xining Qinghai 810008,China
    3.State Key Laboratory of Tibetan Intelligent Information Processing and Application,Xining Qinghai 810018,China
    4.Institute of Plateau Science and Sustainability,Xining Qinghai 810016,China
  • Received:2021-08-04 Revised:2021-09-29 Accepted:2021-09-30 Online:2022-01-10 Published:2022-06-10
  • Contact: Ping XIE
  • About author:HUA Wendi,born in 1998,M. S. candidate. His research interests include computer architecture,mass storage system,embedded system.
    GAO Yuan,born in 1989,M. S. candidate. His research interests include network storage.
    LYU Meng,born in 1998,M. S. candidate. Her research interests include network storage.
  • Supported by:
    National Natural Science Foundation of China(61762075);Natural Science Foundation of Qinghai Province(2020-ZJ-926)

布隆过滤器研究综述

华文镝1,2,3,4, 高原1,2,3,4, 吕萌1,2,3,4, 谢平1,2,3,4()   

  1. 1.青海师范大学 计算机学院, 西宁 810016
    2.青海省物联网重点实验室, 西宁 810008
    3.省部共建藏语智能信息处理及应用国家重点实验室, 西宁 810008
    4.高原科学与可持续发展研究院, 西宁 810016
  • 通讯作者: 谢平
  • 作者简介:华文镝(1998—),男,辽宁抚顺人,硕士研究生,CCF会员,主要研究方向:计算机体系结构、大规模存储系统、嵌入式系统
    高原(1989—),男,山东泰安人,硕士研究生,CCF会员,主要研究方向:网络存储
    吕萌(1998—),女,山东青岛人,硕士研究生,CCF会员,主要研究方向:网络存储
  • 基金资助:
    国家自然科学基金资助项目(61762075);青海省自然科学基金资助项目(2020?ZJ?926)

Abstract:

Bloom Filter (BF) is a binary vector data structure based on hashing strategy. With the idea of sharing hash collisions, the characteristic of one-way misjudgment and the very small time complexity of constant query, BF is often used to represent membership and as an “accelerator” for membership query operations. As the best mathematical tool to solve the membership query problem in computer engineering, BF has been widely used and developed in network engineering, storage system, database, file system, distributed system and some other fields. In the past few years, in order to adapt to various hardware environments and application scenarios, a large number of variant optimization schemes of BF based on the ideas of changing structure and optimizing algorithm appeared. With the development of big data era, it has become an important direction of membership query to improve the characteristics and operation logic of BF.

Key words: Bloom Filter (BF), membership query, Approximate Membership Query (AMQ) structure, hashing strategy, False Positive Rate (FPR)

摘要:

布隆过滤器(BF)是一种基于哈希策略的二进制向量数据结构,凭借分摊哈希碰撞的思想、存在单向误判性的特点以及极小常数查询时间复杂度,常用于表示集合元素并作为进行集合元素查询操作的“加速器”。作为计算机工程中解决集合元素查询问题最好的数学工具,BF在网络工程、存储系统、数据库、文件系统、分布式系统等领域得到了广泛的应用和发展。近几年来,为了适用于各种硬件环境和应用场景,BF出现了大量基于改变结构、优化算法等思想的变种方案。随着大数据时代的发展,对BF自身特点和操作逻辑进行改进已经成为现有集合元素查询研究的一个重要方向。

关键词: 布隆过滤器, 集合元素查询, 近似成员查询结构, 哈希策略, 误判率

CLC Number: