《计算机应用》唯一官方网站 ›› 2020, Vol. 40 ›› Issue (2): 316-320.DOI: 10.11772/j.issn.1001-9081.2019091635

• 2019年全国开放式分布与并行计算学术年会(DPCS 2019)论文 • 上一篇    下一篇

基于远程直接内存访问的高性能键值存储系统

王成1, 叶保留1(), 梅峰2, 卢文达2   

  1. 1.计算机软件新技术国家重点实验室(南京大学),南京 210023
    2.国网浙江省电力有限公司,杭州 310007
  • 收稿日期:2019-07-31 修回日期:2019-09-25 接受日期:2019-09-29 发布日期:2019-11-18 出版日期:2020-02-10
  • 通讯作者: 叶保留
  • 作者简介:王成(1994—),男,江苏泗洪人,硕士研究生,主要研究方向:高性能网络
    梅峰(1977—),男,浙江湖州人,高级工程师,硕士,主要研究方向:电力信息系统、大数据
    卢文达(1989—),男,吉林松原人,助理工程师,硕士,主要研究方向:数据挖掘、云计算。
  • 基金资助:
    国家重点研发计划项目(2018YFB1004704);国家自然科学基金资助项目(61832005);国家电网公司科技项目(52110418001M)

High performance key-value storage system based on remote direct memory access

Cheng WANG1, Baoliu YE1(), Feng MEI2, Wenda LU2   

  1. 1.State Key Laboratory for Novel Software Technology (Nanjing University),Nanjing Jiangsu 210023,China
    2.State Grid Zhejiang Electric Power Company Limited,Hangzhou Zhejiang 310007,China
  • Received:2019-07-31 Revised:2019-09-25 Accepted:2019-09-29 Online:2019-11-18 Published:2020-02-10
  • Contact: Baoliu YE
  • About author:WANG Cheng, born in 1994, M. S. candidate. His research interests include high-performance network.
    MEI Feng, born in 1977, M. S., senior engineer. His research interests include power information system, big data.
    LU Wenda, born in 1989, M. S., assistant engineer. His research interests include data mining, cloud computing.
  • Supported by:
    National Key Technology Research and Development Program(2018YFB1004704);the National Natural Science Foundation of China(61832005);the Science and Technology Project of State Grid Corporation of China(52110418001M)

摘要:

随着数据与系统规模的不断扩大,网络传输成为了键值存储系统的性能瓶颈。同时,远程直接内存访问(RDMA)技术能够支持高带宽和低时延的数据传输,为键值存储系统设计提供了新的思路。结合高性能网络中的RDMA技术,设计并实现了高性能、低CPU负载的键值存储系统Chequer;结合RDMA原语的特性,重新设计了键值存储系统的基本操作工作流程;并设计了基于线性探测的共享hash表,解决客户端缓存失效的问题以及提高hash命中率来减少客户端的读取轮数,进一步提高了系统的性能。在小规模集群上实现了Chequer系统,并通过实验验证了其性能。

关键词: 远程直接内存访问, 哈希表, 键值存储, 高性能网络

Abstract:

With the continuous increment of data and system size, network communication becomes a performance bottleneck of key-value storage systems. Meanwhile, Remote Direct Memory Access (RDMA) technique can support high bandwidth, low latency data transmission, which provides a new idea for designing key-value storage systems. Based on RDMA technique in the high performance network, a key-value storage system named Chequer with high performance and low CPU overhead was designed and implemented. By combining the characteristics of RDMA primitives, the basic operation workflow of key-value storage system was redesigned. And a linear probing based shared hash table was designed to reduce the number of client reading rounds by solving the problem of client cache invalidation as well as increasing the hash hit rate, which can further improve the performance of the system. The Chequer system was implemented on the small-scale cluster, and its performance was demonstrated by experiments.

Key words: Remote Direct Memory Access (RDMA), hash table, key-value storage, high performance network

中图分类号: