计算机应用 ›› 2012, Vol. 32 ›› Issue (03): 620-624.

• 先进计算 • 上一篇    下一篇

大规模云存储系统副本布局研究

董继光,陈卫卫,田浪军,吴海佳   

  1. 解放军理工大学 指挥自动化学院,南京 210007
  • 收稿日期:2011-09-19 修回日期:2011-11-16 发布日期:2012-03-01 出版日期:2012-03-01
  • 通讯作者: 董继光
  • 作者简介:董继光(1986-),男,河南周口人,硕士研究生,CCF会员,主要研究方向:分布式网络存储、云存储;陈卫卫(1967-),女,四川隆昌人,教授,CCF会员,主要研究方向:软件工程、云计算、分布式网络存储;田浪军(1985-),男,陕西渭南人,硕士研究生,CCF会员,主要研究方向:分布式数据库;吴海佳(1986-),男,江苏南通人,博士研究生,CCF会员,主要研究方向:分布式网络存储、分布式文件系统。
  • 基金资助:

    国家自然科学基金资助项目(60603029);国家863计划项目(2008AA01A309)。

Replica placement study in large-scale cloud storage system

DONG Ji-guang, CHEN Wei-wei, TIAN Lang-jun, WU Hai-jia   

  1. Institute of Command Automation, PLA University of Science and Technology, Nanjing Jiangsu 210007, China
  • Received:2011-09-19 Revised:2011-11-16 Online:2012-03-01 Published:2012-03-01

摘要: 在基于副本冗余机制的大规模云存储系统中,以往的副本布局算法只能部分地满足副本布局中高可靠、高可扩展并且高效的要求,针对这一问题,提出了一种基于分组和一致性哈希的副本布局算法。首先,将关联性高的存储节点进行分组;然后,通过改进的一致性哈希算法将同一数据对象的多个副本分别分配到不同分组中;最后,再通过改进的一致性哈希算法将分配到各分组的数据副本放置在组内对应的存储节点上。理论分析可知,该方法大大提高数据的可靠性。仿真结果表明,该算法能满足副本布局的均衡性、自适应性要求,并能在几十微秒内完成副本定位。

关键词: 云存储, 副本布局, 一致性哈希算法, 分组, 可靠性

Abstract: In the large-scale cloud storage system based on copy redundancy, previous layout algorithm can only partially meet the requirements of high reliability, high scalability and high efficiency in the replica layout. To solve this problem, this paper proposed a Replica Placement algorithm based on Grouping and Consistent Hashing (RPGCH). The storage nodes were classified into different groups by their correlativity, then the replicas of one object were assigned in different groups by consistent hashing algorithm, after that each replica was placed into corresponding storage node in the group by consistent Hashing algorithm. The theoretical analysis proves that the reliability of data is improved. The simulation results show that RPGCH can assign data evenly among storage nodes and adapt well with the changing scale of cloud storage system. Moreover, RPGCH is time efficient with little memory overhead.

Key words: cloud storage, replica layout, consistent hashing algorithm, grouping, reliability

中图分类号: