计算机应用 ›› 2015, Vol. 35 ›› Issue (12): 3392-3397.DOI: 10.11772/j.issn.1001-9081.2015.12.3392

• 先进计算 • 上一篇    下一篇

内存云分级存储架构下的数据迁移模型

郭刚1, 于炯1,2, 鲁亮2, 英昌甜2, 尹路通1   

  1. 1. 新疆大学软件学院, 乌鲁木齐 830008;
    2. 新疆大学信息科学与工程学院, 乌鲁木齐 830046
  • 收稿日期:2015-06-08 修回日期:2015-08-10 出版日期:2015-12-10 发布日期:2015-12-10
  • 通讯作者: 于炯(1964-),男,北京人,教授,博士生导师,博士,CCF高级会员,主要研究方向:网络安全、网格与分布式计算
  • 作者简介:郭刚(1990-),男,山东枣庄人,硕士研究生,CCF会员,主要研究方向:云计算、数据迁移;鲁亮(1990-),男,新疆乌鲁木齐人,博士研究生,CCF会员,主要研究方向:云计算、分布式计算;英昌甜(1989-),女,新疆乌鲁木齐人,博士研究生,CCF会员,主要研究方向:云计算、存储节能;尹路通(1992-),男,河南驻马店人,硕士研究生,主要研究方向:大数据、数据挖掘、推荐系统。
  • 基金资助:
    国家自然科学基金资助项目(61462079,61363083,61262088)。

Data migration model based on RAMCloud hierarchical storage architecture

GUO Gang1, YU Jiong1,2, LU Liang2, YING Changtian2, YIN Lutong1   

  1. 1. School of Software, Xinjiang University, Urumqi Xinjiang 830008, China;
    2. School of Information Science and Engineering, Xinjiang University, Urumqi Xinjiang 830046, China
  • Received:2015-06-08 Revised:2015-08-10 Online:2015-12-10 Published:2015-12-10

摘要: 为了实现在线海量数据的高效存储与访问,在内存云分级存储架构下,提出一种基于数据重要性的迁移模型(MMDS)。首先,通过数据本身的大小、时间重要性、用户访问总量等因素对数据本身的重要性进行计算;其次,采用推荐系统中相似用户和PageRank算法中的重要性排名思想对数据的潜在价值进行评估,数据重要性和潜在价值共同决定了数据的重要程度;然后基于数据的重要性,设计了数据迁移机制。实验结果表明:该模型能够识别出数据的重要程度并分级放置数据,相比最近最少使用(LRU)、最近最不常用(LFU)、基于价值评估的数据迁移(MSDV)等算法,提高了存储系统的数据访问命中率。该模型能够缓解部分存储压力,数据访问性能也有了一定的提高。

关键词: 分级存储, 内存云, 数据重要性, 数据迁移, 潜在价值

Abstract: In order to achieve the efficient storage and access to the huge amounts of data online, under the hierarchical storage architecture of memory cloud, a model of Migration Model based on Data Significance (MMDS) was proposed. Firstly, the importance of data itself was calculated based on factors of the size of the data itself, the importance of time, the total amount of user access, and so on. Secondly, the potential value of the data was evaluated by adopting users' similarity and the importance ranking of the PageRank algorithm in the recommendation system. The importance of the data was determined by the importance of data itself and its potential value together. Then, data migration mechanism was designed based on the importance of data, The experimental results show that, the proposed model can identify the importance of the data and place the data in a hierarchical way and improved the data access hit rate from the storage system compared with the algorithms of Least Recently Used (LRU), Least Frequently Used (LFU), Migration Strategy based on Data Value (MSDV). The proposed model can alleviate the part pressure of storage and has improved the data access performance.

Key words: Hierarchical Storage Management (HSM), RAMCloud, data importance, data migration, potential value

中图分类号: