计算机应用 ›› 2015, Vol. 35 ›› Issue (8): 2158-2163.DOI: 10.11772/j.issn.1001-9081.2015.08.2158

• 先进计算 • 上一篇    下一篇

结合云存储的新一代博客系统架构

张宝军1,2, 潘瑞芳1,2   

  1. 1. 浙江传媒学院 新媒体学院, 杭州 310018;
    2. 浙江省媒体传播优化协同创新中心, 杭州 310018
  • 收稿日期:2015-03-05 修回日期:2015-04-10 出版日期:2015-08-10 发布日期:2015-08-14
  • 通讯作者: 张宝军(1977-),男,湖北荆门人,讲师,博士,CCF会员,主要研究方向:计算机网络、体系结构,zbjhover@zju.edu.cn
  • 作者简介:潘瑞芳(1959-),女,江西赣州人,教授,主要研究方向:数字媒体、数据库、计算机网络。
  • 基金资助:

    浙江省重点科技创新团队项目(2011R50019);国家新闻出版广电总局科研项目(2014-42)。

Architecture of new generation blog system with cloud storage

ZHANG Baojun1,2, PAN Ruifang1,2   

  1. 1. School of New Media, Zhejiang University of Media and Communications, Hangzhou Zhejiang 310018, China;
    2. Zhejiang Province Media Optimization of Collaborative Innovation Center, Hangzhou Zhejiang 310018, China
  • Received:2015-03-05 Revised:2015-04-10 Online:2015-08-10 Published:2015-08-14

摘要:

为解决新一代博客系统海量信息的存储问题,结合云存储技术,提出了一种新的博客系统架构BlogCloud。该架构以分布式存储技术为核心,避免了集中式存储的性能瓶颈问题,可扩展性高;采用半分布式P2P网络拓扑结构,能快速定位网络中的存储资源;只将稳定节点作为存储节点,避免了不稳定节点带来的网络波动问题;遵循就近存储原则,同时在客户端缓存文件,减少了网络传输;允许用户自定义文件分块的大小,对大的文件可分块并行传输,提高了文件传输的速度,对小的文件则不用分块,节省了文件分块、合并的开销;具备数据冗余备份功能,在网络中多个存储节点保留文件副本,并实行异地备份,增强了数据的安全性和可靠性。在虚拟机上对BlogCloud和ZSWIN博客系统进行比较测试,结果显示:BlogCloud的吞吐量明显高于ZSWIN;将不稳定节点作为存储节点会降低BlogCloud的性能;在存储节点和索引节点减少的情况下BlogCloud仍然能够稳定运行,可靠性较高。结果表明,BlogCloud架构能够满足新一代博客系统的存储要求。

关键词: 云存储, 博客, 海量信息, 对等网络, 半分布式

Abstract:

To solve the storage problem of massive information for new generation blog system, combined with the cloud storage, a new blog system architecture named BlogCloud was proposed. By using the distributed storage technology, BlogCloud avoided the bottleneck problem in centralized storage, and had high scalability. With semi-distributed Peer-to-Peer (P2P) architecture, it located storage resources quickly. To avoid the network flipping caused by the instable nodes, only the stable nodes were regarded as storage nodes. The rule of nearest storage was adopted and the data was kept in the client cache to reduce the network transmission. Users were allowed to customize the block size of file, it meant that the large files could be devided into blocks and transmitted parallel to enhance the transmission speed, while the small files did not need to be devided, which saved the cost for segmentation and combination. It also had functions of redundancy to enhance the security and reliability of the data, including storing copies of files in multi-storage nodes and remote backup. The comparison test between BlogCloud and ZSWIN was given on the virtual machine. The results show that the throughput of BlogCloud is higher than ZSWIN obviously; the performance of BlogCloud is degraded when using instable nodes as the storage nodes; BlogCloud has high reliability, it still can run stably when reducing the storage nodes and index nodes. The results verify that BlogCloud can satisfy the storage requirements for the new generation blog system.

Key words: cloud storage, blog, massive information, Peer-to-Peer (P2P), semi-distributed

中图分类号: