计算机应用 ›› 2018, Vol. 38 ›› Issue (1): 50-55.DOI: 10.11772/j.issn.1001-9081.2017071836

• 2017年全国开放式分布与并行计算学术年会(DPCS 2017)论文 • 上一篇    下一篇

面向大规模数据接入系统的负载平衡机制

周岳, 陈庆奎   

  1. 上海理工大学 光电信息与计算机工程学院, 上海 200093
  • 收稿日期:2017-07-25 修回日期:2017-08-04 出版日期:2018-01-10 发布日期:2018-01-22
  • 通讯作者: 陈庆奎
  • 作者简介:周岳(1991-),男,江苏宿迁人,硕士研究生,主要研究方向:分布式计算、物联网;陈庆奎(1966-),男,黑龙江哈尔滨人,教授,博士生导师,博士,CCF会员,主要研究方向:网络计算、并行计算、物联网。
  • 基金资助:
    国家自然科学基金资助项目(60970012,61572325);上海重点科技攻关项目(14511107902,16DZ1203603);上海市工程技术研究中心建设项目(GCZX14014);上海市一流学科建设项目(XTKX2012)。

Load balancing mechanism for large-scale data access system

ZHOU Yue, CHEN Qingkui   

  1. School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
  • Received:2017-07-25 Revised:2017-08-04 Online:2018-01-10 Published:2018-01-22
  • Supported by:
    This work is partially supported by the National Natural Science Foundation of China (60970012, 61572325), Shanghai Key Science and Technology Project (14511107902, 16DZ1203603), Shanghai Engineering Research Center Construction Project (GCZX14014), Shanghai Leading Academic Discipline Project (XTKX2012).

摘要: 当前分布式系统负载平衡算法存在问题:1)算法建立的系统中各节点角色固定,系统不具有自适应性;2)算法的通用性不高;3)负载迁移任务巨大,且负载平衡周期过长等。针对这些问题,提出了混合式负载平衡算法。首先,设计了一个分布式系统接收模型。模型将系统任务分为三层:接收层、处理层和存储层。在接收层使用了自定义的通信协议提高系统的接收性能。然后,负载平衡算法采用随机负载迁移策略,根据系统中节点的负载状态,对负载任务进行随机迁移。通过这种策略解决负载平衡周期过长和负载回迁问题。最后,通过分布式控制节点选择策略,使系统中节点具有自适应性。实验结果显示,在百万数据源以下,系统各层平均延迟处于毫秒级,系统负载平衡平均耗时在3 min以下。实验证明了所提出的负载平衡机制具有周期短、任务响应迅速等特点,能够提高分布式系统的接收性能。

关键词: 大规模数据接入系统, 分布式系统, 混合式负载平衡, 随机负载迁移, 数据接入系统

Abstract: Some problems of the current load balancing algorithms for distributed systems include:1) The role of each node in the system is fixed, and the system has no adaptability. 2) The load balancing algorithm is not universal. 3) The migration task is too large, and the load balance cycle is too long. To solve these problems, a hybrid load balancing algorithm was proposed. Firstly, a distributed receiving system model was designed, by which the system tasks were divided into three parts:receiving level, handling level and storing level. In receiving level, a home-made transmission protocol was used to improve the reception capability of the system. And then, in the load balancing algorithm, random load migration strategy was used. According to the status of the nodes, the tasks of load were randomly migrated. The problems of long load balance cycle and load moving back were solved by this strategy. Finally, the distributed control node selecting strategy was adopted to make the nodes adaptable. The experimental results show that the average delay in each layer of the system is in milliseconds, and the system load balancing takes less than 3 minutes, which proves that the load balancing mechanism has short load balance cycle and fast response, and can improve the reception capability of the distributed system.

Key words: large-scale data receiving system, distributed system, hybrid load balancing, random load migration, data access system

中图分类号: