计算机应用 ›› 2015, Vol. 35 ›› Issue (4): 944-949.DOI: 10.11772/j.issn.1001-9081.2015.04.0944

• 先进计算 • 上一篇    下一篇

多租户集群中基于服务水平目标的调度机制

杜雄杰1, 王旻2, 汤学海2,3, 张章2   

  1. 1. 国家计算机网络应急技术处理协调中心, 北京 100029;
    2. 中国科学院 信息工程研究所, 北京 100093;
    3. 中国科学院大学, 北京 100049
  • 收稿日期:2014-11-13 修回日期:2015-01-13 出版日期:2015-04-10 发布日期:2015-04-08
  • 通讯作者: 王旻
  • 作者简介:杜雄杰(1986-),男,湖南双峰人,工程师,博士,主要研究方向:云计算、网络安全; 王旻(1989-),男,天津人,实习研究员,硕士,主要研究方向:分布式计算; 汤学海(1989-),男,湖南株洲人,硕士研究生,主要研究方向:云计算; 张章(1987-),男,湖北武汉人,研究实习员,硕士,主要研究方向:分布式系统。
  • 基金资助:

    国家自然科学基金资助项目(60903047);国家863计划项目(2012AA01A401);国家科技支撑计划项目(2012BAH37B02);中国科学院先导专项(XDA06030200)。

Scheduling mechanism based on service level objective in multi-tenant cluster

DU Xiongjie1, WANG Min2, TANG Xuehai2,3, ZHANG Zhang2   

  1. 1. National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China;
    2. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China;
    3. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2014-11-13 Revised:2015-01-13 Online:2015-04-10 Published:2015-04-08

摘要:

针对多租户集群中无法保证作业服务水平目标(SLO)的问题,提出了一种多租户场景下基于SLO的调度机制,其中包括优先调度算法和资源抢占算法。优先调度算法区别考虑超额使用资源的租户和未超额使用资源的租户,赋予后者的作业更高的优先级,在此前提下选择紧急度最高的作业,优先为其分配资源;资源抢占算法在资源受限的情况下,选择紧急度超过阈值的作业实施资源抢占,并根据租户的资源使用情况,在相应的运行作业范围内选择紧急度最低的作业,抢占其资源。实验结果表明,与现有保证公平的多租户调度器Capacity Scheduler相比,该调度机制可以在兼顾作业执行效率和租户间公平的前提下,显著提高作业的截止时间保证率,从而保证业务的服务水平目标。

关键词: 云计算, 多租户, 服务水平目标, 集群调度, 公平

Abstract:

A scheduling mechanism based on Service Level Objective (SLO) in multi-tenant cluster, including a preference scheduling algorithm and a resource preemption algorithm, was proposed to solve the problem of the inability to guarantee the SLOs of jobs in multi-tenant clusters. The preference scheduling algorithm considered the users who overused resources above their quota and the users who did not, then assigned a higher priority to the jobs of the latter users, under this condition, the job with highest priority was preferentially allocated resources. When the resources was limited, the resource preemption algorithm preempted the resource for the jobs whose urgency was above the threshold, and chose the jobs with the lowest urgency in the corresponding range of the running jobs according to the resource usages of tenants.The experimental results show that, compared with the current multi-tenant scheduler named Capacity Scheduler, the proposed mechanism can significantly improve the deadline guarantee rate of jobs and SLO with guaranteeing the job execution efficiency and the equity among tenants at the same time.

Key words: cloud computing, multi-tenant, Service Level Objective (SLO), cluster scheduling, fairness

中图分类号: