计算机应用 ›› 2019, Vol. 39 ›› Issue (8): 2431-2437.DOI: 10.11772/j.issn.1001-9081.2019010073

• 应用前沿、交叉与综合 • 上一篇    下一篇

基于累计工作量的在线大数据分析作业调度算法

李叶飞1, 徐超2, 许道强3, 邹云峰2, 张晓达1, 钱柱中1   

  1. 1. 南京大学 计算机科学与技术系, 南京 210023;
    2. 国网江苏省电力有限公司 电力科学研究院, 南京 210000;
    3. 国网江苏省电力有限公司, 南京 210000
  • 收稿日期:2019-01-11 修回日期:2019-04-12 发布日期:2019-04-17 出版日期:2019-08-10
  • 通讯作者: 钱柱中
  • 作者简介:李叶飞(1979-),男,安徽池州人,工程师,硕士,主要研究方向:大数据处理;徐超(1989-),男,山东莱芜人,工程师,硕士,主要研究方向:大数据处理、信息安全;许道强(1978-),男,辽宁丹东人,高级工程师,主要研究方向:网络虚拟化、电力信息化;邹云峰(1977-),男,江西丰城人,高级工程师,硕士,主要研究方向:电力信息技术、大数据;张晓达(1991-),男,河北邢台人,博士研究生,主要研究方向:分布式大数据处理;钱柱中(1980-),男,江苏常熟人,副教授,博士,主要研究方向:分布式系统、数据中心网络。
  • 基金资助:
    国家自然科学基金项目(61472181);江苏省自然科学基金项目(BK20151392)。

Online task scheduling algorithm for big data analytics based on cumulative running work

LI Yefei1, XU Chao2, XU Daoqiang3, ZOU Yunfeng2, ZHANG Xiaoda1, QIAN Zhuzhong1   

  1. 1. Department of Computer Science and Technology, Nanjing University, Nanjing Jiangsu 210023, China;
    2. Electric Power Research Institute, State Grid Jiangsu Electric Power Company Limited, Nanjing Jiangsu 210000, China;
    3. State Grid Jiangsu Electric Power Company Limited, Nanjing Jiangsu 210000, China
  • Received:2019-01-11 Revised:2019-04-12 Online:2019-04-17 Published:2019-08-10
  • Supported by:
    This work is partially supported by the Natural Science Foundation of China (61472181), the Natural Science Foundation of Jiangsu Province (BK20151392).

摘要: 针对Hadoop和Spark等大数据分析系统中无先验知识任务的高效执行问题,设计了基于累计工作量(CRW)的任务调度器CRWScheduler。该调度器根据CRW将任务在低权重队列与高权重队列间切换;在为作业分配资源时,同时考虑到作业所在的队列和其瞬时占用资源量,无需作业先验知识即显著提升系统性能。基于Apache Hadoop YARN实现了CRWScheduler原型,在28个节点的基准测试集群上的实验表明,与YARN的公平调度机制相比,作业流时间(JFT)平均降低21%,其中95百分位的作业流时间(JFT)最多降低了35%,并且在与任务级调度程序协作时可获得进一步的性能提升。

关键词: 数据分析系统, 作业流时间, 公平性, 饥饿避免

Abstract: A Cumulative Running Work (CRW) based task scheduler CRWScheduler was proposed to effectively process tasks without any prior knowledge for big data analytics platform like Hadoop and Spark. The running job was moved from a low-weight queue to a high-weight one based on CRW. When resources were allocated to a job, both the queue of the job and the instantaneous resource utilization of the job were considered, significantly improving the overall system performance without prior knowledge. The prototype of CRWScheduler was implemented based on Apache Hadoop YARN. Experimental results on 28-node benchmark testing cluster show that CRWScheduler reduces average Job Flow Time (JFT) by 21% and decreases JFT of 95th percentile by up to 35% compared with YARN fair scheduler. Further improvements can be obtained when CRWScheduler cooperates with task-level schedulers.

Key words: data analytics system, Job Flow Time (JFT), fairness, starvation-free

中图分类号: