《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (1): 198-208.DOI: 10.11772/j.issn.1001-9081.2021071291

• 先进计算 • 上一篇    

融合XGBoost和Multi-GRU的数据中心服务器能耗优化算法

申明尧, 韩萌(), 杜诗语, 孙蕊, 张春砚   

  1. 北方民族大学 计算机科学与工程学院,银川 750021
  • 收稿日期:2021-07-19 修回日期:2021-09-03 接受日期:2021-09-15 发布日期:2021-09-03 出版日期:2022-01-10
  • 通讯作者: 韩萌
  • 作者简介:申明尧(1994—),男,山东菏泽人,硕士研究生,CCF会员,主要研究方向:数据挖掘、人工智能
    韩萌(1982—),女,河南商丘人,副教授,博士,CCF会员,主要研究方向:数据挖掘
    杜诗语(1996—),女,辽宁抚顺人,硕士,主要研究方向:数据挖掘
    孙蕊(1993—),女,山东济宁人,硕士,主要研究方向:数据挖掘
    张春砚(1995—),女,河北张家口人,硕士研究生,主要研究方向:数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(62062004);宁夏自然科学基金资助项目(2020AAC03216)

Data center server energy consumption optimization algorithm combining XGBoost and Multi-GRU

Mingyao SHEN, Meng HAN(), Shiyu DU, Rui SUN, Chunyan ZHANG   

  1. School of Computer Science and Engineering,North Minzu University,Yinchuan Ningxia 750021,China
  • Received:2021-07-19 Revised:2021-09-03 Accepted:2021-09-15 Online:2021-09-03 Published:2022-01-10
  • Contact: Meng HAN
  • About author:SHEN Mingyao, born in 1994, M. S. candidate. His research interests include data mining, artificial intelligence.
    HAN Meng, born in 1982, Ph. D., associate professor. Her research interests include data mining.
    DU Shiyu, born in 1996, M. S. Her research interests include data mining.
    SUN Rui, born in 1993, M. S. Her research interests include data mining.
    ZHANG Chunyan, born in 1995, M. S. candidate. Her research interests include data mining.
  • Supported by:
    National Natural Science Foundation of China(62062004);Natural Science Foundation of Ningxia(2020AAC03216)

摘要:

随着云计算技术的快速发展,数据中心的数量大幅增加,随之而来的能源消耗问题逐渐成为一个研究热点。针对服务器能耗优化问题,提出了一种融合极限梯度提升(XGBoost)和多个门控循环单元(Multi-GRU)的数据中心服务器能耗优化(ECOXG)算法。首先利用Linux终端监控命令和功耗仪收集服务器各部件的资源占用信息和能耗等数据,并对其进行数据预处理来得到资源利用率。其次将资源利用率串联构造成向量形式的时间序列,用其训练Multi-GRU负载预测模型,并根据预测结果对服务器进行模拟降频,以得到降频后的负载数据。然后将服务器的资源利用率与相同时刻的能耗数据相结合,并用其训练XGBoost能耗预测模型。最后将降频后的负载数据输入到训练后的XGBoost模型中,从而预测出降频后的服务器能耗。在6台物理服务器实际资源利用率数据上的实验表明,与卷积神经网络(CNN)、长短期记忆(LSTM)网络、CNN-GRU和CNN-LSTM模型相比,ECOXG算法在均方根误差(RMSE)上分别降低了50.9%、31.0%、32.7%、22.9%;同时,与LSTM、CNN-GRU和CNN-LSTM模型相比,ECOXG算法在训练时间上分别节省了43.2%、47.1%、59.9%。实验结果表明,ECOXG算法能够在服务器能耗预测和能耗优化方面提供一定的理论基础,且在准确性和运行效率方面明显优于对比算法。此外,模拟降频后的服务器能耗已明显低于真实能耗,且在服务器的利用率较低时降耗效果显著。

关键词: 数据中心, 能耗优化, 负载, 极限梯度提升, 多个门控循环单元

Abstract:

With the rapid development of cloud computing technology, the number of data centers have increased significantly, and the subsequent energy consumption problem gradually become one of the research hotspots. Aiming at the problem of server energy consumption optimization, a data center server energy consumption optimization combining eXtreme Gradient Boosting (XGBoost) and Multi-Gated Recurrent Unit (Multi-GRU) (ECOXG) algorithm was proposed. Firstly, the data such as resource occupation information and energy consumption of each component of the servers were collected by the Linux terminal monitoring commands and power consumption meters, and the data were preprocessed to obtain the resource utilization rates. Secondly, the resource utilization rates were constructed in series into a time series in vector form, which was used to train the Multi-GRU load prediction model, and the simulated frequency reduction was performed to the servers according to the prediction results to obtain the load data after frequency reduction. Thirdly, the resource utilization rates of the servers were combined with the energy consumption data at the same time to train the XGBoost energy consumption prediction model. Finally, the load data after frequency reduction were input into the trained XGBoost model, and the energy consumption of the servers after frequency reduction was predicted. Experiments on the actual resource utilization data of 6 physical servers showed that ECOXG algorithm had a Root Mean Square Error (RMSE) reduced by 50.9%, 31.0%, 32.7%, 22.9% compared with Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) network, CNN-GRU and CNN-LSTM models, respectively. Meanwhile, compared with LSTM, CNN-GRU and CNN-LSTM models, ECOXG algorithm saved 43.2%, 47.1%, 59.9% training time, respectively. Experimental results show that ECOXG algorithm can provide a theoretical basis for the prediction and optimization of server energy consumption optimization, and it is significantly better than the comparison algorithms in accuracy and operating efficiency. In addition, the power consumption of the server after the simulated frequency reduction is significantly lower than the real power consumption, and the effect of reducing energy consumption is outstanding when the utilization rates of the servers are low.

Key words: data center, energy consumption optimization, load, eXtreme Gradient Boosting (XGBoost), Multiple Gated Recurrent Units (Multi-GRU)

中图分类号: