Integrated scheduling optimization of multiple data centers based on deep reinforcement learning

Abstract

Abstract: Abstract: The purpose of the multiple data centers task scheduling strategy is to allocate computing tasks to different servers in each data center to promote resource utilization and energy efficiency. A deep reinforcement learning-based integrated scheduling strategy for multiple data centers is proposed, which is divided into two stages: data center selection and task allocation within data centers. In the multiple data center selection stage, the scheduling strategy integrates computing power resources to improve the overall resource utilization, firstly, a Deep Q Network with Prioritized Experience Replay (PER-DQN) was used to obtain the communication paths to each data center network in the data center node network, then the resource use cost and network communication cost were calculated, and the optimal data center was selected according to the principle that the sum of the two costs is minimum. In the task allocation phase, the tasks in the data center were divided and the tasks were added to the scheduling queue according to the First-Come-First-Served (FCFS), the task assignment algorithm based on Double Deep Q Network (Double DQN) was used to get the optimal assignment strategy, which can select the server to perform the computing task, avoid hot spots and reduce the energy consumption of refrigeration equipment. Experimental results show that the average total cost of PER-DQN-based datacenter selection algorithm is reduced by 3.6% and 10% compared with the Computing Resource First (CRF) and Shortest Path First (SPF) path selection methods, respectively. Compared with Round Robin scheduling (RR) and greedy scheduling (Greedy), the Double DQN-based task deployment algorithm reduces the average Power Usage Effectiveness (PUE) by 2.6% and 1.7% respectively. This strategy can effectively reduce the total cost and data center energy consumption, and realize the efficient operation of multiple data centers.

Key words: deep reinforcement learning, multiple data centers, task scheduling, temperature aware, power usage effectiveness

摘要： 摘要: 多数据中心任务调度策略的目的是把计算任务分配到各个数据中心的不同服务器上，以促进资源利用率和能效的提升。提出了基于深度强化学习的多数据中心一体化调度策略，分为数据中心选择和数据中心内部任务分配两个阶段。在多数据中心选择阶段，整合算力资源以提高总体资源利用率，首先采用具有优先经验回放的深度Q网络(PER-DQN)在以数据中心为节点的网络中获取到达各个数据中心网络的通信路径；然后对资源使用成本和网络通信成本进行计算，并依据这两个成本之和最小的原则，选择最优的数据中心。数据中心内部任务分配阶段，在所选数据中心内部，对计算任务进行划分并遵循先来先服务原则(FCFS)，将任务添加到调度队列中，结合计算设备状态和环境温度，采用基于双深度Q网络(Double DQN)的任务分配算法获得最优分配策略，以选择服务器执行计算任务，避免热点产生，降低制冷设备能耗。实验结果表明，基于PER-DQN的数据中心选择算法分别相比于计算资源优先(CRF) 、最短路径优先(SPF)路径选择方法的平均总成本下降3.6%、10%；基于Double DQN的任务部署算法分别相比于较轮询调度(RR)、贪心调度(Greedy)的平均电源使用效率(PUE)下降2.6%、1.7%。该策略能够有效降低总成本和数据中心能耗，实现多数据中心的高效运行。

关键词: 深度强化学习, 多数据中心, 任务调度, 温度感知, 电源使用效率

CLC Number:

TP181

方和平刘曙光冉泳屹钟坤华. 基于深度强化学习的多数据中心一体化调度优化[J]. .

[1]	Tianyu XUE, Aiping LI, Liguo DUAN. Vehicular edge computing scheme with task offloading and resource optimization [J]. Journal of Computer Applications, 2025, 45(6): 1766-1775.
[2]	Pengcheng XU, Lei HE, Chuan LI, Weiqi QIAN, Tun ZHAO. Deep symbolic regression method based on Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1455-1463.
[3]	Huahua WANG, Liang HUANG, Jiajie CHEN, Jiening FANG. Dynamic allocation algorithm for multi-beam subcarriers of low orbit satellites based on deep reinforcement learning [J]. Journal of Computer Applications, 2025, 45(2): 571-577.
[4]	Jing WANG, Xuming FANG. Intelligent joint power and channel allocation algorithm for Wi-Fi7 multi-link integrated communication and sensing [J]. Journal of Computer Applications, 2025, 45(2): 563-570.
[5]	Zijun MIAO, Fei LUO, Weichao DING, Wenbo DONG. Traffic signal control algorithm based on overall state prediction and fair experience replay [J]. Journal of Computer Applications, 2025, 45(1): 337-344.
[6]	Yi ZHOU, Hua GAO, Yongshen TIAN. Proximal policy optimization algorithm based on clipping optimization and policy guidance [J]. Journal of Computer Applications, 2024, 44(8): 2334-2341.
[7]	Tian MA, Runtao XI, Jiahao LYU, Yijie ZENG, Jiayi YANG, Jiehui ZHANG. Mobile robot 3D space path planning method based on deep reinforcement learning [J]. Journal of Computer Applications, 2024, 44(7): 2055-2064.
[8]	Xiaoyan ZHAO, Wei HAN, Junna ZHANG, Peiyan YUAN. Collaborative offloading strategy in internet of vehicles based on asynchronous deep reinforcement learning [J]. Journal of Computer Applications, 2024, 44(5): 1501-1510.
[9]	Rui TANG, Chuanlin PANG, Ruizhi ZHANG, Chuan LIU, Shibo YUE. DDPG-based resource allocation in D2D communication-empowered cellular network [J]. Journal of Computer Applications, 2024, 44(5): 1562-1569.
[10]	Xintong QIN, Zhengyu SONG, Tianwei HOU, Feiyue WANG, Xin SUN, Wei LI. Channel access and resource allocation algorithm for adaptive p-persistent mobile ad hoc network [J]. Journal of Computer Applications, 2024, 44(3): 863-868.
[11]	Fuqin DENG, Huifeng GUAN, Chaoen TAN, Lanhui FU, Hongmin WANG, Tinlun LAM, Jianmin ZHANG. Multi-robot reinforcement learning path planning method based on request-response communication mechanism and local attention mechanism [J]. Journal of Computer Applications, 2024, 44(2): 432-438.
[12]	Yuanchao LI, Chongben TAO, Chen WANG. Gait control method based on maximum entropy deep reinforcement learning for biped robot [J]. Journal of Computer Applications, 2024, 44(2): 445-451.
[13]	Jiachen YU, Ye YANG. Irregular object grasping by soft robotic arm based on clipped proximal policy optimization algorithm [J]. Journal of Computer Applications, 2024, 44(11): 3629-3638.
[14]	Jie LONG, Liang XIE, Haijiao XU. Integrated deep reinforcement learning portfolio model [J]. Journal of Computer Applications, 2024, 44(1): 300-310.
[15]	Shaofa SHANG, Lin JIANG, Yuancheng LI, Yun ZHU. Adaptive partitioning and scheduling method of convolutional neural network inference model on heterogeneous platforms [J]. Journal of Computer Applications, 2023, 43(9): 2828-2835.

Integrated scheduling optimization of multiple data centers based on deep reinforcement learning

基于深度强化学习的多数据中心一体化调度优化

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics