《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (7): 2271-2279.DOI: 10.11772/j.issn.1001-9081.2022060884

• 计算机软件技术 • 上一篇    

离散事件系统最优监督控制算法

胡瑜洪, 王德光(), 何家汉, 张志恒   

  1. 贵州大学 电气工程学院,贵阳 550025
  • 收稿日期:2022-06-20 修回日期:2022-08-04 接受日期:2022-08-11 发布日期:2022-08-26 出版日期:2023-07-10
  • 通讯作者: 王德光
  • 作者简介:胡瑜洪(1999—),男,广东揭阳人,硕士研究生,CCF会员,主要研究方向:监督控制理论、机器人路径规划和调度;
    王德光(1991—),男,山西侯马人,教授,博士,CCF会员,主要研究方向:监督控制理论、离散事件系统故障诊断、机器人路径规划和调度;
    何家汉(1998—),男,广东云浮人,硕士研究生,主要研究方向:离散事件系统故障诊断、不透明性验证;
    张志恒(1996—),男,安徽阜阳人,硕士研究生,主要研究方向:离散事件系统故障诊断、可诊断性验证。
  • 基金资助:
    贵州省省级科技计划项目(黔科合基础-ZK[2022]一般103);贵州大学科研基金资助项目(贵大特岗合字[2021]04号);贵州省教育厅创新群体项目(黔科合支撑[2021]012)

Optimal supervisory control algorithm of discrete-event systems

Yuhong HU, Deguang WANG(), Jiahan HE, Zhiheng ZHANG   

  1. The Electrical Engineering College,Guizhou University,Guiyang Guizhou 550025,China
  • Received:2022-06-20 Revised:2022-08-04 Accepted:2022-08-11 Online:2022-08-26 Published:2023-07-10
  • Contact: Deguang WANG
  • About author:HU Yuhong, born in 1999, M. S. candidate. His research interests include supervisory control theory, robot path planning and scheduling.
    WANG Deguang, born in 1991, Ph. D., professor. His research interests include supervisory control theory, fault diagnosis for discrete-event systems, robot path planning and scheduling.
    HE Jiahan, born in 1998, M. S. candidate. His research interests include fault diagnosis for discrete-event systems, verification of opacity.
    ZHANG Zhiheng, born in 1996, M. S. candidate. His research interests include fault diagnosis for discrete-event systems, diagnostic verification.
  • Supported by:
    Guizhou Provincial Science and Technology Program (QianKeHeJiChu-ZK [2022] YiBan 103), Scientific Research Fund of GuiZhou University(GuiDaTeGang [2021]04);Education Department of Guizhou Province Innovation Group Project(QianKeHeZhiCheng [2021]012)

摘要:

离散事件系统的监控器可以通过禁止可控事件来使系统满足安全性和活性规范。然而,监控器并不对允许发生的可控事件主动进行选择,所以存在同时允许多个可控事件发生的情况。但在实际应用中,如交通调度、机器人路径规划中,要求系统在每个状态下最多只允许一个可控事件的发生。针对上述问题,引入一种最优机制来量化控制成本,并提出一种离散事件系统最优监督控制算法,以确保系统的安全性和活性,并使事件执行累计的成本最小。首先,给定受控系统和行为约束的自动机模型,并基于Ramadge和Wonham的监督控制理论求解出无阻塞和行为最大许可的监控器;其次,通过定义的成本函数为监控器中每个事件的执行赋予相应成本;最后,利用动态规划思想迭代计算求解出最优定向监控器,从而实现每个状态下最多发生一个可控事件和事件执行累计的成本最小的目标。使用单向列车导轨案例和多轨道列车控制案例来验证所提算法的有效性和正确性。对于上述两个案例,所提算法求解的定向监控器到达目标状态所需的事件执行累计的成本分别为26.0和14.0,低于贪心算法的27.5和16.0,以及Q-learning算法的26.5和14.0。

关键词: 离散事件系统, 监督控制, 最优定向监控器, 成本函数, 交通系统调度

Abstract:

A supervisor of a discrete-event system can prohibit controllable events to ensure the safety and liveness specifications of the system. However, the supervisor does not actively select the controllable events that are allowed to occur, so it is possible that several controllable events occur simultaneously. In practice, such as traffic scheduling and robot path planning, the system is required to allow at most one controllable event to occur in each state. In response to the above problem, an optimal mechanism was introduced to quantify control cost, and an optimal supervisory control algorithm of discrete-event systems was proposed, which not only can guarantee the safety and liveness of the system, but also can minimize the cumulative cost of event execution. Firstly, the automata model of controlled system and behavioral constraints was given, and a nonblocking supervisor with maximum allowable behaviors was solved on the basis of the supervisory control theory of Ramadge and Wonham. Secondly, a cost function was defined to assign the corresponding cost to the execution of each event in the supervisor. Finally, an optimal directed supervisor was calculated iteratively based on dynamic programming to achieve the goals of at most one controllable event occurring in each state and minimizing the cumulative cost of event execution. To verify the effectiveness and correctness of the proposed algorithm, a one-way train guideway example and a multi-track train control example were used. For the above two examples, the cumulative cost of the event execution required for the directed supervisor solved by the proposed algorithm to reach the target state is 26.0 and 14.0 respectively, which is lower than the 27.5 and 16.0 of greedy algorithm and the 26.5 and 14.0 of Q-learning.

Key words: discrete-event system, supervisory control, optimal directed supervisor, cost function, traffic system scheduling

中图分类号: