《计算机应用》唯一官方网站

• •    下一篇

改进时域卷积网络与多头自注意力的间歇过程质量预测模型

赵小强,柳勇勇,惠永永,刘凯   

  1. 兰州理工大学
  • 收稿日期:2024-07-08 修回日期:2024-09-27 发布日期:2024-11-19 出版日期:2024-11-19
  • 通讯作者: 柳勇勇
  • 基金资助:
    国家自然科学基金;甘肃省高校产业支撑计划项目;甘肃省工业过程先进控制重点实验室开放基金

Batch process quality prediction model using improved time-domain convolutional network with multi-head self-attention

  • Received:2024-07-08 Revised:2024-09-27 Online:2024-11-19 Published:2024-11-19

摘要: 为了提高时域卷积网络(Time-domain Convolutional Network, TCN)在批量大小变化时训练的稳定性,解决间歇过程质量预测在捕捉长期依赖性和全局关联性上存在不足而导致出现预测准确度不高的问题,文中提出了一种批量组规范化(Batch Group Normalization,BGN)和Mish激活函数改进残差结构的时域卷积网络与多头自注意力机制(Batch Group Normalization and Mish activation function-enhanced residual structure for a Time-domain Convolutional Network with Multi-head Self-attention, BMTCN-MHSA)的间歇过程质量预测模型。首先,将间歇过程的三维数据展开为二维矩阵形式,并对数据归一化处理,再引入奇异谱分析法(Singular Spectrum Analysis, SSA)分解重构。然后,在时域卷积的残差部分融入BGN降低网络模型在批量大小变化时的敏感度,引入Mish激活函数提高模型的泛化能力,利用多头自注意力机制对序列中不同位置的特征信息进行关联和权重分配,进一步提取序列中的关键特征信息和相互依赖关系,以更好地捕捉间歇过程的动态特征。最后,使用青霉素仿真实验数据进行实验验证,实验结果表明,相较于时域卷积网络, BMTCN-MHSA模型的平均绝对误差(Mean Absolute Error, MAE)降低了56.86%,均方根误差(Mean Squared Error, MSE)降低了48.80%,决定系数(R-Squared, R2)达到了99.48%,这表明文中所提模型提高了质量预测的准确度。

关键词: 间歇过程, 质量预测, 奇异谱分析法, 时域卷积网络, 多头自注意力机制

Abstract: To improve the stability of Time-domain Convolutional Network (TCN) training when batch size changes, and to address the issue of batch process quality prediction failing to capture long-term dependence and global correlation, which led to poor prediction accuracy, a model for batch process quality prediction was proposed. Batch Group Normalization (BGN) and Mish activation function were incorporated to enhance the residual structure of the time-domain convolutional network with a multi-head self-attention (BMTCN-MHSA) mechanism. Firstly, the three-dimensional data of the batch process was unfolded into a two-dimensional matrix form and normalized. Singular Spectrum Analysis (SSA) was then introduced to decompose the reconstruction. BGN was added to the residual part of the time-domain convolution to reduce the network model's sensitivity to batch size changes. The Mish activation function was introduced to improve the model's generalization ability, and the multiple self-attention mechanism was utilized to correlate and assign weights to feature information at different sequence locations, further extracting key feature information and interdependence relationships in the sequence to better capture the dynamic characteristics of the batch process. Finally, the experimental validation is carried out using penicillin simulation experimental data, and the experimental results show that compared with the time-domain convolutional network, the mean absolute error of the BMTCN-MHSA model is reduced by 56.86%, and the mean square error is reduced by 48.80%, and the r-squared reaches 99.48%, which suggests that the model proposed in this paper improves the accuracy of the quality prediction.

Key words: batch process, quality prediction, singular spectrum analysis, time-domain convolutional network, multi-head self-attention mechanism

中图分类号: