Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (10): 3167-3176.DOI: 10.11772/j.issn.1001-9081.2023101460
• Computer software technology • Previous Articles Next Articles
Hua HUANG1, Ziyi YANG1, Xiaolong LI1,2(), Chuang LI1
Received:
2023-10-27
Revised:
2023-12-05
Accepted:
2023-12-15
Online:
2024-10-15
Published:
2024-10-10
Contact:
Xiaolong LI
About author:
HUANG Hua, born in 1981, Ph.D., associate professor. His research interests include cloud computing, service computing, artificial intelligence, blockchain.Supported by:
通讯作者:
李小龙
作者简介:
黄华(1981—),男,湖南衡阳人,副教授,博士,CCF会员,主要研究方向:云计算、服务计算、人工智能、区块链基金资助:
CLC Number:
Hua HUANG, Ziyi YANG, Xiaolong LI, Chuang LI. Predictive business process monitoring method based on concept drift[J]. Journal of Computer Applications, 2024, 44(10): 3167-3176.
黄华, 杨子仪, 李小龙, 李闯. 基于概念漂移的预测性业务流程监控方法[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3167-3176.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.joca.cn/EN/10.11772/j.issn.1001-9081.2023101460
事件日志 | 案例数 | 事件数 | 活动数 | 案例属性数 |
---|---|---|---|---|
许可 | 7 065 | 86 581 | 51 | 168 |
国际申报 | 6 449 | 72 151 | 34 | 18 |
国内申报 | 10 500 | 56 437 | 17 | 5 |
预付差旅费用 | 2 099 | 18 246 | 29 | 17 |
请求付款 | 6 886 | 36 796 | 19 | 9 |
Tab. 1 Metadata for different logs
事件日志 | 案例数 | 事件数 | 活动数 | 案例属性数 |
---|---|---|---|---|
许可 | 7 065 | 86 581 | 51 | 168 |
国际申报 | 6 449 | 72 151 | 34 | 18 |
国内申报 | 10 500 | 56 437 | 17 | 5 |
预付差旅费用 | 2 099 | 18 246 | 29 | 17 |
请求付款 | 6 886 | 36 796 | 19 | 9 |
事件日志 | 日志解析 |
---|---|
Declaration APPROVED by ADMINISTRATION | Declaration APPROVED |
Declaration FINAL APPROVED by DIRECTOR | Declaration FINAL APPROVED |
Declaration FOR APPROVAL by ADMINISTRATION | Declaration FOR APPROVAL |
Declaration REJECTED by ADMINISTRATION | Declaration REJECTED |
Tab. 2 Examples of event log parsing
事件日志 | 日志解析 |
---|---|
Declaration APPROVED by ADMINISTRATION | Declaration APPROVED |
Declaration FINAL APPROVED by DIRECTOR | Declaration FINAL APPROVED |
Declaration FOR APPROVAL by ADMINISTRATION | Declaration FOR APPROVAL |
Declaration REJECTED by ADMINISTRATION | Declaration REJECTED |
事件日志 | 活动数 | 资源数 | 轨迹数 | 事件数 | 轨迹变化数 |
---|---|---|---|---|---|
BPI20T | 51 | 2 | 86 581 | 7 065 | 1 478 |
BPI20I | 34 | 2 | 72 151 | 6 449 | 753 |
BPI20D | 17 | 2 | 56 437 | 5 | 10 500 |
BPI20P | 29 | 2 | 18 246 | 2 099 | 202 |
BPI20R | 19 | 2 | 36 796 | 6 886 | 89 |
BPI12L | 36 | 69 | 262 200 | 13 087 | 4 366 |
Tab. 3 Characteristics of event logs used in experiments
事件日志 | 活动数 | 资源数 | 轨迹数 | 事件数 | 轨迹变化数 |
---|---|---|---|---|---|
BPI20T | 51 | 2 | 86 581 | 7 065 | 1 478 |
BPI20I | 34 | 2 | 72 151 | 6 449 | 753 |
BPI20D | 17 | 2 | 56 437 | 5 | 10 500 |
BPI20P | 29 | 2 | 18 246 | 2 099 | 202 |
BPI20R | 19 | 2 | 36 796 | 6 886 | 89 |
BPI12L | 36 | 69 | 262 200 | 13 087 | 4 366 |
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 0.67 | 0.66 | 0.63 | 0.69 | 0.74 | 0.78 |
BPI20T | 0.58 | 0.56 | 0.54 | 0.62 | 0.63 | 0.79 |
BPI20I | 0.72 | 0.69 | 0.73 | 0.77 | 0.80 | 0.83 |
BPI20D | 0.77 | 0.73 | 0.67 | 0.79 | 0.83 | 0.85 |
BPI20P | 0.80 | 0.79 | 0.77 | 0.83 | 0.85 | 0.88 |
BPI20R | 0.63 | 0.60 | 0.58 | 0.66 | 0.71 | 0.72 |
BPI12A | 0.64 | 0.61 | 0.58 | 0.60 | 0.70 | 0.73 |
BPI12J | 0.60 | 0.59 | 0.53 | 0.62 | 0.65 | 0.68 |
BPI12C | 0.64 | 0.67 | 0.60 | 0.65 | 0.71 | 0.72 |
Tab. 4 Comparison of overall AUC of different process result prediction models on different datasets
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 0.67 | 0.66 | 0.63 | 0.69 | 0.74 | 0.78 |
BPI20T | 0.58 | 0.56 | 0.54 | 0.62 | 0.63 | 0.79 |
BPI20I | 0.72 | 0.69 | 0.73 | 0.77 | 0.80 | 0.83 |
BPI20D | 0.77 | 0.73 | 0.67 | 0.79 | 0.83 | 0.85 |
BPI20P | 0.80 | 0.79 | 0.77 | 0.83 | 0.85 | 0.88 |
BPI20R | 0.63 | 0.60 | 0.58 | 0.66 | 0.71 | 0.72 |
BPI12A | 0.64 | 0.61 | 0.58 | 0.60 | 0.70 | 0.73 |
BPI12J | 0.60 | 0.59 | 0.53 | 0.62 | 0.65 | 0.68 |
BPI12C | 0.64 | 0.67 | 0.60 | 0.65 | 0.71 | 0.72 |
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 23 | 23 | 23 | 23 | 23 | 20 |
BPI20R | 31 | 30 | 24 | 31 | 30 | 22 |
BPI12A | 17 | 17 | 16 | 17 | 17 | 16 |
BPI12J | 27 | 28 | 34 | 27 | 27 | 25 |
BPI12C | 16 | 15 | 16 | 16 | 16 | 15 |
Tab. 5 Earliness comparison of different process result prediction models
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 23 | 23 | 23 | 23 | 23 | 20 |
BPI20R | 31 | 30 | 24 | 31 | 30 | 22 |
BPI12A | 17 | 17 | 16 | 17 | 17 | 16 |
BPI12J | 27 | 28 | 34 | 27 | 27 | 25 |
BPI12C | 16 | 15 | 16 | 16 | 16 | 15 |
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 11 621 | 4 306 | 14 716 | 11 587 | 12 167 | 11 532 |
BPI20T | 23 385 | 9 066 | 28 943 | 23 014 | 24 821 | 21 605 |
BPI20I | 19 463 | 8 170 | 21 934 | 19 782 | 20 694 | 19 617 |
BPI20D | 483 | 57 | 74 | 163 | 107 | 736 |
BPI20P | 16 801 | 6 711 | 18 006 | 15 097 | 15 826 | 14 357 |
BPI20R | 21 859 | 8 539 | 28 412 | 25 493 | 27 845 | 21 375 |
BPI12A | 6 890 | 1 164 | 7 732 | 7 032 | 3 659 | 7 365 |
BPI12J | 1 907 | 545 | 7 954 | 974 | 974 | 1 237 |
BPI12C | 2 183 | 196 | 4 673 | 1 137 | 3 407 | 5 964 |
Tab. 6 Offline time of training different models on each dataset
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 11 621 | 4 306 | 14 716 | 11 587 | 12 167 | 11 532 |
BPI20T | 23 385 | 9 066 | 28 943 | 23 014 | 24 821 | 21 605 |
BPI20I | 19 463 | 8 170 | 21 934 | 19 782 | 20 694 | 19 617 |
BPI20D | 483 | 57 | 74 | 163 | 107 | 736 |
BPI20P | 16 801 | 6 711 | 18 006 | 15 097 | 15 826 | 14 357 |
BPI20R | 21 859 | 8 539 | 28 412 | 25 493 | 27 845 | 21 375 |
BPI12A | 6 890 | 1 164 | 7 732 | 7 032 | 3 659 | 7 365 |
BPI12J | 1 907 | 545 | 7 954 | 974 | 974 | 1 237 |
BPI12C | 2 183 | 196 | 4 673 | 1 137 | 3 407 | 5 964 |
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 20 | 17 | 22 | 2 | 2 | 4 |
BPI20T | 21 | 15 | 23 | 2 | 3 | 3 |
BPI20I | 20 | 17 | 22 | 3 | 2 | 2 |
BPI20D | 37 | 32 | 41 | 1 | 1 | 2 |
BPI20P | 15 | 14 | 19 | 1 | 2 | 3 |
BPI20R | 30 | 26 | 34 | 2 | 3 | 5 |
BPI12A | 10 | 9 | 11 | 4 | 3 | 9 |
BPI12J | 12 | 13 | 12 | 1 | 1 | 3 |
BPI12C | 12 | 11 | 12 | 3 | 3 | 2 |
Tab. 7 Online prediction time of different models trained on each dataset
事件日志 | SVM | LR | RF | LSTM | Bi-LSTM | Att-Bi-LSTM |
---|---|---|---|---|---|---|
平均值 | 20 | 17 | 22 | 2 | 2 | 4 |
BPI20T | 21 | 15 | 23 | 2 | 3 | 3 |
BPI20I | 20 | 17 | 22 | 3 | 2 | 2 |
BPI20D | 37 | 32 | 41 | 1 | 1 | 2 |
BPI20P | 15 | 14 | 19 | 1 | 2 | 3 |
BPI20R | 30 | 26 | 34 | 2 | 3 | 5 |
BPI12A | 10 | 9 | 11 | 4 | 3 | 9 |
BPI12J | 12 | 13 | 12 | 1 | 1 | 3 |
BPI12C | 12 | 11 | 12 | 3 | 3 | 2 |
1 | VIRIYASITAVAT W, XU L D, BI Z, et al. Blockchain-based Business Process Management (BPM) framework for service composition in Industry 4.0[J]. Journal of Intelligent Manufacturing, 2020, 31(7): 1737-1748. |
2 | DIJKMAN R, TURETKEN O, VAN IJZENDOORN G R, et al. Business processes exceptions in relation to operational performance[J]. Business Process Management Journal, 2019, 25(5): 908-922. |
3 | KIR H, ERDOGAN N. A knowledge-intensive adaptive business process management framework[J]. Information Systems, 2021, 95: No.101639. |
4 | MÁRQUEZ-CHAMORRO A E, RESINAS M, RUIZ-CORTÉS A. Predictive monitoring of business processes: a survey[J]. IEEE Transactions on Services Computing, 2018, 11(6): 962-977. |
5 | CHEN J, JING H, CHANG Y, et al. Gated recurrent unit based recurrent neural network for remaining useful life prediction of nonlinear deterioration process[J]. Reliability Engineering and System Safety, 2019, 185: 372-382. |
6 | PASQUADIBISCEGLIE V, APPICE A, CASTELLANO G, et al. Using convolutional neural networks for predictive process analytics[C]// Proceedings of the 2019 International Conference on Process Mining. Piscataway: IEEE, 2019: 129-136. |
7 | TAX N, VERENICH I, LA ROSA M, et al. Predictive business process monitoring with LSTM neural networks[C]// Proceedings of the 2017 International Conference on Advanced Information Systems Engineering, LNCS 10253. Cham: Springer, 2017: 477-492. |
8 | MEHDIYEV N, EVERMANN J, FETTKE P. A multi-stage deep learning approach for business process event prediction[C]// Proceedings of the IEEE 19th Conference on Business Informatics — Volume 1. Piscataway: IEEE, 2017: 119-128. |
9 | BAYRAM F, AHMED B S, KASSLER A. From concept drift to model degradation: an overview on performance-aware drift detectors[J]. Knowledge-Based Systems, 2022, 245: No.108632. |
10 | DELGADO A, MAROTTA A, GONZÁLEZ L, et al. Towards a data science framework integrating process and data mining for organizational improvement[C]// Proceedings of the 15th International Conference on Software Technologies. Setúbal: SciTePress, 2020: 492-500. |
11 | SATO D M V, DE FREITAS S C, BARDDAL J P, et al. A survey on concept drift in process mining[J]. ACM Computing Surveys, 2021, 54(9): No.189. |
12 | LU J, LIU A, DONG F, et al. Learning under concept drift: a review[J]. IEEE Transactions on Knowledge and Data Engineering, 2019, 31(12): 2346-2363. |
13 | GAMA J, MEDAS P, CASTILLO G, et al. Learning with drift detection[C]// Proceedings of the 2004 Brazilian Symposium on Artificial Intelligence, LNCS 3171. Berlin: Springer, 2004: 286-295. |
14 | GU F, ZHANG G, LU J, et al. Concept drift detection based on equal density estimation[C]// Proceedings of the 2016 International Joint Conference on Neural Networks. Piscataway: IEEE, 2016: 24-30. |
15 | YU S, WANG X, PRÍNCIPE J C. Request-and-reverify: hierarchical hypothesis testing for concept drift detection with expensive labels[C]// Proceedings of the 27th International Joint Conference on Artificial Intelligence. California: ijcai.org, 2018:3033-3039. |
16 | RAMA-MANEIRO E, VIDAL J C, LAMA M. Deep learning for predictive business process monitoring: review and benchmark[J]. IEEE Transactions on Services Computing, 2023, 16(1): 739-756. |
17 | SCHÖNIG S, JASINSKI R, ACKERMANN L, et al. Deep learning process prediction with discrete and continuous data features[C]// Proceedings of the 13th International Conference on Evaluation of Novel Approaches to Software Engineering. Setúbal: SciTePress, 2018: 314-319. |
18 | LEONTJEVA A, CONFORTI R, DI FRANCESCOMARINO C, et al. Complex symbolic sequence encodings for predictive monitoring of business processes[C]// Proceedings of the 2015 International Conference on Business Process Management, LNCS 9253. Cham: Springer, 2015: 297-313. |
19 | 王娇娇,马小雨,刘畅,等. 基于XGBoost增量实现业务流程执行结果的预测性监控方法[J]. 计算机集成制造系统,2024,30(8):2756-2775. |
WANG J J, MA X Y, LIU C, et al. Incremental outcome-oriented predictive process monitoring based on XGBoost[J]. Computer Integrated Manufacturing Systems, 2024, 30(8): 2756-2775. | |
20 | YAO L, GUAN Y. An improved LSTM structure for natural language processing[C]// Proceedings of the 2018 IEEE International Conference of Safety Produce Informatization. Piscataway: IEEE, 2018: 565-569. |
21 | SHEWALKAR A, NYAVANANDI D, LUDWIG S A. Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU[J]. Journal of Artificial Intelligence and Soft Computing Research, 2019, 9(4): 235-245. |
22 | STAUDEMEYER R C, MORRIS E R. Understanding LSTM — a tutorial into long short-term memory recurrent neural networks[EB/OL]. (2019-09-12) [2013-09-23].. |
23 | 孙笑笑,侯文杰,应钰柯,等. 基于双层机器学习的业务流程剩余时间预测[J]. 计算机学报, 2021,44(11):2283-2294. |
SUN X X, HOU W J, YING Y K, et al. Business process remaining time prediction based on two-layer machine learning[J]. Chinese Journal of Computers, 2021, 44(11):2283-2294. | |
24 | 夏灿铭,邢玛丽,何胜煌. 基于XLNet的业务流程下一活动预测方法[J]. 计算机集成制造系统, 2023, 29(10):3496-3503. |
XIA C M, XING M L, HE S H. XLNet-based next activity prediction method of business process[J]. Computer Integrated Manufacturing Systems, 2023, 29(10):3496-3503. | |
25 | 郑婷婷,陈洁璇,许洋,等. 业务流程中一种个性化的任务完成时间预测方法[J]. 计算机集成制造系统, 2019, 25(4): 993-1000. |
ZHENG T T, CHEN J X, XU Y, et al. Approach for individual task completion time prediction in business processes[J]. Computer Integrated Manufacturing Systems, 2019, 25(4): 993-1000. | |
26 | TEINEMAA I, DUMAS M, LA ROSA M, et al. Outcome-oriented predictive process monitoring: review and benchmark[J]. ACM Transactions on Knowledge Discovery from Data, 2019, 13(2): No.17. |
27 | DU M, LI F, ZHENG G, et al. DeepLog: anomaly detection and diagnosis from system logs through deep learning[C]// Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security. New York: ACM, 2017: 1285-1298. |
28 | ATHIWARATKUN B, WILSON A G, ANANDKUMAR A. Probabilistic FastText for multi-sense word embeddings[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg: ACL, 2019: 1-11. |
29 | QAISER S, ALI R. Text mining: use of TF-IDF to examine the relevance of words to documents[J]. International Journal of Computer Applications, 2018, 181(1): 25-29. |
[1] | Jing QIN, Zhiguang QIN, Fali LI, Yueheng PENG. Diagnosis of major depressive disorder based on probabilistic sparse self-attention neural network [J]. Journal of Computer Applications, 2024, 44(9): 2970-2974. |
[2] | Liting LI, Bei HUA, Ruozhou HE, Kuang XU. Multivariate time series prediction model based on decoupled attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2732-2738. |
[3] | Zhiqiang ZHAO, Peihong MA, Xinhong HEI. Crowd counting method based on dual attention mechanism [J]. Journal of Computer Applications, 2024, 44(9): 2886-2892. |
[4] | Kaipeng XUE, Tao XU, Chunjie LIAO. Multimodal sentiment analysis network with self-supervision and multi-layer cross attention [J]. Journal of Computer Applications, 2024, 44(8): 2387-2392. |
[5] | Pengqi GAO, Heming HUANG, Yonghong FAN. Fusion of coordinate and multi-head attention mechanisms for interactive speech emotion recognition [J]. Journal of Computer Applications, 2024, 44(8): 2400-2406. |
[6] | Zhonghua LI, Yunqi BAI, Xuejin WANG, Leilei HUANG, Chujun LIN, Shiyu LIAO. Low illumination face detection based on image enhancement [J]. Journal of Computer Applications, 2024, 44(8): 2588-2594. |
[7] | Shangbin MO, Wenjun WANG, Ling DONG, Shengxiang GAO, Zhengtao YU. Single-channel speech enhancement based on multi-channel information aggregation and collaborative decoding [J]. Journal of Computer Applications, 2024, 44(8): 2611-2617. |
[8] | Wu XIONG, Congjun CAO, Xuefang SONG, Yunlong SHAO, Xusheng WANG. Handwriting identification method based on multi-scale mixed domain attention mechanism [J]. Journal of Computer Applications, 2024, 44(7): 2225-2232. |
[9] | Huanhuan LI, Tianqiang HUANG, Xuemei DING, Haifeng LUO, Liqing HUANG. Public traffic demand prediction based on multi-scale spatial-temporal graph convolutional network [J]. Journal of Computer Applications, 2024, 44(7): 2065-2072. |
[10] | Dianhui MAO, Xuebo LI, Junling LIU, Denghui ZHANG, Wenjing YAN. Chinese entity and relation extraction model based on parallel heterogeneous graph and sequential attention mechanism [J]. Journal of Computer Applications, 2024, 44(7): 2018-2025. |
[11] | Li LIU, Haijin HOU, Anhong WANG, Tao ZHANG. Generative data hiding algorithm based on multi-scale attention [J]. Journal of Computer Applications, 2024, 44(7): 2102-2109. |
[12] | Song XU, Wenbo ZHANG, Yifan WANG. Lightweight video salient object detection network based on spatiotemporal information [J]. Journal of Computer Applications, 2024, 44(7): 2192-2199. |
[13] | Dahai LI, Zhonghua WANG, Zhendong WANG. Dual-branch low-light image enhancement network combining spatial and frequency domain information [J]. Journal of Computer Applications, 2024, 44(7): 2175-2182. |
[14] | Wenliang WEI, Yangping WANG, Biao YUE, Anzheng WANG, Zhe ZHANG. Deep learning model for infrared and visible image fusion based on illumination weight allocation and attention [J]. Journal of Computer Applications, 2024, 44(7): 2183-2191. |
[15] | Xiaolu WANG, Wangfei QIAN. Gait recognition method based on two-branch convolutional network [J]. Journal of Computer Applications, 2024, 44(6): 1965-1971. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||