Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (12): 3740-3746.DOI: 10.11772/j.issn.1001-9081.2022121828

Special Issue: 数据科学与技术

• Data science and technology • Previous Articles     Next Articles

Contrast order-preserving pattern mining algorithm

Yufei MENG1, Youxi WU1(), Zhen WANG1, Yan LI2   

  1. 1.School of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China
    2.School of Economics and Management,Hebei University of Technology,Tianjin 300401,China
  • Received:2022-12-09 Revised:2023-02-24 Accepted:2023-02-28 Online:2023-03-06 Published:2023-12-10
  • Contact: Youxi WU
  • About author:MENG Yufei, born in 1995, M. S. candidate. Her research interests include data mining.
    WANG Zhen, born in 1997, M. S. candidate. Her research interests include data mining.
    LI Yan, born in 1975, Ph. D., associate professor. Her research interests include data mining, supply chain management.
  • Supported by:
    Natural Science Foundation of Hebei Province(F20202013)


孟玉飞1, 武优西1(), 王珍1, 李艳2   

  1. 1.河北工业大学 人工智能与数据科学学院,天津 300401
    2.河北工业大学 经济管理学院,天津 300401
  • 通讯作者: 武优西
  • 作者简介:孟玉飞(1995—),女,河北石家庄人,硕士研究生,主要研究方向:数据挖掘
  • 基金资助:


Aiming at the problem that the existing contrast sequential pattern mining methods mainly focus on character sequence datasets and are difficult to be applied to time series datasets, a new Contrast Order-preserving Pattern Mining (COPM) algorithm was proposed. Firstly, in the candidate pattern generation stage, a pattern fusion strategy was used to reduce the number of candidate patterns. Then, in the pattern support calculation stage, the support of super-pattern was calculated by using the matching results of sub-patterns. Finally, a dynamic pruning strategy of minimum support threshold was designed to further effectively prune the candidate patterns. Experimental results show that on six real time series datasets, the memory consumption of COPM algorithm is at least 52.1% lower than that of COPM-o (COPM-original) algorithm, 36.8% lower than that of COPM-e (COPM-enumeration) algorithm, and 63.6% lower than that of COPM-p (COPM-prune) algorithm. At the same time, the running time of COPM algorithm is at least 30.3% lower than that of COPM-o algorithm, 8.8% lower than that of COPM-e algorithm and 41.2% lower than that of COPM-p algorithm. Therefore, in terms of algorithm performance, COPM algorithm is superior to COPM-o, COPM-e and COPM-p algorithms. The experimental results verify that COPM algorithm can effectively mine the contrast order-preserving patterns to find the differences between different classes of time series datasets.

Key words: pattern mining, sequential pattern mining, time series, contrast pattern, order-preserving pattern



关键词: 模式挖掘, 序列模式挖掘, 时间序列, 对比模式, 保序模式

CLC Number: