Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (7): 2057-2064.DOI: 10.11772/j.issn.1001-9081.2022091365

Special Issue: 第39届CCF中国数据库学术会议(NDBC 2022)

• The 39th CCF National Database Conference (NDBC 2022) • Previous Articles     Next Articles

PrivSPM: frequent sequential pattern mining algorithm under local differential privacy

Shuo HUANG, Yanhui LI(), Jianqiu CAO   

  1. School of Information Science and Engineering,Chongqing Jiaotong University,Chongqing 400074,China
  • Received:2022-09-12 Revised:2022-11-15 Accepted:2022-11-21 Online:2023-07-20 Published:2023-07-10
  • Contact: Yanhui LI
  • About author:HUANG Shuo, born in 1998, M. S. candidate. His research interests include data privacy, differential privacy.
    LI Yanhui, born in 1989, Ph. D., lecturer. Her research interests include data privacy, differential privacy, big data analysis.
    CAO Jianqiu, born in 1967, M. S., professor. His main research interests include graphics and image processing, information visualization, traffic informatization, intelligent control.
  • Supported by:
    National Natural Science Foundation of China(62002036);Opening Project of Shanghai Key Laboratory of Integrated Administration Technologies for Information Security(AGK2020006);Natural Science Foundation of Chongqing(cstc2021jcyj-msxmX0859);Science and Technology Research Program of Chongqing Municipal Education Commission(KJQN202000707)


黄硕, 李艳辉(), 曹建秋   

  1. 重庆交通大学 信息科学与工程学院,重庆 400074
  • 通讯作者: 李艳辉
  • 作者简介:黄硕(1998—),男,河南漯河人,硕士研究生,主要研究方向:数据隐私、差分隐私;
  • 基金资助:


Sequential data may contain a lot of sensitive information, so that directly mining frequent patterns of sequential data would carry significant risk to privacy of individuals. By resisting attackers with any background knowledge, Local Differential Privacy (LDP) can provide more comprehensive protection for sensitive information. Due to the inherent sequentiality and high-dimensionality, it is challenging to mine frequent sequential patterns with the application of LDP. To tackle this problem, a top-k frequent sequential pattern mining algorithm satisfying ε-LDP, called PrivSPM, was proposed. In this algorithm, filling and sampling technologies, adaptive frequency estimation algorithm and frequent item prediction technology were integrated to construct candidate item. Based on the new domain, an exponential mechanism based strategy was employed to perturb the user data, and the final frequent sequential patterns were identified by combining the frequency estimation algorithm. Theoretical analysis proves that the proposed algorithm satisfies ε-LDP. Experimental results on three real datasets demonstrate that PrivSPM algorithm performs better than the comparison algorithm on True Positive Rate (TPR) and Normalized Cumulative Rank (NCR), and can improve the accuracy of mined results effectively.

Key words: Local Differential Privacy (LDP), privacy protection, frequent sequential pattern mining, exponential mechanism, data mining



关键词: 本地化差分隐私, 隐私保护, 频繁序列模式挖掘, 指数机制, 数据挖掘

CLC Number: