《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (11): 3453-3462.DOI: 10.11772/j.issn.1001-9081.2024121835

• 第七届CCF中国区块链技术大会 • 上一篇    

基于区块链的重复数据删除和数据完整性审计方案

高婷婷1, 姚中原1(), 贾淼1, 斯雪明1,2, 谭焕明3, 詹玉峰4   

  1. 1.中原工学院 前沿信息技术研究院,郑州 450007
    2.河南省区块链与数据共享国际联合实验室(中原工学院),郑州 450007
    3.福建福链科技有限公司,福州 350108
    4.北京理工大学 自动化学院,北京 100081
  • 收稿日期:2024-12-31 修回日期:2025-01-07 接受日期:2025-01-16 发布日期:2025-04-02 出版日期:2025-11-10
  • 通讯作者: 姚中原
  • 作者简介:高婷婷(2000—),女,甘肃平凉人,硕士研究生,主要研究方向:密码学、区块链
    贾淼(1999—),女,河南南阳人,硕士研究生,主要研究方向:密码学、区块链
    斯雪明(1966—),男,浙江诸暨人,教授,博士,CCF会员,主要研究方向:密码学、区块链
    谭焕明(1974—),男,浙江上虞人,副研究员,硕士,CCF会员,主要研究方向:密码学、区块链
    詹玉峰(1989—),男,北京人,副教授,博士,主要研究方向:云边端协同感知与控制。
  • 基金资助:
    国家重点研发计划“区块链”专项(2022YFB2703200)

Blockchain-based deduplication and data integrity audit scheme

Tingting GAO1, Zhongyuan YAO1(), Miao JIA1, Xueming SI1,2, Huanming TAN3, Yufeng ZHAN4   

  1. 1.The Frontier Information Technology Research Institute,Zhongyuan University of Technology,Zhengzhou Henan 450007,China
    2.Henan International Joint Laboratory of Blockchain and Data Sharing (Zhongyuan University of Technology),Zhengzhou Henan 450007,China
    3.Fujian Fulian Technology Company Limited,Fuzhou Fujian 350108,China
    4.School of Automation,Beijing Institute of Technology,Beijing 100081,China
  • Received:2024-12-31 Revised:2025-01-07 Accepted:2025-01-16 Online:2025-04-02 Published:2025-11-10
  • Contact: Zhongyuan YAO
  • About author:GAO Tingting, born in 2000, M. S. candidate. Her research interests include cryptology, blockchain.
    JIA Miao, born in 1999, M. S. candidate. Her research interests include cryptology, blockchain.
    SI Xueming, born in 1966, Ph. D., professor. His research interests include cryptology, blockchain.
    TAN Huanming, born in 1974, M. S., associate research fellow. His research interests include cryptology, blockchain.
    ZHAN Yufeng, born in 1989, Ph. D., associate professor. His research interests include cloud-edge-device collaborative sensing and control.
  • Supported by:
    This work is partially supported by National Key Research and Development Plan “Blockchain” Special Project(2022YFB2703200)

摘要:

为解决当前云存储系统中的数据冗余和数据完整性保障问题,提出一种基于区块链的云存储数据去重与完整性审计方案。该方案结合重复数据删除技术与区块链分布式审计机制,在实现去重存储的同时,可确保数据的机密性和完整性。首先,利用消息锁定加密(MLE)方法为不同用户的相同数据生成相同的加密密文,对于重复数据的上传者,采用基于沃克尔树(VT)的所有权证明(PoW)机制进行所有权验证,从而实现安全的密文去重;其次,基于区块链不可篡改的特性,设计了一种高效的数据完整性审计机制,能够在不影响用户数据隐私的前提下进行透明的审计和验证,可抵御包括外部攻击者、云服务提供商(CSP)恶意行为及密钥管理风险在内的多种威胁;最后,通过功能性分析和密钥安全、抗合谋攻击等安全性分析,证明所提方案具有较高的安全性和实用性。性能分析结果表明,完整性验证时间与挑战块数成正比,当挑战块数为1 000时,完整性验证时间约为85 ms。实验分析结果表明,该方案能为云存储系统提供可靠的去重与数据完整性审计服务。

关键词: 重复数据删除, 数据完整性, 区块链, 消息锁定加密, 数据隐私

Abstract:

To address the issues of data redundancy and data integrity protection in current cloud storage systems, a blockchain-based cloud storage data deduplication and integrity audit scheme was proposed. This approach combined deduplication technology with a blockchain distributed auditing mechanism, ensuring data confidentiality and integrity while achieving deduplicated storage. Firstly, the Message-Locked Encryption (MLE) method was utilized to generate identical encrypted ciphertexts for the same data from different users. For duplicate data uploaders, ownership verification was conducted using a Verkle Tree (VT)-based Proof of Ownership (PoW) mechanism, thus enabling secure ciphertext deduplication. Secondly, based on the immutable characteristic of blockchain, an efficient data integrity auditing mechanism was designed, allowing transparent auditing and verification without compromising user data privacy. This mechanism can defend against various threats, including external attackers, malicious behaviors from Cloud Service Providers (CSPs), and key management risks. Finally, through functional analysis and security analyses such as key security and resistance to collusion attacks, it is demonstrated that the proposed scheme has high security and practicality. Performance analysis results show that the integrity verification time is proportional to the number of challenge blocks, and when the number of challenge blocks is 1 000, the integrity verification time is approximately 85 ms. Experimental analysis results show that the scheme can provide with reliable deduplication and data integrity auditing services for the cloud storage systems.

Key words: deduplication, data integrity, blockchain, Message-Locked Encryption (MLE), data privacy

中图分类号: