Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (1): 242-251.DOI: 10.11772/j.issn.1001-9081.2023010031

Special Issue: 网络空间安全

• Cyber security • Previous Articles     Next Articles

User plagiarism identification scheme in social network under blockchain

Li LI1(), Chunyan YANG2, Jiangwen ZHU2, Ronglei HU1   

  1. 1.College of Electronic and Communication Engineering,Beijing Electronic Science and Technology Institute,Beijing 100071,China
    2.College of Computer Science and Technology,Xidian University,Xi’an Shaanxi 710071,China
  • Received:2023-01-15 Revised:2023-04-28 Accepted:2023-05-12 Online:2023-06-06 Published:2024-01-10
  • Contact: Li LI
  • About author:YANG Chunyan, born in 1998, M. S. candidate. Her research interests include information security, blockchain.
    ZHU Jiangwen, born in 1997, M. S. candidate. His research interests include cryptography, information security.
    HU Ronglei, born in 1977, Ph. D., associate research fellow. His research interests include network information security, blockchain security.
  • Supported by:
    Fundamental Research Funds for Central Universities(3282023017)


李莉1(), 杨春艳2, 朱江文2, 胡荣磊1   

  1. 1.北京电子科技学院 电子与通信工程系, 北京 100071
    2.西安电子科技大学 计算机科学与技术学院, 西安 710071
  • 通讯作者: 李莉
  • 作者简介:杨春艳(1998—),女,河南周口人,硕士研究生,主要研究方向:信息安全、区块链;
  • 基金资助:


To address the problem of difficulty in identifying user plagiarism in social networks and to protect the rights of original authors while holding users accountable for plagiarism actions, a plagiarism identification scheme for social network users under blockchain was proposed. Aiming at the lack of universal tracing model in existing blockchain, a blockchain-based traceability information management model was designed to record user operation information and provide a basis for text similarity detection. Based on the Merkle tree and Bloom filter structures, a new index structure BHMerkle was designed. The calculation overhead of block construction and query was reduced, and the rapid positioning of transactions was realized. At the same time, a multi-feature weighted Simhash algorithm was proposed to improve the precision of word weight calculation and the efficiency of signature value matching stage. In this way, malicious users with plagiarism cloud be identified, and the occurrence of malicious behavior can be curbed through the reward and punishment mechanism. The average precision and recall of the plagiarism detection scheme on news datasets with different topics were 94.8% and 88.3%, respectively. Compared with multi-dimensional Simhash algorithm and Simhash algorithm based on information Entropy weighting (E-Simhash), the average precision was increased by 6.19 and 4.01 percentage points respectively, the average recall was increased by 3.12 and 2.92 percentage points respectively. Experimental results show that the proposed scheme improves the query and detection efficiency of plagiarism text, and has high accuracy in plagiarism identification.

Key words: blockchain, plagiarism identification, Simhash algorithm, similarity detection, social network



关键词: 区块链, 抄袭识别, Simhash算法, 相似度检测, 社交网络

CLC Number: