Cyber security

Select

Review on privacy-preserving technologies in federated learning

Teng WANG, Zheng HUO, Yaxin HUANG, Yilin FAN

Journal of Computer Applications 2023, 43 (2): 437-449. DOI: 10.11772/j.issn.1001-9081.2021122072

Abstract （1725）

HTML （165）

PDF （2014KB）（1293）

Save

In recent years， federated learning has become a new way to solve the problems of data island and privacy leakage in machine learning. Federated learning architecture does not require multiple parties to share data resources， in which participants only needed to train local models on local data and periodically upload parameters to the server to update the global model， and then a machine learning model can be built on large-scale global data. Federated learning architecture has the privacy-preserving nature and is a new scheme for large-scale data machine learning in the future. However， the parameter interaction mode of this architecture may lead to data privacy disclosure. At present， strengthening the privacy-preserving mechanism in federated learning architecture has become a new research hotspot. Starting from the privacy disclosure problem in federated learning， the attack models and sensitive information disclosure paths in federated learning were discussed， and several types of privacy-preserving techniques in federated learning were highlighted and reviewed， such as privacy-preserving technology based on differential privacy， privacy-preserving technology based on homomorphic encryption， and privacy-preserving technology based on Secure Multiparty Computation （SMC）. Finally， the key issues of privacy protection in federated learning were discussed， the future research directions were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Survey of anonymity and tracking technology in Monero

Dingkang LIN, Jiaqi YAN, Nandeng BA, Zhenhao FU, Haochen JIANG

Journal of Computer Applications 2022, 42 (1): 148-156. DOI: 10.11772/j.issn.1001-9081.2021020296

Abstract （1587）

HTML （78）

PDF （723KB）（765）

Save

Virtual digital currency provides a breeding ground for terrorist financing， money laundering， drug trafficking and other criminal activities. As a representative emerging digital currency， Monero has a universally acknowledged high anonymity. Aiming at the problem of using Monroe anonymity to commit crimes， Monero anonymity technology and tracking technology were explored as well as the research progresses were reviewed in recent years， so as to provide technical supports for effectively tackling the crimes based on blockchain technology. In specific， the evolution of Monero anonymity technology was summarized， and the tracking strategies of Monero anonymity technology in academic circles were sorted out. Firstly， in the anonymity technologies， ring signature， guaranteed unlinkability （one-off public key）， guaranteed untraceability， and the important version upgrading for improving anonymity were introduced. Then， in tracking technologies， the attacks such as zero mixin attack， output merging attack， guess-newest attack， closed set attack， transaction flooding attack， tracing attacks from remote nodes and Monero ring attack were introduced. Finally， based on the analysis of anonymity technologies and tracking strategies， four conclusions were obtained： the development of anonymity technology and the development of tracking technology of Monero promote each other； the application of Ring Confidential Transactions （RingCT） is a two-edged sword， which makes the passive attack methods based on currency value ineffective， and also makes the active attack methods easier to succeed； output merging attack and zero mixin attack complement each other； Monero’s system security chain still needs to be sorted out.

Table and Figures | Reference | Related Articles | Metrics

Select

Optimized CKKS scheme based on learning with errors problem

ZHENG Shangwen, LIU Yao, ZHOU Tanping, YANG Xiaoyuan

Journal of Computer Applications 2021, 41 (6): 1723-1728. DOI: 10.11772/j.issn.1001-9081.2020091447

Abstract （1107）

PDF （760KB）（1042）

Save

Focused on the issue that the CKKS (Cheon-Kim-Kim-Song) homomorphic encryption scheme based on the Learning With Errors (LWE) problem has large ciphertext, complicated calculation key generation and low homomorphic calculation efficiency in the encrypted data calculation, an optimized scheme of LWE type CKKS was proposed through the method of bit discarding and homomorphic calculation key reorganization. Firstly, the size of the ciphertext in the homomorphic multiplication process was reduced by discarding part of the low-order bits of the ciphertext vector and part of the low-order bits of the ciphertext tensor product in the homomorphic multiplication. Secondly, the method of bit discarding was used to reorganize and optimize the homomorphic calculation key, so as to remove the irrelevant extension items in powersof2 during the key exchange procedure and reduce the scale of the calculation key as well as the noise increase in the process of homomorphic multiplication. On the basis of ensuring the security of the original scheme, the proposed optimized scheme makes the dimension of the calculation key reduced, and the computational complexity of the homomorphic multiplication reduced. The analysis results show that the proposed optimized scheme reduces the computational complexity of the homomorphic calculation and calculation key generation process to a certain extent, so as to reduce the storage overhead and improve the efficiency of the homomorphic multiplication operation.

Reference | Related Articles | Metrics

Select

Federated learning algorithm based on personalized differential privacy

Chunyong YIN, Rui QU

Journal of Computer Applications 2023, 43 (4): 1160-1168. DOI: 10.11772/j.issn.1001-9081.2022030337

Abstract （915）

HTML （44）

PDF （1800KB）（600）

Save

Federated Learning （FL） can effectively protect users' personal data from attackers. Differential Privacy （DP） is applied to enhance the privacy of FL， which can solve the problem of privacy disclose caused by parameters in the model training. However， existing FL methods based on DP on concentrate on the unified privacy protection budget and ignore the personalized privacy requirements of users. To solve this problem， a two-stage Federated Learning with Personalized Differential Privacy （PDP-FL） algorithm was proposed. In the first stage， the user's privacy was graded according to the user's privacy preference， and the noise meeting the user's privacy preference was added to achieve the purpose of personalized privacy protection. At the same time， the privacy level corresponding to the privacy preference was uploaded to the central aggregation server. In the second stage， in order to fully protect the global data， the simultaneous local and central protection strategy was adopted. And according to the privacy level uploaded by the user， the noise conforming to the global DP threshold was added to quantify the global privacy protection level. Experimental results show that on MNIST and CIFAR-10 datasets， the classification accuracy of PDP-FL algorithm reaches 93.8% to 94.5% and 43.4% to 45.2% respectively， which is better than those of Federated learning with Local Differential Privacy （LDP-Fed） algorithm and Federated Learning with Global Differential Privacy （GDP-FL） algorithm， PDP-FL algorithm meets the needs of personalized privacy protection.

Table and Figures | Reference | Related Articles | Metrics

Select

Improved practical Byzantine fault tolerance consensus algorithm based on Raft algorithm

WANG Jindong, LI Qiang

Journal of Computer Applications 2023, 43 (1): 122-129. DOI: 10.11772/j.issn.1001-9081.2021111996

Abstract （876）

HTML （40）

PDF （2834KB）（411）

Save

Since Practical Byzantine Fault Tolerance （PBFT） consensus algorithm applied to consortium blockchain has the problems of insufficient scalability and high communication overhead， an improved practical Byzantine fault tolerance consensus algorithm based on Raft algorithm named K-RPBFT （K-medoids Raft based Practical Byzantine Fault Tolerance） was proposed. Firstly， blockchain was sharded based on K-medoids clustering algorithm， all nodes were divided into multiple node clusters and each node cluster constituted to a single shard， so that global consensus was improved to hierarchical multi-center consensus. Secondly， the consus between the cluster central nodes of each shard was performed by adopting PBFT algorithm， and the improved Raft algorithm based on supervision nodes was used for intra-shard consensus. The supervision mechanism in each shard gave a certain ability of Byzantine fault tolerance to Raft algorithm and improved the security of the algorithm. Experimental analysis shows that compared with PBFT algorithm， K-RPBFT algorithm greatly reduces the communication overhead and consensus latency， improves the consensus efficiency and throughput while having Byzantine fault tolerance ability， and has good scalability and dynamics， so that the consortium blockchain can be applied to a wider range of fields.

Reference | Related Articles | Metrics

Select

Cross-chain mechanism based on Spark blockchain

Jiagui XIE, Zhiping LI, Jian JIN

Journal of Computer Applications 2022, 42 (2): 519-527. DOI: 10.11772/j.issn.1001-9081.2021020353

Abstract （868）

HTML （56）

PDF （888KB）（701）

Save

Considering different blockchains being isolated and the data interaction and sharing difficulties in the current rapid development process of blockchain technology， a cross-chain mechanism based on Spark blockchain was proposed. Firstly， common cross-chain technologies and current mainstream cross-chain projects were analyzed， the implementation principles of different technologies and projects were studied， and their differences， advantages and disadvantages were summarized. Then， using the blockchain architecture maned main-sub blockchain mode， the key core components such as smart contract component， transaction verification component， transaction timeout component were designed， and the four stages of cross-chain process were elaborated in detail， including transaction initiation， transaction routing， transaction verification and transaction confirmation. Finally， the feasible experiments were designed for performance test and security test， and the security was analyzed. Experimental results show that Spark blockchain has significant advantages compared to other blockchains in terms of transaction delay， throughput and spike testing. Besides， when the proportion of malicious nodes is low， the success rate of cross-chain transactions is 100%， and different sub chains can conduct cross-chain transactions safely and stably. This mechanism solves the problem of data interaction and sharing between blockchains， and provides technical reference for the design of Spark blockchain application scenarios in the next step.

Table and Figures | Reference | Related Articles | Metrics

Select

Abnormal flow detection based on improved one-dimensional convolutional neural network

HANG Mengxin, CHEN Wei, ZHANG Renjie

Journal of Computer Applications 2021, 41 (2): 433-440. DOI: 10.11772/j.issn.1001-9081.2020050734

Abstract （778）

PDF （1011KB）（827）

Save

In order to solve the problems that traditional machine learning based abnormal flow detection methods rely heavily on features, and the detection methods based on deep learning are inefficient and easy to overfit, an abnormal flow detection method based on Improved one-Dimentional Convolutional Neural Network (ICNN-1D) was proposed, namely AFM-ICNN-1D. Different from "convolution-pooling-full connection" structure of the traditional CNN, the ICNN-1D is mainly composed of 2 convolutional layers, 2 global pooling layers, 1 dropout layer and 1 fully connected output layer. The preprocessed data were put into ICNN-1D, and the result after two convolutional layers was used as the input of the global average pooling layer and the global maximum pooling layer, then the obtained output data were merged and sent to the fully connected layer to classify. The model was optimized according to the classification result and the real dataset, then it was used to the abnormal flow detection. The experimental results on the CIC-IDS-2017 dataset showed that the accuracy and recall rate of AFM-ICNN-1D reached 98%, which is better than that of the comparative k-Nearest Neighbor (kNN) and Random Forest (RF) methods. Moreover, compared with traditional CNN, the model parameters were reduced by about 97%, and the training time was shortened by about 40%. Experimental results show that AFM-ICNN-1D has high detection performance, which can reduce training time and avoid over fitting with better retaining the local characteristics of traffic data.

Reference | Related Articles | Metrics

Select

Review on blockchain smart contract vulnerability detection and automatic repair

Juncheng TONG, Bo ZHAO

Journal of Computer Applications 2023, 43 (3): 785-793. DOI: 10.11772/j.issn.1001-9081.2022020179

Abstract （766）

HTML （57）

PDF （2782KB）（692）

PDF（mobile）（582KB）（39）

Save

Smart contract technology， as a milestone of blockchain 2.0， has received widespread attention from both academic and industry circles. It runs on an underlying infrastructure without trusted computing environment and has characteristics that distinguish it from traditional programs， and there are many vulnerabilities with huge influence in its own security， so that the research on security auditing for it has become a popular and urgent key scientific problem in the field of blockchain security. Aiming at the detection and automatic repair of smart contract vulnerabilities， firstly， main types and classifications of smart contract vulnerabilities were introduced. Secondly， three most important methods of smart contract vulnerability detection in the past five years were reviewed， and representative and innovative research techniques of each method were introduced. Thirdly， smart contract upgrade schemes and cutting-edge automatic repair technologies were introduced in detail. Finally， challenges and future work of smart contract vulnerability detection and automatic repair technologies for online， real-time， multi-platform， automatic， and intelligent requirements were analyzed and prospected as a framework of technical solutions.

Table and Figures | Reference | Related Articles | Metrics

Select

Poisoning attack detection scheme based on generative adversarial network for federated learning

Qian CHEN, Zheng CHAI, Zilong WANG, Jiawei CHEN

Journal of Computer Applications 2023, 43 (12): 3790-3798. DOI: 10.11772/j.issn.1001-9081.2022121831

Abstract （762）

HTML （43）

PDF （2367KB）（957）

Save

Federated Learning （FL） emerges as a novel privacy-preserving Machine Learning （ML） paradigm. However， the distributed training structure of FL is more vulnerable to poisoning attack， where adversaries contaminate the global model through uploading poisoning models， resulting in the convergence deceleration and the prediction accuracy degradation of the global model. To solve the above problem， a poisoning attack detection scheme based on Generative Adversarial Network （GAN） was proposed. Firstly， the benign local models were fed into the GAN to output testing samples. Then， the testing samples were used to detect the local models uploaded by the clients. Finally， the poisoning models were eliminated according to the testing metrics. Meanwhile， two test metrics named F1 score loss and accuracy loss were defined to detect the poisoning models and extend the detection scope from one single type of poisoning attacks to all types of poisoning attacks. Besides， a threshold determination method was designed to deal with misjudgment， so that the robust of misjudgment was confirmed. Experimental results on MNIST and Fashion-MNIST datasets show that the proposed scheme can generate high-quality testing samples， and then detect and eliminate poisoning models. Compared with the global models trained with the detection scheme based on directly gathering test data from clients and the detection scheme based on generating test data and using test accuracy as the test metric， the global model trained with the proposed scheme has significant accuracy improvement from 2.7 to 12.2 percentage points.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of zero trust network and its key technologies

Qun WANG, Quan YUAN, Fujuan LI, Lingling XIA

Journal of Computer Applications 2023, 43 (4): 1142-1150. DOI: 10.11772/j.issn.1001-9081.2022030453

Abstract （747）

HTML （53）

PDF （2001KB）（587）

Save

With increasingly severe network security threats and increasingly complex security defense means， zero trust network is a new evaluation and review of traditional boundary security architecture. Zero trust emphasizes never always trusting anything and verifying things continuously. Zero trust network emphasizes that the identity is not identified by location， all access controls strictly execute minimum permissions， and all access processes are tracked in real time and evaluated dynamically. Firstly， the basic definition of zero trust network was given， the main problems of traditional perimeter security were pointed out， and the zero trust network model was described. Secondly， the key technologies of zero trust network， such as Software Defined Perimeter （SDP）， identity and access management， micro segmentation and Automated Configuration Management System （ACMS）， were analyzed. Finally， zero trust network was summarized and its future development was prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Visual image encryption algorithm based on Hopfield chaotic neural network and compressive sensing

SHEN Ziyi, WANG Weiya, JIANG Donghua, RONG Xianwei

Journal of Computer Applications 2021, 41 (10): 2893-2899. DOI: 10.11772/j.issn.1001-9081.2020121942

Abstract （732）

PDF （4865KB）（519）

Save

At present, most image encryption algorithms directly encrypt the plaintext image into a ciphertext image without visual meaning, which is easy to be found by hackers during the transmission process and therefore subjected to various attacks. In order to solve the problem, combining Hopfield chaotic neural network and compressive sensing technology, a visually meaningful image encryption algorithm was proposed. Firstly, the two-dimensional discrete wavelet transform was used to sparse the plaintext image. Secondly, the sparse matrix after threshold processing was encrypted and measured by compressive sensing. Thirdly, the quantized intermediate ciphertext image was filled with random numbers, and Hilbert scrambling and diffusion operations were performed to the image. Finally, the generated noise-like ciphertext image was embedded into the Alpha channel of the carrier image though the Least Significant Bit (LSB) replacement to obtain the visually meaningful steganographic image. Compared with the existing visual image encryption algorithms, the proposed algorithm demonstrates very good visual security, decryption quality and robustness, showing that it has widely application scenarios.

Reference | Related Articles | Metrics

Select

Unsupervised time series anomaly detection model based on re-encoding

Chunyong YIN, Liwen ZHOU

Journal of Computer Applications 2023, 43 (3): 804-811. DOI: 10.11772/j.issn.1001-9081.2022010006

Abstract （721）

HTML （52）

PDF （1769KB）（349）

Save

In order to deal with the problem of low accuracy of anomaly detection caused by data imbalance and highly complex temporal correlation of time series， a re-encoding based unsupervised time series anomaly detection model based on Generative Adversarial Network （GAN）， named RTGAN （Re-encoding Time series based on GAN）， was proposed. Firstly， multiple generators with cycle consistency were used to ensure the diversity of generated samples and thereby learning different anomaly patterns. Secondly， the stacked Long Short-Term Memory-dropout Recurrent Neural Network （LSTM-dropout RNN） was used to capture temporal correlation. Thirdly， the differences between the generated samples and the real samples were compared in the latent space by improved re-encoding. As the re-encoding errors， these differences were served as a part of anomaly score to improve the accuracy of anomaly detection. Finally， the new anomaly score was used to detect anomalies on univariate and multivariate time series datasets. The proposed model was compared with seven baseline anomaly detection models on univariate and multivariate time series. Experimental results show that the proposed model obtains the highest average F1-score （0.815） on all datasets. And the overall performance of the proposed model is 36.29% and 8.52% respectively higher than those of the original AutoEncoder （AE） model Dense-AE （Dense-AutoEncoder） and latest benchmark model USAD （UnSupervised Anomaly Detection on multivariate time series）. The robustness of the model was detected by different Signal-to-Noise Ratio （SNR）. The results show that the proposed model consistently outperforms LSTM-VAE （Variational Autoencoder based on LSTM）， USAD and OmniAnomaly， especially in the case of 30% SNR， the F1-score of RTGAN is 13.53% and 10.97% respectively higher than those of USAD and OmniAnomaly. It can be seen that RTGAN can effectively improve the accuracy and robustness of anomaly detection.

Table and Figures | Reference | Related Articles | Metrics

Select

Secure storage and sharing scheme of internet of vehicles data based on hybrid architecture of blockchain and cloud-edge computing

WU Guangfu, WANG Yingjun

Journal of Computer Applications 2021, 41 (10): 2885-2892. DOI: 10.11772/j.issn.1001-9081.2020121938

Abstract （716）

PDF （897KB）（642）

Save

In order to solve the problems such as high time delay, data leakage and malicious vehicle nodes tampering data of cloud computing in Internet of Vehicles (IoV), a secure storage and sharing scheme of IoV data based on hybrid architecture of blockchain and cloud-edge computing was proposed. Firstly, the dual-chain decentralized storage structure of consortium blockchain-private blockchain was adopted to ensure the security of communication data. Then, the identity-based digital signcryption algorithm and the discrete central binomial distribution-based ring signature scheme were used to solve the security problem in the communication process. Finally, the Dynamic-layering and Reputation-evaluation Practical Byzantine Fault Tolerant mechanism (DRPBFT) was proposed, and the edge computing technology was combined with the cloud computing technology, so as to solve the high time delay problem. Security analysis shows that the proposed scheme can guarantee the security and integrity of data during the information sharing process. Experimental simulation and performance evaluation results show that, DRPBFT has the time delay within 6 s, and effectively improves the throughput of the system. The proposed IoV scheme effectively improves the enthusiasm of vehicle data sharing, leads to more efficient and stable operation of IoV system, and achieves the real-time and efficient purposes of IoV.

Reference | Related Articles | Metrics

Select

Hierarchical access control and sharing system of medical data based on blockchain

Meng CAO, Sunjie YU, Hui ZENG, Hongzhou SHI

Journal of Computer Applications 2023, 43 (5): 1518-1526. DOI: 10.11772/j.issn.1001-9081.2022050733

Abstract （713）

HTML （35）

PDF （2871KB）（356）

Save

Focusing on coarse granularity of access control， low sharing flexibility and security risks such as data leakage of centralized medical data sharing platform， a blockchain-based hierarchical access control and sharing system of medical data was proposed. Firstly， medical data was classified according to sensitivity， and a Ciphertext-Policy Attribute-Based Hierarchical Encryption （CP-ABHE） algorithm was proposed to achieve access control of medical data with different sensitivity. In the algorithm， access control trees were merged and symmetric encryption methods were combinined to improve the performance of Ciphertext-Policy Attribute-Based Encryption （CP-ABE） algorithm， and the multi-authority center was used to solve the key escrow problem. Then， the medical data sharing mode based on permissioned blockchain was used to solve the centralized trust problem of centralized sharing platform. Security analysis shows that the proposed system ensures the security of data during the data sharing process， and can resist user collusion attacks and authority collusion attacks. Experimental results also show that the proposed CP-ABHE algorithm has lower computational cost than CP-ABE algorithm， the maximum average delay of the proposed system is 7.8 s， and the maximum throughput is 236 transactions per second， which meets the expected performance requirements.

Table and Figures | Reference | Related Articles | Metrics

Select

Review of white-box adversarial attack technologies in image classification

Jiaxuan WEI, Shikang DU, Zhixuan YU, Ruisheng ZHANG

Journal of Computer Applications 2022, 42 (9): 2732-2741. DOI: 10.11772/j.issn.1001-9081.2021071339

Abstract （682）

HTML （38）

PDF （2101KB）（576）

Save

In the research of image classification tasks in deep learning， the phenomenon of adversarial attacks brings severe challenges to the secure application of deep learning models， which arouses widespread attention of researchers. Firstly， around the adversarial attack technologies for generating the adversarial perturbations， the important white-box adversarial attack algorithms in the image classification tasks were introduced in detail， and the advantages and disadvantages of different attack algorithms were analyzed. Then， from three realistic application scenarios： mobile application， face recognition and autonomous driving， the application status of the white-box adversarial attack technologies was illustrated. Additionally， some typical white-box adversarial attack algorithms were selected to perform experiments on different target models， and the experimental results were analyzed. Finally， the white-box adversarial attack technologies were summarized， and their valuable research directions were prospected.

Table and Figures | Reference | Related Articles | Metrics

Select

Adversarial attack algorithm for deep learning interpretability

Quan CHEN, Li LI, Yongle CHEN, Yuexing DUAN

Journal of Computer Applications 2022, 42 (2): 510-518. DOI: 10.11772/j.issn.1001-9081.2021020360

Abstract （640）

HTML （20）

PDF （1283KB）（425）

Save

Aiming at the problem of model information leakage caused by interpretability in Deep Neural Network （DNN）， the feasibility of using the Gradient-weighted Class Activation Mapping （Grad-CAM） interpretation method to generate adversarial samples in a white-box environment was proved， moreover， an untargeted black-box attack algorithm named dynamic genetic algorithm was proposed. In the algorithm， first， the fitness function was improved according to the changing relationship between the interpretation area and the positions of the disturbed pixels. Then， through multiple rounds of genetic algorithm， the disturbance value was continuously reduced while increasing the number of the disturbed pixels， and the set of result coordinates of each round would be maintained and used in the next round of iteration until the perturbed pixel set caused the predicted label to be flipped without exceeding the perturbation boundary. In the experiment part， the average attack success rate under the AlexNet， VGG-19， ResNet-50 and SqueezeNet models of the proposed algorithm was 92.88%， which was increased by 16.53 percentage points compared with that of One pixel algorithm， although with the running time increased by 8% compared with that of One pixel algorithm. In addition， in a shorter running time， the proposed algorithm had the success rate higher than the Adaptive Fast Gradient Sign Method （Ada-FGSM） algorithm by 3.18 percentage points， higher than the Projection & Probability-driven Black-box Attack （PPBA） algorithm by 8.63 percentage points， and not much different from Boundary-attack algorithm. The results show that the dynamic genetic algorithm based on the interpretation method can effectively execute the adversarial attack.

Table and Figures | Reference | Related Articles | Metrics

Select

Encrypted traffic classification method based on data stream

GUO Shuai, SU Yang

Journal of Computer Applications 2021, 41 (5): 1386-1391. DOI: 10.11772/j.issn.1001-9081.2020071073

Abstract （636）

PDF （948KB）（1311）

Save

Aiming at the problems of fast classification and accurate identification of encrypted traffic in current network, a new feature extraction method for data stream was proposed. Based on the characteristics of sequential data and the law of the SSL (Secure Sockets Layer) handshake protocol, an end-to-end one-dimensional convolutional neural network model was adopted, and five-tuples were used to label the data stream. By selecting the data stream representation manner, the number of data packets, and the length of feature bytes, the key field positions of sample classification were located more accurately, and the features with little impact on sample classification were removed, so that the 784 bytes used by a single data stream during the original input were reduced to 529 bytes, which reduced 32% of the original length, and the classification of 12 encrypted traffic service types was implemented with the accuracy of 95.5%. These results show that the proposed method can reduce the original input feature dimension and improve the efficiency of data processing on the basis of ensuring the accuracy of the current research.

Reference | Related Articles | Metrics

Select

Malware detection method based on perceptual hash algorithm and feature fusion

JIANG Qianyu, WANG Fengying, JIA Lipeng

Journal of Computer Applications 2021, 41 (3): 780-785. DOI: 10.11772/j.issn.1001-9081.2020060906

Abstract （621）

PDF （995KB）（534）

Save

In the current detection of the malware family, the local features or global features extracted through the grayscale image of the malware cannot fully describe the malware. Aiming at the problem and to improve the detection effect, a malware detection method based on perceptual hash algorithm and feature fusion was proposed. Firstly, the grayscale image samples of malware were detected through the perceptual hash algorithm, and samples of specific malware families and uncertain malware families were quickly divided. Experimental tests showed that about 67% malwares were able to be detected by the perceptual hash algorithm. Then, the local features of Local Binary Pattern (LBP) and global features of Gist were further extracted for the samples of uncertain families, and the features of merging the above two features were used to classify and detect the malware samples by the machine learning algorithm. Finally, experimental results of the detection of 25 types of malware families show that the detection accuracy is higher when using the fusion feature of LBP and Gist compared to that when using a single feature only, and the proposed method is more efficient in classification and detection than the detection algorithm using machine learning only with the detection speed increased by 93.5%.

Reference | Related Articles | Metrics

Select

Efficient homomorphic neural network supporting privacy-preserving training

Yang ZHONG, Renwan BI, Xishan YAN, Zuobin YING, Jinbo XIONG

Journal of Computer Applications 2022, 42 (12): 3792-3800. DOI: 10.11772/j.issn.1001-9081.2021101775

Abstract （610）

HTML （19）

PDF （1538KB）（275）

Save

Aiming at the problems of low computational efficiency and insufficient accuracy in the privacy-preserving neural network based on homomorphic encryption， an efficient Homomorphic Neural Network （HNN） under three-party collaborative supporting privacy-preserving training was proposed. Firstly， in order to reduce the computational cost of ciphertext-ciphertext multiplication in homomorphic encryption， the idea of secret sharing was combined to design a secure fast multiplication protocol to convert the ciphertext-ciphertext multiplication into plaintext-ciphertext multiplication with low complexity. Then， in order to avoid multiple iterations of ciphertext polynomials generated during the construction of HNN and improve the nonlinear calculation accuracy， a secure nonlinear calculation method was studied， which executed the corresponding nonlinear operator for the confused plaintext message with random mask. Finally， the security， correctness and efficiency of the proposed protocols were analyzed theoretically， and the effectiveness and superiority of HNN were verified by experiments. Experimental results show that compared with the dual server scheme PPML （Privacy Protection Machine Learning）， HNN has the training efficiency improved by 18.9 times and the model accuracy improved by 1.4 percentage points.

Table and Figures | Reference | Related Articles | Metrics

Select

Malicious code detection based on multi-channel image deep learning

JIANG Kaolin, BAI Wei, ZHANG Lei, CHEN Jun, PAN Zhisong, GUO Shize

Journal of Computer Applications 2021, 41 (4): 1142-1147. DOI: 10.11772/j.issn.1001-9081.2020081224

Abstract （599）

PDF （2386KB）（718）

Save

Existing deep learning-based malicious code detection methods have problems such as weak deep-level feature extraction capability, relatively complex model and insufficient model generalization capability. At the same time, code reuse phenomenon occurred in large number of malicious samples of the same type, resulting in similar visual features of the code. This similarity can be used for malicious code detection. Therefore, a malicious code detection method based on multi-channel image visual features and AlexNet was proposed. In the method, the codes to be detected were converted into multi-channel images at first. After that, AlexNet was used to extract and classify the color texture features of the images, so as to detect the possible malicious codes. Meanwhile, the multi-channel image feature extraction, the Local Response Normalization(LRN) and other technologies were used comprehensively, which effectively improved the generalization ability of the model with effective reduction of the complexity of the model. The Malimg dataset after equalization was used for testing, the results showed that the average classification accuracy of the proposed method was 97.8%, and the method had the accuracy increased by 1.8% and the detection efficiency increased by 60.2% compared with the VGGNet method. Experimental results show that the color texture features of multi-channel images can better reflect the type information of malicious codes, the simple network structure of AlexNet can effectively improve the detection efficiency, and the local response normalization can improve the generalization ability and detection effect of the model.

Reference | Related Articles | Metrics

Select

Network intrusion detection model based on efficient federated learning algorithm

Shaochen HAO, Zizuan WEI, Yao MA, Dan YU, Yongle CHEN

Journal of Computer Applications 2023, 43 (4): 1169-1175. DOI: 10.11772/j.issn.1001-9081.2022020305

Abstract （588）

HTML （25）

PDF （1650KB）（534）

Save

After the introduction of federated learning technology in intrusion detection scenarios， there is a problem that the traffic data between nodes is non-independent and identically distributed （non-iid）， which makes it difficult for models to aggregate and obtain a high recognition rate. To solve this problem， an efficient federated learning algorithm named H?E?Fed was constructed， and a network intrusion detection model based on this algorithm was proposed. Firstly， a global model for traffic data was designed by the coordinator and was sent to the intrusion detection nodes for model training. Then， by the coordinator， the local models were collected and the skewness of the covariance matrix of the local models between nodes was evaluated， so as to measure the correlation of models between nodes， thereby reassigning model aggregation parameters and generating a new global model. Finally， multiple rounds of interactions between the coordinator and the nodes were carried out until the global model converged. Experimental results show that compared with the models based on FedAvg （Federated Averaging） algorithm and FedProx algorithm， under data non-iid phenomenon between nodes， the proposed model has the communication consumption relatively low. And on KDDCup99 dataset and CICIDS2017 dataset， compared with baseline models， the proposed model has the accuracy improved by 10.39%， 8.14% and 4.40%， 5.98% respectively.

Table and Figures | Reference | Related Articles | Metrics

Select

Reversible data hiding algorithm in encrypted domain based on secret image sharing

Zexi WANG, Minqing ZHANG, Yan KE, Yongjun KONG

Journal of Computer Applications 2022, 42 (5): 1480-1489. DOI: 10.11772/j.issn.1001-9081.2021050823

Abstract （587）

HTML （22）

PDF （4022KB）（276）

Save

The current reversible data hiding algorithms in encrypted domain have the problems that the ciphertext images carrying secret have poor fault tolerance and disaster resistance after embedding secret data， once attacked or damaged， the original image cannot be reconstructed and the secret data cannot be extracted. In order to solve the problems， a new reversible data hiding algorithm in encrypted domain based on secret image sharing was proposed， and its application scenarios in cloud environment were analyzed. Firstly， the encrypted image was divided into n different ciphertext images carrying secret with the same size. Secondly， in the process of segmentation， the random quantities in Lagrange interpolation polynomial were taken as redundant information， and the mapping relationship between secret data and each polynomial coefficient was established. Finally， the reversible embedding of the secret data was realized by modifying the built-in parameters of the encryption process. When k ciphertext images carrying secret were collected， the original image was able to be fully recovered and the secret data was able to be extracted. Experimental results show that， the proposed algorithm has the advantages of low computational complexity， large embedding capacity and complete reversibility. In the （3，4） threshold scheme， the maximum embedding rate of the proposed algorithm is 4 bit per pixel （bpp）， and in the （4，4） threshold scheme， the maximum embedding rate of the proposed algorithm is 6 bpp. The proposed algorithm gives full play to the disaster recovery characteristic of secret sharing scheme. Without reducing the security of secret sharing， the proposed algorithm enhances the fault tolerance and disaster resistance of ciphertext images carrying secret， improves the embedding capacity of algorithm and the disaster recovery ability in the application scenario of cloud environment， and ensures the security of carrier image and secret data.

Table and Figures | Reference | Related Articles | Metrics

Select

Poisoning attack toward visual classification model

Jie LIANG, Xiaoyan HAO, Yongle CHEN

Journal of Computer Applications 2023, 43 (2): 467-473. DOI: 10.11772/j.issn.1001-9081.2021122068

Abstract （582）

HTML （21）

PDF （3264KB）（259）

Save

In data poisoning attacks， backdoor attackers manipulate the distribution of training data by inserting the samples with hidden triggers into the training set to make the test samples misclassified so as to change model behavior and reduce model performance. However， the drawback of the existing triggers is the sample independence， that is， no matter what trigger mode is adopted， different poisoned samples contain the same triggers. Therefore， by combining image steganography and Deep Convolutional Generative Adversarial Network （DCGAN）， an attack method based on sample was put forward to generate image texture feature maps according to the gray level co-occurrence matrix， embed target label character into the texture feature maps as a trigger by using the image steganography technology， and combine texture feature maps with trigger and clean samples into poisoned samples. Then， a large number of fake pictures with trigger were generated through DCGAN. In the training set samples， the original poisoned samples and the fake pictures generated by DCGAN were mixed together to finally achieve the effect that after the poisoner injecting a small number of poisoned samples， the attack rate was high and the effectiveness， sustainability and concealment of the trigger were ensured. Experimental results show that this method avoids the disadvantages of sample independence and has the model accuracy reached 93.78%. When the proportion of poisoned samples is 30%， data preprocessing， pruning defense and AUROR defense have the least influence on the success rate of attack， and the success rate of attack can reach about 56%.

Table and Figures | Reference | Related Articles | Metrics

Select

Image steganalysis method based on saliency detection

HUANG Siyuan, ZHANG Minqing, KE Yan, BI Xinliang

Journal of Computer Applications 2021, 41 (2): 441-448. DOI: 10.11772/j.issn.1001-9081.2020081323

Abstract （571）

PDF （1782KB）（883）

Save

Aiming at the problem that the steganalysis of images is difficult, and the existing detection models are difficult to make a targeted analysis of steganography regions of images, a method for image steganalysis based on saliency detection was proposed. In the proposed method, the saliency detection was used to guide the steganalysis model to focus on the image features of steganography regions. Firstly, the saliency detection module was used to generate saliency regions of the image. Secondly, the region filter module was used to filter the saliency images with a high degree of coincidence with the steganography regions, and image fusion technology was used to fuse them with the original images. Finally, the quality of training set was improved by replacing the error detection images with their corresponding saliency fusion images, so as to improve the training effect and detection ability of the model. The experiments were carried out on BOSSbase1.01 dataset. The dataset was embedded by image adaptive steganography algorithms in spatial domain and JPEG domain respectively, and experimental results show that the proposed method can effectively improve the the detection accuracy for deep learning-based steganalysis model by 3 percentage points at most. The mismatch test was also carried out on IStego100K dataset to further verify the generalization ability of the model and improve its application value. According to the result of the mismatch test, the proposed method has certain generalization ability.

Reference | Related Articles | Metrics

Select

Intrusion detection based on improved triplet network and K-nearest neighbor algorithm

WANG Yue, JIANG Yiming, LAN Julong

Journal of Computer Applications 2021, 41 (7): 1996-2002. DOI: 10.11772/j.issn.1001-9081.2020081217

Abstract （570）

PDF （1105KB）（388）

Save

Intrusion detection is one of the important means to ensure network security. To address the problem that it is difficult to balance detection accuracy and computational efficiency in network intrusion detection, based on the idea of deep metric learning, a network intrusion detection model combining improved Triplet Network (imTN) and K-Nearest Neighbor (KNN) was proposed, namely imTN-KNN. Firstly, a triplet network structure suitable for solving intrusion detection problems was designed to obtain the distance features that are more conducive to the subsequent classification. Secondly, due to the overfitting problem caused by removing the Batch Normalization (BN) layer from the traditional model which affected the detection precision, a Dropout layer and a Sigmoid activation layer were introduced to replace the BN layer, thus improving the model performance. Finally, the loss function of the traditional triplet network model was replaced with the multi-similarity loss function. In addition, the distance feature output of the imTN was used as the input of the KNN algorithm for retraining. Comparison experiments on the benchmark dataset IDS2018 show that compared with the Deep Neural Network based Intrusion Detection System (IDS-DNN) and Convolutional Neural Networks and Long Short Term Memory (CNN-LSTM) based detection model, the detection accuracy of imTN-KNN is improved by 2.76% and 4.68% on Sub_DS3, and the computational efficiency is improved by 69.56% and 74.31%.

Reference | Related Articles | Metrics

Select

Differential privacy clustering algorithm in horizontal federated learning

Xueran XU, Geng YANG, Yuxian HUANG

Journal of Computer Applications 2024, 44 (1): 217-222. DOI: 10.11772/j.issn.1001-9081.2023010019

Abstract （557）

HTML （16）

PDF （1418KB）（298）

Save

Clustering analysis can uncover hidden interconnections between data and segment the data according to multiple indicators， which can facilitate personalized and refined operations. However， data fragmentation and isolation caused by data islands seriously affects the effectiveness of cluster analysis applications. To solve data island problem and protect data privacy， an Equivalent Local differential privacy Federated K-means （ELFedKmeans） algorithm was proposed. A grid-based initial cluster center selection method and a privacy budget allocation scheme were designed for the horizontal federation learning model. To generate same random noise with lower communication cost， all organizations jointly negotiated random seeds， protecting local data privacy. The ELFedKmeans algorithm was demonstrated satisfying differential privacy protection through theoretical analysis， and it was also compared with Local Differential Privacy distributed K-means （LDPKmeans） algorithm and Hybrid Privacy K-means （HPKmeans） algorithm on different datasets. Experimental results show that all three algorithms increase F-measure and decrease SSE （Sum of Squares due to Error） gradually as privacy budget increases. As a whole， the F-measure values of ELFedKmeans algorithm was 1.794 5% to 57.066 3% and 21.245 2% to 132.048 8% higher than those of LDPKmeans and HPKmeans algorithms respectively； the Log（SSE） values of ELFedKmeans algorithm were 1.204 2% to 12.894 6% and 5.617 5% to 27.575 2% less than those of LDPKmeans and HPKmeans algorithms respectively. With the same privacy budget， ELFedKmeans algorithm outperforms the comparison algorithms in terms of clustering quality and utility metric.

Table and Figures | Reference | Related Articles | Metrics

Select

Energy data access control method based on blockchain

GE Jihong, SHEN Tao

Journal of Computer Applications 2021, 41 (9): 2615-2622. DOI: 10.11772/j.issn.1001-9081.2020111844

Abstract （553）

PDF （1175KB）（522）

Save

In order to solve the problems of energy data tampering, leakage and data ownership disputes in the process of data sharing between enterprises and departments of energy internet, combined with the characteristics of blockchain-traceability and hard to be tampered with, an energy data access control method based on blockchain multi-chain architecture was proposed, which can protect user privacy and realize cross-enterprise and cross-department access control of energy data at the same time. In this method, the combination of supervision chain and multi-data-chain was used to protect the privacy of data and improve the scalability. The method of storing data on the chain and storing original data under the chain alleviated the storage pressure of the blockchain.By using the outsourcing supported multi-authority attribute-based encryption technology, the fine-grained access control of energy data was realized. Experimental simulation results show that in the proposed method, the blockchain network has availability, and outsourcing supported multi-authority attribute-based encryption technology has advantages in functionality and computing cost. Therefore, the proposed method can achieve fine-grained access control of energy data while protecting user privacy.

Reference | Related Articles | Metrics

Select

Audio steganography detection model combing residual network and extreme gradient boosting

CHEN Lang, WANG Rangding, YAN Diqun, LIN Yuzhen

Journal of Computer Applications 2021, 41 (2): 449-455. DOI: 10.11772/j.issn.1001-9081.2020060775

Abstract （549）

PDF （1165KB）（748）

Save

Aiming at the problem that the current audio steganography detection methods have low accuracy in detecting audio steganography based on Syndrome-Trellis Codes (STC), and considering the advantages of Convolutional Neural Network (CNN) in extracting abstract features, a model for audio steganography detection combining Deep Residual Network (DRN) and eXtreme Gradient Boosting (XGBoost) was proposed. Firstly, a fixed-parameter High-Pass Filter (HPF) was used to preprocess the input audio, and features were extracted through three convolutional layers. Truncated Linear Unit (TLU) activation function was applied in the first convolutional layer to make the model adapt to the distribution of steganographic signals with low Signal-To-Noise Ratio (SNR). Then, the abstract features were further extracted by five-stage residual blocks and pooling operations. Finally, the extracted high-dimensional features were classified as inputs of the XGBoost model through fully connected layers and dropout layers. The STC steganography and the Least Significant Bit Matching (LSBM) steganography were detected respectively. Experimental results showed that when the embedding rates were 0.5 bps (bit per sample), 0.2 bps and 0.1 bps respectively, that is to say, the average number of bits modified for per audio sample equaled to 0.5, 0.2 and 0.1 respectively, the proposed model achieved average detection accuracies of 73.27%, 70.16% and 65.18% respectively for the STC steganography with a sub check matrix with height of 7, and the average detection accuracies of 86.58%, 76.08% and 72.82% respectively for the LSBM steganography. Compared with the traditional steganography detection methods based on extracting handcrafted features and deep learning steganography detection methods, the proposed model has the average detection accuracies for the two steganography algorithms both increased by more than 10 percent points.

Reference | Related Articles | Metrics

Select

Blockchain digital signature scheme with improved SM2 signature method

YANG Longhai, WANG Xueyuan, JIANG Hesong

Journal of Computer Applications 2021, 41 (7): 1983-1988. DOI: 10.11772/j.issn.1001-9081.2020081220

Abstract （546）

PDF （1080KB）（592）

Save

In order to improve the storage security and signature efficiency of digital signature keys in the consortium blockchain Practical Byzantine Fault Tolerance (PBFT) algorithm consensus process, considering the actual application environment of the consortium blockchain PBFT consensus algorithm, a trusted third-party proof signature scheme based on key division and Chinese encryption SM2 algorithm was proposed. In this scheme, by a trusted third-party, the key was generated and split, and the sub-split private key was distributed to the consensus nodes. In each consensus, the identity must be proved to the trusted third-party at first, and then the other half of the sub-split private key was obtained by the verification party to perform identity verification. In this signature scheme, the segmentation and preservation of the private key was realized by combining the characteristics of the consortium chain, and the modular inversion process in the traditional SM2 algorithm was eliminated by using consensus feature and hash digest. The theoretical analysis proved that the proposed scheme was resistant to data tampering and signature forgery, while Java Development Kit (JDK1.8) and TIO network framework were used to simulate the signature process in consensus. Experimental results show that compared with the traditional SM2 algorithm, the proposed scheme is more efficient, and the more consensus nodes, the more obvious the efficiency gap. When the node number reaches 30, the efficiency of the scheme is improved by 27.56%, showing that this scheme can satisfy the current application environment of the consortium blockchain PBFT consensus.

Reference | Related Articles | Metrics

Select

Detection method of domains generated by dictionary-based domain generation algorithm

ZHANG Yongbin, CHANG Wenxin, SUN Lianshan, ZHANG Hang

Journal of Computer Applications 2021, 41 (9): 2609-2614. DOI: 10.11772/j.issn.1001-9081.2020111837

Abstract （539）

PDF （893KB）（501）

Save

The composition of domain names generated by the dictionary-based Domain Generation Algorithm (DGA) is very similar to that of benign domain names and it is difficult to effectively detect them with the existing technology. To solve this problem, a detection model was proposed, namely CL (Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network). The model includes three parts:character embedding layer, feature extraction layer and fully connected layer. Firstly, the characters of the input domain name were encoded by the character embedding layer. Then, the features of the domain name were extracted by connecting CNN and LSTM in serial way through the feature extraction layer. The n-grams features of the domain name were extracted by CNN and the extracted result were sent to LSTM to learn the context features between n-grams. Meanwhile, different combinations of CNNs and LSTMs were used to learn the features of n-grams with different lengths. Finally, the dictionary-based DGA domain names were classified and predicted by the fully connected layer according to the extracted features. Experimental results show that when the CNNs select the convolution kernel sizes of 3 and 4, the proposed model achives the best performance. In the four dictionary-based DGA family experiments, the accuracy of the CL model is improved by 2.20% compared with that of the CNN model. And with the increase of the number of sample families, the CL network model has a better stability.

Reference | Related Articles | Metrics

Project Articles