Interpretation method based on  selective Softmax gradient for layer-wise relevance propagation

doi:10.11772/j.issn.1001-9081.2025070906

Journal of Computer Applications

Received:2025-08-11 Revised:2025-11-06 Online:2025-12-22 Published:2025-12-22
Contact: Elvis Chen

基于选择性Softmax梯度的分层相关性传播解释方法

陈冲,邹宏杨,曹靖,蒋文静,陈杰,高富民

中国石油大学（北京）

通讯作者: 陈冲
基金资助:
深海采矿非金属柔性管全生命周期损伤识别与健康管理;深水干式油气生产处理平台安全风险评估技术体系研究

Abstract

Abstract: In recent years, neural network has been widely applied in many fields such as medical treatment, communication, and security, driving the growth of scale and the industrial application of deep learning. However, the inherent ‘black-box’ nature and opaque learning process of neural network models limit people's deep understanding of its internal logic and behavior, making it difficult to fully trust the model results. To address these issues, based on the Layer-wise Relevance Propagation (LRP), this paper delves into the interpretability of image classification models such as VGG16 and ResNet50. An interpretability method named Selective Softmax Gradient for Layer-wise Relevance Propagation (SSGLRP) was proposed. The proposed methos effectively addresses the problems that the heatmaps of the LRP interpretation results contain noise and lack class discrimination by introducing the activation values of the positive gradients of output neurons and modifying the initial relevance values of non-target classes. Additionally, by adjusting the initial relevance values of non-target classes, the SSGLRP method can eliminate non-target class objects in the heatmap and generate heatmaps with class discrimination. The effectiveness of this method is quantitatively evaluated through experiments such as maximum patch masking and pointing game. The results of the maximum patch masking experiment show that the average change in model prediction values by perturbing input pixels according to the SSGLRP method is 77.0%, 62.6%, and 33.5% higher than those obtained using LRP, SLRP, and SGLRP, respectively. The experimental results demonstrate that the SSGLRP possesses a higher class discrimination ability and less noise. It exhibits superior performance in interpreting the VGG16 and ResNet50 model.

Key words: neural network, interpretation method, class-discrimination, Layer-wise Relevance Propagation (LRP), image classification

摘要： 近年来，神经网络模型被广泛运用于医疗、通信、安全等领域，推动深度学习向规模化与产业化方向发展。然而，神经网络模型固有的“黑盒”性质导致其内部逻辑与行为决策不透明，限制了其在关键领域的应用。针对该问题，基于分层相关传播(LRP)方法，对VGG16、ResNet50等图像分类模型的可解释性展开研究，提出了选择性Softmax梯度的分层相关传播(SSGLRP)解释方法。该方法通过在LRP中引入输出神经元正梯度的激活值与修正非目标类的初始相关性值，解决LRP方法解释结果的热力图中含有噪声且缺乏类别区分性问题。以经典LRP、选择性分层相关传播(SLRP)、Softmax梯度分层相关传播(SGLRP)为基线方法，采用最大补丁遮掩和定点游戏实验对SSGLRP方法进行定量评估。SSGLRP方法的平均模型预测变化量比LRP、SLRP和SGLRP分别高77.0%、62.6%和33.5%。实验结果表明，SSGLRP方法具有更高的类别区别能力和更少的噪声，解释VGG16、ResNet50网络模型的效果更好。

关键词: 神经网络, 解释方法, 类别区分性, 分层相关传播, 图像分类

CLC Number:

TP181

陈冲邹宏杨曹靖蒋文静陈杰高富民. 基于选择性Softmax梯度的分层相关性传播解释方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2025070906.

[1]	Xiaobo QI, Jing ZHANG, Ying SHI, Hui QI, Hangyuan DU. Multiple active learning method based on concept drift detection [J]. Journal of Computer Applications, 2026, 46(5): 1388-1396.
[2]	Xinyao LIU, Jun LIANG, Jiahao LONG, Renliang YAN. Fine-grained Chinese herbal medicine image classification based on feature fusion and channel information compensation [J]. Journal of Computer Applications, 2026, 46(5): 1677-1683.
[3]	Fengwei CHENG, Bingqi ZHANG, Guohua XU, Wenjian WANG. Competitive loss-driven generative imbalanced node classification [J]. Journal of Computer Applications, 2026, 46(5): 1475-1481.
[4]	Chi ZHANG, Xianjing MENG, Changhao DOU, Qian WANG, Leilei GENG, Xiaoming XI. MD-FVR： cascaded finger vein recognition network based on multi-domain feature fusion [J]. Journal of Computer Applications, 2026, 46(5): 1658-1666.
[5]	Kun FU, Haoyu WEI, Weijing LIU, Xing DANG, Zezheng LIU, Jianwei LI. Graph neural network framework for topology semantic dual-domain collaboration [J]. Journal of Computer Applications, 2026, 46(5): 1378-1387.
[6]	Xumeng DOU, Bin XIE, Zhaohui ZHANG, Zhengang ZHAO, Hanyu DUAN, Aolei GUO. Drug-target interaction prediction based on structure-network collaborative features and grid-attention enhanced Kolmogorov-Arnold network [J]. Journal of Computer Applications, 2026, 46(4): 1344-1353.
[7]	Shengwei XU, Jianbo WANG, Jijie HAN, Yijie BAI. Face forgery detection method based on tri-branch feature extraction [J]. Journal of Computer Applications, 2026, 46(4): 1292-1299.
[8]	Rilong WANG, Zhenping LI, Xiaosong LI, Qiang GAO, Ya HE, Yong ZHONG, Yingxiao ZHAO. Multi-Agent collaborative knowledge reasoning framework [J]. Journal of Computer Applications, 2026, 46(3): 708-714.
[9]	Yongwei JIANG, Xiaoqing CHEN, Linjie FU. Elastic medical image registration model with high-frequency preservation based on spectrum decomposition [J]. Journal of Computer Applications, 2026, 46(3): 924-932.
[10]	Huihui CHEN, Hongtao SUN, Boliang GUAN, Zhongqing HENG. Chinese character image retrieval algorithm in ancient books based on NetVLAD feature encoding [J]. Journal of Computer Applications, 2026, 46(3): 750-757.
[11]	Yan HU, Peng LI, Shuyan CHENG. Adversarial purification method based on directly guided diffusion model [J]. Journal of Computer Applications, 2026, 46(3): 821-829.
[12]	Jian ZHANG, Jianbo YU, Jian TANG. Municipal solid waste incineration state recognition method based on multilayer preprocessing [J]. Journal of Computer Applications, 2026, 46(3): 940-949.
[13]	Kaiguang MA, Xuebin CHEN, Yinlong JIAN, Liu WANG, Yuan GAO. Network intrusion detection based on hybrid sequence model and federated class balance algorithm [J]. Journal of Computer Applications, 2026, 46(3): 857-866.
[14]	Jinjiao LIN, Canshun ZHANG, Shuya CHEN, Tianxin WANG, Jian LIAN, Yonghui XU. Vehicle insurance fraud detection method based on improved graph attention network [J]. Journal of Computer Applications, 2026, 46(2): 437-444.
[15]	Jincheng FU, Shiyou YANG. Short-term wind power prediction using hybrid model based on Bayesian optimization and feature fusion [J]. Journal of Computer Applications, 2026, 46(2): 652-658.

Interpretation method based on selective Softmax gradient for layer-wise relevance propagation

基于选择性Softmax梯度的分层相关性传播解释方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics