Journal of Computer Applications ›› 2023, Vol. 43 ›› Issue (3): 685-691.DOI: 10.11772/j.issn.1001-9081.2022010032

Special Issue: 人工智能

• Artificial intelligence • Previous Articles     Next Articles

Improved method of convolution neural network based on matrix decomposition

Zhenliang LI1, Bo LI2()   

  1. 1.Faculty of Electronics and Information Engineering,Xi’an Jiaotong University,Xi’an Shaanxi 710049,China
    2.Computer Teaching & Experiment Center,Xi’an Jiaotong University,Xi’an Shaanxi 710049,China
  • Received:2022-01-11 Revised:2022-03-13 Accepted:2022-03-22 Online:2022-04-11 Published:2023-03-10
  • Contact: Bo LI
  • About author:LI Zhenliang, born in 1997, M. S. candidate. His research interests include deep learning, object detection.
    LI Bo, born in 1968, professor. His research interests include computer simulation, artificial intelligence.


李振亮1, 李波2()   

  1. 1.西安交通大学 电子与信息学部,西安 710049
    2.西安交通大学 计算机教学实验中心,西安 710049
  • 通讯作者: 李波
  • 作者简介:李振亮(1997—),男,河南许昌人,硕士研究生,主要研究方向:深度学习、目标检测


Aiming at the difficulty of optimizing the traditional Convolutional Neural Network (CNN) in the training process, an improved method of CNN based on matrix decomposition was proposed. Firstly, the convolution kernel parameter tensor of the model convolution layer during training was converted into the product of multiple parameter matrices through matrix decomposition to form overparameterization. Secondly, these additional linear parameters were added to the back propagation of the network and updated synchronously with other parameters of the model to improve the optimization process of gradient descent. After completing the training, the matrix product was restored to the standard convolution kernel parameters, so that the computational complexity of forward propagation during inference was able to be the same as before the improvement. With thin QR decomposition and reduced Singular Value Decomposition (SVD) applied, the classification effect experiments were carried out on CIFAR-10 (Canadian Institute For Advanced Research, 10 classes) dataset, and further generalization experiments were carried out by using different image classification datasets and different initialization methods. Experimental results show that the classification accuracies of 7 models of different depths of Visual Geometry Group (VGG) and Residual Network (ResNet) based on matrix decomposition are higher than those of the original convolutional neural network models. It can be seen that the matrix decomposition method can make CNN achieve higher classification accuracy, and eventually converge to a better local optimum.

Key words: Convolution Neural Network (CNN), matrix decomposition, Singular Value Decomposition (SVD), overparameterization, image classification



关键词: 卷积神经网络, 矩阵分解, 奇异值分解, 过参数化, 图像分类

CLC Number: