Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (6): 1683-1688.DOI: 10.11772/j.issn.1001-9081.2021081417

• National Open Distributed and Parallel Computing Conference 2021 (DPCS 2021) • Previous Articles    

Multimodal sequential recommendation algorithm based on contrastive learning

Tengyue HAN1, Shaozhang NIU1(), Wen ZHANG2   

  1. 1.School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China
    2.Southeast Digital Economic Development Institute,Quzhou Zhejiang 324000,China
  • Received:2021-08-06 Revised:2021-10-15 Accepted:2021-10-29 Online:2022-06-22 Published:2022-06-10
  • Contact: Shaozhang NIU
  • About author:HAN Tengyue, born in 1990, Ph. D. candidate. Her research interests include recommendation algorithm, data mining.
    ZHANG Wen, born in 1981, Ph. D. His research interests include intelligent big data analysis, mobile Internet security.
  • Supported by:
    National Natural Science Foundation of China(U1536121)

基于对比学习的多模态序列推荐算法

韩滕跃1, 牛少彰1(), 张文2   

  1. 1.北京邮电大学 计算机学院,北京 100876
    2.东南数字经济发展研究院,浙江 衢州 324000
  • 通讯作者: 牛少彰
  • 作者简介:韩滕跃(1990—),女,河北衡水人,博士研究生,主要研究方向:推荐算法、数据挖掘
    张文(1981—),四川内江人,博士,主要研究方向:大数据智能分析、移动互联网安全。
  • 基金资助:
    国家自然科学基金资助项目(U1536121)

Abstract:

A multimodal sequential recommendation algorithm based on contrastive learning technology was proposed to improve the accuracy of sequential recommendation algorithm by using multimodal information of commodities. Firstly, to obtain the visual representations such as the color and shape of the product, the visual modal information of the product was extracted by utilizing the contrastive learning framework, where the data enhancement was performed by changing the color and intercepting the center area of the product. Secondly, the textual information of each commodity was embedded into a low-dimensional space, so that the complete multimodal representation of each commodity could be obtained. Finally, a Recurrent Neural Network (RNN) was used for modeling the sequential interactions of multimodal information according to the time sequence of the product, then the preference representation of user was obtained and used for commodity recommendation. The proposed algorithm was tested on two public datasets and compared with the existing sequential recommendation algorithm LESSR. Experimental results prove that the ranking performance of the proposed algorithm is improved, and the recommendation performance remains basically unchanged after the feature dimension value reaches 50.

Key words: contrastive learning, multimodal, neural network, sequential recommendation, feature interaction

摘要:

针对如何利用商品的多模态信息提高序列推荐算法准确性的问题,提出一种基于对比学习技术的多模态序列推荐算法。该算法首先通过改变商品颜色和截取商品图片中心区域等手段进行数据增强,并把增强后的数据与原数据进行对比学习,以提取到商品的颜色和形状等视觉模态信息;其次对商品的文本模态信息进行低维空间嵌入,从而得到商品多模态信息的完整表达;最后根据商品的时序性,采用循环神经网络(RNN)建模多模态信息的序列交互特征,得到用户的偏好表达,从而进行商品推荐。在两个公开的数据集上进行实验测试的结果表明,与现有的序列推荐算法LESSR相比,所提算法排序性能有所提升,且该算法在特征维度值到达50后,推荐性能基本保持不变。

关键词: 对比学习, 多模态, 神经网络, 序列推荐, 特征交互

CLC Number: