Fine-grained cross-modal molecular retrieval method based on reinforcement learning

doi:10.11772/j.issn.1001-9081.2026010020

Journal of Computer Applications

Received:2026-01-15 Revised:2026-03-19 Online:2026-05-13 Published:2026-05-13

基于强化学习的细粒度跨模态分子检索方法

周栋¹,刘婷¹,王晨旭¹,林江豪¹,周咏梅²

1. 广东外语外贸大学
2. 广东外语外贸大学思科信息学院，广州510006

通讯作者: 周栋

Abstract

Abstract: To address the over-reliance on global features and insufficient task adaptability of molecular representations in cross-modal molecular retrieval, a reinforcement learning-based fine-grained cross-modal molecular retrieval method was proposed. For the text end, the pre-trained SciBERT model was used to extract token-level and sentence-level representations. For the molecular end, Graph Convolutional Network (GCN) was adopted to obtain group-level and molecule-level representations. The method introduced the Proximal Policy Optimization (PPO) algorithm in reinforcement learning to achieve accurate alignment between tokens and groups, and dynamically generates token representations matching the semantics of groups. Progressive strategy was applied in overall training, and retrieval learning was conducted by maximizing the similarity of matched text-molecule pairs through the contrastive loss function. Experimental results on the ChEBI-20 dataset show that this method outperforms current mainstream methods in metrics such as mean reciprocal rank and Hits@1, and providing a new solution for cross-modal molecular retrieval tasks.

Key words: Proximal Policy Optimization &, #40

摘要： 针对跨模态分子检索中过度依赖全局特征和分子表征缺乏任务自适应能力的问题，提出一种基于强化学习的细粒度跨模态分子检索方法。该方法文本端采用预训练SciBERT模型提取Token级和句子级表征，分子端采用图卷积网络(GCN)获取基团级和分子级表征。方法引入强化学习中的近端策略优化(PPO)算法实现Token与基团的精准对齐，动态生成与基团语义相匹配的Token表征。整体训练采用渐进式策略，通过对比损失函数最大化匹配文本-分子对的相似度进行检索学习。在数据集ChEBI-20上的实验结果表明，该方法在平均互逆排名和Hits@1等指标均优于目前的主流方法，为跨模态分子检索任务提供了新的解决方案。

CLC Number:

TP391.3

周栋刘婷王晨旭林江豪周咏梅. 基于强化学习的细粒度跨模态分子检索方法[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2026010020.

[1]	SHANG Yimeng, CHI Zenglin, ZHANG Hongming, LIU Bin, HU Guoqiang, NIU Dangdang. Agricultural user identification recognition method integrating image-text and large language model [J]. Journal of Computer Applications, 0, (): 0-0.
[2]	Yancui SHI, Haozhe QIN. Recommendation method integrating user behaviors and improved long-tail algorithm [J]. Journal of Computer Applications, 2026, 46(1): 95-103.
[3]	Xingyao YANG, Zheng QI, Jiong YU, Zulian ZHANG, Shuai MA, Hongtao SHEN. Session-based recommendation model based on time-aware and space-enhanced dual channel graph neural network [J]. Journal of Computer Applications, 2026, 46(1): 104-112.
[4]	Xinran XIE, Zhe CUI, Rui CHEN, Tailai PENG, Dekun LIN. Zero-shot re-ranking method by large language model with hierarchical filtering and label semantic extension [J]. Journal of Computer Applications, 2026, 46(1): 60-68.
[5]	. Graph recommendation model based on dynamic embedding enhancement and hybrid sampling fusion [J]. Journal of Computer Applications, 0, (): 0-0.
[6]	. Text-ID sequential recommendation model via multi-strategy contrastive learning and adaptive label smoothing [J]. Journal of Computer Applications, 0, (): 0-0.
[7]	Ke GAN, Xiaofei ZHU, Jiawei CHENG. Recommendation method based on multi-perspective relation-enhanced knowledge graph [J]. Journal of Computer Applications, 2025, 45(11): 3519-3528.
[8]	. User-oriented multi-behavior reinforcement learning model for learning path recommendation [J]. Journal of Computer Applications, 0, (): 0-0.
[9]	Yi WANG, Yinglong MA. Multi-task social item recommendation method based on dynamic adaptive generation of item graph [J]. Journal of Computer Applications, 2025, 45(8): 2592-2599.
[10]	Yimeng XI, Zhen DENG, Qian LIU, Libo LIU. Cross-modal information fusion for video-text retrieval [J]. Journal of Computer Applications, 2025, 45(8): 2448-2456.
[11]	Panpan GUO, Gang ZHOU, Jicang LU, Zhufeng LI, Taojie ZHU. Paper recommendation method with mixed information enhancement [J]. Journal of Computer Applications, 2025, 45(6): 1879-1887.
[12]	Zonghang WU, Dong ZHANG, Guanyu LI. Multimodal fusion recommendation algorithm based on joint self-supervised learning [J]. Journal of Computer Applications, 2025, 45(6): 1858-1868.
[13]	Sijie NIU, Yuliang LIU. Auxiliary diagnostic method for retinopathy based on dual-branch structure with knowledge distillation [J]. Journal of Computer Applications, 2025, 45(5): 1410-1414.
[14]	Cong WANG, Yancui SHI. Group recommendation model by graph neural network based on multi-perspective learning [J]. Journal of Computer Applications, 2025, 45(4): 1205-1212.
[15]	Renjie TIAN, Mingli JING, Long JIAO, Fei WANG. Recommendation algorithm of graph contrastive learning based on hybrid negative sampling [J]. Journal of Computer Applications, 2025, 45(4): 1053-1060.