考虑多粒度反馈的多轮对话强化学习推荐算法
姚华勇, 叶东毅, 陈昭炯
Multi-round conversational reinforcement learning recommendation algorithm via multi-granularity feedback
YAO Huayong, YE Dongyi, CHEN Zhaojiong
《计算机应用》唯一官方网站 . 2023, (1): 15 -21 .  DOI: 10.11772/j.issn.1001-9081.2021111875