TREA: Tree-structure reasoning schema for conversational recommendation W Li, W Wei, X Qu, XL Mao, Y Yuan, W Xie, D Chen arXiv preprint arXiv:2307.10543, 2023 | 6 | 2023 |
Towards hierarchical policy learning for conversational recommendation with hypergraph-based reinforcement learning S Zhao, W Wei, Y Liu, Z Wang, W Li, XL Mao, S Zhu, M Yang, Z Wen arXiv preprint arXiv:2305.02575, 2023 | 4 | 2023 |
Reinforcement Learning with Token-level Feedback for Controllable Text Generation W Li, W Wei, K Xu, W Xie, D Chen, Y Cheng arXiv preprint arXiv:2403.11558, 2024 | 1 | 2024 |
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue S Fan, W Wei, W Li, XL Mao, W Xie, D Chen arXiv preprint arXiv:2406.02002, 2024 | | 2024 |