Relation-aware transformer for portfolio policy learning K Xu, Y Zhang, D Ye, P Zhao, M Tan Proceedings of the twenty-ninth international conference on international …, 2021 | 44 | 2021 |
Value penalized q-learning for recommender systems C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan, P Zhao Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 16 | 2022 |
Multi-scale attention flow for probabilistic time series forecasting S Feng, C Miao, K Xu, J Wu, P Wu, Y Zhang, P Zhao IEEE Transactions on Knowledge and Data Engineering, 2023 | 12 | 2023 |
Deploying Offline Reinforcement Learning with Human Feedback Z Li, K Xu, L Liu, L Li, D Ye, P Zhao arXiv preprint arXiv:2303.07046, 2023 | | 2023 |
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation C Gao, K Xu, L Liu, D Ye, P Zhao, Z Xu arXiv preprint arXiv:2210.10469, 2022 | | 2022 |
Taming Policy Constrained Offline Reinforcement Learning for Non-expert Demonstrations C Gao, K Xu, L Liu, D Ye, P Zhao | | 2022 |
Quantized Adaptive Subgradient Algorithms and Their Applications K Xu, J Wangni, Y Zhang, D Ye, J Wu, P Zhao arXiv preprint arXiv:2208.05631, 2022 | | 2022 |