关注
ke xu
ke xu
Tencent
没有经过验证的电子邮件地址
标题
引用次数
引用次数
年份
Relation-aware transformer for portfolio policy learning
K Xu, Y Zhang, D Ye, P Zhao, M Tan
Proceedings of the twenty-ninth international conference on international …, 2021
442021
Value penalized q-learning for recommender systems
C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan, P Zhao
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
162022
Multi-scale attention flow for probabilistic time series forecasting
S Feng, C Miao, K Xu, J Wu, P Wu, Y Zhang, P Zhao
IEEE Transactions on Knowledge and Data Engineering, 2023
122023
Deploying Offline Reinforcement Learning with Human Feedback
Z Li, K Xu, L Liu, L Li, D Ye, P Zhao
arXiv preprint arXiv:2303.07046, 2023
2023
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
C Gao, K Xu, L Liu, D Ye, P Zhao, Z Xu
arXiv preprint arXiv:2210.10469, 2022
2022
Taming Policy Constrained Offline Reinforcement Learning for Non-expert Demonstrations
C Gao, K Xu, L Liu, D Ye, P Zhao
2022
Quantized Adaptive Subgradient Algorithms and Their Applications
K Xu, J Wangni, Y Zhang, D Ye, J Wu, P Zhao
arXiv preprint arXiv:2208.05631, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–7