Follow
Bo Liu (Benjamin Liu)
Title
Cited by
Cited by
Year
DeepSeek-LLM: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
1342024
DeepSeek-VL: Towards real-world vision-language understanding
H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu, J Sun, T Ren, Z Li, Y Sun, ...
arXiv preprint arXiv:2403.05525, 2024
862024
Learning correlated communication topology in multi-agent reinforcement learning
Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang
Twentieth International Conference on Autonomous Agents and MultiAgent …, 2021
682021
Envpool: A highly parallel reinforcement learning environment execution engine
J Weng, M Lin, S Huang, B Liu, D Makoviichuk, V Makoviychuk, Z Liu, ...
Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), 2022
532022
Neural auto-curricula in two-player zero-sum games
X Feng, O Slumbers, Z Wan, B Liu, S McAleer, Y Wen, J Wang, Y Yang
Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), 2021
48*2021
DeepSeek-V2: A strong, economical, and efficient mixture-of-experts language model
A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ...
arXiv preprint arXiv:2405.04434, 2024
17*2024
A theoretical understanding of gradient bias in meta-reinforcement learning
B Liu*, X Feng*, J Ren, L Mai, R Zhu, H Zhang, J Wang, Y Yang
Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), 2022
13*2022
DeepSeek-Prover-V1. 5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
H Xin, ZZ Ren, J Song, Z Shao, W Zhao, H Wang, B Liu, L Zhang, X Lu, ...
arXiv preprint arXiv:2408.08152, 2024
11*2024
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
H Xin, D Guo, Z Shao, Z Ren, Q Zhu, B Liu, C Ruan, W Li, X Liang
arXiv preprint arXiv:2405.14333, 2024
112024
Grasp multiple objects with one hand
Y Li, B Liu, Y Geng, P Li, Y Yang, Y Zhu, T Liu, S Huang
IEEE Robotics and Automation Letters (RA-L), 2024
102024
Torchopt: An efficient library for differentiable optimization
J Ren*, X Feng*, B Liu*, X Pan*, Y Fu, L Mai, Y Yang
Journal of Machine Learning Research (JMLR), 2023
102023
The system can't perform the operation now. Try again later.
Articles 1–11