Follow
Xiong Jun Wu 熊君武
Xiong Jun Wu 熊君武
AntGroup
Verified email at antgroup.com
Title
Cited by
Cited by
Year
COCA: Constructing optimal clustering architecture to maximize sensor network lifetime
H Li, Y Liu, W Chen, W Jia, B Li, J Xiong
Computer Communications 36 (3), 256-268, 2013
1282013
Value propagation for decentralized networked deep multi-agent reinforcement learning
C Qu, S Mannor, H Xu, Y Qi, L Song, J Xiong
Advances in Neural Information Processing Systems 32, 2019
522019
Cost-effective incentive allocation via structured counterfactual inference
R Lopez, C Li, X Yan, J Xiong, M Jordan, Y Qi, L Song
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4997-5004, 2020
232020
Intention propagation for multi-agent reinforcement learning
C Qu, H Li, C Liu, J Xiong, W Chu, W Wang, Y Qi, L Song
152020
Constructing optimal clustering architecture for maximizing lifetime in large scale wireless sensor networks
H Li, J Cao, J Xiong
2009 15th International Conference on Parallel and Distributed Systems, 182-189, 2009
152009
Recommendation method and device
J Xiong, Z Liu, H Wei
US Patent 10,489,471, 2019
122019
Risk taxonomy, mitigation, and assessment benchmarks of large language model systems
T Cui, Y Wang, C Fu, Y Xiao, S Li, X Deng, Y Liu, Q Zhang, Z Qiu, P Li, ...
arXiv preprint arXiv:2401.05778, 2024
112024
Latent dirichlet allocation for internet price war
C Li, X Yan, X Deng, Y Qi, W Chu, L Song, J Qiao, J He, J Xiong
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 639-646, 2019
82019
Reinforcement learning for uplift modeling
C Li, X Yan, X Deng, Y Qi, W Chu, L Song, J Qiao, J He, J Xiong
arXiv preprint arXiv:1811.10158, 2018
82018
Unit ball model for embedding hierarchical structures in the complex hyperbolic space
H Xiao, C Jiang, Y Song, J Zhang, J Xiong
arXiv preprint arXiv:2105.03966, 2021
12021
Variational policy propagation for multi-agent reinforcement learning
C Qu, H Li, C Liu, J Xiong, J Zhang, W Chu, W Wang, Y Qi, L Song
arXiv preprint arXiv:2004.08883, 2020
12020
Model-Based Off-Policy Deep Reinforcement Learning With Model-Embedding
X Tan, C Qu, J Xiong, J Zhang, X Qiu, Y Jin
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024
2024
Digital Human Interactive Recommendation Decision-Making Based on Reinforcement Learning
X Junwu, X Feng, YZ Shi, J Zhang, Z Zhao, W Zhou
arXiv preprint arXiv:2210.10638, 2022
2022
Model Embedding Model-Based Reinforcement Learning
X Tan, C Qu, J Xiong, J Zhang
arXiv preprint arXiv:2006.09234, 2020
2020
S2VG: Soft Stochastic Value Gradient method
X Tan, C Qu, J Xiong, J Zhang
2019
A Policy Gradient Method with Variance Reduction for Uplift Modeling.
C Li, X Yan, X Deng, Y Qi, W Chu, L Song, J Qiao, J He, J Xiong
arXiv preprint arXiv:1811.10158, 2018
2018
Personalized Behavior Prediction with Encoder-to-Decoder Structure
T Yin, X Deng, Y Qi, W Chu, J Pan, X Yan, J Xiong
2018 IEEE International Conference on Networking, Architecture and Storage …, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–17