Coin: A large-scale dataset for comprehensive instructional video analysis Y Tang, D Ding, Y Rao, Y Zheng, D Zhang, L Zhao, J Lu, J Zhou Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 266 | 2019 |
Uncertainty-aware score distribution learning for action quality assessment Y Tang, Z Ni, J Zhou, D Zhang, J Lu, Y Wu, J Zhou Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 112 | 2020 |
Websrc: A dataset for web-based structural reading comprehension X Chen, Z Zhao, L Chen, D Zhang, J Ji, A Luo, Y Xiong, K Yu arXiv preprint arXiv:2101.09465, 2021 | 47 | 2021 |
Rotation-robust intersection over union for 3d object detection Y Zheng, D Zhang, S Xie, J Lu, J Zhou European Conference on Computer Vision, 464-480, 2020 | 35 | 2020 |
Large Language Models Are Semi-Parametric Reinforcement Learning Agents D Zhang, L Chen, S Zhang, H Xu, Z Zhao, K Yu Advances in Neural Information Processing Systems 36, 2024 | 17 | 2024 |
Learning from temporal spatial cubism for cross-dataset skeleton-based action recognition Y Tang, X Liu, X Yu, D Zhang, J Lu, J Zhou ACM Transactions on Multimedia Computing, Communications, and Applications …, 2022 | 10 | 2022 |
Mobile-Env: A Universal Platform for Training and Evaluation of Mobile Interaction D Zhang, L Chen, K Yu arXiv preprint arXiv:2305.08144, 2023 | 3 | 2023 |
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments T Xie, D Zhang, J Chen, X Li, S Zhao, R Cao, TJ Hua, Z Cheng, D Shin, ... arXiv preprint arXiv:2404.07972, 2024 | 1 | 2024 |