Ashapeformer: Semantics-guided object-level active shape encoding for 3d object detection via transformers Z Li, H Yu, Z Yang, T Chen, N Akhtar Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 3 | 2023 |
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition T Chen, H Yu, Z Yang, Z Li, W Sun, C Chen arXiv preprint arXiv:2312.00096, 2023 | 1 | 2023 |
First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment TT Chen, H Yu, Z Yang, M Li, Z Li, J Wang, W Miao, W Sun, C Chen arXiv preprint arXiv:2306.13380, 2023 | 1 | 2023 |
EFRNet-VL: An end-to-end feature refinement network for monocular visual localization in dynamic environments J Wang, H Yu, X Lin, Z Li, W Sun, N Akhtar Expert Systems with Applications 243, 122755, 2024 | | 2024 |