Follow
Zhixi Cai
Zhixi Cai
PhD Student at Monash University
Verified email at monash.edu
Title
Cited by
Cited by
Year
MARLIN: Masked Autoencoder for Facial Video Representation LearnINg
Z Cai, S Ghosh, K Stefanov, A Dhall, J Cai, H Rezatofighi, R Haffari, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
282023
Do you really mean that? Content driven audio-visual deepfake dataset and multimodal method for temporal forgery localization
Z Cai, K Stefanov, A Dhall, M Hayat
2022 International Conference on Digital Image Computing: Techniques and …, 2022
222022
Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization
Z Cai, S Ghosh, A Dhall, T Gedeon, K Stefanov, M Hayat
Computer Vision and Image Understanding 236, 103818, 2023
42023
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Z Cai, S Ghosh, AP Adatia, M Hayat, A Dhall, K Stefanov
arXiv preprint arXiv:2311.15308, 2023
12023
Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit
S Ghosh, Z Cai, P Gupta, G Sharma, A Dhall, M Hayat, T Gedeon
arXiv preprint arXiv:2305.05255, 2023
12023
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
S Jahangard, Z Cai, S Wen, H Rezatofighi
arXiv preprint arXiv:2404.04458, 2024
2024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
F Ke, Z Cai, S Jahangard, W Wang, PD Haghighi, H Rezatofighi
arXiv preprint arXiv:2403.12884, 2024
2024
Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase
S Ghosh, R Hasan, P Agrawal, Z Cai, S Soon, A Dhall, T Gedeon
arXiv preprint arXiv:2305.06110, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–8