Rohit Girdhar

Cited by

	All	Since 2019
Citations	8340	7984
h-index	26	26
i10-index	28	28

2700

1350

675

2025

2017201820192020202120222023202469 247 434 548 686 1178 2618 2505

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ishan MisraResearch Scientist, Facebook AI ResearchVerified email at fb.com
Armand JoulinGoogle DeepMindVerified email at google.com
Deva RamananProfessor, Robotics Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Abhinav GuptaProfessor, Robotics Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Mannat SinghFAIR, Meta AIVerified email at fb.com
Lorenzo TorresaniMeta, Fundamental AI Research (FAIR)Verified email at meta.com
Alexander SchwingUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Kristen GraumanProfessor of Computer Science, University of Texas at AustinVerified email at cs.utexas.edu
Andrew ZissermanUniversity of OxfordVerified email at robots.ox.ac.uk
Alexander KirillovOpenAIVerified email at openai.com
Carl DoerschGoogle DeepMindVerified email at google.com
João CarreiraGoogle DeepMindVerified email at google.com
David FouheyNew York UniversityVerified email at nyu.edu
Bowen ChengUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Du TranGoogleVerified email at google.com
Philipp KrähenbühlUT AustinVerified email at cs.utexas.edu
Josef SivicCzech Technical University, CIIRC, ELLIS Unit PragueVerified email at cvut.cz
Bryan RussellResearcher, AdobeVerified email at adobe.com
Alaaeldin El-NoubyResearch Scientist, AppleVerified email at apple.com
Zhuang LiuResearch Scientist, FAIR, MetaVerified email at berkeley.edu

Rohit Girdhar

Research Scientist, Fundamental AI Research (FAIR), Meta

Verified email at fb.com - Homepage

Computer Vision Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Masked-attention mask transformer for universal image segmentation B Cheng, I Misra, AG Schwing, A Kirillov, R Girdhar Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	1573	2022
Video Action Transformer Network R Girdhar, J Carreira, C Doersch, A Zisserman Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 2019	832	2019
Learning a Predictable and Generative Vector Representation for Objects R Girdhar, DF Fouhey, M Rodriguez, A Gupta European Conference on Computer Vision (ECCV) 2016, 2016	831	2016
Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	689	2022
ActionVLAD: Learning spatio-temporal aggregation for action classification R Girdhar, D Ramanan, A Gupta, J Sivic, B Russell Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017	572	2017
Imagebind: One embedding space to bind them all R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	526	2023
Detecting twenty-thousand classes using image-level supervision X Zhou, R Girdhar, A Joulin, P Krähenbühl, I Misra European Conference on Computer Vision, 350-368, 2022	476	2022
An end-to-end transformer model for 3d object detection I Misra, R Girdhar, A Joulin Proceedings of the IEEE/CVF international conference on computer vision …, 2021	442	2021
Attentional pooling for action recognition R Girdhar, D Ramanan Advances in Neural Information Processing Systems (NeurIPS), 2017, 2017	416	2017
Detect-and-Track: Efficient Pose Estimation in Videos R Girdhar, G Gkioxari, L Torresani, M Paluri, D Tran Conference on Computer Vision and Pattern Recognition (CVPR), 2018, 2018	297	2018
Self-supervised pretraining of 3d features on any point-cloud Z Zhang, R Girdhar, A Joulin, I Misra Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	249	2021
Anticipative Video Transformer R Girdhar, K Grauman IEEE/CVF International Conference on Computer Vision (ICCV), 2021	206	2021
Omnivore: A single model for many visual modalities R Girdhar, M Singh, N Ravi, L Van Der Maaten, A Joulin, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	189	2022
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning R Girdhar, D Ramanan International Conference on Learning Representations (ICLR), 2020, 2020	170	2020
Cut and learn for unsupervised object detection and instance segmentation X Wang, R Girdhar, SX Yu, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	119	2023
Learning video representations from large language models Y Zhao, I Misra, P Krähenbühl, R Girdhar Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	109	2023
Binge Watching: Scaling Affordance Learning from Sitcoms X Wang, R Girdhar, A Gupta Conference on Computer Vision and Pattern Recognition (CVPR), 2017, 2017	87	2017
Omnimae: Single model masked pretraining on images and videos R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	82*	2023
Emu video: Factorizing text-to-video generation by explicit image conditioning R Girdhar, M Singh, A Brown, Q Duval, S Azadi, SS Rambhatla, A Shah, ... arXiv preprint arXiv:2311.10709, 2023	77*	2023
DistInit: Learning Video Representations without a Single Labeled Video R Girdhar, D Tran, L Torresani, D Ramanan International Conference on Computer Vision (ICCV) 2019, 2019	73	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors