Follow
Jing Yu Koh
Title
Cited by
Cited by
Year
Scaling autoregressive models for content-rich text-to-image generation
J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang, V Vasudevan, A Ku, Y Yang, ...
TMLR, 2022
8122022
Cross-Modal Contrastive Learning for Text-to-Image Generation
H Zhang*, JY Koh*, J Baldridge, H Lee, Y Yang
CVPR, 2021
3442021
Vector-quantized image modeling with improved vqgan
J Yu, X Li, JY Koh, H Zhang, R Pang, J Qin, A Ku, Y Xu, J Baldridge, Y Wu
ICLR, 2021
2852021
Generating images with multimodal language models
JY Koh, D Fried, R Salakhutdinov
NeurIPS, 2023
1362023
Grounding Language Models to Images for Multimodal Inputs and Outputs
JY Koh, R Salakhutdinov, D Fried
ICML, 2023
1292023
Text-to-image generation grounded by fine-grained user attention
JY Koh, J Baldridge, H Lee, Y Yang
WACV, 2021
682021
Pathdreamer: A World Model for Indoor Navigation
JY Koh, H Lee, Y Yang, J Baldridge, P Anderson
ICCV, 2021
642021
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
A Kamath, P Anderson, S Wang, JY Koh, A Ku, A Waters, Y Yang, ...
CVPR, 2022
37*2022
Visualwebarena: Evaluating multimodal agents on realistic visual web tasks
JY Koh, R Lo, L Jang, V Duvvur, MC Lim, PY Huang, G Neubig, S Zhou, ...
ACL, 2024
322024
Revisiting hierarchical approach for persistent long-term video prediction
W Lee, W Jung, H Zhang, T Chen, JY Koh, T Huang, H Yoon, H Lee, ...
ICLR, 2021
282021
Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning
S Ghosh*, JY Koh*, P Jaillet
IJCAI, 2019
272019
Simple and Effective Synthesis of Indoor 3D Scenes
JY Koh*, H Agrawal*, D Batra, R Tucker, A Waters, H Lee, Y Yang, ...
AAAI, 2022
232022
Vq3d: Learning a 3d-aware generative model on imagenet
K Sargent, JY Koh, H Zhang, H Chang, C Herrmann, P Srinivasan, J Wu, ...
ICCV, 2023
212023
Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data
T Feng, QT Truong, T Nguyen, JY Koh, LF Yu, SK Yeung, A Binder
ECCV, 2018
202018
Twitter-informed crowd flow prediction
G Goh, JY Koh, Y Zhang
ICDM Workshops, 2018
172018
SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information
JY Koh, DT Nguyen, QT Truong, SK Yeung, A Binder
ECCV, 2020
62020
Multimodal graph learning for generative tasks
M Yoon, JY Koh, B Hooi, R Salakhutdinov
arXiv preprint arXiv:2310.07478, 2023
42023
Systems And Methods For Generating Predicted Visual Observations Of An Environment Using Machine Learned Models
JY Koh, H Lee, Y Yang, JM Baldridge, PJ Anderson
US Patent App. 17/409,249, 2023
42023
Object boundary detection and classification with image-level labels
JY Koh, W Samek, KR Müller, A Binder
GCPR, 2017
42017
Adversarial Attacks on Multimodal Agents
CH Wu, JY Koh, R Salakhutdinov, D Fried, A Raghunathan
arXiv preprint arXiv:2406.12814, 2024
22024
The system can't perform the operation now. Try again later.
Articles 1–20