Follow
Karl Cobbe
Karl Cobbe
Research Scientist, OpenAI
Verified email at openai.com
Title
Cited by
Cited by
Year
Training verifiers to solve math word problems
K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ...
arXiv preprint arXiv:2110.14168, 2021
10132021
Quantifying generalization in reinforcement learning
K Cobbe, O Klimov, C Hesse, T Kim, J Schulman
International conference on machine learning, 1282-1289, 2019
6532019
Webgpt: Browser-assisted question-answering with human feedback
R Nakano, J Hilton, S Balaji, J Wu, L Ouyang, C Kim, C Hesse, S Jain, ...
arXiv preprint arXiv:2112.09332, 2021
6512021
Leveraging procedural generation to benchmark reinforcement learning
K Cobbe, C Hesse, J Hilton, J Schulman
International conference on machine learning, 2048-2056, 2020
4922020
System and method for activity management presentation
Y Shoham, JE Bank, K Cobbe, A Matta, M Rubin, ZI Weiner, KT Toft
US Patent App. 14/076,046, 2015
2122015
Let's Verify Step by Step
H Lightman, V Kosaraju, Y Burda, H Edwards, B Baker, T Lee, J Leike, ...
arXiv preprint arXiv:2305.20050, 2023
1782023
Phasic policy gradient
KW Cobbe, J Hilton, O Klimov, J Schulman
International Conference on Machine Learning, 2020-2027, 2021
1462021
Training verifiers to solve math word problems, 2021
K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ...
URL https://arxiv. org/abs/2110.14168, 2021
372021
Measuring sample efficiency and generalization in reinforcement learning benchmarks: Neurips 2020 procgen benchmark
S Mohanty, J Poonganam, A Gaidon, A Kolobov, B Wulfe, D Chakraborty, ...
arXiv preprint arXiv:2103.15332, 2021
182021
Batch size-invariance for policy optimization
J Hilton, K Cobbe, J Schulman
Advances in Neural Information Processing Systems 35, 17086-17098, 2022
112022
Event scheduling presentation in a graphical user interface environment
Y Shoham, JE Bank, K Cobbe, A Matta, M Rubin, ZI Weiner, KT Toft
US Patent 10,088,973, 2018
12018
The system can't perform the operation now. Try again later.
Articles 1–11