Silviu Pitis

Cited by

	All	Since 2019
Citations	1059	1056
h-index	11	11
i10-index	12	12

460

230

115

345

20182019202020212022202320243 10 23 53 93 449 424

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jimmy BaUniversity of TorontoVerified email at cs.toronto.edu
Harris ChanUniversity of Toronto, Vector InstituteVerified email at cs.toronto.edu
Yongchao ZhouUniversity of TorontoVerified email at mail.utoronto.ca
Keiran PasterUniversity of Toronto, Vector InstituteVerified email at cs.toronto.edu
Andrei Ioan MuresanuResearcher at Vector InstituteVerified email at uwaterloo.ca
Ziwen HanUniversity of TorontoVerified email at cs.toronto.edu
Elliot CreagerUniversity of WaterlooVerified email at uwaterloo.ca
Animesh GargGeorgia Institute of Technology, University of Toronto, NvidiaVerified email at gatech.edu
Bradly StadieAssistant Professor, NorthwesternVerified email at northwestern.edu
Stephen ZhaoComputer Science Student, University of TorontoVerified email at mail.utoronto.ca
Yangjun RuanUniversity of TorontoVerified email at cs.toronto.edu
Michael ZhangUniversity of TorontoVerified email at cs.toronto.edu
Daniel GravesVerified email at ualberta.ca
Kris De AsisOpenmind Research InstituteVerified email at pengy.ca
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Alan ChanCentre for the Governance of AI; Mila, Université de MontréalVerified email at mila.quebec
Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Ajay MandlekarResearch Scientist, NVIDIAVerified email at nvidia.com
Kiarash JamaliPhD Student, MRC Laboratory of Molecular BiologyVerified email at mrc-lmb.cam.ac.uk
Sicong(Sheldon) HuangUniversity of TorontoVerified email at cs.toronto.edu

Silviu Pitis

University of Toronto, Vector Institute

Verified email at cs.toronto.edu - Homepage

artificial intelligence machine learning reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Large language models are human-level prompt engineers Y Zhou, AI Muresanu, Z Han, K Paster, S Pitis, H Chan, J Ba International Conference on Learning Representations (ICLR 2023), 2023	594	2023
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning S Pitis, H Chan, S Zhao, B Stadie, J Ba International Conference on Machine Learning (ICML 2020), 2020	122	2020
Counterfactual data augmentation using locally factored dynamics S Pitis, E Creager, A Garg Neural Information Processing Systems (NeurIPS 2020), 2020	81	2020
Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach S Pitis The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), 2019	50	2019
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning K De Asis, A Chan, S Pitis, RS Sutton, D Graves The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 2020	31	2020
Identifying the risks of lm agents with an lm-emulated sandbox Y Ruan, H Dong, A Wang, S Pitis, Y Zhou, J Ba, Y Dubois, CJ Maddison, ... arXiv preprint arXiv:2309.15817, 2023	28	2023
Boosted prompt ensembles for large language models S Pitis, MR Zhang, A Wang, J Ba arXiv preprint arXiv:2304.05970, 2023	25	2023
MoCoDA: Model-based Counterfactual Data Augmentation S Pitis, E Creager, A Mandlekar, A Garg Neural Information Processing Systems (NeurIPS 2022), 2022	23	2022
Large language models are human-level prompt engineers (2022) Y Zhou, AI Muresanu, Z Han, K Paster, S Pitis, H Chan, J Ba arXiv preprint arXiv:2211.01910, 2022	23	2022
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality S Pitis, H Chan, K Jamali, J Ba Eighth International Conference on Learning Representations (ICLR 2020), 2020	21	2020
Source Traces for Temporal Difference Learning S Pitis The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 2018	19	2018
Calibrating language models via augmented prompt ensembles M Jiang, Y Ruan, S Huang, S Liao, S Pitis, RB Grosse, J Ba	10	2023
Failure modes of learning reward models for llms and other sequence models S Pitis ICML 2023 Workshop The Many Facets of Preference-Based Learning, 2023	8	2023
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards S Pitis Neural Information Processing Systems (NeurIPS 2023), 2023	5*	2023
Steering large language models using APE Y Zhou, AI Muresanu, Z Han, K Paster, S Pitis, H Chan, J Ba NeurIPS ML Safety Workshop, 2022	4	2022
Return augmentation gives supervised RL temporal compositionality K Paster, S Pitis, SA McIlraith, J Ba Deep Reinforcement Learning Workshop NeurIPS 2022, 2022	4	2022
Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes S Pitis, MR Zhang International Conference on Autonomous Agents and Multi-Agent Systems 2020, 2020	3	2020
ProtoGE: Prototype Goal Encodings for Multi-goal Reinforcement Learning S Pitis, H Chan, J Ba The 4th Multidisciplinary Conference on Reinforcement Learning and Decision …, 2019	3	2019
Methods for retrieving alternative contract language using a prototype S Pitis The Sixteenth International Conference on Law and Artificial Intelligence …, 2017	3	2017
CSC 311: Introduction to Machine Learning R Grosse, C Maddison, J Bae, S Pitis University of Toronto, Fall, 2020	2	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors