Scale-free adversarial multi armed bandits SR Putta, S Agrawal International Conference on Algorithmic Learning Theory, 910-930, 2022 | 12 | 2022 |
Pure Exploration in Episodic Fixed-Horizon Markov Decision Processes. SR Putta, T Tulabandhula AAMAS, 1703-1704, 2017 | 6 | 2017 |
Multi Armed Bandits and Exploration Strategies S Raja Multi Armed Bandits and Exploration Strategies–Sudeep Raja–MS/Phd Student at …, 2016 | 5 | 2016 |
A Derivation of Backpropagation in Matrix Form S Raja https://sudeepraja.github.io/Neural/, 2017 | 3 | 2017 |
Regret Bounds for Optimistic Follow The Leader: Applications in Portfolio Selection and Linear Regression SR Putta, S Agrawal OPT 2023: Optimization for Machine Learning, 2023 | | 2023 |
Exponential Weights on the Hypercube in Polynomial Time SR Putta, A Shetty Proceedings of the 22nd International Conference on Artificial Intelligence …, 2019 | | 2019 |
Efficient Reinforcement Learning via Initial Pure Exploration SR Putta, T Tulabandhula Reinforcement Learning and Decision Making 2017, 2017 | | 2017 |