Accelerating training in pommerman with imitation and reinforcement learning H Meisheri, O Shelke, R Verma, H Khadilkar arXiv preprint arXiv:1911.04947, 2019 | 9 | 2019 |
Anticipatory decisions in retail e-commerce warehouses using reinforcement learning O Shelke, V Baniwal, H Khadilkar Proceedings of the 3rd ACM India Joint International Conference on Data …, 2021 | 2 | 2021 |
Identifying efficient curricula for reinforcement learning in complex environments with a fixed computational budget O Shelke, H Meisheri, H Khadilkar Proceedings of the 5th Joint International Conference on Data Science …, 2022 | 1 | 2022 |
Method and system for reinforcement learning and dual channel action embedding based robotic navigation H Khadilkar, HB Meisheri, OD Shelke, D Kalwar, P Pathakota US Patent App. 18/355,099, 2024 | | 2024 |
A Learning Approach for Discovering Cost-Efficient Integrated Sourcing and Routing Strategies in E-Commerce O Shelke, P Pathakota, A Chauhan, H Meisheri, H Khadilkar, B Ravindran Proceedings of the 7th Joint International Conference on Data Science …, 2024 | | 2024 |
Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce O Shelke, P Pathakota, A Chauhan, H Khadilkar, H Meisheri, B Ravindran arXiv preprint arXiv:2311.16171, 2023 | | 2023 |
Using General Value Functions to Learn Domain-Backed Inventory Management Policies D Kalwar, O Shelke, H Khadilkar arXiv preprint arXiv:2311.02125, 2023 | | 2023 |
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning D Kalwar, O Shelke, S Nath, H Meisheri, H Khadilkar arXiv preprint arXiv:2203.00874, 2022 | | 2022 |
School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget O Shelke, H Meisheri, H Khadilkar arXiv preprint arXiv:2102.11762, 2021 | | 2021 |