Actor prioritized experience replay B Saglam, FB Mutlu, DC Cicek, SS Kozat Journal of Artificial Intelligence Research 78, 639-672, 2023 | 13 | 2023 |
Off-policy correction for deep deterministic policy gradient algorithms via batch prioritized experience replay DC Cicek, E Duran, B Saglam, FB Mutlu, SS Kozat 2021 IEEE 33rd International Conference on Tools with Artificial …, 2021 | 13 | 2021 |
Awd3: Dynamic reduction of the estimation bias DC Cicek, E Duran, B Saglam, K Kaya, F Mutlu, SS Kozat 2021 IEEE 33rd international conference on tools with artificial …, 2021 | 11 | 2021 |
Estimation error correction in deep reinforcement learning for deterministic actor-critic methods B Saglam, E Duran, DC Cicek, FB Mutlu, SS Kozat 2021 IEEE 33rd international conference on tools with artificial …, 2021 | 10 | 2021 |
Off-Policy correction for actor-critic algorithms in deep reinforcement learning B Saglam, DC Cicek, FB Mutlu, SS Kozat arXiv preprint arXiv:2208.00755, 2022 | 4 | 2022 |
Parameter-free deterministic reduction of the estimation bias in continuous control B Saglam, E Duran, DC Cicek, FB Mutlu, SS Kozat arXiv preprint arXiv:2109.11788, 2021 | 3 | 2021 |
Off-Policy Correction for Actor-Critic Methods without Importance Sampling B Saglam, DC Cicek, FB Mutlu, SS Kozat arXiv preprint arxiv:2208.00755, 2022 | 2 | 2022 |
Parameter-free reduction of the estimation bias in deep reinforcement learning for deterministic policy gradients B Saglam, FB Mutlu, DC Cicek, SS Kozat Neural Processing Letters 56 (2), 80, 2024 | 1 | 2024 |
Actor Prioritized Experience Replay (Abstract Reprint) B Saglam, F Mutlu, D Cicek, S Kozat Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22710 …, 2024 | | 2024 |
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach B Saglam, DC Cicek, FB Mutlu, SS Kozat arXiv preprint arXiv:2208.00755, 2022 | | 2022 |
Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms B Saglam, DC Cicek, FB Mutlu, SS Kozat Responsible Decision Making in Dynamic Environments Workshop in the 39th …, 2022 | | 2022 |
Novel Experience Replay Mechanisms to Improve the Performance of the Deep Deterministic Policy Gradients Algorithms DC Çiçek PQDT-Global, 2022 | | 2022 |