Folgen
Prabhat Nagarajan
Prabhat Nagarajan
PhD Student | The University of Alberta
Bestätigte E-Mail-Adresse bei ualberta.ca - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations
D Brown, W Goo, P Nagarajan, S Niekum
International Conference on Machine Learning, 783-792, 2019
3392019
ChainerRL: A Deep Reinforcement Learning Library
Y Fujita, P Nagarajan, T Kataoka, T Ishikawa
Journal of Machine Learning Research 22 (77), 1-14, 2021
1262021
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
AAAI 2019 Workshop on Reproducible AI, 2019
572019
The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden, 2018
312018
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Y Fujita, K Uenishi, A Ummadisingu, P Nagarajan, S Masuda, MY Castro
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020
222020
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
ZW Hong, P Nagarajan, G Maeda
European Conference on Machine Learning and Principles and Practice of …, 2021
52021
Learning Latent State Spaces for Planning through Reward Prediction
A Havens, Y Ouyang, P Nagarajan, Y Fujita
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
42019
Reconnaissance for Reinforcement Learning with Safety Constraints
S Maeda, H Watahiki, Y Ouyang, S Okada, M Koyama, P Nagarajan
European Conference on Machine Learning and Principles and Practice of …, 2021
22021
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
arXiv preprint arXiv:2312.02355, 2023
2023
Swarm-inspired Reinforcement Learning via Collaborative Inter-agent Knowledge Distillation
ZW Hong, P Nagarajan, G Maeda
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
2019
Nondeterminism as a Reproducibility Challenge for Deep Reinforcement Learning
PM Nagarajan
The University of Texas at Austin, 2018
2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–11