Palm-e: An embodied multimodal language model D Driess, F Xia, MSM Sajjadi, C Lynch, A Chowdhery, B Ichter, A Wahid, ... arXiv preprint arXiv:2303.03378, 2023 | 1236 | 2023 |
Rt-2: Vision-language-action models transfer web knowledge to robotic control A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ... arXiv preprint arXiv:2307.15818, 2023 | 524 | 2023 |
Transporter networks: Rearranging the visual world for robotic manipulation A Zeng, P Florence, J Tompson, S Welker, J Chien, M Attarian, ... Conference on Robot Learning, 726-747, 2021 | 435 | 2021 |
Implicit behavioral cloning P Florence, C Lynch, A Zeng, OA Ramirez, A Wahid, L Downs, A Wong, ... Conference on Robot Learning, 158-168, 2022 | 327 | 2022 |
Visual representations for semantic target driven navigation A Mousavian, A Toshev, M Fišer, J Košecká, A Wahid, J Davidson 2019 International Conference on Robotics and Automation (ICRA), 8846-8852, 2019 | 237 | 2019 |
Open x-embodiment: Robotic learning datasets and rt-x models A Padalkar, A Pooley, A Jain, A Bewley, A Herzog, A Irpan, A Khazatsky, ... arXiv preprint arXiv:2310.08864, 2023 | 203 | 2023 |
Interactive language: Talking to robots in real time C Lynch, A Wahid, J Tompson, T Ding, J Betker, R Baruch, T Armstrong, ... IEEE Robotics and Automation Letters, 2023 | 165 | 2023 |
Rt-2: Vision-language-action models transfer web knowledge to robotic control B Zitkovich, T Yu, S Xu, P Xu, T Xiao, F Xia, J Wu, P Wohlhart, S Welker, ... Conference on Robot Learning, 2165-2183, 2023 | 122 | 2023 |
Robotic skill acquisition via instruction augmentation with vision-language models T Xiao, H Chan, P Sermanet, A Wahid, A Brohan, K Hausman, S Levine, ... arXiv preprint arXiv:2211.11736, 2022 | 58 | 2022 |
Open X-Embodiment: Robotic learning datasets and RT-X models OXE Collaboration, A Padalkar, A Pooley, A Jain, A Bewley, A Herzog, ... arXiv preprint arXiv:2310.08864, 2023 | 41 | 2023 |
Video language planning Y Du, M Yang, P Florence, F Xia, A Wahid, B Ichter, P Sermanet, T Yu, ... arXiv preprint arXiv:2310.10625, 2023 | 36 | 2023 |
Pivot: Iterative visual prompting elicits actionable knowledge for vlms S Nasiriany, F Xia, W Yu, T Xiao, J Liang, I Dasgupta, A Xie, D Driess, ... arXiv preprint arXiv:2402.07872, 2024 | 34 | 2024 |
Long range neural navigation policies for the real world A Wahid, A Toshev, M Fiser, TWE Lee 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019 | 21 | 2019 |
Learning object-conditioned exploration using distributed soft actor critic A Wahid, A Stone, K Chen, B Ichter, A Toshev Conference on Robot Learning, 2020 | 19 | 2020 |
Open x-embodiment: Robotic learning datasets and RT-x models Q Vuong, S Levine, HR Walke, K Pertsch, A Singh, R Doshi, C Xu, J Luo, ... Towards Generalist Robots: Learning Paradigms for Scalable Skill Acquisition …, 2023 | 17 | 2023 |
Visuomotor control in multi-object scenes using object-aware representations N Heravi, A Wahid, C Lynch, P Florence, T Armstrong, J Tompson, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 9515-9522, 2023 | 15 | 2023 |
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration0 A O’Neill, A Rehman, A Maddukuri, A Gupta, A Padalkar, A Lee, A Pooley, ... 2024 IEEE International Conference on Robotics and Automation (ICRA), 6892-6903, 2024 | 10 | 2024 |
Learning to learn faster from human feedback with language model predictive control J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ... arXiv preprint arXiv:2402.11450, 2024 | 9 | 2024 |
Vid2robot: End-to-end video-conditioned policy learning with cross-attention transformers V Jain, M Attarian, NJ Joshi, A Wahid, D Driess, Q Vuong, PR Sanketi, ... arXiv preprint arXiv:2403.12943, 2024 | 6 | 2024 |
Pyreach-python client sdk for robot remote control A Wong, A Zeng, A Bose, A Wahid, D Kalashnikov, I Krasin, J Varley, ... | 6 | 2022 |