Language to rewards for robotic skill synthesis W Yu, N Gileadi, C Fu, S Kirmani, KH Lee, MG Arenas, HTL Chiang, ... arXiv preprint arXiv:2306.08647, 2023 | 187 | 2023 |
Meta reinforcement learning as task inference J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess arXiv preprint arXiv:1905.06424, 2019 | 140 | 2019 |
Learning agile soccer skills for a bipedal robot with deep reinforcement learning T Haarnoja, B Moran, G Lever, SH Huang, D Tirumala, J Humplik, ... Science Robotics 9 (89), eadi8022, 2024 | 79 | 2024 |
Imitate and repurpose: Learning reusable robot movement skills from human and animal behaviors S Bohez, S Tunyasuvunakool, P Brakel, F Sadeghi, L Hasenclever, ... arXiv preprint arXiv:2203.17138, 2022 | 38 | 2022 |
Nerf2real: Sim2real transfer of vision-guided bipedal motion skills using neural radiance fields A Byravan, J Humplik, L Hasenclever, A Brussee, F Nori, T Haarnoja, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 9362-9369, 2023 | 34 | 2023 |
Probabilistic models for neural populations that naturally capture global coupling and criticality J Humplik, G Tkačik PLoS computational biology 13 (9), e1005763, 2017 | 34 | 2017 |
Towards real robot learning in the wild: A case study in bipedal locomotion M Bloesch, J Humplik, V Patraucean, R Hafner, T Haarnoja, A Byravan, ... Conference on Robot Learning, 1502-1511, 2022 | 23 | 2022 |
Neural belief states for partially observed domains P Moreno, J Humplik, G Papamakarios, BA Pires, L Buesing, N Heess, ... NeurIPS 2018 workshop on reinforcement learning under partial observability, 2018 | 20 | 2018 |
Evolutionary dynamics of infectious diseases in finite populations J Humplik, AL Hill, MA Nowak Journal of theoretical biology 360, 149-162, 2014 | 19 | 2014 |
Learning to learn faster from human feedback with language model predictive control J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ... arXiv preprint arXiv:2402.11450, 2024 | 9 | 2024 |
Forgetting and imbalance in robot lifelong learning with off-policy data W Zhou, S Bohez, J Humplik, N Heess, A Abdolmaleki, D Rao, ... Conference on Lifelong Learning Agents, 294-309, 2022 | 6 | 2022 |
Skills: Adaptive skill sequencing for efficient temporally-extended exploration G Vezzani, D Tirumala, M Wulfmeier, D Rao, A Abdolmaleki, B Moran, ... arXiv preprint arXiv:2211.13743, 2022 | 5 | 2022 |
Inferring couplings in networks across order-disorder phase transitions V Ngampruetikorn, V Sachdeva, J Torrence, J Humplik, DJ Schwab, ... Physical review research 4 (2), 023240, 2022 | 5 | 2022 |
Importance weighted policy learning and adaptation A Galashov, J Sygnowski, G Desjardins, J Humplik, L Hasenclever, ... arXiv preprint arXiv:2009.04875, 2020 | 4 | 2020 |
Semiparametric energy-based probabilistic models J Humplik, G Tkačik arXiv preprint arXiv:1605.07371, 2016 | 4 | 2016 |
Offline Distillation for Robot Lifelong Learning with Imbalanced Experience W Zhou, S Bohez, J Humplik, A Abdolmaleki, D Rao, M Wulfmeier, ... CoRR abs/2204.05893, 2022 | 1 | 2022 |
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning N Di Palo, L Hasenclever, J Humplik, A Byravan arXiv preprint arXiv:2407.20798, 2024 | | 2024 |
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning D Tirumala, M Wulfmeier, B Moran, S Huang, J Humplik, G Lever, ... arXiv preprint arXiv:2405.02425, 2024 | | 2024 |