Jacob Beck
Jacob Beck
University of Oxford
Verified email at - Homepage
Cited by
Cited by
A survey of meta-reinforcement learning
J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson
arXiv preprint arXiv:2301.08028, 2023
Hypernetworks in Meta-Reinforcement Learning
J Beck, MT Jackson, R Vuorio, S Whiteson
6th Annual Conference on Robot Learning, 2022
Amrl: Aggregated memory for reinforcement learning
J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann
International Conference on Learning Representations, 2019
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
arXiv preprint arXiv:2202.00082, 2022
On the practical consistency of meta-reinforcement learning algorithms
Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson
arXiv preprint arXiv:2112.00478, 2021
Stackelberg punishment and bully-proofing autonomous vehicles
M Cooper, JK Lee, J Beck, JD Fishman, M Gillett, Z Papakipos, A Zhang, ...
Social Robotics: 11th International Conference, ICSR 2019, Madrid, Spain†…, 2019
Trust region bounds for decentralized ppo under non-stationarity
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
arXiv preprint arXiv:2202.00082, 2022
No DICE: An investigation of the bias-variance tradeoff in meta-gradients
R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson
Deep RL Workshop NeurIPS 2021, 2021
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
J Beck, R Vuorio, Z Xiong, S Whiteson
Advances in Neural Information Processing Systems 36, 2024
Universal morphology control via contextual modulation
Z Xiong, J Beck, S Whiteson
International Conference on Machine Learning, 38286-38300, 2023
Reneg and backseat driver: Learning from demonstration with continuous human feedback
J Beck, Z Papakipos, M Littman
arXiv preprint arXiv:1901.05101, 2019
SplAgger: Split Aggregation for Meta-Reinforcement Learning
J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson
arXiv preprint arXiv:2403.03020, 2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson
arXiv preprint arXiv:2402.06570, 2024
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar
arXiv preprint arXiv:2209.11303, 2022
Human-Actor Human-Critic
J Beck, N Srinivasan, A Shah, J Roy
ReNeg and Backseat Driver: Learning from demonstration with continuous human feedback
Z Papakipos, J Beck, M Littman
Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks
J Beck, Z Papakipos
arXiv preprint arXiv:1807.11121, 2018
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials
J Beck, M Jackson, R Vuorio, S Whiteson
Collaboration in Deep MARL
J Beck
The system can't perform the operation now. Try again later.
Articles 1–19