Grounding large language models in interactive environments with online reinforcement learning T Carta, C Romac, T Wolf, S Lamprier, O Sigaud, PY Oudeyer International Conference on Machine Learning, 3676-3713, 2023 | 81 | 2023 |
Teachmyagent: a benchmark for automatic curriculum learning in deep rl C Romac, R Portelas, K Hofmann, PY Oudeyer International Conference on Machine Learning, 9052-9063, 2021 | 28 | 2021 |
Meta automatic curriculum learning R Portelas, C Romac, K Hofmann, PY Oudeyer arXiv preprint arXiv:2011.08463, 2020 | 8 | 2020 |
Deep Recurrent Q-Learning vs Deep Q-Learning on a simple partially observable Markov decision process with Minecraft C Romac, V Béraud arXiv preprint arXiv:1903.04311, 2019 | 8 | 2019 |
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Q Gallouédec, E Beeching, C Romac, E Dellandréa arXiv preprint arXiv:2402.09844, 2024 | | 2024 |