Transformers are meta-reinforcement learners LC Melo international conference on machine learning, 15340-15359, 2022 | 76 | 2022 |
Housing prices prediction with a deep learning and random forest ensemble B Afonso, L Melo, W Oliveira, S Sousa, L Berton Anais do XVI Encontro Nacional de Inteligência Artificial e Computacional …, 2019 | 64 | 2019 |
Learning humanoid robot running skills through proximal policy optimization LC Melo, MROA Máximo 2019 Latin american robotics symposium (LARS), 2019 Brazilian symposium on …, 2019 | 39 | 2019 |
Robótica Educacional: Experiências Inovadoras na Educação Brasileira R Barbosa, P Blikstein Penso Editora, 2018 | 29 | 2018 |
Learning humanoid robot running motions with symmetry incentive through proximal policy optimization LC Melo, DC Melo, MROA Maximo Journal of Intelligent & Robotic Systems 102 (3), 54, 2021 | 27 | 2021 |
MARS-Gym: Offline Reinforcement Learning for Recommender Systems in Marketplaces MRO Santana*, LC Melo*, FHF Camargo*, B Brandão, A Soares, ... Workshop on Challenges of Real-World Reinforcement Learning at the 34th …, 2020 | 20* | 2020 |
Contextual Meta-Bandit for Recommender Systems Selection MRO Santana*, LC Melo*, FHF Camargo*, B Brandão, A Soares, ... Fourteenth ACM Conference on Recommender Systems, 444-449, 2020 | 17 | 2020 |
Multiagent reinforcement learning for strategic decision making and control in robotic soccer through self-play B Brandão, TW De Lima, A Soares, L Melo, MROA Maximo IEEE Access 10, 72628-72642, 2022 | 16 | 2022 |
Learning humanoid robot motions through deep neural networks LC Melo, MROA Maximo, AM da Cunha arXiv preprint arXiv:1901.00270, 2019 | 9 | 2019 |
Bottom-Up Meta-Policy Search LC Melo, MROA Maximo, AM da Cunha Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019 | 8 | 2019 |
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning BLM de Oliveira, ML da Luz, B Brandão, LGB Martins, TWL Soares, ... Workshop on Open-World Agents at the 38th Conference on Neural Information …, 2024 | 2 | 2024 |
Deep Bayesian Active Learning for Preference Modeling in Large Language Models LC Melo, P Tigas, A Abate, Y Gal Advances in Neural Information Processing Systems, 2024, 2024 | 2 | 2024 |
ITAndroids Soccer3D Team Description Paper 2018 D Melo, EE Soares, G Nahum, H Lopes, J Freire, L Maia, L Melo, ... | 1 | 2018 |
ITAndroids Soccer3D Team Description Paper 2016 A Muzio, D Melo, E Henrique, F Muniz, I Marzzo, JL Saraiva, L Melo, ... | 1 | 2016 |
Temporal-Difference Variational Continual Learning LC Melo, A Abate, Y Gal arXiv preprint arXiv:2410.07812, 2024 | | 2024 |
PulseRL: Enabling Offline Reinforcement Learning for Digital Marketing Systems via Conservative Q-Learning L Melo*, L Martins*, B Oliveira*, B Brandao*, DW Soares, T Lima 2nd Offline Reinforcement Learning Workshop at Neural Information Processing …, 2021 | | 2021 |
ITAndroids Soccer3D Team Description Paper 2019 B Bertucci, D Melo, D Fidalgo, G Oliveira, I Ferreira, J Freire, L Gameiro, ... | | 2019 |
Improving Diagnosis in Health Care Systems L Melo, VU Pugliese, DA da Silva, F Rocha, RM de Barros Santana, ... | | 2018 |