Understanding the Eluder Dimension G Li, P Kamath, DJ Foster, N Srebro arXiv preprint arXiv:2104.06970, 2021 | 25* | 2021 |
Pessimism for Offline Linear Contextual Bandits using Confidence Sets G Li, C Ma, N Srebro Advances in Neural Information Processing Systems 35, 20974-20987, 2022 | 17 | 2022 |
Dueling optimization with a monotone adversary A Blum, M Gupta, G Li, NS Manoj, A Saha, Y Yang International Conference on Algorithmic Learning Theory, 221-243, 2024 | 3 | 2024 |
Exponential family model-based reinforcement learning via score matching G Li, J Li, A Kabra, N Srebro, Z Wang, Z Yang Advances in Neural Information Processing Systems 35, 28474-28487, 2022 | 3 | 2022 |
When is Agnostic Reinforcement Learning Statistically Tractable? Z Jia, G Li, A Rakhlin, A Sekhari, N Srebro Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |