Follow
Lior Shani
Lior Shani
Verified email at google.com
Title
Cited by
Cited by
Year
Adaptive trust region policy optimization: Global convergence and faster rates for regularized mdps
L Shani, Y Efroni, S Mannor
Thirty-Fourth AAAI Conference on Artificial Intelligence, 5668-5675, 2020
1722020
Optimistic Policy Optimization with Bandit Feedback
Y Efroni, L Shani, A Rosenberg, S Mannor
Proceedings of the 37th International Conference on Machine Learning 119 …, 2020
832020
Mirror Descent Policy Optimization
M Tomar, L Shani, Y Efroni, M Ghavamzadeh
The Tenth International Conference on Learning Representations, 2020
592020
Factually consistent summarization via reinforcement learning with textual entailment feedback
P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ...
arXiv preprint arXiv:2306.00186, 2023
262023
Online apprenticeship learning
L Shani, T Zahavy, S Mannor
Proceedings of the AAAI conference on artificial intelligence 36 (8), 8240-8248, 2022
222022
Exploration Conscious Reinforcement Learning Revisited
L Shani, Y Efroni, S Mannor
Proceedings of the 36th International Conference on Machine Learning, 5680--5689, 2019
17*2019
Reinforcement Learning with History Dependent Dynamic Contexts
G Tennenholtz, N Merlis, L Shani, M Mladenov, C Boutilier
International Conference on Machine Learning, 34011-34053, 2023
32023
Reinforcement learning with a terminator
G Tennenholtz, N Merlis, L Shani, S Mannor, U Shalit, G Chechik, ...
Advances in Neural Information Processing Systems 35, 35696-35709, 2022
22022
Demystifying Embedding Spaces using Large Language Models
G Tennenholtz, Y Chow, CW Hsu, J Jeong, L Shani, A Tulepbergenov, ...
arXiv preprint arXiv:2310.04475, 2023
12023
Multi instance learning for unbalanced data
M Kozdoba, E Moroshko, L Shani, T Takagi, T Katoh, S Mannor, ...
arXiv preprint arXiv:1812.07010, 2018
12018
The system can't perform the operation now. Try again later.
Articles 1–10