Follow
Tom Zahavy
Tom Zahavy
Senior Research Scientist, DeepMind
Verified email at deepmind.com - Homepage
Title
Cited by
Cited by
Year
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence …, 2016
3322016
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International Conference on Machine Learning (ICML) 2016, 1899-1908, 2016
2112016
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in Neural Information Processing Systems (NeurIPS) 2018, 2018
1582018
Deep learning reconstruction of ultrashort pulses
T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ...
Optica 5 (5), 666-673, 2018
932018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce
T Zahavy, A Krishnan, A Magnani, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
91*2018
A self-tuning actor-critic algorithm
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ...
Advances in Neural Information Processing Systems 33, 2020
43*2020
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems (NeurIPS) 2017, 3135-3145, 2017
412017
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
O Nabati, T Zahavy, S Mannor
International Conference on Machine Learning (ICML) 2021, 2021
25*2021
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor
International Conference on Learning Representations Workshop (ICLRW'18), 2016
25*2016
Action assembly: Sparse imitation learning for text based games with combinatorial action spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
RLDM 2019: The Multi-disciplinary Conference on Reinforcement Learning and …, 2019
16*2019
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ...
Optics express 28 (5), 7528-7538, 2020
152020
Discovery of Options via Meta-Learned Subgoals
V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, H van Hasselt, ...
Advances in Neural Information Processing Systems (NeurIPS) 2021, 2021
132021
Visualizing Dynamics: from t-SNE to SEMI-MDPs
NB Zrihem, T Zahavy, S Mannor
ICML Workshop on Human Interpretability in Machine Learning (WHI 2016),, 2016
13*2016
Balancing Constraints and Rewards with Meta-Gradient D4PG
DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann
International Conference on Learning Representations (ICLR) 2021, 2021
112021
Unknown mixing times in apprenticeship and reinforcement learning
T Zahavy, A Cohen, H Kaplan, Y Mansour
Conference on Uncertainty in Artificial Intelligence (UAI), 2020, 2020
11*2020
Reward is enough for convex MDPs
T Zahavy, B O'Donoghue, G Desjardins, S Singh
Advances in Neural Information Processing Systems (NeurIPS) 2021, 2021
102021
Discovering a Set of Policies for the Worst Case Reward
T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O’Donoghue, I Kemaev, ...
International Conference on Learning Representations (ICLR) 2021, 2021
92021
Deep neural networks in single-shot ptychography
O Wengrowicz, O Peleg, T Zahavy, B Loevsky, O Cohen
Optics Express 28 (12), 17511-17520, 2020
92020
Apprenticeship learning via frank-wolfe
T Zahavy, A Cohen, H Kaplan, Y Mansour
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 6720-6728, 2020
92020
Sub-Nyquist sampling of OFDM signals for cognitive radios
T Zahavy, O Shayer, D Cohen, A Tolmachev, YC Eldar
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
92014
The system can't perform the operation now. Try again later.
Articles 1–20