Harsh Mehta

Cited by

	All	Since 2019
Citations	1818	1817
h-index	13	13
i10-index	13	13

820

410

205

615

2019202020212022202320248 34 65 196 704 804

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Harsh Mehta

Staff Engineer, Google Research

Verified email at google.com

Natural Language Processing Reinforcement Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	729	2022
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	442	2023
Transformer memory as a differentiable search index Y Tay, VQ Tran, M Dehghani, J Ni, D Bahri, H Mehta, Z Qin, K Hui, Z Zhao, ... Advances in Neural Information Processing Systems, 2022	148	2022
Momentum Improves Normalized SGD A Cutkosky, H Mehta International Conference on Machine Learning, 2020	98	2020
Long range language modeling via gated state spaces H Mehta, A Gupta, A Cutkosky, B Neyshabur International Conference on Learning Representations, 2022	88	2022
Transferable representation learning in vision-and-language navigation H Huang, V Jain, H Mehta, A Ku, G Magalhaes, J Baldridge, E Ie Proceedings of the IEEE/CVF international conference on computer vision …, 2019	86	2019
High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails A Cutkosky, H Mehta Advances in Neural Information Processing Systems, 2021	41	2021
Large scale transfer learning for differentially private image classification H Mehta, A Thakurta, A Kurakin, A Cutkosky Transactions on Machine Learning Research, 2022	39	2022
Retouchdown: Adding touchdown to streetlearn as a shareable resource for language grounding tasks in street view H Mehta, Y Artzi, J Baldridge, E Ie, P Mirowski arXiv preprint arXiv:2001.03671, 2020	33	2020
Multi-modal discriminative model for vision-and-language navigation H Huang, V Jain, H Mehta, J Baldridge, E Ie arXiv preprint arXiv:1905.13358, 2019	25	2019
Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion A Cutkosky, H Mehta, F Orabona International Conference on Machine Learning, 2023	21	2023
Extreme Memorization via Scale of Initialization H Mehta, A Cutkosky, B Neyshabur International Conference on Learning Representations, 2021	17	2021
Simplifying and understanding state space models with diagonal linear rnns A Gupta, H Mehta, J Berant arXiv preprint arXiv:2212.00768, 2022	13	2022
VALAN: vision and language agent navigation L Lansing, V Jain, H Mehta, H Huang, E Ie arXiv preprint arXiv:1912.03241, 2019	8	2019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	6	2024
Differentially Private Image Classification from Features H Mehta, W Krichene, A Thakurta, A Kurakin, A Cutkosky Transactions on Machine Learning Research, 2022	5	2022
ALX: Large scale matrix factorization on TPUs H Mehta, S Rendle, W Krichene, L Zhang arXiv preprint arXiv:2112.02194, 2021	5	2021
Mechanic: A Learning Rate Tuner A Cutkosky, A Defazio, H Mehta Advances in Neural Information Processing Systems, 2023	4	2023
Convexifying transformers: Improving optimization and understanding of transformer networks T Ergen, B Neyshabur, H Mehta arXiv preprint arXiv:2211.11052, 2022	4	2022
Towards large scale transfer learning for differentially private image classification H Mehta, AG Thakurta, A Kurakin, A Cutkosky Transactions on Machine Learning Research, 2022	4	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by