Łukasz Kaiser

Cited by

	All	Since 2019
Citations	197788	185074
h-index	57	53
i10-index	89	76

53000

26500

13250

39750

2016201720182019202020212022202320241031 3410 6967 11774 18529 27602 37733 52618 36774

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Noam ShazeerCharacter.aiVerified email at character.ai
Jakob UszkoreitInceptiveVerified email at uszkoreit.net
Ashish VaswaniStartupVerified email at fastmail.com
Aidan GomezCohereVerified email at cohere.ai
Illia PolosukhinNEARVerified email at near.ai
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com
Samy BengioSenior Director, AI and Machine Learning Research, AppleVerified email at apple.com
Stephan GouwsSenior Research Scientist, Google DeepMindVerified email at google.com
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIVerified email at openai.com
Henryk MichalewskiGoogleVerified email at google.com
Anselm LevskayaResearch Scientist, GoogleVerified email at google.com
Ben D GoodrichGoogleVerified email at google.com
Mohammad SalehGoogle BrainVerified email at google.com
Étienne PotGoogleVerified email at epfl.ch
George TuckerGoogle BrainVerified email at google.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
François CholletGoogleVerified email at google.com
Mostafa DehghaniResearch Scientist, Google DeepMindVerified email at google.com
Piotr KozakowskiUniversity of WarsawVerified email at mimuw.edu.pl
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu

Łukasz Kaiser

OpenAI & CNRS

Verified email at openai.com - Homepage

Machine Learning & Logic in Computer Science


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Attention is all you need A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Advances in neural information processing systems 30, 2017	126589	2017
TensorFlow: Large-scale machine learning on heterogeneous systems M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...	30685*	2015
Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ... arXiv preprint arXiv:1609.08144, 2016	8718	2016
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	2524	2023
Reformer: The efficient transformer N Kitaev, Ł Kaiser, A Levskaya arXiv preprint arXiv:2001.04451, 2020	2494	2020
Evaluating large language models trained on code M Chen, J Tworek, H Jun, Q Yuan, HPDO Pinto, J Kaplan, H Edwards, ... arXiv preprint arXiv:2107.03374, 2021	2471	2021
Image transformer N Parmar, A Vaswani, J Uszkoreit, L Kaiser, N Shazeer, A Ku, D Tran International conference on machine learning, 4055-4064, 2018	1914	2018
Advances in neural information processing systems A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Attention is all you need, 2017	1846	2017
Attention is all you need V Ashish Advances in neural information processing systems 30, I, 2017	1702	2017
Rethinking attention with performers K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ... arXiv preprint arXiv:2009.14794, 2020	1451	2020
Training verifiers to solve math word problems K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ... arXiv preprint arXiv:2110.14168, 2021	1450	2021
Attention Is All You Need.(Nips), 2017 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv:1706.03762 10, S0140525X16001837, 2017	1306	2017
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton arXiv preprint arXiv:1701.06548, 2017	1224	2017
Grammar as a foreign language O Vinyals, Ł Kaiser, T Koo, S Petrov, I Sutskever, G Hinton Advances in neural information processing systems 28, 2015	1128	2015
Generating wikipedia by summarizing long sequences PJ Liu, M Saleh, E Pot, B Goodrich, R Sepassi, L Kaiser, N Shazeer arXiv preprint arXiv:1801.10198, 2018	950	2018
Multi-task sequence to sequence learning MT Luong, QV Le, I Sutskever, O Vinyals, L Kaiser arXiv preprint arXiv:1511.06114, 2015	948	2015
Universal transformers M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser arXiv preprint arXiv:1807.03819, 2018	936	2018
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... arXiv preprint arXiv:1903.00374, 2019	927	2019
Tensor2tensor for neural machine translation A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ... arXiv preprint arXiv:1803.07416, 2018	627	2018
Adding gradient noise improves learning for very deep networks A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens arXiv preprint arXiv:1511.06807, 2015	593	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors