James Martens
James Martens
Research Scientist, DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
On the importance of initialization and momentum in deep learning
I Sutskever, J Martens, G Dahl, G Hinton
International conference on machine learning, 1139-1147, 2013
26032013
Generating text with recurrent neural networks
I Sutskever, J Martens, GE Hinton
Proceedings of the 28th international conference on machine learning (ICML …, 2011
11032011
Deep learning via hessian-free optimization.
J Martens
ICML 27, 735-742, 2010
7592010
Learning recurrent neural networks with hessian-free optimization
J Martens, I Sutskever
Proceedings of the 28th international conference on machine learning (ICML …, 2011
5742011
Optimizing neural networks with kronecker-factored approximate curvature
J Martens, R Grosse
International conference on machine learning, 2408-2417, 2015
2832015
Adding gradient noise improves learning for very deep networks
A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens
arXiv preprint arXiv:1511.06807, 2015
2452015
New insights and perspectives on the natural gradient method
J Martens
arXiv preprint arXiv:1412.1193, 2014
1452014
Training deep and recurrent networks with hessian-free optimization
J Martens, I Sutskever
Neural networks: Tricks of the trade, 479-535, 2012
1412012
The mechanics of n-player differentiable games
D Balduzzi, S Racaniere, J Martens, J Foerster, K Tuyls, T Graepel
arXiv preprint arXiv:1802.05642, 2018
872018
A kronecker-factored approximate fisher matrix for convolution layers
R Grosse, J Martens
International Conference on Machine Learning, 573-582, 2016
852016
On the representational efficiency of restricted boltzmann machines
J Martens, A Chattopadhya, T Pitassi, R Zemel
Advances in Neural Information Processing Systems, 2877-2885, 2013
522013
Estimating the hessian by back-propagating curvature
J Martens, I Sutskever, K Swersky
arXiv preprint arXiv:1206.6464, 2012
392012
On the expressive efficiency of sum product networks
J Martens, V Medabalimi
arXiv preprint arXiv:1411.7717, 2014
352014
Distributed second-order optimization using Kronecker-factored approximations
J Ba, R Grosse, J Martens
312016
Adversarial robustness through local linearization
C Qin, J Martens, S Gowal, D Krishnan, K Dvijotham, A Fawzi, S De, ...
Advances in Neural Information Processing Systems, 13824-13833, 2019
242019
Second-order optimization for neural networks
J Martens
University of Toronto (Canada), 2016
212016
Parallelizable sampling of markov random fields
J Martens, I Sutskever
Proceedings of the Thirteenth International Conference on Artificial …, 2010
202010
Learning the Linear Dynamical System with ASOS.
J Martens
ICML, 743-750, 2010
192010
Weighted gradient method and system for diagnosing disease
M Graovac, J Martens, Z Pavlovic, J Ironstone
US Patent 8,103,337, 2012
152012
Kronecker-factored curvature approximations for recurrent neural networks
J Martens, J Ba, M Johnson
132018
The system can't perform the operation now. Try again later.
Articles 1–20