Follow
Jincheng Mei
Jincheng Mei
Research Scientist, Google Brain
Verified email at ualberta.ca - Homepage
Title
Cited by
Cited by
Year
On the global convergence rates of softmax policy gradient methods
J Mei, C Xiao, C Szepesvari, D Schuurmans
International Conference on Machine Learning, 6820-6829, 2020
1252020
Locality preserving hashing
K Zhao, H Lu, J Mei
Proceedings of the AAAI Conference on Artificial Intelligence 28 (1), 2014
552014
Escaping the Gravitational Pull of Softmax
J Mei, C Xiao, B Dai, L Li, C Szepesvári, D Schuurmans
Advances in Neural Information Processing Systems 33, 2020
232020
Memory-Augmented Monte Carlo Tree Search
C Xiao, J Mei, M Müller
AAAI, 1455-1462, 2018
212018
Leveraging non-uniformity in first-order non-convex optimization
J Mei, Y Gao, B Dai, C Szepesvari, D Schuurmans
International Conference on Machine Learning, 7555-7564, 2021
192021
Maximum entropy monte-carlo planning
C Xiao, R Huang, J Mei, D Schuurmans, M Müller
Advances in Neural Information Processing Systems, 9520-9528, 2019
192019
On principled entropy exploration in policy optimization
J Mei, C Xiao, R Huang, D Schuurmans, M Müller
Proceedings of the 28th International Joint Conference on Artificial …, 2019
152019
Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters
M Yang, J Mei, H Ji, W Zhao, Z Zhao, X Chen
Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017
132017
Discovering author interest evolution in topic modeling
M Yang, J Mei, F Xu, W Tu, Z Lu
Proceedings of the 39th International ACM SIGIR conference on Research and …, 2016
132016
On the optimality of batch policy optimization algorithms
C Xiao, Y Wu, J Mei, B Dai, T Lattimore, L Li, C Szepesvari, ...
International Conference on Machine Learning, 11362-11371, 2021
112021
Frequency-based Search-control in Dyna
Y Pan, J Mei, A Farahmand
arXiv preprint arXiv:2002.05822, 2020
92020
On unconstrained quasi-submodular function optimization
J Mei, K Zhao, BL Lu
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
52015
On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems
R Zhang, J Mei, B Dai, D Schuurmans, N Li
arXiv preprint arXiv:2202.00872, 2022
42022
Understanding the effect of stochasticity in policy optimization
J Mei, B Dai, C Xiao, C Szepesvari, D Schuurmans
Advances in Neural Information Processing Systems 34, 19339-19351, 2021
32021
Understanding and Leveraging Overparameterization in Recursive Value Estimation
C Xiao, B Dai, J Mei, OA Ramirez, R Gummadi, C Harris, D Schuurmans
International Conference on Learning Representations, 2021
22021
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities
J Mei, Y Pan, M White, A Farahmand, H Yao
12020
On the Reducibility of Submodular Functions
J Mei, H Zhang, BL Lu
Artificial Intelligence and Statistics, 186-194, 2016
12016
Understanding and mitigating the limitations of prioritized experience replay
Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo
Uncertainty in Artificial Intelligence, 1561-1571, 2022
2022
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ...
arXiv preprint arXiv:2205.14211, 2022
2022
Understanding and Mitigating the Limitations of Prioritized Replay
Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo
The 38th Conference on Uncertainty in Artificial Intelligence, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20