Yanping Huang

Cited by

	All	Since 2019
Citations	14443	13988
h-index	25	23
i10-index	31	29

5000

2500

1250

3750

201520162017201820192020202120222023202453 85 70 140 494 958 1419 1771 4371 4946

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
Zhifeng ChenGoogle Inc.Verified email at google.com
Orhan FiratGoogle AIVerified email at google.com
Esteban RealGoogle BrainVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Yonghui WuGoogle BrainVerified email at google.com
NAN DUAI and Machine Learning Research, AppleVerified email at apple.com
Yuanzhong XuGoogle DeepMindVerified email at utexas.edu
Rajesh P. N. RaoComputer Science and Engineering, University of WashingtonVerified email at cs.washington.edu
Noam ShazeerCharacter.aiVerified email at character.ai
Andrew DaiGoogle DeepMindVerified email at google.com
Yanqi ZhouGoogleVerified email at google.com
William FedusOpenAIVerified email at openai.com
Minh-Thang LuongSenior Staff Research Scientist at GoogleVerified email at google.com
Jeff DeanGoogle Chief Scientist, Google Research and Google DeepMindVerified email at google.com
Zhuohan LiUC BerkeleyVerified email at berkeley.edu
Ion StoicaProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Hanxiao LiuMicrosoft AIVerified email at microsoft.com

Yanping Huang

Google Brain

Verified email at google.com

Artificial Intelligence Deep Learning Machine Learning Systems Computational Neuroscience


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Regularized evolution for image classifier architecture search E Real, A Aggarwal, Y Huang, QV Le Proceedings of the aaai conference on artificial intelligence 33 (01), 4780-4789, 2019	3333	2019
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	2247	2024
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism Y Huang, Y Cheng, A Bapna, O Firat, MX Chen, D Chen, HJ Lee, J Ngiam, ... Advances in Neural Information Processing Systems 32, 103--112, 2018	1580	2018
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022	1365	2022
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1056	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Gshard: Scaling giant models with conditional computation and automatic sharding D Lepikhin, HJ Lee, Y Xu, D Chen, O Firat, Y Huang, M Krikun, N Shazeer, ... International Conference on Learning Representations (ICLR), 2020	805	2020
Predictive coding Y Huang, RPN Rao Wiley Interdisciplinary Reviews: Cognitive Science 2 (5), 580-593, 2011	698	2011
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022	544*	2022
Alpa: Automating Inter-and Intra-Operator Parallelism for Distributed Deep Learning L Zheng, Z Li, H Zhang, Y Zhuang, Z Chen, Y Huang, Y Wang, Y Xu, ... 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022	239	2022
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	203	2019
Just pick a sign: Optimizing deep multitask models with gradient sign dropout Z Chen, J Ngiam, Y Huang, T Luong, H Kretzschmar, Y Chai, D Anguelov Advances in Neural Information Processing Systems 33, 2039-2050, 2020	173	2020
Mixture-of-experts with expert choice routing Y Zhou, T Lei, H Liu, N Du, Y Huang, V Zhao, AM Dai, QV Le, J Laudon Advances in Neural Information Processing Systems 35, 7103-7114, 2022	163	2022
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022	160	2022
Gspmd: general and scalable parallelization for ml computation graphs Y Xu, HJ Lee, D Chen, B Hechtman, Y Huang, R Joshi, M Krikun, ... arXiv preprint arXiv:2105.04663, 2021	102	2021
Designing effective sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906 2 (3), 17, 2022	84	2022
Beyond distillation: Task-level mixture-of-experts for efficient inference S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin, MT Luong, O Firat arXiv preprint arXiv:2110.03742, 2021	83	2021
{AlpaServe}: Statistical multiplexing with model parallelism for deep learning serving Z Li, L Zheng, Y Zhong, V Liu, Y Sheng, X Jin, Y Huang, Z Chen, H Zhang, ... 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023	80	2023
St-moe: Designing stable and transferable sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906, 2022	77	2022
Building machine translation systems for the next thousand languages A Bapna, I Caswell, J Kreutzer, O Firat, D van Esch, A Siddhant, M Niu, ... arXiv preprint arXiv:2205.03983, 2022	66	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors