PaLM: Scaling Language Modeling with Pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 382 | 2022 |
Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le ICLR 2022, 2021 | 283 | 2021 |
Chain of thought prompting elicits reasoning in large language models J Wei, X Wang, D Schuurmans, M Bosma, E Chi, Q Le, D Zhou NeurIPS 2022, 2022 | 232* | 2022 |
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022 | 190* | 2022 |
Program synthesis with large language models J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 143 | 2021 |
GLaM: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022 | 95* | 2022 |
Show your work: Scratchpads for intermediate computation with language models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... ICLR 2022 Workshop DL4C, 2021 | 87 | 2021 |
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... Transactions on Machine Learning Research, 2022b, 2022 | 76* | 2022 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 69 | 2022 |
A framework for unsupervised spam detection in social networking sites M Bosma, E Meij, W Weerkamp European Conference on Information Retrieval, 364-375, 2012 | 52 | 2012 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, A Levskaya, G Mishra, J Bradbury, D Andor, ... arXiv preprint arXiv:2203.17189 13, 2022 | 29 | 2022 |
System and method for automatically selecting images to accompany text M Heyward, M Bosma, S Brotherton, C DePue III, MEG Contreras, ... US Patent 9,075,812, 2015 | 6 | 2015 |