Bloom: A 176b-parameter open-access multilingual language model BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ... arXiv preprint arXiv:2211.05100, 2022 | 700 | 2022 |
Prompting gpt-3 to be reliable C Si, Z Gan, Z Yang, S Wang, J Wang, J Boyd-Graber, L Wang arXiv preprint arXiv:2210.09150, 2022 | 83 | 2022 |
CharBERT: character-aware pre-trained language model W Ma, Y Cui, C Si, T Liu, S Wang, G Hu arXiv preprint arXiv:2011.01513, 2020 | 81 | 2020 |
Between words and characters: a brief history of open-vocabulary modeling and tokenization in nlp SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021 | 72* | 2021 |
Better robustness by more coverage: Adversarial and mixup data augmentation for robust finetuning C Si, Z Zhang, F Qi, Z Liu, Y Wang, Q Liu, M Sun Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 65* | 2021 |
What does bert learn from multiple-choice reading comprehension datasets? C Si, S Wang, MY Kan, J Jiang arXiv preprint arXiv:1910.12391, 2019 | 43 | 2019 |
Benchmarking robustness of machine reading comprehension models C Si, Z Yang, Y Cui, W Ma, T Liu, S Wang arXiv preprint arXiv:2004.14004, 2020 | 28 | 2020 |
What's in a Name? Answer Equivalence For Open-Domain Question Answering C Si, C Zhao, J Boyd-Graber arXiv preprint arXiv:2109.05289, 2021 | 13 | 2021 |
Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations C Si, D Friedman, N Joshi, S Feng, D Chen, H He arXiv preprint arXiv:2305.13299, 2023 | 12* | 2023 |
Sentiment aware neural machine translation C Si, K Wu, A Aw, MY Kan Proceedings of the 6th Workshop on Asian Translation, 200-206, 2019 | 12 | 2019 |
Re-Examining Calibration: The Case of Question Answering C Si, C Zhao, S Min, J Boyd-Graber Findings of the Association for Computational Linguistics: EMNLP 2022, 2814-2829, 2022 | 10* | 2022 |
Sub-character tokenization for chinese pretrained language models C Si, Z Zhang, Y Chen, F Qi, X Wang, Z Liu, Y Wang, Q Liu, M Sun Transactions of the Association for Computational Linguistics 11, 469-487, 2023 | 9* | 2023 |
Dataset mention extraction and classification A Prasad, C Si, MY Kan Proceedings of the Workshop on Extracting Structured Knowledge from …, 2019 | 9 | 2019 |
Large Language Models Help Humans Verify Truthfulness--Except When They Are Convincingly Wrong C Si, N Goyal, ST Wu, C Zhao, S Feng, H Daumé III, J Boyd-Graber arXiv preprint arXiv:2310.12558, 2023 | | 2023 |
Mixture of Prompt Experts for Generalizable and Interpretable Question Answering C Si, W Shi, C Zhao, L Zettlemoyer, J Boyd-Graber arXiv preprint arXiv:2305.14628, 2023 | | 2023 |
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises C Si, Z Zhang, Y Chen, X Wang, Z Liu, M Sun arXiv preprint arXiv:2302.07324, 2023 | | 2023 |
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition S Schulhoff, J Pinto, A Khan, LF Bouchard, C Si, S Anati, V Tagliabue, ... | | |