Weilin Zhao

Cited by

	All	Since 2019
Citations	1252	1252
h-index	7	7
i10-index	6	6

540

270

135

405

202120222023202425 194 506 524

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Maosong SunProfessor of Computer Science and Technology, Tsinghua UniversityVerified email at tsinghua.edu.cn
Zhiyuan LiuAssociate Professor, Tsinghua UniversityVerified email at tsinghua.edu.cn
Xu HanTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Shengding HuTsinghua UniversityVerified email at mails.tsinghua.edu.cn

Weilin Zhao

Tsinghua University

Verified email at mails.tsinghua.edu.cn - Homepage

Efficient LLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Ptr: Prompt tuning with rules for text classification X Han, W Zhao, N Ding, Z Liu, M Sun AI Open 3, 182-192, 2022	429	2022
Parameter-efficient fine-tuning of large-scale pre-trained language models N Ding, Y Qin, G Yang, F Wei, Z Yang, Y Su, S Hu, Y Chen, CM Chan, ... Nature Machine Intelligence 5 (3), 220-235, 2023	315	2023
Openprompt: An open-source framework for prompt-learning N Ding, S Hu, W Zhao, Y Chen, Z Liu, HT Zheng, M Sun arXiv preprint arXiv:2111.01998, 2021	242	2021
Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models N Ding, Y Qin, G Yang, F Wei, Z Yang, Y Su, S Hu, Y Chen, CM Chan, ... arXiv preprint arXiv:2203.06904, 2022	184	2022
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies S Hu, Y Tu, X Han, C He, G Cui, X Long, Z Zheng, Y Fang, Y Huang, ... arXiv preprint arXiv:2404.06395, 2024	30	2024
Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models B Zhu, Y Qin, G Cui, Y Chen, W Zhao, C Fu, Y Deng, Z Liu, J Wang, W Wu, ... Advances in Neural Information Processing Systems 35, 1086-1099, 2022	17	2022
OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models S Hu, N Ding, W Zhao, X Lv, Z Zhang, Z Liu, M Sun arXiv preprint arXiv:2307.03084, 2023	8	2023
Delta tuning: A comprehensive study of parameter efficient methods for pre-trained language models. CoRR, abs/2203.06904, 2022. doi: 10.48550 N Ding, Y Qin, G Yang, F Wei, Z Yang, Y Su, S Hu, Y Chen, CM Chan, ... arXiv preprint arXiv.2203.06904, 0	7
BMCook: A task-agnostic compression toolkit for big models Z Zhang, B Gong, Y Chen, X Han, G Zeng, W Zhao, Y Chen, Z Liu, M Sun Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022	6	2022
BMInf: An Efficient Toolkit for Big Model Inference and Tuning X Han, G Zeng, W Zhao, Z Liu, Z Zhang, J Zhou, J Zhang, J Chao, M Sun Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	5	2022
CPET: Effective Parameter-Efficient Tuning for Compressed Large Language Models W Zhao, Y Huang, X Han, Z Liu, Z Zhang, M Sun arXiv preprint arXiv:2307.07705, 2023	4	2023
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting W Zhao, Y Huang, X Han, C Xiao, Z Liu, M Sun arXiv preprint arXiv:2402.13720, 2024	2	2024
Unlock Predictable Scaling from Emergent Abilities S Hu, X Liu, X Han, X Zhang, C He, W Zhao, Y Lin, N Ding, Z Ou, G Zeng, ... arXiv preprint arXiv:2310.03262, 2023	2	2023
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun, S Wang, T Su arXiv preprint arXiv:2403.09347, 2024	1	2024
Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun arXiv preprint arXiv:2406.03488, 2024		2024
H3T: efficient integration of memory optimization and parallelism for high-throughput transformer training Y Wang, X Han, W Zhao, G Zeng, Z Liu, M Sun Proceedings of the 37th International Conference on Neural Information …, 2023		2023
Predicting Emergent Abilities with Infinite Resolution Evaluation S Hu, X Liu, X Han, X Zhang, C He, W Zhao, Y Lin, N Ding, Z Ou, G Zeng, ... The Twelfth International Conference on Learning Representations, 2023		2023
Tool learning with foundation models Y Qin, S Hu, Y Lin, W Chen, N Ding, G Cui, Z Zeng, Y Huang, C Xiao, ... arXiv preprint arXiv:2304.08354, 2023		2023
Optimal Rope Extension Via Bayesian Optimization for Training-Free Length Generalization X Zhang, S Hu, W Zhao, H Wang, X Han, C He, Z Liu, M Sun Available at SSRN 4765732, 0

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors