Colossal-AI: A unified deep learning system for large-scale parallel training S Li, H Liu, Z Bian, J Fang, H Huang, Y Liu, B Wang, Y You Proceedings of the 52nd International Conference on Parallel Processing, 766-775, 2023 | 104 | 2023 |
Sequence Parallelism: Long Sequence Training from System Perspective S Li, F Xue, C Baranwal, Y Li, Y You Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 59* | 2023 |
Parallel training of pre-trained models via chunk-based dynamic memory management J Fang, Z Zhu, S Li, H Su, Y Yu, J Zhou, Y You IEEE Transactions on Parallel and Distributed Systems 34 (1), 304-315, 2022 | 31 | 2022 |
Online evolutionary batch size orchestration for scheduling deep learning workloads in GPU clusters Z Bian, S Li, W Wang, Y You Proceedings of the International Conference for High Performance Computing …, 2021 | 24 | 2021 |
Glide with a cape: A low-hassle method to accelerate speculative decoding C Du, J Jiang, X Yuanchen, J Wu, S Yu, Y Li, S Li, K Xu, L Nie, Z Tu, ... arXiv preprint arXiv:2402.02082, 2024 | 3 | 2024 |
Critique of” MemXCT: memory-centric X-ray CT reconstruction with massive parallelization” by SCC Team from Nanyang Technological University S Li, BS Lee IEEE Transactions on Parallel and Distributed Systems 33 (9), 2058-2061, 2021 | | 2021 |