Pointclip: Point cloud understanding by clip R Zhang, Z Guo, W Zhang, K Li, X Miao, B Cui, Y Qiao, P Gao, H Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 425 | 2022 |
Specinfer: Accelerating large language model serving with tree-based speculative inference and verification X Miao, G Oliaro, Z Zhang, X Cheng, Z Wang, Z Zhang, RYY Wong, A Zhu, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 164 | 2024 |
Calip: Zero-shot enhancement of clip with parameter-free attention Z Guo, R Zhang, L Qiu, X Ma, X Miao, X He, B Cui Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 746-754, 2023 | 99 | 2023 |
Reliable data distillation on graph convolutional network W Zhang, X Miao, Y Shao, J Jiang, L Chen, O Ruas, B Cui Proceedings of the 2020 ACM SIGMOD international conference on management of …, 2020 | 86 | 2020 |
Towards efficient generative large language model serving: A survey from algorithms to systems X Miao, G Oliaro, Z Zhang, X Cheng, H Jin, T Chen, Z Jia arXiv preprint arXiv:2312.15234, 2023 | 59 | 2023 |
Heterogeneity-aware distributed machine learning training via partial reduce X Miao, X Nie, Y Shao, Z Yang, J Jiang, L Ma, B Cui Proceedings of the 2021 International Conference on Management of Data, 2262 …, 2021 | 57 | 2021 |
Het: Scaling out huge embedding model training via cache-enabled distributed framework X Miao, H Zhang, Y Shi, X Nie, Z Yang, Y Tao, B Cui Proceedings of the VLDB Endowment 15.2 (2021): 312-320., 2021 | 53 | 2021 |
Galvatron: Efficient transformer training over multiple gpus using automatic parallelism X Miao, Y Wang, Y Jiang, C Shi, X Nie, H Zhang, B Cui arXiv preprint arXiv:2211.13878, 2022 | 50 | 2022 |
Distributed graph neural network training: A survey Y Shao, H Li, X Gu, H Yin, Y Li, X Miao, W Zhang, B Cui, L Chen ACM Computing Surveys 56 (8), 1-39, 2024 | 42 | 2024 |
Degnn: Improving graph neural networks with graph decomposition X Miao, NM Gürel, W Zhang, Z Han, B Li, W Min, SX Rao, H Ren, Y Shan, ... Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data …, 2021 | 41* | 2021 |
Spotserve: Serving generative large language models on preemptible instances X Miao, C Shi, J Duan, X Xi, D Lin, B Cui, Z Jia Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 37 | 2024 |
Lasagne: A multi-layer graph convolutional network framework via node-aware deep architecture X Miao, W Zhang, Y Shao, B Cui, L Chen, C Zhang, J Jiang IEEE Transactions on Knowledge and Data Engineering 35 (2), 1721-1733, 2021 | 37 | 2021 |
Flexmoe: Scaling large-scale sparse pre-trained model training via dynamic device placement X Nie, X Miao, Z Wang, Z Yang, J Xue, L Ma, G Cao, B Cui Proceedings of the ACM on Management of Data 1 (1), 1-19, 2023 | 33 | 2023 |
Evomoe: An evolutional mixture-of-experts training framework via dense-to-sparse gate X Nie, X Miao, S Cao, L Ma, Q Liu, J Xue, Y Miao, Y Liu, Z Yang, B Cui arXiv preprint arXiv:2112.14397, 2021 | 33 | 2021 |
Ps2: Parameter server on spark Z Zhang, B Cui, Y Shao, L Yu, J Jiang, X Miao Proceedings of the 2019 International Conference on Management of Data, 376-388, 2019 | 29 | 2019 |
ROD: reception-aware online distillation for sparse graphs W Zhang, Y Jiang, Y Li, Z Sheng, Y Shen, X Miao, L Wang, Z Yang, B Cui Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021 | 27 | 2021 |
Dense-to-sparse gate for mixture-of-experts X Nie, S Cao, X Miao, L Ma, J Xue, Y Miao, Z Yang, Z Yang, CUI Bin | 27 | 2021 |
PSGraph: How Tencent trains extremely large-scale graphs with Spark? J Jiang, P Xiao, L Yu, X Li, J Cheng, X Miao, Z Zhang, B Cui 2020 IEEE 36th International Conference on Data Engineering (ICDE), 1549-1557, 2020 | 26 | 2020 |
Towards communication-efficient vertical federated learning training via cache-enabled local updates F Fu, X Miao, J Jiang, H Xue, B Cui arXiv preprint arXiv:2207.14628, 2022 | 25 | 2022 |
HetuMoE: An efficient trillion-scale mixture-of-expert distributed training system X Nie, P Zhao, X Miao, T Zhao, B Cui arXiv preprint arXiv:2203.14685, 2022 | 25 | 2022 |