Shuxin Zheng
Shuxin Zheng
Senior Researcher, Microsoft Research Asia
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Asynchronous stochastic gradient descent with delay compensation
S Zheng, Q Meng, T Wang, W Chen, N Yu, ZM Ma, TY Liu
Proceedings of the 34th International Conference on Machine Learning, PMLR …, 2017
175*2017
On layer normalization in the transformer architecture
R Xiong, Y Yang, D He, K Zheng, S Zheng, C Xing, H Zhang, Y Lan, ...
Proceedings of the 37th International Conference on Machine Learning, 2020, 2020
1232020
Invertible Image Rescaling
M Xiao, S Zheng, C Liu, Y Wang, D He, G Ke, J Bian, Z Lin, TY Liu
European Conference on Computer Vision (ECCV) 2020, 126-144, 2020
442020
Cross-Iteration Batch Normalization
Z Yao, Y Cao, S Zheng, G Huang, S Lin
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 2020
292020
Deep learning for prediction of the air quality response to emission changes
J Xing, S Zheng, D Ding, JT Kelly, S Wang, S Li, T Qin, M Ma, Z Dong, ...
Environmental science & technology 54 (14), 8589-8600, 2020
182020
-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
Q Meng, S Zheng, H Zhang, W Chen, Q Ye, ZM Ma, TY Liu
Proceedings of the 7th International Conference on Learning Representations …, 2018
172018
Capacity control of relu neural networks by basis-path norm
S Zheng, Q Meng, H Zhang, W Chen, N Yu, TY Liu
Proceedings of the 33rd AAAI Conference on Artificial Intelligence, 2019, 2018
132018
Do Transformers Really Perform Bad for Graph Representation?
C Ying, T Cai, S Luo, S Zheng, G Ke, D He, Y Shen, TY Liu
Advances in Neural Information Processing Systems, 2021 (NeurIPS 2021), 2021
112021
Modeling Lost Information in Lossy Image Compression
Y Wang, M Xiao, C Liu, S Zheng, TY Liu
arXiv preprint arXiv:2006.11999, 2020
82020
Mc-bert: Efficient language pre-training via a meta controller
Z Xu, L Gong, G Ke, D He, S Zheng, L Wang, J Bian, TY Liu
arXiv preprint arXiv:2006.05744, 2020
72020
OptQuant: Distributed Training of Neural Networks with Optimized Quantization Mechanisms
L He, S Zheng, W Chen, ZM Ma, TY Liu
Neurocomputing 340, Pages 233-244, 2019
42019
Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding
S Luo, S Li, T Cai, D He, D Peng, S Zheng, G Ke, L Wang, TY Liu
Advances in Neural Information Processing Systems, 2021 (NeurIPS 2021), 2021
22021
First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track
C Ying, M Yang, S Zheng, G Ke, S Luo, T Cai, C Wu, Y Wang, Y Shen, ...
arXiv preprint arXiv:2106.08279, 2021
12021
Revisiting Language Encoding in Learning Multilingual Representations
S Luo, K Gao, S Zheng, G Ke, D He, L Wang, TY Liu
arXiv preprint arXiv:2102.08357, 2021
12021
Mimicking atmospheric photochemical modeling with a deep neural network
J Xing, S Zheng, S Li, L Huang, X Wang, JT Kelly, S Wang, C Liu, C Jang, ...
Atmospheric Research, 105919, 2021
2021
How could Neural Networks understand Programs?
D Peng, S Zheng, Y Li, G Ke, D He, TY Liu
Proceedings of International Conference on Machine Learning, 2021 (ICML 2021 …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–16