Learning to incentivize information acquisition: Proper scoring rules meet principal-agent model S Chen, J Wu, Y Wu, Z Yang International Conference on Machine Learning, 5194-5218, 2023 | 5 | 2023 |
Adaptive model design for Markov decision process S Chen, D Yang, J Li, S Wang, Z Yang, Z Wang International Conference on Machine Learning, 3679-3700, 2022 | 5 | 2022 |
Wasserstein flow meets replicator dynamics: A mean-field analysis of representation learning in actor-critic Y Zhang, S Chen, Z Yang, M Jordan, Z Wang Advances in Neural Information Processing Systems 34, 15993-16006, 2021 | 4 | 2021 |
A unified framework of policy learning for contextual bandit with confounding bias and missing observations S Chen, Y Wang, Z Wang, Z Yang arXiv preprint arXiv:2303.11187, 2023 | 3 | 2023 |
Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality S Chen, H Sheen, T Wang, Z Yang arXiv preprint arXiv:2402.19442, 2024 | 2 | 2024 |
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks S Chen, M Wang, Z Yang arXiv preprint arXiv:2307.14085, 2023 | 1 | 2023 |
Implicit Regularization of Gradient Flow on One-Layer Softmax Attention H Sheen, S Chen, T Wang, HH Zhou arXiv preprint arXiv:2403.08699, 2024 | | 2024 |