Towards understanding the spectral bias of deep learning Y Cao, Z Fang, Y Wu, DX Zhou, Q Gu arXiv preprint arXiv:1912.01198, 2019 | 234 | 2019 |
A Finite Time Analysis of Two Time-Scale Actor Critic Methods Y Wu, W Zhang, P Xu, Q Gu NeurIPS 2020, 2020 | 156 | 2020 |
Towards understanding learning representations: To what extent do different neural networks learn the same representation L Wang, L Hu, J Gu, Z Hu, Y Wu, K He, J Hopcroft Advances in neural information processing systems 31, 2018 | 122 | 2018 |
DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text X Yang, W Cheng, Y Wu, L Petzold, WY Wang, H Chen arXiv preprint arXiv:2305.17359, 2023 | 68 | 2023 |
Towards understanding the mixture-of-experts layer in deep learning Z Chen, Y Deng, Y Wu, Q Gu, Y Li Advances in neural information processing systems 35, 23049-23062, 2022 | 59 | 2022 |
Self-play preference optimization for language model alignment Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu arXiv preprint arXiv:2405.00675, 2024 | 56 | 2024 |
Personalized Federated Learning under Mixture of Distributions Y Wu, S Zhang, W Yu, Y Liu, Q Gu, D Zhou, H Chen, W Cheng Fortieth International Conference on Machine Learning (ICML 2023), 2023 | 39 | 2023 |
Nearly minimax optimal regret for learning infinite-horizon average-reward mdps with linear function approximation Y Wu, D Zhou, Q Gu International Conference on Artificial Intelligence and Statistics, 3883-3913, 2022 | 24 | 2022 |
Protein conformation generation via force-guided se (3) diffusion models Y Wang, L Wang, Y Shen, Y Wang, H Yuan, Y Wu, Q Gu arXiv preprint arXiv:2403.14088, 2024 | 11 | 2024 |
Variance-aware regret bounds for stochastic contextual dueling bandits Q Di, T Jin, Y Wu, H Zhao, F Farnoud, Q Gu arXiv preprint arXiv:2310.00968, 2023 | 9 | 2023 |
Borda regret minimization for generalized linear dueling bandits Y Wu, T Jin, H Lou, F Farnoud, Q Gu arXiv preprint arXiv:2303.08816, 2023 | 8 | 2023 |
Active ranking without strong stochastic transitivity H Lou, T Jin, Y Wu, P Xu, Q Gu, F Farnoud Advances in neural information processing systems 35, 297-309, 2022 | 7 | 2022 |
Adaptive sampling for heterogeneous rank aggregation from noisy pairwise comparisons Y Wu, T Jin, H Lou, P Xu, F Farnoud, Q Gu International Conference on Artificial Intelligence and Statistics, 11014-11036, 2022 | 6 | 2022 |
Uniform-PAC guarantees for model-based RL with bounded eluder dimension Y Wu, J He, Q Gu Uncertainty in Artificial Intelligence, 2304-2313, 2023 | 4 | 2023 |
TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling J Qiu, Y Lu, Y Zeng, J Guo, J Geng, H Wang, K Huang, Y Wu, M Wang arXiv preprint arXiv:2410.16033, 2024 | | 2024 |
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement H Yuan, Y Zeng, Y Wu, H Wang, M Wang, L Leqi arXiv preprint arXiv:2410.13828, 2024 | | 2024 |