Follow
Zhendong Wang
Title
Cited by
Cited by
Year
Diffusion policies as an expressive policy class for offline reinforcement learning
Z Wang, JJ Hunt, M Zhou
arXiv preprint arXiv:2208.06193, 2022
2412022
Diffusion-gan: Training gans with diffusion
Z Wang, H Zheng, P He, W Chen, M Zhou
arXiv preprint arXiv:2206.02262, 2022
1912022
Patch diffusion: Faster and more data-efficient training of diffusion models
Z Wang, Y Jiang, H Zheng, P Wang, P He, Z Wang, W Chen, M Zhou
Advances in neural information processing systems 36, 2024
1142024
In-context learning unlocked for diffusion models
Z Wang, Y Jiang, Y Lu, P He, W Chen, Z Wang, M Zhou
Advances in Neural Information Processing Systems 36, 8542-8562, 2023
422023
Thompson sampling via local uncertainty
Z Wang, M Zhou
International Conference on Machine Learning, 10115-10125, 2020
242020
Implicit Distributional Reinforcement Learning
Y Yue, Z Wang, M Zhou
Advances in Neural Information Processing Systems 33, 7135-7147, 2020
162020
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning
S Yang, Z Wang, H Zheng, Y Feng, M Zhou
arXiv preprint arXiv:2202.09673, 2022
152022
Probabilistic conformal prediction using conditional random samples
Z Wang, R Gao, M Yin, M Zhou, DM Blei
arXiv preprint arXiv:2206.06584, 2022
142022
Score identity distillation: Exponentially fast distillation of pretrained diffusion models for one-step generation
M Zhou, H Zheng, Z Wang, M Yin, H Huang
Forty-first International Conference on Machine Learning, 2024
82024
Beta diffusion
M Zhou, T Chen, Z Wang, H Zheng
Advances in Neural Information Processing Systems 36, 2024
62024
Relative preference optimization: Enhancing llm alignment through contrasting responses across identical and diverse prompts
Y Yin, Z Wang, Y Gu, H Huang, W Chen, M Zhou
arXiv preprint arXiv:2402.10958, 2024
62024
Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation
X Fan, Y Zhang, Z Wang, M Zhou
International Conference on Learning Representations 2020, 2019
42019
Improving in-context learning in diffusion models with visual context-modulated prompts
T Chen, Y Liu, Z Wang, J Yuan, Q You, H Yang, M Zhou
arXiv preprint arXiv:2312.01408, 2023
32023
Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation
M Zhou, Z Wang, H Zheng, H Huang
arXiv preprint arXiv:2406.01561, 2024
22024
Learning stackable and skippable LEGO bricks for efficient, reconfigurable, and variable-resolution diffusion modeling
H Zheng, Z Wang, J Yuan, G Ning, P He, Q You, H Yang, M Zhou
The Twelfth International Conference on Learning Representations, 2023
12023
Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization
Y Gu, Z Wang, Y Yin, Y Xie, M Zhou
arXiv preprint arXiv:2406.06382, 2024
2024
Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Y Yin, Z Wang, Y Xie, W Chen, M Zhou
arXiv preprint arXiv:2405.20830, 2024
2024
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
T Chen, Z Wang, M Zhou
arXiv preprint arXiv:2405.19690, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–18