Follow
Jiashi Feng
Jiashi Feng
ByteDance Inc.
Verified email at bytedance.com - Homepage
Title
Cited by
Year
Magic-Me: Identity-Specific Video Customized Diffusion
Z Ma, D Zhou, CH Yeh, XS Wang, X Li, H Yang, Z Dong, K Keutzer, ...
arXiv preprint arXiv:2402.09368, 2024
12024
Xagen: 3d expressive human avatars generation
Z Xu, J Zhang, JH Liew, J Feng, MZ Shou
Advances in Neural Information Processing Systems 36, 2024
32024
Expanding small-scale datasets with guided imagination
Y Zhang, D Zhou, B Hooi, K Wang, J Feng
Advances in Neural Information Processing Systems 36, 2024
202024
Depth anything: Unleashing the power of large-scale unlabeled data
L Yang, B Kang, Z Huang, X Xu, J Feng, H Zhao
Computer Vision and Pattern Recognition (CVPR), 2024, 2024
312024
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation
W Wang, J Liu, Z Lin, J Yan, S Chen, C Low, T Hoang, J Wu, JH Liew, ...
arXiv preprint arXiv:2401.04468, 2024
12024
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Q Wan, Z Huang, B Kang, J Feng, L Zhang
arXiv preprint arXiv:2312.14733, 2023
22023
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
C Zhang, C Wang, J Zhang, H Xu, G Song, Y Xie, L Luo, Y Tian, X Guo, ...
arXiv preprint arXiv:2312.13578, 2023
12023
Video Recognition in Portrait Mode
M Han, L Yang, X Jin, J Feng, X Chang, H Wang
Computer Vision and Pattern Recognition (CVPR), 2024, 2023
2023
Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method
J Pan, H Yan, JH Liew, J Feng, VYF Tan
arXiv preprint arXiv:2312.12030, 2023
12023
Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens
F Ma, X Jin, H Wang, Y Xian, J Feng, Y Yang
Computer Vision and Pattern Recognition (CVPR), 2024, 2023
72023
PixelLM: Pixel Reasoning with Large Multimodal Model
Z Ren, Z Huang, Y Wei, Y Zhao, D Fu, J Feng, X Jin
Computer Vision and Pattern Recognition (CVPR), 2024, 2023
52023
Avatarstudio: High-fidelity and animatable 3d avatar creation from text
J Zhang, X Zhang, H Zhang, JH Liew, C Zhang, Y Yang, J Feng
arXiv preprint arXiv:2311.17917, 2023
42023
Contrastive masked autoencoders are stronger vision learners
Z Huang, X Jin, C Lu, Q Hou, MM Cheng, D Fu, X Shen, J Feng
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
792023
Magicanimate: Temporally consistent human image animation using diffusion model
Z Xu, J Zhang, JH Liew, H Yan, JW Liu, C Zhang, J Feng, MZ Shou
Computer Vision and Pattern Recognition (CVPR), 2024, 2023
142023
MAgIC: Benchmarking Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
L Xu, Z Hu, D Zhou, H Ren, Z Dong, K Keutzer, SK Ng, J Feng
arXiv preprint arXiv:2311.08562, 2023
2023
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Y Zhao, X Yuan, S Gao, Z Lin, Q Hou, J Feng, D Zhou
arXiv preprint arXiv:2311.06772, 2023
2023
EPIM: Efficient Processing-In-Memory Accelerators based on Epitome
C Wang, Z Dong, D Zhou, Z Zhu, Y Wang, J Feng, K Keutzer
arXiv preprint arXiv:2311.07620, 2023
2023
Metaformer baselines for vision
W Yu, C Si, P Zhou, M Luo, Y Zhou, J Feng, S Yan, X Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
582023
Low-Resolution Self-Attention for Semantic Segmentation
YH Wu, SC Zhang, Y Liu, L Zhang, X Zhan, D Zhou, J Feng, MM Cheng, ...
arXiv preprint arXiv:2310.05026, 2023
12023
Maskdiffusion: Boosting text-to-image consistency with conditional mask
Y Zhou, D Zhou, ZL Zhu, Y Wang, Q Hou, J Feng
arXiv preprint arXiv:2309.04399, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20