Follow
yatai ji
yatai ji
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
W Chen, Y Ji, J Wu, H Wu, P Xie, J Li, X Xia, X Xiao, L Lin
arXiv preprint arXiv:2305.13840, 2023
442023
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model
Y Ji, J Wang, Y Gong, L Zhang, Y Zhu, H Wang, J Zhang, T Sakai, Y Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
15*2023
Mirtt: Learning multimodal interaction representations from trilinear transformers for visual question answering
J Wang, Y Ji, J Sun, Y Yang, T Sakai
Findings of the Association for Computational Linguistics: EMNLP 2021, 2280-2292, 2021
122021
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Y Ji, R Tu, J Jiang, W Kong, C Cai, W Zhao, H Wang, Y Yang, W Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
102023
Multimodal prototype-enhanced network for few-shot action recognition
X Ni, Y Liu, H Wen, Y Ji, J Xiao, Y Yang
arXiv preprint arXiv:2212.04873, 2022
72022
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
Y Xiao, Z Luo, Y Liu, Y Ma, H Bian, Y Ji, Y Yang, X Li
arXiv preprint arXiv:2311.16464, 2023
42023
3D face reconstruction system from a single photo based on regression neural network
Y Ji, K Li, H Wu, G Xiong, Z Shen, X Shang, B Xi
IFAC-PapersOnLine 53 (5), 71-76, 2020
22020
Modeling Multimodal Uncertainties via Probability Distribution Encoders Included Vision-Language Models
J Wang, Y Ji, Y Zhang, Y Zhu, T Sakai
IEEE Access, 2023
2023
Global and Local Semantic Completion Learning for Vision-Language Pre-training
RC Tu, Y Ji, J Jiang, W Kong, C Cai, W Zhao, H Wang, Y Yang, W Liu
arXiv preprint arXiv:2306.07096, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–9