Follow
Yiwu Zhong
Title
Cited by
Cited by
Year
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Computer Vision and Pattern Recognition (CVPR), 2022
6602022
RegionCLIP: Region-based Language-Image Pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Computer Vision and Pattern Recognition (CVPR), 2022
3382022
Comprehensive Image Captioning via Scene Graph Decomposition
Y Zhong, L Wang, J Chen, D Yu, Y Li
European Conference on Computer Vision (ECCV), 2020
1212020
Learning to Generate Scene Graph from Natural Language Supervision
Y Zhong, J Shi, J Yang, C Xu, Y Li
International Conference on Computer Vision (ICCV), 2021
662021
A Simple Baseline for Weakly-Supervised Scene Graph Generation
J Shi, Y Zhong, N Xu, Y Li, C Xu
International Conference on Computer Vision (ICCV), 2021
272021
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
212023
Learning Concise and Descriptive Attributes for Visual Recognition
A Yan*, Y Wang*, Y Zhong*, C Dong, Z He, Y Lu, W Wang, J Shang, ...
International Conference on Computer Vision (ICCV), 2023
212023
Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations
Y Zhong, L Yu, Y Bai, S Li, X Yan, Y Li
Computer Vision and Pattern Recognition (CVPR), 2023
92023
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
62023
Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Y Zhong, ZY Hu, MR Lyu, L Wang
arXiv preprint arXiv:2403.18252, 2024
2024
Towards Learning a Generalist Model for Embodied Navigation
D Zheng, S Huang, L Zhao, Y Zhong, L Wang
arXiv preprint arXiv:2312.02010, 2023
2023
Learning Visual Knowledge From Natural Language Supervision
Y Zhong
The University of Wisconsin-Madison, 2023
2023
Supplementary Materials for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Y Zhong, L Yu, Y Bai, S Li, X Yan, Y Li
2023
The system can't perform the operation now. Try again later.
Articles 1–13