A survey of large language models WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou, Y Min, B Zhang, J Zhang, ... arXiv preprint arXiv:2303.18223, 2023 | 1681* | 2023 |
Evaluating object hallucination in large vision-language models Y Li, Y Du, K Zhou, J Wang, WX Zhao, JR Wen arXiv preprint arXiv:2305.10355, 2023 | 205 | 2023 |
A survey of vision-language pre-trained models Y Du, Z Liu, J Li, WX Zhao IJCAI 2022, 2022 | 145 | 2022 |
Learning to imagine: Visually-augmented natural language generation T Tang, Y Chen, Y Du, J Li, WX Zhao, JR Wen arXiv preprint arXiv:2305.16944, 2023 | 6 | 2023 |
What makes for good visual instructions? synthesizing complex visual reasoning instructions for visual instruction tuning Y Du, H Guo, K Zhou, WX Zhao, J Wang, C Wang, M Cai, R Song, JR Wen arXiv preprint arXiv:2311.01487, 2023 | 3 | 2023 |
Zero-shot visual question answering with language model feedback Y Du, J Li, T Tang, WX Zhao, JR Wen arXiv preprint arXiv:2305.17006, 2023 | 3 | 2023 |