Videofactory: Swap attention in spatiotemporal diffusions for text-to-video generation W Wang, H Yang, Z Tuo, H He, J Zhu, J Fu, J Liu arXiv preprint arXiv:2305.10874, 2023 | 45 | 2023 |
Moviefactory: Automatic movie creation from text using large generative models for language and images J Zhu, H Yang, H He, W Wang, Z Tuo, WH Cheng, L Gao, J Song, J Fu Proceedings of the 31st ACM International Conference on Multimedia, 9313-9319, 2023 | 18 | 2023 |
Label-guided generative adversarial network for realistic image synthesis J Zhu, L Gao, J Song, YF Li, F Zheng, X Li, HT Shen IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3311-3328, 2022 | 17 | 2022 |
Fully functional image manipulation using scene graphs in a bounding-box free way S Su, L Gao, J Zhu, J Shao, J Song Proceedings of the 29th ACM International Conference on Multimedia, 1784-1792, 2021 | 13 | 2021 |
Lab2Pix: label-adaptive generative adversarial network for unsupervised image synthesis L Gao, J Zhu, J Song, F Zheng, HT Shen Proceedings of the 28th ACM International Conference on Multimedia, 3734-3742, 2020 | 13 | 2020 |
Mobilevidfactory: Automatic diffusion-based social media video generation for mobile devices from text J Zhu, H Yang, W Wang, H He, Z Tuo, Y Yu, WH Cheng, L Gao, J Song, ... Proceedings of the 31st ACM International Conference on Multimedia, 9371-9373, 2023 | 4 | 2023 |
Utilizing Greedy Nature for Multimodal Conditional Image Synthesis in Transformers S Su, J Zhu, L Gao, J Song IEEE Transactions on Multimedia, 2023 | 3 | 2023 |
From external to internal: Structuring image for text-to-image attributes manipulation L Gao, Q Zhao, J Zhu, S Su, L Cheng, L Zhao IEEE Transactions on Multimedia, 2022 | 3 | 2022 |
Towards Unsupervised Deformable-Instances Image-to-Image Translation. S Su, J Song, L Gao, J Zhu IJCAI, 1004-1010, 2021 | 2 | 2021 |
EchoReel: Enhancing Action Generation of Existing Video Diffusion Models J Zhu, L Gao, J Song arXiv preprint arXiv:2403.11535, 2024 | | 2024 |
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model C Chen, J Zhu, X Luo, H Shen, L Gao, J Song arXiv preprint arXiv:2403.08350, 2024 | | 2024 |
Training-Free Semantic Video Composition via Pre-trained Diffusion Model J Guo, S Su, J Zhu, L Gao, J Song arXiv preprint arXiv:2401.09195, 2024 | | 2024 |
Allowing Supervision in Unsupervised Deformable-Instances Image-to-Image Translation Y Liu, S Su, J Zhu, F Zheng, L Gao, J Song IEEE Transactions on Circuits and Systems for Video Technology, 2023 | | 2023 |
CUCL: Codebook for Unsupervised Continual Learning C Cheng, J Song, X Zhu, J Zhu, L Gao, H Shen Proceedings of the 31st ACM International Conference on Multimedia, 1729-1737, 2023 | | 2023 |