Attngan: Fine-grained text to image generation with attentional generative adversarial networks T Xu, P Zhang, Q Huang, H Zhang, Z Gan, X Huang, X He Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 1007 | 2018 |
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks X Li, X Yin, C Li, P Zhang, X Hu, L Zhang, L Wang, H Hu, L Dong, F Wei, ... European Conference on Computer Vision, 121-137, 2020 | 638 | 2020 |
Provably robust deep learning via adversarially trained smoothed classifiers H Salman, J Li, I Razenshteyn, P Zhang, H Zhang, S Bubeck, G Yang Advances in Neural Information Processing Systems 32, 2019 | 308 | 2019 |
Scaling vision transformers X Zhai, A Kolesnikov, N Houlsby, L Beyer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 282* | 2022 |
Vinvl: Revisiting visual representations in vision-language models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 209 | 2021 |
Object-driven text-to-image synthesis via adversarial training W Li, P Zhang, L Zhang, Q Huang, X He, S Lyu, J Gao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 201 | 2019 |
A convex relaxation barrier to tight robustness verification of neural networks H Salman, G Yang, H Zhang, CJ Hsieh, P Zhang Advances in Neural Information Processing Systems 32, 2019 | 135 | 2019 |
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 97 | 2021 |
On the discrimination-generalization tradeoff in GANs P Zhang, Q Liu, D Zhou, T Xu, X He arXiv preprint arXiv:1711.02771, 2017 | 92 | 2017 |
Vinvl: Making visual representations matter in vision-language models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao | 66 | 2021 |
Understanding the role of momentum in stochastic gradient methods I Gitman, H Lang, P Zhang, L Xiao Advances in Neural Information Processing Systems 32, 2019 | 58 | 2019 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 55 | 2021 |
Efficient self-supervised vision transformers for representation learning C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao arXiv preprint arXiv:2106.09785, 2021 | 52 | 2021 |
Recurjac: An efficient recursive algorithm for bounding jacobian matrix of neural networks and its applications H Zhang, P Zhang, CJ Hsieh Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 5757-5764, 2019 | 48 | 2019 |
Dynamic detr: End-to-end object detection with dynamic attention X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 36 | 2021 |
Object-centric image generation from layouts T Sylvain, P Zhang, Y Bengio, RD Hjelm, S Sharma Proceedings of the AAAI Conference on Artificial Intelligence 35 (3), 2647-2655, 2021 | 29 | 2021 |
An empirical study of training end-to-end vision-and-language transformers ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 27 | 2022 |
A convex relaxation barrier to tight robust verification of neural networks H Salman, G Yang, H Zhang, CJ Hsieh, P Zhang arXiv preprint arXiv:1902.08722, 2019 | 25 | 2019 |
Tiger: Text-to-image grounding for image caption evaluation M Jiang, Q Huang, L Zhang, X Wang, P Zhang, Z Gan, J Diesner, J Gao arXiv preprint arXiv:1909.02050, 2019 | 23 | 2019 |
Turbo learning for captionbot and drawingbot Q Huang, P Zhang, D Wu, L Zhang Advances in neural information processing systems 31, 2018 | 23 | 2018 |