Synthesized classifiers for zero-shot learning S Changpinyo, WL Chao, B Gong, F Sha IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5327-5336, 2016 | 905 | 2016 |
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts S Changpinyo, P Sharma, N Ding, R Soricut IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 | 703 | 2021 |
An empirical study and analysis of generalized zero-shot learning for object recognition in the wild WL Chao, S Changpinyo, B Gong, F Sha European Conference on Computer Vision (ECCV), 52-68, 2016 | 629 | 2016 |
Gemini: A family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 539 | 2023 |
PaLI: A Jointly-Scaled Multilingual Language-Image Model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... International Conference on Learning Representations (ICLR), 2023 | 405 | 2023 |
Predicting visual exemplars of unseen classes for zero-shot learning S Changpinyo, WL Chao, F Sha IEEE International Conference on Computer Vision (ICCV), 3496-3505, 2017 | 205 | 2017 |
Connecting vision and language with localized narratives J Pont-Tuset, J Uijlings, S Changpinyo, R Soricut, V Ferrari European Conference on Computer Vision (ECCV), 647-664, 2020 | 200 | 2020 |
The power of sparsity in convolutional neural networks S Changpinyo, M Sandler, A Zhmoginov arXiv preprint arXiv:1702.06257, 2017 | 159 | 2017 |
PaLI-X: On Scaling up a Multilingual Vision and Language Model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 85 | 2024 |
Multi-Task Learning for Sequence Tagging: An Empirical Study S Changpinyo, H Hu, F Sha 27th International Conference on Computational Linguistics (COLING), 2965–2977, 2018 | 82 | 2018 |
All You May Need for VQA are Image Captions S Changpinyo, D Kukliansky, I Szpektor, X Chen, N Ding, R Soricut Conference of the North American Chapter of the Association for …, 2022 | 50 | 2022 |
Classifier and exemplar synthesis for zero-shot learning S Changpinyo, WL Chao, B Gong, F Sha International Journal of Computer Vision (IJCV) 128 (1), 166-201, 2020 | 49 | 2020 |
On Model Calibration for Long-Tailed Object Detection and Instance Segmentation TY Pan, C Zhang, Y Li, H Hu, D Xuan, S Changpinyo, B Gong, WL Chao Advances in Neural Information Processing Systems (NeurIPS), 2021 | 40 | 2021 |
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection C Zhang, TY Pan, Y Li, H Hu, D Xuan, S Changpinyo, B Gong, WL Chao IEEE/CVF International Conference on Computer Vision (ICCV), 2021 | 37 | 2021 |
Similarity component analysis S Changpinyo, K Liu, F Sha Advances in Neural Information Processing Systems (NIPS), 1511-1519, 2013 | 30 | 2013 |
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions? Y Chen, H Hu, Y Luan, H Sun, S Changpinyo, A Ritter, MW Chang Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 | 27 | 2023 |
What You See is What You Read? Improving Text-Image Alignment Evaluation M Yarom, Y Bitton, S Changpinyo, R Aharoni, J Herzig, O Lang, E Ofek, ... Advances in Neural Information Processing Systems (NeurIPS), 2023 | 26 | 2023 |
Robust Visual Reasoning via Language Guided Neural Module Networks AR Akula, V Jampani, S Changpinyo, SC Zhu Advances in Neural Information Processing Systems (NeurIPS), 2021 | 24 | 2021 |
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization A Akula, S Changpinyo, B Gong, P Sharma, S Zhu, R Soricut Conference on Empirical Methods in Natural Language Processing (EMNLP), 2148 …, 2021 | 22 | 2021 |
PreSTU: Pre-Training for Scene-Text Understanding J Kil, S Changpinyo, X Chen, H Hu, S Goodman, WL Chao, R Soricut IEEE/CVF International Conference on Computer Visiom (ICCV), 2023 | 20 | 2023 |