Kevin Lin

Cited by

	All	Since 2019
Citations	5831	5189
h-index	25	25
i10-index	34	33

1700

850

425

1275

201520162017201820192020202120222023202415 76 190 340 381 408 433 660 1602 1700

Public access

View all

2 articles

1 article

available

not available

Based on funding mandates

Co-authors

Lijuan WangMicrosoft GenAIVerified email at microsoft.com
Zicheng LiuMicrosoftVerified email at microsoft.com
Linjie (Lindsey) LiSenior Researcher, MicrosoftVerified email at microsoft.com
Zhengyuan YangResearcher, MicrosoftVerified email at microsoft.com
Chu-Song ChenNational Taiwan UniversityVerified email at csie.ntu.edu.tw
Jianfeng WangMicrosoftVerified email at microsoft.com
Chung-Ching LinMicrosoftVerified email at microsoft.com
Zhe GanResearch Scientist, AppleVerified email at apple.com
Huei-Fang YangNational Sun Yat-sen UniversityVerified email at mis.nsysu.edu.tw
Ce LiuAI Research Scientist Director, Meta GenAI; IEEE FellowVerified email at meta.com
Ming-Ting SunProfessor of Electrical Engineering, University of WashingtonVerified email at ee.washington.edu
Yi-Ping HungNational Taiwan UniversityVerified email at csie.ntu.edu.tw
Faisal Ahmed, PhDMicrosoftVerified email at microsoft.com
Jenhao HsiaoPrincipal AI Architect, OPPO US Research CenterVerified email at oppo.com
Jiwen Lu (鲁继文)Department of Automation, Tsinghua UniversityVerified email at tsinghua.edu.cn
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
William Yang WangMellichamp Chair Professor, University of California, Santa BarbaraVerified email at cs.ucsb.edu
Tsu-Jui FuUC Santa BarbaraVerified email at ucsb.edu
Fuxiao LiuUniversity of Maryland, College ParkVerified email at umd.edu
Yaser YacoobUniversity of Maryland, College ParkVerified email at umd.edu

Kevin Lin

Microsoft

Verified email at microsoft.com - Homepage

Computer Vision Vision and Language Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep learning of binary hash codes for fast image retrieval K Lin, HF Yang, JH Hsiao, CS Chen IEEE Conference on Computer Vision and Pattern Recognition Workshops, 27-35, 2015	754	2015
End-to-end human pose and mesh reconstruction with transformers K Lin, L Wang, Z Liu IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1954-1963, 2021	644	2021
Adversarial ranking for language generation K Lin, D Li, X He, Z Zhang, MT Sun Advances in Neural Information Processing Systems (NeurIPS), 3158-3168, 2017	421	2017
GIT: A generative image-to-text transformer for vision and language J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang Transactions on Machine Learning Research (TMLR), 2022	419	2022
Learning compact binary descriptors with unsupervised deep neural networks K Lin, J Lu, CS Chen, J Zhou IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1183-1192, 2016	418	2016
Supervised learning of semantics-preserving hash via deep convolutional neural networks HF Yang, K Lin, CS Chen IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (2), 437-451, 2018	394	2018
The dawn of lmms: Preliminary explorations with gpt-4v (ision) Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang arXiv preprint arXiv:2309.17421, 2023	332	2023
Mesh graphormer K Lin, L Wang, Z Liu IEEE/CVF International Conference on Computer Vision (ICCV), 12939-12948, 2021	303	2021
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ... arXiv preprint arXiv:2303.11381, 2023	240	2023
SwinBERT: End-to-end transformers with sparse attention for video captioning K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17949 …, 2022	223	2022
Mm-vet: Evaluating large multimodal models for integrated capabilities W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang ICML 2024, 2024	208	2024
Mitigating hallucination in large multi-modal models via robust instruction tuning F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang ICLR 2024, 2024	204*	2024
VIOLET: End-to-end video-language transformers with masked visual-token modeling TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu arXiv preprint arXiv:2111.12681, 2021	189	2021
Abandoned object detection via temporal consistency modeling and back-tracing verification for visual surveillance K Lin, SC Chen, CS Chen, DTD Lin, YP Hung IEEE Transactions on Information Forensic and Security 10 (7), 1359-1370, 2015	112	2015
Vivo: Visual vocabulary pre-training for novel object captioning X Hu, X Yin, K Lin, L Zhang, J Gao, L Wang, Z Liu Proceedings of the AAAI Conference on Artificial Intelligence, 1575-1583, 2021	107*	2021
Cross-domain complementary learning using pose for multi-person part segmentation K Lin, L Wang, K Luo, Y Chen, Z Liu, MT Sun IEEE Transactions on Circuits and Systems for Video Technology 31 (3), 1066 …, 2020	93	2020
Rapid clothing retrieval via deep learning of binary codes and hierarchical search K Lin, HF Yang, KH Liu, JH Hsiao, CS Chen ACM International Conference on Multimedia Retrieval (ICMR), 499–502, 2015	91	2015
Reco: Region-controlled text-to-image generation Z Yang, J Wang, Z Gan, L Li, K Lin, C Wu, N Duan, Z Liu, C Liu, M Zeng, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14246 …, 2023	82	2023
Unsupervised deep learning of compact binary descriptors K Lin, J Lu, CS Chen, J Zhou, MT Sun IEEE Transactions on Pattern Analysis and Machine Intelligence 41 (6), 1501-1514, 2019	77	2019
Lavender: Unifying video-language understanding as masked language modeling L Li, Z Gan, K Lin, CC Lin, Z Liu, C Liu, L Wang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 23119 …, 2023	70	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors