Haotian Zhang

Cited by

	All	Since 2019
Citations	1500	1499
h-index	12	12
i10-index	13	13

720

360

180

540

20192020202120222023202424 62 113 200 714 380

Public access

View all

2 articles

1 article

available

not available

Based on funding mandates

Co-authors

Jenq-Neng HwangUniversity of WashingtonVerified email at u.washington.edu
Yizhou WangNVIDIA; University of WashingtonVerified email at nvidia.com
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Pengchuan ZhangMeta AIVerified email at fb.com
Lijuan WangMicrosoft GenAIVerified email at microsoft.com
Liunian Harold LiUniversity of California, Los AngelesVerified email at cs.ucla.edu
Gaoang WangZhejiang University / University of Illinois Urbana-Champaign InstituteVerified email at intl.zju.edu.cn
Yinfei YangAppleVerified email at apple.com
Zhe GanResearch Scientist, AppleVerified email at apple.com
Lei ZhangInternational Digital Economy Academy (IDEA)Verified email at idea.edu.cn
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Chunyuan LiMicrosoft Research, RedmondVerified email at microsoft.com
Haoxuan YouColumbia UniversityVerified email at columbia.edu

Haotian Zhang

Research Scientist, Apple

Verified email at apple.com - Homepage

Deep Learning Computer Vision Vision + Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Grounded language-image pre-training LH Li, P Zhang, H Zhang*, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	660	2022
Exploit the connectivity: Multi-object tracking with trackletnet G Wang, Y Wang, H Zhang, R Gu, JN Hwang Proceedings of the 27th ACM international conference on multimedia, 482-490, 2019	208	2019
Glipv2: Unifying localization and vision-language understanding H Zhang, P Zhang, X Hu, YC Chen, LH Li, X Dai, L Wang, L Yuan, ... arXiv preprint arXiv:2206.05836, 2022	192	2022
Transmvsnet: Global context-aware multi-view stereo network with transformers Y Ding, W Yuan, Q Zhu, H Zhang, X Liu, Y Wang, X Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	119	2022
Ferret: Refer and ground anything anywhere at any granularity H You, H Zhang, Z Gan, X Du, B Zhang, Z Wang, L Cao, SF Chang, ... arXiv preprint arXiv:2310.07704, 2023	66	2023
Eye in the sky: Drone-based object tracking and 3d localization H Zhang, G Wang, Z Lei, JN Hwang Proceedings of the 27th ACM international conference on multimedia, 899-907, 2019	65	2019
VisDrone-SOT2019: The vision meets drone single object tracking challenge results D Du, P Zhu, L Wen, X Bian, H Ling, Q Hu, J Zheng, T Peng, X Wang, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	41	2019
Visdrone-mot2019: The vision meets drone multiple object tracking challenge results L Wen, P Zhu, D Du, X Bian, H Ling, Q Hu, J Zheng, T Peng, X Wang, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	26	2019
Bundle adjustment for monocular visual odometry based on detections of traffic signs Y Zhang, H Zhang, G Wang, J Yang, JN Hwang IEEE transactions on vehicular technology 69 (1), 151-162, 2019	17	2019
Rod2021 challenge: A summary for radar object detection challenge for autonomous driving applications Y Wang, JN Hwang, G Wang, H Liu, KJ Kim, HM Hsu, J Cai, H Zhang, ... Proceedings of the 2021 International Conference on Multimedia Retrieval …, 2021	13	2021
Ia-mot: Instance-aware multi-object tracking with motion consistency J Cai, Y Wang, H Zhang, HM Hsu, C Ma, JN Hwang arXiv preprint arXiv:2006.13458, 2020	13	2020
From scarcity to efficiency: Improving clip training via visual-enriched captions Z Lai, H Zhang, W Wu, H Bai, A Timofeev, X Du, Z Gan, J Shan, ... arXiv preprint arXiv:2310.07699, 2023	12	2023
Lifts: Lidar and monocular image fusion for multi-object tracking and segmentation H Zhang, Y Wang, J Cai, HM Hsu, H Ji, JN Hwang BMTT Challenge Workshop, IEEE Conference on Computer Vision and Pattern …, 2020	12	2020
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training B McKinzie, Z Gan, JP Fauconnier, S Dodge, B Zhang, P Dufter, D Shah, ... arXiv preprint arXiv:2403.09611, 2024	9	2024
Megloc: A robust and accurate visual localization pipeline S Peng, Z He, H Zhang, R Yan, C Wang, Q Zhu, X Liu arXiv preprint arXiv:2111.13063, 2021	9	2021
Bundle adjustment for monocular visual odometry based on detected traffic sign features Y Zhang, J Yang, H Zhang, JN Hwang 2019 IEEE International Conference on Image Processing (ICIP), 4350-4354, 2019	9	2019
Monocular 3D localization of vehicles in road scenes H Zhang, H Ji, A Zheng, JN Hwang, RH Hwang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	8	2021
How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts Y Qian, H Zhang, Y Yang, Z Gan arXiv preprint arXiv:2402.13220, 2024	4	2024
Dior: Distill observations to representations for multi-object tracking and segmentation J Cai, Y Wang, HM Hsu, H Zhang, JN Hwang Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022	4	2022
U3d-molts: Unified 3d monocular object localization, tracking and segmentation H Zhang, Y Wang, Z Jiang, CY Yang, J Mei, J Cai, JN Hwang, KJ Kim, ... ICCV Segmenting and Tracking Every Point and Pixel: 6th Workshop on …, 2021	4	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors