Follow
Jian Ding
Jian Ding
Postdoc researcher, KAUST
Verified email at whu.edu.cn - Homepage
Title
Cited by
Year
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos
K Ataallah, X Shen, E Abdelrahman, E Sleiman, M Zhuge, J Ding, D Zhu, ...
arXiv preprint arXiv:2407.12679, 2024
2024
InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding
K Ataallah, C Gou, E Abdelrahman, K Pahwa, J Ding, M Elhoseiny
arXiv preprint arXiv:2406.19875, 2024
2024
Towards generic and controllable attacks against object detection
G Li, Y Xu, J Ding, GS Xia
IEEE Transactions on Geoscience and Remote Sensing, 2024
42024
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding
X Li, J Ding, M Elhoseiny
arXiv preprint arXiv:2406.12384, 2024
2024
iMotion-LLM: Motion Prediction Instruction Tuning
A Felemban, EM Bakr, X Shen, J Ding, A Mohamed, M Elhoseiny
arXiv preprint arXiv:2406.06211, 2024
2024
Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding
J Fei, M Ahmed, J Ding, EM Bakr, M Elhoseiny
arXiv preprint arXiv:2405.18937, 2024
2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
X Ma, Y Bhalgat, B Smart, S Chen, X Li, J Ding, J Gu, DZ Chen, S Peng, ...
arXiv preprint arXiv:2405.10255, 2024
12024
Minigpt4-video: Advancing multimodal llms for video understanding with interleaved visual-textual tokens
K Ataallah, X Shen, E Abdelrahman, E Sleiman, D Zhu, J Ding, ...
arXiv preprint arXiv:2404.03413, 2024
92024
Prompting segmentation with sound is generalizable audio-visual source localizer
Y Wang, W Liu, G Li, J Ding, D Hu, X Li
Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5669-5677, 2024
72024
FreePoint: unsupervised point cloud instance segmentation
Z Zhang, J Ding, L Jiang, D Dai, G Xia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
42024
Uni3DL: Unified Model for 3D and Language Understanding
X Li, J Ding, Z Chen, M Elhoseiny
arXiv, 2023
22023
On the robustness of object detection models in aerial images
H He, J Ding, GS Xia
arXiv preprint arXiv:2308.15378, 2023
42023
Few-shot object detection via variational feature aggregation
J Han, Y Ren, J Ding, K Yan, GS Xia
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 755-763, 2023
272023
Detecting building changes with off-nadir aerial images
C Pang, J Wu, J Ding, C Song, GS Xia
Science China Information Sciences 66 (4), 140306, 2023
192023
Hgformer: Hierarchical grouping transformer for domain generalized semantic segmentation
J Ding, N Xue, GS Xia, B Schiele, D Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
162023
Dynamic coarse-to-fine learning for oriented tiny object detection
C Xu, J Ding, J Wang, W Yang, H Yu, L Yu, GS Xia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
332023
Deeply unsupervised patch re-identification for pre-training object detectors
J Ding, E Xie, H Xu, C Jiang, Z Li, P Luo, GS Xia
IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (3), 1348-1361, 2022
41*2022
Expanding low-density latent regions for open-set object detection
J Han, Y Ren, J Ding, X Pan, K Yan, GS Xia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
572022
Decoupling zero-shot semantic segmentation
J Ding, N Xue, GS Xia, D Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1942022
Object detection in aerial images: A large-scale benchmark and challenges
J Ding, N Xue, GS Xia, X Bai, W Yang, MY Yang, S Belongie, J Luo, ...
IEEE transactions on pattern analysis and machine intelligence 44 (11), 7778 …, 2021
3302021
The system can't perform the operation now. Try again later.
Articles 1–20