Follow
Sibo Song
Sibo Song
Alibaba
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
On classification of distorted images with deep convolutional neural networks
Y Zhou, S Song, NM Cheung
Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017
1412017
Multimodal multi-stream deep learning for egocentric activity recognition
S Song, V Chandrasekhar, B Mandal, L Li, JH Lim, G Sateesh Babu, ...
Proceedings of the IEEE conference on computer vision and pattern …, 2016
932016
Egocentric activity recognition with multimodal fisher vector
S Song, NM Cheung, V Chandrasekhar, B Mandal, J Liri
2016 IEEE International conference on acoustics, speech and signal …, 2016
472016
Activity recognition in egocentric life-logging videos
S Song, V Chandrasekhar, NM Cheung, S Narayan, L Li, JH Lim
Computer Vision-ACCV 2014 Workshops: Singapore, Singapore, November 1-2 …, 2015
452015
Defense against adversarial attacks with saak transform
S Song, Y Chen, NM Cheung, CCJ Kuo
arXiv preprint arXiv:1808.01785, 2018
282018
Truly multi-modal youtube-8m video classification with video, audio, and text
Z Wang, K Kuan, M Ravaut, G Manek, S Song, Y Fang, S Kim, N Chen, ...
arXiv preprint arXiv:1706.05461, 2017
262017
Saak transform-based machine learning for light-sheet imaging of cardiac trabeculation
Y Ding, V Gudapati, R Lin, Y Fei, RRS Packard, S Song, CC Chang, ...
IEEE Transactions on Biomedical Engineering 68 (1), 225-235, 2020
192020
Vision-language pre-training for boosting scene text detectors
S Song, J Wan, Z Yang, J Tang, W Cheng, X Bai, C Yao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
182022
Deep Adaptive Temporal Pooling for Activity Recognition
S Song, NM Cheung, V Chandrasekhar, B Mandal
2018 ACM Multimedia Conference on Multimedia Conference, 1829--1837, 2018
112018
Modeling entities as semantic points for visual information extraction in the wild
Z Yang, R Long, P Wang, S Song, H Zhong, W Cheng, X Bai, C Yao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
52023
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition
J Wan, S Song, W Yu, Y Liu, W Cheng, F Huang, X Bai, C Yao, Z Yang
arXiv preprint arXiv:2403.19128, 2024
2024
ICDAR 2023 Competition on Born Digital Video Text Question Answering
Z Yang, X Song, S Song, T Lu, X Bai, CL Liu, F Huang, C Yao
International Conference on Document Analysis and Recognition, 508-521, 2023
2023
Towards Multimodal and Secure Deep Learning for Human Activity Recognition from Multiple Views
S Song
Singapore University of Technology and Design, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–13