Real-world acoustic event detection X Zhuang, X Zhou, MA Hasegawa-Johnson, TS Huang Pattern Recognition Letters 31 (12), 1543-1551, 2010 | 220 | 2010 |
Multimodal feature fusion for robust event detection in web videos P Natarajan, S Wu, S Vitaladevuni, X Zhuang, S Tsakalidis, U Park, ... Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on …, 2012 | 215 | 2012 |
Acoustic fall detection using Gaussian mixture models and GMM supervectors X Zhuang, J Huang, G Potamianos, M Hasegawa-Johnson Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE …, 2009 | 208 | 2009 |
Zero-shot event detection using multi-modal fusion of weakly supervised concepts S Wu, S Bondugula, F Luisier, X Zhuang, P Natarajan Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2014 | 152 | 2014 |
Sift-bag kernel for video event analysis X Zhou, X Zhuang, S Yan, SF Chang, M Hasegawa-Johnson, TS Huang Proceedings of the 16th ACM international conference on Multimedia, 229-238, 2008 | 139 | 2008 |
Feature analysis and selection for acoustic event detection X Zhuang, X Zhou, TS Huang, M Hasegawa-Johnson Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE …, 2008 | 98 | 2008 |
HMM-based acoustic event detection with AdaBoost feature selection X Zhou, X Zhuang, M Liu, H Tang, M Hasegawa-Johnson, T Huang Multimodal Technologies for Perception of Humans, 345-353, 2008 | 93 | 2008 |
Face age estimation using patch-based hidden markov model supervectors X Zhuang, X Zhou, M Hasegawa-Johnson, T Huang Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, 1-4, 2008 | 82 | 2008 |
Compact unsupervised eeg response representation for emotion recognition X Zhuang, V Rozgic, M Crystal Biomedical and Health Informatics (BHI), 2014 IEEE-EMBS International …, 2014 | 54 | 2014 |
BBN VISER TRECVID 2011 multimedia event detection system P Natarajan, P Natarajan, V Manohar, S Wu, S Tsakalidis, ... Proc. of NIST TRECVID Workshop, 2011 | 48 | 2011 |
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition Z Huang, T Ng, L Liu, H Mason, X Zhuang, D Liu arXiv preprint arXiv:1910.01992, 2019 | 47 | 2019 |
Bbn viser trecvid 2012 multimedia event detection and multimedia event recounting systems P Natarajan, P Natarajan, S Wu, X Zhuang, A Vazquez-Reina, ... TRECVID 2013 Workshop 1 (5), 6, 2013 | 40 | 2013 |
BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems et. al. Pradeep Natarajan Trecvid 2012 Workshop, 2012 | 40* | 2012 |
Novel Gaussianized vector representation for improved natural scene categorization X Zhou, X Zhuang, H Tang, M Hasegawa-Johnson, TS Huang Pattern Recognition Letters 31 (8), 702-708, 2010 | 37 | 2010 |
Compact Audio Representation for Event Detection in Consumer Media X Zhuang, S Tsakalidis, S Wu, P Natarajan, R Prasad, P Natarajan Interspeech 2012, 2012 | 35 | 2012 |
Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization KH Lin, X Zhuang, C Goudeseune, S King, M Hasegawa-Johnson, ... Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International …, 2012 | 23 | 2012 |
The entropy of the articulatory phonological code: recognizing gestures from tract variables. X Zhuang, H Nam, M Hasegawa-Johnson, LM Goldstein, E Saltzman Interspeech, 1489-1492, 2008 | 23 | 2008 |
Text detection and recognition in natural scenes and consumer videos A Jain, X Peng, X Zhuang, P Natarajan, H Cao Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International …, 2014 | 22 | 2014 |
Voice quality dependent speech recognition TJ Yoon, X Zhuang, J Cole, M Hasegawa-Johnson International Symposium on Linguistic Patterns in Spontaneous Speech, 2006 | 22 | 2006 |
A Minimum Converted Trajectory Error (MCTE) Approach to High Quality Speech-to-Lips Conversion X Zhuang, L Wang, FK Soong, M Hasegawa-Johnson Eleventh Annual Conference of the International Speech Communication Association, 2010 | 21 | 2010 |