Xiaodan Zhuang

Cited by

	All	Since 2019
Citations	1767	460
h-index	20	11
i10-index	28	11

200

100

150

200820092010201120122013201420152016201720182019202020212022202320247 32 64 85 126 183 154 191 177 143 115 126 88 80 94 56 16

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Xiaodan Zhuang

Apple Siri; BBN, University of Illinois, IBM, Microsoft

Verified email at apple.com - Homepage

pattern recognition speech recognition computer vision multimedia information retrieval


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Real-world acoustic event detection X Zhuang, X Zhou, MA Hasegawa-Johnson, TS Huang Pattern Recognition Letters 31 (12), 1543-1551, 2010	216	2010
Multimodal feature fusion for robust event detection in web videos P Natarajan, S Wu, S Vitaladevuni, X Zhuang, S Tsakalidis, U Park, ... Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on …, 2012	213	2012
Acoustic fall detection using Gaussian mixture models and GMM supervectors X Zhuang, J Huang, G Potamianos, M Hasegawa-Johnson Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE …, 2009	198	2009
Zero-shot event detection using multi-modal fusion of weakly supervised concepts S Wu, S Bondugula, F Luisier, X Zhuang, P Natarajan Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2014	147	2014
Sift-bag kernel for video event analysis X Zhou, X Zhuang, S Yan, SF Chang, M Hasegawa-Johnson, TS Huang Proceedings of the 16th ACM international conference on Multimedia, 229-238, 2008	139	2008
Feature analysis and selection for acoustic event detection X Zhuang, X Zhou, TS Huang, M Hasegawa-Johnson Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE …, 2008	98	2008
HMM-based acoustic event detection with AdaBoost feature selection X Zhou, X Zhuang, M Liu, H Tang, M Hasegawa-Johnson, T Huang Multimodal Technologies for Perception of Humans, 345-353, 2008	92	2008
Face age estimation using patch-based hidden markov model supervectors X Zhuang, X Zhou, M Hasegawa-Johnson, T Huang Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, 1-4, 2008	81	2008
Compact unsupervised eeg response representation for emotion recognition X Zhuang, V Rozgic, M Crystal Biomedical and Health Informatics (BHI), 2014 IEEE-EMBS International …, 2014	53	2014
BBN VISER TRECVID 2011 multimedia event detection system P Natarajan, P Natarajan, V Manohar, S Wu, S Tsakalidis, ... Proc. of NIST TRECVID Workshop, 2011	48	2011
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition Z Huang, T Ng, L Liu, H Mason, X Zhuang, D Liu arXiv preprint arXiv:1910.01992, 2019	43	2019
Bbn viser trecvid 2012 multimedia event detection and multimedia event recounting systems P Natarajan, P Natarajan, S Wu, X Zhuang, A Vazquez-Reina, ... TRECVID 2013 Workshop 1 (5), 6, 2013	39	2013
BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems et. al. Pradeep Natarajan Trecvid 2012 Workshop, 2012	39*	2012
Novel Gaussianized vector representation for improved natural scene categorization X Zhou, X Zhuang, H Tang, M Hasegawa-Johnson, TS Huang Pattern Recognition Letters 31 (8), 702-708, 2010	37	2010
Compact Audio Representation for Event Detection in Consumer Media X Zhuang, S Tsakalidis, S Wu, P Natarajan, R Prasad, P Natarajan Interspeech 2012, 2012	35	2012
Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization KH Lin, X Zhuang, C Goudeseune, S King, M Hasegawa-Johnson, ... Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International …, 2012	24	2012
The entropy of the articulatory phonological code: recognizing gestures from tract variables. X Zhuang, H Nam, M Hasegawa-Johnson, LM Goldstein, E Saltzman Interspeech, 1489-1492, 2008	23	2008
Text detection and recognition in natural scenes and consumer videos A Jain, X Peng, X Zhuang, P Natarajan, H Cao Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International …, 2014	22	2014
A Minimum Converted Trajectory Error (MCTE) Approach to High Quality Speech-to-Lips Conversion X Zhuang, L Wang, FK Soong, M Hasegawa-Johnson Eleventh Annual Conference of the International Speech Communication Association, 2010	21	2010
Voice quality dependent speech recognition TJ Yoon, X Zhuang, J Cole, M Hasegawa-Johnson International Symposium on Linguistic Patterns in Spontaneous Speech, 2006	21	2006

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by