He Huang

Cited by

	All	Since 2019
Citations	636	625
h-index	8	8
i10-index	7	7

140

105

20182019202020212022202320249 48 109 137 113 110 106

Public access

View all

3 articles

available

not available

Based on funding mandates

Co-authors

Philip S. YuProfessor of Computer Science, University of Illinons at ChicagoVerified email at cs.uic.edu
Boris GinsburgNVIDIAVerified email at nvidia.com
Changhu Wang (王长虎)ByteDance AI Lab (字节跳动人工智能实验室)Verified email at bytedance.com
Chang-Dong WangIEEE Senior Member and CCF Distinguished Member, Sun Yat-sen UniversityVerified email at mail.sysu.edu.cn
Bokai CaoMetaVerified email at fb.com
Alex Leow MDPhDUniversity of IllinoisVerified email at uic.edu
Wei TangUniversity of Illinois ChicagoVerified email at uic.edu
Jiayu HanAmazonVerified email at amazon.com
Lei ZhengPinterest Inc.Verified email at pinterest.com
Hainan XuNVIDIAVerified email at nvidia.com

He Huang

NVIDIA

Verified email at nvidia.com

Conversational AI Computer Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Generative dual adversarial network for generalized zero-shot learning H Huang, C Wang, PS Yu, CD Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	260	2019
An introduction to image synthesis with generative adversarial nets H Huang, PS Yu, C Wang	218	2018
Fast conformer with linearly scalable attention for efficient speech recognition D Rekesh, NR Koluguri, S Kriman, S Majumdar, V Noroozi, H Huang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	37	2023
Deep latent factor model with hierarchical similarity measure for recommender systems J Han, L Zheng, H Huang, Y Xu, SY Philip, W Zuo Information Sciences 503, 521-532, 2019	28	2019
dpMood: exploiting local and periodic typing dynamics for personalized mood prediction H Huang, B Cao, SY Phillip, CD Wang, AD Leow 2018 IEEE International Conference on Data Mining (ICDM), 157-166, 2018	28	2018
Salm: Speech-augmented language model with in-context learning for speech recognition and translation Z Chen, H Huang, A Andrusenko, O Hrinchuk, KC Puvvada, J Li, S Ghosh, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	12	2024
Passive sensing of affective and cognitive functioning in mood disorders by analyzing keystroke kinematics and speech dynamics F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ... Digital phenotyping and mobile sensing: New developments in …, 2019	10	2019
Efficient sequence transduction by jointly predicting tokens and durations H Xu, F Jia, S Majumdar, H Huang, S Watanabe, B Ginsburg International Conference on Machine Learning, 38462-38484, 2023	8	2023
Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge H Huang, Y Chen, W Tang, W Zheng, QG Chen, P Yu BMVC'2020 (oral), 2020	8	2020
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score H Huang, S Saito, Y Kikuchi, E Matsumoto, W Tang, PS Yu ACCV'2020, 2020	7	2020
Passive sensing of affective and cognitive functioning in mood disorders by Analyzing keystroke kinematics and speech dynamics F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ... Digital Phenotyping and Mobile Sensing: New Developments in …, 2022	4	2022
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System TJ Park, H Huang, A Jukic, K Dhawan, KC Puvvada, N Koluguri, N Karpov, ... arXiv preprint arXiv:2310.12378, 2023	3	2023
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ... arXiv preprint arXiv:2310.12371, 2023	3	2023
Leveraging pretrained asr encoders for effective and efficient end-to-end speech intent classification and slot filling H Huang, J Balam, B Ginsburg arXiv preprint arXiv:2307.07057, 2023	3	2023
Translational concept embedding for generalized compositional zero-shot learning H Huang, W Tang, J Zhang, PS Yu arXiv preprint arXiv:2112.10871, 2021	3	2021
Dapred: Dynamic attention location prediction with long-short term movement regularity J Liu, Q Yuan, C Yang, H Huang, C Zhang, P Yu	3	2019
DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment KH Lu, Z Chen, SW Fu, H Huang, B Ginsburg, YCF Wang, H Lee arXiv preprint arXiv:2406.18871, 2024	1	2024
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data KC Puvvada, P Żelasko, H Huang, O Hrinchuk, NR Koluguri, K Dhawan, ... arXiv preprint arXiv:2406.19674, 2024		2024
BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5 Z Chen, H Huang, O Hrinchuk, KC Puvvada, NR Koluguri, P Żelasko, ... arXiv preprint arXiv:2406.19954, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors