Generative dual adversarial network for generalized zero-shot learning H Huang, C Wang, PS Yu, CD Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 274 | 2019 |
An introduction to image synthesis with generative adversarial nets H Huang, PS Yu, C Wang | 232 | 2018 |
Fast conformer with linearly scalable attention for efficient speech recognition D Rekesh, NR Koluguri, S Kriman, S Majumdar, V Noroozi, H Huang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 70 | 2023 |
dpMood: exploiting local and periodic typing dynamics for personalized mood prediction H Huang, B Cao, SY Phillip, CD Wang, AD Leow 2018 IEEE International Conference on Data Mining (ICDM), 157-166, 2018 | 29 | 2018 |
Salm: Speech-augmented language model with in-context learning for speech recognition and translation Z Chen, H Huang, A Andrusenko, O Hrinchuk, KC Puvvada, J Li, S Ghosh, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 28 | 2024 |
Deep latent factor model with hierarchical similarity measure for recommender systems J Han, L Zheng, H Huang, Y Xu, SY Philip, W Zuo Information Sciences 503, 521-532, 2019 | 28 | 2019 |
Efficient sequence transduction by jointly predicting tokens and durations H Xu, F Jia, S Majumdar, H Huang, S Watanabe, B Ginsburg International Conference on Machine Learning, 38462-38484, 2023 | 11 | 2023 |
Passive sensing of affective and cognitive functioning in mood disorders by analyzing keystroke kinematics and speech dynamics F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ... Digital phenotyping and mobile sensing: New developments in …, 2019 | 11 | 2019 |
Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge H Huang, Y Chen, W Tang, W Zheng, QG Chen, P Yu BMVC'2020 (oral), 2020 | 8 | 2020 |
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ... arXiv preprint arXiv:2310.12371, 2023 | 7 | 2023 |
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score H Huang, S Saito, Y Kikuchi, E Matsumoto, W Tang, PS Yu ACCV'2020, 2020 | 7 | 2020 |
Less is more: Accurate speech recognition & translation without web-scale data KC Puvvada, P Żelasko, H Huang, O Hrinchuk, NR Koluguri, K Dhawan, ... arXiv preprint arXiv:2406.19674, 2024 | 6 | 2024 |
Desta: Enhancing speech language models through descriptive speech-text alignment KH Lu, Z Chen, SW Fu, H Huang, B Ginsburg, YCF Wang, H Lee arXiv preprint arXiv:2406.18871, 2024 | 5 | 2024 |
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System TJ Park, H Huang, A Jukic, K Dhawan, KC Puvvada, N Koluguri, N Karpov, ... arXiv preprint arXiv:2310.12378, 2023 | 5 | 2023 |
Leveraging pretrained asr encoders for effective and efficient end-to-end speech intent classification and slot filling H Huang, J Balam, B Ginsburg arXiv preprint arXiv:2307.07057, 2023 | 5 | 2023 |
Passive sensing of affective and cognitive functioning in mood disorders by Analyzing keystroke kinematics and speech dynamics F Hussain, JP Stange, SA Langenecker, MG McInnis, J Zulueta, ... Digital Phenotyping and Mobile Sensing: New Developments in …, 2022 | 4 | 2022 |
Translational concept embedding for generalized compositional zero-shot learning H Huang, W Tang, J Zhang, PS Yu arXiv preprint arXiv:2112.10871, 2021 | 3 | 2021 |
Dapred: Dynamic attention location prediction with long-short term movement regularity J Liu, Q Yuan, C Yang, H Huang, C Zhang, P Yu | 3 | 2019 |
Sortformer: Seamless integration of speaker diarization and asr by bridging timestamps and tokens T Park, I Medennikov, K Dhawan, W Wang, H Huang, NR Koluguri, ... arXiv preprint arXiv:2409.06656, 2024 | 2 | 2024 |
Bestow: Efficient and streamable speech language model with the best of two worlds in gpt and t5 Z Chen, H Huang, O Hrinchuk, KC Puvvada, NR Koluguri, P Żelasko, ... arXiv preprint arXiv:2406.19954, 2024 | 2 | 2024 |