Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022 | 1496 | 2022 |
TTS synthesis with bidirectional LSTM based recurrent neural networks Y Fan, Y Qian, FL Xie, FK Soong Fifteenth annual conference of the international speech communication …, 2014 | 626 | 2014 |
On the training aspects of deep neural network (DNN) for parametric TTS synthesis Y Qian, Y Fan, W Hu, FK Soong 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 276 | 2014 |
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers W Hu, Y Qian, FK Soong, Y Wang Speech Communication 67, 154-166, 2015 | 262 | 2015 |
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ... arXiv preprint arXiv:2110.07205, 2021 | 209 | 2021 |
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1510.06168, 2015 | 160 | 2015 |
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis Y Fan, Y Qian, FK Soong, L He 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 158 | 2015 |
Using bidirectional LSTM recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech Z Yu, V Ramanarayanan, D Suendermann-Oeft, X Wang, K Zechner, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 135 | 2015 |
Large-scale self-supervised speech representation learning for automatic speaker verification Z Chen, S Chen, Y Wu, Y Qian, C Wang, S Liu, Y Qian, M Zeng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 125 | 2022 |
Unispeech: Unified speech representation learning with labeled and unlabeled data C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang International Conference on Machine Learning, 10937-10947, 2021 | 125 | 2021 |
A report on the 2017 native language identification shared task S Malmasi, K Evanini, A Cahill, J Tetreault, R Pugh, C Hamill, ... Proceedings of the 12th Workshop on Innovative Use of NLP for Building …, 2017 | 120 | 2017 |
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1511.00215, 2015 | 119 | 2015 |
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL). W Hu, Y Qian, FK Soong Interspeech, 1886-1890, 2013 | 110 | 2013 |
Locating boundaries for prosodic constituents in unrestricted Mandarin texts M Chu, Y Qian International Journal of Computational Linguistics & Chinese Language …, 2001 | 107 | 2001 |
End-to-end neural network based automated speech scoring L Chen, J Tao, S Ghaffarzadegan, Y Qian 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 91 | 2018 |
Unispeech-sat: Universal speech representation learning with speaker aware pre-training S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 87 | 2022 |
A cross-language state sharing and mapping approach to bilingual (Mandarin–English) TTS Y Qian, H Liang, FK Soong IEEE Transactions on Audio, Speech, and Language Processing 17 (6), 1231-1239, 2009 | 86 | 2009 |
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system Y Qian, R Ubale, V Ramanaryanan, P Lange, D Suendermann-Oeft, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 80 | 2017 |
Word embedding for recurrent neural network based TTS synthesis P Wang, Y Qian, FK Soong, L He, H Zhao 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 75 | 2015 |
An HMM-based Mandarin Chinese text-to-speech system Y Qian, F Soong, Y Chen, M Chu Chinese Spoken Language Processing: 5th International Symposium, ISCSLP 2006 …, 2006 | 74 | 2006 |