Yao Qian

Cited by

	All	Since 2019
Citations	6550	4260
h-index	36	28
i10-index	106	60

1300

650

325

975

200320042005200620072008200920102011201220132014201520162017201820192020202120222023202425 30 21 36 49 53 65 79 81 118 108 109 231 391 365 469 501 420 469 722 1284 854

Public access

View all

2 articles

available

not available

Based on funding mandates

Co-authors

Tan Lee 李丹Department of Electronic Engineering, The Chinese University of Hong KongVerified email at ee.cuhk.edu.hk
Zhizheng WuChinese University of Hong Kong, Shenzhen, Mel LabVerified email at cuhk.edu.cn
Boyang GaoGRVerified email at geometryrobot.com
Pinyan LuITCS, Shanghai University of Finance and EconomicsVerified email at mail.shufe.edu.cn
Wenping HuProfessor of Chemistry, Institute of Chemistry, CASVerified email at iccas.ac.cn
Hui LIANGNIO GmbH

Yao Qian

Microsoft

Verified email at microsoft.com - Homepage

Deep Learning - Spoken Language Processing - Computer Aided Language Learning - Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022	1209	2022
TTS synthesis with bidirectional LSTM based recurrent neural networks Y Fan, Y Qian, FL Xie, FK Soong Fifteenth annual conference of the international speech communication …, 2014	611	2014
On the training aspects of deep neural network (DNN) for parametric TTS synthesis Y Qian, Y Fan, W Hu, FK Soong 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	267	2014
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers W Hu, Y Qian, FK Soong, Y Wang Speech Communication 67, 154-166, 2015	250	2015
Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ... arXiv preprint arXiv:2110.07205, 2021	185	2021
Part-of-speech tagging with bidirectional long short-term memory recurrent neural network P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1510.06168, 2015	160	2015
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis Y Fan, Y Qian, FK Soong, L He 2015 IEEE international conference on acoustics, speech and signal …, 2015	155	2015
Using bidirectional LSTM recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech Z Yu, V Ramanarayanan, D Suendermann-Oeft, X Wang, K Zechner, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	133	2015
A unified tagging solution: Bidirectional lstm recurrent neural network with word embedding P Wang, Y Qian, FK Soong, L He, H Zhao arXiv preprint arXiv:1511.00215, 2015	119	2015
A report on the 2017 native language identification shared task S Malmasi, K Evanini, A Cahill, J Tetreault, R Pugh, C Hamill, ... Proceedings of the 12th Workshop on Innovative Use of NLP for Building …, 2017	118	2017
Unispeech: Unified speech representation learning with labeled and unlabeled data C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang International Conference on Machine Learning, 10937-10947, 2021	117	2021
Large-scale self-supervised speech representation learning for automatic speaker verification Z Chen, S Chen, Y Wu, Y Qian, C Wang, S Liu, Y Qian, M Zeng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	108	2022
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL). W Hu, Y Qian, FK Soong Interspeech, 1886-1890, 2013	108	2013
Locating boundaries for prosodic constituents in unrestricted Mandarin texts M Chu, Y Qian International Journal of Computational Linguistics & Chinese Language …, 2001	107	2001
End-to-end neural network based automated speech scoring L Chen, J Tao, S Ghaffarzadegan, Y Qian 2018 IEEE international conference on acoustics, speech and signal …, 2018	89	2018
A cross-language state sharing and mapping approach to bilingual (Mandarin–English) TTS Y Qian, H Liang, FK Soong IEEE Transactions on Audio, Speech, and Language Processing 17 (6), 1231-1239, 2009	85	2009
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system Y Qian, R Ubale, V Ramanaryanan, P Lange, D Suendermann-Oeft, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	79	2017
Word embedding for recurrent neural network based TTS synthesis P Wang, Y Qian, FK Soong, L He, H Zhao 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	74	2015
An HMM-based Mandarin Chinese text-to-speech system Y Qian, F Soong, Y Chen, M Chu Chinese Spoken Language Processing: 5th International Symposium, ISCSLP 2006 …, 2006	73	2006
Unispeech-sat: Universal speech representation learning with speaker aware pre-training S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	69	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors