Yashesh Gaur

Cited by

	All	Since 2019
Citations	1574	1476
h-index	21	21
i10-index	37	33

440

220

110

330

2016201720182019202020212022202320248 19 63 90 103 298 356 437 190

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Zhong MengGoogleVerified email at google.com
Naoyuki KandaMicrosoftVerified email at microsoft.com
Yifan GongPrincipal Science Manager, Microsoft Corp.Verified email at microsoft.com
Anuroop SriramMeta FAIRVerified email at alumni.cmu.edu
Sanjeev SatheeshStanford UniversityVerified email at stanford.edu
Eric BattenbergGoogle ResearchVerified email at google.com
Adam CoatesPreviously Apple, Khosla Ventures, Baidu SVAIL, Stanford PhDVerified email at cs.stanford.edu
Jeffrey P. BighamCarnegie Mellon University & AppleVerified email at cs.cmu.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Yajie MiaoCarnegie Mellon UniversityVerified email at cs.cmu.edu

Yashesh Gaur

Meta AI

Verified email at cs.cmu.edu

Machine Learning Speech & Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploring neural transducers for end-to-end speech recognition E Battenberg, J Chen, R Child, A Coates, YGY Li, H Liu, S Satheesh, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	269*	2017
On the comparison of popular end-to-end models for large scale speech recognition J Li, Y Wu, Y Gaur, C Wang, R Zhao, S Liu arXiv preprint arXiv:2005.14327, 2020	142	2020
Internal language model estimation for domain-adaptive end-to-end speech recognition Z Meng, S Parthasarathy, E Sun, Y Gaur, N Kanda, L Lu, X Chen, R Zhao, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 243-250, 2021	95	2021
Serialized output training for end-to-end overlapped speech recognition N Kanda, Y Gaur, X Wang, Z Meng, T Yoshioka arXiv preprint arXiv:2003.12687, 2020	94	2020
Joint speaker counting, speech recognition, and speaker identification for overlapped speech of any number of speakers N Kanda, Y Gaur, X Wang, Z Meng, Z Chen, T Zhou, T Yoshioka arXiv preprint arXiv:2006.10930, 2020	70	2020
Robust speech recognition using generative adversarial networks A Sriram, H Jun, Y Gaur, S Satheesh 2018 IEEE international conference on acoustics, speech and signal …, 2018	69	2018
The effects of automatic speech recognition quality on human transcription latency Y Gaur, WS Lasecki, F Metze, JP Bigham Proceedings of the 13th International Web for All Conference, 1-8, 2016	56	2016
Minimum latency training strategies for streaming sequence-to-sequence ASR H Inaguma, Y Gaur, L Lu, J Li, Y Gong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	54	2020
Domain adaptation via teacher-student learning for end-to-end speech recognition Z Meng, J Li, Y Gaur, Y Gong 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	50	2019
Internal language model training for domain-adaptive end-to-end speech recognition Z Meng, N Kanda, Y Gaur, S Parthasarathy, E Sun, L Lu, X Chen, J Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	47	2021
Investigation of end-to-end speaker-attributed ASR for continuous multi-talker recordings N Kanda, X Chang, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka 2021 IEEE Spoken Language Technology Workshop (SLT), 809-816, 2021	41	2021
Speaker adaptation for attention-based end-to-end speech recognition Z Meng, Y Gaur, J Li, Y Gong arXiv preprint arXiv:1911.03762, 2019	41	2019
Streaming multi-talker ASR with token-level serialized output training N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... arXiv preprint arXiv:2202.00842, 2022	39	2022
A Federated Approach in Training Acoustic Models. D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez Interspeech, 981-985, 2020	39	2020
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2103.16776, 2021	34	2021
Viola: Unified codec language models for speech recognition, synthesis, and translation T Wang, L Zhou, Z Zhang, Y Wu, S Liu, Y Gaur, Z Chen, J Li, F Wei arXiv preprint arXiv:2305.16107, 2023	33	2023
End-to-end speaker-attributed ASR with transformer N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2104.02128, 2021	33	2021
Internal language model adaptation with text-only data for end-to-end speech recognition Z Meng, Y Gaur, N Kanda, J Li, X Chen, Y Wu, Y Gong arXiv preprint arXiv:2110.05354, 2021	24	2021
On decoder-only architecture for speech-to-text and large language model integration J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	23	2023
Transcribe-to-diarize: Neural speaker diarization for unlimited number of speakers using end-to-end speaker-attributed ASR N Kanda, X Xiao, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	21	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors