Hiroshi Sato

Cited by

	All	Since 2019
Citations	257	244
h-index	10	9
i10-index	10	9

140

105

2016201720182019202020212022202320241 3 1 2 5 21 49 130 32

Co-authors

Ryo MasumuraDistinguished Research Scientist, NTT Computer and Data Science Laboratories, NTT CorporationVerified email at lab.ntt.co.jp
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Tomohiro TanakaNTT Computer & Data Science LaboratoriesVerified email at hco.ntt.co.jp
Takanori AshiharaNTTVerified email at ntt.com
Yusuke ShinoharaLY CorporationVerified email at lycorp.co.jp
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Atsushi AndoNTT CorporationVerified email at hco.ntt.co.jp
Mana IhoriNTTコンピュータ＆データサイエンス研究所Verified email at hco.ntt.co.jp
Shoko ArakiNTT Communication Science LaboratoriesVerified email at ieee.org
Kohei MatsuuraNTT CorporationVerified email at ntt.com
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Rintaro Ikeshita (池下林太郎)NTTVerified email at ieee.org
Yusuke IjimaNTT CorporationVerified email at lab.ntt.co.jp
Shigeru KatagiriVerified email at mbj.ocn.ne.jp
Takahiro ShinozakiTokyo Institute of TechnologyVerified email at ict.e.titech.ac.jp
Nobukatsu HojoNTT Human Informatics LaboratoriesVerified email at ntt.com
Taichi AsamiNTT CorporationVerified email at ntt.com
Shigeki KaritaGoogleVerified email at google.com
Atsunori OgawaNTT Communication Science LaboratoriesVerified email at ieee.org
Kateřina ŽmolíkováResearch scientist, MetaVerified email at meta.com

Hiroshi Sato

NTT Corporation

Verified email at ntt.com

speech enhancement speech recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR K Iwamoto, T Ochiai, M Delcroix, R Ikeshita, H Sato, S Araki, S Katagiri arXiv preprint arXiv:2201.06685, 2022	51	2022
Learning to enhance or not: Neural network-based switching of enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, N Kamo, T Moriya ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	24	2022
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020	23	2020
Multimodal attention fusion for target speaker extraction H Sato, T Ochiai, K Kinoshita, M Delcroix, T Nakatani, S Araki 2021 IEEE Spoken Language Technology Workshop (SLT), 778-784, 2021	22	2021
Should we always separate?: Switching between enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Kamo arXiv preprint arXiv:2106.00949, 2021	20	2021
Streaming target-speaker ASR with neural transducer T Moriya, H Sato, T Ochiai, M Delcroix, T Shinozaki arXiv preprint arXiv:2209.04175, 2022	13	2022
Distilling attention weights for CTC-based ASR systems T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	13	2020
Listen only to me! How well can target speech extraction handle false alarms? M Delcroix, K Kinoshita, T Ochiai, K Zmolikova, H Sato, T Nakatani arXiv preprint arXiv:2204.04811, 2022	11	2022
A case report of comorbid eating disorder and factitious disorder I Mizuta, T Fukunaga, H Sato, M Ogasawara, M Takeda, Y Inoue Psychiatry and clinical neurosciences 54 (5), 603-606, 2000	11	2000
Neural Whispered Speech Detection with Imbalanced Learning. T Ashihara, Y Shinohara, H Sato, T Moriya, K Matsui, T Fukutomi, ... INTERSPEECH, 3352-3356, 2019	10	2019
Speech emotion recognition based on listener adaptive models A Ando, R Masumura, H Sato, T Moriya, T Ashihara, Y Ijima, T Toda ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	8	2021
SimpleFlat: A simple whole-network pre-training approach for RNN transducer-based end-to-end speech recognition T Moriya, T Ashihara, T Tanaka, T Ochiai, H Sato, A Ando, Y Ijima, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	8	2021
Strategies to improve robustness of target speech extraction to enrollment variations H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Makishima, M Ihori, ... arXiv preprint arXiv:2206.08174, 2022	7	2022
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture. T Moriya, T Tanaka, T Ashihara, T Ochiai, H Sato, A Ando, R Masumura, ... Interspeech, 1787-1791, 2021	7	2021
Hybrid RNN-T/Attention-based streaming ASR with triggered chunkwise attention and dual internal language model integration T Moriya, T Ashihara, A Ando, H Sato, T Tanaka, K Matsuura, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	6	2022
World's first bio-degradable actuator for removal-free implantable MEMS H Sato, Y Inoue, M Ikeuchi, K Ikuta 2017 IEEE 30th International Conference on Micro Electro Mechanical Systems …, 2017	4	2017
End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders. R Masumura, H Sato, T Tanaka, T Moriya, Y Ijima, T Oba INTERSPEECH, 1606-1610, 2019	3	2019
Improving scheduled sampling for neural transducer-based ASR T Moriya, T Ashihara, H Sato, K Matsuura, T Tanaka, R Masumura ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023
Downstream task agnostic speech enhancement with self-supervised representation loss H Sato, R Masumura, T Ochiai, M Delcroix, T Moriya, T Ashihara, ... arXiv preprint arXiv:2305.14723, 2023	2	2023
Transcribing speech as spoken and written dual text using an autoregressive model M Ihori, H Sato, T Tanaka, R Masumura, S Mizuno, N Hojo Proc. INTERSPEECH 2023, 461-465, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors