Follow
Hiroshi Sato
Hiroshi Sato
NTT Corporation
Verified email at ntt.com
Title
Cited by
Cited by
Year
How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR
K Iwamoto, T Ochiai, M Delcroix, R Ikeshita, H Sato, S Araki, S Katagiri
arXiv preprint arXiv:2201.06685, 2022
512022
Learning to enhance or not: Neural network-based switching of enhanced and observed signals for overlapping speech recognition
H Sato, T Ochiai, M Delcroix, K Kinoshita, N Kamo, T Moriya
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Self-Distillation for Improving CTC-Transformer-Based ASR Systems.
T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ...
INTERSPEECH, 546-550, 2020
232020
Multimodal attention fusion for target speaker extraction
H Sato, T Ochiai, K Kinoshita, M Delcroix, T Nakatani, S Araki
2021 IEEE Spoken Language Technology Workshop (SLT), 778-784, 2021
222021
Should we always separate?: Switching between enhanced and observed signals for overlapping speech recognition
H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Kamo
arXiv preprint arXiv:2106.00949, 2021
202021
Streaming target-speaker ASR with neural transducer
T Moriya, H Sato, T Ochiai, M Delcroix, T Shinozaki
arXiv preprint arXiv:2209.04175, 2022
132022
Distilling attention weights for CTC-based ASR systems
T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
132020
Listen only to me! How well can target speech extraction handle false alarms?
M Delcroix, K Kinoshita, T Ochiai, K Zmolikova, H Sato, T Nakatani
arXiv preprint arXiv:2204.04811, 2022
112022
A case report of comorbid eating disorder and factitious disorder
I Mizuta, T Fukunaga, H Sato, M Ogasawara, M Takeda, Y Inoue
Psychiatry and clinical neurosciences 54 (5), 603-606, 2000
112000
Neural Whispered Speech Detection with Imbalanced Learning.
T Ashihara, Y Shinohara, H Sato, T Moriya, K Matsui, T Fukutomi, ...
INTERSPEECH, 3352-3356, 2019
102019
Speech emotion recognition based on listener adaptive models
A Ando, R Masumura, H Sato, T Moriya, T Ashihara, Y Ijima, T Toda
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
82021
SimpleFlat: A simple whole-network pre-training approach for RNN transducer-based end-to-end speech recognition
T Moriya, T Ashihara, T Tanaka, T Ochiai, H Sato, A Ando, Y Ijima, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
82021
Strategies to improve robustness of target speech extraction to enrollment variations
H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Makishima, M Ihori, ...
arXiv preprint arXiv:2206.08174, 2022
72022
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture.
T Moriya, T Tanaka, T Ashihara, T Ochiai, H Sato, A Ando, R Masumura, ...
Interspeech, 1787-1791, 2021
72021
Hybrid RNN-T/Attention-based streaming ASR with triggered chunkwise attention and dual internal language model integration
T Moriya, T Ashihara, A Ando, H Sato, T Tanaka, K Matsuura, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
62022
World's first bio-degradable actuator for removal-free implantable MEMS
H Sato, Y Inoue, M Ikeuchi, K Ikuta
2017 IEEE 30th International Conference on Micro Electro Mechanical Systems …, 2017
42017
End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders.
R Masumura, H Sato, T Tanaka, T Moriya, Y Ijima, T Oba
INTERSPEECH, 1606-1610, 2019
32019
Improving scheduled sampling for neural transducer-based ASR
T Moriya, T Ashihara, H Sato, K Matsuura, T Tanaka, R Masumura
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Downstream task agnostic speech enhancement with self-supervised representation loss
H Sato, R Masumura, T Ochiai, M Delcroix, T Moriya, T Ashihara, ...
arXiv preprint arXiv:2305.14723, 2023
22023
Transcribing speech as spoken and written dual text using an autoregressive model
M Ihori, H Sato, T Tanaka, R Masumura, S Mizuno, N Hojo
Proc. INTERSPEECH 2023, 461-465, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20