Follow
Xin Wang
Title
Cited by
Cited by
Year
Asvspoof 2019: Future horizons in spoofed and fake audio detection
M Todisco, X Wang, V Vestman, M Sahidullah, H Delgado, A Nautsch, ...
Proc. Interspeech, 1008-1012, 2019
6772019
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
SSW, 146-152, 2016
4412016
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language, 101114, 2020
3872020
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
J Yamagishi, X Wang, M Todisco, M Sahidullah, J Patino, A Nautsch, ...
Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021
3332021
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings
E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2112020
A comparative study on recent neural spoofing countermeasures for synthetic speech detection
X Wang, J Yamagishi
Proc. Interspeech, 4259--4263, 2021
1792021
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech
A Nautsch, X Wang, N Evans, TH Kinnunen, V Vestman, M Todisco, ...
IEEE Transactions on Biometrics, Behavior, and Identity Science 3 (2), 252-265, 2021
1742021
Neural source-filter-based waveform model for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1722019
Neural source-filter waveform models for statistical parametric speech synthesis
X Wang, S Takaki, J Yamagishi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 402-415, 2019
1632019
Speaker anonymization using x-vector and neural waveform models
F Fang, X Wang, J Yamagishi, I Echizen, M Todisco, N Evans, ...
Proc. SSW, 155-160, 2019
1562019
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
H Tak, M Todisco, X Wang, J Jung, J Yamagishi, N Evans
Proc. Odyssey, 2022
1472022
ASVspoof 2021: Towards spoofed and deepfake speech detection in the wild
X Liu, X Wang, M Sahidullah, J Patino, H Delgado, T Kinnunen, ...
IEEE/ACM Transaction on Audio, Speech, and Language Processing (accepted), 2022
1442022
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks.
C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi
Interspeech, 352-356, 2016
1352016
Introducing the VoicePrivacy initiative
N Tomashenko, BML Srivastava, X Wang, E Vincent, A Nautsch, ...
Proc. Interspeech, 1693--1697, 2020
1322020
Investigating self-supervised front ends for speech spoofing countermeasures
X Wang, J Yamagishi
Proc. Odyssey, 100-106, 2022
1172022
The VoicePrivacy 2020 Challenge: Results and findings
N Tomashenko, X Wang, E Vincent, J Patino, BML Srivastava, PG Noé, ...
Computer Speech & Language 74, 101362, 2022
1102022
Tandem assessment of spoofing countermeasures and automatic speaker verification: Fundamentals
T Kinnunen, H Delgado, N Evans, KA Lee, V Vestman, A Nautsch, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2195-2210, 2020
1102020
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Y Yasuda, X Wang, S Takaki, J Yamagishi
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1102019
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data
J Lorenzo-Trueba, F Fang, X Wang, I Echizen, J Yamagishi, T Kinnunen
Proc. Speaker Odyssey, 240-247, 2018
882018
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
X Wang, J Lorenzo-Trueba, S Takaki, L Juvela, J Yamagishi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
792018
The system can't perform the operation now. Try again later.
Articles 1–20