Adam Polyak
Adam Polyak
Facebook AI Research
Verified email at
Cited by
Cited by
Unsupervised cross-domain image generation
Y Taigman, A Polyak, L Wolf
arXiv preprint arXiv:1611.02200, 2016
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop
Y Taigman, L Wolf, A Polyak, E Nachmani
6th International Conference on Learning Representations, 2017
Channel-level acceleration of deep face representations
A Polyak, L Wolf
IEEE Access 3, 2163-2175, 2015
Make-a-video: Text-to-video generation without text-video data
U Singer, A Polyak, T Hayes, X Yin, J An, S Zhang, Q Hu, H Yang, ...
arXiv preprint arXiv:2209.14792, 2022
Make-a-scene: Scene-based text-to-image generation with human priors
O Gafni, A Polyak, O Ashual, S Sheynin, D Parikh, Y Taigman
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
A Universal Music Translation Network
N Mor, L Wolf, A Polyak, Y Taigman
7th International Conference on Learning Representations, 2019
On generative spoken language modeling from raw audio
K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ...
Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021
Speech resynthesis from discrete disentangled self-supervised representations
A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ...
arXiv preprint arXiv:2104.00355, 2021
Fitting New Speakers Based on a Short Untranscribed Sample
E Nachmani, A Polyak, Y Taigman, L Wolf
Proceedings of the 35th International Conference on Machine Learning, 2018
Direct speech-to-speech translation with discrete units
A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma, A Polyak, Y Adi, Q He, ...
arXiv preprint arXiv:2107.05604, 2021
Unsupervised creation of parameterized avatars
L Wolf, Y Taigman, A Polyak
Proceedings of the IEEE International Conference on Computer Vision, 1530-1538, 2017
Text-free prosody-aware generative spoken language modeling
E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ...
arXiv preprint arXiv:2109.03264, 2021
Audiogen: Textually guided audio generation
F Kreuk, G Synnaeve, A Polyak, U Singer, A Défossez, J Copet, D Parikh, ...
arXiv preprint arXiv:2209.15352, 2022
Knn-diffusion: Image generation via large-scale retrieval
S Sheynin, O Ashual, A Polyak, U Singer, O Gafni, E Nachmani, ...
arXiv preprint arXiv:2204.02849, 2022
Unsupervised cross-domain singing voice conversion
A Polyak, L Wolf, Y Adi, Y Taigman
arXiv preprint arXiv:2008.02830, 2020
TTS skins: Speaker conversion via ASR
A Polyak, L Wolf, Y Taigman
arXiv preprint arXiv:1904.08983, 2019
Textless speech emotion conversion using decomposed and discrete representations
F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ...
arXiv preprint arXiv:2111.07402, 2021
Attention-based wavenet autoencoder for universal voice conversion
A Polyak, L Wolf
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
High fidelity speech regeneration with application to speech enhancement
A Polyak, L Wolf, Y Adi, O Kabeli, Y Taigman
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
fairseq S^ 2: A Scalable and Integrable Speech Synthesis Toolkit
C Wang, WN Hsu, Y Adi, A Polyak, A Lee, PJ Chen, J Gu, J Pino
arXiv preprint arXiv:2109.06912, 2021
The system can't perform the operation now. Try again later.
Articles 1–20