Unsupervised cross-domain image generation Y Taigman, A Polyak, L Wolf arXiv preprint arXiv:1611.02200, 2016 | 1088 | 2016 |
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop Y Taigman, L Wolf, A Polyak, E Nachmani 6th International Conference on Learning Representations, 2017 | 188* | 2017 |
Channel-level acceleration of deep face representations A Polyak, L Wolf IEEE Access 3, 2163-2175, 2015 | 178 | 2015 |
Make-a-video: Text-to-video generation without text-video data U Singer, A Polyak, T Hayes, X Yin, J An, S Zhang, Q Hu, H Yang, ... arXiv preprint arXiv:2209.14792, 2022 | 146 | 2022 |
Make-a-scene: Scene-based text-to-image generation with human priors O Gafni, A Polyak, O Ashual, S Sheynin, D Parikh, Y Taigman Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 142 | 2022 |
A Universal Music Translation Network N Mor, L Wolf, A Polyak, Y Taigman 7th International Conference on Learning Representations, 2019 | 137 | 2019 |
On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021 | 126 | 2021 |
Speech resynthesis from discrete disentangled self-supervised representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... arXiv preprint arXiv:2104.00355, 2021 | 120 | 2021 |
Fitting New Speakers Based on a Short Untranscribed Sample E Nachmani, A Polyak, Y Taigman, L Wolf Proceedings of the 35th International Conference on Machine Learning, 2018 | 91 | 2018 |
Direct speech-to-speech translation with discrete units A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma, A Polyak, Y Adi, Q He, ... arXiv preprint arXiv:2107.05604, 2021 | 63 | 2021 |
Unsupervised creation of parameterized avatars L Wolf, Y Taigman, A Polyak Proceedings of the IEEE International Conference on Computer Vision, 1530-1538, 2017 | 50 | 2017 |
Text-free prosody-aware generative spoken language modeling E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ... arXiv preprint arXiv:2109.03264, 2021 | 46 | 2021 |
Audiogen: Textually guided audio generation F Kreuk, G Synnaeve, A Polyak, U Singer, A Défossez, J Copet, D Parikh, ... arXiv preprint arXiv:2209.15352, 2022 | 37 | 2022 |
Knn-diffusion: Image generation via large-scale retrieval S Sheynin, O Ashual, A Polyak, U Singer, O Gafni, E Nachmani, ... arXiv preprint arXiv:2204.02849, 2022 | 37 | 2022 |
Unsupervised cross-domain singing voice conversion A Polyak, L Wolf, Y Adi, Y Taigman arXiv preprint arXiv:2008.02830, 2020 | 36 | 2020 |
TTS skins: Speaker conversion via ASR A Polyak, L Wolf, Y Taigman arXiv preprint arXiv:1904.08983, 2019 | 33 | 2019 |
Textless speech emotion conversion using decomposed and discrete representations F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ... arXiv preprint arXiv:2111.07402, 2021 | 27 | 2021 |
Attention-based wavenet autoencoder for universal voice conversion A Polyak, L Wolf ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 26 | 2019 |
High fidelity speech regeneration with application to speech enhancement A Polyak, L Wolf, Y Adi, O Kabeli, Y Taigman ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 16 | 2021 |
fairseq S^ 2: A Scalable and Integrable Speech Synthesis Toolkit C Wang, WN Hsu, Y Adi, A Polyak, A Lee, PJ Chen, J Gu, J Pino arXiv preprint arXiv:2109.06912, 2021 | 12 | 2021 |