Analysis of CNN-based Speech Recognition System using Raw Speech as Input D Palaz, M Magimai-Doss, R Collobert Proceedings of Interspeech, 11-15, 2015 | 377 | 2015 |
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks D Palaz, R Collobert, M Magimai Doss Interspeech, 1766-1770, 2013 | 255 | 2013 |
Convolutional neural networks-based continuous speech recognition using raw speech signal D Palaz, M Magimai-Doss, R Collobert Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 234 | 2015 |
End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition D Palaz, M Magimai-Doss, R Collobert Speech Communication 108, 15-32, 2019 | 170 | 2019 |
Analysis of Deep Learning Architectures for Cross-Corpus Speech Emotion Recognition. J Parry, D Palaz, G Clarke, P Lecomte, R Mead, M Berger, G Hofer Interspeech, 1656-1660, 2019 | 110 | 2019 |
Jointly Learning to Locate and Classify Words Using Convolutional Networks. D Palaz, G Synnaeve, R Collobert Interspeech, 2741-2745, 2016 | 38 | 2016 |
End-to-end phoneme sequence recognition using convolutional neural networks D Palaz, R Collobert, MM Doss arXiv preprint arXiv:1312.2137, 2013 | 38 | 2013 |
Magimai D Palaz, R Collobert Doss, M.: End-to-end phoneme sequence recognition using convolutional neural …, 2013 | 8 | 2013 |
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks. arXiv 2013 D Palaz, R Collobert, MM Doss arXiv preprint arXiv:1304.1018, 0 | 8 | |
Speech Emotion Recognition in the Wild using Multi-task and Adversarial Learning. J Parry, E DeMattos, A Klementiev, A Ind, D Morse-Kopp, G Clarke, ... INTERSPEECH, 1158-1162, 2022 | 7 | 2022 |
Non-Verbal Vocalisation and Laughter Detection Using Sequence-to-Sequence Models and Multi-Label Training. S Condron, G Clarke, A Klementiev, D Morse-Kopp, J Parry, D Palaz Interspeech, 2506-2510, 2021 | 7 | 2021 |
Towards end-to-end speech recognition D Palaz EPFL, 2016 | 7 | 2016 |
Learning linearly separable features for speech recognition using convolutional neural networks D Palaz, MM Doss, R Collobert arXiv preprint arXiv:1412.7110, 2014 | 7 | 2014 |
Sparse stereo image coding with learned dictionaries D Palaz, I Tošić, P Frossard 2011 18th IEEE International Conference on Image Processing, 133-136, 2011 | 5 | 2011 |
Joint phoneme segmentation inference and classification using CRFs D Palaz, M Magimai-Doss, R Collobert 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP …, 2014 | 4 | 2014 |
End-to-end acoustic modeling using convolutional neural networks for automatic speech recognition D Palaz, R Collobert Idiap, 2016 | 3 | 2016 |
Emotion Label Encoding using Word Embeddings for Speech Emotion Recognition E Stanley, E DeMattos, A Klementiev, P Ozimek, G Clarke, M Berger, ... | 3 | |
Raw speech signal-based continuous speech recognition using convolutional neural networks D Palaz, R Collobert Idiap, 2014 | 2 | 2014 |
Type of publication: Idiap-RR Citation: Palaz_Idiap-RR-24-2015 Number: Idiap-RR-24-2015 Year: 2015 Month: 6 D Palaz | | 2015 |
Type of publication: Idiap-RR Citation: Palaz_Idiap-RR-23-2015 Number: Idiap-RR-23-2015 Year: 2015 Month: 6 D Palaz | | 2015 |