A review of deep learning techniques for speech processing A Mehrish, N Majumder, R Bharadwaj, R Mihalcea, S Poria Information Fusion 99, 101869, 2023 | 174 | 2023 |
Text-to-audio generation using instruction-tuned llm and latent diffusion model D Ghosal, N Majumder, A Mehrish, S Poria arXiv preprint arXiv:2304.13731, 2023 | 130 | 2023 |
Text-to-audio generation using instruction guided latent diffusion model D Ghosal, N Majumder, A Mehrish, S Poria Proceedings of the 31st ACM International Conference on Multimedia, 3590-3598, 2023 | 30 | 2023 |
Speaker embeddings for diarization of broadcast data in the allies challenge A Larcher, A Mehrish, M Tahon, S Meignier, J Carrive, D Doukhan, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 17 | 2021 |
Evaluating parameter-efficient transfer learning approaches on sure benchmark for speech understanding Y Li, A Mehrish, R Bhardwaj, N Majumder, B Cheng, S Zhao, A Zadeh, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 16 | 2023 |
Robust PRNU estimation from probabilistic raw measurements A Mehrish, AV Subramanyam, S Emmanuel Signal Processing: Image Communication 66, 30-41, 2018 | 16 | 2018 |
Sensor pattern noise estimation using probabilistically estimated RAW values A Mehrish, AV Subramanyam, S Emmanuel IEEE Signal Processing Letters 23 (5), 693-697, 2016 | 14 | 2016 |
Joint spatial and discrete cosine transform domain-based counter forensics for adaptive contrast enhancement A Mehrish, AV Subramanyam, S Emmanuel IEEE access 7, 27183-27195, 2019 | 13 | 2019 |
Multimedia signatures for vehicle forensics A Mehrish, AV Subramanyam, M Kankanhalli 2017 IEEE International Conference on Multimedia and Expo (ICME), 685-690, 2017 | 13 | 2017 |
Anti-forensic technique for median filtering using L1-L2 TV model S Sharma, AV Subramanyam, M Jain, A Mehrish, S Emmanuel 2016 IEEE International Workshop on Information Forensics and Security (WIFS …, 2016 | 11 | 2016 |
Adaptermix: Exploring the efficacy of mixture of adapters for low-resource tts adaptation A Mehrish, AR Kashyap, L Yingting, N Majumder, S Poria arXiv preprint arXiv:2305.18028, 2023 | 7 | 2023 |
Learning accent representation with multi-level vae towards controllable speech synthesis J Melechovsky, A Mehrish, D Herremans, B Sisman 2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023 | 6 | 2023 |
Egocentric analysis of dash-cam videos for vehicle forensics A Mehrish, P Singh, P Jain, AV Subramanyam, M Kankanhalli IEEE Transactions on Circuits and Systems for Video Technology 30 (9), 3000-3014, 2019 | 6 | 2019 |
Improving text-to-audio models with synthetic captions Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ... arXiv preprint arXiv:2406.15487, 2024 | 5 | 2024 |
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models X Li, F Bu, A Mehrish, Y Li, J Han, B Cheng, S Poria arXiv preprint arXiv:2404.00569, 2024 | 5 | 2024 |
Accented text-to-speech synthesis with a conditional variational autoencoder J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2211.03316, 2022 | 5 | 2022 |
Towards lifelong human assisted speaker diarization M Shamsi, A Larcher, L Barrault, S Meignier, Y Prokopalo, M Tahon, ... Computer Speech & Language 77, 101437, 2023 | 4 | 2023 |
Precoding based on signal-to-leakage and noise ratio to reduce ICI in MIMO-OFDM systems A Mehrish, H Kumar, A Goswami International Journal of Computer Applications 975, 8887, 2014 | 3 | 2014 |
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2406.01018, 2024 | 1 | 2024 |
HYPERTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks Y Li, R Bhardwaj, A Mehrish, B Cheng, S Poria arXiv preprint arXiv:2404.04645, 2024 | 1 | 2024 |