Follow
Ambuj Mehrish
Title
Cited by
Cited by
Year
A review of deep learning techniques for speech processing
A Mehrish, N Majumder, R Bharadwaj, R Mihalcea, S Poria
Information Fusion 99, 101869, 2023
1742023
Text-to-audio generation using instruction-tuned llm and latent diffusion model
D Ghosal, N Majumder, A Mehrish, S Poria
arXiv preprint arXiv:2304.13731, 2023
1302023
Text-to-audio generation using instruction guided latent diffusion model
D Ghosal, N Majumder, A Mehrish, S Poria
Proceedings of the 31st ACM International Conference on Multimedia, 3590-3598, 2023
302023
Speaker embeddings for diarization of broadcast data in the allies challenge
A Larcher, A Mehrish, M Tahon, S Meignier, J Carrive, D Doukhan, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
172021
Evaluating parameter-efficient transfer learning approaches on sure benchmark for speech understanding
Y Li, A Mehrish, R Bhardwaj, N Majumder, B Cheng, S Zhao, A Zadeh, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
162023
Robust PRNU estimation from probabilistic raw measurements
A Mehrish, AV Subramanyam, S Emmanuel
Signal Processing: Image Communication 66, 30-41, 2018
162018
Sensor pattern noise estimation using probabilistically estimated RAW values
A Mehrish, AV Subramanyam, S Emmanuel
IEEE Signal Processing Letters 23 (5), 693-697, 2016
142016
Joint spatial and discrete cosine transform domain-based counter forensics for adaptive contrast enhancement
A Mehrish, AV Subramanyam, S Emmanuel
IEEE access 7, 27183-27195, 2019
132019
Multimedia signatures for vehicle forensics
A Mehrish, AV Subramanyam, M Kankanhalli
2017 IEEE International Conference on Multimedia and Expo (ICME), 685-690, 2017
132017
Anti-forensic technique for median filtering using L1-L2 TV model
S Sharma, AV Subramanyam, M Jain, A Mehrish, S Emmanuel
2016 IEEE International Workshop on Information Forensics and Security (WIFS …, 2016
112016
Adaptermix: Exploring the efficacy of mixture of adapters for low-resource tts adaptation
A Mehrish, AR Kashyap, L Yingting, N Majumder, S Poria
arXiv preprint arXiv:2305.18028, 2023
72023
Learning accent representation with multi-level vae towards controllable speech synthesis
J Melechovsky, A Mehrish, D Herremans, B Sisman
2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023
62023
Egocentric analysis of dash-cam videos for vehicle forensics
A Mehrish, P Singh, P Jain, AV Subramanyam, M Kankanhalli
IEEE Transactions on Circuits and Systems for Video Technology 30 (9), 3000-3014, 2019
62019
Improving text-to-audio models with synthetic captions
Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ...
arXiv preprint arXiv:2406.15487, 2024
52024
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
X Li, F Bu, A Mehrish, Y Li, J Han, B Cheng, S Poria
arXiv preprint arXiv:2404.00569, 2024
52024
Accented text-to-speech synthesis with a conditional variational autoencoder
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2211.03316, 2022
52022
Towards lifelong human assisted speaker diarization
M Shamsi, A Larcher, L Barrault, S Meignier, Y Prokopalo, M Tahon, ...
Computer Speech & Language 77, 101437, 2023
42023
Precoding based on signal-to-leakage and noise ratio to reduce ICI in MIMO-OFDM systems
A Mehrish, H Kumar, A Goswami
International Journal of Computer Applications 975, 8887, 2014
32014
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2406.01018, 2024
12024
HYPERTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks
Y Li, R Bhardwaj, A Mehrish, B Cheng, S Poria
arXiv preprint arXiv:2404.04645, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20