Ambuj Mehrish

Cited by

	All	Since 2020
Citations	654	627
h-index	11	9
i10-index	11	9

400

200

100

300

20162017201820192020202120222023202420253 4 6 13 12 12 14 79 383 124

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Soujanya PoriaAssociate Professor, Singapore University of Technology and DesignVerified email at sutd.edu.sg
Navonil MajumderSingapore University of Technology and DesignVerified email at sutd.edu.sg
A V SubramanyamProfessorVerified email at iiitd.ac.in
Rishabh BhardwajSingapore University of Technology and DesignVerified email at mymail.sutd.edu.sg
Sabu EmmanuelSingapore Institute of Technology (SIT), SingaporeVerified email at singaporetech.edu.sg
marie tahonLIUM / Le Mans UniversitéVerified email at univ-lemans.fr
Anthony LarcherProfessor Le Mans UniversitéVerified email at univ-lemans.fr
Mohan KankanhalliProfessor of Computer Science, National University of SingaporeVerified email at comp.nus.edu.sg
Shishir SharmaMcGill UniversityVerified email at mail.mcgill.ca

Ambuj Mehrish

Research Fellow, Singapore University of Technology and Design, Singapore

Verified email at sutd.edu.sg

Signal Processing Multimedia Forensics Speech and Language Processing Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A review of deep learning techniques for speech processing A Mehrish, N Majumder, R Bharadwaj, R Mihalcea, S Poria Information Fusion 99, 101869, 2023	266	2023
Text-to-audio generation using instruction guided latent diffusion model D Ghosal, N Majumder, A Mehrish, S Poria Proceedings of the 31st ACM International Conference on Multimedia, 3590-3598, 2023	215	2023
Evaluating parameter-efficient transfer learning approaches on sure benchmark for speech understanding Y Li, A Mehrish, R Bhardwaj, N Majumder, B Cheng, S Zhao, A Zadeh, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	21	2023
Speaker embeddings for diarization of broadcast data in the allies challenge A Larcher, A Mehrish, M Tahon, S Meignier, J Carrive, D Doukhan, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	19	2021
Robust PRNU estimation from probabilistic raw measurements A Mehrish, AV Subramanyam, S Emmanuel Signal Processing: Image Communication 66, 30-41, 2018	16	2018
Joint spatial and discrete cosine transform domain-based counter forensics for adaptive contrast enhancement A Mehrish, AV Subramanyam, S Emmanuel IEEE access 7, 27183-27195, 2019	14	2019
Multimedia signatures for vehicle forensics A Mehrish, AV Subramanyam, M Kankanhalli 2017 IEEE International Conference on Multimedia and Expo (ICME), 685-690, 2017	14	2017
Sensor pattern noise estimation using probabilistically estimated RAW values A Mehrish, AV Subramanyam, S Emmanuel IEEE Signal Processing Letters 23 (5), 693-697, 2016	14	2016
Improving text-to-audio models with synthetic captions Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ... arXiv preprint arXiv:2406.15487, 2024	13	2024
Adaptermix: Exploring the efficacy of mixture of adapters for low-resource tts adaptation A Mehrish, AR Kashyap, L Yingting, N Majumder, S Poria arXiv preprint arXiv:2305.18028, 2023	11	2023
Anti-forensic technique for median filtering using L₁-L₂ TV model S Sharma, AV Subramanyam, M Jain, A Mehrish, S Emmanuel 2016 IEEE International Workshop on Information Forensics and Security (WIFS …, 2016	11	2016
Egocentric analysis of dash-cam videos for vehicle forensics A Mehrish, P Singh, P Jain, AV Subramanyam, M Kankanhalli IEEE Transactions on Circuits and Systems for Video Technology 30 (9), 3000-3014, 2019	8	2019
Accented text-to-speech synthesis with a conditional variational autoencoder J Melechovsky, A Mehrish, B Sisman, D Herremans TENCON 2024-2024 IEEE Region 10 Conference (TENCON), 343-346, 2024	6	2024
Cm-tts: Enhancing real time text-to-speech synthesis efficiency through weighted samplers and consistency models X Li, F Bu, A Mehrish, Y Li, J Han, B Cheng, S Poria arXiv preprint arXiv:2404.00569, 2024	6	2024
Learning accent representation with multi-level vae towards controllable speech synthesis J Melechovsky, A Mehrish, D Herremans, B Sisman 2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023	6	2023
Towards lifelong human assisted speaker diarization M Shamsi, A Larcher, L Barrault, S Meignier, Y Prokopalo, M Tahon, ... Computer Speech & Language 77, 101437, 2023	4	2023
Precoding based on signal-to-leakage and noise ratio to reduce ICI in MIMO-OFDM systems A Mehrish, H Kumar, A Goswami International Journal of Computer Applications 975, 8887, 2014	3	2014
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2406.01018, 2024	2	2024
HyperTTS: Parameter efficient adaptation in text to speech using hypernetworks Y Li, R Bhardwaj, A Mehrish, B Cheng, S Poria arXiv preprint arXiv:2404.04645, 2024	2	2024
Text-to-audio generation using instruction-tuned llm and latent diffusion model G Deepanway, M Navonil, M Ambuj, P Soujanya arXiv preprint arXiv:2304.13731, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors