A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022 | 396 | 2022 |
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap TJ Park, KJ Han, M Kumar, S Narayanan IEEE Signal Processing Letters 27, 381-385, 2019 | 141 | 2019 |
Titanet: Neural model for speaker representation with 1d depth-wise separable convolutions and global context NR Koluguri, T Park, B Ginsburg ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 115 | 2022 |
Binaural rendering method and apparatus for decoding multi channel audio YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ... US Patent 9,319,819, 2016 | 50 | 2016 |
Musical instrument sound classification with deep convolutional neural network using feature fusion approach T Park, T Lee arXiv preprint arXiv:1512.07370, 2015 | 49 | 2015 |
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks TJ Park, P Georgiou arXiv preprint arXiv:1805.10731, 2018 | 45 | 2018 |
Speaker diarization with lexical information TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan arXiv preprint arXiv:2004.06756, 2020 | 39 | 2020 |
Multi-scale speaker diarization with dynamic scale weighting TJ Park, NR Koluguri, J Balam, B Ginsburg arXiv preprint arXiv:2203.15974, 2022 | 29 | 2022 |
Speaker diarization using latent space clustering in generative adversarial network M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 26 | 2020 |
Meta-learning with latent space clustering in generative adversarial network for speaker diarization M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021 | 24 | 2021 |
Tackling dynamics in federated incremental learning with variational embedding rehearsal TJ Park, K Kumatani, D Dimitriadis arXiv preprint arXiv:2110.09695, 2021 | 21 | 2021 |
Multi-scale speaker diarization with neural affinity score fusion TJ Park, M Kumar, S Narayanan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 19 | 2021 |
Enhancing speaker diarization with large language models: A contextual beam search approach TJ Park, K Dhawan, N Koluguri, J Balam ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 15 | 2024 |
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 15 | 2020 |
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech. A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ... Interspeech, 2463-2467, 2019 | 15 | 2019 |
The Second DIHARD Challenge: System Description for USC-SAIL Team. TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ... INTERSPEECH, 998-1002, 2019 | 11 | 2019 |
Encoding/decoding apparatus for processing channel signal and method therefor JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim US Patent 10,068,579, 2018 | 11 | 2018 |
Robust multi-channel speech recognition using frequency aligned network T Park, K Kumatani, M Wu, S Sundaram ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 8 | 2020 |
Binaural rendering method and apparatus for decoding multi channel audio YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ... US Patent 10,199,045, 2019 | 8 | 2019 |
Apparatus for processing audio signal for sound bar and method therefor JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim US Patent App. 14/760,770, 2015 | 8 | 2015 |