Dongchao Yang

引用次数

	总计	2019 年至今
引用	663	662
h 指数	13	13
i10 指数	14	14

360

180

270

202020212022202320242 15 53 351 235

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Yuexian ZouPeking University Shenzhen Graduate School在 pku.edu.cn 的电子邮件经过验证
Helin WangJohns Hopkins University在 jh.edu 的电子邮件经过验证
Rongjie HuangZhejiang University在 zju.edu.cn 的电子邮件经过验证
Songxiang LiumiHoYo在 mihoyo.com 的电子邮件经过验证
Zhongjie YePeking University在 stu.pku.edu.cn 的电子邮件经过验证
Jinchuan TianLanguage Technologies Institute, Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Xu TanPrincipal Researcher and Research Manager, Microsoft在 microsoft.com 的电子邮件经过验证
Nuo ChenHong Kong University of Science and Technology在 connect.ust.hk 的电子邮件经过验证

关注

Dongchao Yang

The Chinese University of HongKong

在 se.cuhk.edu.hk 的电子邮件经过验证 - 首页

TTS Multi-modal Audio Fundation Models


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Diffsound: Discrete diffusion model for text-to-sound generation D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu IEEE Transactions on Audio, Speech and Language Processing (TASLP)., 2023	155	2023
Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models R Huang, J Huang, D Yang*, Y Ren, L Liu, M Li, Z Ye, J Liu, X Yin, ... ICML 2023, 2023	126	2023
AudioGPT: Understanding and generating speech, music, sound, and talking head R Huang, M Li, D Yang, J Shi, X Chang, Z Ye, Y Wu, Z Hong, J Huang, ... AAAI demo, 2024, 2023	92	2023
Instructtts: Modelling expressive tts in discrete latent space with natural language style prompt D Yang, S Liu, R Huang, C Weng, H Meng arXiv preprint arXiv:2301.13662, 2023	36	2023
Towards data distillation for end-to-end spoken conversational question answering C You, N Chen, F Liu, D Yang, Y Zou arXiv preprint arXiv:2010.08923, 2020	34	2020
A Mutual learning framework for Few-shot Sound Event Detection D Yang, H Wang, Y Zou, Z Ye, W Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	28*	2022
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information Z Ye, H Wang, D Yang, Y Zou Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2021	27	2021
Hifi-codec: Group-residual vector quantization for high fidelity audio codec D Yang, S Liu, R Huang, J Tian, C Weng, Y Zou arXiv preprint arXiv:2305.02765, 2023	26	2023
UniAudio: An Audio Foundation Model Toward Universal Audio Generation D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ... arXiv preprint arXiv:2310.00704, 2023	19	2023
Improving Text-Audio Retrieval by Text-aware Attention Pooling and Prior Matrix Revised Loss Y Xin, D Yang, Y Zou ICASSP2023, 2023	15	2023
Make-a-voice: Unified voice synthesis with discrete representation R Huang, C Zhang, Y Wang, D Yang, L Liu, Z Ye, Z Jiang, C Weng, ... arXiv preprint arXiv:2305.19269, 2023	14	2023
Norespeech: Knowledge distillation based conditional diffusion model for noise-robust expressive tts D Yang, S Liu, J Yu, H Wang, C Weng, Y Zou Interspeech2023, 2022	14	2022
Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification Y Xin, D Yang, Y Zou Proc. Interspeech 2022, 1546-1550, 2022	13	2022
Make-an-audio 2: Temporal-enhanced text-to-audio generation J Huang, Y Ren, R Huang, D Yang, Z Ye, C Zhang, J Liu, X Yin, Z Ma, ... arXiv preprint arXiv:2305.18474, 2023	11	2023
Detect what you want: Target sound detection D Yang, H Wang, Y Zou, F Cui, Y Wang Workshop on Detection and Classification of Acoustic Scenes and Events …, 2022	8	2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches Z Zhao, D Yang, R Gu, H Zhang, Y Zou Interspeech2022, 2022	8	2022
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... ICLR 2024, 2023	6	2023
Unsupervised multi-target domain adaptation for acoustic scene classification D Yang, H Wang, Y Zou Interspeech2021, 2021	6	2021
Improving Weakly Supervised Sound Event Detection with Causal Intervention Y Xin, D Yang, F Cui, Y Wang, Y Zou ICASSP2023, 2023	5	2023
Improving Target Sound Extraction with Timestamp Information D Yang, H Wang, C Weng, J Yu, Y Zou Proc. Interspeech 2022, 2022	4	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者