Follow
Dong Zhang
Title
Cited by
Cited by
Year
Speechgpt: Empowering large language models with intrinsic cross-modal conversational abilities
D Zhang, S Li, X Zhang, J Zhan, P Wang, Y Zhou, X Qiu
EMNLP 2023 (Findings), 2023
772023
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models
X Zhang*, D Zhang*, S Li, Y Zhou, X Qiu
ICLR 2024, 2023
132023
DUB: Discrete Unit Back-translation for Speech Translation
D Zhang, R Ye, T Ko, M Wang, Y Zhou
ACL 2023 (Findings), 2023
132023
SeqXGPT: Sentence-Level AI-Generated Text Detection
P Wang, L Li, K Ren, B Jiang, D Zhang, X Qiu
EMNLP 2023, 2023
102023
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
J Zhan, J Dai, J Ye, Y Zhou, D Zhang, Z Liu, X Zhang, R Yuan, G Zhang, ...
arXiv preprint arXiv:2402.12226, 2024
52024
Inferaligner: Inference-time alignment for harmlessness through cross-model guidance
P Wang, D Zhang, L Li, C Tan, X Wang, K Ren, B Jiang, X Qiu
arXiv preprint arXiv:2401.11206, 2024
42024
GroundingGPT: Language Enhanced Multi-modal Grounding Model
Z Li, Q Xu, D Zhang, H Song, Y Cai, Q Qi, R Zhou, J Pan, Z Li, VT Vu, ...
arXiv preprint arXiv:2401.06071, 2024
4*2024
SpeechAlign: Aligning Speech Generation to Human Preferences
D Zhang, Z Li, S Li, X Zhang, P Wang, Y Zhou, X Qiu
arXiv preprint arXiv:2404.05600, 2024
2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Y Hu, C Chen, CHH Yang, R Li, D Zhang, Z Chen, ES Chng
arXiv preprint arXiv:2402.06894, 2024
2024
SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation
D Zhang, X Zhang, J Zhan, S Li, Y Zhou, X Qiu
arXiv preprint arXiv:2401.13527, 2024
2024
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
D Zhang, Z Li, P Wang, X Zhang, Y Zhou, X Qiu
arXiv preprint arXiv:2401.03945, 2024
2024
SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models
X Zhang, D Zhang, S Li, Y Zhou, X Qiu
The system can't perform the operation now. Try again later.
Articles 1–12