Publications

Conference

[C1] Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng. “CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations”. 38th Annual Conference on Neural Information Processing Systems(NeurIPS), Dec. 2024 [Paper]

[C2] Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Hemin Yang, Shujie Liu, Long Zhou, Yanmin Qian. “DDTSE: Discriminative Diffusion Model for Target Speech Extraction”. IEEE Spoken Language Technology Workshop (SLT), Dec. 2024 [Paper]

[C3] Leying Zhang, , Zhengyang Chen and Yanmin Qian. “Adaptive Large Margin Fine-tuning for Speaker Verification”. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), June. 2023 [Paper]

[C4] Leying Zhang, Zhengyang Chen* and Yanmin Qian. “Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification ”. 23rd Annual Conference of the International Speech Communication Association (InterSpeech), Sep. 2022 [Paper]

[C5] Leying Zhang, Zhengyang Chen and Yanmin Qian. “Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification”. 22nd Annual Conference of the International Speech Communication Association (InterSpeech), Sep. 2021 [Paper]

[C6] Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin and Jiang Bian. “PromptTTS 2: Describing and Generating Voices with Text Prompt”. 2024 ICLR 2024 [Paper]

[C7] Linfeng Yu, Wangyou Zhang, Chenpeng Du, Leying Zhang, Zheng Liang and Yanmin Qian. “Generation-Based Target Speech Extraction with Speech Discretization and Vocoder”. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), April. 2024 [Paper]

[C8] Zhengyang Chen, Bei Liu, Bing Han, Leying Zhang and Yanmin Qian. “The SJTU X-LANCE Lab System for CNSRC 2022”. arXiv preprint, May. 2023 [Paper]