Home
People
Events
Research
Publications
Contact
News
1
Task-Agnostic Decision Transformer for Multi-Type Agent Control with Federated Split Training
With the rapid advancements in artificial intelligence, the development of knowledgeable and personalized agents has become …
Zhiyuan Wang
,
Bokui Chen
,
Xiaoyang Qu
,
Zhenhou Hong
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
IEEE
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
In the realm of Large Language Models, the balance between instruction data quality and quantity has become a focal point. Recognizing …
Ming Li
,
Yong Zhang
,
Zhitao Li
,
Jiuhai Chen
,
Lichang Chen
,
Ning Cheng
,
Jianzong Wang
,
Tianyi Zhou
,
Jing Xiao
Cite
Code
arXiv
Medical Speech Symptoms Classification via Disentangled Representation
Intent is defined for understanding spoken language in existing works. Both textual features and acoustic features involved in medical …
Jianzong Wang
,
Pengcheng Li
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
Cite
arXiv
IEEE
Gecko: Resource-Efficient and Accurate Queries in Real-Time Video Streams at the Edge
Surveillance cameras are ubiquitous nowadays and users’ increasing needs for accessing real-world information (e.g., finding abandoned …
Liang Wang
,
Xiaoyang Qu
,
Jianzong Wang
,
Guokuan Li
,
Jiguang Wan
,
Nan Zhang
,
Song Guo
,
Jing Xiao
PDF
Cite
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model
In recent years, the field of talking faces generation has attracted considerable attention, with certain methods adept at generating …
Bingyuan Zhang
,
Xulong Zhang
,
Ning Cheng
,
Jun Yu
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
Dataset
IEEE
ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis
Existing emotional speech synthesis methods often utilize an utterance-level style embedding extracted from reference audio, neglecting …
Haobin Tang
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
IEEE
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval
Voice conversion refers to transferring speaker identity with well-preserved content. Better disentanglement of speech representations …
Yimin Deng
,
Huaizhen Tang
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
IEEE
INCPrompt: Task-Aware Incremental Prompting for Rehearsal-Free Class-incremental Learning
This paper introduces INCPrompt, an innovative continual learning solution that effectively addresses catastrophic forgetting. …
Zhiyuan Wang
,
Xiaoyang Qu
,
Jing Xiao
,
Bokui Chen
,
Jianzong Wang
Cite
arXiv
IEEE
Leveraging Biases in Large Language Models: bias-kNN for Effective Few-Shot Learning
Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning. …
Yong Zhang
,
Hanzhang Li
,
Zhitao Li
,
Ning Cheng
,
Ming Li
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
IEEE
P2DT: Mitigating Forgetting in Task-Incremental Learning with Progressive Prompt Decision Transformer
Catastrophic forgetting poses a substantial challenge for managing intelligent agents controlled by a large model, causing performance …
Zhiyuan Wang
,
Xiaoyang Qu
,
Jing Xiao
,
Bokui Chen
,
Jianzong Wang
Cite
arXiv
IEEE
«
»
Cite
×