Home
People
Events
Research
Publications
Contact
News
1
Attention-weighted Centered Kernel Alignment for Knowledge Distillation in Large Audio-Language Models Applied to Speech Emotion Recognition
TBD
Qingran Yang
,
Botao Zhao
,
Zuheng Kang
,
Xue Li
,
Yayun He
,
Chuhang Liu
,
Xulong Zhang
,
Xiaoyang Qu
,
Junqing Peng
,
Jianzong Wang
Cite
CARE: Multi-Task Pretraining for Latent Continuous Action Representation in Robot Control
TBD
Jiaqi Shi
,
Xulong Zhang
,
Xiaoyang Qu
,
Jianzong Wang
Cite
From Knowing to Doing Precisely: A General Self-Correction and Termination Framework for VLA Models
TBD
Wentao Zhang
,
Wentao Mo
,
Aolan Sun
,
Xiaoyang Qu
,
Yuxin Zheng
,
Jianzong Wang
Cite
Head-Aware Visual Cropping: Enhancing Fine-Grained VQA with Attention-Guided Subimage
TBD
Junfei Xie
,
Peng Pan
,
Xulong Zhang
Cite
MirrorTalk: Forging Personalized Avatars via Disentangled Style and Hierarchical Motion Control
TBD
Renjie Lu
,
Xulong Zhang
,
Xiaoyang Qu
,
Jianzong Wang
,
Shangfei Wang
Cite
Mita: A Hierarchical Multi-Agent Collaboration Framework with Memory-Integrated and Task Allocation
TBD
Xiaojie Zhang
,
Jianhan Wu
,
Xiaoyang Qu
,
Jianzong Wang
Cite
Triage: Hierarchical Visual Budgeting for Efficient Video Reasoning in Vision-Language Models
TBD
Anmin Wang
,
Nan Zhang
,
Wei Tao
,
Xiaoyang Qu
,
Guokuan Li
,
Jiguang Wan
,
Jianzong Wang
Cite
Vista: Scene-Aware Optimization for Streaming Video Question Answering under Post-Hoc Queries
TBD
Haocheng Lu
,
Nan Zhang
,
Wei Tao
,
Xiaoyang Qu
,
Guokuan Li
,
Jiguang Wan
,
Jianzong Wang
Cite
Turbo-TTS: Enhancing Diffusion Model TTS with an Improved ODE Solver
This paper introduces Turbo-TTS, a novel diffusion-based model for text-to-speech (TTS) synthesis. Diffusion models leverage stochastic …
Xulong Zhang
,
Jiashu Wang
,
Xiaoyang Qu
,
Hui Tian
,
Jianzong Wang
Cite
Springer
EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition
Although Large Audio-Language Models (LALMs) have exhibited outstanding performance in auditory understanding, their performance in …
Pengcheng Li~
,
Botao Zhao
,
Zuheng Kang
,
Junqing Peng
,
Xiaoyang Qu
,
Yayun He
,
Jianzong Wang
Cite
arXiv
ACL
»
Cite
×