Home
People
Events
Research
Publications
Contact
News
1
ESARM: 3D Emotional Speech-To-Animation via Reward Model From Automatically-Ranked Demonstrations
TBD
Xulong Zhang
,
Xiaoyang Qu
,
Haoxiang Shi
,
Chunguang Xiao
,
Jianzong Wang
Cite
Incremental Label Distribution Learning With Scalable Graph Convolutional Networks
TBD
Ziqi Jia
,
Xiaoyang Qu
,
Chenghao Liu
,
Jianzong Wang
Cite
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
The audio watermarking technique embeds messages into audio and accurately extracts messages from the watermarked audio. Traditional …
Pengcheng Li
,
Xulong Zhang
,
Jing Xiao
,
Jianzong Wang
Cite
Code
arXiv
DEMO
FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation
TBD
Jiaqi Li
,
Chenyu Zhao
,
Yuzhu Mao
,
Xinlei Chen
,
Wenbo Ding
,
Xiaoyang Qu
,
Jianzong Wang
Cite
Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion
TBD
Jiaqi Li
,
Xiaoyang Qu
,
Wenbo Ding
,
Zihao Zhao
,
Jianzong Wang
Cite
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction …
Ming Li
,
Yong Zhang
,
Shwai He
,
Zhitao Li
,
Hongyu Zhao
,
Jianzong Wang
,
Ning Cheng
,
Tianyi Zhou
Cite
Code
arXiv
Enhancing Emotion Prediction and Recognition in Conversation through Fine-Grained Emotional Cue Analysis and Cross-Modal Fusion
The purpose of emotion recognition in conversation (ERC) is to identify the emotion category of an utterance based on contextual …
Haoxiang Shi
,
Xulong Zhang
,
Ning Cheng
,
Yong Zhang
,
Jun Yu
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
Springer
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Known for efficient computation and easy storage, hashing has been extensively explored in cross-modal retrieval. The majority of …
Jianzong Wang
,
Haoxiang Shi
,
Kaiyi Luo
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
Cite
arXiv
Springer
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Although current Text-To-Speech (TTS) models are able to generate high-quality speech samples, there are still challenges in developing …
Haoxiang Shi
,
Jianzong Wang
,
Xulong Zhang
,
Ning Cheng
,
Jun Yu
,
Jing Xiao
Cite
arXiv
Springer
Retrieval-Augmented Audio Deepfake Detection
With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of …
Zuheng Kang
,
Yayun He
,
Botao Zhao
,
Xiaoyang Qu
,
Junqing Peng
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
ACM
»
Cite
×