Home
People
Events
Research
Publications
Contact
News
1
Incremental Label Distribution Learning With Scalable Graph Convolutional Networks
Label Distribution Learning (LDL) is an effective approach for handling label ambiguity, as it can analyze all labels at once and …
Ziqi Jia
,
Xiaoyang Qu
,
Chenghao Liu
,
Jianzong Wang
Cite
arXiv
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
The audio watermarking technique embeds messages into audio and accurately extracts messages from the watermarked audio. Traditional …
Pengcheng Li
,
Xulong Zhang
,
Jing Xiao
,
Jianzong Wang
Cite
Code
arXiv
ACL
DEMO
FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation
Although modern localization methods have achieved remarkable accuracy with various sensors, there are still some circumstances where …
Jiaqi Li
,
Chenyu Zhao
,
Yuzhu Mao
,
Xinlei Chen
,
Wenbo Ding
,
Xiaoyang Qu
,
Jianzong Wang
PDF
Cite
ACM
Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion
As the Internet of Things (IoT) evolves, the need for enhanced data-sharing to improve edge device performance has led to the adoption …
Jiaqi Li
,
Xiaoyang Qu
,
Wenbo Ding
,
Zihao Zhao
,
Jianzong Wang
Cite
ACM
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction …
Ming Li
,
Yong Zhang
,
Shwai He
,
Zhitao Li
,
Hongyu Zhao
,
Jianzong Wang
,
Ning Cheng
,
Tianyi Zhou
Cite
Code
arXiv
Enhancing Emotion Prediction and Recognition in Conversation through Fine-Grained Emotional Cue Analysis and Cross-Modal Fusion
The purpose of emotion recognition in conversation (ERC) is to identify the emotion category of an utterance based on contextual …
Haoxiang Shi
,
Xulong Zhang
,
Ning Cheng
,
Yong Zhang
,
Jun Yu
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
Springer
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Known for efficient computation and easy storage, hashing has been extensively explored in cross-modal retrieval. The majority of …
Jianzong Wang
,
Haoxiang Shi
,
Kaiyi Luo
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
Cite
arXiv
Springer
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Although current Text-To-Speech (TTS) models are able to generate high-quality speech samples, there are still challenges in developing …
Haoxiang Shi
,
Jianzong Wang
,
Xulong Zhang
,
Ning Cheng
,
Jun Yu
,
Jing Xiao
Cite
arXiv
Springer
Retrieval-Augmented Audio Deepfake Detection
With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of …
Zuheng Kang
,
Yayun He
,
Botao Zhao
,
Xiaoyang Qu
,
Junqing Peng
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
ACM
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition
Singing voice beautifying is a novel task that has application value in people’s daily life, aiming to correct the pitch of the …
Jianzong Wang
,
Pengcheng Li
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
Cite
arXiv
DEMO
IEEE
«
»
Cite
×