Home
People
Events
Research
Publications
Contact
News
1
Cocktait:Chunk-AdaptiveMixed-PrecisionOuanization for Long-Context LLM inference
TBD
Wei Tao
,
Bin Zhang
,
Xiaoyang Qu
,
Jiguang Wan
,
Jianzong Wang
Cite
Incremental Label Distribution Learning With Scalable Graph Convolutional Networks
Label Distribution Learning (LDL) is an effective approach for handling label ambiguity, as it can analyze all labels at once and …
Ziqi Jia
,
Xiaoyang Qu
,
Chenghao Liu
,
Jianzong Wang
Cite
arXiv
IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding
The audio watermarking technique embeds messages into audio and accurately extracts messages from the watermarked audio. Traditional …
Pengcheng Li
,
Xulong Zhang
,
Jing Xiao
,
Jianzong Wang
Cite
Code
arXiv
ACL
DEMO
FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation
Although modern localization methods have achieved remarkable accuracy with various sensors, there are still some circumstances where …
Jiaqi Li
,
Chenyu Zhao
,
Yuzhu Mao
,
Xinlei Chen
,
Wenbo Ding
,
Xiaoyang Qu
,
Jianzong Wang
PDF
Cite
ACM
Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion
TBD
Jiaqi Li
,
Xiaoyang Qu
,
Wenbo Ding
,
Zihao Zhao
,
Jianzong Wang
Cite
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction …
Ming Li
,
Yong Zhang
,
Shwai He
,
Zhitao Li
,
Hongyu Zhao
,
Jianzong Wang
,
Ning Cheng
,
Tianyi Zhou
Cite
Code
arXiv
Enhancing Emotion Prediction and Recognition in Conversation through Fine-Grained Emotional Cue Analysis and Cross-Modal Fusion
The purpose of emotion recognition in conversation (ERC) is to identify the emotion category of an utterance based on contextual …
Haoxiang Shi
,
Xulong Zhang
,
Ning Cheng
,
Yong Zhang
,
Jun Yu
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
Springer
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Known for efficient computation and easy storage, hashing has been extensively explored in cross-modal retrieval. The majority of …
Jianzong Wang
,
Haoxiang Shi
,
Kaiyi Luo
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
Cite
arXiv
Springer
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
Although current Text-To-Speech (TTS) models are able to generate high-quality speech samples, there are still challenges in developing …
Haoxiang Shi
,
Jianzong Wang
,
Xulong Zhang
,
Ning Cheng
,
Jun Yu
,
Jing Xiao
Cite
arXiv
Springer
Retrieval-Augmented Audio Deepfake Detection
With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of …
Zuheng Kang
,
Yayun He
,
Botao Zhao
,
Xiaoyang Qu
,
Junqing Peng
,
Jing Xiao
,
Jianzong Wang
Cite
arXiv
ACM
»
Cite
×