1

CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

This paper proposes a talking face generation method named “CP-EB” that takes an audio signal as input and a person image as reference, …

Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao

DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

Most existing neural-based text-to-speech methods rely on extensive datasets and face challenges under low-resource condition. In this …

Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao

DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter

The Retrieval Question Answering (ReQA) task employs the retrieval-augmented framework, composed of a retriever and generator. The …

Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter

VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model

Speaker Verification (SV) performance gets worse as utterances get shorter. To this end, we propose a new architecture called …

Yayun He, Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

AOSR-Net: All-in-One Sandstorm Removal Network

Most existing sandstorm image enhancement methods are based on traditional theory and prior knowledge, which often limit their …

Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao

Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval

Cross-modal retrieval (CMR) has been widely applied in a wide range of applications, such as multimedia search engines, recommendation …

Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao

Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval

FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework

This paper integrates graph-to-sequence into an end-to-end text-to-speech framework for syntax-aware modelling with syntactic …

Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao

FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework

A Hierarchy-based Analysis Approach for Blended Learning: A Case Study with Chinese Students

Blended learning is generally defined as the combination of traditional face-to-face learning and online learning. This learning mode …

Yu Ye, Gongjin Zhang, Hongbiao Si, Liang Xu, Shenghua Hu, Yong Li, Xulong Zhang, Kaiyu Hu, Fangzhou Ye

An Empirical Study of Attention Networks for Semantic Segmentation

Semantic segmentation is a vital problem in computer vision. Recently, a common solution to semantic segmentation is the end-to-end …

Hao Guo, Hongbiao Si, Guilin Jiang, Wei Zhang, Zhiyan Liu, Xuanyi Zhu, Xulong Zhang, Yang Liu

Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model

Based on principal-agent theory and optimal contract theory, companies use the method of increasing executives’ shareholding to …

Shanyi Zhou, Ning Yan, Zhijun Li, Mo Geng, Xulong Zhang, Hongbiao Si, Lihua Tang, Wenyuan Sun, Longda Zhang, Yi Cao