Speech

Variational Information Bottleneck for Effective Low-Resource Audio Classification

Large-scale deep neural networks (DNNs) such as convolutional neural networks (CNNs) have achieved impressive performance in audio …

Shijing Si, Jianzong Wang, Huiming Sun, Jianhan Wu, Chuanyao Zhang, Xiaoyang Qu, Ning Cheng, Lei Chen, Jing Xiao

End-To-End Silent Speech Recognition with Acoustic Sensing

Silent speech interfaces (SSI) has been an exciting area of recent interest. In this paper, we present a non-invasive silent speech …

Jian Luo, Jianzong Wang, Ning Cheng, Guilin Jiang, Jing Xiao

End-To-End Silent Speech Recognition with Acoustic Sensing

Communication-Memory-Efficient Decentralized Learning For Audio Representation

Smartphones and wearable devices produce a wealth of audio data, which cannot be accumulated in a centralized repository for learning …

Leilai Li, Jianzong Wang, Xiaoyang Qu, Jing Xiao

Contrastive Learning for improving End-to-end Speaker Verification

Speaker verification involves examining the speech signal to authenticate the claim of a speaker as true or false. Deep neural networks …

Yanxi Tang, Jianzong Wang, Xiaoyang Qu, Jing Xiao

Effective Phase Encoding for End-To-End Speaker Verification

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Junyi Peng, Xiaoyang Qu, Rongzhi Gu, Jianzong Wang, Jing Xiao, Lukás Burget, Jan Cernocký

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Junyi Peng, Xiaoyang Qu, Jianzong Wang, Rongzhi Gu, Jing Xiao, Lukás Burget, Jan Cernocký

When Hearing the Voice, Who Will Come to Your Mind

Speech is a carrier containing rich biological information, such as speaker identity information including age, gender, race. In this …

Zhenhou Hong, Jianzong Wang, Wenqi Wei, Jie Liu, Xiaoyang Qu, Bo Chen, Zihang Wei, Jing Xiao

A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection

In this paper, we propose a real-time robot-based auxiliary system for risk evaluation of COVID-19 infection. It combines real-time …

Wenqi Wei, Jianzong Wang, Jiteng Ma, Ning Cheng, Jing Xiao

A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection

Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding

End-to-end Spoken Language Understanding (SLU) models are made increasingly large and complex to achieve the state-of-the-art accuracy. …

Xueli Jia, Jianzong Wang, Zhiyong Zhang, Ning Cheng, Jing Xiao

Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding

MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection

Voice activity detection (VAD) makes a distinction between speech and non-speech and its performance is of crucial importance for …

Zhenpeng Zheng, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao

MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection