Home
People
Events
Research
Publications
Contact
News
Speech
MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection
Voice activity detection (VAD) makes a distinction between speech and non-speech and its performance is of crucial importance for …
Zhenpeng Zheng
,
Jianzong Wang
,
Ning Cheng
,
Jian Luo
,
Jing Xiao
PDF
Cite
arXiv
ISCA
A Robust Speaker Clustering Method Based on Discrete Tied Variational Autoencoder
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
Chen Feng
,
Jianzong Wang
,
Tongxu Li
,
Junqing Peng
,
Jing Xiao
Cite
IEEE
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification
State-of-the-art speaker verification models are based on deep learning techniques, which heavily depend on the handdesigned neural …
Xiaoyang Qu
,
Jianzong Wang
,
Jing Xiao
PDF
Cite
arXiv
ISCA
A Noise-Robust Self-Adaptive Multitarget Speaker Detection System
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.
Siqi Zheng
,
Jianzong Wang
,
Jing Xiao
,
Wei-Ning Hsu
,
James R. Glass
Cite
Prioritized Grid Highway Long Short-Term Memory-Based Universal Background Model for Speaker Verification
Prioritized grid long short-term memory (pGLSTM) has been shown to improve automatic speech recognition efficiently. In this paper, we …
Jianzong Wang
,
Hui Guo
,
Jing Xiao
Cite
Springer
«
Cite
×