Audio

Research on Singing Voice Detection Based on a Long-Term Recurrent Convolutional Network with Vocal Separation and Temporal Smoothing

Singing voice detection or vocal detection is a classification task that determines whether a given audio segment contains singing …

Xulong Zhang, Yi Yu, Yongwei Gao, Xi Chen, Wei Li

Aligntts: Efficient Feed-Forward Text-to-Speech System Without Explicit Alignment

Targeting at both high efficiency and performance, we propose AlignTTS to predict the mel-spectrum in parallel. AlignTTS is based on a …

Zhen Zeng, Jianzong Wang, Ning Cheng, Tian Xia, Jing Xiao

Aligntts: Efficient Feed-Forward Text-to-Speech System Without Explicit Alignment

GraphTTS: Graph-to-Sequence Modelling in Neural Text-to-Speech

This paper leverages the graph-to-sequence method in neural text-to-speech (GraphTTS), which maps the graph embedding of the input …

Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao

GraphTTS: Graph-to-Sequence Modelling in Neural Text-to-Speech

A Robust Speaker Clustering Method Based on Discrete Tied Variational Autoencoder

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Chen Feng, Jianzong Wang, Tongxu Li, Junqing Peng, Jing Xiao

Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification

State-of-the-art speaker verification models are based on deep learning techniques, which heavily depend on the handdesigned neural …

Xiaoyang Qu, Jianzong Wang, Jing Xiao

Singing Voice Detection Using Multi-Feature Deep Fusion with CNN

The problem of singing voice detection is to segment a song into vocal and non-vocal parts. Commonly used methods usually train a model …

Xulong Zhang, Shengchen Li, Zijin Li, Shizhe Chen, Yongwei Gao, Wei Li

Transfer Learning for Music Classification and Regression Tasks Using Artist Tags

In this paper, a transfer learning method that exploits artist tags for general-purpose music feature vector extraction is presented. …

Lei Wang, Hongning Zhu, Xulong Zhang, Shengchen Li, Wei Li

Transfer Learning for Music Classification and Regression Tasks Using Artist Tags

Composer4Everyone: Automatic Music Generation with Audio Motif

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Aozhi Liu, Jianzong Wang, Junqing Peng, Yiwen Wang, Yaqi Mei, Xiaojing Liang, Zimin Xia, Jing Xiao

A Novel Singer Identification Method Using GMM-UBM

This paper presents a novel method for singer identification from polyphonic music audio signals. It is based on the universal …

Xulong Zhang, Yiliang Jiang, Jin Deng, Juanjuan Li, Mi Tian, Wei Li

A Novel Singer Identification Method Using GMM-UBM

A Practical Singing Voice Detection System Based on GRU-RNN

In this paper, we present a practical three-step approach for singing voice detection based on a gated recurrent unit (GRU) recurrent …

Zhigao Chen, Xulong Zhang, Jin Deng, Juanjuan Li, Yiliang Jiang, Wei Li