Speaker recognition with hybrid features from a deep belief network

Ali, H; Tran, Son; Benetos, E; d'Avila Garcez, AS

File(s) under permanent embargo

Speaker recognition with hybrid features from a deep belief network

journal contribution

posted on 2023-05-20, 17:25 authored by Ali, H, Son TranSon Tran, Benetos, E, d'Avila Garcez, AS

Learning representation from audio data has shown advantages over the handcrafted features such as mel-frequency cepstral coefficients (MFCCs) in many audio applications. In most of the representation learning approaches, the connectionist systems have been used to learn and extract latent features from the fixed length data. In this paper, we propose an approach to combine the learned features and the MFCC features for speaker recognition task, which can be applied to audio scripts of different lengths. In particular, we study the use of features from different levels of deep belief network for quantizing the audio data into vectors of audio word counts. These vectors represent the audio scripts of different lengths that make them easier to train a classifier. We show in the experiment that the audio word count vectors generated from mixture of DBN features at different layers give better performance than the MFCC features. We also can achieve further improvement by combining the audio word count vector and the MFCC features.

History

Publication title

Neural Computing and Applications

Volume

29

Issue

6

Pagination

13-19

ISSN

0941-0643

Department/School

School of Information and Communication Technology

Publisher

Springer-Verlag

Place of publication

175 Fifth Ave, New York, USA, Ny, 10010

Rights statement

Repository Status

Restricted

Socio-economic Objectives

The media

Usage metrics

Keywords

deep belief networks deep learning mel-frequency cepstral coefficients speaker recognition

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Speaker recognition with hybrid features from a deep belief network

History

Publication title

Volume

Issue

Pagination

ISSN

Department/School

Publisher

Place of publication

Rights statement

Repository Status

Socio-economic Objectives

Usage metrics

Categories

Keywords

Licence

Exports