About: On Complementarity of State-of-the-art Speaker Recognition Systems

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: On Complementarity of State-of-the-art Speaker Recognition Systems Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
Description	In this paper recent methods used in the task of Speaker Recognition (SR) are reviewed and their complementarity is analysed. At first, methods based on Supervectors (SVs) related to Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs) used as a discriminative model are described along with the Nuisance Attribute Projection (NAP). NAP was proposed to suppress undesirable influences of high channel variabilities between several sessions of a speaker. Next, recent methods focusing on the extraction of so called i-vectors (low dimensional representations of GMM based SVs) are discussed. The space in which i-vectors lie is denoted the Total Variability Space (TVS) since it contains both between-speaker and session/channel variabilities. Once i-vectors have been extracted a Probabilistic Linear Discriminant Analysis (PLDA) model is trained in the TVS. In the training phase of PLDA the TVS is decomposed to a channel and a speaker subspace, hence each i-vector is supposed to be composed from a speaker identity component and a channel component. The complementarity of PLDA and SVM based modelling techniques is examined utilizing the linear logistic regression as a fusion tool used to combine the verification scores of individual systems leading to significant reductions in error rates of the SR system. The results are presented on the NIST SRE 2008 and NIST SRE 2010 corpora. In this paper recent methods used in the task of Speaker Recognition (SR) are reviewed and their complementarity is analysed. At first, methods based on Supervectors (SVs) related to Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs) used as a discriminative model are described along with the Nuisance Attribute Projection (NAP). NAP was proposed to suppress undesirable influences of high channel variabilities between several sessions of a speaker. Next, recent methods focusing on the extraction of so called i-vectors (low dimensional representations of GMM based SVs) are discussed. The space in which i-vectors lie is denoted the Total Variability Space (TVS) since it contains both between-speaker and session/channel variabilities. Once i-vectors have been extracted a Probabilistic Linear Discriminant Analysis (PLDA) model is trained in the TVS. In the training phase of PLDA the TVS is decomposed to a channel and a speaker subspace, hence each i-vector is supposed to be composed from a speaker identity component and a channel component. The complementarity of PLDA and SVM based modelling techniques is examined utilizing the linear logistic regression as a fusion tool used to combine the verification scores of individual systems leading to significant reductions in error rates of the SR system. The results are presented on the NIST SRE 2008 and NIST SRE 2010 corpora. (en)
Title	On Complementarity of State-of-the-art Speaker Recognition Systems On Complementarity of State-of-the-art Speaker Recognition Systems (en)
skos:prefLabel	On Complementarity of State-of-the-art Speaker Recognition Systems On Complementarity of State-of-the-art Speaker Recognition Systems (en)
skos:notation	RIV/49777513:23520/12:43916022!RIV13-GA0-23520___
http://linked.open...avai/predkladatel	Fakulta aplikovaných věd
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(GBP103/12/G084)
http://linked.open...vai/riv/dodaniDat	2013
http://linked.open...aciTvurceVysledku	Zajíc, Zbyněk Müller, Luděk Machlica, Lukáš
http://linked.open.../riv/druhVysledku	D - Článek ve sborníku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Západočeská univerzita v Plzni / Fakulta aplikovaných věd
http://linked.open...dnocenehoVysledku	156193
http://linked.open...ai/riv/idVysledku	RIV/49777513:23520/12:43916022
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	speaker recognition; supervector; fusion; PLDA; i-vector; NAP; SVM (en)
http://linked.open.../riv/klicoveSlovo	SVM speaker recognition i-vector PLDA supervector NAP fusion
http://linked.open...ontrolniKodProRIV	[02AB6DA54232]
http://linked.open...v/mistoKonaniAkce	Vietnam, Ho Chi Minh City
http://linked.open...i/riv/mistoVydani	Neuveden
http://linked.open...i/riv/nazevZdroje	IEEE International Symposium on Signal Processing and Information Technology
http://linked.open...in/vavai/riv/obor	JD
http://linked.open...ichTvurcuVysledku	3 (xsd:int)
http://linked.open...cetTvurcuVysledku	3 (xsd:int)
http://linked.open...vavai/riv/projekt	Center for Large Scale Multi-modal Data Interpretation
http://linked.open...UplatneniVysledku	2012
http://linked.open...iv/tvurceVysledku	Machlica, Lukáš Zajíc, Zbyněk Müller, Luděk
http://linked.open...vavai/riv/typAkce	WRD - Světová
http://linked.open.../riv/zahajeniAkce	2012-12-12 (xsd:date)
number of pages	6 (xsd:int)
http://purl.org/ne...btex#hasPublisher	Institute of Electrical and Electronics Engineers ( IEEE )
https://schema.org/isbn	978-1-4673-5604-6
http://localhost/t...ganizacniJednotka	23520
is http://linked.open...avai/riv/vysledek of	On Complementarity of State-of-the-art Speaker Recognition Systems

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 58 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software