About: Online Speaker Adaptation of an Acoustic Model Using Face Recognition

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Online Speaker Adaptation of an Acoustic Model Using Face Recognition Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
rdfs:seeAlso	http://dx.doi.org/10.1007/978-3-642-40585-3_48
Description	We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system. We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system. (en)
Title	Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition (en)
skos:prefLabel	Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition (en)
skos:notation	RIV/68407700:21230/13:00212560!RIV14-GA0-21230___
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(GBP103/12/G084)
http://linked.open...vai/riv/dodaniDat	2014
http://linked.open...aciTvurceVysledku	Campr, Pavel
http://linked.open.../riv/druhVysledku	D - Článek ve sborníku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	České vysoké učení technické v Praze / Fakulta elektrotechnická
http://linked.open...dnocenehoVysledku	94055
http://linked.open...ai/riv/idVysledku	RIV/68407700:21230/13:00212560
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	acoustic model; face recognition; speaker adaptation; multimodal processing; automatic speech recognition (en)
http://linked.open.../riv/klicoveSlovo	automatic speech recognition face recognition acoustic model speaker adaptation multimodal processing
http://linked.open...ontrolniKodProRIV	[EC8FAB5D72FB]
http://linked.open...v/mistoKonaniAkce	Pilsen
http://linked.open...i/riv/mistoVydani	Heidelberg
http://linked.open...i/riv/nazevZdroje	Text, Speech, and Dialogue: 16th International Conference, TSD 2013
http://linked.open...in/vavai/riv/obor	JD
http://linked.open...ichTvurcuVysledku	1 (xsd:int)
http://linked.open...cetTvurcuVysledku	4 (xsd:int)
http://linked.open...vavai/riv/projekt	Center for Large Scale Multi-modal Data Interpretation
http://linked.open...UplatneniVysledku	2013
http://linked.open...iv/tvurceVysledku	Campr, Pavel Psutka, J. Pražák, A.
http://linked.open...vavai/riv/typAkce	EUR - Evropská
http://linked.open.../riv/zahajeniAkce	2013-09-01 (xsd:date)
issn	0302-9743
number of pages	8 (xsd:int)
http://bibframe.org/vocab/doi	10.1007/978-3-642-40585-3_48
http://purl.org/ne...btex#hasPublisher	Springer-Verlag
https://schema.org/isbn	978-3-642-40584-6
http://localhost/t...ganizacniJednotka	21230

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 35 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software