This HTML5 document contains 47 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
n20http://linked.opendata.cz/ontology/domain/vavai/riv/typAkce/
dctermshttp://purl.org/dc/terms/
n18http://localhost/temp/predkladatel/
n13http://purl.org/net/nknouf/ns/bibtex#
n22http://linked.opendata.cz/resource/domain/vavai/projekt/
n15http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n17http://linked.opendata.cz/resource/domain/vavai/subjekt/
n12http://linked.opendata.cz/ontology/domain/vavai/
n9https://schema.org/
shttp://schema.org/
skoshttp://www.w3.org/2004/02/skos/core#
rdfshttp://www.w3.org/2000/01/rdf-schema#
n3http://linked.opendata.cz/ontology/domain/vavai/riv/
n16http://bibframe.org/vocab/
n2http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n8http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n24http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdhhttp://www.w3.org/2001/XMLSchema#
n21http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n14http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n7http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n19http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n4http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F68407700%3A21230%2F13%3A00212560%21RIV14-GA0-21230___/
n6http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item
n2:RIV%2F68407700%3A21230%2F13%3A00212560%21RIV14-GA0-21230___
rdf:type
n12:Vysledek skos:Concept
rdfs:seeAlso
http://dx.doi.org/10.1007/978-3-642-40585-3_48
dcterms:description
We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system. We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system.
dcterms:title
Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition
skos:prefLabel
Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition
skos:notation
RIV/68407700:21230/13:00212560!RIV14-GA0-21230___
n12:predkladatel
n17:orjk%3A21230
n3:aktivita
n21:P
n3:aktivity
P(GBP103/12/G084)
n3:dodaniDat
n6:2014
n3:domaciTvurceVysledku
n15:4051351
n3:druhVysledku
n19:D
n3:duvernostUdaju
n24:S
n3:entitaPredkladatele
n4:predkladatel
n3:idSjednocenehoVysledku
94055
n3:idVysledku
RIV/68407700:21230/13:00212560
n3:jazykVysledku
n14:eng
n3:klicovaSlova
acoustic model; face recognition; speaker adaptation; multimodal processing; automatic speech recognition
n3:klicoveSlovo
n8:face%20recognition n8:automatic%20speech%20recognition n8:multimodal%20processing n8:acoustic%20model n8:speaker%20adaptation
n3:kontrolniKodProRIV
[EC8FAB5D72FB]
n3:mistoKonaniAkce
Pilsen
n3:mistoVydani
Heidelberg
n3:nazevZdroje
Text, Speech, and Dialogue: 16th International Conference, TSD 2013
n3:obor
n7:JD
n3:pocetDomacichTvurcuVysledku
1
n3:pocetTvurcuVysledku
4
n3:projekt
n22:GBP103%2F12%2FG084
n3:rokUplatneniVysledku
n6:2013
n3:tvurceVysledku
Pražák, A. Campr, Pavel Psutka, J.
n3:typAkce
n20:EUR
n3:zahajeniAkce
2013-09-01+02:00
s:issn
0302-9743
s:numberOfPages
8
n16:doi
10.1007/978-3-642-40585-3_48
n13:hasPublisher
Springer-Verlag
n9:isbn
978-3-642-40584-6
n18:organizacniJednotka
21230