HTML Microdata document

This HTML5 document contains 47 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

Prefix	IRI
n20	http://linked.opendata.cz/ontology/domain/vavai/riv/typAkce/
dcterms	http://purl.org/dc/terms/
n18	http://localhost/temp/predkladatel/
n13	http://purl.org/net/nknouf/ns/bibtex#
n22	http://linked.opendata.cz/resource/domain/vavai/projekt/
n15	http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n17	http://linked.opendata.cz/resource/domain/vavai/subjekt/
n12	http://linked.opendata.cz/ontology/domain/vavai/
n9	https://schema.org/
s	http://schema.org/
skos	http://www.w3.org/2004/02/skos/core#
rdfs	http://www.w3.org/2000/01/rdf-schema#
n3	http://linked.opendata.cz/ontology/domain/vavai/riv/
n16	http://bibframe.org/vocab/
n2	http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n8	http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n24	http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdh	http://www.w3.org/2001/XMLSchema#
n21	http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n14	http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n7	http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n19	http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n4	http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F68407700%3A21230%2F13%3A00212560%21RIV14-GA0-21230___/
n6	http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item: n2:RIV%2F68407700%3A21230%2F13%3A00212560%21RIV14-GA0-21230___
rdf:type: n12:Vysledek skos:Concept
rdfs:seeAlso: http://dx.doi.org/10.1007/978-3-642-40585-3_48
dcterms:description: We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system. We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of the face recognizer, which reduced Word Error Rate (WER) by 12% relatively compared to speaker independent baseline system.
dcterms:title: Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition
skos:prefLabel: Online Speaker Adaptation of an Acoustic Model Using Face Recognition Online Speaker Adaptation of an Acoustic Model Using Face Recognition
skos:notation: RIV/68407700:21230/13:00212560!RIV14-GA0-21230___
n12:predkladatel: n17:orjk%3A21230
n3:aktivita: n21:P
n3:aktivity: P(GBP103/12/G084)
n3:dodaniDat: n6:2014
n3:domaciTvurceVysledku: n15:4051351
n3:druhVysledku: n19:D
n3:duvernostUdaju: n24:S
n3:entitaPredkladatele: n4:predkladatel
n3:idSjednocenehoVysledku: 94055
n3:idVysledku: RIV/68407700:21230/13:00212560
n3:jazykVysledku: n14:eng
n3:klicovaSlova: acoustic model; face recognition; speaker adaptation; multimodal processing; automatic speech recognition
n3:klicoveSlovo: n8:face%20recognition n8:automatic%20speech%20recognition n8:multimodal%20processing n8:acoustic%20model n8:speaker%20adaptation
n3:kontrolniKodProRIV: [EC8FAB5D72FB]
n3:mistoKonaniAkce: Pilsen
n3:mistoVydani: Heidelberg
n3:nazevZdroje: Text, Speech, and Dialogue: 16th International Conference, TSD 2013
n3:obor: n7:JD
n3:pocetDomacichTvurcuVysledku: 1
n3:pocetTvurcuVysledku: 4
n3:projekt: n22:GBP103%2F12%2FG084
n3:rokUplatneniVysledku: n6:2013
n3:tvurceVysledku: Pražák, A. Campr, Pavel Psutka, J.
n3:typAkce: n20:EUR
n3:zahajeniAkce: 2013-09-01+02:00
s:issn: 0302-9743
s:numberOfPages: 8
n16:doi: 10.1007/978-3-642-40585-3_48
n13:hasPublisher: Springer-Verlag
n9:isbn: 978-3-642-40584-6
n18:organizacniJednotka: 21230