About: Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
Description	This paper describes the effort with building speaker-clustered acoustic models as a part of the real-time LVCSR system that is used more than one year by the Czech TV for automatic subtitling of parliament meetings broadcasted on the channel Cˇ T24. Speaker-clustered acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model or even gender-dependent models. Frequent changes of speakers and a direct connection of the LVCSR system to the audio channel require an automatic switching/fusion of models as quickly as possible. An important part of the solution is real time likelihood evaluations of all clustered acoustic models, taking advantage of a fast GPU(Graphic Processing Unit). The proposed method achieved a WER reduction to the baseline gender-independent model over 2.34% relatively with more than 2M Gaussian mixtures evaluated in real-time. This paper describes the effort with building speaker-clustered acoustic models as a part of the real-time LVCSR system that is used more than one year by the Czech TV for automatic subtitling of parliament meetings broadcasted on the channel Cˇ T24. Speaker-clustered acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model or even gender-dependent models. Frequent changes of speakers and a direct connection of the LVCSR system to the audio channel require an automatic switching/fusion of models as quickly as possible. An important part of the solution is real time likelihood evaluations of all clustered acoustic models, taking advantage of a fast GPU(Graphic Processing Unit). The proposed method achieved a WER reduction to the baseline gender-independent model over 2.34% relatively with more than 2M Gaussian mixtures evaluated in real-time. (en)
Title	Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings (en)
skos:prefLabel	Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings (en)
skos:notation	RIV/49777513:23520/11:43898287!RIV12-TA0-23520___
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(TA01011264)
http://linked.open...iv/cisloPeriodika	6836
http://linked.open...vai/riv/dodaniDat	2012
http://linked.open...aciTvurceVysledku	Vaněk, Jan Psutka, Josef Psutka jr., Josef
http://linked.open.../riv/druhVysledku	J - Článek v odborném periodiku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Západočeská univerzita v Plzni / Fakulta aplikovaných věd
http://linked.open...dnocenehoVysledku	231214
http://linked.open...ai/riv/idVysledku	RIV/49777513:23520/11:43898287
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	speech recognition, speaker-clustered acoustic models (en)
http://linked.open.../riv/klicoveSlovo	speaker-clustered acoustic models speech recognition
http://linked.open...odStatuVydavatele	DE - Spolková republika Německo
http://linked.open...ontrolniKodProRIV	[83099AC758E3]
http://linked.open...i/riv/nazevZdroje	Lecture Notes in Computer Science
http://linked.open...in/vavai/riv/obor	JD
http://linked.open...ichTvurcuVysledku	3 (xsd:int)
http://linked.open...cetTvurcuVysledku	3 (xsd:int)
http://linked.open...vavai/riv/projekt	Elimination of the language barriers faced by the handicapped watchers of the Czech Television II
http://linked.open...UplatneniVysledku	2011
http://linked.open...v/svazekPeriodika	2011
http://linked.open...iv/tvurceVysledku	Psutka, Josef Vaněk, Jan
issn	0302-9743
number of pages	7 (xsd:int)
http://bibframe.org/vocab/doi	10.1007/978-3-642-23538-2_36
http://localhost/t...ganizacniJednotka	23520

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 77 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software