About: Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
rdfs:seeAlso	http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6169951
Description	In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihood evaluation algorithm for graphical processing units (GPUs). The evaluation of these likelihoods is one of the most computationally intensive parts of automatic speech recognizers, but it can be parallelized and offloaded to GPU devices. Our approach offers a significant speed-up over the recently published approaches, because it utilizes the GPU architecture in a more effective manner. All the recent implementations have been intended only for NVIDIA graphics processors, programmed either in CUDA or OpenCL GPU programming frameworks. We present results for both CUDA and OpenCL. Further, we have developed an OpenCL implementation optimized for ATI/AMD GPUs. Results suggest that even very large acoustic models can be used in real-time speech recognition engines on computers equipped with a low-end GPU or laptops. In addition, the completely asynchronous GPU management provides additional CPU resources for the decoder part of the LVCSR. The optimized implementation enables us to apply fusion techniques together with evaluating many (10 or even more) speaker-specific acoustic models. We apply this technique to a real-time parliamentary speech recognition system where the speaker changes frequently. In this paper, we describe an optimized version of a Gaussian-mixture-based acoustic model likelihood evaluation algorithm for graphical processing units (GPUs). The evaluation of these likelihoods is one of the most computationally intensive parts of automatic speech recognizers, but it can be parallelized and offloaded to GPU devices. Our approach offers a significant speed-up over the recently published approaches, because it utilizes the GPU architecture in a more effective manner. All the recent implementations have been intended only for NVIDIA graphics processors, programmed either in CUDA or OpenCL GPU programming frameworks. We present results for both CUDA and OpenCL. Further, we have developed an OpenCL implementation optimized for ATI/AMD GPUs. Results suggest that even very large acoustic models can be used in real-time speech recognition engines on computers equipped with a low-end GPU or laptops. In addition, the completely asynchronous GPU management provides additional CPU resources for the decoder part of the LVCSR. The optimized implementation enables us to apply fusion techniques together with evaluating many (10 or even more) speaker-specific acoustic models. We apply this technique to a real-time parliamentary speech recognition system where the speaker changes frequently. (en)
Title	Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors (en)
skos:prefLabel	Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors (en)
skos:notation	RIV/49777513:23520/12:43921648!RIV14-GA0-23520___
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(GBP103/12/G084)
http://linked.open...iv/cisloPeriodika	6
http://linked.open...vai/riv/dodaniDat	2014
http://linked.open...aciTvurceVysledku	Vaněk, Jan Psutka, Josef Psutka jr., Josef Trmal, Jan
http://linked.open.../riv/druhVysledku	J - Článek v odborném periodiku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Západočeská univerzita v Plzni / Fakulta aplikovaných věd
http://linked.open...dnocenehoVysledku	157001
http://linked.open...ai/riv/idVysledku	RIV/49777513:23520/12:43921648
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	OpenCL; CUDA; GPU; software performance; parallel architectures; parallel algorithms; Automatic speech recognition (en)
http://linked.open.../riv/klicoveSlovo	OpenCL Automatic speech recognition CUDA parallel algorithms GPU parallel architectures software performance
http://linked.open...odStatuVydavatele	US - Spojené státy americké
http://linked.open...ontrolniKodProRIV	[3429813237E4]
http://linked.open...i/riv/nazevZdroje	IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
http://linked.open...in/vavai/riv/obor	JD
http://linked.open...ichTvurcuVysledku	4 (xsd:int)
http://linked.open...cetTvurcuVysledku	4 (xsd:int)
http://linked.open...vavai/riv/projekt	Center for Large Scale Multi-modal Data Interpretation
http://linked.open...UplatneniVysledku	2012
http://linked.open...v/svazekPeriodika	20
http://linked.open...iv/tvurceVysledku	Psutka jr., Josef Psutka, Josef Vaněk, Jan Trmal, Jan
http://linked.open...ain/vavai/riv/wos	000302742400001
issn	1558-7916
number of pages	11 (xsd:int)
http://bibframe.org/vocab/doi	10.1109/TASL.2012.2190928
http://localhost/t...ganizacniJednotka	23520

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 58 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software