About: Audio-Visual Speech Recognition in Noisy Audio Environments

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Audio-Visual Speech Recognition in Noisy Audio Environments Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
Description	It is a well-known fact that the visual part of speech can improve the resulting recognition rate mainly in noisy conditions. Main goal of this work is to find a set of visual features which would be possible to use in our audio-visual speech recognition systems. Discrete Cosine Transform (DCT) and Active Appearance Model (AAM) based visual features are extracted from visual speech signals, enhanced by a simplified variant of Hierarchical Linear Discriminant Analysis (HiLDA) and normalized across speakers. The visual features are then combined with standard MFCC audio features by the middle fusion method. The results from audio-visual speech recognition are compared with the results from experiments where the log-spectra minimum mean square error and multiband spectral subtraction methods for reducing additive noise in the audio signal are used. It is a well-known fact that the visual part of speech can improve the resulting recognition rate mainly in noisy conditions. Main goal of this work is to find a set of visual features which would be possible to use in our audio-visual speech recognition systems. Discrete Cosine Transform (DCT) and Active Appearance Model (AAM) based visual features are extracted from visual speech signals, enhanced by a simplified variant of Hierarchical Linear Discriminant Analysis (HiLDA) and normalized across speakers. The visual features are then combined with standard MFCC audio features by the middle fusion method. The results from audio-visual speech recognition are compared with the results from experiments where the log-spectra minimum mean square error and multiband spectral subtraction methods for reducing additive noise in the audio signal are used. (en)
Title	Audio-Visual Speech Recognition in Noisy Audio Environments Audio-Visual Speech Recognition in Noisy Audio Environments (en)
skos:prefLabel	Audio-Visual Speech Recognition in Noisy Audio Environments Audio-Visual Speech Recognition in Noisy Audio Environments (en)
skos:notation	RIV/46747885:24220/13:#0002802!RIV14-MSM-24220___
http://linked.open...avai/predkladatel	Fakulta mechatroniky, informatiky a mezioborových studií
http://linked.open...avai/riv/aktivita	S
http://linked.open...avai/riv/aktivity	S
http://linked.open...vai/riv/dodaniDat	2014
http://linked.open...aciTvurceVysledku	Chaloupka, Josef Paleček, Karel
http://linked.open.../riv/druhVysledku	D - Článek ve sborníku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Technická univerzita v Liberci / Fakulta mechatroniky, informatiky a mezioborových studií
http://linked.open...dnocenehoVysledku	62506
http://linked.open...ai/riv/idVysledku	RIV/46747885:24220/13:#0002802
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	AAM; Audio visual speech recognition; Discrete Cosine Transform(DCT); HiLDA; Linear discriminant analysis; Minimum mean square errors; Multi-band spectral subtractions; Visual speech features (en)
http://linked.open.../riv/klicoveSlovo	AAM Audio visual speech recognition Discrete Cosine Transform(DCT) HiLDA Linear discriminant analysis Minimum mean square errors Multi-band spectral subtractions Visual speech features
http://linked.open...ontrolniKodProRIV	[AD2CBFB0FAEE]
http://linked.open...v/mistoKonaniAkce	Itálie
http://linked.open...i/riv/nazevZdroje	Proc. of 36th International Conference on Telecommunications and Signal Processing (TSP 2013)
http://linked.open...in/vavai/riv/obor	JC
http://linked.open...ichTvurcuVysledku	2 (xsd:int)
http://linked.open...cetTvurcuVysledku	2 (xsd:int)
http://linked.open...UplatneniVysledku	2013
http://linked.open...iv/tvurceVysledku	Chaloupka, Josef Paleček, Karel
http://linked.open...vavai/riv/typAkce	WRD - Světová
http://linked.open.../riv/zahajeniAkce	2013-01-01 (xsd:date)
number of pages	4 (xsd:int)
http://bibframe.org/vocab/doi	10.1109/TSP.2013.6613979
http://purl.org/ne...btex#hasPublisher	Neuveden
https://schema.org/isbn	9781479904044
http://localhost/t...ganizacniJednotka	24220

Faceted Search & Find service v1.16.116 as of Feb 22 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3239 as of Feb 22 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 82 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software