About: Visual Speech Segmentation and Speaker Recognition for Transcription of TV News

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Visual Speech Segmentation and Speaker Recognition for Transcription of TV News Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
Description	This paper is about a method for visual segmentation of TV news. The TV news shows are segmented according to the visual stream from the video TV recordings in this method. Human faces are found in the single visual segments with the help of the fast algorithm for face detection. The found faces are compared with the visual GMMs, that have been trained from the video picture of the single broadcasters (anchors) from the TV news. The single visual segments, where the faces of the broadcasters have been found and recognized, have been compared with the acoustic segments from the acoustic segmentation. The speaker adapted HMMs have been used for speech recognition of these acoustic segments. The recognition rate is better for the use of this speaker-adapted HMMs compared to the use of the speaker independent HMMs. This paper is about a method for visual segmentation of TV news. The TV news shows are segmented according to the visual stream from the video TV recordings in this method. Human faces are found in the single visual segments with the help of the fast algorithm for face detection. The found faces are compared with the visual GMMs, that have been trained from the video picture of the single broadcasters (anchors) from the TV news. The single visual segments, where the faces of the broadcasters have been found and recognized, have been compared with the acoustic segments from the acoustic segmentation. The speaker adapted HMMs have been used for speech recognition of these acoustic segments. The recognition rate is better for the use of this speaker-adapted HMMs compared to the use of the speaker independent HMMs. (en)
Title	Visual Speech Segmentation and Speaker Recognition for Transcription of TV News Visual Speech Segmentation and Speaker Recognition for Transcription of TV News (en)
skos:prefLabel	Visual Speech Segmentation and Speaker Recognition for Transcription of TV News Visual Speech Segmentation and Speaker Recognition for Transcription of TV News (en)
skos:notation	RIV/46747885:24220/06:#0001343!RIV10-GA0-24220___
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(GA102/05/0278)
http://linked.open...vai/riv/dodaniDat	2010
http://linked.open...aciTvurceVysledku	Chaloupka, Josef
http://linked.open.../riv/druhVysledku	D - Článek ve sborníku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Technická univerzita v Liberci / Fakulta mechatroniky, informatiky a mezioborových studií
http://linked.open...dnocenehoVysledku	506183
http://linked.open...ai/riv/idVysledku	RIV/46747885:24220/06:#0001343
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	Visual speech segmentation (en)
http://linked.open.../riv/klicoveSlovo	Visual speech segmentation
http://linked.open...ontrolniKodProRIV	[6387CAD5EE5A]
http://linked.open...v/mistoKonaniAkce	Pittsburgh, USA
http://linked.open...i/riv/mistoVydani	Pittsburgh, USA
http://linked.open...i/riv/nazevZdroje	INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING
http://linked.open...in/vavai/riv/obor	JD
http://linked.open...ichTvurcuVysledku	1 (xsd:int)
http://linked.open...cetTvurcuVysledku	1 (xsd:int)
http://linked.open...vavai/riv/projekt	New trends in research and application of voice technology
http://linked.open...UplatneniVysledku	2006
http://linked.open...iv/tvurceVysledku	Chaloupka, Josef
http://linked.open...vavai/riv/typAkce	WRD - Světová
http://linked.open.../riv/zahajeniAkce	2006-01-01 (xsd:date)
number of pages	4 (xsd:int)
http://purl.org/ne...btex#hasPublisher	ISCA-INST SPEECH COMMUNICATION ASSOC
https://schema.org/isbn	978-1-60423-449-7
http://localhost/t...ganizacniJednotka	24220

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 48 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software