About: 3D lip-tracking for audio-visual speech recognition in real applications     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

AttributesValues
rdf:type
Description
  • Článek se zabývá problémem sledování 3D tvaru rtů z 2D obrazu. Procej je zaměřen na sledování rtů v audiovizuálních nahrávkách z korpusu Czech in-vehicle audio-visual speech corpus (CIVAVC). Ve skutčném prostředí automobilu se hlava pohybuje v 3D prostoru. Změna orientace rtů v 3D prostoru zhoršuje výsledky rozpoznávání řeči. Proto jsme vytvořili algoritmus, který využívá 3D model rtů pro sledování jejich tvaru a 3D prostoru. (cs)
  • In this paper, we present a solution to the problem of tracking 3D information about the shape of lips from 2D picture of a speaker. We focus on lip-tracking of audio-visual speech recordings from the Czech in-vehicle audio-visual speech corpus (CIVAVC). The corpus consists of 4 h 40 min records of audiovisual speech of driver recorded in a car during driving in an usual traffic. In real conditions a head of a speaker (a car driver) can move and turn in various directions. To cope with this movements and to avoid recognition errors caused by changing 3D position of lips, our algorithm utilizes a 3Dmodel- based approach to the lip-tracking process.
  • In this paper, we present a solution to the problem of tracking 3D information about the shape of lips from 2D picture of a speaker. We focus on lip-tracking of audio-visual speech recordings from the Czech in-vehicle audio-visual speech corpus (CIVAVC). The corpus consists of 4 h 40 min records of audiovisual speech of driver recorded in a car during driving in an usual traffic. In real conditions a head of a speaker (a car driver) can move and turn in various directions. To cope with this movements and to avoid recognition errors caused by changing 3D position of lips, our algorithm utilizes a 3Dmodel- based approach to the lip-tracking process. (en)
Title
  • 3D lip-tracking for audio-visual speech recognition in real applications
  • 3D lip-tracking for audio-visual speech recognition in real applications (en)
  • 3D sledování rtů pro audio-vizuální rozpoznávání řeči v reálných aplikacích (cs)
skos:prefLabel
  • 3D lip-tracking for audio-visual speech recognition in real applications
  • 3D lip-tracking for audio-visual speech recognition in real applications (en)
  • 3D sledování rtů pro audio-vizuální rozpoznávání řeči v reálných aplikacích (cs)
skos:notation
  • RIV/49777513:23520/04:00000149!RIV07-MSM-23520___
http://linked.open.../vavai/riv/strany
  • 2521
http://linked.open...avai/riv/aktivita
http://linked.open...avai/riv/aktivity
  • Z(MSM 235200004)
http://linked.open...iv/cisloPeriodika
  • 0
http://linked.open...vai/riv/dodaniDat
http://linked.open...aciTvurceVysledku
http://linked.open.../riv/druhVysledku
http://linked.open...iv/duvernostUdaju
http://linked.open...titaPredkladatele
http://linked.open...dnocenehoVysledku
  • 596649
http://linked.open...ai/riv/idVysledku
  • RIV/49777513:23520/04:00000149
http://linked.open...riv/jazykVysledku
http://linked.open.../riv/klicovaSlova
  • liptracking; 3D; speech recognition, templates, 3D tracking (en)
http://linked.open.../riv/klicoveSlovo
http://linked.open...odStatuVydavatele
  • KR - Korejská republika
http://linked.open...ontrolniKodProRIV
  • [C164EA247570]
http://linked.open...i/riv/nazevZdroje
  • Journal of the Acoustical Society of Korea
http://linked.open...in/vavai/riv/obor
http://linked.open...ichTvurcuVysledku
http://linked.open...cetTvurcuVysledku
http://linked.open...UplatneniVysledku
http://linked.open...v/svazekPeriodika
  • 2004
http://linked.open...iv/tvurceVysledku
  • Císař, Petr
  • Krňoul, Zdeněk
  • Železný, Miloš
http://linked.open...n/vavai/riv/zamer
issn
  • 1225-441X
number of pages
http://localhost/t...ganizacniJednotka
  • 23520
Faceted Search & Find service v1.16.118 as of Jun 21 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 48 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software