About: On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

AttributesValues
rdf:type
Description
  • V článku se zkoumají možnosti minimalizace databáze řečových jednotek za účelem získání kompaktní databáze řečových jednotek, která bude poskytovat syntetickou řeč %22rozumné kvality%22 také pro zařízení s menšími systémovými zdroji. Zaměřili jsme se zejména na přípravu databáze řečových jednotek s využitím HMM, plně automatický proces, který připravuje soubor kontextově závislých fonů (trifonů) pomocí modelování HMM, shlukování založeného na CART a segmentace s využitím HMM V článku jsou popsány tři experimenty: první experiment se týká velikosti zdrojového řečového korpusu, druhý experiment se zabývá procesem shlukování trifo (cs)
  • In this paper, minimization of speech unit database is researched in order to have a compact speech unit database yielding a %22good enough%22 synthetic speech usable also for low-resource devices. We focused mainly on HMM-based speech unit database preparation, a process which prepares a set of context-dependent phones (triphones) by means of HMM modelling, CART-based clustering, and HMM-based segmentation in a fully automatic way. Three experiments are described in the paper: the first one concerns the size of the source speech corpus, the second one deals with the triphone clustering process, and the last one concerns the modelling of the cross-word dependencies. The final minimised system exploits techniques used in all three experiments. The size of the resulting speech unit database decreased from 28.1 to 1.6 MB. The resulting synthetic speech was then judged by means of CCR listening tests and evaluated as %22slightly worse%22 than speech generated by the baseline system.
  • In this paper, minimization of speech unit database is researched in order to have a compact speech unit database yielding a %22good enough%22 synthetic speech usable also for low-resource devices. We focused mainly on HMM-based speech unit database preparation, a process which prepares a set of context-dependent phones (triphones) by means of HMM modelling, CART-based clustering, and HMM-based segmentation in a fully automatic way. Three experiments are described in the paper: the first one concerns the size of the source speech corpus, the second one deals with the triphone clustering process, and the last one concerns the modelling of the cross-word dependencies. The final minimised system exploits techniques used in all three experiments. The size of the resulting speech unit database decreased from 28.1 to 1.6 MB. The resulting synthetic speech was then judged by means of CCR listening tests and evaluated as %22slightly worse%22 than speech generated by the baseline system. (en)
Title
  • On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis
  • On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis (en)
  • Minimalizace velikosti databáze řečových jednotek v úloze konkatenační syntézy řeči (cs)
skos:prefLabel
  • On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis
  • On Minimizing the Size of Speech Unit Database in Concatenative Speech Synthesis (en)
  • Minimalizace velikosti databáze řečových jednotek v úloze konkatenační syntézy řeči (cs)
skos:notation
  • RIV/49777513:23520/06:00000105!RIV07-GA0-23520___
http://linked.open.../vavai/riv/strany
  • 70-76
http://linked.open...avai/riv/aktivita
http://linked.open...avai/riv/aktivity
  • P(2C06020), P(GA102/05/0278)
http://linked.open...vai/riv/dodaniDat
http://linked.open...aciTvurceVysledku
http://linked.open.../riv/druhVysledku
http://linked.open...iv/duvernostUdaju
http://linked.open...titaPredkladatele
http://linked.open...dnocenehoVysledku
  • 490359
http://linked.open...ai/riv/idVysledku
  • RIV/49777513:23520/06:00000105
http://linked.open...riv/jazykVysledku
http://linked.open.../riv/klicovaSlova
  • speech synthesis; minimization of speech unit database; HMM modelling; HMM-based segmentation; CART clustering (en)
http://linked.open.../riv/klicoveSlovo
http://linked.open...ontrolniKodProRIV
  • [34E69137A357]
http://linked.open...v/mistoKonaniAkce
  • Praha
http://linked.open...i/riv/mistoVydani
  • Prague
http://linked.open...i/riv/nazevZdroje
  • Speech Processing
http://linked.open...in/vavai/riv/obor
http://linked.open...ichTvurcuVysledku
http://linked.open...cetTvurcuVysledku
http://linked.open...vavai/riv/projekt
http://linked.open...UplatneniVysledku
http://linked.open...iv/tvurceVysledku
  • Matoušek, Jindřich
http://linked.open...vavai/riv/typAkce
http://linked.open.../riv/zahajeniAkce
number of pages
http://purl.org/ne...btex#hasPublisher
  • Ústav radiotechniky a elektroniky AV ČR
https://schema.org/isbn
  • 80-86269-15-9
http://localhost/t...ganizacniJednotka
  • 23520
Faceted Search & Find service v1.16.118 as of Jun 21 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 48 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software