About: Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive Goto Sponge NotDistinct Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

Attributes	Values
rdf:type	skos:Concept http://linked.opendata.cz/ontology/domain/vavai/Vysledek
Description	In this paper, we present the outcome of a 4-year project whose ultimate goal is to develop a complex platform that can transcribe, index and make searchable the historical archive of Czech and Czechoslovak Radio. The archive covers 90 years of public broadcasting and contains hundreds of thousands audio documents. The developed modular platform employs our LVCSR system that has to cope with 2 related languages: Czech and Slovak. Furthermore, it must deal with audio files of varying quality (e.g. recordings originally stored on matrices or tapes, data passed through analog and digital telephone lines, speech recorded during parliament or court sessions, etc.) The system includes speaker and language identification modules, a narrow-band signal detector, a music/song detector, and several other components to enhance transcription accuracy and provide support for multi-optional search. We evaluate the performance on broadcast news test sets grouped according to decades. We show that after acoustic and language model adaptation WER values are in range 8-14% and do not differ much since 1960s to present. We report also results achieved on other types of documents (e.g. talk shows, political debates, public speeches, etc), where the WER is higher but still acceptable for most search tasks. In this paper, we present the outcome of a 4-year project whose ultimate goal is to develop a complex platform that can transcribe, index and make searchable the historical archive of Czech and Czechoslovak Radio. The archive covers 90 years of public broadcasting and contains hundreds of thousands audio documents. The developed modular platform employs our LVCSR system that has to cope with 2 related languages: Czech and Slovak. Furthermore, it must deal with audio files of varying quality (e.g. recordings originally stored on matrices or tapes, data passed through analog and digital telephone lines, speech recorded during parliament or court sessions, etc.) The system includes speaker and language identification modules, a narrow-band signal detector, a music/song detector, and several other components to enhance transcription accuracy and provide support for multi-optional search. We evaluate the performance on broadcast news test sets grouped according to decades. We show that after acoustic and language model adaptation WER values are in range 8-14% and do not differ much since 1960s to present. We report also results achieved on other types of documents (e.g. talk shows, political debates, public speeches, etc), where the WER is higher but still acceptable for most search tasks. (en)
Title	Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive (en)
skos:prefLabel	Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive (en)
skos:notation	RIV/46747885:24220/14:#0003002!RIV15-MK0-24220___
http://linked.open...avai/riv/aktivita	P
http://linked.open...avai/riv/aktivity	P(DF11P01OVV013)
http://linked.open...vai/riv/dodaniDat	2015
http://linked.open...aciTvurceVysledku	Rott, Michal Chaloupka, Josef Blavka, Karel Boháč, Marek Žďánský, Jindřich Červa, Petr Nouza, Jan Málek, J. Silovský, Jan Kuchařová, Michaela Šeps, Ladislav
http://linked.open.../riv/druhVysledku	D - Článek ve sborníku
http://linked.open...iv/duvernostUdaju	S - Úplné a pravdivé údaje nepodléhající ochraně podle zvláštních právních předpisů
http://linked.open...titaPredkladatele	Technická univerzita v Liberci / Fakulta mechatroniky, informatiky a mezioborových studií
http://linked.open...dnocenehoVysledku	46605
http://linked.open...ai/riv/idVysledku	RIV/46747885:24220/14:#0003002
http://linked.open...riv/jazykVysledku	eng - angličtina
http://linked.open.../riv/klicovaSlova	spoken archive; speech recognition; speaker recognition; anguage identification; spoken term search (en)
http://linked.open.../riv/klicoveSlovo	speaker recognition speech recognition anguage identification spoken archive spoken term search
http://linked.open...ontrolniKodProRIV	[8E72AA22347B]
http://linked.open...v/mistoKonaniAkce	Singapore
http://linked.open...i/riv/mistoVydani	Singapore
http://linked.open...i/riv/nazevZdroje	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
http://linked.open...in/vavai/riv/obor	JC
http://linked.open...ichTvurcuVysledku	11 (xsd:int)
http://linked.open...cetTvurcuVysledku	11 (xsd:int)
http://linked.open...vavai/riv/projekt	Disclosure of the Czech Radio archive for sophisticated search
http://linked.open...UplatneniVysledku	2014
http://linked.open...iv/tvurceVysledku	Blavka, Karel Boháč, Marek Chaloupka, Josef Málek, Jiří Nouza, Jan Silovský, Jan Červa, Petr Žďánský, Jindřich Kuchařová, Michaela Šeps, Ladislav Rott, Michal
http://linked.open...vavai/riv/typAkce	WRD - Světová
http://linked.open.../riv/zahajeniAkce	2014-01-01 (xsd:date)
issn	2308-457X
number of pages	5 (xsd:int)
http://purl.org/ne...btex#hasPublisher	ISCA
http://localhost/t...ganizacniJednotka	24220

Faceted Search & Find service v1.16.118 as of Jun 21 2024

Alternative Linked Data Documents: ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 58 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software