HTML Microdata document

This HTML5 document contains 48 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

Prefix	IRI
dcterms	http://purl.org/dc/terms/
n15	http://localhost/temp/predkladatel/
n7	http://linked.opendata.cz/resource/domain/vavai/projekt/
n6	http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n13	http://linked.opendata.cz/ontology/domain/vavai/
n17	http://linked.opendata.cz/resource/domain/vavai/zamer/
s	http://schema.org/
skos	http://www.w3.org/2004/02/skos/core#
n4	http://linked.opendata.cz/ontology/domain/vavai/riv/
n14	http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F00216208%3A11320%2F07%3A00005175%21RIV08-AV0-11320___/
n2	http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n8	http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n19	http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdh	http://www.w3.org/2001/XMLSchema#
n12	http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n10	http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n18	http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n16	http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n9	http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item: n2:RIV%2F00216208%3A11320%2F07%3A00005175%21RIV08-AV0-11320___
rdf:type: skos:Concept n13:Vysledek
dcterms:description: EGOTHOR je vyhledávací stroj indexující web a umožňující hledat webovské dokumenty. Jím dodávaný seznam hitů obsahuje, URL a název hitu, a také snippet snažící se stručně ukázat shodu. Snippet může být téměř vždy vytvořen algoritmem, který úplnou zanlost původního dokumentu (většinou HTML stránky). Z toho plyne, že vyhledávací stroj si musí jako součást indexu uchovávat ke všem dokumentům jejich plné znění. Takovýto požadavek nás vede k odpovídajícím kompresním algoritmům, které umožní zredukovat nároky na místo. Jedním z řešení je použít stávající běžně dostupné metody jako je gzip či bzip2, ale může být výhodnější vyvinout novou metodu, která by mohla využít strukturu dokumentu či textový charakter těch dokumentů. Pro kompresi XML dokumentů již existují specializované kompresní metody. Cílem tohoto příspěvku je integrace těchto dvou přístupů k dosažení optimálního kompresního poměru. EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit list contains URL and title of the hits, and also some snippet which tries to shortly show a match. The snippet can be almost always assembled by an algorithm that has a full knowledge of the original document (mostly HTML page). It implies that the search engine is required to store the full text of the documents as part of the index. Such a requirement leads us to an appropriate compression algorithm which would reduce the space demand. One of the solutions could be some use of common compression methods, for instance gzip or bzip2, but it might be preferable to develop a new method which would take advantage of the document structure, or rather, the textual character of the documents. There already exist special compression text algorithms and methods for a compression of XML documents. The aim of this paper is an integration of the two approaches to achieve an optimal level of the compression ratio EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit list contains URL and title of the hits, and also some snippet which tries to shortly show a match. The snippet can be almost always assembled by an algorithm that has a full knowledge of the original document (mostly HTML page). It implies that the search engine is required to store the full text of the documents as part of the index. Such a requirement leads us to an appropriate compression algorithm which would reduce the space demand. One of the solutions could be some use of common compression methods, for instance gzip or bzip2, but it might be preferable to develop a new method which would take advantage of the document structure, or rather, the textual character of the documents. There already exist special compression text algorithms and methods for a compression of XML documents. The aim of this paper is an integration of the two approaches to achieve an optimal level of the compression ratio
dcterms:title: Compression of Semistructured Documents Compression of Semistructured Documents Komprese semistrukturovaných dokumentů
skos:prefLabel: Komprese semistrukturovaných dokumentů Compression of Semistructured Documents Compression of Semistructured Documents
skos:notation: RIV/00216208:11320/07:00005175!RIV08-AV0-11320___
n4:strany: 11;17
n4:aktivita: n10:Z n10:P
n4:aktivity: P(1ET100300419), P(1ET100300517), Z(MSM0021620838)
n4:cisloPeriodika: 1
n4:dodaniDat: n9:2008
n4:domaciTvurceVysledku: n6:8082391 n6:5522633 n6:2321084
n4:druhVysledku: n16:J
n4:duvernostUdaju: n19:S
n4:entitaPredkladatele: n14:predkladatel
n4:idSjednocenehoVysledku: 414579
n4:idVysledku: RIV/00216208:11320/07:00005175
n4:jazykVysledku: n12:eng
n4:klicovaSlova: Compression; Semistructured; Documents
n4:klicoveSlovo: n8:Compression n8:Documents n8:Semistructured
n4:kodStatuVydavatele: GB - Spojené království Velké Británie a Severního Irska
n4:kontrolniKodProRIV: [8AB77E37AFD1]
n4:nazevZdroje: International Journal of Information Technology
n4:obor: n18:JC
n4:pocetDomacichTvurcuVysledku: 3
n4:pocetTvurcuVysledku: 4
n4:projekt: n7:1ET100300419 n7:1ET100300517
n4:rokUplatneniVysledku: n9:2007
n4:svazekPeriodika: 4
n4:tvurceVysledku: Žemlička, Michal Galamboš, Leo Lánský, Jan
n4:zamer: n17:MSM0021620838
s:issn: 1305-2403
s:numberOfPages: 7
n15:organizacniJednotka: 11320