This HTML5 document contains 44 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
n20http://linked.opendata.cz/ontology/domain/vavai/riv/typAkce/
dctermshttp://purl.org/dc/terms/
n7http://localhost/temp/predkladatel/
n6http://purl.org/net/nknouf/ns/bibtex#
n12http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n8http://linked.opendata.cz/resource/domain/vavai/projekt/
n19http://linked.opendata.cz/ontology/domain/vavai/
n15https://schema.org/
shttp://schema.org/
skoshttp://www.w3.org/2004/02/skos/core#
n3http://linked.opendata.cz/ontology/domain/vavai/riv/
n2http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n4http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n13http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdhhttp://www.w3.org/2001/XMLSchema#
n21http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n18http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n10http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n5http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n16http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F00216224%3A14330%2F10%3A00045835%21RIV11-GA0-14330___/
n14http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item
n2:RIV%2F00216224%3A14330%2F10%3A00045835%21RIV11-GA0-14330___
rdf:type
skos:Concept n19:Vysledek
dcterms:description
This paper presents a multiword expression (MWE) database recently built for Czech that at the moment contains approx. 160,000 items (lexical units). It was compiled from various resources such as encyclopedias and dictionaries, public databases of proper names and toponyms, collocations obtained from Czech WordNet, lists of botanical and zoological terms and others. We compare the built MWEs database with the corpus data from Czech National Corpus (approx. 100 mil. tokens) and internet-based CZES corpus (approx. 1 bil. tokens) and present results of this comparison in the paper. To obtain a more reliable and complete list of MWEs we have proposed and used a technique exploiting the Word Sketch Engine, which allows us to work with statistical parameters such as frequency of MWEs and their components as well as the salience for the whole MWEs. The list of bigrams and n-grams obtained via Word Sketch Engine was further analyzed and compared with the MWE database mentioned above. This paper presents a multiword expression (MWE) database recently built for Czech that at the moment contains approx. 160,000 items (lexical units). It was compiled from various resources such as encyclopedias and dictionaries, public databases of proper names and toponyms, collocations obtained from Czech WordNet, lists of botanical and zoological terms and others. We compare the built MWEs database with the corpus data from Czech National Corpus (approx. 100 mil. tokens) and internet-based CZES corpus (approx. 1 bil. tokens) and present results of this comparison in the paper. To obtain a more reliable and complete list of MWEs we have proposed and used a technique exploiting the Word Sketch Engine, which allows us to work with statistical parameters such as frequency of MWEs and their components as well as the salience for the whole MWEs. The list of bigrams and n-grams obtained via Word Sketch Engine was further analyzed and compared with the MWE database mentioned above.
dcterms:title
Multiword Expressions in Czech (a case study) Multiword Expressions in Czech (a case study)
skos:prefLabel
Multiword Expressions in Czech (a case study) Multiword Expressions in Czech (a case study)
skos:notation
RIV/00216224:14330/10:00045835!RIV11-GA0-14330___
n3:aktivita
n18:P
n3:aktivity
P(2C06009), P(GA407/07/0679), P(GAP401/10/0792), P(LC536)
n3:dodaniDat
n14:2011
n3:domaciTvurceVysledku
n12:1322451 n12:6076939
n3:druhVysledku
n5:D
n3:duvernostUdaju
n13:S
n3:entitaPredkladatele
n16:predkladatel
n3:idSjednocenehoVysledku
273222
n3:idVysledku
RIV/00216224:14330/10:00045835
n3:jazykVysledku
n21:eng
n3:klicovaSlova
Czech Multiword Expressions; Word Sketches; n-grams
n3:klicoveSlovo
n4:n-grams n4:Word%20Sketches n4:Czech%20Multiword%20Expressions
n3:kontrolniKodProRIV
[5F1C7ACCD2F0]
n3:mistoKonaniAkce
Brno
n3:mistoVydani
Brno
n3:nazevZdroje
Karlík a továrna na lingvistiku. Prof. Petru Karlíkovi k šedesátým narozeninám
n3:obor
n10:IN
n3:pocetDomacichTvurcuVysledku
2
n3:pocetTvurcuVysledku
2
n3:projekt
n8:LC536 n8:GAP401%2F10%2F0792 n8:GA407%2F07%2F0679 n8:2C06009
n3:rokUplatneniVysledku
n14:2010
n3:tvurceVysledku
Pala, Karel Šmerk, Pavel
n3:typAkce
n20:CST
n3:zahajeniAkce
2010-01-01+01:00
s:numberOfPages
14
n6:hasPublisher
Host
n15:isbn
978-80-7294-412-5
n7:organizacniJednotka
14330