This HTML5 document contains 44 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
n14http://linked.opendata.cz/ontology/domain/vavai/riv/typAkce/
dctermshttp://purl.org/dc/terms/
n9http://localhost/temp/predkladatel/
n7http://purl.org/net/nknouf/ns/bibtex#
n18http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n12http://linked.opendata.cz/resource/domain/vavai/projekt/
n19http://linked.opendata.cz/ontology/domain/vavai/
n21https://schema.org/
shttp://schema.org/
skoshttp://www.w3.org/2004/02/skos/core#
n3http://linked.opendata.cz/ontology/domain/vavai/riv/
n2http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n16http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F00216208%3A11320%2F14%3A10289378%21RIV15-MSM-11320___/
n15http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n11http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdhhttp://www.w3.org/2001/XMLSchema#
n17http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n4http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n20http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n10http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n6http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item
n2:RIV%2F00216208%3A11320%2F14%3A10289378%21RIV15-MSM-11320___
rdf:type
skos:Concept n19:Vysledek
dcterms:description
In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We extend the work of Jawaid and Bojar (2012) who use three different taggers and then apply a voting scheme to disambiguate among the different choices suggested by each tagger. We run this complex ensemble on a large monolingual corpus and release the tagged corpus. Additionally, we use this data to train a single standalone tagger which will hopefully significantly simplify Urdu processing. The standalone tagger obtains the accuracy of 88.74% on test data. In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags. We extend the work of Jawaid and Bojar (2012) who use three different taggers and then apply a voting scheme to disambiguate among the different choices suggested by each tagger. We run this complex ensemble on a large monolingual corpus and release the tagged corpus. Additionally, we use this data to train a single standalone tagger which will hopefully significantly simplify Urdu processing. The standalone tagger obtains the accuracy of 88.74% on test data.
dcterms:title
A Tagged Corpus and a Tagger for Urdu A Tagged Corpus and a Tagger for Urdu
skos:prefLabel
A Tagged Corpus and a Tagger for Urdu A Tagged Corpus and a Tagger for Urdu
skos:notation
RIV/00216208:11320/14:10289378!RIV15-MSM-11320___
n3:aktivita
n4:P
n3:aktivity
P(LM2010013)
n3:dodaniDat
n6:2015
n3:domaciTvurceVysledku
Jawaid, Bushra Kamran, Amir n18:2630176
n3:druhVysledku
n20:D
n3:duvernostUdaju
n11:S
n3:entitaPredkladatele
n16:predkladatel
n3:idSjednocenehoVysledku
1233
n3:idVysledku
RIV/00216208:11320/14:10289378
n3:jazykVysledku
n17:eng
n3:klicovaSlova
urdu; tagger; corpus; tagged
n3:klicoveSlovo
n15:corpus n15:tagger n15:tagged n15:urdu
n3:kontrolniKodProRIV
[0D7E1753A179]
n3:mistoKonaniAkce
Reykjavík, Iceland
n3:mistoVydani
Reykjavík, Iceland
n3:nazevZdroje
Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
n3:obor
n10:IN
n3:pocetDomacichTvurcuVysledku
3
n3:pocetTvurcuVysledku
3
n3:projekt
n12:LM2010013
n3:rokUplatneniVysledku
n6:2014
n3:tvurceVysledku
Jawaid, Bushra Bojar, Ondřej Kamran, Amir
n3:typAkce
n14:WRD
n3:zahajeniAkce
2014-05-26+02:00
s:numberOfPages
6
n7:hasPublisher
European Language Resources Association
n21:isbn
978-2-9517408-8-4
n9:organizacniJednotka
11320