This HTML5 document contains 47 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
n20http://linked.opendata.cz/ontology/domain/vavai/riv/typAkce/
dctermshttp://purl.org/dc/terms/
n16http://purl.org/net/nknouf/ns/bibtex#
n15http://localhost/temp/predkladatel/
n13http://linked.opendata.cz/resource/domain/vavai/riv/tvurce/
n10http://linked.opendata.cz/resource/domain/vavai/projekt/
n8http://linked.opendata.cz/ontology/domain/vavai/
n6http://linked.opendata.cz/resource/domain/vavai/vysledek/RIV%2F00216224%3A14330%2F14%3A00077319%21RIV15-MSM-14330___/
shttp://schema.org/
rdfshttp://www.w3.org/2000/01/rdf-schema#
skoshttp://www.w3.org/2004/02/skos/core#
n3http://linked.opendata.cz/ontology/domain/vavai/riv/
n2http://linked.opendata.cz/resource/domain/vavai/vysledek/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n5http://linked.opendata.cz/ontology/domain/vavai/riv/klicoveSlovo/
n12http://linked.opendata.cz/ontology/domain/vavai/riv/duvernostUdaju/
xsdhhttp://www.w3.org/2001/XMLSchema#
n19http://linked.opendata.cz/ontology/domain/vavai/riv/jazykVysledku/
n4http://linked.opendata.cz/ontology/domain/vavai/riv/aktivita/
n21http://linked.opendata.cz/ontology/domain/vavai/riv/druhVysledku/
n9http://linked.opendata.cz/ontology/domain/vavai/riv/obor/
n11http://reference.data.gov.uk/id/gregorian-year/

Statements

Subject Item
n2:RIV%2F00216224%3A14330%2F14%3A00077319%21RIV15-MSM-14330___
rdf:type
n8:Vysledek skos:Concept
rdfs:seeAlso
http://ceur-ws.org/Vol-1180/
dcterms:description
This paper describes our approaches for the Plagiarism Detection – Source Retrieval task of PAN 2014. We combined and improved methodology used at PAN 2012 and PAN 2013. Our system combines three types of queries: The keywords-based queries; the paragraph-based queries; and the headers-based queries. The queries are distinguished also by other properties such as the phrase query or the positional query. The queries are submitted to two search engines – Chatnoir and Indri – according to their properties. The query’s position serves for the search control, minimization of the total number of executed queries is the system’s priority. Downloaded documents are textually compared with the suspicious document and if a similarity is found, the downloaded document is reported. This paper describes our approaches for the Plagiarism Detection – Source Retrieval task of PAN 2014. We combined and improved methodology used at PAN 2012 and PAN 2013. Our system combines three types of queries: The keywords-based queries; the paragraph-based queries; and the headers-based queries. The queries are distinguished also by other properties such as the phrase query or the positional query. The queries are submitted to two search engines – Chatnoir and Indri – according to their properties. The query’s position serves for the search control, minimization of the total number of executed queries is the system’s priority. Downloaded documents are textually compared with the suspicious document and if a similarity is found, the downloaded document is reported.
dcterms:title
Heterogeneous Queries for Synoptic and Phrasal Search Heterogeneous Queries for Synoptic and Phrasal Search
skos:prefLabel
Heterogeneous Queries for Synoptic and Phrasal Search Heterogeneous Queries for Synoptic and Phrasal Search
skos:notation
RIV/00216224:14330/14:00077319!RIV15-MSM-14330___
n3:aktivita
n4:I n4:P
n3:aktivity
I, P(LG13010)
n3:dodaniDat
n11:2015
n3:domaciTvurceVysledku
n13:5800951 n13:7837445
n3:druhVysledku
n21:D
n3:duvernostUdaju
n12:S
n3:entitaPredkladatele
n6:predkladatel
n3:idSjednocenehoVysledku
18998
n3:idVysledku
RIV/00216224:14330/14:00077319
n3:jazykVysledku
n19:eng
n3:klicovaSlova
suspicious document; plagiarism detection; search engine; source retrieval; stop word; text alignment; snippet similarity
n3:klicoveSlovo
n5:search%20engine n5:suspicious%20document n5:source%20retrieval n5:text%20alignment n5:snippet%20similarity n5:plagiarism%20detection n5:stop%20word
n3:kontrolniKodProRIV
[C4777787ECE2]
n3:mistoKonaniAkce
Sheffield, UK
n3:mistoVydani
Sheffield, UK
n3:nazevZdroje
CLEF2014 Working Notes
n3:obor
n9:IN
n3:pocetDomacichTvurcuVysledku
2
n3:pocetTvurcuVysledku
2
n3:projekt
n10:LG13010
n3:rokUplatneniVysledku
n11:2014
n3:tvurceVysledku
Suchomel, Šimon Brandejs, Michal
n3:typAkce
n20:CST
n3:zahajeniAkce
2014-01-01+01:00
s:issn
1613-0073
s:numberOfPages
4
n16:hasPublisher
CEUR, Aachen University
n15:organizacniJednotka
14330