About: Building a multilingual parallel corpus for human users     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

AttributesValues
rdf:type
rdfs:seeAlso
Description
  • We present the architecture and the current state of InterCorp, a multilingual parallel corpus centered around Czech, intended primarily for human users and consisting of written texts with a focus on fiction. Following an outline of its recent development and a comparison with some other multilingual parallel corpora we give an overview of the data collection procedure that covers text selection criteria, data format, conversion, alignment, lemmatization and tagging. Finally, we discuss challenges and prospects of the project.
  • We present the architecture and the current state of InterCorp, a multilingual parallel corpus centered around Czech, intended primarily for human users and consisting of written texts with a focus on fiction. Following an outline of its recent development and a comparison with some other multilingual parallel corpora we give an overview of the data collection procedure that covers text selection criteria, data format, conversion, alignment, lemmatization and tagging. Finally, we discuss challenges and prospects of the project. (en)
Title
  • Building a multilingual parallel corpus for human users
  • Building a multilingual parallel corpus for human users (en)
skos:prefLabel
  • Building a multilingual parallel corpus for human users
  • Building a multilingual parallel corpus for human users (en)
skos:notation
  • RIV/00216208:11210/12:10132275!RIV15-MSM-11210___
http://linked.open...avai/riv/aktivita
http://linked.open...avai/riv/aktivity
  • P(LM2011023), Z(MSM0021620823)
http://linked.open...vai/riv/dodaniDat
http://linked.open...aciTvurceVysledku
http://linked.open.../riv/druhVysledku
http://linked.open...iv/duvernostUdaju
http://linked.open...titaPredkladatele
http://linked.open...dnocenehoVysledku
  • 125659
http://linked.open...ai/riv/idVysledku
  • RIV/00216208:11210/12:10132275
http://linked.open...riv/jazykVysledku
http://linked.open.../riv/klicovaSlova
  • Czech; multilingual; parallel corpora (en)
http://linked.open.../riv/klicoveSlovo
http://linked.open...ontrolniKodProRIV
  • [7F825958C7E8]
http://linked.open...v/mistoKonaniAkce
  • Istanbul
http://linked.open...i/riv/mistoVydani
  • Istanbul, Turkey
http://linked.open...i/riv/nazevZdroje
  • Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)
http://linked.open...in/vavai/riv/obor
http://linked.open...ichTvurcuVysledku
http://linked.open...cetTvurcuVysledku
http://linked.open...vavai/riv/projekt
http://linked.open...UplatneniVysledku
http://linked.open...iv/tvurceVysledku
  • Rosen, Alexandr
  • Vavřín, Martin
http://linked.open...vavai/riv/typAkce
http://linked.open...ain/vavai/riv/wos
  • 000323927702085
http://linked.open.../riv/zahajeniAkce
http://linked.open...n/vavai/riv/zamer
number of pages
http://purl.org/ne...btex#hasPublisher
  • European Language Resources Association (ELRA)
https://schema.org/isbn
  • 978-2-9517408-7-7
http://localhost/t...ganizacniJednotka
  • 11210
Faceted Search & Find service v1.16.118 as of Jun 21 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 98 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software