. "0"^^ . "corpus; linguisitc data"@en . . "2014-04-30+02:00"^^ . "http://www.isvav.cz/projectDetail.do?rowId=LM2011023"^^ . . . . "Czech National Corpus"@en . . "The Czech National Corpus (CNC) strives for extensive and continuous data coverage of the Czech language (and other languages in comparison with Czech) aiming thus to build up a foundation for basic and applied research. The CNC is the only project of its kind in the Czech Republic and due to its current results (set of corpora containing more than 1.3 billion tokens in total), it ranks among the foremost corpus research centres in the world. The CNC objective is mainly continuous development and building of language corpora of various types as representative, linguistically processed textual bases for empirical and exact research of the Czech language; these are primarily corpora covering Czech in its present state (synchronic corpora of written and spoken language), in its historical development (diachronic corpus), and in translation comparison with other languages (parallel corpora). This is closely related to versatile, continually developed and improved structural and specialized linguistic annotation of these corpora. Upon request, the CNC will also cater for comprehensive processing of other corpora created at different institutes in the Czech Republic and abroad, as well as maintaining public access to them. An integral part of the project is providing free and open public service of internet user access to all corpora through specialized corpus tools, including related administration, user service and development of these tools. This is connected to providing of data packages (i.e. processed and annotated collections of language data) to other institutions and individual users in the Czech Republic as well as abroad, in various forms and formats according to the users\u2019 needs, with applications ranging from linguistic research to natural language processing."@en . . "2012-01-01+01:00"^^ . "\u010Cesk\u00FD n\u00E1rodn\u00ED korpus (\u010CNK) usiluje o extenz\u00EDvn\u00ED a kontinu\u00E1ln\u00ED datov\u00E9 pokr\u00FDv\u00E1n\u00ED \u010De\u0161tiny (a dal\u0161\u00EDch jazyk\u016F ve srovn\u00E1n\u00ED s n\u00ED) a c\u00EDlen\u011B tak buduje b\u00E1zi pro z\u00E1kladn\u00ED i aplikovan\u00FD v\u00FDzkum. \u010CNK p\u0159edstavuje jedin\u00FD projekt sv\u00E9ho druhu v \u010Cesk\u00E9 republice a sv\u00FDmi dosavadn\u00EDmi v\u00FDsledky (nab\u00EDdka korpus\u016F o celkov\u00E9m rozsahu v\u00EDce 1,3 miliardy textov\u00FDch slov) se \u0159ad\u00ED k p\u0159edn\u00EDm korpusov\u00FDm pracovi\u0161t\u00EDm i ve sv\u011Btov\u00E9m m\u011B\u0159\u00EDtku. C\u00EDlem \u010Dinnosti \u010CNK je p\u0159edev\u0161\u00EDm kontinu\u00E1ln\u00ED rozvoj a budov\u00E1n\u00ED jazykov\u00FDch korpus\u016F r\u016Fzn\u00FDch typ\u016F jako reprezentativn\u00ED lingvisticky zpracovan\u00E9 datov\u00E9 z\u00E1kladny pro empirick\u00FD a exaktn\u00ED v\u00FDzkum \u010Desk\u00E9ho jazyka; jde p\u0159edev\u0161\u00EDm o korpusy zachycuj\u00EDc\u00ED \u010De\u0161tinu v jej\u00EDm sou\u010Dasn\u00E9m stavu (synchronn\u00ED korpusy psan\u00E9ho a mluven\u00E9ho jazyka), v jej\u00EDm historick\u00E9m v\u00FDvoji (diachronn\u00ED korpus) a v p\u0159ekladov\u00E9m srovn\u00E1n\u00ED s jin\u00FDmi jazyky (paraleln\u00ED korpusy). S t\u00EDm \u00FAzce souvis\u00ED i mnohostrann\u00E1, trvale rozv\u00EDjen\u00E1 a zdokonalovan\u00E1 strukturn\u00ED a lingvistick\u00E1 anotace t\u011Bchto korpus\u016F. \u010CNK bude na po\u017E\u00E1d\u00E1n\u00ED zaji\u0161\u0165ovat tak\u00E9 komplexn\u00ED zpracov\u00E1n\u00ED dal\u0161\u00EDch korpus\u016F vznikl\u00FDch na jin\u00FDch pracovi\u0161t\u00EDch v \u010CR i v zahrani\u010D\u00ED a ve\u0159ejn\u00FD p\u0159\u00EDstup k nim. Ned\u00EDlnou sou\u010D\u00E1st\u00ED projektu je bezplatn\u00E1 a otev\u0159en\u00E1 ve\u0159ejn\u00E1 slu\u017Eba poskytov\u00E1n\u00ED internetov\u00E9ho u\u017Eivatelsk\u00E9ho p\u0159\u00EDstupu ke v\u0161em korpus\u016Fm pomoc\u00ED specializovan\u00FDch korpusov\u00FDch n\u00E1stroj\u016F, v\u010Detn\u011B souvisej\u00EDc\u00ED spr\u00E1vy, u\u017Eivatelsk\u00E9ho servisu a v\u00FDvoje t\u011Bchto n\u00E1stroj\u016F. S t\u00EDm je spojeno tak\u00E9 poskytov\u00E1n\u00ED datov\u00FDch bal\u00ED\u010Dk\u016F (tj. zpracovan\u00FDch a anotovan\u00FDch soubor\u016F jazykov\u00FDch dat) dal\u0161\u00EDm instituc\u00EDm i individu\u00E1ln\u00EDm u\u017Eivatel\u016Fm v \u010CR i v zahrani\u010D\u00ED, v r\u016Fzn\u00FDch podob\u00E1ch a form\u00E1tech podle pot\u0159eb t\u011Bchto u\u017Eivatel\u016F, s vyu\u017Eit\u00EDm zejm\u00E9na pro jazykov\u011Bdn\u00FD v\u00FDzkum a po\u010D\u00EDta\u010Dov\u00E9 zpracov\u00E1n\u00ED p\u0159irozen\u00E9ho jazyka." . "46"^^ . . "46"^^ . "corpus" . "\u010Cesk\u00FD n\u00E1rodn\u00ED korpus" . "0"^^ . "2015-02-16+01:00"^^ . "LM2011023" . . . . . . "2016-12-31+01:00"^^ . . "1"^^ . . .