. . . "243;249" . . "7"^^ . "Praha" . . . "1"^^ . . "From the Corpus as Open Source for Investigation to Commercial Products"@en . "RIV/00216208:11260/07:00003319" . "\u00DAstav pro jazyk \u010Desk\u00FD AV \u010CR" . "1"^^ . . "422677" . "Korpusy se vyvinuly z\u010Dist\u00FDch text\u016F vsofistikovan\u00E9, zna\u010Dkovan\u00E9 n\u00E1stroje. Poskytuj\u00ED rychl\u00E9 a zd\u00E1nliv\u011B nenapadnuteln\u00E9 odpov\u011Bdi na komplikovan\u00E9 dotazy o jazyku. P\u0159\u00EDsp\u011Bvek p\u0159edkl\u00E1d\u00E1 d\u016Fvody, pro kter\u00E9 tyto odpov\u011Bdi ne v\u017Edy popisuj\u00ED jazyk; p\u0159\u00EDklady jsou z\u010Desk\u00E9ho reprezentativn\u00EDho korpusu SYN2000: (1) Texty jsou \u010Dasto reprezentov\u00E1ny vtakov\u00E9 podob\u011B, jak (by) je nikdo nepublikoval. (2) Zna\u010Dkov\u00E1n\u00ED jazykov\u00FDch jev\u016F m\u00E1 daleko k plauzibilit\u011B, a to jak co do koncepce, tak co do implementace zna\u010Dek. (3) Ani tehdy, kdy\u017E statistick\u00E1 data jsou spolehliv\u00E1, explanaci jazykov\u00FDch jev\u016F nelze derivovat zdat; z\u00EDsk\u00E1v\u00E1 se nad daty, interpretac\u00ED dat. (4) V\u011Bdeck\u00E9mu v\u00FDzkumu jazyka je sp\u00ED\u0161 na p\u0159ek\u00E1\u017Eku ne\u017E k pomoci rozvoj takov\u00FDch korpusov\u00FDch n\u00E1stroj\u016F, u nich\u017E badatel nem\u016F\u017Ee nastavit pracovn\u00ED parametry (p\u0159\u00EDpad WordSketches). A tak \u010D\u00EDm sofistikovan\u011Bj\u0161\u00ED jsou korpusov\u00E9 n\u00E1stroje, t\u00EDm ni\u017E\u0161\u00ED je z\u00E1ruka v\u011Bdecky plauzibiln\u00EDch v\u00FDsledk\u016F, resp. t\u00EDm v\u00EDc starost\u00ED je st\u00EDm, jak v\u00FDsledky u\u010Dinit plauzibiln\u00EDmi. To, co nab\u00EDz\u00ED tzv. korpusov\u00E1 lingvistika,"@cs . "Gramatika a korpus 2005 = Grammar & Corpora 2005" . . "\u0160imandl, Josef" . . "Corpora have developed from pure texts into sophisticated tagged tools. They provide quick and seemingly indisputable answers to complicated questions about language. Reasons as to why those answers do not necessarily describe the language are proposed, using examples from the Czech representative corpus SYN2000: (1) Texts are often represented in a way that nobody has (or could have) written/published them. (2) The tagging is far from plausible for labelling language phenomena, both as conception and as implementation. (3) Even if statistical data were proof enough, the explanation of language cannot be derived by the data themselves, but must be obtained through data and interpreting data. (4) Linguistic inquiry is impeded rather than supported by the development of corpus-based tools, as a researcher cannot modify/test how appropriate their setting is for each single case (so by the use of WordSketches). Thus, the more sophisticated the corpus tools, the less the guarantee of scientifically plausib"@en . "[136C4F4628B2]" . . "From the Corpus as Open Source for Investigation to Commercial Products" . . "11260" . "Corpora have developed from pure texts into sophisticated tagged tools. They provide quick and seemingly indisputable answers to complicated questions about language. Reasons as to why those answers do not necessarily describe the language are proposed, using examples from the Czech representative corpus SYN2000: (1) Texts are often represented in a way that nobody has (or could have) written/published them. (2) The tagging is far from plausible for labelling language phenomena, both as conception and as implementation. (3) Even if statistical data were proof enough, the explanation of language cannot be derived by the data themselves, but must be obtained through data and interpreting data. (4) Linguistic inquiry is impeded rather than supported by the development of corpus-based tools, as a researcher cannot modify/test how appropriate their setting is for each single case (so by the use of WordSketches). Thus, the more sophisticated the corpus tools, the less the guarantee of scientifically plausib" . "Od korpusu jako otev\u0159en\u00E9ho zdroje pro b\u00E1d\u00E1n\u00ED ke komer\u010Dn\u00EDm produkt\u016Fm"@cs . "978-80-86496-32-0" . "RIV/00216208:11260/07:00003319!RIV08-GA0-11260___" . . . "Corpus; Source; Investigation; Commercial; Products"@en . "P(GA405/06/1057)" . "From the Corpus as Open Source for Investigation to Commercial Products"@en . . . "From the Corpus as Open Source for Investigation to Commercial Products" . . "Od korpusu jako otev\u0159en\u00E9ho zdroje pro b\u00E1d\u00E1n\u00ED ke komer\u010Dn\u00EDm produkt\u016Fm"@cs .