. . . "Magistersk\u00E1 diplomov\u00E1 pr\u00E1ce. V pr\u00E1ci vych\u00E1z\u00EDme z \u0159ady osv\u011Bd\u010Den\u00FDch postup\u016F pro ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch dokument\u016F a vytv\u00E1\u0159\u00EDme nov\u00E9. Ji\u017E existuj\u00EDc\u00ED a pou\u017E\u00EDvan\u00E9 techniky kombinujeme, optimalizujeme a inovujeme pro t\u0159i hlavn\u00ED \u00FAlohy: Automatick\u00E9 p\u0159i\u0159azen\u00ED autora podle dan\u00E9 mno\u017Einy autorsk\u00FDch dokument\u016F, Verifikace autorstv\u00ED dan\u00E9ho dokumentu vybran\u00FDm autorem, Shlukov\u00E1n\u00ED dokument\u016F podle autorstv\u00ED. N\u00E1mi implementovan\u00E9 algoritmy jsou testov\u00E1ny na \u010De\u0161tin\u011B, syst\u00E9m je v\u0161ak navr\u017Een modul\u00E1rn\u011B a pokud vypust\u00EDme \u010Di nahrad\u00EDme n\u011Bkolik jazykov\u011B z\u00E1visl\u00FDch komponent, lze v tuto chv\u00EDli pracovat s dokumenty napsan\u00FDmi v libovoln\u00E9m jazyce. V\u0161e je naprogramov\u00E1no ve skriptovac\u00EDm jazyce Python. Sou\u010D\u00E1st\u00ED syst\u00E9mu jsou i n\u00E1stroje pro p\u0159edzpracov\u00E1n\u00ED vstupn\u00EDch dat pro \u010De\u0161tinu a jejich spr\u00E1vu v datab\u00E1zi PostgreSQL. Dal\u0161\u00EDm p\u0159\u00EDnosem pr\u00E1ce krom\u011B v\u00FDvoje syst\u00E9mu pro \u0159e\u0161en\u00ED t\u0159\u00ED zm\u00EDn\u011Bn\u00FDch \u00FAloh jsou empiricky podlo\u017Een\u00E1 pozorov\u00E1n\u00ED, jak se chovaj\u00ED nejpou\u017E\u00EDvan\u011Bj\u0161\u00ED algoritmy na ur\u010Dov\u00E1n\u00ED autorstv\u00ED dokument\u016F na dokumentech v \u010De\u0161tin\u011B." . "RIV/00216224:14330/11:00073205!RIV15-MV0-14330___" . "Ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch text\u016F na z\u00E1klad\u011B automaticky nalezen\u00FDch charakteristick\u00FDch znak\u016F" . "P(LC536), P(VF20102014003), S" . . "Rygl, Jan" . . "Master's thesis. The work is based on the most successful methods for determining authorship of anonymous documents. We combine, optimize and revise these methods and create new techniques for three main tasks: Automatic assignment of the authorship with the given set of documents, Verification of the authorship of the document by selected author, Clustering of documents according to their authorships. Our implemented algorithms are tested on the Czech documents, but system is modular and if we remove or replace some language-dependent components, we can process documents written in any language. Everything is coded in the Python. The system contains tools for preprocessing of Czech data and for management of stored documents in the PostgreSQL database. The thesis also makes empirical observations of performance of the most popular methods for determining authorship of Czech documents."@en . "Determining Authorship of Anonymous Texts Based on Automatically Discovered Characteristic Features"@en . "Determining Authorship of Anonymous Texts Based on Automatically Discovered Characteristic Features"@en . . . "14330" . "1"^^ . . . "anonymous document; author's writeprint; authorship attribution; clustering; machine learning"@en . "Magistersk\u00E1 diplomov\u00E1 pr\u00E1ce. V pr\u00E1ci vych\u00E1z\u00EDme z \u0159ady osv\u011Bd\u010Den\u00FDch postup\u016F pro ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch dokument\u016F a vytv\u00E1\u0159\u00EDme nov\u00E9. Ji\u017E existuj\u00EDc\u00ED a pou\u017E\u00EDvan\u00E9 techniky kombinujeme, optimalizujeme a inovujeme pro t\u0159i hlavn\u00ED \u00FAlohy: Automatick\u00E9 p\u0159i\u0159azen\u00ED autora podle dan\u00E9 mno\u017Einy autorsk\u00FDch dokument\u016F, Verifikace autorstv\u00ED dan\u00E9ho dokumentu vybran\u00FDm autorem, Shlukov\u00E1n\u00ED dokument\u016F podle autorstv\u00ED. N\u00E1mi implementovan\u00E9 algoritmy jsou testov\u00E1ny na \u010De\u0161tin\u011B, syst\u00E9m je v\u0161ak navr\u017Een modul\u00E1rn\u011B a pokud vypust\u00EDme \u010Di nahrad\u00EDme n\u011Bkolik jazykov\u011B z\u00E1visl\u00FDch komponent, lze v tuto chv\u00EDli pracovat s dokumenty napsan\u00FDmi v libovoln\u00E9m jazyce. V\u0161e je naprogramov\u00E1no ve skriptovac\u00EDm jazyce Python. Sou\u010D\u00E1st\u00ED syst\u00E9mu jsou i n\u00E1stroje pro p\u0159edzpracov\u00E1n\u00ED vstupn\u00EDch dat pro \u010De\u0161tinu a jejich spr\u00E1vu v datab\u00E1zi PostgreSQL. Dal\u0161\u00EDm p\u0159\u00EDnosem pr\u00E1ce krom\u011B v\u00FDvoje syst\u00E9mu pro \u0159e\u0161en\u00ED t\u0159\u00ED zm\u00EDn\u011Bn\u00FDch \u00FAloh jsou empiricky podlo\u017Een\u00E1 pozorov\u00E1n\u00ED, jak se chovaj\u00ED nejpou\u017E\u00EDvan\u011Bj\u0161\u00ED algoritmy na ur\u010Dov\u00E1n\u00ED autorstv\u00ED dokument\u016F na dokumentech v \u010De\u0161tin\u011B."@cs . "1"^^ . . . . . . . . "237051" . . . . "Ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch text\u016F na z\u00E1klad\u011B automaticky nalezen\u00FDch charakteristick\u00FDch znak\u016F" . "Ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch text\u016F na z\u00E1klad\u011B automaticky nalezen\u00FDch charakteristick\u00FDch znak\u016F"@cs . "[A2B1DC965EF7]" . "RIV/00216224:14330/11:00073205" . "Ur\u010Dov\u00E1n\u00ED autorstv\u00ED anonymn\u00EDch text\u016F na z\u00E1klad\u011B automaticky nalezen\u00FDch charakteristick\u00FDch znak\u016F"@cs .