. "N&SDiT - Detektor k\u0159estn\u00EDch jmen a p\u0159\u00EDjmen\u00ED v textu"@cs . . "N" . "tagging; word detection; text analysis; data mining"@en . "Software tool automatically locates and tags male and female names and surnames in a common text. The names are located and tagged in any form including academic degrees, in addition a grammatical case based on grammatical form of the name is determined and marked. In the case of grammatical case ambiguity under the same grammatical forms an identification using heuristic approach is done based on broader word context of the located name. An expert approach based on 56 712 rules allows to find the vast majority of target phrases. Software tool is written in Python and allows an application of rules in specific order on any text in electronic form. Software tool can be used in a text processing during creation of statistical class-based language models or for text preprocessing during POS tagging."@en . "N&SDiT - Detektor k\u0159estn\u00EDch jmen a p\u0159\u00EDjmen\u00ED v textu" . "Automatick\u00E1 lokalizace a ozna\u010Dov\u00E1n\u00ED k\u0159estn\u00EDch jmen a p\u0159\u00EDjmen\u00ED osob mu\u017Esk\u00E9ho a \u017Eensk\u00E9ho pohlav\u00ED (v\u010Detn\u011B jejich gramatick\u00E9ho p\u00E1du) v b\u011B\u017En\u00E9m textu jako n\u00E1hrada manu\u00E1ln\u00ED editace textu. Softwarov\u00FD n\u00E1stroj je mo\u017Eno vyu\u017E\u00EDt p\u0159i zpracov\u00E1n\u00ED text\u016F pro tvorbu t\u0159\u00EDdov\u00FDch statistick\u00FDch jazykov\u00FDch model\u016F nebo pro p\u0159edzpracov\u00E1n\u00ED text\u016F p\u0159i POS taggingu." . . "RIV/49777513:23520/11:43898601" . "N&SDiT - Detektor k\u0159estn\u00EDch jmen a p\u0159\u00EDjmen\u00ED v textu" . "[3EA6023594E2]" . . . . . "N&SDiT - Names and Surnames Detector in Text"@en . . "23520" . "Psutka, Josef" . "Software je v\u00FDsledkem smluvn\u00EDho v\u00FDzkumu, lokalizace Z\u00E1pado\u010Desk\u00E1 univerzita v Plzni (http://www.kky.zcu.cz/cs/sw/NSDiT) a SpeechTech, s.r.o. (http://www.speechtech.cz)." . "Softwarov\u00FD n\u00E1stroj umo\u017E\u0148uje automaticky lokalizovat a ozna\u010Dit v b\u011B\u017En\u00E9m textu k\u0159estn\u00ED jm\u00E9na a p\u0159\u00EDjmen\u00ED osob mu\u017Esk\u00E9ho a \u017Eensk\u00E9ho pohlav\u00ED. Jm\u00E9na a p\u0159\u00EDjmen\u00ED jsou nalezena a ozna\u010Dena v pln\u00E9m zn\u011Bn\u00ED, tedy v\u010Detn\u011B titul\u016F a hodnost\u00ED p\u0159ed jm\u00E9nem a za jm\u00E9nem, v jak\u00E9mkoliv tvaru, p\u0159i\u010Dem\u017E je zji\u0161t\u011Bn a ozna\u010Den i gramatick\u00FD p\u00E1d, ve kter\u00E9m jsou jm\u00E9na a p\u0159\u00EDjmen\u00ED uvedena, podle jejich gramatick\u00E9ho tvaru. V p\u0159\u00EDpad\u011B nejednozna\u010Dnosti gramatick\u00E9ho p\u00E1du na z\u00E1klad\u011B stejn\u00FDch gramatick\u00FDch tvar\u016F je pomoc\u00ED heuristick\u00E9ho p\u0159\u00EDstupu provedeno up\u0159esn\u011Bn\u00ED identifikac\u00ED z \u0161ir\u0161\u00EDho slovn\u00EDho kontextu, ve kter\u00E9m se jm\u00E9na a p\u0159\u00EDjmen\u00ED nach\u00E1z\u00ED. Odli\u0161ena jsou jm\u00E9na a p\u0159\u00EDjmen\u00ED v n\u00E1zvech ulic, jm\u00E9na svat\u00FDch apod. Vyu\u017Eit je expertn\u00ED p\u0159\u00EDstup, kter\u00FD na z\u00E1klad\u011B 56712 pravidel umo\u017E\u0148uje naj\u00EDt velkou v\u011Bt\u0161inu c\u00EDlov\u00FDch v\u00FDraz\u016F. Softwarov\u00FD n\u00E1stroj je naps\u00E1n v jazyce Python, p\u0159i\u010Dem\u017E umo\u017E\u0148uje aplikaci pravidel ve specifick\u00E9m po\u0159ad\u00ED na jak\u00FDkoliv text v elektronick\u00E9 form\u011B. Softwarov\u00FD n\u00E1stroj je mo\u017Eno vyu\u017E\u00EDt p\u0159i zpracov\u00E1n\u00ED text\u016F pro tvorbu t\u0159\u00EDdov\u00FDch statistick\u00FDch jazykov\u00FDch model\u016F nebo pro p\u0159edzpracov\u00E1n\u00ED text\u016F p\u0159i POS taggingu."@cs . . "214740" . . . "Pra\u017E\u00E1k, Ale\u0161" . . "Software je v\u00FDsledkem smluvn\u00EDho v\u00FDzkumu, Smlouva o d\u00EDlo mezi Z\u00E1pado\u010Deskou univerzitou v Plzni a SpeechTech, s.r.o. byla podeps\u00E1na dne 31.10.2011. Bli\u017E\u0161\u00ED informace k technick\u00FDm parametr\u016Fm SW pod\u00E1 Ale\u0161 Pra\u017E\u00E1k, aprazak@kky.zcu.cz, tel.: 377632573, d\u00E1le t\u00E9\u017E http://www.kky.zcu.cz/cs/sw/NSDiT. Informace k licen\u010Dn\u00ED politice pod\u00E1 Ji\u0159\u00ED Zahradil, jiri.zahradil@speechtech.cz." . . . "N&SDiT - Detektor k\u0159estn\u00EDch jmen a p\u0159\u00EDjmen\u00ED v textu"@cs . "RIV/49777513:23520/11:43898601!RIV12-MSM-23520___" . . . "N&SDiT - Names and Surnames Detector in Text"@en . "N&SDiT" . . . . . "3"^^ . . "3"^^ . "Radov\u00E1, Vlasta" . "Softwarov\u00FD n\u00E1stroj umo\u017E\u0148uje automaticky lokalizovat a ozna\u010Dit v b\u011B\u017En\u00E9m textu k\u0159estn\u00ED jm\u00E9na a p\u0159\u00EDjmen\u00ED osob mu\u017Esk\u00E9ho a \u017Eensk\u00E9ho pohlav\u00ED. Jm\u00E9na a p\u0159\u00EDjmen\u00ED jsou nalezena a ozna\u010Dena v pln\u00E9m zn\u011Bn\u00ED, tedy v\u010Detn\u011B titul\u016F a hodnost\u00ED p\u0159ed jm\u00E9nem a za jm\u00E9nem, v jak\u00E9mkoliv tvaru, p\u0159i\u010Dem\u017E je zji\u0161t\u011Bn a ozna\u010Den i gramatick\u00FD p\u00E1d, ve kter\u00E9m jsou jm\u00E9na a p\u0159\u00EDjmen\u00ED uvedena, podle jejich gramatick\u00E9ho tvaru. V p\u0159\u00EDpad\u011B nejednozna\u010Dnosti gramatick\u00E9ho p\u00E1du na z\u00E1klad\u011B stejn\u00FDch gramatick\u00FDch tvar\u016F je pomoc\u00ED heuristick\u00E9ho p\u0159\u00EDstupu provedeno up\u0159esn\u011Bn\u00ED identifikac\u00ED z \u0161ir\u0161\u00EDho slovn\u00EDho kontextu, ve kter\u00E9m se jm\u00E9na a p\u0159\u00EDjmen\u00ED nach\u00E1z\u00ED. Odli\u0161ena jsou jm\u00E9na a p\u0159\u00EDjmen\u00ED v n\u00E1zvech ulic, jm\u00E9na svat\u00FDch apod. Vyu\u017Eit je expertn\u00ED p\u0159\u00EDstup, kter\u00FD na z\u00E1klad\u011B 56712 pravidel umo\u017E\u0148uje naj\u00EDt velkou v\u011Bt\u0161inu c\u00EDlov\u00FDch v\u00FDraz\u016F. Softwarov\u00FD n\u00E1stroj je naps\u00E1n v jazyce Python, p\u0159i\u010Dem\u017E umo\u017E\u0148uje aplikaci pravidel ve specifick\u00E9m po\u0159ad\u00ED na jak\u00FDkoliv text v elektronick\u00E9 form\u011B. Softwarov\u00FD n\u00E1stroj je mo\u017Eno vyu\u017E\u00EDt p\u0159i zpracov\u00E1n\u00ED text\u016F pro tvorbu t\u0159\u00EDdov\u00FDch statistick\u00FDch jazykov\u00FDch model\u016F nebo pro p\u0159edzpracov\u00E1n\u00ED text\u016F p\u0159i POS taggingu." .