"8"^^ . . "Character-based Language Model"@en . . "Character-based Language Model" . . "2336-4289" . . . . "2014-01-01+01:00"^^ . . . . "Brno" . "Brno" . . "Character-based Language Model"@en . "1"^^ . "P(LM2010013), S" . . "Tribun EU" . "1"^^ . . . "Eighth Workshop on Recent Advances in Slavonic Natural Language Processing" . "language model; suffix array; LCP; trie; character-based; random text generator; corpus"@en . "Language modelling and also other natural language processing tasks are usually based on words. I present here a more general yet simpler approach to language modelling using much smaller units of text data: character-based language model (CBLM). In this paper I describe the underlying data structure of the model, evaluate the model using standard measures (entropy, perplexity). As a proof-of-concept and an extrinsic evaluation I present also a random sentence generator based on this model." . "Baisa, V\u00EDt" . "6881" . . . "Language modelling and also other natural language processing tasks are usually based on words. I present here a more general yet simpler approach to language modelling using much smaller units of text data: character-based language model (CBLM). In this paper I describe the underlying data structure of the model, evaluate the model using standard measures (entropy, perplexity). As a proof-of-concept and an extrinsic evaluation I present also a random sentence generator based on this model."@en . . . . . "RIV/00216224:14330/14:00077506" . "14330" . . "RIV/00216224:14330/14:00077506!RIV15-MSM-14330___" . "Character-based Language Model" . . "[7933649456AC]" .