About: Text Mining-based Formation of Dictionaries Expressing Opinions in Natural Languages     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

AttributesValues
rdf:type
Description
  • Automatic formation of dictionaries containing words significant for expressing different customers' opinions written in natural languages is demonstrated. The research used very large real-world data concerning the hotel accommodation booking via the Internet. The hotel companies could be interested in characteristic words expressing positive and negative opinions because it could help improve the offered service. The suggested method uses unstructured plain text reviews of many customers from different countries. The data is transformed into vectors using the bag-of-words procedure with the word representation by their frequencies in the reviews. Significant words are selected as relevant attributes for the classification to given categories using trained decision trees. Each tree branch leading to a leaf represents a subset of significant words for a category. The individual word importance is weighted by the word frequency in all the branches combined with their occurrence in branches leading to specific categories. As a result, the generated dictionaries contain only a fraction of the original huge vocabulary. The selected words express very well the positive and negative meaning, which is demonstrated for several different languages using the same processing procedure.
  • Automatic formation of dictionaries containing words significant for expressing different customers' opinions written in natural languages is demonstrated. The research used very large real-world data concerning the hotel accommodation booking via the Internet. The hotel companies could be interested in characteristic words expressing positive and negative opinions because it could help improve the offered service. The suggested method uses unstructured plain text reviews of many customers from different countries. The data is transformed into vectors using the bag-of-words procedure with the word representation by their frequencies in the reviews. Significant words are selected as relevant attributes for the classification to given categories using trained decision trees. Each tree branch leading to a leaf represents a subset of significant words for a category. The individual word importance is weighted by the word frequency in all the branches combined with their occurrence in branches leading to specific categories. As a result, the generated dictionaries contain only a fraction of the original huge vocabulary. The selected words express very well the positive and negative meaning, which is demonstrated for several different languages using the same processing procedure. (en)
Title
  • Text Mining-based Formation of Dictionaries Expressing Opinions in Natural Languages
  • Text Mining-based Formation of Dictionaries Expressing Opinions in Natural Languages (en)
skos:prefLabel
  • Text Mining-based Formation of Dictionaries Expressing Opinions in Natural Languages
  • Text Mining-based Formation of Dictionaries Expressing Opinions in Natural Languages (en)
skos:notation
  • RIV/62156489:43110/11:00215946!RIV14-MSM-43110___
http://linked.open...avai/riv/aktivita
http://linked.open...avai/riv/aktivity
  • Z(MSM6215648904)
http://linked.open...vai/riv/dodaniDat
http://linked.open...aciTvurceVysledku
http://linked.open.../riv/druhVysledku
http://linked.open...iv/duvernostUdaju
http://linked.open...titaPredkladatele
http://linked.open...dnocenehoVysledku
  • 235026
http://linked.open...ai/riv/idVysledku
  • RIV/62156489:43110/11:00215946
http://linked.open...riv/jazykVysledku
http://linked.open.../riv/klicovaSlova
  • natural languages; text mining; opinion analysis; significant words; machine learning; decision tree (en)
http://linked.open.../riv/klicoveSlovo
http://linked.open...ontrolniKodProRIV
  • [CA8DAFF86CD3]
http://linked.open...v/mistoKonaniAkce
  • Brno
http://linked.open...i/riv/mistoVydani
  • Brno
http://linked.open...i/riv/nazevZdroje
  • Mendel 2011: 17th International Conference on Soft Computing
http://linked.open...in/vavai/riv/obor
http://linked.open...ichTvurcuVysledku
http://linked.open...cetTvurcuVysledku
http://linked.open...UplatneniVysledku
http://linked.open...iv/tvurceVysledku
  • Žižka, Jan
  • Dařena, František
http://linked.open...vavai/riv/typAkce
http://linked.open...ain/vavai/riv/wos
  • 302647900059
http://linked.open.../riv/zahajeniAkce
http://linked.open...n/vavai/riv/zamer
number of pages
http://purl.org/ne...btex#hasPublisher
  • Vysoké učení technické v Brně
https://schema.org/isbn
  • 978-80-214-4302-0
http://localhost/t...ganizacniJednotka
  • 43110
Faceted Search & Find service v1.16.118 as of Jun 21 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 07.20.3240 as of Jun 21 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 58 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software