About: Approaches to Samples Selection for Machine Learning Based Classification of Textual Data     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : http://linked.opendata.cz/ontology/domain/vavai/Vysledek, within Data Space : linked.opendata.cz associated with source document(s)

AttributesValues
rdf:type
Description
  • The paper focuses on the process of selecting representative sample documents written in a natural language that can be used as the basis for automatic selection or classification of textual documents. A method of selecting the examples from a larger set of candidate examples, called automatic biased sample selection, is compared to random and manual selection. The methods are evaluated by experiments carried out with real world data consisting of customer reviews, with different document representations and similarity measures. The presented approach, that provided satisfactory results, faces problems related to processing user created content and huge computational complexity and can be used as an alternative to manual selection and evaluation of textual samples.
  • The paper focuses on the process of selecting representative sample documents written in a natural language that can be used as the basis for automatic selection or classification of textual documents. A method of selecting the examples from a larger set of candidate examples, called automatic biased sample selection, is compared to random and manual selection. The methods are evaluated by experiments carried out with real world data consisting of customer reviews, with different document representations and similarity measures. The presented approach, that provided satisfactory results, faces problems related to processing user created content and huge computational complexity and can be used as an alternative to manual selection and evaluation of textual samples. (en)
Title
  • Approaches to Samples Selection for Machine Learning Based Classification of Textual Data
  • Approaches to Samples Selection for Machine Learning Based Classification of Textual Data (en)
skos:prefLabel
  • Approaches to Samples Selection for Machine Learning Based Classification of Textual Data
  • Approaches to Samples Selection for Machine Learning Based Classification of Textual Data (en)
skos:notation
  • RIV/62156489:43110/13:00208671!RIV14-MSM-43110___
http://linked.open...avai/predkladatel
http://linked.open...avai/riv/aktivita
http://linked.open...avai/riv/aktivity
  • Z(MSM6215648904)
http://linked.open...iv/cisloPeriodika
  • 5
http://linked.open...vai/riv/dodaniDat
http://linked.open...aciTvurceVysledku
http://linked.open.../riv/druhVysledku
http://linked.open...iv/duvernostUdaju
http://linked.open...titaPredkladatele
http://linked.open...dnocenehoVysledku
  • 61878
http://linked.open...ai/riv/idVysledku
  • RIV/62156489:43110/13:00208671
http://linked.open...riv/jazykVysledku
http://linked.open.../riv/klicovaSlova
  • machine learning; text similarity; natural language processing; textual patterns; information retrieval; text classification (en)
http://linked.open.../riv/klicoveSlovo
http://linked.open...odStatuVydavatele
  • SK - Slovenská republika
http://linked.open...ontrolniKodProRIV
  • [63B4A35C1305]
http://linked.open...i/riv/nazevZdroje
  • Computing and Informatics
http://linked.open...in/vavai/riv/obor
http://linked.open...ichTvurcuVysledku
http://linked.open...cetTvurcuVysledku
http://linked.open...UplatneniVysledku
http://linked.open...v/svazekPeriodika
  • 32
http://linked.open...iv/tvurceVysledku
  • Žižka, Jan
  • Dařena, František
http://linked.open...ain/vavai/riv/wos
  • 327410900003
http://linked.open...n/vavai/riv/zamer
issn
  • 1335-9150
number of pages
http://localhost/t...ganizacniJednotka
  • 43110
Faceted Search & Find service v1.16.116 as of Feb 22 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 07.20.3239 as of Feb 22 2024, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (126 GB total memory, 68 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software