
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/">
  <dc:format>application/pdf</dc:format>
  <dc:format>185071 bytes</dc:format>
  <dc:description xml:lang="eng">Abstract: Given the growing need to quickly process texts and extract information from the data for various purposes, correct normalization that will contribute to better and faster processing is of great importance. The paper presents the comparison of different methods of short text (tweet) normalization. The comparison is illustrated by the example of text sentiment analysis. The results of an application of different normalizations are presented, taking into account time complexity and sentiment algorithm classification accuracy. It has been shown that using cutting to n-gram normalization, better or similar results are obtained compared to language-dependent normalizations. Including the time complexity, it is concluded that the application of this language independent normalization gives optimal results in the classification of short informal texts.</dc:description>
  <dc:source>Facta Universitatis, Series: Mathematics and Informatics 33(5)</dc:source>
  <dc:title xml:lang="eng">Comparison of the influence of different normalization methods on tweet sentiment analysis in the Serbian language</dc:title>
  <dc:date>2018</dc:date>
  <dc:creator id="https://orcid.org/0000-0001-7326-059X">Ljajić, Adela</dc:creator>
  <dc:creator id="https://orcid.org/0000-0001-7232-3755 https://plus.cobiss.net/cobiss/sr/sr/conor/86849033">Marovac, Ulfeta</dc:creator>
  <dc:creator>Stanković, Milena</dc:creator>
  <dc:identifier>https://phaidrabg.bg.ac.rs/o:30278</dc:identifier>
  <dc:identifier>doi:10.22190/FUMI1805683L</dc:identifier>
  <dc:identifier>ISSN: 0352-9665</dc:identifier>
  <dc:subject xml:lang="eng">Keywords: sentiment analysis, normalization, Serbian language</dc:subject>
  <dc:rights>All rights reserved</dc:rights>
  <dc:type>info:eu-repo/semantics/article</dc:type>
  <dc:language>eng</dc:language>
</oai_dc:dc>
