首页 | 本学科首页   官方微博 | 高级检索  
     


Sentiment analysis leveraging emotions and word embeddings
Affiliation:1. Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki GR 54124, Greece;2. Department of Information Technology, Technological Education Institute of Thessaloniki, Sindos GR 57400, Greece;3. mSensis S.A., VEPE Technopolis, Bld C2, P.O. Box 60756, GR-57001 Thessaloniki, Greece;1. Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki GR 54124, Greece;2. Department of Information Technology, Technological Education Institute of Thessaloniki, Sindos GR 57400, Greece;3. mSensis S.A., VEPE Technopolis, Bld C2, P.O. Box 60756, GR-57001 Thessaloniki, Greece;1. School of Computing Science and Digital Media, Robert Gordon University, Aberdeen and AB10 7QB, UK;2. School of Electronics Electrical Engineering and Computer Science, Queens University, Belfast and BT7 1NN, UK;1. Dept. of Computer Science and Engineering, POSTECH, South Korea;2. University of Illinois at Urbana-Champaign, USA;3. NAVER Corporation, Seongnam, South Korea
Abstract:Sentiment analysis and opinion mining are valuable for extraction of useful subjective information out of text documents. These tasks have become of great importance, especially for business and marketing professionals, since online posted products and services reviews impact markets and consumers shifts. This work is motivated by the fact that automating retrieval and detection of sentiments expressed for certain products and services embeds complex processes and pose research challenges, due to the textual phenomena and the language specific expression variations. This paper proposes a fast, flexible, generic methodology for sentiment detection out of textual snippets which express people’s opinions in different languages. The proposed methodology adopts a machine learning approach with which textual documents are represented by vectors and are used for training a polarity classification model. Several documents’ vector representation approaches have been studied, including lexicon-based, word embedding-based and hybrid vectorizations. The competence of these feature representations for the sentiment classification task is assessed through experiments on four datasets containing online user reviews in both Greek and English languages, in order to represent high and weak inflection language groups. The proposed methodology requires minimal computational resources, thus, it might have impact in real world scenarios where limited resources is the case.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号