首页 | 本学科首页   官方微博 | 高级检索  
     


Multilingual Sentiment Mining System to Prognosticate Governance
Authors:Muhammad Shahid Bhatti  Saman Azhar  Abid Sohail  Mohammad Hijji  Hamna Ayemen  Areesha Ramzan
Affiliation:1.COMSATS University Islamabad, Lahore Campus, Lahore, Pakistan2 Computer Science Department, University of Tabuk, Tabuk, Saudi Arabia
Abstract:In the age of the internet, social media are connecting us all at the tip of our fingers. People are linkedthrough different social media. The social network, Twitter, allows people to tweet their thoughts on any particular event or a specific political body which provides us with a diverse range of political insights. This paper serves the purpose of text processing of a multilingual dataset including Urdu, English, and Roman Urdu. Explore machine learning solutions for sentiment analysis and train models, collect the data on government from Twitter, apply sentiment analysis, and provide a python library that classifies text sentiment. Training data contained tweets in three languages: English: 200k, Urdu: 200k and Roman Urdu: 11k. Five different classification models are applied to determine sentiments, and eventually, the use of ensemble technique to move forward with the acquired results is explored. The Logistic Regression model performed best with an accuracy of 75%, followed by the Linear Support Vector classifier and Stochastic Gradient Descent model, both having 74% accuracy. Lastly, Multinomial Naïve Bayes and Complement Naïve Bayes models both achieved 73% accuracy.
Keywords:Multilingual NLP  artificial intelligence  government  sentiment analysis  NLP  NLTK  ensemble technique  multilingual  twitter  data science
点击此处可从《计算机、材料和连续体(英文)》浏览原始摘要信息
点击此处可从《计算机、材料和连续体(英文)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号