首页 | 本学科首页   官方微博 | 高级检索  
     


Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction
Affiliation:1. Department of Industrial and Management Engineering, POSTECH, 790-784 Pohang, Kyungbuk, South Korea;2. Department of Industrial Engineering, Seoul National University, 151-744 Seoul, Republic of Korea;1. Universidad Autónoma de Aguascalientes, Department of Computer Science, Av. Universidad 940, Col. Ciudad Universitaria, Aguascalientes 20131, Aguascalientes, México;1. State Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China;2. Department of Electrical Engineering, Columbia University, New York, NY 10027, USA;3. Facebook, 1601 Willow Rd, Menlo Park, CA 94025, USA;1. Applied Math and Analisis Dept, University of Barcelona, Gran Via de les Corts Catalanes. 585, 08007 Barcelona, Spain;2. Computer Vision Center, Campus UAB, Edifici O, 08193 Bellaterra, Spain;3. Computer Science, Multimedia, and Telecommunications Dept, Universitat Oberta de Catalunya, Rambla del Poblenou 156, 08018 Barcelona, Spain;1. ACTLab, Signal Processing Group, TU Eindhoven, Den Dolech 2, Eindhoven, 5612AZ, The Netherlands;2. Department de Matemàtica Aplicada i Anàlisi, Universitat de Barcelona, Gran Via 585, Barcelona 08007, Spain
Abstract:Sentiment analysis, which detects the subjectivity or polarity of documents, is one of the fundamental tasks in text data analytics. Recently, the number of documents available online and offline is increasing dramatically, and preprocessed text data have more features. This development makes analysis more complex to be analyzed effectively. This paper proposes a novel semi-supervised Laplacian eigenmap (SS-LE). The SS-LE removes redundant features effectively by decreasing detection errors of sentiments. Moreover, it enables visualization of documents in perceptible low dimensional embedded space to provide a useful tool for text analytics. The proposed method is evaluated using multi-domain review data set in sentiment visualization and classification by comparing other dimensionality reduction methods. SS-LE provides a better similarity measure in the visualization result by separating positive and negative documents properly. Sentiment classification models trained over reduced data by SS-LE show higher accuracy. Overall, experimental results suggest that SS-LE has the potential to be used to visualize documents for the ease of analysis and to train a predictive model in sentiment analysis. SS-LE can also be applied to any other partially annotated text data sets.
Keywords:Text visualization  Semi-supervised dimensionality reduction  Laplacian eigenmaps  Sentiment classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号