首页 | 本学科首页   官方微博 | 高级检索  
     


Important citation identification using sentiment analysis of in-text citations
Affiliation:1. Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia;2. Department of Computer Science, National Textile University, Faisalabad, Pakistan;3. Department of Computer Science, Namal Institute, Mianwali, Pakistan;1. College of Business and Social Sciences, Antalya Bilim University, Turkey;2. Faculty of Business, Al-Ahliyya Amman University, Jordan;1. Institute of Philosophy and Institute of Sociology, University of Zürich, Switzerland;2. Institute of Sociology, University of Zürich, Switzerland;1. SciTech Strategies, Inc., 105 Rolling Road, Bala Cynwyd, PA 19004, USA;2. National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, 6701 Democracy Boulevard, Bethesda, MD 20892, USA;3. SciTech Strategies, Inc., 58 Russell Street, Keene, NH 03431, USA;1. Dept. of Digital Media, School of Arts and Communication, Beijing Normal University, Beijing 100875, China;2. Dept. of Sociology, Tsinghua University, Beijing 100081, China;3. Department of Communication, Michigan State University, 404 Wilson Road, East Lansing, MI 48824, United States
Abstract:Citation represents the relationship between the cited and the citing document and vice versa. Citations are widely used to measure the different aspects of knowledge-based achievements such as institutional ranking, author ranking, the impact factor of the journal, research grants, and peer judgments. A fair evaluation of research required a quantitative and qualitative assessment of citations. To perform the qualitative analysis of citations, researchers tried to classify the citations into binary classes (i.e., important and non-important). To perform this task, researchers used metadata, content, citations count, cue words or phrases, sentiment analysis, keywords, and machine learning approaches for citation classification. However, the state-of-the-art results of binary classification are inadequate for the calculation of different aspects of the researcher and their work. Therefore, this research proposed an in-text citation sentiment analysis-based approach for binary classification which effectively enhanced the results of the state-of-the-art. In this research, different machine learning-based models are evaluated to determine the in-text citations sentiments. These sentiment results are further used for positive-negative, and neutral citation counts. Furthermore, the scores of cosine similarity between paper citation pairs are also calculated and used as a feature. This sentiment and cosine similarity scores are further used as features in binary classification. The classification is performed through SVM, KLR, and Random Forest. The proposed approach is evaluated and compared with two state-of-the-art approaches on the benchmark dataset. The proposed approach can achieve 0.83 f-measure with the improvement of 13.6% for dataset 1 and 0.67 with an improvement of 8% for dataset two with a random forest classification model.
Keywords:Sentiment analysis  Cosine similarity  In-text citation  Linear SVC  Multinomial Naïve Bayes  KNN  Logistic regression  Bernoulli NB  Citation classification
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号