首页 | 本学科首页   官方微博 | 高级检索  
     

基于计量风格学的小说质量分析
引用本文:李艳丽,李宛蓉,廖欣,李静娟,汤露,刘喜平.基于计量风格学的小说质量分析[J].计算机与现代化,2019,0(5):19-24,107.
作者姓名:李艳丽  李宛蓉  廖欣  李静娟  汤露  刘喜平
作者单位:江西财经大学信息管理学院,江西 南昌,330013;江西财经大学信息管理学院,江西 南昌,330013;江西财经大学信息管理学院,江西 南昌,330013;江西财经大学信息管理学院,江西 南昌,330013;江西财经大学信息管理学院,江西 南昌,330013;江西财经大学信息管理学院,江西 南昌,330013
基金项目:国家自然科学基金资助项目(61462037)
摘    要:从计量风格学的角度来对小说文本进行比较研究。目前对小说文本的研究以定性为主,很少有定量的;以主观分析的居多,客观实证分析的较少。采集涉及网络小说和经典小说的225部小说作品,分成3个作品集,分别对应"优秀"、"良好"和"较差"的作品。对于每个作品,提取篇幅、词性、节奏、词汇量等方面的特征,基于这些特征,构造决策树、神经网络、贝叶斯等分类模型,由此来发现3个作品集之间的关键差异。研究发现,3个作品集在计量风格统计特征上有着较为明显的区别;对于不同的作品集,不同的特征具有不同的区分度。

关 键 词:风格计量学  文本分析  小说文本
收稿时间:2019-05-14

Stylometry-based Analysis of Literature Texts
LI Yan-li,LI Wan-rong,LIAO Xin,LI Jing-juan,TANG Lu,LIU Xi-ping.Stylometry-based Analysis of Literature Texts[J].Computer and Modernization,2019,0(5):19-24,107.
Authors:LI Yan-li  LI Wan-rong  LIAO Xin  LI Jing-juan  TANG Lu  LIU Xi-ping
Affiliation:(School of Information Technology,Jiangxi University of Finance and Economics,Nanchang 330013,China)
Abstract:This study compares literary works from the perspective of stylometry. At present, the research on literature is mainly qualitative and subjective analysis, and there are few quantitative studies and empirical analysis. A total number of 225 literary works are collected in the study, including Internet literary works and classical literary works, which are divided into three subsets, corresponding to the “excellent”, “good” and “poor”. For each work, a lot of features regarding article length, part of speech, rhythm, vocabulary, etc. are extracted. Based on these features, classifiers such as decision trees, neural networks and Bayesian are constructed. The models are utilized to find the key differences among the three datasets. The study found that the three datasets have obvious differences in stylometry statistics, and for different pair of datasets, the features have different discriminative power.
Keywords:stylometry  text analysis  literature text  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号