首页 | 本学科首页   官方微博 | 高级检索  
     

唐代以来汉语文学作品中的字频演变
引用本文:刘宇凡,郭金忠,陈清华.唐代以来汉语文学作品中的字频演变[J].中文信息学报,2011,25(3):93-98.
作者姓名:刘宇凡  郭金忠  陈清华
作者单位:1. 石家庄经济学院 人文社科学院,河北 石家庄 050031;2. 北京师范大学 管理学院,北京 100875
基金项目:北京师范大学青年教师科研基金
摘    要:研究历史上各个时期汉语文学作品中的字频分布具有重要意义,可以帮助我们更加深入研究汉语言的历史演变,但这在以前的语言统计工作中是缺乏的。该文对唐代以来的文学作品按不同时期进行分类建立语料库,字频分析的结果表明自唐代以来人们使用汉字的习惯处于不断变化之中,时期越相近,汉字的使用习惯就更具一致性。从分布上看,不同时期的字频都可以用一个指数截断的幂律函数进行很好的拟合,随着历史的发展,幂律性质不断衰减而指数性质不断增强。

关 键 词:汉语文学作品  字频分布  指数截断的幂律  

The Evolution of Character Using Frequency in Chinese Literature Since the Tang Dynasty
LIU Yufan,GUO Jinzhong,CHEN Qinghua.The Evolution of Character Using Frequency in Chinese Literature Since the Tang Dynasty[J].Journal of Chinese Information Processing,2011,25(3):93-98.
Authors:LIU Yufan  GUO Jinzhong  CHEN Qinghua
Affiliation:1. School of Humanities and Social Sciences, Shijiazhuang University of Economics, Shijiazhuang, Hebei 050031, China;
2. School of Management, Beijing Normal University, Beijing 100875, China
Abstract:It is meaningful to investigate character frequency distribution among Chinese literatures across different periods since it could help us to know more about how Chinese language evolves over time. This paper presents the change of Chinese character frequency distribution since Tang Dynasty, by counting the character frequencies of 5 classical as well as modern Chinese literatures. It is clear that two character frequency distributions are more similar when they are derived from closer periods, and all the distributions could be well fitted by exponential power law functions. And the exponential property is increasing while the power law feature is decreasing over time.
Key wordsChinese literature; character frequency distribution; exponential truncated power law
Keywords:Chinese literature  character frequency distribution  exponential truncated power law  
本文献已被 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号