首页 | 本学科首页   官方微博 | 高级检索  
     

《现汉》与《语法信息词典》词类对应分析
引用本文:邱立坤,赵慧,俞士汶,朱学锋.《现汉》与《语法信息词典》词类对应分析[J].中文信息学报,2017,31(5):1.
作者姓名:邱立坤  赵慧  俞士汶  朱学锋
作者单位:1.鲁东大学 文学院,山东 烟台 264025;
2.北京大学 计算语言学教育部重点实验室,北京 100871;
3.语言能力协同创新中心,江苏 徐州 221009
基金项目:国家自然科学基金(61572245);国家重点基础研究发展计划(2014CB340504);国家社会科学基金(15BYY094)
摘    要:词类标注问题历来受到中文信息处理、汉语语法和词汇学界的共同关注,学者们已提出多种词类标记体系,彼此间存在较大差异,但迄今尚无人对大规模词类标注工程进行系统比较。该文以《现代汉语词典》第5版和《现代汉语语法信息词典》两个大型词典词类标注工程为比较对象,基于所提出的词类对应算法,自动找出两部词典词类标注上的差异,进而对形成差异的原因进行分析。分析结果表明,两部词典词类标注一致性较高(83.5%完全相同),而存在差异的地方可归结为三类主要原因: 词类迁移;词类判断标准不一致;收录义项不同。

关 键 词:现代汉语词典  现代汉语语法信息词典  词类标注  词类对应  

Analysis of Parts-of-speech Correspondence Between DCC and GKB
QIU Likun,ZHAO Hui,YU Shiwen,ZHU Xuefeng.Analysis of Parts-of-speech Correspondence Between DCC and GKB[J].Journal of Chinese Information Processing,2017,31(5):1.
Authors:QIU Likun  ZHAO Hui  YU Shiwen  ZHU Xuefeng
Affiliation:1.School of Chinese Language and Literature, Ludong University, Yantai, Shandong 264025, China;
2.Key Laboratory of Computational Linguistics at Peking University, Ministry of Education, Beijing 100871, China;
3.Collaborative Innovation Center for Language Ability, Xuzhou, Jiangsu 221009, China
Abstract:Part-of-speech annotation has attracted extensive attention from the areas including Chinese information processing, Chinese grammar study and Chinese lexicographer. Multiple part-of-speech systems have been proposed and there are significant differences between these systems. So far, little research has been done to systematically compare different large-scale part-of-speech annotations. Based on the part-of-speech annotation results in Dictionary of Contemporary Chinese and Grammatical Knowledge-Base Dictionary, this paper proposes a mapping algorithm, which can detect part-of-speech differences in two dictionaries automatically. Further, we analyze the differences and conclude in two perspectives. 1) about 83.5% of the part-of-speech annotation results is identical. and 2) all the differences can be attributed to three effects: part-of-speech shifting, different part-of-speech annotation standards and different senses.
Keywords:Dictionary of Contemporary Chinese  Grammatical Knowledge-Base Dictionary  part-of-speech annotation  part-of-speech correspondence  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号