首页 | 本学科首页   官方微博 | 高级检索  
     

中文信息处理的词法问题——以句本位语法图解树库构建为背景
引用本文:彭炜明,宋继华,俞士汶.中文信息处理的词法问题——以句本位语法图解树库构建为背景[J].中文信息学报,2014,28(2):1-7.
作者姓名:彭炜明  宋继华  俞士汶
作者单位:1. 北京大学 计算语言学教育部重点实验室 北京大学 计算语言学研究所,北京 100871;
2. 北京师范大学 信息科学与技术学院,北京 100875
基金项目:国家社科重大项目(12&ZD227);中国博士后科学基金面上资助项目(2013M530455
摘    要:该文对比了句本位语法图解树库与中文信息处理现行词法规范在分词单位和词类标注两方面的差异,指出目前自动词法分析与句法分析的若干脱节之处,梳理了图解树库中关于临时造词、惯用语等特殊结构的标注策略和语言学理据,并探讨了“依句辨品”和“指称化”等汉语词类相关理论在中文信息处理中的实现方式。

关 键 词:中文信息处理  临时造词  句本位语法  图解树库  

Lexical Issues in Chinese Information Processing:in the Background of Sentence-based Diagram Treebank Construction
PENG Weiming,SONG Jihua,YU Shiwen.Lexical Issues in Chinese Information Processing:in the Background of Sentence-based Diagram Treebank Construction[J].Journal of Chinese Information Processing,2014,28(2):1-7.
Authors:PENG Weiming  SONG Jihua  YU Shiwen
Affiliation:1. MOE Key Laboratory of Computational Linguistics (Peking University), Institute of Computational Linguistics, Peking University, Beijing 100871, China;
2. College of Information Science and Technology, Beijing Normal University, Beijing 100875, China
Abstract:This paper compares the Sentence-based DiagramTreebank with existing lexical specification in the aspect of word segmentation unit and POStagging, revealing the disjunction between automatic lexical analysis and parsing in the current Chinese information processing.It describes the parsing strategy of some special structures such as nonce formation and idiomsin the Diagram Treebank as well as their linguistics rationale. It also explores the implementation of the Chinese word classtheories such as “For All Words,the Word-class Is Based on the Sentence” and “Referentiality” in Chinese information processing.
Keywords:Chinese information processing  nonce formation  sentence-based grammar  diagram treebank  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号