首页 | 本学科首页   官方微博 | 高级检索  
     

基于SVM的汉语句子片段划分
引用本文:马金山,刘挺,李生.基于SVM的汉语句子片段划分[J].哈尔滨工业大学学报,2009,41(5):52-55.
作者姓名:马金山  刘挺  李生
作者单位:哈尔滨工业大学计算机学院信息检索研究室,哈尔滨,150001  
基金项目:国家自然科学基金资助项目 
摘    要:针对长句子引起句法分析性能下降的问题,本文提出了一种基于SVM的句子片段划分方法:先根据语法结构将句子划分为多个片段,识别出每个片段的类别;然后根据片段的类别将句子分割为几个部分,每个部分作为句法分析的基本单元;最后将句法分析之后的各个部分进行合并,形成完整的分析结果.该方法减小了句法分析的复杂度,提高了分析的准确率.

关 键 词:依存句法分析  句子片段  依存关系  支持向量机
收稿时间:7/17/2007 3:12:12 PM

Chinese Sentence Segmentation based on SVM method
MA Jin-shan,LIU Ting,LI Sheng.Chinese Sentence Segmentation based on SVM method[J].Journal of Harbin Institute of Technology,2009,41(5):52-55.
Authors:MA Jin-shan  LIU Ting  LI Sheng
Affiliation:Information Retrieval Laboratory, Computer Science and Technology School, Harbin Institute of Technology,,
Abstract:The sentence length has a great influence on the parsing. The complexity of searching algorithm and the amount of ambiguous structures will grow rapidly with the increase of the sentence length. A method of identifying the segments based on the SVM classifier is presented to solve the problem of sentence length in the parsing. Firstly, a sentence is divided into different segments, each of which is assigned a label to indicate its syntactic type. Then the sentence is parsed based on the segments. Finally, all the segments are linked through the dependency relations and the parsing of the whole dependency tree is completed. Experiments show that the identification of segments decreases the complexity of parsing and improves the accuracy of Chinese dependency parsing.
Keywords:dependency parsing  segment  dependency relation  SVM
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《哈尔滨工业大学学报》浏览原始摘要信息
点击此处可从《哈尔滨工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号