首页 | 本学科首页   官方微博 | 高级检索  
     

基于Stacking算法的组合分类器及其应用于中文组块分析
引用本文:李珩,朱靖波,姚天顺. 基于Stacking算法的组合分类器及其应用于中文组块分析[J]. 计算机研究与发展, 2005, 42(5): 844-848
作者姓名:李珩  朱靖波  姚天顺
作者单位:东北大学计算机软件与理论研究所,沈阳,110004;东北大学计算机软件与理论研究所,沈阳,110004;东北大学计算机软件与理论研究所,沈阳,110004
基金项目:国家教育部科学技术研究重点基金项目(104065),微软亚洲研究院联合资助基金项目(60203019)
摘    要:与基于Voting方法的组合分类器相比,提出基于Stacking算法的多分类器组合方法.通过构造一个两层的叠加式框架结构,将4种分类器(fnTBL,SNoW,SVM,MBL)进行了组合,并融合各种可能的上下文信息作为各层分类器的输入特征向量,在中文组块识别中取得了较好的效果.实验结果表明.组合后的分类器无论在准确率还是召回率上都有所提高,在哈尔滨工业大学树库语料的测试下达到了F=93.64的结果.

关 键 词:叠加式  多分类器  文本组块

Combined Multiple Classifiers Based on a Stacking Algorithm and Their Application to Chinese Text Chunking
Li Heng,Zhu Jingbo,Yao Tianshun. Combined Multiple Classifiers Based on a Stacking Algorithm and Their Application to Chinese Text Chunking[J]. Journal of Computer Research and Development, 2005, 42(5): 844-848
Authors:Li Heng  Zhu Jingbo  Yao Tianshun
Abstract:Comparing with the combined multiple classifiers based on a voting algorithm, a two-layer classifier-combination experimental framework is presented for Chinese text chunking, in which four diverse classifiers (transformation-based learning, sparse network of winnow, support vector machine, and memory based learning) are combined with a stacking algorithm The relevant information is incorporated into the two-layer framework as input feature vectors to construct more complete contextual models The chunking experiments are carried out on the HIT Chinese Treebank Corpus Experimental results show that it is an effective approach, which can achieve an F score of 93 64
Keywords:stacking  multiple classifiers  text chunking  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号