首页 | 本学科首页   官方微博 | 高级检索  
     


Tagging Icelandic text: an experiment with integrations and combinations of taggers
Authors:Hrafn Loftsson
Affiliation:(1) Department of Computer Science, University of Sheffield, 211 Regent Court, Portobello Street, S1 4DP Sheffield, UK;(2) Department of Computer Science, Reykjavik University, Kringlan 1, 103 Reykjavik, Iceland
Abstract:We use integrations and combinations of taggers to improve the tagging accuracy of Icelandic text. The accuracy of the best performing integrated tagger, which consists of our linguistic rule-based tagger for initial disambiguation and a trigram tagger for full disambiguation, is 91.80%. Combining five different taggers, using simple voting, results in 93.34% accuracy. By adding two linguistically motivated rules to the combined tagger, we obtain an accuracy of 93.48%. This method reduces the error rate by 20.5%, with respect to the best performing tagger in the combination pool.
Keywords:Combination of taggers  Integration of taggers  Linguistically motivated rules  Simple voting  Tagging accuracy
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号