Tagging Icelandic text: an experiment with integrations and combinations of taggers |
| |
Authors: | Hrafn Loftsson |
| |
Affiliation: | (1) Department of Computer Science, University of Sheffield, 211 Regent Court, Portobello Street, S1 4DP Sheffield, UK;(2) Department of Computer Science, Reykjavik University, Kringlan 1, 103 Reykjavik, Iceland |
| |
Abstract: | We use integrations and combinations of taggers to improve the tagging accuracy of Icelandic text. The accuracy of the best
performing integrated tagger, which consists of our linguistic rule-based tagger for initial disambiguation and a trigram
tagger for full disambiguation, is 91.80%. Combining five different taggers, using simple voting, results in 93.34% accuracy.
By adding two linguistically motivated rules to the combined tagger, we obtain an accuracy of 93.48%. This method reduces
the error rate by 20.5%, with respect to the best performing tagger in the combination pool. |
| |
Keywords: | Combination of taggers Integration of taggers Linguistically motivated rules Simple voting Tagging accuracy |
本文献已被 SpringerLink 等数据库收录! |
|