BoosTexter: A Boosting-based System for Text Categorization |
| |
Authors: | Schapire Robert E Singer Yoram |
| |
Affiliation: | (1) Shannon Laboratory, AT&T Labs, 180 Park Avenue, Room A279, Florham Park, NJ 07932-0971, USA;(2) School of Computer Science & Engineering, The Hebrew University, Jerusalem, 91904, Israel |
| |
Abstract: | This work focuses on algorithms which learn from examples to perform multiclass text and speech categorization tasks. Our approach is based on a new and improved family of boosting algorithms. We describe in detail an implementation, called BoosTexter, of the new boosting algorithms for text categorization tasks. We present results comparing the performance of BoosTexter and a number of other text-categorization algorithms on a variety of tasks. We conclude by describing the application of our system to automatic call-type identification from unconstrained spoken customer responses. |
| |
Keywords: | text and speech categorization multiclass classification problems boosting algorithms |
本文献已被 SpringerLink 等数据库收录! |
|