首页 | 本学科首页   官方微博 | 高级检索  
     


Multi-domain learning by confidence-weighted parameter combination
Authors:Mark Dredze  Alex Kulesza  Koby Crammer
Affiliation:1. Human Language Technology Center of Excellence, Johns Hopkins University, Baltimore, MD, 21211, USA
2. Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, 19104, USA
3. Department of Electrical Engineering, The Technion, Haifa, 32000, Israel
Abstract:State-of-the-art statistical NLP systems for a variety of tasks learn from labeled training data that is often domain specific. However, there may be multiple domains or sources of interest on which the system must perform. For example, a spam filtering system must give high quality predictions for many users, each of whom receives emails from different sources and may make slightly different decisions about what is or is not spam. Rather than learning separate models for each domain, we explore systems that learn across multiple domains. We develop a new multi-domain online learning framework based on parameter combination from multiple classifiers. Our algorithms draw from multi-task learning and domain adaptation to adapt multiple source domain classifiers to a new target domain, learn across multiple similar domains, and learn across a large number of disparate domains. We evaluate our algorithms on two popular NLP domain adaptation tasks: sentiment classification and spam filtering.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号