融合迁移学习的TranCo-Training分类模型 An Enhanced TranCo-Training Categorization Model with Transfer Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

融合迁移学习的TranCo-Training分类模型

引用本文：	唐焕玲,于立萍,鲁明羽.融合迁移学习的TranCo-Training分类模型[J].模式识别与人工智能,2013,26(5):432-439.

作者姓名：	唐焕玲于立萍鲁明羽

作者单位：	1.山东省高校智能信息处理重点实验室山东工商学院烟台264005 2.大连海事大学信息科学技术学院大连116026

基金项目：	国家自然科学基金资助项目(No.61073133,61175053,61272369,61272244)

摘要：	半监督学习中当未标注样本与标注样本分布不同时，将导致分类器偏离目标数据的主题，降低分类器的正确性.文中采用迁移学习技术，提出一种TranCo-Training分类模型.每次迭代，根据每个未标注样本与其近邻标注样本的分类一致性计算其迁移能力，并根据迁移能力从辅助数据集向目标数据集迁移实例.理论分析表明，辅助样本的迁移能力与其训练错误损失成反比，该方法能将训练错误损失最小化，避免负迁移，从而解决半监督学习中的主题偏离问题.实验表明，TranCo-Training优于随机选择未标注样本的RdCo-Training算法，尤其是给定少量的标注目标样本和大量的辅助未标注样本时.
关键词：	迁移学习半监督学习协同训练朴素贝叶斯文本分类
收稿时间：	2012-11-29
An Enhanced TranCo-Training Categorization Model with Transfer Learning

TANG Huan-Ling,YU Li-Ping,LU Ming-Yu.An Enhanced TranCo-Training Categorization Model with Transfer Learning[J].Pattern Recognition and Artificial Intelligence,2013,26(5):432-439.

Authors:	TANG Huan-Ling YU Li-Ping LU Ming-Yu

Affiliation:	1. Key Laboratory of Intelligent Information Processing in Universities of Shandong Shandong Institute of Business and Technology,Yantai 264005 2. Information Science and Technology College,Dalian Maritime University,Dalian 116026

Abstract:	When unlabeled data draw from different distributions compared with labeled data in semi-supervise learning,the topic biases the target domain and the performance of semi-supervised classifier decreases. The transfer technique is applied to improve the performance of semi-supervised learning in this paper. An enhanced categorization model called TranCo-training is studied which combines transfer learning techniques with co-training methods. The transferability of each unlabeled instance is computed by an important component of TranCo-training according to the consistency with its labeled neighbors. At each iteration,unlabeled instances are transferred from auxiliary dataset according to their transfer ability. Theoretical analysis indicates that transfer ability of an unlabeled instance is inversely proportional to its training error,which minimizes the training error and avoids negative transfer. Thereby,the problem of topic bias in semi-supervised learning is solved. The experimental results show that TranCo-training algorithm achieves better performance than the RdCo-training algorithm when a few labeled data on target domain and abundant unlabeled data on auxiliary domain are provided.

Keywords:	Transfer Learning Semi-Supervised Learning Co-Training Naive Bayesian Text Categorization
本文献已被 CNKI 等数据库收录！
	点击此处可从《模式识别与人工智能》浏览原始摘要信息
	点击此处可从《模式识别与人工智能》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏