首页 | 本学科首页   官方微博 | 高级检索  
     

Transfer active learning by querying committee
引用本文:Hao SHAO,Feng TAO,Rui XU. Transfer active learning by querying committee[J]. 浙江大学学报:C卷英文版, 2014, 15(2): 107-118. DOI: 10.1631/jzus.C1300167
作者姓名:Hao SHAO  Feng TAO  Rui XU
基金项目:Project supported by the Humanity and Social Science Youth Foundation of Ministry of Education of China (No. 13YJC630126), the 085 Foundation of SUIBE (Nos. Z085YYJ13014 and 085LXPT13020), the Fundamental Research Funds for the Central Universities (No. WK0110000032), the National Natural Science Foundation of China (Nos. 71171184, 71201059, 71201151, 71090401, and 71090400), and the Funds for the Creative Research Group of China (No. 70821001)
摘    要:In real applications of inductive learning for classifi cation, labeled instances are often defi cient, and labeling them by an oracle is often expensive and time-consuming. Active learning on a single task aims to select only informative unlabeled instances for querying to improve the classifi cation accuracy while decreasing the querying cost. However, an inevitable problem in active learning is that the informative measures for selecting queries are commonly based on the initial hypotheses sampled from only a few labeled instances. In such a circumstance, the initial hypotheses are not reliable and may deviate from the true distribution underlying the target task. Consequently, the informative measures will possibly select irrelevant instances. A promising way to compensate this problem is to borrow useful knowledge from other sources with abundant labeled information, which is called transfer learning. However, a signifi cant challenge in transfer learning is how to measure the similarity between the source and the target tasks. One needs to be aware of different distributions or label assignments from unrelated source tasks;otherwise, they will lead to degenerated performance while transferring. Also, how to design an effective strategy to avoid selecting irrelevant samples to query is still an open question. To tackle these issues, we propose a hybrid algorithm for active learning with the help of transfer learning by adopting a divergence measure to alleviate the negative transfer caused by distribution differences. To avoid querying irrelevant instances, we also present an adaptive strategy which could eliminate unnecessary instances in the input space and models in the model space. Extensive experiments on both the synthetic and the real data sets show that the proposed algorithm is able to query fewer instances with a higher accuracy and that it converges faster than the state-of-the-art methods.

关 键 词:Active learning   Transfer learning   Classification

Transfer active learning by querying committee
Hao Shao,Feng Tao,Rui Xu. Transfer active learning by querying committee[J]. Journal of Zhejiang University-Science C(Computers and Electronics), 2014, 15(2): 107-118. DOI: 10.1631/jzus.C1300167
Authors:Hao Shao  Feng Tao  Rui Xu
Affiliation:1. School of WTO Research & Education, Shanghai University of International Business and Economics, Shanghai, 200336, China
2. School of Business, East China University of Science and Technology, Shanghai, 200237, China
3. School of Computer Science and Technology, University of Science and Technology of China, Hefei, 230026, China
Abstract:In real applications of inductive learning for classification, labeled instances are often deficient, and labeling them by an oracle is often expensive and time-consuming. Active learning on a single task aims to select only informative unlabeled instances for querying to improve the classification accuracy while decreasing the querying cost. However, an inevitable problem in active learning is that the informative measures for selecting queries are commonly based on the initial hypotheses sampled from only a few labeled instances. In such a circumstance, the initial hypotheses are not reliable and may deviate from the true distribution underlying the target task. Consequently, the informative measures will possibly select irrelevant instances. A promising way to compensate this problem is to borrow useful knowledge from other sources with abundant labeled information, which is called transfer learning. However, a significant challenge in transfer learning is how to measure the similarity between the source and the target tasks. One needs to be aware of different distributions or label assignments from unrelated source tasks; otherwise, they will lead to degenerated performance while transferring. Also, how to design an effective strategy to avoid selecting irrelevant samples to query is still an open question. To tackle these issues, we propose a hybrid algorithm for active learning with the help of transfer learning by adopting a divergence measure to alleviate the negative transfer caused by distribution differences. To avoid querying irrelevant instances, we also present an adaptive strategy which could eliminate unnecessary instances in the input space and models in the model space. Extensive experiments on both the synthetic and the real data sets show that the proposed algorithm is able to query fewer instances with a higher accuracy and that it converges faster than the state-of-the-art methods.
Keywords:Active learning  Transfer learning  Classifcation
本文献已被 CNKI 维普 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号