基于自适应稀疏邻域重构的无监督主动学习算法 Unsupervised Active Learning Based on Adaptive Sparse Neighbors Reconstruction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于自适应稀疏邻域重构的无监督主动学习算法

引用本文：	吕巨建,赵慧民,陈荣军,李键红.基于自适应稀疏邻域重构的无监督主动学习算法[J].计算机科学,2018,45(6):251-258.

作者姓名：	吕巨建赵慧民陈荣军李键红

作者单位：	广东技术师范学院广州510665;广州数字内容处理及其安全性技术重点实验室广州510665,广东技术师范学院广州510665;广州数字内容处理及其安全性技术重点实验室广州510665,广东技术师范学院广州510665,广东外语外贸大学语言工程与计算实验室广州510006

基金项目：	本文受国家自然科学基金(61672008),广东省自然科学基金重点项目(2016A030311013),广东省普通高校国际合作重大项目(2015KGJHZ021),广东省自然科学基金(2016A030310335)资助

摘要：	在很多信息处理任务中,人们容易获得大量的无标签样本,但对样本进行标注是非常费时和费力的。作为机器学习领域中一种重要的学习方法,主动学习通过选择最有信息量的样本进行标注,减少了人工标注的代价。然而,现有的大多数主动学习算法都是基于分类器的监督学习方法,这类算法并不适用于无任何标签信息的样本选择。针对这个问题,借鉴最优实验设计的算法思想,结合自适应稀疏邻域重构理论,提出基于自适应稀疏邻域重构的主动学习算法。该算法可以根据数据集各区域的不同分布自适应地选择邻域规模,同步完成邻域点的搜寻和重构系数的计算,能在无任何标签信息的情况下较好地选择最能代表样本集分布结构的样本。基于人工合成数据集和真实数据集的实验表明,在同等标注代价下,基于自适应稀疏邻域重构的主动学习算法在分类精度和鲁棒性上具有较高的性能。
关键词：	主动学习稀疏重构优化实验设计直推式实验设计局部线性重构
收稿时间：	2017/1/11 0:00:00
修稿时间：	2017/3/18 0:00:00
Unsupervised Active Learning Based on Adaptive Sparse Neighbors Reconstruction

LV Ju-jian,ZHAO Hui-min,CHEN Rong-jun and LI Jian-hong.Unsupervised Active Learning Based on Adaptive Sparse Neighbors Reconstruction[J].Computer Science,2018,45(6):251-258.

Authors:	LV Ju-jian ZHAO Hui-min CHEN Rong-jun and LI Jian-hong

Affiliation:	Guangdong Polytechnic Normal University,Guangzhou 510665,China;Key Laboratory of Guangzhou Digital Content Processing and Security Technology,Guangzhou 510665,China,Guangdong Polytechnic Normal University,Guangzhou 510665,China;Key Laboratory of Guangzhou Digital Content Processing and Security Technology,Guangzhou 510665,China,Guangdong Polytechnic Normal University,Guangzhou 510665,China and Language Engineering and Computing Laboratory,Guangdong University of Foreign Studies,Guangzhou 510006,China

Abstract:	In many information processing tasks,individuals are easy to get a lot of unlabeled data,but labeling the unlabeled data is quite time-consuming and usually expensive.As an important learning method in the field of machine lear-ning,active learning reduces the cost of labeling data by selecting the most information data points to label.However,most of the existing active learning algorithms are supervised method based on the classifier,not suitable for the sample selection problem without any label information.Aiming at this problem,a novel unsupervised active learning algorithm was proposed,called active learning based on adaptive sparse neighbors reconstruction,by learning from the optimal experiment design and combining the adaptive sparse neighbors reconstruction.The proposed algorithm adaptively selects the neighborhood scale according to different regional distribution of dataset,searches the sparse neighbors and calculates the reconstruct coefficients simultaneously,and can choose the most representative data points of the distribution structure of dataset without any label information.Empirical results on both synthetic and real-world data sets show that the proposed algorithm has high performance in classification accuracy and robustness under the same labeling cost.

Keywords:	Active learning Sparse reconstruction Optimal experimental design Transductive experimental design Local linear reconstruction

	点击此处可从《计算机科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏