首页 | 本学科首页   官方微博 | 高级检索  
     

基于边界样本的训练样本选择方法
引用本文:胡兰兰,杨义先.基于边界样本的训练样本选择方法[J].北京邮电大学学报,2006,29(4):77-80.
作者姓名:胡兰兰  杨义先
作者单位:1. 北京邮电大学 信息工程学院,北京100876; 2.对外经济贸易大学 信息学院,北京100029
摘    要:以入侵检测系统中的分类器设计为例,研究分类器训练样本选择问题。提出了一种大规模数据集的训练样本选择方法,首先通过聚类将训练数据划分成不同的子集缩小搜索范围;然后根据聚类内离散度和样本的覆盖区域选择样本,保留每个聚类的边界样本,删除内部样本。 即保留了典型样本,减少了训练样本数量,从而保证分类器的性能并且训练效率较高。

关 键 词:样本选择  离散度  覆盖区域  边界样本
文章编号:1007-5321(2006)04-0077-04
收稿时间:2005-03-29
修稿时间:2005年3月29日

Mobile Agent-Based Security Scheme of Electronic Transactions
ZHANG Li,GUO Jun.Mobile Agent-Based Security Scheme of Electronic Transactions[J].Journal of Beijing University of Posts and Telecommunications,2006,29(4):77-80.
Authors:ZHANG Li  GUO Jun
Affiliation:1. School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China;
2. School of Information Technology and Management Engineering, University of International Business and Economics, Beijing 100029, China
Abstract:Taking the example of designing classifier in intrusion detection system, the selection of training samples for classifier is studied. A new method is proposed for sample selection in large data set. First, it will reduce the size of selection problem via clustering, select samples according to the with-in cluster scatter value and coverage area of a sample. And it will retain boundary samples and discard most of the interior ones in each cluster. Experiment result shows that as reserving typical samples and reducing training samples, the generalization performance and training efficient of the classifier are guaranteed.
Keywords:sample selection  scatter  coverage area  boundary samples
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号