首页 | 本学科首页   官方微博 | 高级检索  
     

基于主曲线的微阵列数据分类
引用本文:祁云篙,孙怀江.基于主曲线的微阵列数据分类[J].计算机科学,2010,37(12):203-205.
作者姓名:祁云篙  孙怀江
作者单位:1. 江苏科技大学计算机学院,镇江,212003;南京理工大学计算机学院,南京,210094
2. 南京理工大学计算机学院,南京,210094
基金项目:本文受国家自然科学基金(60773172)资助。
摘    要:提出了一种基于主曲线(principal curves)的微阵列数据分类方法(PC)。主曲线是第一主成分的非线性推广,它是数据集合的“骨架”,数据集合是主曲线的“云”。基于主曲线的微阵列数据分类方法,首先利用专门设计的算法在训练数据集上计算出每类样本的主曲线,然后根据测试样本与各类样本主曲线距离的期望方差来确定测试样本所属的类别。实验结果表明,该分类方法在进行小样本微阵列数据分类时性能优于现有的方法。

关 键 词:基因微阵列,主曲线,模式分类

Microarray Data Classification Based on Principal Curves
QI Yun-song,SUN Huai-jiang.Microarray Data Classification Based on Principal Curves[J].Computer Science,2010,37(12):203-205.
Authors:QI Yun-song  SUN Huai-jiang
Affiliation:(School of Computer Science and Engineering,Jiangsu University of Science and Technology,Zhenjiang 212003,China);(School of Computer Science and Technology, Nanjing University of Science and Technology, Nanjing 210094,China)
Abstract:In this paper, a novel classifier was proposed to classify microarray data using principal curves. Principal curves are the non-linear generalization of principal components. Intuitively, a principal curve `passes through the middle of the data cloud'. As a kind of new classification technique,Principal Curve-based classifier (PC) involves a novel way of computing a principal curve for each class using the training data. A test sample is the class-label of the principal curve that is closest to it according to Expected Sctuared Error. Experimental results illustrate the performance of the PC is better than other existing approaches when a very small sample size of a microarray set is concerned.
Keywords:Microarray data  Principal curve  Pattern classification
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号