基于语义的高维数据聚类技术 Clustering Technology for High Dimensional Data Based on Semantics期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于语义的高维数据聚类技术

引用本文：	刘铭,王晓龙,刘远超. 基于语义的高维数据聚类技术[J]. 电子学报, 2009, 37(5): 925-929

作者姓名：	刘铭王晓龙刘远超

作者单位：	哈尔滨工业大学计算机科学与技术学院,黑龙江哈尔滨 150001

基金项目：	国家自然科学基金重点项目，国家高技术研究发展计划(863计划)

摘要：	本文提出一种有效处理高维数据的聚类算法,算法首先通过构造特征链将文档集合划分为多个类别,同时在相似度计算及权值调整时考虑相似特征的影响以凝聚语义相似的文档,并动态调整文档权重使分布不平衡的文档得到充分训练.实验表明:该算法在高维空间能够获得较好的聚类结果,类内相似度高,类间区分性好,迭代次数较少.
关键词：	语义相似度计算自组织映射特征链权值调整
收稿时间：	2008-04-29
Clustering Technology for High Dimensional Data Based on Semantics

LIU Ming,WANG Xiao-long,LIU Yuan-chao. Clustering Technology for High Dimensional Data Based on Semantics[J]. Acta Electronica Sinica, 2009, 37(5): 925-929

Authors:	LIU Ming WANG Xiao-long LIU Yuan-chao

Affiliation:	School of Computer Science and Technology,Harbin Institute of Technology,Harbin,Heilongjiang 150001,China

Abstract:	A novel clustering algorithm for high dimensional data is proposed in this paper.This algorithm first partitions input document set into some clusters by constructing feature chains.Simultaneously it also considers the effects of similar features in similarity computation and weight adjustment to agglomerate documents with semantic similarities,and dynamically adjusts weights of documents to make unbalanced documents well trained.Experiment results demonstrate that it can obtain relatively better clustering...

Keywords:	semantic similarity computation self organizing mapping feature chain weight adjustment
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《电子学报》浏览原始摘要信息
	点击此处可从《电子学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏