基于概念格的Web文本聚类 Web text clustering based on concept lattice期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于概念格的Web文本聚类

引用本文：	李云,田素方,李拓,徐涛.基于概念格的Web文本聚类[J].计算机工程与应用,2008,44(23):169-171.

作者姓名：	李云田素方李拓徐涛

作者单位：	扬州大学,信息工程学院,江苏,扬州,225009;扬州大学,信息工程学院,江苏,扬州,225009;扬州大学,信息工程学院,江苏,扬州,225009;扬州大学,信息工程学院,江苏,扬州,225009

基金项目：	国家自然科学基金 , 江苏省自然科学基金

摘要：	Web文本聚类大多是基于空间向量文本表示模型的，它没有考虑特征词之间的语义关系，并且特征词的维数非常高，造成文本语义信息的损失和时间复杂度的增加。把文本作为对象，文本中的特征词作为对应的属性，形成了基于文本的形式背景，从中提取概念来表示文本并度量文本之间的相似度，从而降低了特征词的维数，减少了计算的复杂度，取得了良好的聚类结果。
关键词：	Web文档聚类概念格约简
收稿时间：	2007-10-9
修稿时间：	2007-12-17
Web text clustering based on concept lattice

LI Yun,TIAN Su-fang,LI Tuo,XU Tao.Web text clustering based on concept lattice[J].Computer Engineering and Applications,2008,44(23):169-171.

Authors:	LI Yun TIAN Su-fang LI Tuo XU Tao

Affiliation:	Institute of Information Engineering，Yangzhou University，Yangzhou，Jiangsu 225009，China

Abstract:	Web text clustering are mostly based on space vector text express model,the semantics relation of the terms in the text is not considered in this method and the dimension of the terms is very high,which results in the losing of text semantics and the increase of time complexity.The text is considered as object in this paper,and the term of text is abstract as the corresponding attribute.Therefore,a formal context is formed based on text.To express text and measure the similarity the authors extract the concept from formal context.Thus,the dimension of term is reduced,and the complexity of computation is decreased too.Theoretical analysis shows that the method of clustering is effective.

Keywords:	Web document clustering concept lattice reduce
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏