基于统计语义方法的文本分类 Text Categorization Based on Statistic Semantic Theory期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于统计语义方法的文本分类

引用本文：	钱民,郭祥文. 基于统计语义方法的文本分类[J]. 昆明冶金高等专科学校学报, 2004, 20(4): 11-15

作者姓名：	钱民郭祥文

作者单位：	1. 昆明冶金高等专科学校,计算机与信息工程系,云南,昆明,650033 2. 中国农业银行云南省分行,科学技术处,云南,昆明,653100

摘要：	本分类是指在给定的分类体系下，根据本的内容自动地确定本所属的类别。与当前的本分类技术相比，统计语义方法描述了语义元的相互关系，定义了语义元间的亲和力、语义元集的松散度等。基于上述定义，给出了一种选取关键词集的方法，并用所获得的关键词集构造了关键词集树，完成了映射类别未知的本的词集到关键词集树的分类过程。
关键词：	文本分类统计语义方法亲和力松散度关键词集关键词集树
文章编号：	1009-0479(2004)04-0011-05
修稿时间：	2004-05-10
Text Categorization Based on Statistic Semantic Theory

QIAN Min,GUO Xiang-wen. Text Categorization Based on Statistic Semantic Theory[J]. Journal of Kunming Metallurgy College, 2004, 20(4): 11-15

Authors:	QIAN Min GUO Xiang-wen

Affiliation:	QIAN Min1,GUO Xiang-wen2

Abstract:	Text Categorization means under a defined classifying system, texts are classified automatically according to the contents. Compare with the traditional technologies of text categorization, the statistic semantic theory describes the relationship between semantic elements and defines the affinity between semantic elements and the loose degree of the semantic density. Based on the above definition, a way of selecting a key-word group, therefore a key-word group tree is presented, leading to a classifying process reflecting the word group whose category is unknown to a key-word group tree.

Keywords:	text categorization statistic semantic theory affinity loose degree key-word group tree
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏