首页 | 本学科首页   官方微博 | 高级检索  
     

自组织中文语义映射网络的优化特征编码方法
引用本文:张敏,马青,马少平. 自组织中文语义映射网络的优化特征编码方法[J]. 中文信息学报, 2003, 17(3): 28-34
作者姓名:张敏  马青  马少平
作者单位:1.清华大学计算机科学与技术系智能技术与系统国家重点实验室2.Communications Research Laboratory
基金项目:国家重点基础研究 ( 973) (G19980 30 5 0 9),自然科学基金资助项目 ( 60 2 2 30 0 4 ),国家 863高科技资助项目 ( 2 0 0 1AA1140 82 )
摘    要:本文介绍自组织中文语义映射网络,并分别基于集合论、代数理论和概率论研究和提出六种不同的特征编码方法,这对自组织语义映射效果有很重要的影响。通过性能评价得出如下结论:使用TFIDF修正的频率密度编码能得到最佳效果,其语义映射的精确度和召回率分别为94.4%和90.7% ,而基于向量模型的方法则都不适用于中文自组织语义映射。文中给出结果分析。另外比较实验结果表明文中的最好方法其系统性能好于目前广泛采用的分层聚类技术,并远好于多元统计分析技术,例如主成分分析的特征降维编码。

关 键 词:计算机应用  中文信息处理  中文语义映射  自组织映射  特征编码  相似度计算  Kohonen网络  
文章编号:1003-0077(2003)03-0027-07
修稿时间:2002-11-04

Optimizing Feature Encoding for Self- Organizing Chinese Semantic Maps
ZHANG Min ,MA Qing ,MA Shao-ping. Optimizing Feature Encoding for Self- Organizing Chinese Semantic Maps[J]. Journal of Chinese Information Processing, 2003, 17(3): 28-34
Authors:ZHANG Min   MA Qing   MA Shao-ping
Affiliation:1.State Key Lab. of Intelligent Tech. and Sys., CST Dept., Tsinghua Uni.2.Communications Research Laboratory
Abstract:In this paper, we introduce self-organizing Chinese semantic map, then study and propose six different approaches of feature encoding which is crucial to the performance of a SOM. The approaches are based on set theory, algebra, and probabilistic theory respectively. We conclude from the evaluation results that the method of combining frequency density approach and TFIDF approach has the best performance with 94.4% of precision and 90.7% of recall on semantic mapping, and vector space oriented approaches are not suitable for the task. Analyses of results are also given. Comparative experiments show that the best approach in this paper is better than conventional hierarchy clustering technique, and much better than multivariate statistical analyses such as principle component analyses on dimension reduction based feature encoding.
Keywords:computer application  Chinese information processing  Chinese semantic map  self-organizing map  feature encoding  kohonen network  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号