首页 | 本学科首页   官方微博 | 高级检索  
     

基于柔性匹配的中文文本特征提取方法
引用本文:帅正化,周学广. 基于柔性匹配的中文文本特征提取方法[J]. 计算机工程, 2010, 36(16): 63-64
作者姓名:帅正化  周学广
作者单位:海军工程大学电子工程学院,武汉,430033
摘    要:针对含有变形关键词的不良信息过滤问题,提出一种基于柔性匹配的中文文本特征信息提取方法。该方法采用柔性匹配技术识别和提取变形关键词,改进向量空间模型中特征项权重的计算方法,对具有变形形式的关键词赋予较高权重,从而提高特征信息的提取效率。实验结果表明,该方法可在保证过滤准确率的前提下,获得较高的召回率。

关 键 词:柔性匹配  特征信息提取  变形关键词  特征项权重

Feature Extraction Method in Chinese Text Based on Flexible Matching
SHUAI Zheng-hua,ZHOU Xue-guang. Feature Extraction Method in Chinese Text Based on Flexible Matching[J]. Computer Engineering, 2010, 36(16): 63-64
Authors:SHUAI Zheng-hua  ZHOU Xue-guang
Affiliation:(College of Electronic Engineering, Naval University of Engineering, Wuhan 430033)
Abstract:Aiming at the problem of filtering malicious information which contains transformed keyword, this paper presents a feature extraction method in Chinese text based on flexible matching. The method adopts flexible matching technology to identify transformed keyword, improves the computational method of feature term weight in Vector Space Model(VSM). The keyword which has transmutative form is endowed high weight to enhance extraction efficiency for feature information. Experimental result shows that the method of feature information extraction for filtering has high recall in the condition of ensuring precision.
Keywords:flexible matching  feature information extraction  transmutative keyword  feature item weight
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号