首页 | 本学科首页   官方微博 | 高级检索  
     

基于词性的文本挖掘算法在IDS日志中的应用
引用本文:胡军光,刘力,车奇. 基于词性的文本挖掘算法在IDS日志中的应用[J]. 计算机与数字工程, 2010, 38(2): 90-93
作者姓名:胡军光  刘力  车奇
作者单位:1. 空军驻深圳地区军事代表室,深圳,518026
2. 南京航空航天大学信息科学与技术学院,南京,210016
摘    要:提出一种以词性为参考值的文本挖掘算法,能有效挖掘与种子词有关的关联规则。基于Bootstrapping算法思想,既减少了预处理阶段对于词根还原的依赖,能处理日志中出现的中文词汇。增加了对于日志文本上下的理解,提高了关联规则的有效性,并应用与IDS日志挖掘之中,有效改善挖掘效率,为规则库提供关联规则。

关 键 词:词性  ICTCLAS  Bootstrapping  入侵检测系统  数据挖掘

A Text Mining Algorithm Based on Part of Speech Used in IDS Logs
Hu Junguang Liu Li Che Qi. A Text Mining Algorithm Based on Part of Speech Used in IDS Logs[J]. Computer and Digital Engineering, 2010, 38(2): 90-93
Authors:Hu Junguang Liu Li Che Qi
Affiliation:Military Deputation of Air Force in Shenzhen1;College of Information Science and Technology/a>;Nanjing University of Aeronautics and Astronautics2
Abstract:We made a text mining algorithm using part of speech(POS) as its argument,which can effectively mine the seed-related rules.Based on the idea of Bootstrapping algorithm,it can reduce the dependence of root-restoring on the pre-processing stage,process Chinese vocabulary appear in the log,increase the understanding of context,enhance the effectiveness of rule-relating.When applied in IDS log mining,it will significantly improve the mining efficiency and provide rule library with rules.
Keywords:part of speech  ICTCLAS  Bootstrapping  IDS  data mining  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号