首页 | 本学科首页   官方微博 | 高级检索  
     

Web文本挖掘系统及其关键技术研究
引用本文:钟艳花,余伟红,余永权.Web文本挖掘系统及其关键技术研究[J].计算机工程与应用,2003,39(34):167-169,196.
作者姓名:钟艳花  余伟红  余永权
作者单位:1. 广东工业大学计算机学院,广州,510090;广东江门教育学院计算机系,广东,江门,529020
2. 广东江门电子技术研究所,广东,江门,529000
3. 广东工业大学计算机学院,广州,510090
摘    要:随着网络信息的迅猛发展,信息量日益增加,怎样从海量的Internet上获取有用信息,WEB文本挖掘系统是挖掘技术的重要应用方向,它是指在给定的分类体系下,根据网页的内容自动判别内容类别的过程,论文对文本中所涉及的关键技术,包括K-最近邻参照法模型、基于隐马尔科夫模型(HMM)的信息抽取、机器学习方法,进行了研究和探讨,并且给出了基于信息抽取的文本挖掘系统的设计实现和下一步的研究重点。

关 键 词:Web文本挖掘  K-最近邻参照法  信息抽取  隐马尔科夫模型(HMM)
文章编号:1002-8331-(2003)34-0167-03

The Key Technical Research of Text Mining System Based on Web
Zhong Yanhua Yu Weihong Yu Yongquan.The Key Technical Research of Text Mining System Based on Web[J].Computer Engineering and Applications,2003,39(34):167-169,196.
Authors:Zhong Yanhua Yu Weihong Yu Yongquan
Abstract:With the development of network technology,the spread of internet become more and more quick.There are many types of complicated data in the information ocean.How to acquire useful knowledge quickly from the information ocean is the very difficult.The Text Mining based on Web is a new research field which can solve the problem effectively .This paper gives a research to several key techniques about Text Mining,including K-Nearest Neighbor Model, Information Extraction (IE) based on Hide in Markov Model (HMM), Machine Learning.It also describes a text mining model based on IE,and gives the results.
Keywords:Text mining based on Web  K-Nearest Neighbor  Information Extraction  Hide in Markov Model (HMM)
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号