首页 | 本学科首页   官方微博 | 高级检索  
     

Web文本挖掘技术研究
引用本文:王继成,潘金贵,张福炎.Web文本挖掘技术研究[J].计算机研究与发展,2000,37(5):513-520.
作者姓名:王继成  潘金贵  张福炎
作者单位:南京大学计算机科学与技术系,南京,210093;南京大学软件新技术国家重点实验室,南京,210093
摘    要:作为从浩瀚的Web信息资源中发现潜在的、有价值知识的一种有效技术,Web挖掘正悄然兴起,倍受关注,目前,Web挖掘的研究正处于发我统一的结论,需要国内外学者在理论上开展更多的讨论,同时,Web挖掘系统的开发对其研究也将起到很大推进作用,首先探讨了Web挖掘的有关理论,从Web挖掘的定义、Web挖掘与Web信息检索的关系、Web信息检索的关系、Web挖掘任务的分类与功能等方面加以阐述,然后重点分析了

关 键 词:文本挖掘  文本分类  文本聚类  信息检索  Web

RESEARCH ON WEB TEXT MINING
WANG Ji-Cheng,PAN Jin-Gui,ZHANG Fu-Yan.RESEARCH ON WEB TEXT MINING[J].Journal of Computer Research and Development,2000,37(5):513-520.
Authors:WANG Ji-Cheng  PAN Jin-Gui  ZHANG Fu-Yan
Abstract:With the flood of information on the Web, Web mining is a new research issue which draws great interest from many communities. Currently, there is no agreement about Web mining yet. It needs more discussion among scientists in order to define what it is exactly. Meanwhile, the development of Web mining system will promote its research in turn. In this paper, a systemic discussion about the principle of Web mining is presented, including the definition, the relationship between information mining and retrieval on the Web, the taxonomy and function. Then the methods of text mining on the Web are discussed in detail and a prototype of Web text mining system WebMiner is introduced. WebMiner is a multi agent system which combines text mining and multi dimension text analysis in order to help user in mining HTML documents on the Web efficiently and effectively.
Keywords:Web mining  text mining  text categorization  text clustering  multi  dimension text analysis
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号