首页 | 本学科首页   官方微博 | 高级检索  
     


Web document clustering using a hybrid neural network
Authors:M Shamim Khan  Sebastian W Khor
Affiliation:School of Information Technology, Murdoch University, South Street, Murdoch, WA 6150, Australia
Abstract:The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user’s information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend to find the information sought after. Clustering documents by features representative of their contents—in this case, key words and phrases—increases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user.
Keywords:Hybrid neural network  PCA  ART  Web document clustering  Information retrieval  Document features extraction
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号