Web document clustering using a hybrid neural network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Web document clustering using a hybrid neural network

Authors:	M Shamim Khan Sebastian W Khor

Affiliation:	School of Information Technology, Murdoch University, South Street, Murdoch, WA 6150, Australia

Abstract:	The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user’s information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend to find the information sought after. Clustering documents by features representative of their contents—in this case, key words and phrases—increases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user.

Keywords:	Hybrid neural network PCA ART Web document clustering Information retrieval Document features extraction
本文献已被 ScienceDirect 等数据库收录！