共查询到20条相似文献,搜索用时 140 毫秒
1.
2.
杜勇强 《计算机与数字工程》2006,34(10):46-48
首先介绍了搜索引擎的基本概念,并对各种类别的搜索引擎作了简单的介绍;接着介绍了全文搜索引擎的各方面技术以及排序技术对于全文搜索引擎的重要性,然后分别介绍了当前两种重要的排序算法:PageRank排序和HilITop排序,并根据这两种排序算法提出基于行为统计的排序算法,最后对基于行为统计的排序算法作出评价和总结。 相似文献
3.
在分析了传统搜索引擎以及现有元搜索引擎不足之后,提出了基于Agent的个性化元搜索引擎模型,与传统的元搜索引擎系统相比改进了系统的运行效率,同时给出了基于用户反馈的个性化QR-加权重排序算法,将检索结果的排序质量以及文档的相关度在重排序中加以考虑,与传统的线性重排序算法相比提高了查询的准确程度。 相似文献
4.
基于本体的元搜索引擎的设计与实现 总被引:1,自引:0,他引:1
与荚丈元搜索引擎相比,中文元搜索引擎还存在一定的差距,现有的中文元搜索引擎在实现关键词的扩展方面有待提高.通过对本体及元搜索引擎技术的研究,提出并实现了基于本体的元搜索引擎系统,介绍了系统的工作原理,通过对现有的元搜索引擎排序算法的分析,对摘要排序算法进行了改进.最终对系统进行测试并对其结果进行分析,该系统实现了对关键词的同义词和英丈扩展查询,有效地提高了系统的查全率和查准率. 相似文献
5.
6.
7.
8.
基于本体的元搜索引擎结果排序算法研究 总被引:1,自引:1,他引:1
介绍了本体及元搜索引擎技术,认为将本体概念引入到元搜索引擎系统中,能够实现基于同义词、近义词的查询.给出了一个基于本体的元搜索引擎系统体系结构,介绍了系统的工作原理,提出了一种改进的摘要排序算法,并重点介绍了该算法的实现. 相似文献
9.
10.
为改进元搜索引擎查询速度慢、独立性差的缺点,本文设计了一个元搜索引擎的结果处理模型。该模型结合元搜索引擎的特点设计了一种4级结果集的结构,提高了元搜索引擎结果处理的效率。在结果提取部分提出了根据反馈信息自动调整权重的算法(FBWM),在没有人工干预的情况下自动监视各独立搜索引擎的性能变化并随之动态调整其权重。在结果排序部分,提出了改进的位置/全文排序法(IPFTS),在算法中引入了词条匹配等级的概念,不但能提高搜索结果和查询串相关度的精度,还能保证排名在前的搜索结果的URL的有效性。 相似文献
11.
Rank aggregation mechanisms have been used in solving problems from various domains such as bioinformatics, natural language processing, information retrieval, etc. Metasearch is one such application where a user gives a query to the metasearch engine, and the metasearch engine forwards the query to multiple individual search engines. Results or rankings returned by these individual search engines are combined using rank aggregation algorithms to produce the final result to be displayed to the user. We identify few aspects that should be kept in mind for designing any rank aggregation algorithm for metasearch. For example, generally equal importance is given to the input rankings while performing the aggregation. However, depending on the indexed set of web pages, features considered for ranking, ranking functions used etc. by the individual search engines, the individual rankings may be of different qualities. So, the aggregation algorithm should give more weight to the better rankings while giving less weight to others. Also, since the aggregation is performed when the user is waiting for response, the operations performed in the algorithm need to be light weight. Moreover, getting supervised data for rank aggregation problem is often difficult. In this paper, we present an unsupervised rank aggregation algorithm that is suitable for metasearch and addresses the aspects mentioned above.We also perform detailed experimental evaluation of the proposed algorithm on four different benchmark datasets having ground truth information. Apart from the unsupervised Kendall-Tau distance measure, several supervised evaluation measures are used for performance comparison. Experimental results demonstrate the efficacy of the proposed algorithm over baseline methods in terms of supervised evaluation metrics. Through these experiments we also show that Kendall-Tau distance metric may not be suitable for evaluating rank aggregation algorithms for metasearch. 相似文献
12.
Leonidas AkritidisAuthor VitaeDimitrios KatsarosAuthor Vitae Panayiotis BozanisAuthor Vitae 《Journal of Systems and Software》2011,84(1):130-143
Nowadays, mashup services and especially metasearch engines play an increasingly important role on the Web. Most of users use them directly or indirectly to access and aggregate information from more than one data sources. Similarly to the rest of the search systems, the effectiveness of a metasearch engine is mainly determined by the quality of the results it returns in response to user queries. Since these services do not maintain their own document index, they exploit multiple search engines using a rank aggregation method in order to classify the collected results. However, the rank aggregation methods which have been proposed until now, utilize a very limited set of parameters regarding these results, such as the total number of the exploited resources and the rankings they receive from each individual resource. In this paper we present QuadRank, a new rank aggregation method, which takes into consideration additional information regarding the query terms, the collected results and the data correlated to each of these results (title, textual snippet, URL, individual ranking and others). We have implemented and tested QuadRank in a real-world metasearch engine, QuadSearch, a system developed as a testbed for algorithms related to the wide problem of metasearching. The name QuadSearch is related to the current number of the exploited engines (four). We have exhaustively tested QuadRank for both effectiveness and efficiency in the real-world search environment of QuadSearch and also, using a task from the recent TREC-2009 conference. The results we present in our experiments reveal that in most cases QuadRank outperformed all component engines, another metasearch engine (Dogpile) and two successful rank aggregation methods, Borda Count and the Outranking Approach. 相似文献
13.
Internet上有关元搜索引擎的研究与设计 总被引:1,自引:0,他引:1
概述了Internet上一种新形式——元搜索引擎的概念、构成及分类,对元搜索引擎和一般的搜索引擎进行了比较,联系当前国内外元搜索引擎的现状,分析了其意义,提出了基于Agent的元搜索引擎的设计方案,指出了有自己独特的检索方式和处理方式,突破了传统搜索的局限性,并且该形式尚有许多重要问题亟待解决。 相似文献
14.
15.
16.
集成搜索引擎的文本数据库选择 总被引:8,自引:0,他引:8
用户需要检索的信息往往分散存储在多个搜索多个搜索引擎各自的数据库里,对普通用户而言,访问多个搜索引擎并从返回的结果中分辨出确实有网页是一件费时费力的工作,集成搜索引擎则可以提供给用户一个同时记问多个搜索引擎人集成环境,集成搜索引擎能将其接收到的用户查询提交给底层的多个搜索引擎进行搜索,作为一种搜索工具,集成搜索引擎具有如WEB查询覆盖面比传统引擎更大,引警有更好的可扩展性等优点,讨论了解决集成搜索引擎的数据库选择问题的多种技术,针对用户提交的查询要求,通过数据库选择可以选定最有可能返回有用信息的底层搜索引擎。 相似文献
17.
A neural network-based intelligent metasearch engine 总被引:12,自引:0,他引:12
Determining the relevancy of web pages to a query term is basic to the working of any search engine. In this paper we present a neural network based algorithm to classify the relevancy of search results on a metasearch engine. The fast learning neural network technology used by us enables the metasearch engine to handle a query term in a reasonably short time and return the search results with high accuracy. 相似文献
18.
《Journal of Systems and Software》2004,73(1):159-168
Metasearch engines offer better coverage and are more fault-tolerant and expandable than single search engines. A metasearch engine is required to post queries with and obtain retrieval results from several other Internet search engines. In this paper, we describe the use of the extensible style language (XSL) to support metasearches. We show how XSL can transform a query, expressed in XML, into different forms for different search engines. We show how the retrieval results could be transformed into a standard format so that the metasearch engine can interpret the retrieved data, filtering the irrelevant information (e.g. advertisement). The proposed structure treats the metasearch engine and the individual search engines as separate modules with a clearly defined communication structure through XSL. Thus, the system is more extensible than coding the structure and syntactic transformation processes. It allows other new search engines to be included just through plug-and-play, requiring only that the new transformation of XML for this search engine be included in the XSL. 相似文献
19.
基于弱实时系统的元搜索引擎合成策略 总被引:2,自引:0,他引:2
本文根据弱实时系统理论提出一种新的元搜索引擎结果合成策略,与现有系统相比可以较好把握系统响应时间与准确率的平衡,提高元搜索引擎的搜索速度和性能,并具有进一步深入研究的价值。 相似文献
20.
In this paper, we present a system LESSON for lecture notes searching and sharing, which is dedicated to both instructors and students for effectively supporting their Web-based teaching and learning activities. The LESSON system employs a metasearch engine for lecture notes searching from Web and a peer-to-peer (P2P) overlay network for lecture notes sharing among the users. A metasearch engine provides an unified access to multiple existing component search engines and has better performance than general-purpose search engines. With the help of a P2P overlay network, all computers used by instructors and students can be connected into a virtual society over the Internet and communicate directly with each other for lecture notes sharing, without any centralized server and manipulation. In order to merge results from multiple component search engines into a single ranked list, we design the RSF strategy that takes rank, similarity and features of lecture notes into account. To implement query routing decision for effectively supporting lecture notes sharing, we propose a novel query routing mechanism. Experimental results indicate that the LESSON system has better performance in lecture notes searching from Web than some popular general-purpose search engines and some existing metasearch schemes; while processing queries within the system, it outperforms some typical routing methods. Concretely, it can achieve relatively high query hit rate with low bandwidth consumption in different types of network topologies. 相似文献