首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 109 毫秒
1.
本文提出了一个基于因特网的中文搜索引擎模型,并从数据组织结构、搜索策略及实现算法等方面进行了论述。  相似文献   

2.
基于Nutch的中文搜索引擎的研究与实现   总被引:1,自引:0,他引:1  
重点讨论了搜索引擎原理及基于Nutch的搜索引擎的实现架构,同时对网页抓取过程作了深入的研究和分析。最后,给出了基于Nutch的中文搜索引擎的解决方案。  相似文献   

3.
中文搜索引擎现状与展望   总被引:19,自引:0,他引:19  
本文介绍了中文搜索引擎的发展现状,分析了中文搜索引擎中存在的问题,以及与国外先进的搜索引擎的差距,提出了中文搜索引擎的发展方向。  相似文献   

4.
基于字表的中文搜索引擎分词系统的设计与实现   总被引:9,自引:0,他引:9  
丁承  邵志清 《计算机工程》2001,27(2):191-192,F003
分析了常用的基于词典的汉语分词方法用于中文搜索引擎开发中的不足,提出基于字表的中文搜索引擎分词系统,并在索引,查询,排除歧义等方面进行了设计和实现。  相似文献   

5.
张敏 《福建电脑》2010,26(6):102-102,122
本文通过对垂直搜索引擎的工作原理与主要技术进行分析,给出了一种基于开源Nutch上实现中文垂直搜索引擎的方案。  相似文献   

6.
张贇  李政 《福建电脑》2006,(11):27-27,31
本文简要介绍了中文搜索引擎技术的系统架构、实现原理及搜索引擎的索引和搜索处理过程。  相似文献   

7.
一个基于向量空间模型的中文文本自动分类系统   总被引:33,自引:2,他引:33  
介绍了一个基于向量空间模型的中文文本自动分类系统,重点阐述了特征提取、空间降维、层次分类和分类器训练等技术的实现方法。实践表明:该系统对文本分类具有较高的平均查全率和平均精度。  相似文献   

8.
为了提高搜索引擎的查准率,帮助用户快速地定位其感兴趣的网页,可应用中文网页自动分类技术,实现快速准确的搜索引擎系统,使其具有较高的查准率.  相似文献   

9.
本文对网络搜索引擎的特点与使用方法和技巧,进行认真的研究与探讨.对因特网上的中文搜集引擎作一比较,目的在于探索各中文搜索引擎检索方法的异同、检索技术的优劣,以期达到帮助用户的目的.  相似文献   

10.
11.
In this article, we show the existence of a formal convergence between the matrix models of biological memories and the vector space models designed to extract information from large collections of documents. We first show that, formally, the term-by-document matrix (a mathematical representation of a set of codified documents) can be interpreted as an associative memory. In this framework, the dimensionality reduction of the term-by-document matrices produced by the latent semantic analysis (LSA) has a common factor with the matrix biological memories. This factor consists in the generation of a statistical ‘conceptualisation’ of data using little dispersed weighted averages. Then, we present a class of matrix memory that built up thematic blocks using multiplicative contexts. The thematic memories define modular networks that can be acceded using contexts as passwords. This mathematical structure emphasises the contacts between LSA and matrix memory models and invites to interpret LSA, and similar procedures, as a reverse engineering applied on context-deprived cognitive products, or on biological objects (e.g. genomes) selected during large evolutionary processes.  相似文献   

12.
基于中文搜索引擎网络信息用户行为研究*   总被引:1,自引:0,他引:1  
为了更好地理解中文搜索用户的检索行为,首先建立一个搜索引擎选择平台,主要是用来生成研究中所需的日志文件;然后从中英文用户的搜索行为差异的角度出发,对日志文件进行深入研究,包括各中文搜索引擎使用率比较以及中文用户输入查询行为的一些规律等。研究结果表明,对准确地评测搜索引擎检索的效果以及未来中文搜索引擎设计的改进都有较好的指导意义。  相似文献   

13.
提出构建数字图书馆主题搜索引擎的总体系统设计。利用一个预处理系统尽量选择高质量的种子站点,从而产生Web主题定义数据;在系统控制器的协调下,各主题爬行器同步地采集爬行器所推荐的Web资源,对下载的资源进行文本分类与主题识别;将已经下载的Web资源按学科分类存储在Web主题资源库中,通过全局信息库建立索引,接入通用接口进行依主题检索。依赖数字图书馆各方面特点,提出支持多线程主题爬行器的设计,并提出一种新颖的URL主题相关性剪切算法EPR,为实现数字图书馆主题搜索引擎原型提供重要的设计。基于开源Lucene平  相似文献   

14.
搜索引擎在多成员搜索引擎搜索结果的整合过程中,搜索结果的排序在很大程度上决定着元搜索引擎的服务质量。为了实现搜索结果的有效整合,目前技术主要结合查询请求、文档内容、初始排序或(和)赋予搜索成员搜索引擎权重等因素。其中采用赋予搜索引擎权重时,往往根据用户和技术人员经验,主观地进行赋值,不能体现真实的用户搜索偏好。为此,提出了通过挖掘用户搜索及遍历情况,动态地赋予各成员搜索引擎权重的方法。通过用户遍历及点击下载情况,得到了用户搜索遍历与返回结果的匹配度,论证了该方法的可行性和有效性。  相似文献   

15.
推荐 CAJ下载PDF下载不支持迅雷等加速下载工具,请取消加速工具后下载。 随着互联网经济的迅猛发展,PO(IPoint Of Interest)搜索成为空间信息服务业发展的核心技术之一。提高用户满意度无疑是POI搜索引擎的最终目标。通过挖掘用户访问日志,建立反馈相似度模型,可提高搜索结果准确度,优化POI搜索引擎。通过理论分析,该方法在不增加计算时间的基础上提高了搜索结果的准确性。最后将该方法应用于中国科学院计算技术研究所地理信息中心自主研发的通图(www.tongmap.cn)地图搜索引擎中,结合实际数据测试,说明该方法在优化POI搜索引擎方面是行之有效的。  相似文献   

16.
基于Ajax与向量空间模型的个性化搜索引擎   总被引:1,自引:0,他引:1       下载免费PDF全文
针对个性化搜索的三个关键问题:用户信息搜集,用户信息库的动态更新与个性化检索算法,探索性地提出了基于Ajax用户行为跟踪方案,以会话为单位动态更新用户行为信息库策略与加入用户文档的向量空间检索模型,在此基础上设计并实现了个性化搜索引擎实验系统。  相似文献   

17.
In this article we illustrate a methodology for building cross-language search engine. A synergistic approach between thesaurus-based approach and corpus-based approach is proposed. First, a bilingual ontology thesaurus is designed with respect to two languages: English and Spanish, where a simple bilingual listing of terms, phrases, concepts, and subconcepts is built. Second, term vector translation is used – a statistical multilingual text retrieval techniques that maps statistical information about term use between languages (Ontology co-learning). These techniques map sets of t f id f term weights from one language to another. We also applied a query translation method to retrieve multilingual documents with an expansion technique for phrasal translation. Finally, we present our findings.  相似文献   

18.
This study presents an analysis of users' queries directed at different search engines to investigate trends and suggest better search engine capabilities. The query distribution among search engines that includes spawning of queries, number of terms per query and query lengths is discussed to highlight the principal factors affecting a user's choice of search engines and evaluate the reasons of varying the length of queries. The results could be used to develop long to short term business plans for search engine service providers to determine whether or not to opt for more focused topic specific search offerings to gain better market share.  相似文献   

19.
While small-scale search engines in specific domains and languages are increasingly used by Web users, most existing search engine development tools do not support the development of search engines in languages other than English, cannot be integrated with other applications, or rely on proprietary software. A tool that supports search engine creation in multiple languages is thus highly desired. To study the research issues involved, we review related literature and suggest the criteria for an ideal search tool. We present the design of a toolkit, called SpidersRUs, developed for multilingual search engine creation. The design and implementation of the tool, consisting of a Spider module, an Indexer module, an Index Structure, a Search module, and a Graphical User Interface module, are discussed in detail. A sample user session and a case study on using the tool to develop a medical search engine in Chinese are also presented. The technical issues involved and the lessons learned in the project are then discussed. This study demonstrates that the proposed architecture is feasible in developing search engines easily in different languages such as Chinese, Spanish, Japanese, and Arabic.  相似文献   

20.
The affective component has been acknowledged as critical to understand information search behavior and user-computer interactions. There is a lack of studies that analyze the emotions that the user feels when searching for information about products with search engines. The present study analyzes the emotional outcomes of the online search process, taking into account the user’s (a) perceptions of success and effort exerted on the search process, (b) initial affective state, and (c) emotions felt during the search process. In addition, we identify profiles of online searchers based on the emotional outcomes of the search process, which allow us to differentiate the emotional processes and behavioral patterns that lead to such emotions. The results of the study stress the importance of the affective component of the online search behavior, given that these emotional outcomes are likely to influence all the subsequent actions that users perform on the Web.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号