期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

In this article, we show the existence of a formal convergence between the matrix models of biological memories and the vector space models designed to extract information from large collections of documents. We first show that, formally, the term-by-document matrix (a mathematical representation of a set of codified documents) can be interpreted as an associative memory. In this framework, the dimensionality reduction of the term-by-document matrices produced by the latent semantic analysis (LSA) has a common factor with the matrix biological memories. This factor consists in the generation of a statistical ‘conceptualisation’ of data using little dispersed weighted averages. Then, we present a class of matrix memory that built up thematic blocks using multiplicative contexts. The thematic memories define modular networks that can be acceded using contexts as passwords. This mathematical structure emphasises the contacts between LSA and matrix memory models and invites to interpret LSA, and similar procedures, as a reverse engineering applied on context-deprived cognitive products, or on biological objects (e.g. genomes) selected during large evolutionary processes. 相似文献

12.

基于中文搜索引擎网络信息用户行为研究* 总被引：1，自引：0，他引：1

王浩姚长利郭琳艾国庆《计算机应用研究》2009,26(12):4665-4668

为了更好地理解中文搜索用户的检索行为,首先建立一个搜索引擎选择平台,主要是用来生成研究中所需的日志文件;然后从中英文用户的搜索行为差异的角度出发,对日志文件进行深入研究,包括各中文搜索引擎使用率比较以及中文用户输入查询行为的一些规律等。研究结果表明,对准确地评测搜索引擎检索的效果以及未来中文搜索引擎设计的改进都有较好的指导意义。相似文献

13.

数字图书馆主题搜索引擎的设计与实现*

林其东陈传波郑乐丹张一曼b 《计算机应用研究》2009,26(8):2952-2955

提出构建数字图书馆主题搜索引擎的总体系统设计。利用一个预处理系统尽量选择高质量的种子站点,从而产生Web主题定义数据;在系统控制器的协调下,各主题爬行器同步地采集爬行器所推荐的Web资源,对下载的资源进行文本分类与主题识别;将已经下载的Web资源按学科分类存储在Web主题资源库中,通过全局信息库建立索引,接入通用接口进行依主题检索。依赖数字图书馆各方面特点,提出支持多线程主题爬行器的设计,并提出一种新颖的URL主题相关性剪切算法EPR,为实现数字图书馆主题搜索引擎原型提供重要的设计。基于开源Lucene平相似文献

14.

多搜索引擎权重计算及搜索结果排序质量评估

李超谢坤武《计算机工程与应用》2014,50(12):21-25

搜索引擎在多成员搜索引擎搜索结果的整合过程中,搜索结果的排序在很大程度上决定着元搜索引擎的服务质量。为了实现搜索结果的有效整合,目前技术主要结合查询请求、文档内容、初始排序或（和）赋予搜索成员搜索引擎权重等因素。其中采用赋予搜索引擎权重时,往往根据用户和技术人员经验,主观地进行赋值,不能体现真实的用户搜索偏好。为此,提出了通过挖掘用户搜索及遍历情况,动态地赋予各成员搜索引擎权重的方法。通过用户遍历及点击下载情况,得到了用户搜索遍历与返回结果的匹配度,论证了该方法的可行性和有效性。相似文献

15.

基于用户反馈的POI搜索引擎优化研究

下载免费PDF全文

潘明远方金云章立生《计算机工程与应用》2010,46(32):112-115

推荐 CAJ下载PDF下载不支持迅雷等加速下载工具,请取消加速工具后下载。随着互联网经济的迅猛发展,PO（IPoint Of Interest）搜索成为空间信息服务业发展的核心技术之一。提高用户满意度无疑是POI搜索引擎的最终目标。通过挖掘用户访问日志,建立反馈相似度模型,可提高搜索结果准确度,优化POI搜索引擎。通过理论分析,该方法在不增加计算时间的基础上提高了搜索结果的准确性。最后将该方法应用于中国科学院计算技术研究所地理信息中心自主研发的通图（www.tongmap.cn）地图搜索引擎中,结合实际数据测试,说明该方法在优化POI搜索引擎方面是行之有效的。相似文献

16.

基于Ajax与向量空间模型的个性化搜索引擎 总被引：1，自引：0，他引：1

下载免费PDF全文

李蕾周国民《计算机工程与应用》2007,43(19):89-91

针对个性化搜索的三个关键问题：用户信息搜集,用户信息库的动态更新与个性化检索算法,探索性地提出了基于Ajax用户行为跟踪方案,以会话为单位动态更新用户行为信息库策略与加入用户文档的向量空间检索模型,在此基础上设计并实现了个性化搜索引擎实验系统。相似文献

17.

A synergistic strategy for combining thesaurus-based and corpus-based approaches in building ontology for multilingual search engines

《Computers in human behavior》2015

In this article we illustrate a methodology for building cross-language search engine. A synergistic approach between thesaurus-based approach and corpus-based approach is proposed. First, a bilingual ontology thesaurus is designed with respect to two languages: English and Spanish, where a simple bilingual listing of terms, phrases, concepts, and subconcepts is built. Second, term vector translation is used – a statistical multilingual text retrieval techniques that maps statistical information about term use between languages (Ontology co-learning). These techniques map sets of t f id f term weights from one language to another. We also applied a query translation method to retrieve multilingual documents with an expansion technique for phrasal translation. Finally, we present our findings. 相似文献

18.

An analysis of web proxy logs with query distribution pattern approach for search engines

Mona TaghaviAuthor Vitae Nikita Schmidt^{Author Vitae} 《Computer Standards & Interfaces》2012,34(1):162-170

This study presents an analysis of users' queries directed at different search engines to investigate trends and suggest better search engine capabilities. The query distribution among search engines that includes spawning of queries, number of terms per query and query lengths is discussed to highlight the principal factors affecting a user's choice of search engines and evaluate the reasons of varying the length of queries. The results could be used to develop long to short term business plans for search engine service providers to determine whether or not to opt for more focused topic specific search offerings to gain better market share. 相似文献

19.

SpidersRUs: Creating specialized search engines in multiple languages

Michael Jialun Yilu Chunju Hsinchun 《Decision Support Systems》2008,45(3):621

While small-scale search engines in specific domains and languages are increasingly used by Web users, most existing search engine development tools do not support the development of search engines in languages other than English, cannot be integrated with other applications, or rely on proprietary software. A tool that supports search engine creation in multiple languages is thus highly desired. To study the research issues involved, we review related literature and suggest the criteria for an ideal search tool. We present the design of a toolkit, called SpidersRUs, developed for multilingual search engine creation. The design and implementation of the tool, consisting of a Spider module, an Indexer module, an Index Structure, a Search module, and a Graphical User Interface module, are discussed in detail. A sample user session and a case study on using the tool to develop a medical search engine in Chinese are also presented. The technical issues involved and the lessons learned in the project are then discussed. This study demonstrates that the proposed architecture is feasible in developing search engines easily in different languages such as Chinese, Spanish, Japanese, and Arabic. 相似文献

20.

Analyzing the emotional outcomes of the online search behavior with search engines

Carlos Flavián-Blanco Raquel Gurrea-Sarasa Carlos Orús-Sanclemente 《Computers in human behavior》2011,27(1):540-551

The affective component has been acknowledged as critical to understand information search behavior and user-computer interactions. There is a lack of studies that analyze the emotions that the user feels when searching for information about products with search engines. The present study analyzes the emotional outcomes of the online search process, taking into account the user’s (a) perceptions of success and effort exerted on the search process, (b) initial affective state, and (c) emotions felt during the search process. In addition, we identify profiles of online searchers based on the emotional outcomes of the search process, which allow us to differentiate the emotional processes and behavioral patterns that lead to such emotions. The results of the study stress the importance of the affective component of the online search behavior, given that these emotional outcomes are likely to influence all the subsequent actions that users perform on the Web. 相似文献