首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 828 毫秒
1.
Tirri  H. 《Computer》2003,36(1):115-116
Search - originally a simple keyword-lookup functionality for files - has become a fundamental Internet service. Yet its failings are well known. Easy Web access to billions of pages of information has a downside. The pages are titled mostly according to their authors' whims and use subtly different terminology that can fool a simple keyword search - sometimes intentionally, sometimes not. In addition, even the best general purpose search engines do not reach the "invisible Web" of back-end databases. Subject-specific search sites can help this situation but take time to maintain, rarely include a sophisticated interface, and seldom provide good coverage even in their topic area. Citeseer (http://clteseer.nj.nec.com/cs), a reference source for computer science research papers, is one successful exception. It is argued that the next generation of Internet search facilities must support more complex vehicles for interaction than keywords.  相似文献   

2.
搜索引擎是Internet信息服务的主体,搜索引擎的设计是各网站建设的重要部分。介绍了搜索引擎的分类和各类搜索引擎的工作过程。在此基础上,指出了蜘蛛程序是由网页下载和网页内容分析及信息提取两部分组成,并结合用C Builder作为开发工具给出了这两部分的源代码示例。最后介绍了蜘蛛程序设计要注意的问题。  相似文献   

3.
王立杰  李萌  蔡斯博  李戈  谢冰  杨芙清 《软件学报》2012,23(6):1335-1349
随着Web服务技术的不断成熟和发展,互联网上出现了大量的公共Web服务.在使用Web服务开发软件系统的过程中,其文本描述信息(例如简介和使用说明等)可以帮助服务消费者直观有效地识别和理解Web服务并加以利用.已有的研究工作大多关注于从Web服务的WSDL文件中获取此类信息进行Web服务的发现或检索,调研发现,互联网上大部分Web服务的WSDL文件中普遍缺少甚至没有此类信息.为此,提出一种基于网络信息搜索的从WSDL文件之外的信息源为Web服务扩充文本描述信息的方法.从互联网上收集包含目标Web服务特征标识的相关网页,基于从网页中抽取出的信息片段,利用信息检索技术计算信息片段与目标Web服务的相关度,并选取相关度较高的文本片段为Web服务扩充文本描述信息.基于互联网上的真实数据进行的实验,其结果表明,可为约51%的互联网上的Web服务获取到相关网页,并为这些Web服务中约88%扩充文本描述信息.收集到的Web服务及其文本描述信息数据均已公开发布.  相似文献   

4.
SUMMARY

As libraries expand their services into the world of federated searching, librarians need to work with users to discover what their expectations are and how the library can customize the software to meet users' expectations. This article describes the user testing performed at Texas A&M University during 2005 as the libraries implemented a new federated search service called Search Now (ExLibris' MetaLib). Over fifty volunteers–including undergraduates, graduate students, faculty, and library faculty and staff–helped to test the new system and offered suggestions for improvements. Problems were noted and, where possible, modifications were made to improve results. These modifications were then tested again. Major issues noted during the usability testing included: user expectations of search performance; information included in and the layout of the search results; availability of advanced search options; and lack of ability to limit by format, scholarly nature of journal, date and full-text availability. Suggestions for further development are also presented.  相似文献   

5.
互联网发展迅速,搜索引擎在网络应用中开始显现主导地位。下一代网络技术的代表是AJAX技术,由于AJAX技术开发的页面具有许多新属性,传统的搜索引擎不能够查找到AJAX技术开发的页面,该文分析了AJAX搜索引擎的概念,从搜索引擎的原理和AJAX技术分析出发,并与采用了AJAX技术的传统搜索引擎作对比,研究AJAX搜索引擎的特点。  相似文献   

6.
AJAX搜索引擎研究   总被引:1,自引:0,他引:1  
互联网发展迅速,搜索引擎在网络应用中开始显现主导地位。下一代网络技术的代表是AJAX技术.由于AJAX技术开发的页面具有许多新属性,传统的搜索引擎不能够查找到AJAX技术开发的页面,该文分析了AJAX搜索引擎的概念,从搜索引擎的原理和AJAX技术分析出发,并与采用了AJAX技术的传统搜索引擎作对比,研究AJAX搜索引擎的特点。  相似文献   

7.
Most users assume that their use of Internet services is implicitly private and anonymous, so it can be quite eye-opening to find out how much about ourselves and our companies we reveal by seemingly innocuous words we use to search, the maps we view, and the other "free" services we use on the Internet. The Internet has become one of the most central aspects of our world, and we react to both the mundane and important events in our personal and professional lives by turning to it. Unfortunately, these events, great or small, continue to exist for an indeterminately long time period on the service providers' servers. Providers of free Web-based applications aren't simply offering their tools as a public service. However altruistic they might be in some regards, these companies have legal obligations to their shareholders to make profits. Although various business models exist for advertising in connection with "free" services, the consistent bottom line is that Web-based companies depend on being able to convince advertisers that it's worth their money to have their ads presented on Web pages and emails. Free Web-based services aren't really free: users pay for them with micropayments of information that add up to a significant sum.  相似文献   

8.
随着互联网的普及和网页数量的飞速增长,搜索引擎已经成为从网上获取信息的首选工具.然而,目前主流的搜索引擎在响应用户提交的检索请求时,往往以较长的一维列表形式分页展示结果,为了找到自己所需要的信息,用户必须对该结果列表进行耐心的浏览.为了进一步提高用户获取信息的效率和质量,减轻用户的劳动强度,研究者提出了对检索结果进行再挖掘、再组织的问题,聚类就是其中的研究热点之一.本文在分析现有检索结果聚类算法存在的问题的基础上,提出了基于查询相关性分析的标签驱动聚类算法,该算法通过分析短语与查询项的关联程度,提取作为候选簇标签的短语,然后根据这些标签确定网页摘要隶属的候选簇,最后基于对候选簇和标签的评价进行簇筛选和归并,得到聚类结果及每个簇的标签.在相同环境下进行的对比实验表明,所提出的算法优于相关工作,而且需要更少的信息资源支持.  相似文献   

9.
《IT Professional》2001,3(3):60-62
Advances in Internet search engine technology may not help you blast Klingons into outer space, but they should help you find them more quickly on the Web. The whole arena for Internet searching has become rather interesting. Search engines appear poised to make some serious breakthroughs in relevancy ranging and personalization that promise to increase the accuracy and reliability of search. On the ether hand, data suggests that users are becoming increasingly disenchanted with search engines that don't actually search the Web, but rather search records of the Web sites their robots have visited. Some online merchants (Victoria's Secret, for example) don't even enable keyword searches on their sites. The Web's increasingly dynamic nature complicates searching. New pages created on the fly using personalization information, and even static content, with dynamically inserted sidebars, navigation bars, advertising and commentary, can present a rapidly changing picture for any robot to discover. And as indexes grow larger, search system performance becomes a significant problem  相似文献   

10.
Hawking  D. 《Computer》2006,39(6):86-88
In this article, we go behind the scenes and explain how this data processing "miracle" is possible. We focus on whole-of-Web search but note that enterprise search tools and portal search interfaces use many of the same data structures and algorithms. Search engines cannot and should not index every page on the Web. After all, thanks to dynamic Web page generators such as automatic calendars, the number of pages is infinite. To provide a useful and cost-effective service, search engines must reject as much low-value automated content as possible. In addition, they can ignore huge volumes of Web-accessible data, such as ocean temperatures and astrophysical observations, without harm to search effectiveness. Finally, Web search engines have no access to restricted content, such as pages on corporate intranets. What follows is not an inside view of any particular commercial engine - whose precise details are jealously guarded secrets - but a characterization of the problems that whole-of-Web search services face and an explanation of the techniques available to solve these problems.  相似文献   

11.
An interactive agent-based system for concept-based web search   总被引:1,自引:0,他引:1  
Search engines are useful tools in looking for information from the Internet. However, due to the difficulties of specifying appropriate queries and the problems of keyword-based similarity ranking presently encountered by search engines, general users are still not satisfied with the results retrieved. To remedy the above difficulties and problems, in this paper we present a multi-agent framework in which an interactive approach is proposed to iteratively collect a user's feedback from the pages he has identified. By analyzing the pages gathered, the system can then gradually formulate queries to efficiently describe the content a user is looking for. In our framework, the evolution strategies are employed to evolve critical feature words for concept modeling in query formulation. The experimental results show that the framework developed is efficient and useful to enhance the quality of web search, and the concept-based semantic search can thus be achieved.  相似文献   

12.
Nowadays, searches for webpages of a person with a given name constitute a notable fraction of queries to web search engines. Such a query would normally return webpages related to several namesakes, who happened to have the queried name, leaving the burden of disambiguating and collecting pages relevant to a particular person (from among the namesakes) on the user. In this article we develop a Web People Search approach that clusters webpages based on their association to different people. Our method exploits a variety of semantic information extracted from Web pages, such as named entities and hyperlinks, to disambiguate among namesakes referred to on the Web pages. We demonstrate the effectiveness of our approach by testing the efficacy of the disambiguation algorithms and its impact on person search.  相似文献   

13.
ContextSoftware has become an innovative solution nowadays for many applications and methods in science and engineering. Ensuring the quality and correctness of software is challenging because each program has different configurations and input domains. To ensure the quality of software, all possible configurations and input combinations need to be evaluated against their expected outputs. However, this exhaustive test is impractical because of time and resource constraints due to the large domain of input and configurations. Thus, different sampling techniques have been used to sample these input domains and configurations.ObjectiveCombinatorial testing can be used to effectively detect faults in software-under-test. This technique uses combinatorial optimization concepts to systematically minimize the number of test cases by considering the combinations of inputs. This paper proposes a new strategy to generate combinatorial test suite by using Cuckoo Search concepts.MethodCuckoo Search is used in the design and implementation of a strategy to construct optimized combinatorial sets. The strategy consists of different algorithms for construction. These algorithms are combined to serve the Cuckoo Search.ResultsThe efficiency and performance of the new technique were proven through different experiment sets. The effectiveness of the strategy is assessed by applying the generated test suites on a real-world case study for the purpose of functional testing.ConclusionResults show that the generated test suites can detect faults effectively. In addition, the strategy also opens a new direction for the application of Cuckoo Search in the context of software engineering.  相似文献   

14.
田莉霞 《软件》2020,(4):67-71
随着信息化社会的来临,各种互联网技术应运而生,数字信息已然成为当今社会中商家必争的宝贵财富资源。众多数字信息中,怎样帮助用户精准筛选出有效信息是当前搜索引擎所面临的巨大挑战。传统的互联网搜索仅仅是基于本文的链接,搜索时仅单纯的给出包含搜索词的网页,让用户去网页中寻找答案,这种检索方法耗时耗力,还不能准确给出用户想要的答案。由此谷歌率先提出以知识图谱(Knowledge Graph)为技术基础的的搜索引擎,这是搜索引擎界的一次重大变革。它以图的形式表现客观世界中的概念和实体及其之间关系,现如今广泛应用于语义搜索、智能问答、决策支持等智能服务领域。本文针对什么是知识图谱、如何表示构建知识图谱及知识图谱的主要应用作了详细阐述,希望更多的读者可以了解知识图谱及其对人工智能发展的巨大贡献。  相似文献   

15.
智能型元搜索引擎的设计与实现   总被引:13,自引:0,他引:13  
刘丽  孙燕唐 《计算机工程》2003,29(6):118-120,133
研究现有元搜索引擎技术,提出了智能型元搜索引擎模型,即采用数据挖掘技术,根据独立型搜索引擎工作情况的记录,动态生成元搜索引擎的调度策略。在对各数据挖掘方法进行比较之后,选择了决策树归纳分类分析技术生成元搜索引擎调用策略,并详细介绍了调度策略的处理过程、系统评估度量的建立以及用微软最近发布的OLE DB for DM数据挖掘通用接口进行数据挖掘的具体实现。  相似文献   

16.
陈浩  陶传奇  文万志 《计算机科学》2017,44(11):125-133
随着大数据的迅速发展,大数据应用层出不穷,诸如网购零售平台、人脸识别系统、智能决策系统、自助客服、看病导医系统等典型的大数据应用使得人们的生活越发便捷。搜索系统是人们最常使用的大数据应用之一。然而,搜索系统在不同平台上的功能各有侧重,其标准尚且不完善,搜索的质量参次不齐,无法得到保障。 与普通的文本搜索引擎相比,网购平台的搜索引擎增加了分类检索、筛选等特色功能,其质量的评价与保障更为复杂。通过对网络零售平台的搜索功能进行研究,针对网购平台搜索功能的质量评价提出了质量参考因素,针对质量因素提出了若干评价指标以及相应的实现算法,并通过实验来论证了质量指标的有效性。  相似文献   

17.
A web service may evolve autonomously, making peer web services in the same service composition uncertain as to whether the evolved behaviors are compatible with its original collaborative agreement. Although peer services may wish to conduct regression testing to verify the agreed collaboration, the source code of the former service may be inaccessible to them. Owing to the black-box nature of peer services, traditional code-based approaches to regression testing are inapplicable. In addition, traditional techniques assume that a regression test suite for verifying a web service is available. The location to store a regression test suite is also a problem. On the other hand, we note that the rich interface specifications of a web service provide peer services with a means to formulate black-box testing strategies. In this paper, we provide a strategy for black-box service-oriented testing. We also formulate new test case prioritization strategies using tags embedded in XML messages to reorder regression test cases, and reveal how the test cases use the interface specifications of web services. We experimentally evaluate the effectiveness of these black-box strategies in revealing regression faults in modified WS-BPEL programs. The results show that the new techniques can have a high chance of outperforming random ordering. Moreover, our experiment shows that prioritizing test cases based on WSDL tag coverage can achieve a smaller variance than that based on the number of tags in XML messages in regression test cases, even though their overall fault detection rates are similar.  相似文献   

18.
Web服务组合测试综述   总被引:1,自引:0,他引:1  
丁志军  周泽霞 《软件学报》2018,29(2):299-319
随着面向服务技术和云计算技术的不断成熟,尤其是面向服务体系结构SOA的不断完善以及推广,使得其主要内容Web服务已经被广泛应用.为了充分利用Web服务并解决单个Web服务的功能有限的问题,业界将多个原子Web服务按照一定的规则和业务逻辑进行组合,以提供更多功能更强大的服务,实现了Web服务的增值和复用.为保证Web服务组合的质量,需要对Web服务组合进行全面、充分的测试.然而,由于Web服务组合的动态特性和分布式特点,使得其测试技术与方法和传统的软件测试有很大区别,存在很多挑战.本文针对Web服务组合测试,对近年来Web服务组合测试研究中的测试用例生成技术、回归测试技术、测试执行和度量方法进行了系统地总结和分析.此外,我们还对Web服务组合测试中有待研究的问题进行了分析和展望.  相似文献   

19.
基于NuSOAP及Google Search API搜索技术的研究与实现   总被引:1,自引:0,他引:1  
Web Services已经成为目前解决分布式系统的主流技术。Google搜索引擎基于Web Services技术为开发者提供了Google Search API,使得开发人员可以在应用程序当中通过与Google Web服务的访问,来获取搜索服务。本文阐述如何在PHP集成开发环境下,利用NuSOAP组件访问Google Search API,构建PHP环境下搜索系统的方法和技术。  相似文献   

20.
Next generation heterogeneous wireless networks are expected to interwork with Internet Protocol (IP)-based infrastructures. Conventional network services operate like silos in that a specific set of services are offered over a specific type of access network. It is desirable for users to be able to roam between fixed and mobile networks that employ different access technologies. Therefore, mobility management with quality of service (QoS) support is of particular importance and one of the driving forces of convergence. Since service providers often provide more than one service to their subscribers, it is important to facilitate convergence of network charging architecture through a common charging framework. One of the main issues of IP-based convergence is security and privacy. This requires coordination of different security policies in diverse networks that have different security levels and capabilities. The business case for migration to an IP-based platform motivates operators to deliver more powerful services for customers as well as a better user experience. This paper provides an overview of converged mobile Internet architectures and their implications on QoS, charging/billing and security, as well as emerging business models for telecommunication services.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号