共查询到18条相似文献,搜索用时 125 毫秒
1.
非结构化产品信息的分布式模型研究 总被引:9,自引:0,他引:9
以CIMS为代表的先进制造技术要求与适应的产品信息模型的支持。产品信息包含结构化和非结构化两种形式。非结构化产品模型往往建立在异构环境之上,无法抽象成单一的数据库的模式。目前对非结构化产品信息还没有形成合适的表达模型和相应的操作模型。提出的一个基于企业级异构环境下的非结构化产品信息的分布式模型,以SGML/XML标准表达非结构化产品信息,其操作模型能在常规Web/INTERNET环境下方便地访问信 相似文献
2.
3.
基于约束的半结构化信息的抽取方法 总被引:1,自引:0,他引:1
为了对WEB上不规则的动态信息按照数据库的方式集成和查询,本文采用对象交换模型(OEM)建立WEB上信息模型。为了将页面中各个部分表示为对应的OEM对象,本文(1)设计了半结构化信息的抽取算法;(2)定义了满足约束条件的数据抽取格式,并且设计了输出正确抽取格式的候选者算法;(3)给出测试结果。该方法可以抽取结构化和半结构化的信息,比现有的抽取方法通用性更强。 相似文献
4.
本文对基于Web文档内容和Web文档结构的查询语言WebSQL和WebOQL从数据模型的角度对其优缺点加以分析,并阐述了Web查询语言对Web信息的提取与集成、Web站点的重构及半结构化数据的支持。 相似文献
5.
6.
一种基于VRML的三维逼真地形生成方法 总被引:4,自引:1,他引:3
本文首先对DEM数据进行处理,得到LOD模型;然后采用VRML和Java语言相结合的方法,在Client/Server上组成具有LOD功能的VRML文件;最后在Web上实现了三维地形场景的交互浏览。 相似文献
7.
《中国图象图形学报》1997,(7)
Intranet/Internet智能化媒体数据流传输软件——WebFORCEMediaBaseSGI的WebFORCEMediaBase定义了一个新型的可扩展、可靠的高性能服务器,用于Web站点和应用程序。WebFORCEMediaBase针对企业... 相似文献
8.
为了满足CIMS环境中信息集成的要求,本文为信息集成平台设计了一种具有集成功能的面向对象视图模型I-VIEW.I-VIEW对OO模型进行了扩充,定义了虚属性、虚对象的概念;引入了输入与隐藏机制和类派生机制,允许对对象的状态和行为进行提炼,能够很好地解决各类集成问题,如模式映射、评义冲突和模式合并与重构等。 相似文献
9.
10.
基于Web/CORBA的网管关键技术的研究 总被引:5,自引:0,他引:5
本文在分析了传统的基于Web的网络管理系统的结构及CORBA技术的基础之上,提出了将CORBA技术应用于基于Web的网管系统中,文中论述了这种方式的可行性,并讨论了用分布对象技术建立网管信息模型、Web对象计算技术在WBM系统中的应用及CORBA的事件服务在网管的事件处理模块中的应用等关键技术问题。 相似文献
11.
Yakov Kogan David Michaeli Yehoshua Sagiv Oded Shmueli 《Data & Knowledge Engineering》1998,28(3):655-275
Current query languages for the Web (e.g., W3QL, WebLog and WebSQL) explore the structure of the Web. However, usually, the structure of the Web has little to do with the semantics of the data. Therefore, it is practically difficult to pose database queries over the Web. We introduce a new type of tags for denoting the semantics of data stored in HTML pages. These semantic tags (implemented as HTML comments) superimpose on HTML pages semistructured objects in the style of the OEM model. The paper discusses two implemented tools for fully utilizing the semantics. The first is a visualization tool for displaying both the HTML reading of Web pages and the OEM reading of Web pages. The second tool is a query language, similar to LOREL, that can query the HTML structure and/or the OEM reading. The above formalism and tools provide data-modeling capabilities for the Web that fit its heterogeneous nature. Real database queries, taking the OEM point of view, can be formulated, including queries about the schema as well as queries about the HTML structure of Web pages. Therefore, the query language is not restricted to portions of the Web in which semantic tags are used. 相似文献
12.
Data mining for Web intelligence 总被引:2,自引:0,他引:2
Searching, comprehending, and using the semistructured HTML, XML, and database-service-engine information stored on the Web poses a significant challenge. This data is more sophisticated and dynamic than the information commercial database systems store. To supplement keyword-based indexing, researchers have applied data mining to Web-page ranking. In this context, data mining helps Web search engines find high-quality Web pages and enhances Web click stream analysis. For the Web to reach its full potential, however, we must improve its services, make it more comprehensible, and increase its usability. As researchers continue to develop data mining techniques, the authors believe this technology will play an increasingly important role in meeting the challenges of developing the intelligent Web. Ultimately, data mining for Web intelligence will make the Web a richer, friendlier, and more intelligent resource that we can all share and explore. The paper considers how data mining holds the key to uncovering and cataloging the authoritative links, traversal patterns, and semantic structures that will bring intelligence and direction to our Web interactions. 相似文献
13.
近年来万维网(World Wide Web)的广泛使用为人们访问大量的数据源提供了一种开放式的途径,而影响web数据访问的一个主要原因就是web页面之间以及web页面内部的信息都缺乏结构化。为了能更加有效的检索web数据,就有必要实现web页面结构化的管理。该文所提出的结构化的管理web页面分为两步:①将超文本标记语言(html)转换为扩展标记语言(xml);②分级导航检索。 相似文献
14.
Chia-Hui Chang Shih-Chien Kuo 《Intelligent Systems, IEEE》2004,19(6):56-64
Olera is a semisupervised information-extraction system that produces extraction rules from semistructured Web documents without requiring detailed annotation of the training documents. It performs well for program-generated Web pages with few training pages and limited user intervention. 相似文献
15.
自组织映射在Web结构挖掘中的应用 总被引:1,自引:0,他引:1
该文讨论了用自组织映射进行Web结构挖掘的基本方法。用SOM可直观地表示数据的相似性和进行分类,还可方便地进行数据聚簇分析,并可在Web挖掘中找到权威页面等有用信息。 相似文献
16.
17.
文中提出了一种为了检索万维网上的信息机制并构建了一个关系数据库。解决这个问题分三步:处理了基于HTML的WEB页面的困难;从WEB页面上抽取指定的信息并整合成结构化的文档;给出了把结构化的文档转换成相关的数据表的算法。满足了用户以最小代价、最短时间买到适合自己的商品。 相似文献
18.
Generating finite-state transducers for semi-structured data extraction from the Web 总被引:13,自引:0,他引:13
Integrating a large number of Web information sources may significantly increase the utility of the World-Wide Web. A promising solution to the integration is through the use of a Web Information mediator that provides seamless, transparent access for the clients. Information mediators need wrappers to access a Web source as a structured database, but building wrappers by hand is impractical. Previous work on wrapper induction is too restrictive to handle a large number of Web pages that contain tuples with missing attributes, multiple values, variant attribute permutations, exceptions and typos. This paper presents SoftMealy, a novel wrapper representation formalism. This representation is based on a finite-state transducer (FST) and contextual rules. This approach can wrap a wide range of semistructured Web pages because FSTs can encode each different attribute permutation as a path. A SoftMealy wrapper can be induced from a handful of labeled examples using our generalization algorithm. We have implemented this approach into a prototype system and tested it on real Web pages. The performance statistics shows that the sizes of the induced wrappers as well as the required training effort are linear with regard to the structural variance of the test pages. Our experiment also shows that the induced wrappers can generalize over unseen pages. 相似文献