共查询到16条相似文献,搜索用时 64 毫秒
1.
杨容 《数字社区&智能家居》2007,2(7):32-35
XML为在Web上发布和交换异质数据提供了相当的灵活性。但由于这种语言自身就有冗余的特性所以XML文档在体积都大于有相同数据内容的其他类型的文档。随着XML在Web上应用的扩展其数据大小自然也会随之而增加,而这实质上增加了数据的存储量、处理量和交换量,因此XML文档的体积问题也阻碍了XML的应用,特别是阻碍了XML在具有带宽和内存容量限制的应用上如移动通讯应用。在这篇文章中,我们将大致的介绍一下最近提出的几种针对XML的压缩算法并分析它们在解决XML文档体积问题上的技术和效能。 相似文献
2.
3.
4.
近年来,XML已成为Web上信息交流和资源共享的主要载体。但XML自身的自冗余特性限制了它的普遍应用。目前,已经有研究成果提出了XML的压缩方法。压缩的XML文档能够有效利用存储空间,节省网络带宽。在实际应用中,经常需要对压缩存储的XML文档进行更新。对于大的压缩文档,如果先解压再更新,会消耗大量时间,因此,高效的更新方法应该避免解压缩文档,在压缩的XML文档上直接进行更新操作。本文针对压缩XML文档中的数值类型(包括整型和浮点型)数据,研究了在保持压缩状态条件下如何进行有效的数值更新,提出了基于XPRESS实现的Naive数值更新方法,以及修改XPRESS编码方法实现的更为高效的Pivot数值更新方法。通过大量的实验证明,Pivot数值更新方法不仅能够提供高效的更新处理,而且保持了XPRESS的高压缩率。 相似文献
5.
6.
7.
蔡可训 《数字社区&智能家居》2009,5(12):9640-9643
当愈来愈多的数据资料以XML为标准格式进行存储时,由于其格式的不同而导致传统的数据库及查询语法无法适用,该文分析了一种全新的XML查询语言XQuery,并对其在相关领域的应用作了介绍和比较。最后,对XQuery的芡展前景作出展望。 相似文献
8.
9.
蔡可训 《数字社区&智能家居》2009,(34)
当愈来愈多的数据资料以XML为标准格式进行存储时,由于其格式的不同而导致传统的数据库及查询语法无法适用,该文分析了一种全新的XML查询语言XQuery,并对其在相关领域的应用作了介绍和比较。最后,对XQuery的发展前景作出展望。 相似文献
10.
XML(eXtemible Markup Language)是一种可扩展的元置标语言,XML技术的出现,给互联网带来了无限的商机。可以说,XML已经渗透到Intemet的各个角落。文章分别介绍了两种标记语言:超文本标记语言(HTML)和可扩展标记语言(XMC),并对其作简单的比较分析。 相似文献
11.
Comparative Analysis of XML Compression Technologies 总被引:1,自引:0,他引:1
XML provides flexibility in publishing and exchanging heterogeneous data on the Web. However, the language is by nature verbose
and thus XML documents are usually larger in size than other specifications containing the same data content. It is natural
to expect that the data size will continue to grow as XML data proliferates on the Web. The size problem of XML documents hinders the applications of XML, since it substantially increases the costs of storing, processing and exchanging
the data. The hindrance is more apparent in bandwidth- and memory-limited settings such as those applications related to mobile
communication.
In this paper, we survey a range of recently proposed XML specific compression technologies and study their efforts and capabilities
to overcome the size problem. First, by categorizing XML compression technologies into queriable and unqueriable compressors,
we explain the efforts in the representative technologies that aim at utilizing the exposed structure information from the
input XML documents. Second, we discuss the importance of queriable XML compressors and assess whether the compressed XML
documents generated from these technologies are able to support direct querying on XML data. Finally, we present a comparative
analysis of the state-of-the-art XML conscious compression technologies in terms of compression ratio, compression and decompression
times, memory consumption, and query performance. 相似文献
12.
XML已成为各种网络应用中数据存储和数据交换的标准.XML数据管理面临的最大困难在于结构与数据混合存储导致大量数据冗余,这极大地增加了XML数据存储、交换和处理的代价.对XML文档进行压缩可以在一定程度上解决这个问题.但现有XML压缩方法大都仅压缩单文档中的冗余信息.利用XML文档间的相似性,提出一种支持查询的多XML文档压缩存储方法XCluster.XCluster先利用XML有根有序标签树上改进的pq-gram近似距离对XML文档集进行层次聚类;然后合并每个聚类结果子集中文档的结构得到结构代表并进行字典编码压缩;同时合并不同文档内同一标签下的值内容,并根据其数据类型进行相应编码压缩.实验结果表明,在真实和生成的XML多文档数据集上,XCluster比XGrind和XQilla具有更好的压缩效果和查询效率. 相似文献
13.
14.
15.
We propose a novel partition path-based (PPB) grouping strategy to store compressed XML data in a stream of blocks. In addition,
we employ a minimal indexing scheme called block statistic signature (BSS) on the compressed data, which is a simple but effective
technique to support evaluation of selection and aggregate XPath queries of the compressed data. We present a formal analysis
and empirical study of these techniques. The BSS indexing is first extended into effective cluster statistic signature (CSS)
and multiple-cluster statistic signature (MSS) indexing by establishing more layers of indexes. We analyze how the response
time is affected by various parameters involved in our compression strategy such as the data stream block size, the number
of cluster layers, and the query selectivity. We also gain further insight about the compression and querying performance
by studying the optimal block size in a stream, which leads to the minimum processing cost for queries. The cost model analysis
provides a solid foundation for predicting the querying performance. Finally, we demonstrate that our PPB grouping and indexing
strategies are not only efficient enough to support path-based selection and aggregate queries of the compressed XML data,
but they also require relatively low computation time and storage space when compared with other state-of-the-art compression
strategies. 相似文献
16.
XQuery, the standard query language for XML, is increasingly popular among computer scientists with SQL background, since
XQuery and SQL require comparable skills. However, these experts are limited in number, and the availability of easier XQuery
“dialects” could be extremely valuable. With this motivation in mind, we designed XQBE, a visual dialect of XQuery inspired
by the QBE language (Query by Example). Coherent with the hierarchical nature of XML, XQBE uses one or more hierarchical structures
to denote the input documents and one structure to denote the document produced in output. These structures are annotated
to express selection predicates; explicit binding edges connecting the nodes of these structures visualize the input/output
mappings. This paper presents XQBE through several examples and describes the main features of our implementation of the language,
a visual editor coupled with an XQBE-to-XQuery translator. Indeed, the XQBE front-end is a general purpose user-friendly visual
query interface, capable of providing access to any data storage system that exposes XQuery APIs. Available schema information
can be exploited to guide users in querying data sets they are not familiar with. Also, switching between the visual and textual
versions of the same query could be helpful for XQuery learners. 相似文献