共查询到20条相似文献,搜索用时 598 毫秒
1.
可伸缩矢量图形(SVG) 总被引:6,自引:1,他引:5
SVG是由W3C制定的用矢量描述图形的XML应用标准.他具有可扩充性、动态的、可交互性等优点.SVG支持复杂的事件处理能力,并可以嵌入在XHTML文档中,生成充分集成的高质量的Web页面.介绍了SVG的概念和特点,并描述了SVG文档的基本结构以及基本的SVG形状,最后给出了在XHTML中嵌入SVG文件的方法以及一个完整的文档实例. 相似文献
2.
Major components of a complete text reading system 总被引:1,自引:0,他引:1
Tsujimoto S. Asada H. 《Proceedings of the IEEE. Institute of Electrical and Electronics Engineers》1992,80(7):1133-1149
The document image processes used in a recently developed text reading system are described. The system consists of three major components: document analysis, document understanding, and character segmentation/recognition. The document analysis component extracts lines of text from a page for recognition. The document understanding component extracts logical relationships between the document constituents. The character segmentation/recognition component extracts characters from a text line and recognizes them. Experiments on more than a hundred documents have proved that the proposed approaches to document analysis and document understanding are robust even for multicolumned and multiarticle documents containing graphics and photographs, and that the proposed character segmentation/recognition method is robust enough to cope with omnifont characters which frequently touch each other 相似文献
3.
Tak-Shing Wong Bouman C.A. Pollak I. Zhigang Fan 《IEEE transactions on image processing》2009,18(11):2518-2535
The JPEG standard is one of the most prevalent image compression schemes in use today. While JPEG was designed for use with natural images, it is also widely used for the encoding of raster documents. Unfortunately, JPEG's characteristic blocking and ringing artifacts can severely degrade the quality of text and graphics in complex documents. We propose a JPEG decompression algorithm which is designed to produce substantially higher quality images from the same standard JPEG encodings. The method works by incorporating a document image model into the decoding process which accounts for the wide variety of content in modern complex color documents. The method works by first segmenting the JPEG encoded document into regions corresponding to background, text, and picture content. The regions corresponding to text and background are then decoded using maximum a posteriori (MAP) estimation. Most importantly, the MAP reconstruction of the text regions uses a model which accounts for the spatial characteristics of text and graphics. Our experimental comparisons to the baseline JPEG decoding as well as to three other decoding schemes, demonstrate that our method substantially improves the quality of decoded images, both visually and as measured by PSNR. 相似文献
4.
Models for storing and presenting multimedia documents 总被引:1,自引:0,他引:1
This paper deals with models for storing and presenting multimedia documents integrating large amounts of data of different types. Usually, multimedia systems integrate a variety of data such as voice, graphics, text video and other types of images into a single document. Most of these data are not structured and therefore require a huge storage capacity. Such a requirement causes many problems for traditional database management systems which were not originally designed to manipulate data other than text. The presentation of multimedia documents incorporates documentary research that depends upon storage model and mechanisms. In this paper, we present two data storage models and several other associated models for the presentation of multimedia documents. The analysis of these models reveals their capacity to synchronize various temporal scenarios while allowing users simultaneous and secure access to multimedia systems. 相似文献
5.
Robust detection of skew in document images 总被引:4,自引:0,他引:4
6.
Blythe D. 《Proceedings of the IEEE. Institute of Electrical and Electronics Engineers》2008,96(5):761-778
The modern graphics processing unit (GPU) is the result of 40 years of evolution of hardware to accelerate graphics processing operations. It represents the convergence of support for multiple market segments: computer-aided design, medical imaging, digital content creation, document and presentation applications, and entertainment applications. The exceptional performance characteristics of the GPU make it an attractive target for other application domains. We examine some of this evolution, look at the structure of a modern GPU, and discuss how graphics processing exploits this structure and how nongraphical applications can take advantage of this capability. We discuss some of the technical and market issues around broader adoption of this technology. 相似文献
7.
8.
9.
Exploiting the Information Web 总被引:1,自引:0,他引:1
Gregg D.G. Walczak S. 《IEEE transactions on systems, man and cybernetics. Part C, Applications and reviews》2007,37(1):109-125
The World Wide Web is an increasingly important data source for business decision making; however, extracting information from the Web remains one of the challenging issues related to Web business intelligence applications. To use heterogeneous Web data for decision making, documents containing relevant data must be located, and the data of interest within the documents must be identified and extracted. Currently, most automatic information extraction systems can only cope with a limited set of document formats or do not adapt well to changes in document structure, as a result, many real-world data sources with complex document structures cannot be consistently interpreted using a single information extraction system. This paper presents an adaptive information extraction system prototype that combines multiple information extraction approaches to allow more accurate and resilient data extraction for a wide variety of Web sources. The Amorphic Web information extraction system prototype can locate data of interest based on domain knowledge or page structure, can automatically generate a wrapper for a data source, and can detect when the structure of a Web-based resource has changed and act on this to search the updated resource to locate the desired data. The prototype Amorphic information extraction system demonstrated improved information extraction accuracy for the four different extraction scenarios examined when compared with traditional data extraction approaches 相似文献
10.
Developers needing to realize high-level multimedia applications are essentially left on their own. Only a few programming tools allow the creation of multimedia effects based on a more general model than multimedia documents. No currently available ISO standard encompasses these needs. A standard in this area should focus more on the presentation aspects of multimedia and less on the coding, transfer, or hypermedia document aspects, which are covered other standards. It should also concentrate on programming tools rather than multimedia document format. These are exactly the main concerns of the Premo (Presentation Environment for Multimedia Objects) standard, the subject of the article. Premo's major features can be briefly summarized as follows: Premo is a presentation environment that aims to provide a standard programming environment in a very general sense, one that helps promote portable multimedia applications; Premo targets multimedia presentation, whereas earlier SC24 standards concentrated either on synthetic graphics or image-processing systems; Premo is object oriented. This means that, through standard object-oriented techniques, a Premo implementation becomes extensible and configurable. Object-oriented technology also provides a framework to describe distribution in a consistent manner 相似文献
11.
When document designers localize documents for readers in another country, they often assume that the organization of the material used with its original audience will be effective for readers in another country. Whether this assumption is sound depends on what organizational structure readers in other countries are accustomed to seeing. The study examines the organizational structure of 17 Japanese expository texts with the goal of determining what expository structure Japanese readers are most accustomed to reading. The results lead to implications for document designers and readers in both Japan and native English speaking countries 相似文献
12.
提出基于Web的文档在线浏览实现方案。该方案能够实现各种文档的在线浏览功能。文中主要讲述如何高效地将文档文件转换为swf文件,并利用FlexPaper开源平台实现浏览器在线浏览过程。系统主要实现了文档转换、文档传输和文档在线浏览功能,效果理想。 相似文献
13.
When document designers localize documents for readers in other cultures, they should consider what text organization will best suit those readers. The study presented examines American and Japanese readers' comprehension of and preference for expository text that contains a thesis and is organized either inductively or deductively. The results revealed that while Americans performed equally well with either organizational structure, Japanese readers recalled more information from inductively organized text. The implications for document designers in English and Japanese speaking countries are discussed 相似文献
14.
The searching of an extensive document database for documents that are locally similar to a given query document, and the subsequent detection of similar regions between such documents, is considered as an essential task in the fields of information retrieval and data management. In this paper, we present a framework for such a task. The proposed framework employs the method of short‐read mapping, which is used in bioinformatics to reveal similarities between genomic sequences. In this paper, documents are considered biological objects; consequently, edit operations between locally similar documents are viewed as an evolutionary process. Accordingly, we are able to apply the method of evolution tracing in the detection of similar regions between documents. In addition, we propose heuristic methods to address issues associated with the different stages of the proposed framework, for example, a frequency‐based fragment ordering method and a locality‐aware interval aggregation method. Extensive experiments covering various scenarios related to the search of an extensive document database for documents that are locally similar to a given query document are considered, and the results indicate that the proposed framework outperforms existing methods. 相似文献
15.
杜椿楣 《电子工业专用设备》2013,(10):58-62
在查阅、处理IC(集成电路)封装、测试行业相关文件、资料的过程中,发现该行业的文件编制标准很不统一:不同的公司使用不同的标准,同一公司的文件编制也往往使用不同的规则。这给该行业的文件使用者在查找和审核带来很大困扰。编制一份正式受控的技术类型的文件不应只考虑内容,还需附含文件名称、文件编号、文件版号等等.这样才能作为追溯和判别产品是否符合加工要求的依据。随便一段文字、一次谈话、一个邮件、一次电话、一个会议纪要往往不适合作为指导产品加工的文字依据。给出适合该行业文件编制规则的基本要求。其他行业的文件编制也可用作参考。 相似文献
16.
17.
针对互联网中多渠道信息传播导致的文档篡改和所有权归属等问题。通过分析PDF文档结构及内容流特性,文中提出一种基于文本格式的新型算法。设计使用Kerckhoffs安全性准则和数字签名算法,以解决版权归属难题,并能有效甄别文档篡改。文中将不可见水印作为便携式数字签名嵌入PDF文档中,依据PDF文档结构中内容流的字体渲染模式和字体饱和度的可修改特性,通过选择性的修改和添加相关属性,在不损坏文档结构的基础上达到嵌入水印文本的目的。通过从PDF文档中提取出的水印,与原始水印的一致性检测,来辨别文档的真伪及版权归属。实验结果表明,该算法具有良好的透明性和鲁棒性,以及较大地水印容量,能进行篡改判别。 相似文献
18.
19.
根据文档图像的文本行和竖直笔画边界,提出了一种基于数学形态学的分层校正算法,可以有效地恢复文档图像的正面平行视图。首先,利用最近邻链(NNC)构造局部自适应的线状结构元,通过方向平滑的闭运算(DSCA)把连通体(CCs)聚类为文本行,并利用文本行的中心线精确地定位水平消隐点。其次,根据局部邻域内CCs的平均高度,提出了一种基于行程开运算(RLOA)的目标函数,通过估计竖直笔画边界的偏斜方向定位竖直消隐点。最后,为了避免竖直消隐点对水平消隐点的影响,提出了一种新的分层校正方法以消除文档图像的透视失真。各种类型文档图像的实验表明,该方法明显改进了消隐点的定位精度,并具有较高的鲁棒性。 相似文献
20.
Chang C.-S. Chen A.L.P. 《IEEE transactions on systems, man and cybernetics. Part C, Applications and reviews》1998,28(2):300-308
A document retrieval system mainly consists of three components: document representation, user queries, and document evaluation. Each component may involve some uncertainties. Fuzzy set theory is a natural approach to coping with the representation of documents, queries, and the relevance of documents to a given query. The authors propose a fuzzy document retrieval model on the World Wide Web (WWW) environment to support conceptual queries. A flexible query expression is proposed to support different semantics of the queries. A concept network is adopted as the knowledge base to represent the relevance of the concepts. The concept network is explored from the WWW. Moreover, they also support neighborhood queries, which retrieve documents relevant to a document specified by a user. A system is currently being implemented to achieve these functions 相似文献