首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 598 毫秒
1.
可伸缩矢量图形(SVG)   总被引:6,自引:1,他引:5  
SVG是由W3C制定的用矢量描述图形的XML应用标准.他具有可扩充性、动态的、可交互性等优点.SVG支持复杂的事件处理能力,并可以嵌入在XHTML文档中,生成充分集成的高质量的Web页面.介绍了SVG的概念和特点,并描述了SVG文档的基本结构以及基本的SVG形状,最后给出了在XHTML中嵌入SVG文件的方法以及一个完整的文档实例.  相似文献   

2.
Major components of a complete text reading system   总被引:1,自引:0,他引:1  
The document image processes used in a recently developed text reading system are described. The system consists of three major components: document analysis, document understanding, and character segmentation/recognition. The document analysis component extracts lines of text from a page for recognition. The document understanding component extracts logical relationships between the document constituents. The character segmentation/recognition component extracts characters from a text line and recognizes them. Experiments on more than a hundred documents have proved that the proposed approaches to document analysis and document understanding are robust even for multicolumned and multiarticle documents containing graphics and photographs, and that the proposed character segmentation/recognition method is robust enough to cope with omnifont characters which frequently touch each other  相似文献   

3.
The JPEG standard is one of the most prevalent image compression schemes in use today. While JPEG was designed for use with natural images, it is also widely used for the encoding of raster documents. Unfortunately, JPEG's characteristic blocking and ringing artifacts can severely degrade the quality of text and graphics in complex documents. We propose a JPEG decompression algorithm which is designed to produce substantially higher quality images from the same standard JPEG encodings. The method works by incorporating a document image model into the decoding process which accounts for the wide variety of content in modern complex color documents. The method works by first segmenting the JPEG encoded document into regions corresponding to background, text, and picture content. The regions corresponding to text and background are then decoded using maximum a posteriori (MAP) estimation. Most importantly, the MAP reconstruction of the text regions uses a model which accounts for the spatial characteristics of text and graphics. Our experimental comparisons to the baseline JPEG decoding as well as to three other decoding schemes, demonstrate that our method substantially improves the quality of decoded images, both visually and as measured by PSNR.  相似文献   

4.
Models for storing and presenting multimedia documents   总被引:1,自引:0,他引:1  
This paper deals with models for storing and presenting multimedia documents integrating large amounts of data of different types. Usually, multimedia systems integrate a variety of data such as voice, graphics, text video and other types of images into a single document. Most of these data are not structured and therefore require a huge storage capacity. Such a requirement causes many problems for traditional database management systems which were not originally designed to manipulate data other than text. The presentation of multimedia documents incorporates documentary research that depends upon storage model and mechanisms. In this paper, we present two data storage models and several other associated models for the presentation of multimedia documents. The analysis of these models reveals their capacity to synchronize various temporal scenarios while allowing users simultaneous and secure access to multimedia systems.  相似文献   

5.
Robust detection of skew in document images   总被引:4,自引:0,他引:4  
  相似文献   

6.
The modern graphics processing unit (GPU) is the result of 40 years of evolution of hardware to accelerate graphics processing operations. It represents the convergence of support for multiple market segments: computer-aided design, medical imaging, digital content creation, document and presentation applications, and entertainment applications. The exceptional performance characteristics of the GPU make it an attractive target for other application domains. We examine some of this evolution, look at the structure of a modern GPU, and discuss how graphics processing exploits this structure and how nongraphical applications can take advantage of this capability. We discuss some of the technical and market issues around broader adoption of this technology.  相似文献   

7.
随着因特网的发展,以地域分布的方式来存储和管理企业文件成为一种很普遍的现象。跨地域的企业需要一种比较便捷的方式来共享这些分布在不同地方的文件或者协同完成它们的设计。针对在开发协同文件管理系统的构建模块时可能会遇到的一些基本问题,研究了文件协同的管理,并为用户上下文、文件上下文和协同上下文这3个正交的构建模块提供了一套比较完整的解决方案,包括用户群组管理、文件安全管理、文件关系管理以及文件流程控制。  相似文献   

8.
9.
Exploiting the Information Web   总被引:1,自引:0,他引:1  
The World Wide Web is an increasingly important data source for business decision making; however, extracting information from the Web remains one of the challenging issues related to Web business intelligence applications. To use heterogeneous Web data for decision making, documents containing relevant data must be located, and the data of interest within the documents must be identified and extracted. Currently, most automatic information extraction systems can only cope with a limited set of document formats or do not adapt well to changes in document structure, as a result, many real-world data sources with complex document structures cannot be consistently interpreted using a single information extraction system. This paper presents an adaptive information extraction system prototype that combines multiple information extraction approaches to allow more accurate and resilient data extraction for a wide variety of Web sources. The Amorphic Web information extraction system prototype can locate data of interest based on domain knowledge or page structure, can automatically generate a wrapper for a data source, and can detect when the structure of a Web-based resource has changed and act on this to search the updated resource to locate the desired data. The prototype Amorphic information extraction system demonstrated improved information extraction accuracy for the four different extraction scenarios examined when compared with traditional data extraction approaches  相似文献   

10.
Developers needing to realize high-level multimedia applications are essentially left on their own. Only a few programming tools allow the creation of multimedia effects based on a more general model than multimedia documents. No currently available ISO standard encompasses these needs. A standard in this area should focus more on the presentation aspects of multimedia and less on the coding, transfer, or hypermedia document aspects, which are covered other standards. It should also concentrate on programming tools rather than multimedia document format. These are exactly the main concerns of the Premo (Presentation Environment for Multimedia Objects) standard, the subject of the article. Premo's major features can be briefly summarized as follows: Premo is a presentation environment that aims to provide a standard programming environment in a very general sense, one that helps promote portable multimedia applications; Premo targets multimedia presentation, whereas earlier SC24 standards concentrated either on synthetic graphics or image-processing systems; Premo is object oriented. This means that, through standard object-oriented techniques, a Premo implementation becomes extensible and configurable. Object-oriented technology also provides a framework to describe distribution in a consistent manner  相似文献   

11.
When document designers localize documents for readers in another country, they often assume that the organization of the material used with its original audience will be effective for readers in another country. Whether this assumption is sound depends on what organizational structure readers in other countries are accustomed to seeing. The study examines the organizational structure of 17 Japanese expository texts with the goal of determining what expository structure Japanese readers are most accustomed to reading. The results lead to implications for document designers and readers in both Japan and native English speaking countries  相似文献   

12.
陈引迪  李隐峰 《电子科技》2014,27(5):107-109
提出基于Web的文档在线浏览实现方案。该方案能够实现各种文档的在线浏览功能。文中主要讲述如何高效地将文档文件转换为swf文件,并利用FlexPaper开源平台实现浏览器在线浏览过程。系统主要实现了文档转换、文档传输和文档在线浏览功能,效果理想。  相似文献   

13.
When document designers localize documents for readers in other cultures, they should consider what text organization will best suit those readers. The study presented examines American and Japanese readers' comprehension of and preference for expository text that contains a thesis and is organized either inductively or deductively. The results revealed that while Americans performed equally well with either organizational structure, Japanese readers recalled more information from inductively organized text. The implications for document designers in English and Japanese speaking countries are discussed  相似文献   

14.
The searching of an extensive document database for documents that are locally similar to a given query document, and the subsequent detection of similar regions between such documents, is considered as an essential task in the fields of information retrieval and data management. In this paper, we present a framework for such a task. The proposed framework employs the method of short‐read mapping, which is used in bioinformatics to reveal similarities between genomic sequences. In this paper, documents are considered biological objects; consequently, edit operations between locally similar documents are viewed as an evolutionary process. Accordingly, we are able to apply the method of evolution tracing in the detection of similar regions between documents. In addition, we propose heuristic methods to address issues associated with the different stages of the proposed framework, for example, a frequency‐based fragment ordering method and a locality‐aware interval aggregation method. Extensive experiments covering various scenarios related to the search of an extensive document database for documents that are locally similar to a given query document are considered, and the results indicate that the proposed framework outperforms existing methods.  相似文献   

15.
在查阅、处理IC(集成电路)封装、测试行业相关文件、资料的过程中,发现该行业的文件编制标准很不统一:不同的公司使用不同的标准,同一公司的文件编制也往往使用不同的规则。这给该行业的文件使用者在查找和审核带来很大困扰。编制一份正式受控的技术类型的文件不应只考虑内容,还需附含文件名称、文件编号、文件版号等等.这样才能作为追溯和判别产品是否符合加工要求的依据。随便一段文字、一次谈话、一个邮件、一次电话、一个会议纪要往往不适合作为指导产品加工的文字依据。给出适合该行业文件编制规则的基本要求。其他行业的文件编制也可用作参考。  相似文献   

16.
层次化中文文档分类   总被引:5,自引:0,他引:5  
对层次化中文文档分类进行了研究。采用两种途径进行层次化中文文档分类:第一是按照文档类别的层次结构,将一个大的分类问题分化成若干个小的分类问题,用Bayes分类方法分别建立分类器;第二是将层次化分类问题看成是一个更普遍的多类别、多标注分类问题进行求解,分别利用KNN和基于Boosting的分类方法进行层次化中文文档分类。测试结果显示层次化分类比平面分类能够取得更好的分类效果。  相似文献   

17.
针对互联网中多渠道信息传播导致的文档篡改和所有权归属等问题。通过分析PDF文档结构及内容流特性,文中提出一种基于文本格式的新型算法。设计使用Kerckhoffs安全性准则和数字签名算法,以解决版权归属难题,并能有效甄别文档篡改。文中将不可见水印作为便携式数字签名嵌入PDF文档中,依据PDF文档结构中内容流的字体渲染模式和字体饱和度的可修改特性,通过选择性的修改和添加相关属性,在不损坏文档结构的基础上达到嵌入水印文本的目的。通过从PDF文档中提取出的水印,与原始水印的一致性检测,来辨别文档的真伪及版权归属。实验结果表明,该算法具有良好的透明性和鲁棒性,以及较大地水印容量,能进行篡改判别。  相似文献   

18.
办公文档与固定版式文档格式关系探讨   总被引:1,自引:0,他引:1       下载免费PDF全文
李宁  田英爱  侯霞  梁琦 《电子学报》2008,36(Z1):128-132
 从文档承载信息的抽象程度,提出了文档分层的思想,分析了以流式办公文档和固定版式文档为主的不同层次文档之间的关系.利用Tagged PDF,成功尝试了在固定版式文档中蕴含和提取办公文档信息,说明固定版式文档中容纳结构化办公文档格式的可行性,指出文档格式标准应贯通两种文档格式,形成完整的标准体系.  相似文献   

19.
根据文档图像的文本行和竖直笔画边界,提出了一种基于数学形态学的分层校正算法,可以有效地恢复文档图像的正面平行视图。首先,利用最近邻链(NNC)构造局部自适应的线状结构元,通过方向平滑的闭运算(DSCA)把连通体(CCs)聚类为文本行,并利用文本行的中心线精确地定位水平消隐点。其次,根据局部邻域内CCs的平均高度,提出了一种基于行程开运算(RLOA)的目标函数,通过估计竖直笔画边界的偏斜方向定位竖直消隐点。最后,为了避免竖直消隐点对水平消隐点的影响,提出了一种新的分层校正方法以消除文档图像的透视失真。各种类型文档图像的实验表明,该方法明显改进了消隐点的定位精度,并具有较高的鲁棒性。  相似文献   

20.
A document retrieval system mainly consists of three components: document representation, user queries, and document evaluation. Each component may involve some uncertainties. Fuzzy set theory is a natural approach to coping with the representation of documents, queries, and the relevance of documents to a given query. The authors propose a fuzzy document retrieval model on the World Wide Web (WWW) environment to support conceptual queries. A flexible query expression is proposed to support different semantics of the queries. A concept network is adopted as the knowledge base to represent the relevance of the concepts. The concept network is explored from the WWW. Moreover, they also support neighborhood queries, which retrieve documents relevant to a document specified by a user. A system is currently being implemented to achieve these functions  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号