共查询到20条相似文献,搜索用时 31 毫秒
1.
文章提出了一种快速的DCT域MPEG-2到MPEG-1准卷积下呆样算法。在转码过程中。头信忠保持不变,且要求将每四个相邻MPEG-2宏块变为一个下采样MPEG-1宏块:用最大最小距离方法确定下采样宏块的运动向量、用简单多数原则确定下采样宏块类型以及用加全平均方案确定下呆样宏块的量化参数。另外,对下采样视频转码失真来源进行了分析。实验结果表明我们提出的转码方案,在同样条件下与级联像素域转码器(TM5)相比,不仅其计算复杂性减少67.6%、PNSR提高0.1dB,而且具有很小的比特控制错误。 相似文献
2.
3.
4.
压缩域MPEG-2到MPEG-4视频转码中不匹配宏块的复原算法 总被引:3,自引:0,他引:3
作为一种最有应用前景的先进的压缩编码标准,MPEG-4将会很快在视频传输、存储和剪辑等许多领域中得到应用。不过,MPEG-2和MPEG-4这两种优秀的视频系统肯定会在很长一段时间内共存。本文提出了一种基于MC-DCT域的转码方法。重点解决了两种标准视频流中宏块编码模式不匹配的一些关键问题。仿真结果表明,本算法不仅能有效地实现MPEG-2到MPEG-4的转码,满足实时性和低处迟的需求,而且能隐藏或复原编码模式不匹配宏块,明显地改变视觉质量。 相似文献
5.
Xingang Liu Kook-Yeol Yoo 《Journal of Visual Communication and Image Representation》2010,21(2):155-166
Recently the latest video coding standard H.264/AVC is widely used for the mobile and low bitrate video codec in the various multimedia terminals. On the other hand, the MPEG-2 MP@HL codec has become the center of digital video contents since it is the standard codec for the Digital TV (DTV). To provide the bridge between the contents in MPEG-2 and mobile terminals, the transcoding of MPEG-2 contents into H.264/AVC format is an inevitable technology in the digital video market. The main bottleneck in the process lies in the computational complexity. In H.264/AVC, the variable block size (VBS) mode decision (MD) is used in the Interframe for the improved performance in the motion compensated prediction. For the macroblock (MB) which cannot be accurately predicted with one motion vector (MV), it is partitioned into smaller blocks and predicted with different MVs. In addition, SKIP and Intra modes are also permitted in the Interframe MD of H.264/AVC to further ameliorate the encoding performance. With the VBS MD technology, the Inter prediction accuracy can be improved significantly. However, the incidental side-effect is the high computational complexity. In this paper, we propose a fast Interframe MD algorithm for MPEG-2 to H.264/AVC transcoding. The relationships between SKIP and Intra modes are detected at first to map these two kinds of modes directly from MPEG-2 to H.264/AVC. And then the MB activity will be scaled by the residual DCT energy obtained from the MPEG-2 decoding process to estimate the block sizes of the MB mode for H.264/AVC Interframe MD. In our proposed method, the original redundant candidate modes can be eliminated effectively, resulting in the reduction of the computational complexity. It can reduce about 85% Rate-to-Distortion Cost (RDCost) computing and 45% entire processing time compared with the well-known cascaded transcoder while maintaining the video quality. 相似文献
6.
Sandro Moiron Srgio Faria Antnio Navarro Vitor Silva Pedro Assuno 《Signal Processing: Image Communication》2009,24(8):637-650
This paper addresses video transcoding from H.264/AVC into MPEG-2 with reduced complexity and high rate-distortion efficiency. While the overall concept is based on a cascaded decoder–encoder, the novel adaptation methods developed in this work have the advantage of providing very good performance in H.264/AVC to MPEG-2 transcoding. The proposed approach exploits the similarities between the coding tools used in both standards, with the objective of obtaining a computationally efficient transcoder without penalising the signal quality. Fast and efficient methods are devised for conversion of macroblock coding modes and translation of motion information in order to compute the MPEG-2 coding format with a reduced number of operations, by reusing the corresponding data embedded in the incoming H.264/AVC coded stream. In comparison with a cascaded decoder–encoder, the fast transcoder achieves computational complexity savings up to 60% with slightly better peak signal-to-noise ratio (PSNR) at the same bitrate. 相似文献
7.
8.
For a conventional downscaling video transcoder, a video server has firstly to decompress the video, perform downscaling operations in the pixel domain, and then recompress it. This is computationally intensive. However, it is difficult to perform video downscaling in the discrete cosine transform (DCT)- domain since the prediction errors of each frame are computed from its immediate past higher resolution frames. Recently, a fast algorithm for DCT domain image downsampling has been proposed to obtain the downsampled version of DCT coefficients with low computational complexity. However, there is a mismatch between the downsampled version of DCT coefficients and the resampled motion vectors. In other words, significant quality degradation is introduced when the derivation of the original motion vectors and the resampled motion vector is large. In this paper, we propose a new architecture to obtain resampled DCT coefficients in the DCT domain by using the split and merge technique. Using our proposed video transcoder architecture, a macroblock is splitted into two regions: dominant region and the boundary region. The dominant region of the macroblock can be transcoded in the DCT domain with low computational complexity and re-encoding error can be avoided. By transcoding the boundary region adaptively, low computational complexity can also be achieved. More importantly, the re-encoding error introduced in the boundary region can be controlled more dynamically. Experimental results show that our proposed video downscaling transcoder can lead to significant computational savings as well as videos with high quality as compared with the conventional approach. The proposed video transcoder is useful for video servers that provide quality service in real-time for heterogeneous clients. 相似文献
9.
10.
《Signal Processing: Image Communication》2003,18(9):837-853
Many encoder architectures, notably for scalable systems such as MPEG-2 SNR scalability and stream morphing, require DCT-domain techniques for the estimation of the rate-distortion characteristics of a video frame for use in the rate control process. This paper develops a CBR rate control framework that can be used for single-layer MPEG-4 Simple Profile as well as two MPEG-4-based scalable systems, one following the structure described in MPEG-2 and also stream morphing. The framework supports perceptual quantization and also low bit rate operation by modelling macroblock overhead as well as the bits required to code non-zero DCT coefficients. Experimental results demonstrate the performance of stream morphing, in both objective and subjective measures, is in almost all cases superior to the MPEG-2 SNR scalability and MPEG-4 Fine Granularity Scalability. 相似文献
11.
12.
13.
This paper discusses the problem of reduced-resolution transcoding of compressed video bitstreams. An analysis of drift errors is provided to identify the sources of quality degradation when transcoding to a lower spatial resolution. Two types of drift error are considered: a reference picture error, which has been identified in previous works, and error due to the noncommutative property of motion compensation and down-sampling, which is unique to this work. To overcome these sources of error, four novel architectures are presented. One architecture attempts to compensate for the reference picture error in the reduced resolution, while another architecture attempts to do the same in the original resolution. We present a third architecture that attempts to eliminate the second type of drift error and a final architecture that relies on an intrablock refresh method to compensate for all types of errors. In all of these architectures, a variety of macroblock level conversions are required, such as motion vector mapping and texture down-sampling. These conversions are discussed in detail. Another important issue for the transcoder is rate control. This is especially important for the intra-refresh architecture since it must find a balance between number of intrablocks used to compensate for errors and the associated rate-distortion characteristics of the low-resolution signal. The complexity and quality of the architectures are compared. Based on the results, we find that the intra-refresh architecture offers the best tradeoff between quality and complexity and is also the most flexible. 相似文献
14.
DU Yao-gang~ 《中国邮电高校学报(英文版)》2005,12(2)
1Introduction Video transcoding in Discrete Cosine Transform(DCT) domain is highly desirable in many multi mediaapplications .It converts a video streamfromone formatto another which has alower bit-rate and/or a differentsyntax to meet the requirements such as dynamicalchannel bandwidth or different monitors of end users .In this paper ,video transcoding with spatial resolutionreductionis concerned.The watermarking technique , which embeds a visibleor invisible watermarkinto the video strea… 相似文献
15.
The latest international video-coding standard H.264/AVC significantly achieves better coding performance compared to prior
video coding standards such as MPEG-2 and H.263, which have been widely used in today’s digital video applications. To provide
the interoperability between different coding standards, this paper proposes an efficient architecture for MPEG-2/H.263/H.264/AVC
to H.264/AVC intra frame transcoding, using the original information such as discrete cosine transform (DCT) coefficients
and coded mode type. Low-frequency components of DCT coefficients and a novel rate distortion cost function are used to select
a set of candidate modes for rate distortion optimization (RDO) decision. For H.263 and H.264/AVC, a mode refinement scheme
is utilized to eliminate unlikely modes before RDO mode decision, based on coded mode information. The experimental results,
conducted on JM12.2 with fast C8MB mode decision, reveal that average 58%, 59% and 60% of computation (re-encoding) time can
be saved for MPEG-2, H.263, H.264/AVC to H.264/AVC intra frame transcodings respectively, while preserving good coding performance
when compared with complex cascaded pixel domain transcoding (CCPDT); or average 88% (a speed up factor of 8) when compared
with CCPDT without considering fast C8MB. The proposed algorithm for H.264/AVC homogeneous transcoding is also compared to
the simple cascaded pixel domain transcoding (with original mode reuse). The results of this comparison indicate that the
proposed algorithm significantly outperforms the mode reuse algorithm in coding performance, with only slightly higher computation. 相似文献
16.
《Signal Processing: Image Communication》2006,21(1):44-58
To achieve portability between different kinds of encoding formats and network environments, heterogeneous video transcoding becomes a key technique for reducing the bitrate of a previously compressed video signal. A frame-skipping transcoder is often used to avoid an unacceptable picture quality when high transcoding ratio is required. Due to high computational complexity and quality degradation introduced by conventional frame-skipping transcoders, a DCT-based video frame-skipping transcoder has been proposed recently. However, the transcoding process of the motion compensated macroblocks in the DCT domain becomes the bottleneck since IDCT and DCT processes are required. In this paper, we propose a new architecture of the frame-skipping transcoder to reduce the computational complexity of motion compensated macroblocks in the frame-skipping process. The new architecture transcodes the dominant region of a motion compensated macroblock in the DCT domain by making use of the DCT coefficients of the incoming bistream and some pre-computed shift operators. By using a shifted version of the dominant vector, the re-encoding error introduced in the dominant region can be avoided. On the other hand, an adaptive transcoding architecture to transcode the boundary regions of MC marcoblocks and a way to perform error compensation are proposed. This architecture can further speed up the transcoding process of the motion compensated macroblocks. Half pixel accuracy related to our proposed frame skipping transcoder is also addressed. Experimental results show that, as compared to the conventional or DCT-based transocders, the new architecture is more robust to noise, gives rise to fewer requantization errors, and requires simple computational complexity. 相似文献
17.
提出了一个基于MPEG-4的快速高效存储的DCT视频编码器的设计原理,用它把一个H.261视频数据流转换为一个低比特率、低空间分辨率的MPEG-4流,并进行无线视频处理.和现有编码器相比,它不仅能够大大节省实际需要的存储空间,计算复杂度也降低了许多,实验结果表明,编码器得到的视频质量与像素领域方法得到的视频质量相当. 相似文献
18.
《Signal Processing: Image Communication》2004,19(1):47-65
Video transcoding is a popular technique for adapting the bit-rate or spatial/temporal resolution of a precoded video to suit better the constraints and requirements of different transmission networks and receiving devices. To minimize computational complexity, many fast methods have been proposed to obtain the motion vectors required for transcoding a precoded video through reducing its frame size by an integral factor. In this paper, we extend the existing work by developing and comparing several fast methods of downsizing precoded videos by a rational factor. Methods that outperform others under different conditions or with different computational requirements are identified, and an application scenario that can benefit from the proposed rational downsizing video transcoding is presented. An efficient scheme is also proposed to select the proper reduced frame size for sustaining the best possible video quality at a specified lower bit-rate. The superiority of the proposed transcoding approach in comparison with the existing integral downsizing video transcoding or cascaded video re-encoding methods is evident from the experimental results shown in this paper. 相似文献
19.
Requantization for transcoding of MPEG-2 intraframes 总被引:4,自引:0,他引:4
An investigation on requantization for transcoding of video signals is carried out. Specifically, MPEG-2 compatible discrete cosine transform (DCT) intraframe coding is addressed. The aim of this work is twofold: first, to provide a theoretical analysis of the transcoding problem, and second, to derive quantization methods for efficient transcoding based on the results of the analysis. The mean squared error (MSE) cost function is proposed for designing a quantizer with minimum distortion resulting in up to 1.3 dB gain compared with the quantizer used in the MPEG-2 reference coder TM5. However, the MSE quantizer leads in general to a larger bit rate and may therefore only be applied locally to blocks of sensitive image content. A better rate-distortion performance can be provided by the maximum a posteriori (MAP) cost function. In critical cases, the MAP quantizer gives a 0.4 dB larger signal-to-noise-ratio (SNR) at the same bit rate compared with the TM5 quantizer. The results are not limited to MPEG-2 and can be adapted to other coding schemes such as H.263 or JPEG. 相似文献