期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Video Transcoding with H.263 Bit-Streams

Jeongnam Youn Ming-Ting Sun 《Journal of Visual Communication and Image Representation》2000,11(4):385-403

Video transcoding is one of the key technologies in implementing dynamic adaptation of the bit-rate of a coded video bit-stream to the available bandwidth over various networks. Many fast transcoder architectures have been proposed to achieve fast processing. However, they suffer from quality degradation caused by the drift error. In this paper, we investigate the drift caused by the fast transcoder architectures for transcoding H.263 bit-streams. We discuss the limitations of the fast transcoder architectures and the flexibility that can be offered by a cascaded pixel-domain transcoder. Since the cascaded pixel-domain transcoder can achieve drift-free performance, we also propose methods to reduce the computational complexity of the drift-free cascaded pixel-domain transcoder. 相似文献

2.

针对移动视频终端的基于感兴趣区域的快速转换编码

贺强杨高波张兆扬《通信学报》2009,30(7):36-40

提出一种针对移动终端,基于感兴趣区域(ROI)的快速转换编码方案.首先,根据移动终端的显示尺寸,在视频服务器端利用视觉关注度模型从H.264视频流自动地检测出ROI.然后,在代理服务器端根据ROI转换编码生成适合于移动终端的视频流.此外,针对此转码体系提出了一种快速模式选择算法.仿真实验结果表明,本方案可在降低网络占用带宽的情况下,获得较好的主观视觉效果,并且计算量小. 相似文献

3.

视频转换编码的快速图标插入技术研究 总被引：1，自引：0，他引：1

刘昱李桂苓郭映王立《电路与系统学报》2005,10(1):53-57

数字视频转换编码技术为传输带宽匹配等问题提供解决方案,在许多具体应用中,要求在转换编码时在视频图像中加入图形标识,本文对转换编码中图标插入技术进行了分析和研究,给出了一种快速图标插入转换编码技术,提出了图标插入后运动矢量及宏块模式判别等问题的优化算法,实验结果表明,使用本算法进行图标插入的转换编码流质量良好. 相似文献

4.

Arbitrary-ratio image resizing using fast DCT of composite length for DCT-based transcoder.

Young Seo Park Hyun Wook Park 《IEEE transactions on image processing》2006,15(2):494-500

We propose a fast arbitrary-ratio image resizing method for transcoding of the compressed images. The downsizing process in the discrete cosine transform (DCT) domain can be implemented by truncating high-frequency coefficients, whereas the upsizing process is implemented in the DCT domain by padding zero coefficients to the high-frequency part. The proposed method combines a fast inverse and forward DCT of composite length for arbitrary-ratio upsizing or downsizing. According to the resizing ratio, truncating the high-frequency coefficients and padding zeros are appropriately considered by combining the inverse DCT and forward DCT. The proposed method shows a good peak signal-to-noise ratio and less computational complexity compared with the spatial-domain and previous DCT-domain image resizing methods. 相似文献

5.

Robust Video Watermarking of H.264/AVC

Jing Zhang Anthony T. S. Ho Gang Qiu Pina Marziliano 《Circuits and Systems II: Express Briefs, IEEE Transactions on》2007,54(2):205-209

A robust video watermarking scheme of the state-of-the-art video coding standard H.264/AVC is proposed in this brief. 2-D 8-bit watermarks such as detailed company trademarks or logos can be used as inconvertible watermark for copyright protection. A grayscale watermark pattern is first modified to accommodate the H.264/AVC computational constraints, and then embedded into video data in the compressed domain. With the proposed method, the video watermarking scheme can achieve high robustness and good visual quality without increasing the overall bit-rate. Experimental results show that our algorithm can robustly survive transcoding process and strong common signal processing attacks, such as bit-rate reduction, Gaussian filtering and contrast enhancement 相似文献

6.

On re-composition of motion compensated macroblocks for DCT-based video transcoding

《Signal Processing: Image Communication》2006,21(1):44-58

To achieve portability between different kinds of encoding formats and network environments, heterogeneous video transcoding becomes a key technique for reducing the bitrate of a previously compressed video signal. A frame-skipping transcoder is often used to avoid an unacceptable picture quality when high transcoding ratio is required. Due to high computational complexity and quality degradation introduced by conventional frame-skipping transcoders, a DCT-based video frame-skipping transcoder has been proposed recently. However, the transcoding process of the motion compensated macroblocks in the DCT domain becomes the bottleneck since IDCT and DCT processes are required. In this paper, we propose a new architecture of the frame-skipping transcoder to reduce the computational complexity of motion compensated macroblocks in the frame-skipping process. The new architecture transcodes the dominant region of a motion compensated macroblock in the DCT domain by making use of the DCT coefficients of the incoming bistream and some pre-computed shift operators. By using a shifted version of the dominant vector, the re-encoding error introduced in the dominant region can be avoided. On the other hand, an adaptive transcoding architecture to transcode the boundary regions of MC marcoblocks and a way to perform error compensation are proposed. This architecture can further speed up the transcoding process of the motion compensated macroblocks. Half pixel accuracy related to our proposed frame skipping transcoder is also addressed. Experimental results show that, as compared to the conventional or DCT-based transocders, the new architecture is more robust to noise, gives rise to fewer requantization errors, and requires simple computational complexity. 相似文献

7.

DCT域中高清电视到标清电视转码技术及软件实现

刘怀宇李延奇梁铁朱维乐《中国有线电视》2006,(Z1)

随着数字电视的发展与普及,不同编码视频的转码变得越来越重要。研究DCT域中空间分辨率下采样的关键技术,包括DCT域图像尺寸下采样、DCT域的运动补偿、运动矢量重用等,并以软件方式实现了HDTV到SDTV的转码。相似文献

8.

Framework for Fast and Efficient Cloud Video Transcoding System Using Intelligent Splitter and Hadoop MapReduce

D. Kesavaraja A. Shenbagavalli 《Wireless Personal Communications》2018,102(3):2117-2132

The way computing was done has changed a lot in recent times. Nowadays mobile devices have been the stealing the show. These devices come in different specifications, that any multimedia content that is to be played requires transcoding for better user experiences. Cloud based video services cater to the needs of the end user based on their requirements, through video transcoding. Hence video transcoding plays a very important role in today’s evolving streaming media environment. The major problem with video transcoding is that it consumes a lot of time and impacts seriously on the quality of the output. Transcoding uses the device information to transform the video into the required format and this process is done in a distributed fashion, to speed up the process. This work proposes an Intelligent Video Splitter which uses the Map Reduce algorithm to provide efficiency based on time factor. The important performance metrics including video distortion (VD), video distortion due to frame dependency (FDD) were considered. The results showed that the proposed framework perceptibly outperforms than the prevailing strategies. It provides higher video quality as a result of it introduces less video distortion. In future this method may be extended to supply associate automatic device aware video standards. 相似文献

9.

DCT域中高清电视到标清电视转码技术及软件实现

刘怀宇李延奇梁铁朱维乐《中国有线电视》2006,(3):289-292

随着数字电视的发展与普及，不同编码视频的转码变得越来越重要。研究DCT域中空间分辨率下采样的关键技术，包括DCT域图像尺寸下采样、DCT域的运动补偿、运动矢量重用等，并以软件方式实现了HDTV到SDTV的转码。相似文献

10.

MPEG-2到MPEG-4视频转编码中的快速运动矢量重估算法 总被引：1，自引：0，他引：1

鲁照华李华陈宏俞斯乐《电路与系统学报》2003,8(5):141-144

本文讨论了MPEG-2到MPEG-4视频转编码问题，提出了两种快速运动矢量重估算法，分别基于空域和时域。我们的算法利用已有运动信息和DCT变换的直流系数进行运动估计，可明显减少运算量，提高编码速度，并获得了与全搜索算法接近的图像质量，可满足网络视频传输的实时要求。相似文献

11.

Fast computation of MC-DCT for video transcoding

Assuncao P.A.A. Ghanbari M. 《Electronics letters》1997,33(4):284-286

A fast method to reduce the computational complexity of a frequency domain transcoder for bit-rate reduction of compressed video bit streams is proposed. The proposed method reduces the computational complexity of a pixel domain transcoder by 81.25% while the introduced distortion in the transcoded pictures is 0.2 dB 相似文献

12.

Video transcoding architectures and techniques: an overview 总被引：2，自引：0，他引：2

Vetro A. Christopoulos C. Huifang Sun 《Signal Processing Magazine, IEEE》2003,20(2):18-29

Throughout this article, we concentrate on the transcoding of block-based video coding schemes that use hybrid discrete cosine transform (DCT) and motion compensation (MC). In such schemes, the frames of the video sequence are divided into macroblocks (MBs), where each MB typically consists of a luminance block (e.g., of size 16 /spl times/ 16, or alternatively, four 8 /spl times/ 8 blocks) along with corresponding chrominance blocks (e.g., 8 /spl times/ 8 Cb and 8 /spl times/ 8 Cr). This article emphasizes the processing that is done on the luminance components of the video. In general, the chrominance components can be handled similarly and will not be discussed in this article. We first provide an overview of the techniques used for bit-rate reduction and the corresponding architectures that have been proposed. Then, we describe the advances regarding spatial and temporal resolution reduction techniques and architectures. Additionally, an overview of error resilient transcoding is also provided, as well as a discussion of scalable coding techniques and how they relate to video transcoding. Finally, the article ends with concluding remarks, including pointers to other works on video transcoding that have not been covered in this article, as well as some future directions. 相似文献

13.

Arbitrary Frame Rate Transcoding Through Temporal and Spatial Complexity 总被引：1，自引：0，他引：1

《Broadcasting, IEEE Transactions on》2009,55(4):767-775

In this paper, an arbitrary frame rate transcoding joint considering temporal and spatial complexity of frames in the adaptive length sliding window is proposed. The length of a sliding window can be adjusted according to bandwidth variation in order to decide the number of skipped frames. The proposed method preserves significant frames and drops non-significant ones using the complexity measurements. Moreover, the motion vector composition algorithm is proposed to reduce the computations of motion estimation process by adopting the coding feature of variable block sizes in H.264/AVC video transcoder. Experimental results show that the proposed method achieves higher visual quality compared to other existing methods. After combining with the proposed fast motion composition algorithm, our proposed algorithm reduces encoding time significantly with slight visual quality degradation. 相似文献

14.

General Architecture for MPEG-2/H.263/H.264/AVC to H.264/AVC Intra Frame Transcoding

Yinyi Lin He Han Hsu 《Journal of Signal Processing Systems》2011,65(1):89-103

The latest international video-coding standard H.264/AVC significantly achieves better coding performance compared to prior video coding standards such as MPEG-2 and H.263, which have been widely used in today’s digital video applications. To provide the interoperability between different coding standards, this paper proposes an efficient architecture for MPEG-2/H.263/H.264/AVC to H.264/AVC intra frame transcoding, using the original information such as discrete cosine transform (DCT) coefficients and coded mode type. Low-frequency components of DCT coefficients and a novel rate distortion cost function are used to select a set of candidate modes for rate distortion optimization (RDO) decision. For H.263 and H.264/AVC, a mode refinement scheme is utilized to eliminate unlikely modes before RDO mode decision, based on coded mode information. The experimental results, conducted on JM12.2 with fast C8MB mode decision, reveal that average 58%, 59% and 60% of computation (re-encoding) time can be saved for MPEG-2, H.263, H.264/AVC to H.264/AVC intra frame transcodings respectively, while preserving good coding performance when compared with complex cascaded pixel domain transcoding (CCPDT); or average 88% (a speed up factor of 8) when compared with CCPDT without considering fast C8MB. The proposed algorithm for H.264/AVC homogeneous transcoding is also compared to the simple cascaded pixel domain transcoding (with original mode reuse). The results of this comparison indicate that the proposed algorithm significantly outperforms the mode reuse algorithm in coding performance, with only slightly higher computation. 相似文献

15.

Very low bit-rate video coding with DFD segmentation

Wei Li Vasudev Bhaskaran Murat Kunt 《Signal Processing: Image Communication》1995,7(4-6):419-434

A video coding system for applications requiring very low bit-rate is presented. This coding scheme uses an intraframe coder for the initial frame in the video sequence and subsequent frames are coded using an interframe coding method. A wavelet-based technique is used for intraframe coding. For interframe coding, displaced frame differences (DFD) are computed and coded using a segmentation-based method wherein the displaced frame difference is segmented into active and inactive regions using morphological operators. To meet the very low bit-rate requirements, the motion vectors are processed so as to reduce their contribution to the overall bit-rate. To reduce coding artifacts, a post-processing technique is developed for use at the decoder. Coding performance of the proposed coding scheme is evaluated at 16 kbit/s and 32 kbit/s using luminance component of several typical test sequences at QCIF resolution with a frame rate 8.3 frame/s. 相似文献

16.

Error concealment aware rate shaping for wireless video transport

Trista Pei-chun Chen Tsuhan Chen 《Signal Processing: Image Communication》2003,18(10):889-905

Streaming of video, which is both source- and channel-coded, over wireless networks faces the challenge of time-varying packet loss rate and fluctuating bandwidth. Rate shaping (RS) has been proposed to reduce the bit-rate of a precoded video bitstream to adapt to the real-time bandwidth variation. In our earlier work, rate shaping was extended to consider not only the bandwidth but also the packet loss rate variations. Rate-distortion optimized rate adaptation is performed on the precoded video that is a scalable coded bitstream protected by forward error correction codes. In this paper, we propose an RS scheme that further takes into account the error concealment (EC) method used at the receiver. We refer to this scheme as EC aware RS (ECARS). When performing ECARS, first ECARS needs to know the benefit/gain of sending each part of the precoded video, as opposed to not sending it but reconstructing it by EC. Then given a certain packet loss probability, the expected gain can be derived and be included in the rate-distortion optimization problem formulation. Finally, ECARS performs rate-distortion optimization to adapt the rate of the precoded video. A two-stage rate-distortion optimization approach is proposed to solve the ECARS rate-distortion optimization problem. In addition to ECARS, the precoding process can be EC aware to prioritize the precoded video based on the gain. We present an example EC aware precoding process by means of macroblock prioritization. Experiment results of ECARS together with EC aware precoding are shown to have excellent performance. 相似文献

17.

Foveation embedded DCT domain video transcoding

Shizhong Liu Alan C. Bovik 《Journal of Visual Communication and Image Representation》2005,16(6):643-667

Video transcoding is a key technology to support video communications over heterogeneous networks. Although quite a bit of research effort has been made in video transcoding due to its wide applications, most video transcoding techniques proposed in the literature are optimized based on the simple mean squared error (MSE) metric which does not correlate well with the human visual perception. In this paper, foveation, a property of the HVS, is exploited in video transcoding. The proposed foveation embedded DCT domain video transcoding can reduce the bit rate without compromising visual quality or achieve better subjective quality for a given bit rate by shaping the compression distortion according to the foveated contrast sensitivity function of the HVS. In addition, fast algorithms for video foveation filtering and DCT domain inverse motion compensation are developed, which significantly improve the efficiency of video transcoding. 相似文献

18.

Hybrid model-and-object-based real-time conversational video coding

《Signal Processing: Image Communication》2015

Bandwidth-constrained real-time conversational video communications (such as mobile teleconferencing) require video codecs with good rate-distortion characteristics at low bit-rates and modest computational complexity. While target-specific object-based and model-based coding methods have been proposed for low bit-rate conversational video coding, difficulties in generalization and high computational complexity hinder their practical utilization. In this paper, we propose a low bit-rate coding method for typical conversational video by combining two-dimensional model-based coding of face regions and object-based coding of non-face head-shoulder regions, achieving high-quality face reconstruction and low overall bit-rate with real-time encoding capability. Experiments on typical conversational test sequences confirm that, compared to other conversational video codecs, our model-and-object-based coding method offers superior rate-distortion performance at low bit-rates. 相似文献

19.

DCT-based video downscaling transcoder using split and merge technique.

Kai-Tat Fung Wan-Chi Siu 《IEEE transactions on image processing》2006,15(2):394-403

For a conventional downscaling video transcoder, a video server has firstly to decompress the video, perform downscaling operations in the pixel domain, and then recompress it. This is computationally intensive. However, it is difficult to perform video downscaling in the discrete cosine transform (DCT)- domain since the prediction errors of each frame are computed from its immediate past higher resolution frames. Recently, a fast algorithm for DCT domain image downsampling has been proposed to obtain the downsampled version of DCT coefficients with low computational complexity. However, there is a mismatch between the downsampled version of DCT coefficients and the resampled motion vectors. In other words, significant quality degradation is introduced when the derivation of the original motion vectors and the resampled motion vector is large. In this paper, we propose a new architecture to obtain resampled DCT coefficients in the DCT domain by using the split and merge technique. Using our proposed video transcoder architecture, a macroblock is splitted into two regions: dominant region and the boundary region. The dominant region of the macroblock can be transcoded in the DCT domain with low computational complexity and re-encoding error can be avoided. By transcoding the boundary region adaptively, low computational complexity can also be achieved. More importantly, the re-encoding error introduced in the boundary region can be controlled more dynamically. Experimental results show that our proposed video downscaling transcoder can lead to significant computational savings as well as videos with high quality as compared with the conventional approach. The proposed video transcoder is useful for video servers that provide quality service in real-time for heterogeneous clients. 相似文献

20.

一种MPEG4标准的快速及高效视频编码器

杨永杰《现代电子技术》2004,27(19):35-37,40

提出了一个基于MPEG4的快速高效存储的DCT视频编码器，用他把一个H．261视频数据流转换为一个低比特率、低空间分辨率的MPEG4流，并进行无线视频处理。和现有编码器相比，他不仅能够大大地节省实际需要存储空间，计算复杂度也降低了许多，实验结果表明设计的编码器得到的视频质量与象素领域方法得到的视频质量相当。相似文献