共查询到18条相似文献,搜索用时 46 毫秒
1.
本文给出了视觉感知编码方法的综述。首先介绍了人类视觉系统(HVS)的特性,然后根据视觉感知编码方法所利用的视觉特性不同将其分为三类,并分别介绍各类方法的研究现状。最后展望视觉感知编码的应用前景和发展方向。 相似文献
2.
在当今这个数据爆炸增长的时代,深度学习强大的非线性建模能力进一步提高了多视点视频编码的率失真性能.对此,介绍多视点视频的特点及应用,重点对深度学习背景下现有的多视点视频编码方法进行介绍和总结,并展望多视点视频编码未来的发展趋势,旨在进一步发挥深度学习的能力,为用户提供更佳的观看体验. 相似文献
3.
4.
基于感知的多视点视频编码宏块模式选择快速算法 总被引:2,自引:2,他引:0
多视点视频编码(MVC)采取可变块模式选择技术和 多参考帧技术显著提高了编码的压缩效率,但同时带来了巨大的 编码计算复杂度。为了降低MVC的计算复杂度,提出基于感知的快速MVC宏块模式选择 算法。基于人眼视觉感知的特点,利用视觉恰可察觉失真(JND)的概念建立MVC宏块的最优模 式和JND的联系,并利用该联系确定早期结束最优宏块模式选择过程的阈值,根据当前编码 宏块的JND与阈值的 关系自适应地减少每个编码宏块的模式搜索次数,进而减少MVC的方向搜索和参考帧搜索的 次数,以降 低编码的复杂度,提高MVC速度。实验结果显示,对于不同运动特性、内容、纹理信息、相 机间距和图像 尺寸的测试序列,提出的快速算法在率失真性能几乎不变的情况下平均节约76.00% 编码时间。 相似文献
5.
6.
7.
随着虚拟现实、增强现实等沉浸式媒体技术的发展,沉浸式视频的表示、存储、传输和显示等各个环节都受到了科研及产业界的广泛关注。沉浸式视频更复杂的视频特性和庞大的数据量,对传统视频编码技术提出了挑战,新的编码技术应运而生。该文从视频自由度(DoF)出发,分别从3DoF和6DoF两个方面介绍沉浸式视频编码技术的最新成果。3DoF视频相关编码技术包括投影模型、运动估计模型和3DoF视频编码标准。6DoF视频相关编码技术包括视频表示形式、虚拟视点合成技术、6DoF视频编码技术及运动图像专家组沉浸式视频(MPEG, MIV)编码标准。最后,对沉浸式视频及其编码技术的发展进行总结和展望。 相似文献
8.
视频编码中,率失真优化是提升编码性能的重要环节。在新一代视频编码标准——多功能视频编码(Versatile Video Coding,VVC)中,率失真优化关键参数λ的确定仍是基于与量化参数(Quantization Parameter,QP)之间相对固定的映射关系模式,而人眼对于视频内容特征的感知并未得到充分考虑,因此提出了用于多功能视频编码标准VVC的感知率失真优化算法。首先提取视频帧中编码树单元(Coding Tree Unit,CTU)的纹理和运动特征,其次结合人眼对于纹理复杂度和运动程度的不同感知,对用于CTU编码的拉格朗日乘子λ进行自适应调节。在VVC标准测试软件VTM12.0平台的低延时P帧配置下,采用多尺度结构相似性和峰值信噪比作为视频质量评价指标,在保持视频质量不变的情况下,码率平均节省0.15%和0.09%,VVC编码器率失真性能得以提升。 相似文献
9.
10.
分析了视频数据的3D小波系数分布特性,提出了一种基于混合3D树型结构和HVS特性的视频可分级编码算法.首先,依据小波低、高频系数的自相关性,确定相应的树型结构来扫描和处理时间维上的低、高频系数,明显减少了用于定位重要系数的同步信息;其次,依据人类视觉系统对各频率子带敏感程度的不同,对各子带系数进行加权,使得重构视频的重要系数得以排在码流前端,从而在很大程度上提高了中低码率下视频的重构质量.对多种标准测试视频的仿真实验验证了本文算法的有效性,与非对称树型结构编码方案和单一时空方向树结构方案相比,该算法解码图像的Y、U和V 3个分量的均峰值信噪比分别高出0.65dB、1.75dB、1.77dB和0.23dB、2.11dB、1.72dB.此外,算法有效抑制了振铃效应,并获得了更好的主观效果. 相似文献
11.
In this paper, we propose perceptual stereoscopic video coding using a disparity just-noticeable-distortion (JND) model. We obtain the disparity JND model in stereo videos by disparity masking effects of the human visual system (HVS). The disparity JND model represents the maximum distortion of stereo perception that HVS cannot perceive. Based on the disparity JND model, we adjust prediction residuals to remove the perceptual redundancy of stereo videos. Thus, we achieve significant bit-rate saving while maintaining visual quality. Experimental results demonstrate that the proposed method significantly improves coding efficiency without loss of stereoscopic perceptual quality. 相似文献
12.
Phi Bang Nguyen Azeddine Beghdadi Marie Luong 《Signal Processing: Image Communication》2013,28(10):1506-1525
In this paper, a new watermarking scheme based on Human Visual System (HVS) modeling is proposed. The approach consists in building computational models which take into account the most common properties of the HVS that can be exploited for watermarking. Two schemes for embedding and controlling the transparency of the watermark are presented, namely the implicit and the explicit schemes. Both schemes are designed in the framework of the pyramidal decomposition which has been shown to be a powerful tool for analyzing image through a multi-scale representation. For the first approach, a multi-scale visibility map is used to optimize the watermark embedding process. The second approach makes use of HVS properties in an explicit and more sophisticated manner that consists in tuning the watermark strength just beneath the visual detection threshold. A new JND (Just-Noticeable-Difference) model for determining this threshold is then proposed and evaluated. The obtained results provide a strong support for this new JND model. 相似文献
14.
15.
在TM5算法的基础上,提出了一种MPEG-2码率控制算法。仿真实验表明,算法对有场景切换连续序列解码后的图象主客观质量有明显改善。 相似文献
16.
Spatial and temporal inconsistency of depth video deteriorates encoding efficiency in three dimensional video systems. A depth video processing algorithm based on human perception is presented. Firstly, a just noticeable rendering distortion (JNRD) model is formulated by combining the analyses of the influence of depth distortion on virtual view rendering with human visual perception characteristics. Then, depth video is processed based on the JNRD model from two aspects, spatial and temporal correlation enhancement. During the process of spatial correlation enhancement, depth video is segmented into edge, foreground, and background regions, and smoothened by Gaussian and mean filters. The operations of the temporal correlation enhancement include temporal–spatial transpose (TST), temporal smoothing filter and inverse TST. Finally, encoding and virtual view rendering experiments are conducted to evaluate the proposed algorithm. Experimental results show that the proposed algorithm can greatly reduce the bit rate while it maintains the quality of virtual view. 相似文献
17.
With the development of modern imaging techniques, every medical examination would result in a huge volume of image data. Analysis, storage and/or transmission of these data demands high compression without any loss of diagnostically significant data. Although, various 3-D compression techniques have been proposed, they have not been able to meet the current requirements. This paper proposes a novel method to compress 3-D medical images based on human vision model to remove visually insignificant information. The block matching algorithm applied to exploit the anatomical symmetry remove the spatial redundancies. The results obtained are compared with those of lossless compression techniques. The results show better compression without any degradation in visual quality. The rate-distortion performance of the proposed coders is compared with that of the state-of-the-art lossy coders. The subjective evaluation performed by the medical experts confirms that the visual quality of the reconstructed image is excellent. 相似文献
18.
A salience computation approach based on perceptual distinctness is proposed in this paper that predicts human eye fixation locations in images. The approach uses a novel representation of pattern in a patch around a pixel such that discrimination between patterns is in compliance with distinctness between them in-terms of a few well-known perceptual cues. Image patches of different sizes (resolutions) are considered. The representations of the value at and pattern around a pixel in different features such as lightness, color and orientation are discriminated from all those in the rest of the image to compute salience at that pixel. The effectiveness of the proposed salience computation approach in generating salience maps closer to human eye fixation density maps is demonstrated by comparing it both qualitatively and quantitatively to other existing state-of-the-art salience computation approaches. Several synthetic images and psychological patterns, and real-life images from five well-known datasets are considered for this purpose. 相似文献