首页 | 本学科首页   官方微博 | 高级检索  
     


Motion analysis in 3D DCT domain and its application to video coding
Affiliation:1. Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics, Pleinlaan2, B-1050 Brussels, Belgium;2. imec, Kapeldreef 75, B-3001 Leuven, Belgium;1. Department of Mathematics, University of Jaén, Spain;2. Department of Statistics and Operations Research, University of Granada, Spain;3. Department of Mathematics Education and RINS, Gyeongsang National University, Republic of Korea;1. Department of Mathematics, UFPR, Setor de Ciências Exatas, Centro Politécnico, CP 19081, Jd das Américas, CEP 81531-990 Curitiba, Paraná, Brazil;2. Universidade Tecnológica Federal do Paraná (UTFPR), Departamento Acadêmico de Matemática, Av. Sete de Setembro, 3165, Rebouças, CEP 80230-901 Curitiba, Paraná, Brazil;3. CEMAT and Department of Mathematics, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal;1. Department of Mathematics, George Washington University, Washington, DC 20052, USA;2. Institute of Computational Science, Universitá della Svizzera Italiana, CH-6904 Lugano, Switzerland;3. Department of Chemistry, George Washington University, Washington, DC 20052, USA
Abstract:Global, constant-velocity, translational motion in an image sequence induces a characteristic energy footprint in the Fourier-transform (FT) domain; spectrum is limited to a plane with orientation defined by the direction of motion. By detecting these spectral occupancy planes, methods have been proposed to estimate such global motion. Since the discrete cosine transform (DCT) is a ubiquitous tool of all video compression standards to date, we investigate in this paper properties of motion in the DCT domain. We show that global, constant-velocity, translational motion in an image sequence induces in the DCT domain spectral occupancy planes, similarly to the FT domain. Unlike in the FT case, however, these planes are subject to spectral folding. Based on this analysis, we propose a motion estimation method in the DCT domain, and we show that results comparable to standard block matching can be obtained. Moreover, by realizing that significant energy in the DCT domain concentrates around a folded plane, we propose a new approach to video compression. The approach is based on 3D DCT applied to a group of frames, followed by motion-adaptive scanning of DCT coefficients (akin to “zig-zag” scanning in MPEG coders), their adaptive quantization, and final entropy coding. We discuss the design of the complete 3D DCT coder and we carry out a performance comparison of the new coder with ubiquitous hybrid coders.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号