共查询到20条相似文献,搜索用时 0 毫秒
1.
A multiview 3D modeling system based on stereo vision techniques 总被引:9,自引:0,他引:9
This paper introduces a stereo vision system to automatically generate 3D models of real objects. 3D model generation is based on the merging of multiview range images obtained from a digital stereo camera. Stereo images obtained from the camera are rectified, and a correlation-based stereo matching technique reconstructs range images from them. A turntable stage is also employed to obtain multiple range images of the objects. To register range images into a common coordinate system automatically, we introduce and calibrate a turntable coordinate system with respect to the camera coordinate system. After the registration of multiview range images, a 3D model is reconstructed using a volumetric integration technique. Error analysis on turntable calibration and 3D model reconstruction shows the accuracy of our 3D modeling system.Received: 2 August 2003, Accepted: 20 September 2004, Published online: 25 February 2005
Correspondence to: S.Y. Park 相似文献
2.
Aljoscha Smolic Author Vitae 《Pattern recognition》2011,44(9):1958-1968
This paper gives an end-to-end overview of 3D video and free viewpoint video, which can be regarded as advanced functionalities that expand the capabilities of a 2D video. Free viewpoint video can be understood as the functionality to freely navigate within real world visual scenes, as it is known for instance from virtual worlds in computer graphics. 3D video shall be understood as the functionality that provides the user with a 3D depth impression of the observed scene, which is also known as stereo video. In that sense as functionalities, 3D video and free viewpoint video are not mutually exclusive but can very well be combined in a single system. Research in this area combines computer graphics, computer vision and visual communications. It spans the whole media processing chain from capture to display and the design of systems has to take all parts into account, which is outlined in different sections of this paper giving an end-to-end view and mapping of this broad area. The conclusion is that the necessary technology including standard media formats for 3D video and free viewpoint video is available or will be available in the future, and that there is a clear demand from industry and user for such advanced types of visual media. As a consequence we are witnessing these days how such technology enters our everyday life 相似文献
3.
寄存器栈在减少程序调用时的内存访问上发挥了重要作用。但是.并非任何时候栈寄存器的使用都是没有代价的.有时栈溢出的代价甚至非常高。为了解决这个问题,本文提出了一种解决自递归函数中大量栈寄存器的使用导致过高栈溢出代价的算法,对寄存器分配中的简化过程进行了改进.并提出了一种减轻寄存器压力的优化方法。本算法在开放源码编译器ORC(Open Research Compiler是IA-64开放源码编译器的名称)上得到了实现。在IA-64上运行的实验结果证明,该算法对于执行频率很高,而且寄存器压力大的自递归函数有很明显的优化效果。 相似文献
4.
Continuous Global Optimization in Multiview 3D Reconstruction 总被引:1,自引:0,他引:1
Kalin Kolev Maria Klodt Thomas Brox Daniel Cremers 《International Journal of Computer Vision》2009,84(1):80-96
In this article, we introduce a new global optimization method to the field of multiview 3D reconstruction. While global minimization has been proposed in a discrete formulation in form of the maxflow-mincut framework, we suggest the use of a continuous convex relaxation scheme. Specifically, we propose to cast the problem of 3D shape reconstruction as one of minimizing a spatially continuous convex functional. In qualitative and quantitative evaluation we demonstrate several advantages of the proposed continuous formulation over the discrete graph cut solution. Firstly, geometric properties such as weighted boundary length and surface area are represented in a numerically consistent manner: The continuous convex relaxation assures that the algorithm does not suffer from metrication errors in the sense that the reconstruction converges to the continuous solution as the spatial resolution is increased. Moreover, memory requirements are reduced, allowing for globally optimal reconstructions at higher resolutions. We study three different energy models for multiview reconstruction, which are based on a common variational template unifying regional volumetric terms and on-surface photoconsistency. The three models use data measurements at increasing levels of sophistication. While the first two approaches are based on a classical silhouette-based volume subdivision, the third one relies on stereo information to define regional costs. Furthermore, this scheme is exploited to compute a precise photoconsistency measure as opposed to the classical estimation. All three models are compared on standard data sets demonstrating their advantages and shortcomings. For the third one, which gives the most accurate results, a more exhaustive qualitative and quantitative evaluation is presented. 相似文献
5.
计算机生成三维全景图像研究现状与发展 总被引:4,自引:0,他引:4
三位全景图像Integral Imaging(Ⅱ)是一种采用微透镜阵列来记录和显示全真三维场景的图像技术,它的一些突出特点使其在三维电视(3DTV)和可视化领域有着广泛的应用前景.但由于受硬件设备及光学成像本身的限制,目前使用计算机生成三维全景图像(CGⅡ)已成为三维全景图像研究的一个重要手段.本文从三维全景图像技术的原理出发,系统地整理分析了CGⅡ的研究文献,对计算机生成三维全景图像的研究现状和关键问题进行了详细的分析,提出了计算机生成三维全景图像技术未来的发展趋势. 相似文献
6.
7.
三维姿态角光学测量通过对物体表面的光学图像处理,实现对物体的角度姿态测量;为提高目标三维姿态角光学检测的准确性,提出基于立体视觉的大视场三维姿态角光学测量系统设计方法;利用双基准平行准直光源与姿态敏感器,通过图像传感器、USB接口芯片等部件实现分割光斑、质心定位等功能;在Linux内核基础上设计信号收集、数据移植、串口通信、姿态角分析4个软件模块,同时引入立体视觉算法探究靶标点和图像像点对应关联,运用视差定理确立空间角度的三维坐标,实现大视场三维姿态角测量;实验结果证明:所建系统姿态角标定误差小、时间同步性强,测量累计时间低于15 s,具有较高的测量精度和鲁棒性,为机械设备精密部件尺寸测量提供了一种有效的技术手段. 相似文献
8.
本文就具有普遍性的“大转角双目视觉”的三维恢复问题,提出了一种包括自动匹配问题在内的求取物体三维坐标的系列最小二乘法.这种方法不仅能够克服由于大转角变化及遮挡等原因造成的求取图象对应点的困难,而且在数据准确性、鲁棒性、计算复杂性、保证旋转矩阵正交性及三维运动的直接恢复程度方面都优于现存的方法;用实际图象数据进行计算,结果满足实际要求. 相似文献
9.
目的 多视点视频编码标准-MVC是国际标准组织针对3D视频制定的编码标准,支持实时编码和兼容已有H.264标准是MVC应用推广迫切需要解决的问题。方法 本文基于拓扑排序,提出一种二维预测结构到一维参考序列的映射变换机制,利用已有的H.264编码单元技术,实现了MVC-H.264双向实时转码。结果 实验表明该方法能够兼容现有的H.264编解码器,支持高清MVC实时编码,并通过3D视频直播实际应用验证了方法的有效性。结论 本文给出了MVC与H.264标准间进行预测参考结构映射和码流转换的机制,无需设计专用芯片就能同时兼容H.264和MVC标准。 相似文献
10.
RBF网络在立体视觉系统中的研究 总被引:3,自引:0,他引:3
摄像机标定、立体校正以及三维表面重建是立体视觉研究的重要内容。论文充分利用RBF网络的泛函逼近以及插值能力,将其应用于以上三个方面。在摄像机标定过程,通过将标定平面放置在有效视场内的多个位置,得到一组完备的样本,经过RBF网训练后,将立体视觉的几何成像关系存储于网络中;在立体校正过程,利用极线性质,由RBF网络确定图中的一组极线,然后通过求解极值问题来确定极点位置,最后用优化方法解出校正变换矩阵;在三维重建过程,利用摄像机标定中建立的视觉模型,重建出与图像信息相一致的三维表面。与传统方法相比,该算法具有重建速度快,运算精度高,过程简易明了等优点。通过对实际的视觉系统进行实验,证明了该算法的正确性和有效性。 相似文献
11.
随着宽带网络飞速发展,网络立体电视将成为未来视频主流业务.提出一种实时纠错的P2P模式网络立体电视播放系统的解决方案,该方案利用P2P技术进行视频传输,利用LDPC码进行纠错.经实验证明,在高斯噪声下,纠错后的立体视频与未纠错的相比,PSNR值平均提高约15dB.通过在CERNET2网络中测试,系统充分保证了立体视频的正常观看,同时系统配备认证功能,并且兼容IPv4和IPv6网络环境,能够为未来网络立体电视产业化发展提供有力支持. 相似文献
12.
13.
14.
Chunxiao Chen Jing Wang Yupin Liu Xin Chen 《Journal of the Society for Information Display》2017,25(8):524-530
Fatigue caused by the prolonged watching of 3DTV has been paid great attention about the safety for viewers. We used regional homogeneity (ReHo) to measure the local synchronization of resting‐state functional magnetic resonance imaging signals both before and after watching 2D or 3D television. Twenty normal subjects were all measured four times: scans before watching television (3D‐Pre/2D‐Pre) and immediately after watching television for 1 h (3D‐Post/2D‐Post). The variation of ReHo was investigated in 2D/3D groups, and then the influence of watching 2D/3D TV on the spectators was estimated. Compared with the 3D‐Pre, the 3D‐Post showed significantly higher ReHo in the right inferior occipital gyrus (BA18/19) and right middle occipital gyrus (BA18/19), left postcentral gyrus (BA2/3/4/7), and small area of BA9/10 in left frontal lobe. Additionally, increased ReHo regions in the 2D‐Post was observed in the left medial frontal gyrus (BA9/10/32), left cingulate gyrus (BA24), and right anterior cingulate (BA32) as compared with the 2D‐Pre. For the 2D group, subjects mainly feel mental fatigue, which could be caused by prolonged attention. For the 3D group, watching TV primarily causes visual fatigue because of the constant change of depth of focus and mild mental fatigue. The study indicates the adverse effects of 3DTV on visual function. 相似文献
15.
利用镜面成像技术获取被测物体或场景的三维信息得到研究者越来越多的关注。光线与平面镜或曲面镜交互时产生镜面成像。平面镜的反射属性可以改善视觉效果,基于光路射线展开过程可应用于不同的平面镜成像系统,采用光路展开替代镜面交互应用于三维场景,得到虚拟三维空间,平面镜成像使得复杂的射线交互可以用一种虚拟的方式可视化,且坐标系统的变化容易跟踪。曲面镜成像通常不具有透视投影属性,根据曲面的曲率来改变空间显示。曲面镜常常导致折反射,故针对不同的三维立体视觉测量及重构需设计相应的几何恢复算法。从计算机图形学和计算机视觉的角度,分析了镜面成像的基本原理,对近年来较典型的基于镜面成像技术的三维测量与重构方法及最新研究进展进行综述。 相似文献
16.
Chufeng Xiao Chu Han Zhuming Zhang Jing Qin Tien-Tsin Wong Guoqiang Han Shengfeng He 《Computer Graphics Forum》2020,39(1):20-33
We propose a novel deep example-based image colourization method called dense encoding pyramid network. In our study, we define the colourization as a multinomial classification problem. Given a greyscale image and a reference image, the proposed network leverages large-scale data and then predicts colours by analysing the colour distribution of the reference image. We design the network as a pyramid structure in order to exploit the inherent multi-scale, pyramidal hierarchy of colour representations. Between two adjacent levels, we propose a hierarchical decoder–encoder filter to pass the colour distributions from the lower level to higher level in order to take both semantic information and fine details into account during the colourization process. Within the network, a novel parallel residual dense block is proposed to effectively extract the local–global context of the colour representations by widening the network. Several experiments, as well as a user study, are conducted to evaluate the performance of our network against state-of-the-art colourization methods. Experimental results show that our network is able to generate colourful, semantically correct and visually pleasant colour images. In addition, unlike fully automatic colourization that produces fixed colour images, the reference image of our network is flexible; both natural images and simple colour palettes can be used to guide the colourization. 相似文献
17.
18.
A. DipandaAuthor Vitae S. WooAuthor VitaeF. MarzaniAuthor Vitae J.M. BilbaultAuthor Vitae 《Pattern recognition》2003,36(9):2143-2159
The recovery of 3-D shape information (depth) using stereo vision analysis is one of the major areas in computer vision and has given rise to a great deal of literature in the recent past. The widely known stereo vision methods are the passive stereo vision approaches that use two cameras. Obtaining 3-D information involves the identification of the corresponding 2-D points between left and right images. Most existing methods tackle this matching task from singular points, i.e. finding points in both image planes with more or less the same neighborhood characteristics. One key problem we have to solve is that we are on the first instance unable to know a priori whether a point in the first image has a correspondence or not due to surface occlusion or simply because it has been projected out of the scope of the second camera. This makes the matching process very difficult and imposes a need of an a posteriori stage to remove false matching.In this paper we are concerned with the active stereo vision systems which offer an alternative to the passive stereo vision systems. In our system, a light projector that illuminates objects to be analyzed by a pyramid-shaped laser beam replaces one of the two cameras. The projections of laser rays on the objects are detected as spots in the image. In this particular case, only one image needs to be treated, and the stereo matching problem boils down to associating the laser rays and their corresponding real spots in the 2-D image. We have expressed this problem as a minimization of a global function that we propose to perform using Genetic Algorithms (GAs). We have implemented two different algorithms: in the first, GAs are performed after a deterministic search. In the second, data is partitioned into clusters and GAs are independently applied in each cluster. In our second contribution in this paper, we have described an efficient system calibration method. Experimental results are presented to illustrate the feasibility of our approach. The proposed method yields high accuracy 3-D reconstruction even for complex objects. We conclude that GAs can effectively be applied to this matching problem. 相似文献
19.
This paper describes an algorithm for 3D reconstruction of a smooth surface with a relatively dense set of self-similar point features from two calibrated views. We bypass the usual correspondence problem by triangulating a point in space from all pairs of features satisfying the epipolar constraint. The surface is then extracted from the resulting point cloud by taking advantage of the statistical and geometric properties of the point distribution on the surface. Results are presented for computer simulations and for a laboratory experiment on a silicon gel phantom used in a breast cancer screening project. 相似文献