共查询到20条相似文献,搜索用时 31 毫秒
1.
Equipped with 512-bit wide SIMD inst d large numbers of computing cores, the emerging x86-based Intel(R) Many Integrated Core (MIC) Architecture ot only high floating-point performance, but also substantial off-chip memory bandwidth. The 3D FFT (three-di fast Fourier transform) is a widely-studied algorithm; however, the conventional algorithm needs to traverse the three times. In each pass, it computes multiple 1D FFTs along one of three dimensions, giving rise to plenty of rided memory accesses. In this paper, we propose a two-pass 3D FFT algorithm, which mainly aims to reduce of explicit data transfer between the memory and the on-chip cache. The main idea is to split one dimension into ensions, and then combine the transform along each sub-dimension with one of the rest dimensions respectively erence in amount of TLB misses resulting from decomposition along different dimensions is analyzed in detail. el parallelism is leveraged on the many-core system for a high degree of parallelism and better data reuse of loc On top of this, a number of optimization techniques, such as memory padding, loop transformation and vectoriz employed in our implementation to further enhance the performance. We evaluate the algorithm on the Intel(R) PhiTM coprocessor 7110P, and achieve a maximum performance of 136 Gflops with 240 threads in offload mode, which ts the vendor-specific Intel(R)MKL library by a factor of up to 2.22X. 相似文献
2.
Object contours contain important visual information which can be applied to numerous vision tasks. As recent algorithms focus on the accuracy of contour detection, the entailed time complexity is sign... 相似文献
3.
Pipeline parallelism is a popular parallel programming pattern for emerging applications. However, program-ming pipelines directly on conventional multithreaded shared memory is di?cult and error-prone... 相似文献
4.
Determinism is very useful to multithreaded programs in debugging, testing, etc. Many deterministic ap-proaches have been proposed, such as deterministic multithreading (DMT) and deterministic replay. ... 相似文献
5.
6.
1 Introduction The hull surface is an undevelopable curved surface, because of some special demand, it needs to be coated with rectangular tiles. While under the condition of permitting certain gap of 15±3mm between tiles and presenting pleasing overall appearance, these tiles must completely cover the hull surface, so layout is very necessary before tiles’ coating process. The previous method is to lay planar tiles on the whole hull surface’s flattened plane, only along frame line directio… 相似文献
7.
8.
《Computer》2010,43(1):14-16
3D content still is not widely found on the Web. Now, though, several new technologies may widen 3D's presence on the Web by transforming browsers into computing platforms powerful enough to play the content. 相似文献
9.
AFASTHIDDEN-LINEREMOVALALGORITHMFOR3-DIMENSIONALBUILDINGSQinKaihuai;TongGeliang;ZhangNan;ChenDailin;LiYungui;ShenWenduAFASTHI... 相似文献
10.
11.
In this work, we present the concept, design and implementation of a new software to visualize and segment 3-dimensional medical
data. The main goal was to create a platform that would allow trying out new approaches and ideas while staying independent
from hardware and operating system, being especially useful for interdisciplinary research groups. A special focus will be
given on fast and interactive volume visualization, and a survey on the use of Virtual Reality (VR) and especially haptic/force
feedback in medical applications will be provided. 相似文献
12.
Dong Jianjia Wang Xiaochun School of Mechanical engineering Xi'an Jiaotong University Xi'an China 《计算机辅助绘图.设计与制造(英文版)》2002,(2)
The initial point and edge are first determined, the subsequent edges on contour are searched by comparing the angles formed by vectors of the candidate edges and the reference vector, repeat this process until the searched edges are closed. The bi-directional list is adopted to store graphic data. Data structure for storing the point and graphic entity is also discussed. 相似文献
13.
The monotone line search schemes have been extensively used in the iterative methods for solving various optimization problems.
It is well known that the non-monotone line search technique can improve the likelihood of finding a global optimal solution
and the numerical performance of the methods, especially for some difficult nonlinear problems. The traditional non-monotone
line search approach requires that a maximum of recent function values decreases. In this paper, we propose a new line search
scheme which requires that a convex combination of recent function values decreases. We apply the new line search technique
to solve unconstrained optimization problems, and show the proposed algorithm possesses global convergence and R-linear convergence under suitable assumptions. We also report the numerical results of the proposed algorithm for solving
almost all the unconstrained testing problems given in CUTEr, and give numerical comparisons of the proposed algorithm with
two famous non-monotone methods. 相似文献
14.
15.
《计算机辅助设计与制造》2014,(3):1-1
AVEVA近日发布了AVEVA E3D Insight,这是一款基于Windows 8.1与微软合作开发完成的一款产品,专门为客户提供在全球范围内随时随地查看、评估和审批设计的能力。同时,AVEVA E3D Insight优化了设计评估与审批过程,为精益业务实践提供支持。它使授权用户随时随地都可以直接获取实时的AVEVA E3D设计模型,从而实现项目效益的最大化。AVEVA首席技术官兼工程设计与系统负责人Dave Wheeldon说:AVEVA E3D Insight能够帮助客户提高业务水平,实现多个项目间更加有效的合 相似文献
16.
17.
We present the first linear algorithm for the random sampling from regular languages. More precisely, for generating a uniformly random word of length n in a given regular language, our algorithm has worst-case space bit-complexity O(n) and mean time bit-complexity?O(n). The previously best algorithm, due to Denise and Zimmermann (Theor. Comp. Sci. 218(2):233–248, 1999), has worst-case space bit-complexity O(n 2) and mean time bit-complexity O(nlog?(n)). The Denise et al. algorithm was obtained by performing a floating-point optimization on the general recursive method formalized by Nijenhuis and Wilf (and further developed by Flajolet, Zimmermann and Van Cutsem). Our algorithm combines the floating-point optimization with a new divide-and-conquer scheme. 相似文献
18.
正最近亚马逊推出一款Fire Phone手机,这是一款以3D为功能卖点的手机,号称采用了创新的3D显示技术。那么Fire Phone的3D到底是一项什么样的新技术?让我们看看它和之前的3D手机有什么不同,它又是怎么实现在手机上显示3D图像的。了解手机上的3D显示技术 相似文献
19.
Automatic reconstruction of 3D objects from 2D orthographic views has been a major research issue in CAD/CAM.In this paper,two acceleratin techniques to improve the efficiency of reconstruction are presented.First,some peudo elements are removed by depth and topology information as soon as the wire-frame is constructed ,which reduces the searching space.Second.the proposed algorithm does not establish all possible surfaces in the process of generating 3D faces.The surfaces and edge loops are generated by using the relationship between the boundaries of 3D faces and their projections,This avoids the growth in combinational complexity of previous methods that have to check all possible pairs of 3D candidate edges. 相似文献