期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

钱悦《计算机与数字工程》2008,36(12)

由于图形处理器(GPU)最近几年的快速发展,基于 GPU 的通用计算已经成为一个新的研究领域.通过对nVIDIA 公司最新的通用计算 GPU 编程模型-CUDA 的研究,阐明了 CUDA 应用程序的结构和它本身特征,讨论和分析了 CUDA 编程方法与普通 CPU 编程的差别,并以 H.264 数字视频编解码中,以消除宏块边界锯齿为主要目的的去块滤波模块为实例.详细描述了 CUDA 编程的方法和特点,最后通过与 CPU 编程实现的去块滤波模块的性能比较,揭示了 CUDA 在计算能力上的优势,为进一步优化编解码器性能和 GPU 通用计算提供了新的方法和思路. 相似文献

2.

基于统一计算设备架构技术的并行图像处理研究 总被引：1，自引：0，他引：1

杨志义朱娅婷蒲勇《计算机测量与控制》2009,17(4)

对统一计算设备架构CUDA技术进行研究,分析了CUDAGPU的显著特性,总结了CUDA的通用并行程序模式,详细介绍了用CUDA实现直方图均衡化的过程,接着简要介绍了CUDA在其它图像处理算法中的应用;最后对比CPU和GPU计算256级直方图均衡化的时间,实验结果表明随着图像像素的增大,CUDA可以把计算速度提高40多倍,在其它的图像算法中,甚至可以上百倍地提高速度. 相似文献

3.

Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA

J. Habich T. ZeiserG. Hager G. Wellein 《Advances in Engineering Software》2011,42(5):266-272

This paper presents implementation strategies and optimization approaches for a D3Q19 lattice Boltzmann flow solver on nVIDIA graphics processing units (GPUs). Using the STREAM benchmarks we demonstrate the GPU parallelization approach and obtain an upper limit for the flow solver performance. We discuss the GPU-specific implementation of the solver with a focus on memory alignment and register shortage. The optimized code is up to an order of magnitude faster than standard two-socket x86 servers with AMD Barcelona or Intel Nehalem CPUs. We further analyze data transfer rates for the PCI-express bus to evaluate the potential benefits of multi-GPU parallelism in a cluster environment. 相似文献

4.

Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA)

Liheng Jian Cheng Wang Ying Liu Shenshen Liang Weidong Yi Yong Shi 《The Journal of supercomputing》2013,64(3):942-967

Recent development in Graphics Processing Units (GPUs) has enabled inexpensive high performance computing for general-purpose applications. Compute Unified Device Architecture (CUDA) programming model provides the programmers adequate C language like APIs to better exploit the parallel power of the GPU. Data mining is widely used and has significant applications in various domains. However, current data mining toolkits cannot meet the requirement of applications with large-scale databases in terms of speed. In this paper, we propose three techniques to speedup fundamental problems in data mining algorithms on the CUDA platform: scalable thread scheduling scheme for irregular pattern, parallel distributed top-k scheme, and parallel high dimension reduction scheme. They play a key role in our CUDA-based implementation of three representative data mining algorithms, CU-Apriori, CU-KNN, and CU-K-means. These parallel implementations outperform the other state-of-the-art implementations significantly on a HP xw8600 workstation with a Tesla C1060 GPU and a Core-quad Intel Xeon CPU. Our results have shown that GPU + CUDA parallel architecture is feasible and promising for data mining applications. 相似文献

5.

Lattice Boltzmann simulations of the motion induced by variable surface tension

S. Stensholt A. Øien 《Advances in Engineering Software》2011,42(11):944-953

Methods for implementing variable surface tension in the multiphase Lattice Boltzmann model with the color model and Shan-Chen scheme are tested by analyzing the models’ abilities to reproduce a theoretical result by Levich and Kuznetzov. If the surface tension around a droplet is asymmetrical, the droplet moves towards the side where the surface tension is lower. The droplet’s velocity is proportional to the surface tension gradient, the droplet’s radius, and the inverse of the viscosity. The model is tested to determine whether the simulated droplets move in the manner predicted by theory. Although the discreteness of the underlying lattice causes a spurious oscillation to the velocity, the numerical results concerning the average velocity show a good correspondence between theory and the model in regards to the surface tension gradient and droplet size. The color model also produces good simulations in the scenarios with different viscosities, while the diffusive properties and unknown relationships between the parameters and surface tension in the Shan-Chen model make the numerical results of that model more dubious, even though several of the results are qualitatively in agreement. 相似文献

6.

Performance of a Lattice Quantum Chromodynamics kernel on the Cell processor

J. Spray A. Trew 《Computer Physics Communications》2008,179(9):642-646

The implementation of a proof-of-concept Lattice Quantum Chromodynamics kernel on the Cell processor is described in detail, illustrating issues encountered in the porting process. The resulting code performs up to 45 GFlop/s per socket (without inter-node parallel communications), indicating that the Cell processor is likely to be a good platform for future Lattice QCD calculations. 相似文献

7.

一元化安全结构初探

陈妍妍《计算机安全》2007,(7):38-39,44

该文分析了企业网络的安全需求及其采用的安全策略的不足,阐述了什么是一元化安全结构,一元化安全结构的好处,及目前一元化安全结构的应用。相似文献

8.

Lattice Boltzmann method simulation of electroosmotic stirring in a microscale cavity

Anindya Kanti De Achintya Mukhopadhyay Ishwar K. Puri 《Microfluidics and nanofluidics》2008,4(5):463-470

The suitable surface modification of microfluidic channels can enable a neutral electrolyte solution to develop an electric double layer (EDL). The ions contained within the EDL can be moved by applying an external electric field, inducing electroosmotic flows (EOFs) that results in associated stirring. This provides a solution for the rapid mixing required for many microfluidic applications. We have investigated EOFs generated by applying a steady electric field across a square cavity that has homogenous electric potentials along its walls. The flowfield is simulated using the lattice Boltzmann method. The extent of mixing is characterized for different electrode configurations and electric field strengths. We find that rapid mixing can be achieved by using this simple configuration which increases with increasing electric field strength. The mixing time for water-soluble organic molecules can be decreased by four orders of magnitude by suitable choice of wall zeta potential and electric field. We dedicate this paper to the memory of our colleagues Professors Kevin Granata and Liviu Librescu who fell tragically on April 16, 2007 while answering their call to serve higher education. They continue to inspire us. AM gratefully acknowledges support from Jadavpur University under the World Bank funded Technical Education Quality Improvement Programme of the Government of India and the hospitality of the Virginia Tech ESM Department where he conducted a portion of this work. 相似文献

9.

The effect of the microfluidic diodicity on the efficiency of valve-less rectification micropumps using Lattice Boltzmann Method

Ahmed Fadl Zongqin Zhang Sebastian Geller Jonas Tölke Manfred Krafczyk Donna Meyer 《Microsystem Technologies》2009,15(9):1379-1387

The efficiency of the valve-less rectification micropump depends primarily on the microfluidic diodicity (the ratio of the backward pressure drop to the forward pressure drop). In this study, different rectifying structures, including the conventional structures (nozzle/diffuser and Tesla structures), were investigated at very low Reynolds numbers (between 0.2 and 60). The rectifying structures were characterized with respect to their design, and a numerical approach was illustrated to calculate the diodicity for the rectifying structures. In this study, the microfluidic diodicity was evaluated numerically for different rectifying structures including half circle, semicircle, heart, triangle, bifurcation, nozzle/diffuser, and Tesla structures. The Lattice Boltzmann Method (LBM) was utilized as a numerical method to simulate the fluid flow in the microscale. The results suggest that at very low Reynolds number flow, rectification and multifunction micropumping may be achievable by using a number of the presented structures. The results for the conventional structures agree with the reported results. 相似文献

10.

Multiplicity of steady solutions in two-dimensional lid-driven cavity flows by Lattice Boltzmann Method

D. Arumuga Perumal Anoop K. Dass 《Computers & Mathematics with Applications》2011,61(12):3711-3721

This work is concerned with the computation of two- and four-sided lid-driven square cavity flows and also two-sided rectangular cavity flows with parallel wall motion by the Lattice Boltzmann Method (LBM) to obtain multiple stable solutions. In the two-sided square cavity two of the adjacent walls move with equal velocity and in the four-sided square cavity all the four walls move in such a way that parallel walls move in opposite directions with the same velocity; in the two-sided rectangular lid-driven cavity flow the longer facing walls move in the same direction with equal velocity. Conventional numerical solutions show that the symmetric solutions exist for all Reynolds numbers for all the geometries, whereas multiplicity of stable states exist only above certain critical Reynolds numbers. Here we demonstrate that Lattice Boltzmann method can be effectively used to capture multiple steady solutions for all the aforesaid geometries. The strategy employed to obtain these solutions is also described. 相似文献

11.

Lattice Boltzmann modeling of microchannel flows in the transition flow regime

Q. Li Y. L. He G. H. Tang W. Q. Tao 《Microfluidics and nanofluidics》2011,10(3):607-618

Owing to its kinetic nature and distinctive computational features, the lattice Boltzmann method for simulating rarefied gas flows has attracted significant research interest in recent years. In this article, a lattice Boltzmann (LB) model is presented to study microchannel flows in the transition flow regime, which have gained much attention because of fundamental scientific issues and technological applications in various micro-electro-mechanical system (MEMS) devices. In the model, a Bosanquet-type effective viscosity is used to account for the rarefaction effect on gas viscosity. To match the introduced effective viscosity and to gain an accurate simulation, a modified second-order slip boundary condition with a new set of slip coefficients is proposed. Numerical investigations demonstrate that the results, including the velocity profile, the non-linear pressure distribution along the channel, and the mass flow rate, are in good agreement with the solution of the linearized Boltzmann equation, the direct simulation Monte Carlo (DSMC) results, and the experimental results over a broad range of Knudsen numbers. It is shown that taking the rarefaction effect on gas viscosity into consideration and employing an appropriate slip boundary condition can lead to a significant improvement in the modeling of rarefied gas flows with moderate Knudsen numbers in the transition flow regime. 相似文献

12.

Lattice Boltzmann method on a cluster of IBM RISC system/6000 workstations

G. Betello G. Richelu S. Succi F. Ruello 《Concurrency and Computation》1993,5(4):359-366

An implementation of the lattice Boltzmann method on a homogeneous cluster of IBM RISC System/6000 superscalar workstations is presented. 相似文献

13.

Facilitating the applications of support vector machine by using a new kernel

Rui Zhang Wenjian Wang 《Expert systems with applications》2011,38(11):14225-14230

In the last few years, the applications of support vector machine (SVM) have substantially increased due to the high generalization performance and modeling of non-linear relationships. However, whether SVM behaves well largely depends on its adopted kernel function. The most commonly used kernels include linear, polynomial inner product functions and the Radial Basis Function (RBF), etc. Since the nature of the data is usually unknown, it is very difficult to make, on beforehand, a proper choice from the mentioned kernels. Usually, more than one kernel are applied to select the one which gives the best prediction performance but with a very time-consuming optimization procedure. This paper presents a kernel function based on Lorentzian function which is well-known in the field of statistics. The presented kernel can properly deal with a large variety of mapping problems due to its flexibility to vary. The applicability, suitability, performance and robustness of the presented kernel are investigated on bi-spiral benchmark data set as well as seven data sets from the UCI benchmark repository. The experiment results demonstrate that the presented kernel is robust and has stronger mapping ability comparing with the standard kernel functions, and it can obtain better generalization performance. In general, the proposed kernel can be served as a generic alternative for the common linear, polynomial and RBF kernels. 相似文献

14.

Lattice Boltzmann simulation of viscous fingering phenomenon of immiscible fluids displacement in a channel

Bo Dong Y.Y. Yan Weizhong Li 《Computers & Fluids》2010,39(5):768-779

In this paper, the viscous fingering phenomenon of two immiscible fluids in a channel is studied by applying the lattice Boltzmann method (LBM). The fundamental physical mechanisms of a finger formation or the interface evolution between immiscible fluids are described in terms of the relative importance of viscous forces, surface tension, and gravity, which are quantifiable via the dimensionless quantities, namely, capillary number, Bond number and viscosity ratio between displaced fluid and displacing fluid. In addition, the effect of wettability on flow behaviour of fluids is investigated for the cases with and without consideration of gravity, respectively. The numerical results provide a good understanding of the mechanisms of viscous fingering phenomenon from a mesoscopic point of view and confirm that the LBM can be viewed as a promising tool for investigating fluid behaviour and other immiscible displacement problems. 相似文献

15.

Lattice Boltzmann large eddy simulation of subcritical flows around a sphere on non-uniform grids

M. Stiebler M. Krafczyk S. Freudiger M. Geier 《Computers & Mathematics with Applications》2011,61(12):3475-3484

In this work, the suitability of the lattice Boltzmann method is evaluated for the simulation of subcritical turbulent flows around a sphere. Special measures are taken to reduce the computational cost without sacrificing the accuracy of the method. A large eddy simulation turbulence model is employed to allow efficient simulation of resolved flow structures on non-uniform computational meshes. In the vicinity of solid walls, where the flow is governed by the presence of a thin boundary layer, local grid-refinement is employed in order to capture the fine structures of the flow. In the test case considered, reference values for the drag force in the Reynolds number range from 2000 to 10 000 and for the surface pressure distribution and the angle of separation at a Reynolds number of 10 000 could be quantitatively reproduced. A parallel efficiency of 80% was obtained on an Opteron cluster. 相似文献

16.

Numerical Study of the Nonlinear Combined Sine-Cosine-Gordon Equation with the Lattice Boltzmann Method

Huilin Lai Changfeng Ma 《Journal of scientific computing》2012,53(3):569-585

In this paper, a?lattice Boltzmann model is developed for solving the combined sine-cosine-Gordon equation through selecting equilibrium distribution function properly. With the Chapman-Enskog expansion, the governing evolution equation is recovered correctly from the continuous Boltzmann equation. Some problems, which have exact solutions, are validated by the present model. From the simulations, we find that the numerical results agree well with the exact solutions or better than the numerical solutions reported in previous studies. The study indicates that the present method is very effective and accurate. The present model can be used to solve more other nonlinear wave problems. 相似文献

17.

Lattice Boltzmann computations of incompressible laminar flow and heat transfer in a constricted channel

S. Gokaltun G.S. Dulikravich 《Computers & Mathematics with Applications》2010,59(7):2431-2441

A multi-population thermal lattice Boltzmann method (TLBM) is applied to simulate incompressible steady flow and heat transfer in a two-dimensional constricted channel. The method is validated for velocity and temperature profiles by comparing with a finite element method based commercial solver. The results indicate that, at various Reynolds numbers, the average flow resistance increases and the heat transfer rate decreases in a constricted channel in comparison to a straight channel. The effect of the constriction ratio is also investigated. The results show that the presented numerical model is a promising tool in analyzing simultaneous solution of fluid flow and heat transfer phenomena in complex geometries. 相似文献

18.

一种USB外设的实现方案

刘旭田捷《计算机工程与应用》2003,39(27):127-129

该文介绍了一种USB外设的实现方案。该方案采用USBN9603接口芯片和最常用的51单片机构成USB外设,来与主机进行USB通讯。由于51单片机的低廉价格以及USBN9603接口芯片的卓越性能,该系统具有良好的性价比。该方案能实现12Mbps的全速传输,支持所有的USB传输方式,具有很高的实用性和可靠性。相似文献

19.

一个P2P搜索引擎的架构和实现

郑仲伟郑有才《微型电脑应用》2007,23(6):32-34

P2P搜索技术是P2P研究中的一个重要的领域。本文介绍了一个基于P2P结构化覆盖网络的分布式搜索引擎的架构和实现。该搜索引擎采用了三层架构,良好的层次架构减少了搜索引擎核心算法与P2P覆盖网络协议和具体应用间的依赖,使得搜索引擎可以移植到不同的P2P结构化覆盖网络之上。由于P2P搜索过程中会消耗大量的网络带宽,所以该搜索引擎使用了一些优化算法,它们不仅减少搜索过程带来的带宽消耗,而且保证了系统的可伸缩性。相似文献

20.

Lattice Boltzmann simulations of vortex entrapment of particles in a microchannel with curved or flat edges

Hakan Başağaoğlu John T. Carrola Jr Christopher J. Freitas Berkay Başağaoğlu Sauro Succi 《Microfluidics and nanofluidics》2015,18(5-6):1165-1175

相似文献