首页 | 本学科首页   官方微博 | 高级检索  
     


Non-linear explicit transient finite element analysis on the Intel Delta
Affiliation:2. Argonne National Laboratory, Argonne, IL 60439, U.S.A.;3. Department of Civil Engineering, The Johns Hopkins University, Baltimore, MD 21218, U.S.A.;1. School of Civil Engineering, Chongqing University, Chongqing 400045, China;2. Key Laboratory of New Technology for Construction of Cities in Mountain Area, Ministry of Education, Chongqing 400045, China;3. Institute of Solid Mechanics, Beihang University, Beijing 100191, China;1. Structural Engineering Research Group/LGCGM, INSA de Rennes, Université Bretagne Loire, 20 avenue des Buttes de Coësmes, CS 70839, 35708 Rennes Cedex 7, France;2. Department of Civil and Architectural Engineering, KTH Royal Institute of Technology, SE-10044 Stockholm, Sweden;1. College of Water Conservancy and Civil Engineering, South China Agricultural University, Guangzhou 510642, China;2. Earthquake Engineering Research and Test Center, Guangzhou University, Guangzhou 510405, China
Abstract:Many large-scale finite element problems are intractable on current generation production supercomputers. High-performance computer architectures offer effective avenues to bridge the gap between computational needs and the power of computational hardware. The biggest challenge lies in the substitution of the key algorithms in an application program with redesigned algorithms which exploit the new architectures and use better or more appropriate numerical techniques. A methodology for implementing non-linear finite element analysis on a homogeneous distributed processing network is discussed. The method can also be extended to heterogeneous networks comprised of different machine architectures provided that they have a mutual communication interface. This unique feature has greatly facilitated the port of the code to the 8-node Intel Touchstone Gamma and then the 512-node Intel Touchstone Delta. The domain is decomposed serially in a preprocessor. Separate input files are written for each subdomain. These files are read in by local copies of the program executable operating in parallel. Communication between processors is addressed utilizing asynchronous and synchronous message passing. The basic kernel of message passing is the internal force exchange which is analogous to the computed interactions between sections of physical bodies in static stress analysis. Benchmarks for the Intel Delta are presented. Performance exceeding 1 gigaflop was attained. Results for two large-scale finite element meshes are presented.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号