期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Efficient multimedia coprocessor with enhanced SIMD engines for exploiting ILP and DLP

《Parallel Computing》2013,39(10):586-602

Multimedia applications have become increasingly important in daily computing. These applications are composed of heterogeneous regions of code mixed with data-level parallelism (DLP) and instruction-level parallelism (ILP). A standard solution for a multimedia coprocessor resembles of single-instruction multiple-data (SIMD) engines into architectures exploiting ILP at compile time, such as very long instruction word (VLIW) and transport triggered architecture (TTA). However, the ILP regions fail to scale with the increased vector length to achieve high performance in the DLP regions. Furthermore, the register-to-register nature of SIMD instructions causes current SIMD engines to have limitations in handling memory alignment, data reorganization, and control flow. Many supporting instructions such as data permutations, address generations, and loop branches, are required to aid in the execution of the real SIMD computation instructions. To mitigate these problems, we propose optimized SIMD engines that have the capabilities for combining VLIW or TTA processing with a unified scalar and long vector computations as well as efficient SIMD hardware for real computation. Our new architecture is based on TTA and is called multimedia coprocessor (MCP). This architecture includes following features: (1) a simple coprocessor structure with 8-way TTA, (2) cost-effective SIMD hardware capable of performing floating-point operations, (3) long vector capabilities built upon existing SIMD hardware and a single register file and processor data path for both scalar operands and vector elements, and (4) an optimized SIMD architecture that addresses the SIMD limitations. Our experimental evaluations show that MCP can outperform conventional SIMD techniques by an average of 39% and 12% in performance for multimedia kernels and applications, respectively. 相似文献

2.

XScale架构分析及优化策略 总被引：1，自引：0，他引：1

潘伟斌徐国治张珏璐郑浩《计算机工程与应用》2006,42(18):102-104

XScale被广泛地应用于嵌入式系统,但是很多程序并没有充分利用XScale架构的各种优点,性能并不尽如人意。本文详细分析了IntelXScale架构,并提出如何基于XScale架构进行优化的策略,最后对实际应用程序进行优化,并给出测试结果。相似文献

3.

XScale微架构代码级优化策略

雷蕾成瑾《微计算机信息》2006,22(30):186-188

Intel公司推出的XScale微架构以其高性能低功耗的特点,被广泛的应用到各种嵌入式应用领域。本文深入的分析了XScale架构中与性能有关的硬件单元的特点,并以此为依据,详细论述了基于XScale的应用程序开发过程中代码级优化的策略和技术。相似文献

4.

Multiple-camera people localization in an indoor environment

Valery A. Petrushin Gang Wei Anatole V. Gershman 《Knowledge and Information Systems》2006,10(2):229-241

With the rapid proliferation of video cameras in public places, the ability to identify and track people and other objects creates tremendous opportunities for business and security applications. This paper presents the Multiple Camera Indoor Surveillance project which is devoted to using multiple cameras, agent-based technology and knowledge-based techniques to identify and track people and summarize their activities. We also describe a people localization system, which identifies and localizes people in an indoor environment. The system uses low-level color features – a color histogram and average vertical color – for building people models and the Bayesian decision-making approach for people localization. The results of a pilot experiment that used 32 h of data (4 days × 8 h) showed the average recall and precision values of 68 and 59% respectively. Augmenting the system with domain knowledge, such as location of working places in cubicles, doors and passages, increased the average recall to 87% and precision to 73%. Valery A. Petrushin is a senior researcher at the Accenture Technology Labs in Chicago, Illinois USA. He received his M.Sc. in Applied Mathematics from the Kharkov State University, Kharkov, Ukraine, and his Ph.D. in Computer Science from Glushkov Institute for Cybernetics, Kiev, Ukraine. He worked as a Director of Intelligent Tutoring Systems at Glushkov Institute for Cybernetics, Ukraine and a researcher at EduTech Institute at the Georgia Tech, GA, USA. His research interests include multimedia data mining, processing, annotation and retrieval. He is the author of two books and more than 130 publications in the fields of computer science, computer-based education, data mining and signal processing. Gang Wei received his Ph.D. in Computer Science at Wayne State University, Detroit, Michigan, in May 2001. After that, he joined the Accenture Technology Labs in Chicago, Illinois, USA. He also worked as a summer intern research staff at Phillips Research Labs in New York State in year 1999 and 2000. His research focuses on multimedia annotation and retrieval, image and video processing, and sensor intelligence. He has published two book chapters, and over 20 papers in this area. Anatole V. Gershman is the global director of technology research for Accenture Technology Labs. He received his Ph.D. degree in Computer Science from Yale University in 1979. He worked at Bell Labs, the Schlumberger Research Centre in Connecticut, Cognitive Systems Inc., and Coopers & Lybrand before he joined Accenture (Andersen Consulting) in 1989. His research interests are in the fields of artificial intelligence, intelligent sensor networks and ubiquitous computing. 相似文献

5.

Cache modeling and optimization for portable devices running MPEG-4 video decoder

Abu Asaduzzaman Imad Mahgoub 《Multimedia Tools and Applications》2006,28(1):239-256

There are increasing demands on portable communication devices to run multimedia applications. ISO (an International Organization for Standardization) standard MPEG-4 is an important and demanding multimedia application. To satisfy the growing consumer demands, more functions are added to support MPEG-4 video applications. With improved CPU speed, memory sub-system deficiency is the major barrier to improving the system performance. Studies show that there is sufficient reuse of values for caching that significantly reduce the memory bandwidth requirement for video data. Software decoding of MPEG-4 video data generates much more cache-memory traffic than required. Proper understanding of the decoding algorithm and the composition of its data set is obvious to improve the performance of such a system. The focus of this paper is cache modeling and optimization for portable communication devices running MPEG-4 video decoding algorithm. The architecture we simulate includes a digital signal processor (DSP) for running the MPEG-4 decoding algorithm and a memory system with two levels of caches. We use VisualSim and Cachegrind simulation tools to optimize cache sizes, levels of associativity, and cache levels for a portable device decoding MPEG-4 video. Abu Asaduzzaman is, currently, a PhD candidate in the department of Computer Science and Engineering (CSE), Florida Atlantic University (FAU), Boca Raton, Florida. He received his MS degree in computer engineering from FAU in 1997. Mr. Asaduzzaman worked for ECI Telecom as a software engineer from 1998 to 2001. From 2001 to 2003, he worked for BlueCross and BlueShield of Florida and SunPass (FDoT) as an IT Consultant. Currently, he is working as a research assistant at CSE Dept, FAU. His research interests include cache optimization, architecture exploration, embedded system evaluation, and networks-on-a-chip (NoC). He has published several research papers in these areas. Abu is a member of the honor society of Phi Kappa Phi, Tau Beta Pi, Upsilon Phi Epsilon, and the Association for Computing Machinery (ACM) FAU Chapter. Imad Mahgoub received the MS degree in applied mathematics and MS degree in electrical and computer engineering, both from North Carolina State University, Raleigh in 1983 and 1986 respectively and the PhD degree in computer engineering from the Pennsylvania State University, University Park, PA in 1989. Dr. Mahgoub joined Florida Atlantic University (FAU), Boca Raton, Florida in 1989. Currently he is a full professor of Computer Science and Engineering department and the director of the Mobile Computing Laboratory. His research interests include performance evaluation, mobile computing, sensor networks, and parallel and distributed processing. He has published over 80 research papers in these areas. He is the co-editor of the Mobile Computing Handbook and the Handbook of Sensor Networks. Dr. Mahgoub has served on the program committees of numerous conferences. He has been the vice-chair for the Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS) since 2003. He is a senior member of the IEEE. He is also a member of Tau Beta Pi, Upsilon Pi Epsilon, the IEEE Computer Society, and the ACM. 相似文献

6.

Multimedia technologies for structuring and retrieval of TV news

Yasuo Ariki 《New Generation Computing》2000,18(4):341-357

Because of the media digitization, a large amount of information such as speech, audio and video data is produced everyday. In order to retrieve data from these databases quickly and precisely, multimedia technologies for structuring and retrieving of speech, audio and video data are strongly required. In this paper, we overview the multimedia technologies such as structuring and retrieval of speech, audio and video data, speaker indexing, audio summarization and cross media retrieval existing today for TV news detabase. The main purpose of structuring is to produce tables of contents and indices from audio and video data automatically. In order to make these technologies feasible, first, processing units such as words on audio data and shots on video data are extracted. On a second step, they are meaningfully integrated into topics. Furthermore, the units extracted from different types of media are integrated for higher functions. Yasuo Ariki, Ph.D.: He is a Professor in the Department of Electronics and Informatics at the Ryukoku University. He received his B.E., M.E. and Ph.D. in information science from Kyoto University in 1974, 1976 and 1979, respectively. He had been an Assistant in Kyoto University from 1980 to 1990, and stayed at Edinburgh University as visiting academic from 1987 to 1990. His research interests are in speech and image recognition and in information retrieval and database. He is a member of IPSJ, IEICE, ASJ, Soc. Artif. Intel. and IEEE. 相似文献

7.

Cache modeling and optimization for portable devices running MPEG-4 video decoder

Abu Asaduzzaman Imad Mahgoub 《Multimedia Tools and Applications》2006,28(2):239-256

There are increasing demands on portable communication devices to run multimedia applications. ISO (an International Organization for Standardization) standard MPEG-4 is an important and demanding multimedia application. To satisfy the growing consumer demands, more functions are added to support MPEG-4 video applications. With improved CPU speed, memory sub-system deficiency is the major barrier to improving the system performance. Studies show that there is sufficient reuse of values for caching that significantly reduce the memory bandwidth requirement for video data. Software decoding of MPEG-4 video data generates much more cache-memory traffic than required. Proper understanding of the decoding algorithm and the composition of its data set is obvious to improve the performance of such a system. The focus of this paper is cache modeling and optimization for portable communication devices running MPEG-4 video decoding algorithm. The architecture we simulate includes a digital signal processor (DSP) for running the MPEG-4 decoding algorithm and a memory system with two levels of caches. We use VisualSim and Cachegrind simulation tools to optimize cache sizes, levels of associativity, and cache levels for a portable device decoding MPEG-4 video. Abu Asaduzzaman is, currently, a PhD candidate in the department of Computer Science and Engineering (CSE), Florida Atlantic University (FAU), Boca Raton, Florida. He received his MS degree in computer engineering from FAU in 1997. Mr. Asaduzzaman worked for ECI Telecom as a software engineer from 1998 to 2001. From 2001 to 2003, he worked for BlueCross and BlueShield of Florida and SunPass (FDoT) as an IT Consultant. Currently, he is working as a research assistant at CSE Dept, FAU. His research interests include cache optimization, architecture exploration, embedded system evaluation, and networks-on-a-chip (NoC). He has published several research papers in these areas. Abu is a member of the honor society of Phi Kappa Phi, Tau Beta Pi, Upsilon Phi Epsilon, and the Association for Computing Machinery (ACM) FAU Chapter. Imad Mahgoub received the MS degree in applied mathematics and MS degree in electrical and computer engineering, both from North Carolina State University, Raleigh in 1983 and 1986 respectively and the PhD degree in computer engineering from the Pennsylvania State University, University Park, PA in 1989. Dr. Mahgoub joined Florida Atlantic University (FAU), Boca Raton, Florida in 1989. Currently he is a full professor of Computer Science and Engineering department and the director of the Mobile Computing Laboratory. His research interests include performance evaluation, mobile computing, sensor networks, and parallel and distributed processing. He has published over 80 research papers in these areas. He is the co-editor of the Mobile Computing Handbook and the Handbook of Sensor Networks. Dr. Mahgoub has served on the program committees of numerous conferences. He has been the vice-chair for the Symposium on Performance Evaluation of Computer and Telecommunication Systems (SPECTS) since 2003. He is a senior member of the IEEE. He is also a member of Tau Beta Pi, Upsilon Pi Epsilon, the IEEE Computer Society, and the ACM. 相似文献

8.

一种改进的嵌入式SIMD协处理器设计 总被引：1，自引：0，他引：1

周国昌王忠车德亮冯国臣《计算机工程与应用》2004,40(31):13-16

论文介绍的SIMD协处理器是用于低层图像理解的16位定点嵌入式阵列处理器。该协处理器采用load/store体系结构,并且除SIMD固有的数据并行性外,还具有三级流水和三组指令并发执行的并行性。三组指令并发执行使数据交换操作和其它类型操作并发执行,从而实现了数据交换操作的隐含执行,大大减少了通信和I/O操作的开销。相似文献

9.

二维SIMD体系结构研究与模拟器实现

下载免费PDF全文

王晔臧斌宇朱传琪《计算机工程与应用》2007,43(25):60-62

二维SIMD结构是指一个由N×N的处理单元按一定的拓扑结构连接组成的阵列结构,其同行/列的处理单元以SIMD方式工作。二维SIMD结构作为多媒体加速部件广泛应用在各种多媒体处理的SOC中,因此其体系结构的设计是获得高性能多媒体计算的重要因素。结合多媒体应用程序的特点,研究分析不同设计参数对二维SIMD结构性能的影响,并设计实现了一个二维SIMD结构的性能模拟器。实验结果显示了二维SIMD结构对多媒体程序有很好的加速比并证实了研究分析结论。相似文献

10.

Robust Quality Adaptation for Internet Video Streaming

E.?Turhan?Tunali Email author Aylin?Kantarci Nukhet?Ozbek 《Multimedia Tools and Applications》2005,27(3):431-448

Internet video streaming is a widely popular application however, in many cases, congestion control facilities are not well integrated into such applications. In order to be fair to other users that do not stream video, rate adaptation should be performed to respond to congestion. On the other hand, the effect of rate adaptation on the viewer should be minimized and this extra mechanism should not overload the client and the server. In this paper, we develop a heuristic approach for unicast congestion control. The primary feature of our approach is the two level adaptation algorithm that utilizes packet loss rate as well as receiver buffer data to maintain satisfactory buffer levels at the receiver. This is particularly important if receiver has limited buffer such as in mobile devices. When there is no congestion, to maintain best buffer levels, fine grain adjustments are carried out at the packet level. Depending on the level of congestion and receiver buffer level, rate shaping that involves frame discard and finally rate adaptation by switching to a different pre-encoded video stream are carried out. Additive increase multiplicative decrease policy is maintained to respond to congestion in a TCP- friendly manner. The algorithm is implemented and performance results show that it has adaptation ability that is suitable for both local area and wide area networks. E. Turhan Tunali received B.Sc. Degree in Electrical Engineering from Middle East Technical University and M.Sc. Degree in Applied Statistics from Ege University, both in Turkey. He then received D.Sc. Degree in Systems Science and Mathematics from Washington University in St. Louis, U.S.A. in 1985. After his doctorate study, he joined Computer Engineering Department of Ege University as an assistant professor where he became an associate professor in 1988. During the period of 1992–1994, he worked in Department of Computer Technology of Nanyang Technological University of Singapore as a Visiting Senior Fellow. He then joined International Computer Institute of Ege University as a Professor where he is currently the director. In the period of 2000–2001 he worked in Department of Computer Science of Loyola University of Chicago as a Visiting Professor. His current research interests include adaptive video streaming and Internet performance measurements. Dr. Tunali is married with an eighteen year old son. Aylin Kantarci received B.Sc., M.Sc. and Ph.D. degrees all from Computer Engineering Department of Ege University, Izmir, Turkey, in 1992, 1994 and 2000, respectively. She then joined the same department as an assistant professor. Her current research interests include adaptive video streaming, video coding, operating systems, multimedia systems and distributed systems. Nukhet Ozbek received B.Sc. degree in Electrical and Electronics Engineering from School of Engineering and M.Sc. degree in Computer Science from International Computer Institute both in Ege University, Izmir, Turkey. From 1998 to 2003 she worked in the DVB team of Digital R&D at Vestel Corporation, Izmir-Turkey that produces telecommunication and consumer electronics devices. She is currently a Ph.D. student and a research assistant at International Computer Institute of Ege University. Her research areas include video coding and streaming, multimedia systems and set top box architectures. 相似文献

11.

Multi-Kepler GPU vs. multi-Intel MIC for spin systems simulations

M. Bernaschi M. Bisson F. Salvadore 《Computer Physics Communications》2014

We present and compare the performances of two many-core architectures: the Nvidia Kepler and the Intel MIC both in a single system and in cluster configuration for the simulation of spin systems. As a benchmark we consider the time required to update a single spin of the 3D Heisenberg spin glass model by using the Over-relaxation algorithm. We present data also for a traditional high-end multi-core architecture: the Intel Sandy Bridge. The results show that although on the two Intel architectures it is possible to use basically the same code, the performances of a Intel MIC change dramatically depending on (apparently) minor details. Another issue is that to obtain a reasonable scalability with the Intel Phi coprocessor (Phi is the coprocessor that implements the MIC architecture) in a cluster configuration it is necessary to use the so-called offload mode which reduces the performances of the single system. As to the GPU, the Kepler architecture offers a clear advantage with respect to the previous Fermi architecture maintaining exactly the same source code. Scalability of the multi-GPU implementation remains very good by using the CPU as a communication co-processor of the GPU. All source codes are provided for inspection and for double-checking the results. 相似文献

12.

Optimizing mobile multimedia using SIMD techniques

N. C. Paver M. H. Khan B. C. Aldrich 《Multimedia Tools and Applications》2006,28(2):221-238

Demand for mobile video applications is growing today in wireless handheld platforms. Optimizing instruction set architectures and employing SIMD techniques is a logical approach towards attaining higher performance in mobile multimedia applications. Intel® Wireless MMX? technology has been designed to accelerate mobile multimedia and applications processing in a power efficient manner. This paper provides an overview of Intel® Wireless MMX? technology, a 64-bit Single Instruction Multiple Data (SIMD) coprocessor for the Intel® XScale® microarchitecture, and the key features of the architecture that specifically enhance the multi-media performance. Tools and techniques for optimization are also described. 相似文献

13.

基于PXA270的嵌入式工程机械监控器设计

张延晋周慧玲李青松《计算机与现代化》2010,(2):105-108

PXA270是一种基于IntelXScale架构的高性能嵌入式处理器。本文提出一种基于PXA270的高性能工程机械监控器。通过完成硬件电路设计、硬件驱动程序设计、blob和操作系统移植,良好地实现了对监控器的预期要求。相似文献

14.

A multi-streaming SIMD multimedia computing engine

Jih-Ching Chiu Yu-Liang Chou 《Microprocessors and Microsystems》2010,34(7-8):247-258

Current multimedia extensions provide a mechanism for general-purpose processors to meet the growing performance demand of multimedia applications. However, the computing performance of these extensions is often limited for the design conceptions of the single data stream. This paper presents an architecture called “multi-streaming SIMD” that enables current multimedia extensions to simultaneously manipulate multiple data streams. To efficiently and flexibly realize the proposed architecture, an operation cell is designed by fusing the logic gates and the storage cells together. Multiple operation cells then are connected to compose a register file with the ability of performing SIMD operations called “Multimedia Operation Storage Unit (MOSU)”. Further, many MOSUs are used to compose a multi-streaming SIMD computing engine that can simultaneously manipulate multiple data streams and exploit the subword parallelisms of the elements in each data stream. This paper also designs three instruction modes (global, coupling, and isolated modes) for programmers to dynamically configure the multi-streaming SIMD computing engine at the instruction level to manipulate different amounts of data streams. Simulation results show that when the multi-streaming SIMD architecture has four 4-register MOSUs, it provides a factor of 3.3×–5.5× performance enhancement for traditional MMX extensions on 12 multimedia kernels. 相似文献

15.

Information access in multimedia databases based on feature models

Arjen P. De Vries Menzo Windhouwer Peter M. G. Apers Martin Kersten 《New Generation Computing》2000,18(4):323-339

With the increasing popularity of the WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. Access to multimedia objects in databases has long been limited to the information provided in manually assigned keywords. Now, with the integration of feature-detection algorithms in database systems software, content-based retrieval can be fully integrated with query processing. We describe our experimentation platform under development, making database technology available to multimedia. Our approach is based on the new notion of feature databases. Its architecture fully integrates traditional query processing and content-based retrieval techniques. Arjen P. de Vries, Ph.D.: He received his Ph.D. in Computer Science from the University of Twente in 1999, on the integration of content management in database systems. He is especially interested in the new requirements on the design of database systems to support content-based retrieval in multimedia digital libraries. He has continued to work on multimedia database systems as a postdoc at the CWI in Amsterdam as well as University of Twente. Menzo Windhouwer: He received his MSc in Computer Science and Management from the University of Amsterdam in 1997. Currently he is working in the CWI Database Research Group on his Ph.D., which is concerned with multimedia indexing and retrieval using feature grammars. Peter M.G. Apers, Ph.D.: He is a full professor in the area of databases at the University of Twente, the Netherlands. He obtained his MSc and Ph.D. at the Free University, Amsterdam, and has been a visiting researcher at the University of California, Santa Cruz and Stanford University. His research interests are query optimization in parallel and distributed database systems to support new application domains, such as multimedia applications and WWW. He has served on the program committees of major database conferences: VLDB, SIGMOD, ICDE, EDBT. In 1996 he was the chairman of the EDBT PC. In 2001 he will, for the second time, be the chairman of the European PC of the VLDB. Currently he is coordinating Editor-in-Chief of the VLDB Journal, editor of Data & Knowledge Engineering, and editor of Distributed and Parallel Databases. Martin Kersten, Ph.D.: He received his PhD in Computer Science from the Vrije Universiteit in 1985 on research in database security, whereafter he moved to CWI to establish the Database Research Group. Since 1994 he is professor at the University of Amsterdam. Currently he is heading a department involving 60 researchers in areas covering BDMS architectures, datamining, multimedia information systems, and quantum computing. In 1995 he co-founded Data Distilleries, specialized in data mining technology, and became a non-executive board member of the software company Consultdata Nederland. He has published ca. 130 scientific papers and is member of the editorial board of VLDB journal and Parallel and Distributed Systems. He acts as a reviewer for ESPRIT projects and is a trustee of the VLDB Endowment board. 相似文献

16.

Algebraic retrieval of fragmentarily indexed video

Katsumi Tanaka Keishi Tajima Takashi Sogo Sujeet Pradhan 《New Generation Computing》2000,18(4):359-374

When dealing with long video data, the task of identifying and indexing all meaningful subintervals that become answers to some queries is infeasible. It is infeasible not only when done by hand but even when done by using latest automatic video indexing techniques. Whether manually or automatically, it is only fragmentary video intervals that we can identify in advance of any database usage. Our goal is to develop a framework for retrieving meaningful intervals from such fragmentarily indexed video data. We propose a set of algebraic operations that includes ourglue join operations, with which we can dynamically synthesize all the intervals that are conceivably relevant to a given query. In most cases, since these operations also produce irrelevant intervals, we also define variousselection operations that are useful in excluding them from the answer set. We also show the algebraic properties possessed by those operations, which establish the basis of an algebraic query optimization. Katsumi Tanaka, D. Eng.: He received his B.E., M.E., and D.Eng. degrees in information science from Kyoto University, in 1974, 1976, and 1981, respectively. Since 1994, he is a professor of the Department of Computer and Systems Engineering and since 1997, he is a professor of the Division of Information and Media Sciences, Graduate School of Science and Technology, Kobe University. His research interests include object-oriented, multimedia and historical databases abd multimedia information systems. He is a member of the ACM, IEEE Computer Society and the Information Processing Society of Japan. Keishi Tajima, D.Sci.: He received his B.S, M.S., and D.S. from the department of information science of University of Tokyo in 1991, 1993, and 1996 respectively. Since 1996, he is a Research Associate in the Department of Computer and Systems Engineering at Kobe University. His research interests include data models for non-traditional database systems and their query languages. He is a member of ACM, ACM SIGMOD, Information Processing Society of Japan (IPSJ), and Japan Society for Software Science and Technology (JSSST). Takashi Sogo, M.Eng.: He received B.E. and M.E. from the Department of Computer and Systems Engineering, Kobe University in 1998 and 2000, respectively. Currently, he is with USAC Systems Co. His research interests include video database systems. Sujeet Pradhan, D.Eng.: He received his BE in Mechanical Engineering from the University of Rajasthan, India in 1988, MS in Instrumentation Engineering in 1995 and Ph.D. in Intelligence Science in 1999 from Kobe University, Japan. Since 1999 May, he is a lecturer of the Department of Computer Science and Mathematics at Kurashiki University of Science and the Arts, Japan. A JSPS (Japan Society for the Promotion of Science) Research Fellow during the period between 1997 and 1999, his research interests include video databases, multimedia authoring, prototypebased languages and semi-structured databases. Dr. Pradhan is a member of Information Processing Society of Japan. 相似文献

17.

共享主存多SIMD结构及编译技术研究

张为华臧斌宇《计算机科学与探索》2009,3(1):18-25

目前多媒体应用已经成为各种运算平台的主要应用类型。随着多媒体应用的多样性和复杂性,共享主存多SIMD结构逐渐成为主从多核结构中多媒体加速部件的首要选择。总结了目前共享主存多SIMD结构的特征,同时深入分析了共享主存多SIMD编译优化的主要问题以及相关编译技术。相似文献

18.

An Integrated Framework for Semantic Annotation and Adaptation 总被引：1，自引：1，他引：0

M.?Bertini Email author R.?Cucchiara A.?Del?Bimbo A.?Prati 《Multimedia Tools and Applications》2005,26(3):345-363

Tools for the interpretation of significant events from video and video clip adaptation can effectively support automatic extraction and distribution of relevant content from video streams. In fact, adaptation can adjust meaningful content, previously detected and extracted, to the user/client capabilities and requirements. The integration of these two functions is increasingly important, due to the growing demand of multimedia data from remote clients with limited resources (PDAs, HCCs, Smart phones). In this paper we propose an unified framework for event-based and object-based semantic extraction from video and semantic on-line adaptation. Two cases of application, highlight detection and recognition from soccer videos and people behavior detection in domotic^* applications, are analyzed and discussed.Domotics is a neologism coming from the Latin word domus (home) and informatics.Marco Bertini has a research grant and carries out his research activity at the Department of Systems and Informatics at the University of Florence, Italy. He received a M.S. in electronic engineering from the University of Florence in 1999, and Ph.D. in 2004. His main research interest is content-based indexing and retrieval of videos. He is author of more than 25 papers in international conference proceedings and journals, and is a reviewer for international journals on multimedia and pattern recognition.Rita Cucchiara (Laurea Ingegneria Elettronica, 1989; Ph.D. in Computer Engineering, University of Bologna, Italy 1993). She is currently Full Professor in Computer Engineering at the University of Modena and Reggio Emilia (Italy). She was formerly Assistant Professor (‘93–‘98) at the University of Ferrara, Italy and Associate Professor (‘98–‘04) at the University of Modena and Reggio Emilia, Italy. She is currently in the Faculty staff of Computer Engenering where has in charges the courses of Computer Architectures and Computer Vision.Her current interests include pattern recognition, video analysis and computer vision for video surveillance, domotics, medical imaging, and computer architecture for managing image and multimedia data.Rita Cucchiara is author and co-author of more than 100 papers in international journals, and conference proceedings. She currently serves as reviewer for many international journals in computer vision and computer architecture (e.g. IEEE Trans. on PAMI, IEEE Trans. on Circuit and Systems, Trans. on SMC, Trans. on Vehicular Technology, Trans. on Medical Imaging, Image and Vision Computing, Journal of System architecture, IEEE Concurrency). She participated at scientific committees of the outstanding international conferences in computer vision and multimedia (CVPR, ICME, ICPR, ...) and symposia and organized special tracks in computer architecture for vision and image processing for traffic control. She is in the editorial board of Multimedia Tools and Applications journal. She is member of GIRPR (Italian chapter of Int. Assoc. of Pattern Recognition), AixIA (Ital. Assoc. Of Artificial Intelligence), ACM and IEEE Computer Society.Alberto Del Bimbo is Full Professor of Computer Engineering at the Università di Firenze, Italy. Since 1998 he is the Director of the Master in Multimedia of the Università di Firenze. At the present time, he is Deputy Rector of the Università di Firenze, in charge of Research and Innovation Transfer. His scientific interests are Pattern Recognition, Image Databases, Multimedia and Human Computer Interaction. Prof. Del Bimbo is the author of over 170 publications in the most distinguished international journals and conference proceedings. He is the author of the “Visual Information Retrieval” monography on content-based retrieval from image and video databases edited by Morgan Kaufman. He is Member of IEEE (Institute of Electrical and Electronic Engineers) and Fellow of IAPR (International Association for Pattern Recognition). He is presently Associate Editor of Pattern Recognition, Journal of Visual Languages and Computing, Multimedia Tools and Applications Journal, Pattern Analysis and Applications, IEEE Transactions on Multimedia, and IEEE Transactions on Pattern Analysis and Machine Intelligence. He was the Guest Editor of several special issues on Image databases in highly respected journals.Andrea Prati (Laurea in Computer Engineering, 1998; PhD in Computer Engineering, University of Modena and Reggio Emilia, 2002). He is currently an assistant professor at the University of Modena and Reggio Emilia (Italy), Faculty of Engineering, Dipartimento di Scienze e Metodi dell’Ingegneria, Reggio Emilia. During last year of his PhD studies, he has spent six months as visiting scholar at the Computer Vision and Robotics Research (CVRR) lab at University of California, San Diego (UCSD), USA, working on a research project for traffic monitoring and management through computer vision. His research interests are mainly on motion detection and analysis, shadow removal techniques, video transcoding and analysis, computer architecture for multimedia and high performance video servers, video-surveillance and domotics. He is author of more than 60 papers in international and national conference proceedings and leading journals and he serves as reviewer for many international journals in computer vision and computer architecture. He is a member of IEEE, ACM and IAPR. 相似文献

19.

Particle-in-Cell algorithms for emerging computer architectures

Viktor K. Decyk Tajendra V. Singh 《Computer Physics Communications》2014

We have designed Particle-in-Cell algorithms for emerging architectures. These algorithms share a common approach, using fine-grained tiles, but different implementations depending on the architecture. On the GPU, there were two different implementations, one with atomic operations and one with no data collisions, using CUDA C and Fortran. Speedups up to about 50 compared to a single core of the Intel i7 processor have been achieved. There was also an implementation for traditional multi-core processors using OpenMP which achieved high parallel efficiency. We believe that this approach should work for other emerging designs such as Intel Phi coprocessor from the Intel MIC architecture. 相似文献

20.

一种SIMD优化中的向量寄存器部分重用方法 总被引：1，自引：0，他引：1

下载免费PDF全文

钱兴隆臧斌宇朱传琪《计算机工程与科学》2007,29(5):141-146

SIMD架构用于多媒体加速,已经广泛应用于现代通用处理器中.SIMD架构的数据并行性可大大提高处理器的运算能力,但由于存储系统的速度远远不能与其匹配,使得应用程序的性能很难获得进一步的提高.因此,本文基于SIMD架构的访存特性,提出了一种向量寄存器部分重用的方法,以提高访存效率;并给出了相应的程序转换算法,通过数据相关性的分
分析,在应用程序向量化时,生成采用向量寄存器部分重用的优化代码.实验结果说明,该算法对多媒体应用程序的性能有显著的提高. 相似文献