共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Fast template placement for reconfigurable computing systems 总被引:3,自引:0,他引:3
This article presents fast online placement methods for dynamically reconfigurable systems, as well as offline 3D placement algorithms for statically reconfigurable architectures 相似文献
3.
Wearable computers are embedded into the mobile environment of their users. A design challenge for wearable systems is to combine the high performance required for tasks such as video decoding with the low energy consumption required to maximise battery runtimes and the flexibility demanded by the dynamics of the environment and the applications. In this paper, we demonstrate that reconfigurable hardware technology is able to answer this challenge. We present the concept and the prototype implementation of an autonomous wearable unit with reconfigurable modules (WURM). We discuss experiments that show the uses of reconfigurable hardware in WURM: ASICs-on-demand and adaptive interfaces. Finally, we present an experiment with an operating system layer for WURM. 相似文献
4.
Reconfigurability is essential for semiconductor manufacturing systems to remain competitive. Reconfigurable systems avoid costly modifications required to change and adapt to changes in product, production and services. A fully automated, collaborative, and integrated while reconfigurable manufacturing system proves cost-effective in the long term and is a promising strategy for the semiconductor manufacturing industry. However, there is a lack of computing models to facilitate the design and development of control and management systems in a truly reconfigurable manner. This paper presents an innovative computing model for reconfigurable systems and controlled manufacturing processes while allowing for the integration of modern technologies to facilitate reconfiguration, such as radio frequency identification (RFID) and reconfigurable field programmable gate array (FPGA). Shop floor manufacturing activities are modeled as processes from a business perspective. A process-driven formal method that builds on prior research on virtual production lines is proposed for the formation of a reconfigurable cross-facility manufacturing system. The trajectory of the controlled manufacturing systems is optimized for on-demand production services. Reconfigurable process controllers are introduced in support of the essential system reconfigurability of future semiconductor manufacturing systems. Implementation of this approach is also presented. 相似文献
5.
Reiner W. Hartenstein Jürgen Becker Rainer Kress Helmut Reinig 《Concurrency and Computation》1996,8(6):429-443
The paper introduces the MoM-3 as a reconfigurable accelerator for high performance computing at a moderate price. By using a new machine paradigm to trigger the operations in the MoM-3, this accelerator is especially suited to scientific algorithms, where the hardware structure can be configured to match the structure of the algorithm. The MoM-3 efficiently uses reconfigurable logic devices to provide a fine-grain parallelism, and multiple address generators to have the complete memory bandwidth free for data transfers (instead of fetching address computing instructions). Speed-up factors up to 82, compared to state-of-the-art workstations, are demonstrated by means of an Ising spin system simulation example. Adding the MoM-3 as an accelerator enables achievement of supercomputer performance from a low-cost workstation. 相似文献
6.
In embedded systems, dynamically reconfigurable computing can be partially modified at runtime without stopping the operation of the whole system. In this paper, we consider a reorganization mechanism for dynamically reconfigurable computing in embedded systems to guarantee that invariants of the design are respected. This reorganization is considered as a visual transformation of the logical configuration by the formulated rules. The invariant is recognized under the restructuring of the configuration using reconfiguration rules. 相似文献
7.
Reid Porter Jan Frigo Al Conti Neal Harvey Garrett Kenyon Maya Gokhale 《Microprocessors and Microsystems》2007,31(8):546-563
Cellular computing architectures represent an important class of computation that are characterized by simple processing elements, local interconnect and massive parallelism. These architectures are a good match for many image and video processing applications and can be substantially accelerated with Reconfigurable Computers. We present a flexible software/hardware framework for design, implementation and automatic synthesis of cellular image processing algorithms. The system provides an extremely flexible set of parallel, pipelined and time-multiplexed components which can be tailored through reconfigurable hardware for particular applications. The most novel aspects of our framework include a highly pipelined architecture for multi-scale cellular image processing as well as support for several different pattern recognition applications. In this paper, we will describe the system in detail and present our performance assessments. The system achieved speed-up of at least 100× for computationally expensive sub-problems and 10× for end-to-end applications compared to software implementations. 相似文献
8.
Susmit Bagchi 《Applied Soft Computing》2012,12(9):3023-3033
In recent time, the applications of biologically-inspired computing models into various domains of computing fields have gained attention due to a set of advantages. The bio-inspired distributed computing paradigm offers benefits such as, self-detection and self-reconfiguration capabilities of the computing systems. The large scale distributed systems suffer from the arbitrary failure of nodes and dynamic formation of network partitions at any point of time. This paper proposes a novel membrane algorithm for self-detection and self-reconfiguration of large distributed systems on the event of arbitrary node failures resulting in network partitioning. The algorithm is distributed in nature and, it is designed based on the hybridization of biological membrane computing model and cell-signaling mechanisms of biological cells. This paper presents the problem definition, design and analysis of the algorithm. The performance of the algorithm is evaluated through simulation. A detailed comparative analysis of the algorithm with respect to the other contemporary algorithms is presented. 相似文献
10.
Reconfigurable computing (RC) applications employing both microprocessors and FPGAs have potential for large speedup when compared with traditional (software) parallel applications. However, this potential is marred by the additional complexity of these dual-paradigm systems, making it difficult to identify performance bottlenecks and achieve desired performance. Performance analysis concepts and tools are well researched and widely available for traditional parallel applications but are lacking in RC, despite being of great importance due to the applications’ increased complexity. In this paper, we explore challenges and present new techniques in automated instrumentation, runtime measurement, and visualization of RC application behavior. We also present ideas for integration with conventional performance analysis tools to create a unified tool for RC applications as well as our initial framework for FPGA instrumentation and measurement. Results from a case study are provided using a prototype of this new tool. 相似文献
11.
Ideally, reconfigurable-system programmers and designers should code algorithms and write hardware accelerators independently of the underlying platform. To realize this scenario, the authors propose a portable, hardware-agnostic programming paradigm, which delegates platform-specific tasks to a system-level virtualization layer. This layer supports a chosen programming model and hides platform details from users much as general-purpose computers do. We introduce multithreaded programming model for reconfigurable computing based on a unified virtual-memory image for both software and hardware application parts. We also address the challenge of achieving seamless hardware-software interfacing and portability with minimal performance penalties. 相似文献
12.
A generic programmable spike-timing based circuit which forms the building block of a reconfigurable neuromorphic array is implemented in analog VLSI. An array of programmable spike time event coded circuit blocks is configured to implement functional circuit blocks of a spike time based neuromorphic model. A reconfigurable neuromorphic array chip with 10 event blocks is fabricated using Austria Microsystems m CMOS technology to demonstrate the functionality of the circuits in silicon. 相似文献
13.
Ju-Wook Jang Heonchul Park Prasanna V.K. 《IEEE transactions on pattern analysis and machine intelligence》1995,17(2):97-106
The reconfigurable mesh captures salient features from a variety of sources, including the content addressable array parallel processor, the CHiP, the polymorphic-torus network and the bus automaton. It consists of an array of processors interconnected by a reconfigurable bus system. The bus system can be used to dynamically obtain various interconnection patterns between the processors. In this paper, we present a fast algorithm for computing the histogram of an N×N image with h grey levels in O(min{√h+log*(N/h),N}) time on an N×N reconfigurable mesh assuming each PE has a constant amount of local memory. This algorithm runs on the PARBUS and MRN/LRN models. In addition, histogram modification can be performed in O(√h) time on the same model. A variant of out algorithm runs in O(min{√h+log log(N/h),N}) time on an N×N RMESH in which each PE has constant storage. This result improves the known time and memory bounds for histogramming on the RMESH model 相似文献
14.
Proshanta Saha Esam El-Araby Miaoqing Huang Mohamed Taher Sergio Lopez-Buedo Tarek El-Ghazawi Chang Shu Kris Gaj Alan Michalski Duncan Buell 《Parallel Computing》2008,34(4-5):245-260
Portable libraries of highly-optimized hardware cores can significantly reduce the development time of reconfigurable computing applications. This paper presents the tradeoffs and challenges in the design of such libraries. A set of library development guidelines is provided, which has been validated with the RCLib case study. RCLib is a set of portable libraries with over 100 cores, targeting a wide range of applications. RCLib portability has been verified in three major High-Performance reconfigurable computing architectures: SRC6, Cray XD1 and SGI RC100. Compared to full-software implementations, applications using RCLib hardware acceleration cores show speedups ranging from one to four orders of magnitude. 相似文献
15.
The Berkeley Emulation Engine 2 (BEE2) project is developing a reusable, modular, and scalable framework for designing high-end reconfigurable computers, including a processing-module building block and several programming models. Using these elements, BEE2 can provide over 10 times more computing throughput than a DSP-based system with similar power consumption and cost and over 100 times that of a microprocessor-based system. 相似文献
16.
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose
computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes
a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation
techniques to support decoupling synchronization between the token generator and the collector. This paper also introduces
the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special
data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average
3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture
achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.
Supported by the National Natural Science Foundation of China (Grant No. 60633050, 60621003) and the National High Technology
Research and Development Program of China (Grant No. 2007AA01Z06) 相似文献
17.
Endri Bezati Richard Thavot Ghislain Roquier Marco Mattavelli 《Journal of Real-Time Image Processing》2014,9(1):251-262
The potential computational power of today multicore processors has drastically improved compared to the single processor architecture. Since the trend of increasing the processor frequency is almost over, the competition for increased performance has moved on the number of cores. Consequently, the fundamental feature of system designs and their associated design flows and tools need to change, so that, to support the scalable parallelism and the design portability. The same feature can be exploited to design reconfigurable hardware, such as FPGAs, which leads to rethink the mapping of sequential algorithms to HDL. The sequential programming paradigm, widely used for programming single processor systems, does not naturally provide explicit or implicit forms of scalable parallelism. Conversely, dataflow programming is an approach that naturally provides parallelism and the potential to unify SW and HDL designs on heterogeneous platforms. This study describes a dataflow-based design methodology aiming at a unified co-design and co-synthesis of heterogeneous systems. Experimental results on the implementation of a JPEG codec and a MPEG 4 SP decoder on heterogeneous platforms demonstrate the flexibility and capabilities of this design approach. 相似文献
18.
Most extant debugging aids force their users to think about errors in programs from a low-level, unit-at-a-time perspective. Such a perspective is inadequate for debugging large complex systems, particularly distributed systems. In this paper, we present a high-level approach to debugging that offers an alternative to the traditional techniques. We describe a language, edl, developed to support this high-level approach to debugging and outline a set of tools that has been constructed to effect this approach. The paper includes an example illustrating the approach and discusses a number of problems encountered while developing these debugging tools. 相似文献
19.
20.
Reconfiguring a network to counter variations in traffic is expected to greatly enhance optimal usage of network resources. But an important input to this method is the traffic fluctuations themselves. We have developed two models for this purpose to describe the time-dependent variations in traffic at a base station in a nomadic computing, wireless environment. The first model is rather simple and does not take into account details of human behavior. It takes into account the probabilities of choosing different applications. The model is also analyzed and experimented with to identify the important input parameters. The second model, a refined version of the first model, takes into account details of relevant human behavior (in the context of a wireless nomadic computing environment). Finally, we have compared the two models on the basis of their complexity and validity in different situations. 相似文献