期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Complex exponential functions: A high-precision hardware realization

《Integration, the VLSI Journal》2020

相似文献

2.

A mixed-signal array processor with early vision applications

Martin D.A. Hae-Seung Lee Masaki I. 《Solid-State Circuits, IEEE Journal of》1998,33(3):497-502

Many early vision tasks require only 6 to 8 b of precision. For these applications, a special-purpose analog circuit is often a smaller, faster, and lower power solution than a general-purpose digital processor, but the analog chips lack the programmability of digital image processors. This paper presents a programmable mixed-signal array processor which combines the programmability of a digital processor with the small area and low power of an analog circuit. Each processor cell in the array utilizes a digitally programmable analog arithmetic unit with an accuracy of 1.3%. The analog arithmetic unit utilizes a unique circuit that combines a cyclic switched-capacitor analog-to-digital converter (ADC) and digital-to-analog converter (DAC) to perform addition, subtraction, multiplication, and division, Each processor cell, fabricated in a 0.8-μm triple-metal CMOS process, operates at a speed of 0.8 MIPS, consumes 1.8 mW of power at 5 V, and uses 700 μm by 270 μm of silicon area. An array of these processor cells performed an edge detection algorithm and a subpixel resolution algorithm 相似文献

3.

A VLSI array processor for 16-point FFT 总被引：1，自引：0，他引：1

Lee Moon-Key Shin Kyung-Wook Lee Jang-Kyu 《Solid-State Circuits, IEEE Journal of》1991,26(9):1286-1292

An implementation of a two-dimensional array processor for fast Fourier transform (FFT) using a 2-μm CMOS technology is presented. The array processor, which is dedicated to 16-point FFT, implements a 4×4 mesh array of 16 processing elements (PEs) working in parallel. Design considerations in both the chip level and the PE level are examined. A layout design methodology based on bit-slice units (BSUs) results in a very simple design, easy debugging, and a regular interconnection scheme through abutment. It contains about 48,000 transistors on an area of 53.52 mm², excluding the 83-pad area, and operation is on a 15-MHz clock. The array processor performs 24.6 million complex multiplications per second, and computes a 16-point FFT in 3 μs 相似文献

4.

A fault-tolerant array processor designed for testability andself-reconfiguration

Jain A. Mandava B. Rajski J. Rumin N.C. 《Solid-State Circuits, IEEE Journal of》1991,26(5):778-788

The design of a fault-tolerant rectangular array of processing elements (PEs) is presented in which the reconfiguration is done by means of on-chip distributed logic, without the help of any external host. Spare PEs are included in every column of the array, and faulty PEs are bypassed within a column to facilitate reconfiguration in the presence of faults. Scan paths are used to enhance the testability of the array. PEs are tested locally using near-neighbor comparisons without the need of an external host. Because the interconnections between logical neighbors are short, the speed penalty for reconfiguration is very small. Any amount of redundancy can be incorporated in the array without changing the topology of the scheme or the design of the reconfiguration switches. The scheme is well suited for very large-area, high-density chips and wafer-scale integration. In order to demonstrate the capabilities of the scheme and evaluate its performance, an experimental chip consisting of a 6×4 array was designed, fabricated, and tested. Details of the design and the implementation of the chip are presented. The scheme is also analyzed for yield and area utilization for a range of array sizes and PE survival probabilities 相似文献

5.

An LSI adaptive array processor

《Solid-State Circuits, IEEE Journal of》1983,18(2):147-156

Describes an LSI adaptive array processor (AAP) for two-dimensional data processings. The AAP contains a large number of one-bit processing elements (PEs) arranged in a square array. The large degree of parallelism and control registers in each PE allow for high speed and flexible operations. High transfer capability is also obtained by a simple inter-PE connection network with hierarchical bypasses. The high applicability to various data processings is indicated by a matrix multiplication example, utilizing an algorithm similar to a systolic one. An AAP LSI composed of 8/spl times/8 PEs with powerful functions has been implemented in a 96.0 mm/SUP 2/ chip by using 2 /spl mu/m Si-gate p-well CMOS technology. A high-speed cycle time of 55 ns, low power dissipation of 1.1 W, and high packing density of 1170 transistors/mm/SUP 2/ has been achieved by a skilful manual design. Though the LSI contains as many as 111900 transistors, the design effort has only required one man-year due to cellular array regularity. This LSI is expected to realize a high-performance AAP compactly. 相似文献

6.

A baseband processor for IS-54 cellular telephony

Friedman V. Lakshmikumar K.R. Price D.L. Le T.N. Kumar J. 《Solid-State Circuits, IEEE Journal of》1996,31(5):646-655

This paper describes the design of a baseband processor for IS-54 North American cellular telephony standard. The effect of diverse circuit impairments on the error vector in digital mode and on the parasitic amplitude modulation in analog mode are analyzed. An analog offset compensation scheme, which takes advantage of the TDMA operation, is presented. The device incorporates a Manchester data decoder for data transmission in analog mode. The messages can be sent via two interfaces to the DSP or μ-processor. The architecture of the digital signal processing chain is discussed. The device is fabricated in a 0.9 μ CMOS technology with an area of 40 mm² 相似文献

7.

A polynomial rooting approach to super-resolution array design

Dowlut N. Manikas A. 《Signal Processing, IEEE Transactions on》2000,48(6):1559-1569

This paper is concerned with the design of super-resolution direction finding (DF) arrays that satisfy prespecified performance levels, such as detection-resolution thresholds and Cramer-Rao bounds on error variance. The sensor placement problem is formulated in the framework of subspace-based DF techniques and a novel polynomial rooting approach to the design problem, based on the new concept of the “sensor locator polynomial (SLP),” is proposed. This polynomial is constructed using the prespecified performance levels, and its roots yield the sensor locations of the desired array. The distinguishing feature of the proposed technique is that it hinges on the properties of the array manifold, which plays a central role in all subspace-based DF algorithms 相似文献

8.

Fault-tolerant matrix inversion on processor array

Milovanovic E.I. Milovanovic I.Z. Stojcev M.K. Jovanovic G.S. 《Electronics letters》1992,28(13):1206-1208

The authors consider the problem of fault tolerant inversion of triangular matrices based on a linear checksum approach. The iterative Shultz method adapted for parallel implementation on triangular processor arrays was used.<> 相似文献

9.

COS/MOS parallel processor array

《Solid-State Circuits, IEEE Journal of》1970,5(5):221-227

相似文献

10.

Sequential diagnosis of processor array systems

Jun Zhao Meyer F.J. Park N. Lombardi F. 《Reliability, IEEE Transactions on》2004,53(4):487-498

We examine the diagnosis of processor array systems formed as two-dimensional arrays, with boundaries, and either four or eight neighbors for each interior processor. We employ a parallel test schedule. Neighboring processors test each other, and report the results. Our diagnostic objective is to find a fault-free processor or set of processors. The system may then be sequentially diagnosed by repairing those processors tested faulty according to the identified fault-free set, or a job may be run on the identified fault-free processors. We establish an upper bound on the maximum number of faults which can be sustained without invalidating the test results under worst case conditions. We give test schedules and diagnostic algorithms which meet the upper bound as far as the highest order term. We compare these near optimal diagnostic algorithms to alternative algorithms, both new and already in the literature, and against an upper bound ideal case algorithm, which is not necessarily practically realizable. For eight-way array systems with N processors, an ideal algorithm has diagnosability 3N/sup 2/3/-2N/sup 1/2/ plus lower-order terms. No algorithm exists which can exceed this. We give an algorithm which starts with tests on diagonally connected processors, and which achieves approximately this diagnosability. So the given algorithm is optimal to within the two most significant terms of the maximum diagnosability. Similarly, for four-way array systems with N processors, no algorithm can have diagnosability exceeding 3N/sup 2/3//2/sup 1/3/-2N/sup 1/2/ plus lower-order terms. And we give an algorithm which begins with tests arranged in a zigzag pattern, one consisting of pairing nodes for tests in two different directions in two consecutive test stages; this algorithm achieves diagnosability (3/2)(5/2)/sup 1/3/N/sup 2/3/-(5/4)N/sup 1/2/ plus lower-order terms, which is about 0.85 of the upper bound due to an ideal algorithm. 相似文献

11.

Resolution of maximum entropy array processor

《Proceedings of the IEEE. Institute of Electrical and Electronics Engineers》1979,67(6):968-969

An approximate expression for the resolution of the maximum entropy array processor is derived and compared with the resolution expression for the conventional linear array processor (beamformer). 相似文献

12.

Cellular array processor CAP and applications

Mitsuo Ishii Hiroyuki Sato Morio Ikesaka Kouichi Murakami Hiroaki Ishihata 《The Journal of VLSI Signal Processing》1989,1(1):57-67

The general-purpose, highly parallel, cellular array processor (CAP) we developed features multiple-instruction stream, multiple-data stream (MIMD) processing and image display. Processor elements can number in several hundreds. The present system uses 256 processors. Each processor element consists of a general-purpose microprocessor, memory, and a special VLSI chip that performs parallel-processing-specific functions such as processor communication and synchronization. The VLSI has two 2M byte/s independent common bus interfaces for data broadcating and six 15M bit/s serial communication ports for local data communication. The chip also can process image data in real time for multiple processors. Use of the communication interfaces enables a variety of processor networks to be configured. One CAP application has been computer graphics, in which ray tracing is used to generate quality images. 相似文献

13.

Digital tracking array using off-the-shelf hardware

Gezer L. Broadston R. Jenn D. Burgstaller G. 《Antennas and Propagation Magazine, IEEE》2008,50(1):108-114

The design, development, and testing of a digital tracking array is described. The array operates at 2.4 GHz for tracking video and data from UAVs and other mobile transmitters. A monopulse tracking technique is used to keep the beam scanned to the direction of the incoming signal. The array is built entirely of commercial off-the-shelf (COTS) components. Calibration, measurement of patterns, and verification of the tracking function are also discussed. 相似文献

14.

A delay-encoding-logic array processor for dynamic-programming matching of data sequences

Ogawa M. Shibata T. 《Solid-State Circuits, IEEE Journal of》2005,40(7):1578-1582

Computationally very expensive dynamic-programming matching of data sequences has been directly implemented in a fully-parallel-architecture VLSI chip. The circuit operates as digital logic in the signal domain, while analog processing is carried out in the time domain based on the delay-encoding-logic scheme. As a result, a high-speed low-power best-match-sequence search has been established with a small chip area. The typical matching time of 80 ns with the power dissipation of 2 mW has been demonstrated with fabricated prototype chips. 相似文献

15.

CMOS image sensor with mixed-signal processor array

Graupner A. Schreiter J. Getzlaff S. Schuffny R. 《Solid-State Circuits, IEEE Journal of》2003,38(6):948-957

We present a single-chip integration of a CMOS image sensor with an embedded flexible processing array and dedicated analog-to-digital converter. The processor array is designed to perform convolution and transformation algorithms with arbitrary kernels. It has been designed to carry out the multiplication of analog image data with given digital kernel coefficients and to add up the results. The processor array is an analog implementation of a highly parallel architecture which is scalable to any desired sensor resolution while preserving video-rate operation. A prototype implementation has been realized in a 0.6-/spl mu/m CMOS technology. Switched current technique has been applied to obtain compact and robust circuits. The prototype's sensor resolution is 64 /spl times/ 128 pixels. The processor array occupies a small chip area and consumes only a small percentage of the power (250 /spl mu/W) of the whole image sensor. 相似文献

16.

Speech recognition on a distributed array processor

Simpson P. Roberts J.B.G. 《Electronics letters》1983,19(24):1018-1020

A highly parallel single-instruction multiple-data (SIMD) array signal processor is advocated as efficient for a wide range of real-time problems. We examine its performance for digital speech recognition and show that impressive throughput rates for realistic vocabulary sizes can be achieved for `time-warping? dynamic programming algorithms which currently form the basis of several commercial and research speech recognisers. 相似文献

17.

Dynamic programming implementation on array processor architectures

K. I. Diamantaras W. H. Chou S. Y. Kung 《The Journal of VLSI Signal Processing》1996,13(1):27-35

Dynamic Programming (DP) applies to many signal and image processing applications including boundary following, the Viterbi algorithm, dynamic time warping, etc. This paper presents an array processor implementation of generic dynamic programming. Our architecture is a SIMD array attached to a host computer. The processing element of the architecture is based on an ASIC design opting for maximum speed-up. By adopting a torus interconnection network, a dual buffer structure, and a multilevel pipeline, the performance of the DP chip is expected to reach the order of several GOPS. The paper discusses both the dedicated hardware design and the data flow control of the DP chip and the total array.This work was supported in part by the NATO, Scientific and Environmental Affairs Division, Collaborative Research Grant SA.5-2-05(CRG.960201)424/96/JARC-501. 相似文献

18.

通用硬件模糊控制器研究

徐德杨莹春《电子与信息学报》2002,24(9):1263-1269

提出了一种基于多值逻辑电路的模糊控制器硬件实现方案,采用规则分时进行硬件模糊推理,不同规则的推理结果合并后形成模糊输出,经模糊判决后形成精确量输出。该方案的复杂性不受规则数量的影响,执行速度不受语言变量维数的影响,该方案通过改变存储器数据可以方便地调整隶属度函数和模糊控制规则,克服了硬件模糊控制器灵活性差这一重大缺陷,该方案便于以VLSI实现。相似文献

19.

High-performance area-efficient polynomial ring processor for CRYSTALS-Kyber on FPGAs

《Integration, the VLSI Journal》2021

相似文献

20.

FIR滤波器的优化设计与硬件实现 总被引：6，自引：2，他引：6

王金明张雄伟《电视技术》2003,(3):19-21

介绍如何用FPGA器件设计和实现FIR数字滤波器，对几种实现方案进行了比较和优化，并就如何改善所设计滤波器的性能和指标进行了讨论。相似文献