首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The discrete wavelet transform (DWT) provides a new method for signal/image analysis where high frequency components are studied with finer time resolution and low frequency components with coarser time resolution. It decomposes a signal or an image into localized contributions for multiscale analysis. In this paper, we present a parallel pipelined VLSI array architecture for 2D dyadic separable DWT. The 2D data array is partitioned into non-overlapping groups of rows. All rows in a partition are processed in parallel, and consecutive partitions are pipelined. Moreover, multiple wavelet levels are computed in the same pipeline, and multiple DWT problems can be pipelined also. The whole computation requires a single scan of the image data array. Thus, it is suitable for on-line real-time applications. For anN×N image, anm-level DWT can be computed in time units on a processor costing no more than , whereq is the partition size,p is the length of corresponding 1D DWT filters,C m andC a are the costs of a parallel multiplier and a parallel adder respectively, and a time unit is the time for a multiplication and an addition. Forq=N m, the computing time reduces to . When a large number of DWT problems are pipelined, the computing time is about per problem.  相似文献   

2.
    
In this paper we investigate -bit serial addition in the context of feed-forward linear threshold gate based networks. We show that twon-bit operands can be added in overall delay with a feed-forward network constructed with linear threshold gates and latches. The maximum weight value is and the maximum fan-in is . We also investigate the implications our scheme have to the performance and the cost under small weights and small fan-in requirements. We deduce that if the weight values are to be limited by a constantW, twon-bit operands can be added in overall delay with a feed-forward network that has the implementation cost [logW]+1, in terms of linear threshold gates, in terms of latches and a maximum fan-in of 3[logW]+1. We also prove that, if the fan-in values are to be limited by a constantF+1, twon-bit operands can be added in overall delay with a feed-forward network that has the implementation cost , in terms of linear threshold gates, in terms of latches, and a maximum weight value of . An asymptotic bound of is derived for the addition overall delay in the case that the weight values have to be linearly bounded, i.e., in the order ofO(n). The implementation cost in this case is in the order ofO(logn), in terms of linear threshold gates, and in the order ofO(log2 n), in terms of latches. The maximum fan-in is in the order ofO(logn). Finally, a partition technique, that substantially reduces the overall cost of the implementation for all the schemes in terms of delay, latches, weights, and fan-in with some few additional threshold gates, is also presented.  相似文献   

3.
Let K be a field, k and n positive integers and let matrices with coefficients in K. For any function
there exists a unique solution of the system of difference equations
defined by the matrix-k-tuple such that . The system is called finite-memory system iff for every function g with finite support the values are 0 for sufficiently big . In the case , these systems and the corresponding matrix-k-tuples have been studied in bis, fm, fmv, fv1, fv, fz. In this paper I generalize these results to an arbitrary positive integer k and to an arbitrary field K.  相似文献   

4.
The ability of a parallel algorithm to make efficient use of increasing computational resources is known as its scalability. In this paper, we develop four parallel algorithms for the 2-dimensional Discrete Wavelet Transform algorithm (2-D DWT), and derive their scalability properties on Mesh and Hypercube interconnection networks. We consider two versions of the 2-D DWT algorithm, known as the Standard (S) and Non-standard (NS) forms, mapped onto P processors under two data partitioning schemes, namely checkerboard (CP) and stripped (SP) partitioning. The two checkerboard partitioned algorithms (Non-standard form, NS-CP), and as (Standard form, S-CP); while on the store-and-forward-routed (SF-routed) Mesh and Hypercube they are scalable as (NS-CP), and as (S-CP), respectively, where M 2 is the number of elements in the input matrix, and (0,1) is a parameter relating M to the number of desired octaves J as . On the CT-routed Hypercube, scalability of the NS-form algorithms shows similar behavior as on the CT-routed Mesh. The Standard form algorithm with stripped partitioning (S-SP) is scalable on the CT-routed Hypercube as M 2 = (P 2), and it is unscalable on the CT-routed Mesh. Although asymptotically the stripped partitioned algorithm S-SP on the CT-routed Hypercube would appear to be inferior to its checkerboard counterpart S-CP, detailed analysis based on the proportionality constants of the isoefficiency function shows that S-SP is actually more efficient than S-CP over a realistic range of machine and problem sizes. A milder form of this result holds on the CT- and SF-routed Mesh, where S-SP would, asymptotically, appear to be altogether unscalable.  相似文献   

5.
Consider the class of d-dimensional causal filters characterized by a d-variate rational function analytic on the polydisk . The BIBO stability of such filters has been the subject of much research over the past two decades. In this paper we analyze the BIBO stability of such functions and prove necessary and sufficient conditions for BIBO stability of a d-dimensional filter. In particular, we prove if a d-variate filter H(z) analytic on has a Fourier expansion that converges uniformly on the closure of , then H(z) is BIBO stable. This result proves a long standing conjecture set forth by Bose in [3].  相似文献   

6.
7.
In many signal processing situations, the desired (ideal) magnitude response of the filter is a rational function: (a digital integrator). The requirements of a linear phase response and guaranteed stable performance limit the design to a finite impulse response (FIR) structure. In many applications we require the FIR filter to yield a highly accurate magnitude response for a narrow band of frequencies with maximal flatness at an arbitrary frequency 0 in the spectrum (0, ). No techniques for meeting such requirements with respect to approximation of are known in the literature. This paper suggests a design by which the linear phase magnitude response can be approximated by an FIR configuration giving a maximally flat (in the Butterworth sense) response at an arbitrary frequency 0, 0<0<*. A technique to compute exact weights for the design has also been given.  相似文献   

8.
CORDIC-based algorithms to compute cos and are proposed. The implementation requires a standard CORDIC module plus a module to compute the direction of rotation, this being the same hardware required for the extended CORDIC vectoring, recently proposed by the authors [T. Lang and E. Antelo, IEEE Transactions on Computers, vol. 47, no. 7, 1998, pp. 736–749.]. Although these functions can be obtained as a special case of this extended vectoring, the specific algorithm we propose here presents two significant improvements: (1) it uses the same datapath width as the standard CORDIC, even when t has 2n bits (to achieve a granularity of 2–n for the whole range). In contrast, the extended vectoring unit requires about 2n bits. (2) no repetitions of iterations are needed (the extended vectoring needs some repetitions). The proposed algorithm is compatible with the extended vectoring and, in contrast with previous implementations, the number of iterations and the delay of each iteration are the same as for the conventional CORDIC algorithm.  相似文献   

9.
High-computing speed and modularity have made RNS-based arithmetic processors attractive for a long time, especially in signal processing, where additions and multiplications are very frequent. The VLSI technology renewed this interest because RNS-based circuits are becoming more feasible; however, intermodular operations degradate their performance and a great effort results on this topic. In this paper, we deal with the problem of performing the basic operationX(modm), that is the remainder of the integer divisionX/m, for large values of the integerX, following an approximating and correcting approach, which guarantees the correctness of the result.We also define a structure to computeX(modm) by means of few fast VLSI binary multipliers, which is exemplified for 32-bit long numbers, obtaining a total response time lower than 200 nsec. Furthermore, such a structure is evaluated in terms of VLSI complexity and area and time figuresA=(n 2 T m 2 ) andT=(T M ) for the parameterT M in are derived. A simple positional-to-residue converter is finally presented, based on this structure; it improves some complexity results previously obtained by authors.This work has been supported by the National Program on Solid-State Electronics and Devices of the Italian National Research Council.  相似文献   

10.
Wang  S.Y.  Kung  H.T. 《Wireless Networks》2001,7(3):221-236
We propose using the TCP decoupling approach to improve a TCP connection's goodput over wireless networks. The performance improvement can be analytically shown to be proportional to , where MTU is the maximum transmission unit of participating wireless links and HP_Sz is the size of a packet containing only a TCP/IP header. For example, on a WaveLAN [32] wireless network, where MTU is 1500 bytes and HP_Sz is 40 bytes, the achieved goodput improvement is about 350%. We present experimental results demonstrating that TCP decoupling outperforms TCP reno and TCP SACK. These results confirm the analysis of performance improvement.  相似文献   

11.
In this paper the performance of soft interference cancellation in synchronous multiuser channels is investigated. Two-stage detection with decorrelator in the first stage is considered. Soft first stage tentative decisions are formed by means of a multilevel quantizer or linear clipper. The asymptotic multiuser efficiency (AME) is used as a performance measured for optimization of the nonlinearity. It is shown, for the two-user case, that soft interference cancellation by means of an optimized linear clipper achieves the AME of the maximum likelihood detector if the crosscorrelation of the signature waveforms is less than . For the K-user case, novel upper bounds on the AME for two-stage detectors with feedback interference cancellation, with both hard and soft tentative decisions, are obtained. The derived bounds are used to examine the performance of two-stage detectors in a cellular scenario.  相似文献   

12.
This paper describes the design of three high-performance op amps in a 40V BiCMOS technology. The first circuit is a low-noise op amp with MOS inputs. A thermal noise level as low as with a 1/f noise corner frequency of 100 Hz is achieved. For applications that can tolerate a lower input impedance, a more economical bipolar input low-noise op amp has been designed, yielding an even better noise performance for source impedances up to 20 k. The third circuit is an internally compensated high-gain-bandwidth (GBW=15 MHz) op amp that can drive loads from 0 to 20 pF. A fourth-order low-pass switched-capacitor filter making use of the latter op amp is discussed next. Finally the applications of this 40V BiCMOS process are illustrated.  相似文献   

13.
A fast algorithm for discrete hartley transform of arbitrary length   总被引:1,自引:0,他引:1  
DHT of length p~lq(p is odd and q is arbitrary) is turned into p~l DHTs of length qand some additional operations, while the additional operations only involves the computation ofcos-DFT and sin-DFT with length p. If the length of a DHT is p_1~(l_1)…P_N~(l_N)2~l(P_1…,P_N are oddprimes), a fast algorithm is obtained by the similar recursive technique. Therefore, the algorithmcan compute DHT of arbitrary length. The paper also Proves that operations for computingDHT of length N by the algorithm are no more than O(Nlog_2N), when the length is N=p~l,operations of the algorithm are fewer than that of other known algorithms.  相似文献   

14.
In this paper we investigate new Fourier series with respect to orthonormal families of directed cycles , which occur in the graph of a recurrent stochastic matrixP. Specifically, it is proved thatP may be approximated in a suitable Hilbert space by the Fourier series . This approach provides a proof in terms of Hilbert space of the cycle decomposition formula for finite stochastic matricesP.  相似文献   

15.
This paper is the last in a two-part sequence which studies nonlinear networks, containing capacitor-only cutsets and/or inductor-only loops, from the geometric coordinate-free point of view of the theory of differentiable manifolds. For such circuits, it is shown that (subject to certain assumptions) there is a naturally defined Lie group action of on the state space ofN, where 0 is the sum of the number of independent capacitor-only cutsets and the number of independent inductor-only loops. Circuit theoretic sufficient conditions on the reactive constitutive relations are derived for the circuit dynamics to be invariant under this Lie group action.This work was supported by the Natural Sciences and Engineering Research Council of Canada, under Grant Number A7113, and by scholarships from the Natural Sciences and Engineering Research Council of Canada and the Ontario Provincial Government.  相似文献   

16.
A 900 MHz low-power CMOS bandpassamplifier suitable for the applications of RFfront-end in wireless communication receiversis proposed and analyzed. In this design, thetemperature compensation circuit is used tostabilize the amplifier gain so that theoverall amplifier has a good temperaturestability. Moreover, the compact tunablepositive-feedback circuit is connected to theintegrated spiral inductor to generate thenegative resistance and enhance its value. The simple diode varactor circuit isadopted for center-frequency tuning. These twoimproved circuits can reduce the powerdissipation of the amplifier. An experimentalchip fabricated by 0.5 mdouble-poly-double-metal CMOS technologyoccupies a chip area of ; chip area. The measuredresults have verified the performance of thefabricated CMOS bandpass amplifier. Under a2-V supply voltage, the measured quality factoris tunable between 4.5 and 50 and the tunablefrequency range is between 845 MHz and 915 MHz. At , the measured is 20 dB whereas thenoise figure is 5.2 dB in the passband. Thegain variation is less than 4 dB in the rangeof 0–80°C. The dc powerdissipation is 35 mW. Suitable amplifier gain,low power dissipation, and good temperaturestability make the proposed bandpass amplifierquite feasible in RF front-endapplications.  相似文献   

17.
Creep behavior of eutectic Sn-Cu lead-free solder alloy   总被引:3,自引:0,他引:3  
Tensile creep behavior of precipitation-strengthened, tin-based eutectic Sn-0.7Cu alloy was investigated at three temperatures ranging from 303–393 K. The steady-state creep rates cover six orders of magnitude (10−3−10−8 s−1) under the stress range of σ/E=10−4−10−3. The initial microstructure reveals that the intermetallic compound Cu6Sn5 is finely dispersed in the matrix of β-Sn. By incorporating a threshold stress, σ th, into the analysis, the creep data of eutectic Sn-Cu at all temperatures can be fitted by a single straight line with a slope of 7 after normalizing the steady-state creep rate and the effective stress, indicating that the creep rates are controlled by the dislocation-pipe diffusion in the tin matrix. So the steady-state creep rate, , can be expressed as exp , where Qc is the activation energy for creep, G is the temperature-dependent shear modulus, b is the Burgers vector, R is the universal gas constant, T is the temperature, σ is the applied stress, A is a material-dependent constant, and , in which σ OB is the Orowan bowing stress, and kR is the relaxation factor. An erratum to this article is available at .  相似文献   

18.
Bipolar transistors are interesting for low noise front-end readout systems when high speed and low power consumption are required. This paper presents a fully integrated, low noise front-end design for the future Large Hadron Collider (LHC) experiments using the radiation hard SOI BiCMOS process. In the present prototype, the input-referred Equivalent Noise Charge (ENC) of 990 electrons (rms) for 12 pF detector capacitance with a shaping time of 25 ns and power consumption of 1.4 mW/channel has been measured. The gain of this front-end is 90 mV/MIP (Minimum Ionisation Particle: 1 fC) with non-linearity of less than 3% and linear input dynamic range is MIP. These results are obtained at room temperature and before irradiation. The measurements after irradiations by high intensity pion beam with an integrated flux of pions/cm2 are also presented in this paper.  相似文献   

19.
In this paper, we propose a reduced complexity and power efficient System-on-Chip (SoC) architecture for adaptive interference suppression in CDMA systems. The adaptive Parallel-Residue-Compensation architecture leads to significant performance gain over the conventional interference cancellation algorithms. The multi-code commonality is explored to avoid the direct Interference Cancellation (IC), which reduces the IC complexity from to . The physical meaning of the complete versus weighted IC is applied to clip the weights above a certain threshold so as to reduce the VLSI circuit activity rate. Novel scalable SoC architectures based on simple combinational logic are proposed to eliminate dedicated multipliers with at least saving in hardware resource. A Catapult C High Level Synthesis methodology is apply to explore the VLSI design space extensively and achieve at least speedup. Multi-stage Convergence-Masking-Vector combined with clock gating is proposed to reduce the VLSI dynamic power consumption by up to This paper was presented in part at IEEE ISCAS in Vancouver, Canada, May, 2004.  相似文献   

20.
An algorithm is presented to compute the variance of the output of a two-dimensional (2-D) stable auto-regressive moving-average (ARMA) process driven by a white noise bi-sequence with unity variance. Actually, the algorithm is dedicated to the evaluation of a complex integral of the form , where and G(z1,z2) = B(z1, z2) / A(z1, z2) is stable (z1,z2)-transferfunction. Like other existing methods, the proposed algorithmis based on the partial-fraction decomposition G(z1,z2)G(z 1 -1 , z 2 -1 ) = X(z1, z1) / A(z1,z2)+ X(z 1 -1 , z 2 -1 ) / A(z 1 -1 , z 2 -1 ). However,the general and systematic partial-fraction decomposition schemeof Gorecki and Popek [1] is extended to determine X(z1,z2).The key to the extension is that of bilinearly transforming thediscrete (z1, z2)-transfer function G(z1,z2)into a mixed continuous-discrete (s1, z2)-transferfunction . As a result, the partial-fraction decomposition involves only efficient DFT computations for the inversion of a matrix polynomial, and the value of I is finally determined by the residue method with finding the roots of a 1-D polynomial. The algorithm is very easy to implement and it can be extended to the covariance computation for two 2-D ARMA processes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号