PaperID 74S201921
PaperID 74S201921
PaperID 74S201921
Abstract: With the recent rapid advances in multimedia and communication systems, real-time signal processing like audio
signal processing, video/image processing, or large-capacity data processing are increasingly being demanded. The
multiplier and multiplier-and-accumulator (MAC) are the essential elements of the digital signal processing such as filtering,
convolution, transformations and Inner products. There are different entities that one would like to optimize when designing
a VLSI circuit. These entities can often not be optimized simultaneously, only improve one entity at the expense of one or
more others The design of an efficient integrated circuit in terms of power, area, and speed simultaneously, has become a
very challenging problem. Power dissipation is recognized as a critical parameter in modern the objective of a good
multiplier is to provide a physically compact, good speed and lowpower consuming chip.
This paper proposes a new architecture of multiplier-and-accumulator (MAC) for high speed and low-power by
adopting the new SPST implementing approach. This multiplier is designed by equipping the Spurious Power Suppression
Technique (SPST) on a modified Booth encoder which is controlled by a detection unit using an AND gate. The modified
booth encoder will reduce the number of partial products generated by a factor of 2. The SPST adder will avoid the
unwanted addition and thus minimize the switching power dissipation. By combining multiplication with accumulation and
devising a low power equipped carry save adder (CSA), the performance was improved.
In this project we used Modelsim for logical verification, and further synthesizing it on Xilinx-ISE tool using target
technology and performing placing & routing operation for system verification on targeted FPGA.
87
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
(product). In elementary school, students learn to multiply In the binary number system the digits, called
by placing the multiplicand on top of the multiplier. The bits, are limited to the set [0, 1]. The result of multiplying
multiplicand is then multiplied by each digit of the any binary number by a single binary bit is either 0, or the
multiplier beginning with the rightmost, Least Significant original number. This makes forming the intermediate
Digit (LSD). Intermediate results (partial products) are partial-products simple and efficient. Summing these
placed one atop the other, offset by one digit to align digits partial-products is the time consuming task for binary
of the same weight. The final product is determined by multipliers. One logical approach is to form the partial-
summation of all the partial-products. Although most people products one at a time and sum them as they are
think of multiplication only in base 10, this technique generated. Often implemented by software on processors
applies equally to any base, including binary. Figure 1.1 that do not have a hardware multiplier, this technique
shows the data flow for the basic multiplication technique works fine, but is slow because at least one machine cycle
just described. Each black dot represents a single digit. is required to sum each additional partial-product. For
applications where this approach does not provide enough
performance, multipliers can be implemented directly in
hardware. The two main categories of binary
multiplication include signed and unsigned numbers.
Digit multiplication is a series of bit shifts and series of
bit additions, where the two numbers, the multiplicand
and the multiplier are combined into the result.
Considering the bit representation of the multiplicand x =
xn-1…..x1 x0 and the multiplier y = yn-1…..y1y0 in
order to form the product up to n shifted copies of the
multiplicand are to be added for unsigned multiplication.
The entire process consists of three steps, partial product
Figure 1.1: basic Multiplication generation, partial product reduction and final addition.
Here, we assume that MSB represent the sign of
the digit. The operation of multiplication is rather simple in
digital electronics. It has its origin from the classical
algorithm for the product of two binary numbers. This
algorithm uses addition and shift left operations to calculate
the product of two numbers. Based upon the above
procedure, we can deduce an algorithm for any kind of
multiplication which is shown in figure 1.2. We can check
at the initial stage also that whether the product will be
positive or negative or after getting the whole result, MSB
of the results tells the sign of the product.
88
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
multiplicand and an m-bit multiplier, a product with n + m role of increasing the speed to add the partial products. To
bits long and m partial products can be generated. The increase the speed of the MBA algorithm, many parallel
method shown in figure 1.3 is also called a non-Booth multiplication architectures have been researched .Among
encoding scheme. them, the architectures based on the Baugh–Wooley
algorithm (BWA) have been developed and they have
been applied to various digital filtering calculations.
The N-bit 2‟s complement binary number can be
expressedas
……..(1)
If (1) is expressed in base-4 type redundant sign digit
form in order to apply the radix-2 Booth‟s algorithm.
……………………………………..(2)
……………………(3)
Fig 1.4: Multiplication 0peration in hardware If (2) is used, multiplication can be expressed as
4. MULTIPLIER-ACCUMULATOR (MAC)
UNIT
In the majority of digital signal processing (DSP)
applications the critical operations are the multiplication and ………………………………(4)
accumulation. Real-time signal processing requires high If these equations are used, the afore-mentioned
speed and high throughput Multiplier-Accumulator (MAC) multiplication–accumulation results can be expressed as
unit that consumes low power, which is always a key to
achieve a high performance digital signal processing
system. The purpose of this work is to design and
implementation of a low power MAC unit with block
enabling technique to save power. Firstly, a 1-bit MAC unit …….(5)
is designed, with appropriate geometries that give optimized Each of the two terms on the right-hand side of
power, area and delay. The delay in the pipeline stages in (5) is calculated independently and the final result is
the MAC unit is estimated based on which a control unit is produced by adding the two results. The MAC
designed to control the data flow between the MAC blocks architecture implemented by (5) is called the standard
for low power. Similarly, the N-bit MAC unit is designed design [6].
and controlled for low power using a control logic that If radix-2 Booth encoding is used, the number of
enables the pipelined stages at appropriate time. The adder partial products, is reduced to half, resulting in the
cell designed has advantage of high operational speed, small decrease in Addition of Partial Products step. In addition,
Gate count and low power. the signed multiplication based on 2‟s complement
In general, a multiplier uses Booth‟s algorithm and numbers is also possible. Due to these reasons, most
array of full adders (FAs), or Wallace tree instead of the current used multipliers adopt the Booth encoding.
array of FA‟s., i.e., this multiplier mainly consists of the
three parts: Booth encoder, a tree to compress the partial
products such as Wallace tree, and final adder. Because
Wallace tree is to add the partial products from encoder as
parallel as possible, its operation time is proportional to,
where is the number of inputs. It uses the fact that counting
the number of 1‟s among the inputs reduces the number of
outputs into. In real implementation, many (3:2) or (7:3)
counters are used to reduce the number of outputs in each
pipeline step. The most effective way to increase the speed
of a multiplier is to reduce the number of the partial
products because multiplication precedes a series of
additions for the partial products. To reduce the number of
calculation steps for the partial products, MBA algorithm
has been applied mostly where Wallace tree has taken the
89
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
Fig 1.5: Simple Multiplier and Accumulator signed digits, -2, -1, 0, +1, +2. Each encoded digit in the
Architecture multiplier performs a certain operation on the
5. HIGH-SPEED BOOTH ENCODED multiplicand, X, as illustrated in Table 1
PARALLELMULTIPLIER DESIGN
90
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
91
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
Fig. shows the generated partial products and sign extension 7 . RESULTS AND DISCUSSION
scheme of the 8-bit modified Booth multiplier. The partial
products generated by the modified Booth algorithm are
added in parallel using the Wallace tree until the last two
rows are remained. The final multiplication results are
generated by adding the last two rows. The carry
propagation adder is usually used in this step.
8. SYNTHESIS RESULT
92
International Journal of Research in Advent Technology, Vol.7, No.4S, April 2019
E-ISSN: 2321-9637
Available online at www.ijrat.org
10. CONCLUSION
93