# Performance Comparison of Vedic Multiplier and Booth Multiplier

### Anju

Abstract -The performance of the any processor will depend upon its power and delay. The power and delay should be less in order to get a effective processor. In processors the most commonly used architecture is multiplier. If the power and delay of the multiplier is reduced then the effective processor can be generated. In this paper Vedic Multiplier and Booth Multiplier are implemented on FPGA platform and comparative analysis is done. The comparison of these Architectures is carried out to know the best architecture for multiplication w. r. t. power and delay characteristics. The designs are implemented using VHDL in Modelsim 10.1 b and synthesis is done in Xilinx 8.2i ISE.

Index Terms – Booth multiplier, Urdhva Tiryagbhyam, Vedic multiplier, Xilinx.

### I. INTRODUCTION

Multiplication is an important function in an ALU. Since, multiplication dominates the execution time of most DSP algorithms; therefore high speed multiplier is much desired [1]-[4]. Currently, multiplication time is still the dominant factor in determining the instruction cycle time of a DSP chip. With an ever-increasing quest for greater computing power on battery operated mobile devices, design emphasis has shifted from optimizing conventional delay, time, area and size to minimizing power dissipation while still maintaining the high performance [5]-[7]. The low power and high speed VLSI can be implemented with different logic styles. The three important considerations for VLSI design are power, area and delay. We have applied Urdhva Tiryabhyam sutra to binary multiplier using carry save adders.

### II. BOOTH MULTIPLIER

Booth algorithm gives a procedure for multiplying binary integers in signed 2's complement representation on. It operates on the fact that strings of 0's in the multiplier require no addition but just shifting and a string of 1's in the multiplier from bit weight  $2^k$  to weight  $2^m$  can be treated as  $2^{K+1}-2^m$ . Fig 1 shows the hardware implementation for Booth's algorithm. It consists of an n-bit adder, control logic and four registers A, B, Q and  $Q_{-1}$ . Multiplier and

Multiplicand are loaded into register Q and register B respectively Register A and  $Q_{\text{-}1}$  are initially set to 0.

The n-bit adder performs addition, inputs of adders comes from multiplicand and content of register A. In case of addition, Add/Sub line is 0, therefore,  $C_{\rm in}=0$  and multiplicand is directly applied as a second input to the n-bit adder. In case of subtraction, Add/sub line is 1, therefore  $C_{\rm in}=1$  and multiplicand is complemented and then applied to the n-bit adder. As a result, the 2's complement of the multiplicand is added to the contents of register A.

### Manuscript Received on July, 2013.

**Ms. Anju, M. Tech Scholar**, Electronics & Communication Engg., IMS Engineering College, NH – 24, Adhyatmik Nagar, Ghaziabad, U.P., India.



Fig. 1 Hardware implementation of signed binary multiplication for booth's algorithm

The control logic Scans bits  $Q_0$  and  $Q_{-1}$  one at a time and generates the control signals to perform the corresponding function. If the two bits are same (1 1 or 0 0), then all the bits of A, Q and  $Q_{-1}$  registers are shifted to right I-bit without addition or subtraction (Add/Subtract Enable = 0). If the two bits differ, then the multiplicand is added to or subtracted from the A register, depending on the status of bits. After addition or subtraction right shift occurs such that the left most bit of A  $(A_{n-1})$  is not only shifted into  $A_{n-2}$ , but also remains in  $A_{n-1}$ . This is required to preserve the sign of the number in A and Q.

The flow chart of Booth's algorithm for signed multiplication is shown in Fig 2.



Fig. 2 Flow chart of Booth's algorithm for signed multiplication

Example-1 0101(5) x 0100(4)

| Multiplicand(B) ← 0101 |      | Multiplier(Q) | Multiplier(Q) ← 0100 |              |  |
|------------------------|------|---------------|----------------------|--------------|--|
| Steps                  | A    | A Q           |                      | Opretion     |  |
|                        | 0000 | 0 1 0 0<br>G  | 20 Q                 | ,            |  |
| Step 1                 | 0000 | 0100          | ۰ م                  | Skip add/sub |  |
|                        | 0000 | 0010          | 0                    | Right shift  |  |
| Step 2                 | 0000 | 0010          | -00                  | Skip add/sub |  |
|                        | 0000 | 0001          | 6                    | Right shift  |  |
| Step 3                 | 1011 | 0001          | ~1,0                 | A≪-A-B       |  |
|                        | 1101 | 1000          | À                    | Right shift  |  |
| Step 4                 | 0010 | 1000          | ~0 1                 | A←A+B        |  |
|                        | 0001 | 0100          |                      | Right shift  |  |
| Result                 | 0001 | 0100          |                      |              |  |

Final Product 0101 x 0100 = 00010100

Here form the above example it can be proved that the addition/subtraction operation can be skipped if the successive bits in the multiplicand are same.

#### III. URDHAVA MULTIPLIER

The use of Vedic mathematics lies in the fact that it reduces the typical calculations in conventional mathematics to very simple ones. This is so because the Vedic formulae are claimed to be based on the natural principles on which the human mind works [9]-[10]. Vedic Mathematics is a methodology of arithmetic rules that allow more efficient speed implementation [11]-[12].

UrdhvaTiryakbhyam Sutra is a general multiplication formula applicable to all cases of multiplication. It means "Vertically and Crosswise" [13]-[16]. The digits on the two ends of the line are multiplied and the result is added with the previous carry. When there are more lines in one step, all the results are added to the previous carry. The least significant digit of the number thus obtained acts as one of the result digits and the rest act as the carry for the next step. Initially the carry is taken to be as zero. The line diagram for multiplication of two 4-bit numbers is as shown in Fig 3.



Fig. 3 Line diagram for multiplication of two 4-Bit Number

Now we will extend this Sutra to binary number system. For the multiplication algorithm, let us consider the multiplication of two 8 bit binary numbers  $A_7A_6A_5A_4A_3A_2A_1A_0$  and  $B_7B_6B_5B_4B_3B_2B_1B_0$ . As the result of this multiplication would be more than 8 bits, we

express it as ... $R_7R_6R_5R_4R_3R_2R_1R_0$ . As in the last case, the digits on the both sides of the line are multiplied and added with the carry from the previous step. This generates one of the bits of the result and a carry. This carry is added in the next step and hence the process goes on. If more than one lines are there in one step, all the results are added to the previous carry. In each step, least significant bit acts as the result bit and all the other bits act as carry. For example, if in

some intermediate step, we will get 011, then I will act as result bit and 01 as the carry. Thus we will get the following expressions [12]-[13].

 $R_0 = A_0 B_0$ 

 $C_1R_1 = A_0B_1 + A_1B_0$ 

 $C_2R_2=C_1+A_0B_2+A_2B_0+A_1B_1$ 

 $C_3R_3 = C_2 + A_3B_0 + A_0B_3 + A_1B_2 + A_2B_1$ 

 $C_4R_4 = C_3 + A_4B_0 + A_0B_4 + A_3B_1 + A_1B_3 + A_2B_2$ 

 $C_5R_5 = C_4 + A_5B_0 + A_0B_5 + A_4B_1 + A_1B_4 + A_3B_2 + A_2B_3$ 

 $C_6R_6 = C_5 + A_6B_0 + A_0B_6 + A_5B_1 + A_1B_5 + A_4B_2 + A_2B_4 + A_3B_3$ 

 $C_7R_7 = C_6 + A_7B_0 + A_0B_7 + A_6B_1 + A_1B_6 + A_5B_2 + A_2B_5 + A_4B_3 + A_3B_4$ 

 $C_8R_8=C_7+A_7B_1+A_1B_7+A_6B_2+A_2B_6+A_5B_3+A_3B_5+A_4B_4$ 

 $C_9R_9=C_8+A_7B_7+A_5B_7+A_6B_3+A_3B_6+A_5B_4+A_4B_5$ 

 $C_{10}R_{10} = C_9 + A_7B_3 + A_3B_7 + A_6B_4 + A_4B_6 + A_5B_5$ 

 $C_{11}R_{11}\!\!=\!\!C_{10}\!\!+\!\!A_7B_4\!\!+\!\!A_4B_7\!\!+\!\!A_6B_5\!\!+\!\!A_5B_6$ 

 $C_{12}R_{12} = C_{11} + A_7B_5 + A_5B_7 + A_6B_6$ 

 $C_{13}R_{13}=C_{12}+A_7B_6+A_6B_7$ 

C<sub>14</sub>R<sub>14</sub>=C<sub>13</sub>+A<sub>7</sub>B<sub>7</sub>

 $C_{14}R_{14}R_{13}R_{12}R_{11}R_{10}R_9R_8R_7R_6R_5R_4R_3R_2R_1R_0$  being the final product [14]. Hence this is the general mathematical formula

applicable to all cases of multiplication.

## IV. FPGA IMPLEMENTATION AND EXPERIMENTAL RESULTS

In this work, 8x8 bit Vedic multiplier and 8x8 bit Booth multiplier are designed in VHDL (very High speed Integrated Circuits Hardware Description Language).

Logic simulation was done in Modelsim 10.1 b. Table 1 & 2 and Figs 4(a) & 5(b), show the synthesis and simulation results of 8x8 bit Booth multiplier and 8x8 bit Vedic multiplier respectively.

When multiplicand, booth multiplier\_8 bit/mplicand\_in = 11111111(225decimal) and multiplier, booth\_multiplier\_8 bit/mplier\_in=111111111 (225 decimal), we get the result 16-bit output booth multiplier\_8 bit/product = 1111111100 000001 (65025.)

For 8x8 bit Vedic multiplier when input multiplicand = 10111111(191 decimal) and multiplier = 11111111 (225 decimal). Product of 1011111 (191) x 11111111 (255) = 1011111001000001 (48705)

TABLE 1: Synthesis Result of 8x8 bit Vedic multiplier

| Device utilization Summary (estimated values) |      |           |             |  |  |  |
|-----------------------------------------------|------|-----------|-------------|--|--|--|
| Logic utilization                             | used | Available | Utilization |  |  |  |
| Number of Slices                              | 174  | 192       | 90%         |  |  |  |
| Number of slice Flip Flop                     | 133  | 384       | 34%         |  |  |  |
| Number of 4 input LUTs;                       | 326  | 384       | 84%         |  |  |  |
| Number of Bounded 10 Bs;                      | 67   | 90        | 74%         |  |  |  |
| Number of GCLKs                               | 1    | 4         | 25%         |  |  |  |

TABLE 2: Synthesis Result of 8x8 bit Booth multiplier

| Device utilization Summary (estimated values) |      |           |             |  |  |  |
|-----------------------------------------------|------|-----------|-------------|--|--|--|
| Logic utilization                             | used | Available | Utilization |  |  |  |
| Number of Slices                              | 49   | 192       | 25%         |  |  |  |
| Number of slice Flip Flop                     | 48   | 384       | 12%         |  |  |  |



| Number of 4 input LUTs;                     |                    |         | 84               |     | 384          |                  | 21% |  |  |
|---------------------------------------------|--------------------|---------|------------------|-----|--------------|------------------|-----|--|--|
| Number of Bounded 10 Bs;                    |                    |         | 34               |     | 90           |                  | 37% |  |  |
| Number of GCLKs                             |                    |         | 1                |     | 4            |                  | 25% |  |  |
| // /booth_multiplier_8bit/dk                |                    |         |                  |     |              | 1                |     |  |  |
| <u>+</u> → /booth_multiplier_8bit/mplicand_ | <u>i</u> n   11111 | 111     |                  | 111 | 1111         |                  |     |  |  |
|                                             |                    | 111     |                  | 111 | 11111        |                  |     |  |  |
| /booth_multiplier_8bit/product              | 00111              | 1111000 | 000001           | 001 | 111111000000 | 001              |     |  |  |
| <u>→</u> /booth_multiplier_8bit/product_t   | temp 00111         | 1111000 | 000001           | 001 | 111111000000 | 001              |     |  |  |
| //booth_multiplier_8bit/partial_product     |                    | 1111    |                  | 001 | 11111        |                  |     |  |  |
| 7 A Month multiplier Shit/molicand          | 01111              | 1111    |                  | 011 | 11111        |                  |     |  |  |
| A. I. S. Drawler                            |                    | (8      | a)               |     |              |                  |     |  |  |
| /vedic_mult8x8/rst 1                        |                    |         |                  |     |              |                  |     |  |  |
| /vedic_mult8x8/start 1                      |                    |         |                  |     |              |                  |     |  |  |
| /vedic_mult8x8/dk 1                         |                    |         |                  |     |              |                  |     |  |  |
| +> /vedic_mult8x8/mplic 10                  | 111111             |         | 10111111         |     |              |                  |     |  |  |
|                                             | 1111111            |         | 11111111         |     |              |                  |     |  |  |
|                                             |                    | 01      | 0000000000000000 |     |              | 1011111001000001 |     |  |  |
|                                             | 1100001            |         | 11100001         |     |              |                  |     |  |  |
|                                             | 1100001            |         | 11100001         |     |              |                  |     |  |  |
| <b>1 1 1 1 1 1 1 1 1 1</b>                  | 100101             |         | 10100101         |     |              |                  |     |  |  |
|                                             |                    |         | h)               |     |              |                  |     |  |  |

Fig 4 : Simulation results of 8×8 bit Vedic and Booth Multipliers

### V. EXPERIMENTAL RESULTS

Table 3 compares the result of area, delay speed and power of Booth and Urdhava Tiryakbhyam multipliers. The result shows that Urdhava Tiryakbhyam multiplier is the best multiplier compare to Booth multiplier in terms of delay and power consumptions (Fig. 5)

**TABLE 3: Result For 8x8 Multiplier** 

| FPGA<br>Device<br>Package<br>XC2550<br>TQ144 | Area in<br>LUT's<br>in % | Delay<br>(ns) | Speed<br>(MH <sub>z</sub> ) | Memory<br>(Kb) | Power (mW) |
|----------------------------------------------|--------------------------|---------------|-----------------------------|----------------|------------|
| Booth<br>Multiplier                          | 21                       | 11.176        | 89.477                      | 156440         | 11.30      |
| Urdhava<br>Multiplier                        | 84                       | 8.460         | 118.203                     | 166744         | 9.29       |



Fig 5a: Area Comparison for (8x8) Area in LUTs (%)



Fig. 5b Delay Comparison for (8x8)



Fig. 5c Speed Comparison for (8x8)



Fig. 5d Power Comparison for (8x8)

### VI. CONCLUSION

It can be concluded that Urdhava Multiplier is superior in respect of speed, delay and power consumption. The power of Vedic Mathematics can be explored to implement high performance multiplier in different VLSI applications.

### **REFERENCES**

- R. P.Rajput and M. N. S. Swamy, "High Seed Modified Booth Encoder multiplier for signed and unsigned numbers", 14<sup>th</sup> Internaitonal Conference on Modeling and simulation, 2012 IEEE, pp. 649-654.
- R. K. Lamte and Prof. Bhasker, "Speedy Convolution using Vedic Mathematics", International Journal of Recent Trends in Engineering and Technology, Vol-05, No-01, March 2011.
- 3. J. Rao M, and S. Dubey, "A high speed and Area Efficient Booth Recoded Wallace tree Multiplier for fast Arithmetic Circuits," *Asia Pacific Conference on postgraduate Research in Microelectronics & Electronics* (PPIM EASIA) 2012.
- Ch. H. Kumar "Implementation and Analysis of Power, Area and Delay of Array Urdhava, Nikhilam Vedic Multipliers," *Internatinal Journal of Scientific and Research Publications*, Volume 3, Issue 1 January 2013, ISSN 2250 – 3153,pp.1-5.



- N. Mittal and A. Kumar, "Hardware Implementation of FFT using vertically and crosswise Algorithm",, International Journal of Computer Application (0975-8887), Volume-35- No-1, December 2011
- L. Sriraman and T.N. Prabakar." Design and Implementation of two variable Multiplier using KCM and Vedic Mathematics", Ist International Conference on Recent Advances in Information Technology, 2012 IEEE.
- H. Thapliyal, M. B. Srinivas and H. R. Arabnia, "Design And Analysis of a VLSI Based High Performance Low Power Parallel square Architecture", in Proc. Int. Conf. Also. Math. Compo. Sc., Las Vegas, June 2005, pp. 72-76.
- 8. V. C. Bhaaskaran, V.S.K. Brindha, B. Sakthikumarna, S. Kavinilavu, V. Bhaskar, B. Kanagaasabapathy M. and Sharath, B. "A novel low power and high speed Wallace tree multiplier for RISC Processer," 3<sup>rd</sup> International Conference on Electronics Computer Technology (ICECT), 2011, Vol-1, April 2010, pp. 8-10.
- 9. C. Ping-hua and Z. Juan, "High-Speed Parallel 32x32-bit Multiplier Using a Radix-16 Booth Encoder", *Third International Symposium on intelligent Information Technology Application Workshop*, 2009. IITAW 09, 21-22 Nov. 2009, pp.406-9.
- Swami Bharati Krshna Tirthaji," Vedic Mathematics". Delhi: Motilal Banarsidass Publishers, 1965, pp. 5-8.
- A. Haveliya "A Novel Design for High Speed Multiplier .for Digital Signal Processing Applications (Ancient Indian Vedic mathematics approach)" *International Journal of Technology And Engineering* System(IJTES):Jan - March 2011- Vo12 .No. 1, pp.27-31.
- H.S.Dhillon and A.Mitra "A Digital Multiplier Architecture using Urdhava Tiryakbhyam Sutra of Vedic Mathematics" IEEE Conference Proceedings, 2008.
- P/ Mehta and D. Gawali "Conventional versus Vedic mathematical method for Hardware implementation of a multiplier",2009, International Conference on Advances in Computing, Control, and Telecommunication Technologies, pp. 640-42.
- H. Thapliyal, S. Kotiyal and M.B. Srinivas, "Design and Analysis of a Novel Parallel Square and Cube Architecture Based on Ancient Indian Vedic Mathematics", Proceedings on 48th IEEE International Midwest Symp-osium on Circuits and Systems (MWSCAS 2005), Hyderabad, vol. 2, pp. 1462-65. S
- S.Akhtar, "VHDL Implementation of Fast NxN multiplier Base on Vedic Mathematics," *Jaypee Institute of Information Technology* <u>University, Noida</u>, 2011307 U.P, India, 2007 IEEE, pp. 472-75..
- P. Verma and K.K. Mehta, "Implementation of an efficient multiplier based on Vedic Mathematics using EDA Tool", *International Journal of Engineering and Advance Technology* (IJEAT) ISSN: 2249-8958, volume-1, Issue -5, June 2012, pp.75-79

### **AUTHOR PROFILE**



**Ms. Anju** is M. Tech student at IMS Engineering College, Ghaziabad, affiliated to Mahamaya Technical University.

