Efficient Hardware Implementation of Hybrid Cosine-Fourier-Wavelet Transforms On A Single FPGA
Efficient Hardware Implementation of Hybrid Cosine-Fourier-Wavelet Transforms On A Single FPGA
Efficient Hardware Implementation of Hybrid Cosine-Fourier-Wavelet Transforms On A Single FPGA
I.
INTRODUCTION
II.
2325
2
m(2n +1)
km cos
where,{m, n} = 0,1,..., N 1
N
2N
[CN ]m,n =
(1) [ F ] = ([ Pr ] )1 F~
N
N
= [ Pr ]N
j = 1,2,...., N 1
and k = 1,
[ Pr ]2 = [ I ]2
pri , j = 1, if i = 2 j , 0 j N / 2 1
pri , j = 1, if i = (2 j + 1) mod N , N / 2 j N 1, and
pr = 0, others
i, j
0
IN / 4
0
0
0
0
,N 4
IN / 4
0
0
0
IN / 4
(3)
N 1
X ( n ) = x ( m) W nm , 0 n N 1
(5)
m=0
Where W = e
[ ]
2
N
denoted by F N = W
N . Using similar approach defined
in the previous section, the general recursive form for the DFT
matrix can be represented by the following (6):
nm
I N /2
I N /2
W0
W2
W0
W0 W2
W0 W0
W W
2
0
W0 W0
W0 W2
W0
W0
W0
[ 0]
W0
W0
W0
W0
W0
W2
W0
W2
(7)
[ I4 ]
[0]
W
0
0
0
[I ] [I ]
0
4
4
0 W1 0
0
[ I 4 ] [ I 4 ]
[ 0]
0 0 W2 0
0 0 0 W3
0
r
0
0
K (i, 0) = (1)i + j . 2
Here, [ K ] =
K (i, j ) = (1)i + j .2 , when i j , j 0
ij
otherwise
0,
N 1
I 2 I N /2
0 I N /2
...
I 2 0 WN /2 I N /2
W0 W0
W0 W2
W0 W0
[ I 4 ][ 0] W0 W2
[Pr]8F
F
[ 0][ Pr ]4
[ 0]
0
I2
I
W2 N /4 I 2
0
[ I F2 ]
Pr2 N /2
I N /2
0
I2 0
(4)
... I N /4 0 K [ I N /2 C2 ]
0
K
N
N /2
2
0
I I
0 I
I
I
I
1
[ I N /4 02 D I N /4 I2 I2 ... N0/2 D I N /2 IN /2 ( Pc
N )
N /2 N /2
N /2
2
2 2
[CN ] = Pr
0
I
... I 2
PrN /2 N /4 0
I N /2
0
(2)
,N 4
Where,
I N / 4
0
=
0
[ I N /4 02
j = 0, N
0.5 ,
(6)
r3
r3
r3
r3
r3
r3
r
0
r2
0
r
0
r 2
0
r
r2
r
r 2
r
r 2
r
0
0
r
0
0
0
0
r2
0
0
0
0
0
0
0
r
0
r
0
0
r
r3
r 3
(8)
0
r 2
1
where, r =
= C4
0
2
0
0
C4
[ I 4 ][ 0]
W1
C
[Pr]8
W1
4
[0] [ Pr ]4 [ 0]
1
[ 0]
C 4 C4
C 4 C4
1 0
0
C4
C4
(9)
[ I 4 ][ 0] [ I 4 ] [ I 4 ]
W2
[Pr]8
W2
[ 0][ Pr ]4 [ I 4 ] [ I 4 ]
2326
III.
HARDWARE IMPLEMENTATION
X0
X1
X2
X3
X4
X5
X6
X7
Data_in
[7:0]
DMUX
1x2
C
o
n
t
r
o
l
CF_2
CFW_2
C_3
C
o
n
t
r
o
l
C
o
n
t
r
o
l
CFW_4
Buffer
t_select
[1:0]
CFW_6
B
u
f
f
e
r
F_6
B
u
f
f
e
r
CF_4
C
o
n
t
r
o
l
C_5
F_5
C
o
n
t
r
o
l
data_out_real
[11:0]
data_out_img
[11:0]
Controller
(a) DEMUX
(g) CF_2
(h) F_6
(i) CFW_6
(j) C_3
(b) CF_1
C1
C3
C5
C7
(c) C_5
(d) F_5
(f) CFW_2
2327
HARDWARE SYNTHESIS
LUT
(utilization)
Registers
(utilization)
Frequency
(MHz)
N/A
2073 (19%)
1476 (N/A)
118
Technology
Altera Stratix
(EP1S10FC780)
Altera Cyclone II
(EP2C35FC672)
Xilinx Virtex4
(XCV4LX15SF363)
Actel Fusion
(AFS090)
TABLE II.
Topology
Standalone
Hybrid
N/A
1941 (N/A)
1476 (4%)
107
N/A
2017 (16%)
1479 (12%)
95
4338
(5%)
N/A
N/A
30
M+A
LUT
Registers
DCT
8 + 30
1118
1094
DFT
3 + 32
1073
887
DWT
1 + 14
764
800
Total
12 + 76
2955
2781
DCT-DFTDWT
10 + 38
2073
1476
17 + 50
30
47
V.
CONCLUSIONS
M = Multiplications; these fixed coefficient multipliers have been implemented with adders and
shifters only which makes the entire architecture multiplication-free; A = Additions
2328