A Novel Method of Compressing Speech With Higher Bandwidt
A Novel Method of Compressing Speech With Higher Bandwidt
A Novel Method of Compressing Speech With Higher Bandwidt
com
BANDWIDTH EFFICIENCY
Abstract:
This paper illustrates a novel method of speech compression and
transmission. This method saves the transmission bandwidth required for the speech
signal by a considerable amount. This scheme exploits the property of low pass nature
of the speech signal. Also this method applies equally well for any signal, which is
low pass in nature, speech being the more widely used in Real Time Communication,
is highlighted here.
As per this method, the low pass signal (speech) at the transmitter is divided into set
of packets, each containing, say N number of samples. Of the N samples per packet,
only certain lesser number of samples, say αN alone are transmitted. Here α is less
than unity, so compression is achieved. The N samples per packet are subjected to a
N-Point DFT. Since low pass signals alone are considered here, the number of
significant values in the set of DFT samples is very limited. Transmitting these
significant samples alone would suffice for reliable transmission. The number of
samples, which are transmitted, is determined by the parameter α.
The parameter α is almost independent of the source of the speech signal. In other
methods of speech compression, the specific characteristics of the source such as pitch
are important for the algorithm to work.
An exact reverse process at the receiver reconstructs the samples. At the receiver, the
N-point IDFT of the received signal is performed after necessary zero padding. Zero
padding is necessary because at the transmitter of the N samples only αN samples are
transmitted, but at the receiver N samples are again needed to honestly reconstruct the
signal.
Hence this method is efficient as only a portion of the total number of samples is
transmitted thereby saving the bandwidth. Since the frequency samples are
transmitted the phase information has also to be transmitted. Here again by
exploiting the property of signals and their spectra that the PHASE INFORMATION
CAN BE EMBEDDED WITHIN THE MAGNITUDE SPECTRUM by using simple
mathematics without any heavy computations or by increasing the bandwidth.
Also the simulation result of this method shows that smaller the size of the packet the
more faithful is the reproduction of received signal that is again an advantage as the
computation time is reduced. The reduction in the computation time is due to the fact
that the transmitter has to wait until N samples are obtained before starting the
1
Speech Compression - a novel method www.seminarson.com
transmission. If N is small, the transmitter has to wait for a less duration of time and a
smaller value of N achieves a better reconstruction at the receiver.
Thus this scheme provides a more efficient method of speech compression and this
scheme is also very easy to implement with the help of available high-speed
processors.
2
Speech Compression - a novel method www.seminarson.com
A NOVEL METHOD OF
COMPRESSING SPEECH WITH
HIGHER BANDWIDTH EFFICIENCY
INTRODUCTION:
The same method has also been used in the compression of static images also.
3
Speech Compression - a novel method www.seminarson.com
We are trying to introduce another scheme that utilizes the properties of speech
signals and transmits at a lower bit rate and reconstructs the signal back with less
distortion.
Their power spectrum approaches zero for zero frequency and reaches a peak
in the neighborhood of few hundred Hertz.
The above properties of the speech signal have enabled us to devise a new method
of speech compression
4
Speech Compression - a novel method www.seminarson.com
DESCRIPTION:
5
Speech Compression - a novel method www.seminarson.com
this requires twice the actual bandwidth .This problem can be solved by
exploiting the property of real and even signals. The spectrum of the samples is real
and evenliness is artificially introduced such that their spectra are also real and even.
Thus by simple mathematics the complete phase information is embedded within the
magnitude spectrum and it is needed only to send ‘αN’ samples instead of
‘2N’samples of the spectra (Magnitude and phase).
By adopting all these procedures and embedding the phase information in the
magnitude spectrum, a MATLAB simulation has been performed to determine the
optimum value of ‘α’ and ’N’. The result of the simulation is also provided.
ALGORITHM:
Divide the speech samples into a set of packets each of size ‘N’.
Select only ‘αN’ number of samples of each packet and transmit it.
Follow a similar reverse process at the receiver to reconstruct the signal. (After
doing appropriate zero padding).
The inverse Fourier transform of the actual signal is given by its spectral
components as,
Since the phase information has to be embedded in the magnitude spectrum at the
transmitter the processed spectrum would be,
Since ‘αN’ samples of the spectrum are transmitted at the receiver the even
spectrum is formed by padding N-αN-1 zeros at the end and we have
6
Speech Compression - a novel method www.seminarson.com
[
x [n] =(1/2*N) X (0)+2* ΣX (k)cos (2*π*n*k/(2*N))] (3)
What do we require?
• We expect the value of α to be very low because to achieve maximum
reduction in the number of samples to be transmitted.
• We expect ‘N’ to be very low as it is an important factor in determining
the speed of operation of the transmitter because at the transmitter the ‘N’
samples are fed to a processor, which computes the FFT of the samples.
The time required for this operation would be O[logN].
Taking into account the above requirements and choosing a small but optimum
value of ‘α’ and ‘N’ the algorithm still gives a faithful reproduction of the signal
without any complexity both at the transmitter and at the receiver.
Hence the total number of significant frequency samples obtained with this
method is very less compared with the actual frequency spectrum samples of the
signal. This helps us to effectively reduce the number of samples to be chosen thereby
reducing the number of samples to be transmitted.
7
Speech Compression - a novel method www.seminarson.com
8
Speech Compression - a novel method www.seminarson.com
9
Speech Compression - a novel method www.seminarson.com
And, the cross correlation of the received and the actual signal is…
From the above simulations it is clear that even with lesser number of
transmitted samples the signal is reproduced faithfully.
MATLAB CODE:
10
Speech Compression - a novel method www.seminarson.com
clear all,
y=wavread('yok');%The voice signal to be transmitted
y=y';
size=100000;
packetsize=100;
y=y(1,1:size);
res=.1;
i=0;
res=.01:.01:1;
for i=1:length(res)
yhat=[];
for j=1:(size/packetsize)
[y1,f1,f2]=yokspeechprocess(y(1,round((j*(packetsize)+1):round(j*(pack
etsize))),res(i));
yhat=[yhat y1];
end
if max(yhat)~=0
yhat=yhat./max(yhat);
else
end
if max(y)~=0
y=y./max(y);
else
end
xy=mean(yhat.*y);
xx=mean(y.*y);
yy=mean(yhat.*yhat);
cof(i)=xy/sqrt(xx*yy);
end
cif=max(cof);
cof=cof/cif;
figure(1);
plot(res,cof,'r');
xlabel('TRANS.BANDWIDTH EXPRESSED AS FRACTION OF THE SAMPLING
FREQUENCY');
ylabel('CORR.COEFF. OF THE TRANS. AND RECEIVED SIGNAL--->');
title('PLOT OF TRANS.BANDWIDTH (fraction of total bandwidth) vs
CORR.COEFF.');
figure(2);
plot(xcorr(y,yhat));
title('CROSS CORRELATION OF TRANSMITTEDSIGNAL AND RECEIVED
SIGNAL');
figure(3);
plot(xcorr(yhat));
title('AUTO CORRELATION OF THE RECEIVED SIGNAL');
FUNCTION USED:
function [yhat,f1,f2]=yokspeechprocess(y,res)
s=size(y);
if s(1)>1
y=y';
else
end
y=[y fliplr(y)];
c=length(y);
y=y(1,1:c-1);
f=fft(y);
d=round(res*c/2);
11
Speech Compression - a novel method www.seminarson.com
e=(2*(.5*c-d));
f=[f(1,1:d) zeros(1,e) f(1,e+1:c-1)];
[f,err,bits]=yokquantise(f,8);
yhat=ifft(f);
f1=f(1,1:(c/2));
f2=f(1,1:((c/2)+1));
f1=real(f1);
f2=real(f2);
yhat=yhat(1,1:(c/2));
yhat=real(yhat);
ADVANTAGES:
CONCLUSION:
Given the conceptual ease of understanding and design, as well as the many
advantages listed above, we are bound to conclude that the new algorithm lends itself
to use directly. Also, the universal applicability (the same has been tried out
successfully on static images) of the same makes it furthermore appealing.
12