Ece 306L - Experiment 4: Signal Quantization
Ece 306L - Experiment 4: Signal Quantization
Ece 306L - Experiment 4: Signal Quantization
Signal Quantization
Z. Aliyazicioglu. OBJECTIVES 1. The student will be able to record and analyze an audio signal in MATLAB 2. The student will be able to change the quantization of an audio signal in MATLAB 3. The student will be able to analyze signal to noise ratio LIST OF EQUIPMENTS AND PARTS 1 2 PC with MATLAB Microphone
1 Introduction This lab presents two important concepts for working with digital signals. The first section discusses how numbers are stored in memory. Numbers may be either in fixed point or floating point notation. Integers are often represented with fixed point notation. Decimals and numbers that may take on a very large range of values would use floating point. The second issue of numeric storage is quantization. All analog signals that are processed on the computer must first be quantized. We will examine the errors that arise from this operation, and determine how different levels of quantization affect a signals quality. We will also look at two types of quantizers. The uniform quantizer is the simpler of the two. The Max quantizer, however, is optimal, as it minimizes the mean square error between the original and quantized signals. 2 Review of number representations There are two types of numbers that a computer can represent: integers and decimals. These two numbers are stored quite differently in memory. Integers (e.g. 27, 0, - 986 ) are stored in fixed point format, while decimals ( 12.34, -0.98 ) most often use floating point format [Ref: Section 7.5, Proakis and Manolakis]. Integers usually require four bytes of memory; floating point values usually require eight. The three types of fixed point formats are: Sign-magnitude Ones-complement Twos-complement 2.1 Sign-magnitude representation In all three formats, the first bit denotes the sign of the number: 0 for positive, and 1 for negative. For positive numbers, the magnitude simply follows the first bit. All three notations represent negative numbers differently.
ECE306L-4 1
Sign-magnitude notation is the simplest way to represent negative numbers. The magnitude of the negative number follows the first bit. If an integer was stored as one byte, the range of possible numbers would be -127 to 127. The value +27 would be represented as 0 0 0 1 1 0 1 1. The number -27 would represented as 1 0 0 1 1 0 1 1. 2.2 Ones-complement To represent a negative number, the complement of the bits for the positive number with the same magnitude are computed. The positive number 27 in onescomplement form would be written as 0 0 0 1 1 0 1 1, but the value -27 would be represented as 1 1 1 0 0 1 0 0. 2.3 Twos-complement The problem with both these notations is that two different values represent zero. Twoscomplement notation is a revision to ones-complement that solves this problem. To form negative numbers, the positive number is subtracted from a certain binary number. This number has a one in the most significant bit (MSB), followed by as many zeros as there are bits in an integer. If 27 was represented by an eight-bit integer, -27 would be represented as: 100000000 -00011011 =11100101 Notice that this result is one plus the ones-complement representation for -27 (modulo-2 addition). What about the second value of 0? That representation is 1 0 0 0 0 0 0 0. This value equals -128 in twos-complement notation! 100000000 -10000000 =10000000 The value represented here is -128; we know it is negative, because the result has a 1 in the MSB. Twos-complement is used because it can represent one extra negative value. More importantly, if the sum of a series of twos-complement numbers is within the range, overflows that occur during the summation will not affect the final answer! The range of an 8-bit twos complement integer is [-128,127].
ECE306L-4 2
2.4 Floating Point Floating point notation is used to represent a much wider range of numbers. The tradeoff is that the resolution is variable: it decreases as the magnitude of the number increases. In the fixed point examples above, the resolution was fixed at 1. It is possible to represent decimals with fixed point notation, but for a fixed word length any increase in resolution is matched by a decrease in the range of possible values. A floating point number, F, has two parts: a mantissa, M, and an exponent, E.
F = M * 2E
The mantissa is a signed fraction; this fraction has a power of two in the denominator. [See section 7.5.2 in Proakis and Manolakis for more details.] The exponent is a signed integer, which represents the power of two that the mantissa must be multiplied by. These signed numbers may be represented with any of the three fixed-point number formats. The IEEE has a standard for floating point numbers (IEEE 754). For a 32-bit number, the first bit is the mantissas sign. The exponent takes up the next 8 bits (1 for the sign, 7 for the quantity), and the mantissa is contained in the remaining 23 bits. The range of values for this number is (1.18 1038, 3.40 1038). To add two floating point numbers, the exponents must be the same. If the exponents are different, the mantissa is adjusted until the exponents match. If a very small number is added to a large one, the result may be the same as the large number! For instance, if 0.15600 0 230 is added to 0.62500 0 23, the second number would be converted to 0.0000 0 230 before addition. Since the mantissa only holds 23 binary digits, the decimal digits 625 would be lost in the conversion. In short, the second number is rounded down to zero. For multiplication, the two exponents are added and the mantissas multiplied. 3 Quantization 3.1 Introduction Quantization is the act of rounding off the value of a signal or quantity to certain discrete levels. For example, digital scales may round off weight to the nearest gram. Numbers need to be rounded off in order to be represented in a computer. Analog voltage signals in a control system may be rounded off to the nearest volt before they enter a digital controller. Digital images are also quantized. The gray levels in a black and white photograph must be quantized in order to store an image in a computer. The brightness of the photo at each pixel is assigned an integer value between 0 and 255 (typically), where 0 corresponds to black and 255 is white. Since an 8-bit number may represent 256 different values, such an image is called an 8-bit grayscale image. An image which is quantized to just 1 b/pel (only black and white pixels) is called a halftone image. Many printers work by placing, or not placing, a spot of colorant on the paper at each point on a raster of addressable points. To accommodate this, an image must be halftoned before it is printed. Quantization can be thought of as a function y = f(x). An example of a quantization function is plotted in Figure 1, where the x-axis is the input value, and the y-axis is the quantized output value.
ECE306L-4 3
3.2 Quantization and Compression Quantization is sometimes used for compression. As an example, suppose we have a digital image which is represented by 8 different gray levels: [0 31 63 95 159 191 223 255]. To directly store each of the image values, we need at least 8-bits for each pixel since the values range from 0 to 255. However, since the image only takes on 8 different values, we can assign a different 3-bit integer (a code) to represent each pixel: [000 001 ... 111]. Then, instead of storing the actual gray levels, we can store the 3-bit code for each pixel. A look-up table, possibly stored at the beginning of the file, would be used to decode the image. This lowers the cost of an image onsiderably: less hard drive space is needed, and less bandwidth is required to transmit the image (i.e. it downloads quicker). In practice, there are much more sophisticated methods of quantizing images which rely on quantization.
Prelab
Quantization level (resolution) is given
where N is number of level and N=2b. The ratio of the signal average power to the noise power is the signal-quantization noise ratio (SQNR) gives
Max( X ) Min( X ) N 1
and = 0.01 . a. How many bits are required in the A/D converted in each case? b. If you use 10 bits for quantization, what is the resolution? 2. Consider an analog signal x (t ) = 2 cos 2 500t . a. Determine x(n) for
Fs = 3 KHz and sketch it over x(t) for 0 t 4ms b. The discrete signal x ( n ) is quantized with a resolution =0.001 . How many bits are
required in the A/D converted? c. If the minimum required SQNR is 65 dB, what is the minimum number of bits and resolution for the signal? d. What is the quantized signal frequency for 16 bits of quantization?
ECE306L-4 4
LAB 1. Image Quantization Download the file Fig-3-1.tif. The image Fig-3-1.tif is an 8-bit grayscale image. We will investigate what happens when we quantize it to smaller numbers of bits/pixel. Load it into MATLAB and display it using the following sequence of commands. y = imread(Fig-3-1.tif); image(y); colormap(gray(256)); axis(image); The image array will initially be of type uint8, so you will need to convert the image matrix to type double before performing any computation. Use the command z=double(y) for this. There is an easy way to uniformly quantize a signal. Let
Max( X ) Min( X ) N 1
where X is the signal to be quantized, and N is the number of quantization levels. To force the data to have a uniform quantization step of , Subtract Min(X) from the data and divide the result by . Round the data to the nearest integer. Multiply the rounded data by and add Min(X) to convert the data back to its original scale. Write a MATLAB code to uniformly quantize the image to N discrete levels. Use this code to quantize the original image to 7 b/pel, 6, 5, 4, 3, 2, 1 b/pel, and observe the output images 1. Print hard copies of only the 7, 4, 2, and 1 b/pel images, as well as the original. INLAB REPORT: 1. Describe the errors that appear in the image as the number of bits is lowered? 2. Note the number of b/pel at which the image quality noticeably deteriorates. 3. Compare each of these four quantized images to the original.
ECE306L-4 5
y = audiorecorder returns a handle to an 8-kHz, 8-bit, mono audio recorder object. The audio recorder object supports methods and properties that you can use to record audio data. y = audiorecorder(Fs,nbits,channels) returns a handle to an audio recorder object using the sampling rate Fs (in Hz), the sample size of nbits, and the number of channels. Fs can be any sampling rate supported by the audio hardware. Common sampling rates are 8000, 11025, 22050, and 44000. The value of nbits must be 8 or 16 (or 24, if a 24-bit device is installed). For mono or stereo, channels must be 1 or 2, respectively. Example 1 Using a microphone, record 3.5 seconds of 44.1-kHz, 16-bit, stereo data, and then return the data to the MATLAB workspace as a double array.
ECE306L-4 6
You may need to give vector x of samples at sampling rate fs, with sample size nbits, type: >> soundsc (x,fs,nbits) soundsc() This command rescales the audio signal before playing it in order to place it within the dynamic range of the hardware. The syntax is the same as for the sound () command. Use sound() if you do not want to have it autoscaled. Writing audio Files You can write the audio vector y to a MuLaw encoded File by typing the following MATLAB function >>auwrite(y,'filename.au'); or >> auwrite(y,fs,nbits, 'filename.au') Note that you have to use the File-extension .wav or .au in order to be able to load the signal back using wav read , auread or listen it with other sound player such as Windows Media Player. (wavwrite and auwrite is a built-in MATLAB function.) Example Lets have an audio File speech.au which we want to load into Matlab. We use >> x=auread ('speech.au'); To play the audio file within Matlab, you write the following command >> sound(x); To plot the audio file, we can use the following command >> plot(x);
Quantization in MATLAB Produce a quantization index and a quantized output value [index,quants] = quantiz(sig,partition,codebook) [index,quants,distor] = quantiz(sig,partition,codebook) Description index = quantiz(sig,partition) returns the quantization levels in the real vector signal signal using the parameter partition. partition is a real vector whose entries are in strictly ascending order. If partition has length n, then index is a column vector
ECE306L-4 7
whose kth entry is 0 if sig(k) partition(1)m if partition(m) < sig(k) partition(m+1)n if partition(n) < sig(k) [index,quants] = quantiz(sig,partition,codebook) is the same as the syntax above, except that codebook prescribes a value for each partition in the quantization and quants contains the quantization of sig based on the quantization levels and prescribed values. codebook is a vector whose length exceeds the length of partition by one. quants is a row vector whose length is the same as the length of sig. quants is related to codebook and index byquants(ii) = codebook(index(ii)+1); where ii is an integer between 1 and length(sig) Example: >> partition = [-1:.2:1]; % Length 11, to represent 12 intervals >> codebook = [-1.2:.2:1]; % Length 12, one entry for each interval .. [index,quants] = quantiz(x,partition,codebook); % Quantize signal y LAB: 2. Audio Quantization Down load music.au If an audio signal is to be coded, either for compression or for digital transmission, it must undergo some form of quantization. Most often, a general technique known as vector quantization is employed for this task, but this technique must be tailored to the specific application so it will not be addressed here. In this exercise, we will observe the effect of uniformly quantizing the samples of two audio signals. Down load the audio files speech.au and music.au . Write MATLAB code to quantize each of these signals to 7, 4, 2 and 1 bits/sample. Listen to the original and quantized signals and answer the following questions: Check the sampling frequency Check the number of bits For each signal, describe the change in quality as the number of b/sample is reduced? For each signal, is there a point at which the signal quality deteriorates drastically? At what point (if any) does it become incomprehensible? Which signals quality deteriorates faster as the number of levels decreases? Do you think 4 b/sample is acceptable for telephone systems? ... 2 b/sample?
Plot the four quantized speech signals separate. Compare the plots and make your command
3. Error Analysis As we have clearly observed, quantization produces errors in a signal. The most effective methods of the analysis of the error signal turn out to be probabilistic. In order to apply these methods, however, one needs to have a clear understanding of the error signals statistical properties. For example, can we assume that the error signal is white noise? Can we assume that it is uncorrelated with the quantized signal? As you will see in this exercise, both of these are good assumptions if the
ECE306L-4 8
quantization intervals are small compared with sample-to-sample variations in the signal. If the original signal is X, and the quantized signal is Y, the error signal is defined by the following: E=YX Compute the error signal for the quantized speech for 7, 4, 2 and 1 b/sample. When the spacing, , between quantization levels is sufficiently small, a common statistical model for the error is a uniform distribution from
to 2 2
Use the command hist(E,20) to generate a 20-bin histogram for each of the four error signals. Use subplot to place the four histograms in the same figure. LAB: 1. Hand in the histogram figure. 2. How does the number of quantization levels seem to affect the shape of the distribution? 3. Explain why the error histograms you obtain might not be uniform? Next we will examine correlation properties of the error signal. First compute and plot an estimate of the autocorrelation function for each of the four error signals using the following commands: [r,lags] = xcorr(E,200,unbiased); plot(lags,r) Now compute and plot an estimate of the cross-correlation function between the quantized speech Y and each error signal E using [c,lags] = xcorr(E,Y,200,unbiased); plot(lags,c) LAB : 1. Hand in the autocorrelation and cross-correlation estimates. 2. Is the autocorrelation influenced by the number of quantization levels? Do samples in the error signal appear to be correlated with each other? 3. Does the number of levels influence the cross-correlation? 4. Signal to Noise Ratio One way to measure the quality of a quantized signal is by the Power Signal-to-Noise Ratio (PSNR). This is defined by the ratio of the power in the quantized speech to power in the noise.
PSNR =
PY PE
In this expression, the noise is the error signal E. Generally, this means that a higher PSNR implies a less noisy signal. From previous labs we know the power of a sampled signal, x(n), is defined by
ECE306L-4 9
1 L 2 Px = x ( n ) L n =1
where L is the length of x(n). Compute the PSNR for the four quantized speech signals from the previous section. In evaluating quantization (or compression) algorithms, a graph called a ratedistortion curve is often used. This curve plots signal distortion vs. bit rate. Here, we can measure the distortion by
number of quantization levels and sampling rate. For example, if the sampling rate is 8000 samples/sec, and we are using 7 bits/sample, the bit rate is 56 kilobits/sec (kbps). LAB: Assuming that the speech is sampled at 8kHz, plot the rate distortion curve using 1 PSNR as the measure of distortion. Generate this curve by computing the PSNR for 7, 6, 5,..., 1 bits/sample. Make sure the axes of the graph are in terms of distortion and bit rate. INLAB REPORT: Hand in a list of the 4 PSNR values, and the rate-distortion curve.
ECE306L-4 10