Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
www.elsevier.de/aeue
Frequency-based window width optimization for S-transform
Igor Djurovića,∗ , Ervin Sejdićb , Jin Jiangb
a Department of Electrical Engineering, University of Montenegro, 81000 Podgorica, Montenegro
b Department of Electrical and Computer Engineering, The University of Western Ontario, London, Ont., Canada N6A 5B9
Received 12 September 2006; accepted 18 March 2007
Abstract
The S-transform combines properties of the short-time Fourier (STFT) and wavelet transforms. It preserves the phase
information of a signal as in the STFT, while providing the variable resolution as in the wavelet transform. However, the
S-transform suffers from poor energy concentration for some classes of signals. A modification to the existing S-transform
is proposed in this paper to enhance the energy concentration in the time–frequency domain. An improvement is achieved by
introducing an additional parameter which can be used to optimize the window width. The optimization is performed over
frequency and the proposed modification keeps the frequency marginal of the S-transform. The proposed scheme is tested on
a set of synthetic signals. The results show that the proposed algorithm produces enhanced energy concentration in comparison to the standard S-transform. Also, the results show that for various signal types the proposed algorithm achieves higher
signal concentration in comparison to other standard time–frequency transforms, such as, STFT and pseudo Wigner–Ville
distribution (PWVD). Furthermore, it is concluded by numerical study that the proposed algorithm provides more accurate
estimation of the instantaneous frequency than the standard S-transform.
䉷 2007 Elsevier GmbH. All rights reserved.
Keywords: Time–frequency transformations; S-transform; Concentration measure
1. Introduction
The time–frequency analysis of a signal depicts variation of the signal’s spectrum with time. In an ideal case,
the time–frequency representation provides only information about the frequency occurring at a given instant without
cross-information about adjacent instants [1,2]. Therefore,
the main objective of time–frequency distribution functions
is to be as close as possible to the ideal time–frequency representations, that is, to obtain time-varying spectral density
function with high resolution, and to overcome any interference if exist [3]. Hence, the energy concentration in the
time–frequency domain is one of its very important and intensively studied aspects in time–frequency analysis [4].
∗ Corresponding author. Tel.: +381 67 257 155; fax: +381 81 244 921.
E-mail addresses:
[email protected] (I. Djurović),
[email protected]
(E. Sejdić),
[email protected] (J. Jiang).
1434-8411/$ - see front matter 䉷 2007 Elsevier GmbH. All rights reserved.
doi:10.1016/j.aeue.2007.03.013
In recent years, many different algorithms have been
proposed to obtain the time–frequency representations of
a signal. The recently proposed S-transform is a combination of short-time Fourier (STFT) and wavelet transforms, since it employs a variable window length and the
Fourier kernel [5]. The advantage of S-transform is that
it preserves the phase information of signal, and also provides a variable resolution similar to the wavelet tilings. In
addition, the S-transform is a linear transform that can be
used as both an analysis and a synthesis tool, which is
not the case with some of the bilinear transforms such as
Wigner–Ville distribution. However, the S-transform suffers from poor energy concentration in some situations.
Especially poor energy concentration is achieved at the
higher frequencies (the terms low and high frequencies
are loosely used, and they describe whether the particular frequency is close to the highest analyzed frequency
or it is much lower). For example, let us assume that a
246
I. Djurović et al. / Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
linearly increasing FM signal (i.e. a chirp) is being analyzed.
At the high frequency, the energy concentration around the
instantaneous frequency of the chirp is very poor, since the
S-transform provides excellent time localization at higher
frequencies, but poor frequency localization.
The major contribution of this paper is an optimization of
the window width used in the S-transform through an additional parameter in the transform, which will enhance the
energy concentration around the component’s instantaneous
frequency. The optimization is performed for each frequency
resulting in the modified S-transform with preserved frequency marginal property.
The proposed scheme has been tested on a set of synthetic
signals. The time–frequency representations obtained by the
proposed algorithm are compared with those obtained from
the standard S-transform. The results have shown that the
proposed algorithm enhances the energy localization of the
signals. Also, the results show that for some signal classes,
the proposed algorithm is capable of achieving higher concentration than other standard techniques, such as, the shorttime Fourier transform and pseudo Wigner–Ville distribution
(PWVD). Furthermore, the results show that the proposed
algorithm is more accurate than the standard S-transform
for the instantaneous frequency estimation. This can be of
significant importance in many applications where precise
energy concentration is desirable, such as communications
and time series analysis.
This paper is organized as follows. In Section 2, a review
of the standard S-transform is given along with the modified
form of the S-transform. Details of the optimization process
for the proposed algorithm are covered in Section 3. Section
4 evaluates the performance of the proposed scheme using
synthetic signals. Numerical study of the proposed transform
as the instantaneous frequency estimator is also given in this
section. Conclusions are drawn in Section 5.
2. Background theory
The standard S-transform of a signal x(t) is given by a
convolution integral as [5]
+∞
Sx (t, f ) =
x()w(t − , f )e−j2f d
=
−∞
+∞
−∞
x()
(1)
The advantage of the S-transform over the STFT is that the
standard deviation is a function of frequency, f, defined as
(f ) =
1
.
|f |
−∞
where X(f ) is the Fourier transform of the signal, x(t).
This is an important property, since it distinguishes the
S-transform from the wavelet transform [5], and also it represents a form of the frequency marginal property [6].
The disadvantage of the S-transform is the same assignment of the standard deviation for all signal components at
all frequencies. That is, is always defined as a reciprocal
of the frequency. Some signals would benefit from different value of the standard deviation for the window function.
A simple, but powerful improvement to the S-transform can
be made by defining the standard deviation of the window as
(f ) =
1
,
|f |p
resulting in the modified S-transform given as
2 2p
|f |p +∞
p
Sx (t, f ) = √
x()e−(t−) f /2 e−j2f d,
2 −∞
(4)
(5)
where the new parameter p controls the width of the window.
If an appropriate value of p can be determined for a given
signal, an improved time–frequency localization becomes
possible. It can be shown that the proposed modification of
the S-transform keeps the frequency marginal property:
+∞
1
p
Sx (t, f ) dt,
(6)
X(f ) =
W (0, f ) −∞
where W (, f ) is a Fourier transform (from t to ) of the
window function, w(t, f ).
3. Proposed optimization algorithm
2
2
1
√ e−(t−) /2(f )
(f ) 2
× e−j2f d.
frequency. It can be seen that the window is wider in the
time domain for lower, and narrower for higher frequencies.
In other words, the window provides good localization in
the frequency domain for low frequencies, while it provides
good localization in the time domain for higher frequencies.
It is clear that the time–frequency atoms for the S-transform
are arranged in the same way as for the wavelet transform.
An important property of the S-transform is that its integral over time is equal to the Fourier transform:
+∞
Sx (t, f ) dt,
(3)
X(f ) =
(2)
In other words, the window function is a function of time
and frequency. Width of the window is determined by
The proposed implementation of the S-transform provides
us with the additional parameter, p, which can be adjusted
to achieve the time–frequency representation of the signal
with improved energy concentration. It is desired to develop
a scheme that can be used to obtain the optimal value of the
parameter p for a given signal. In the time–frequency analysis, the aim of the optimization is to obtain a time–frequency
representation as close as possible to the ideal one, i.e.,
signal components most closely resemble the instantaneous
frequency.
I. Djurović et al. / Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
Therefore, the optimal value of p will be found based on
the variation of the concentration measure proposed in [7].
The measure is designed to minimize the energy concentration for any time–frequency representation based on the
automatic determination of some time–frequency distribution parameter. In this case, the objective is to minimize the
energy concentration through the parameter p which controls the window width.
The optimization can be performed either with respect
to time or to frequency. To understand the outcome of
either approach, it is critical to understand the marginals
of the S-transform. The time and frequency marginals are
important quantities in time–frequency analysis since they
ensure that the spectral, temporal, and total signal energies
are accurately reflected in the time–frequency domain. They
also assure that global quantities such as mean time, mean
frequency, duration, and bandwidth are correctly given [8].
As shown by (3), the S-transform has well-defined frequency marginal. However, the S-transform does not satisfy
the time-marginal property. This implies that in a case of
a signal with a time-varying amplitude, integrating over
the frequency in the time–frequency domain would destroy
the information about the signal’s amplitude, while integration over the time preserves information on the spectral
content.1 Therefore, it is a natural choice to perform the
optimization in the frequency domain. In this way, the frequency marginal of the S-transform can be preserved, and
information about the amplitude of spectral components is
not destroyed.
For the above reasons, a scheme for finding the optimal
variation of the parameter based on frequency is given
below. The scheme provides us with the frequency-varying
parameter, p(f ), which is used in the proposed S-transform
modification. Therefore, the algorithm for p(f ) can be
summarized through the following steps:
1. For p selected from the set p ∈ (0, 1], compute
p
S-transform of the signal, Sx (t, f ) according to (5).
2. For each p and a frequency f compute
CM(f, p) = +∞
1
p
q
−∞ |Sx (t, f )|
dt
,
(7)
where q ∈ (0, 0.25].
3. Optimal value of p for the considered frequency f maximizes concentration measure CM(f, p)
popt (f ) = arg max[CM(f, p)].
p
(8)
4. Set the modified S-transform as
p
Sxa (t, f ) = Sx opt
(f )
(t, f ).
(9)
1 We developed a technique for optimization of p for each instant that
requires elaborate normalization strategy. This technique will be reported
elsewhere.
247
The algorithm specifies that the parameter p should not
exceed 1. The reason is that for p?1 the window function
becomes very narrow in the time domain, and for very high
value of p it approaches the Dirac function. Very narrow
window in the time domain is suitable only for analysis of
the Dirac function itself, or a sum of the Dirac functions.
All time–frequency representations obtained in step 1 satisfy the frequency marginal, and for a given frequency, the
representation, which provides the best localization in the
time–frequency domain, is chosen in the step 2. Parameter
q in step 2 is selected to be within
∞ q ∈p (0, 0.25]. Namely,
parameter q = 0 would produce −∞ |Sx (t, f )|q dt equal to
time interval in which component is detected for given frequency. This time-interval could be caused by actual component on this frequency but also by spread of components
from adjacent frequency bins. Then the measure CM(f, p)
would be maximal if influence of adjacent frequency bins is
p
∞
minimized (value −∞ |Sx (t, f )|q df minimal). However,
due to the potential presence of noise, sidelobes of the signal and other effects, it is not a good idea to use q = 0.
Instead, we propose to use a q in the range q ∈ (0, 0.25].
Note that using larger values of q, namely
q ≈ 1, could not
∞
p
produce appropriate results since value −∞ |Sx (t, f )|q dt
approaches toward constant (toward marginal property) for
each p. Then it would be difficult to highlight any difference
among the time–frequency representations for different p.
In an actual implementation of the algorithm, it might
be beneficial to use some of the available signal processing
tools to reduce the effects of the discrete implementation.
Such tools include zero-padding the signal or windowing
the signal before proceeding to the step 2. This is especially
critical for wideband signals, which might show some
irregular properties unless they are properly windowed [9].
A wide selection of the windows is available in the literature
and usually Gaussian or Kaiser windows are used to smooth
the edges of the finite duration signals, due to the fact that
they have a parameter to control the width of the window.
While discussing the implementation of the proposed
scheme it is important to mention that in our simulations
only one global optimum point for p or q exists and the local minimums do not exist. Therefore, a least mean square
based technique could be developed in order to diminish
computation complexity [7].
As a last remark, it should be mentioned that further
improvements to the concentration of the S-transform could
be achieved by introducing a method for the time–frequency
analysis similar to an S-method or its signal adaptive variation [10], which is developed based on the relation between
the Wigner–Ville distribution and a spectrogram.
4. Simulation results and discussion
In this section, the performance of the proposed algorithm
for the S-transform is examined using a set of synthetic test
signals. The goal is to examine how the proposed algorithm
0
-5
0.4
0.6
0.8
-2
0
1
0
0.2
Time (s)
0.4
0
0.8
50
0
0
0.2
0.4
0.6
Time (s)
(c)
(d)
50
0
0.8
0.4
0.6
0.8
1
1
0.5
50
0
1
0
0.2
0.4
0.2
0.4
0.6
100
Time (s)
Frequency (Hz)
(e)
(f)
Fig. 1. Test signal x1 (t): (a) time domain representation; (b) STFT;
p(f )
(c) PWVD; (d) standard S-transform; (e) Sx (t, f ); and (f)
optimal variation of parameter p across the frequencies, p(f ).
performs in comparison to the standard S-transform. Furthermore, a comparison with other standard time–frequency
techniques, such as the STFT and the PWVD is given. In
the actual implementation, the sampling period used is
T = 1/256 s, and the set of p values is selected by
p = {0.01n : n ∈ N and 1n100}. For the STFT and the
PWVD, a Gaussian window is used with standard deviations
equal to 0.05 and 0.1, respectively.
Let us begin the performance analysis of the algorithm
with the following signal:
x1 (t) = cos(68t − 20t 2 ) + cos(2 sin(5t) + 120t)
+ cos(168t + 28t 2 ),
(10)
where 0 t < 1, and the signal does not exist outside the
interval. By examining the time–frequency representations
of the signal, it can be noticed that the STFT is capable of
concentrating the two chirps well, but the concentration of
the sinusoidally modulated component is poor.
The representation obtained by the PWVD contains significant cross-terms. The standard S-transform suffers from
the same problem as the wavelet transform, that is, at the
low frequencies it has good frequency resolution, but as the
frequency increases, the frequency resolution deteriorates.
This is evident from Fig. 1(d). With the proposed algorithm
more uniform resolution is obtained at both low and high
frequencies as shown in Fig. 1(e). It is worthwhile to consider the behavior of p(f ), which is shown in Fig. 1(f).
It is apparent that as the frequency increases, the value of p
decreases, which means that the window is becoming wider
in the time domain. A wider window in the time domain
0.8
1
0.8
1
(b)
0
0
0.6
Time (s)
50
0.8
100
50
0
1
0
Time (s)
0.2
0.4
0.6
Time (s)
(d)
1
100
50
0
0.5
0
0
0
0.8
(c)
0
0.2
0.6
100
1
Frequency (Hz)
Time (s)
100
0
0.4
50
(a)
100
1
Amplitude
Frequency (Hz)
0.6
0.2
100
Time (s)
Frequency (Hz)
Frequency (Hz)
Frequency (Hz)
50
0.4
0
1
(b)
100
0.2
0.8
Time (s)
(a)
0
0.6
Frequency (Hz)
0.2
50
0
Amplitude
0
2
100
Frequency (Hz)
5
Amplitude
Frequency (Hz)
I. Djurović et al. / Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
Amplitude
248
0.2
0.4
0.6
0.8
1
0
Time (s)
(e)
50
100
Frequency (Hz)
(f)
Fig. 2. Test signal x2 (t): (a) time domain representation; (b) STFT;
p(f )
(c) PWVD; (d) standard S-transform; (e) Sx (t, f ); and (f)
optimal variation of parameter p across the frequencies, p(f ).
provides better localization in the frequency domain. Therefore, this indeed enhances the time–frequency representation
of the signal in comparison to the standard S-transform, and
resulting in better concentrated time–frequency representation.
In the first example, the signal with linearly increasing/decreasing frequencies along with a sinusoidally modulated frequency component is examined. It is worthwhile
to examine how the algorithm behaves for a multicomponent signal with the crossing components and parabolic
instantaneous frequency:
x2 (t) = cos(144(t − 0.3)3 + 30t)
+ cos(128t − 50t 2 ),
(11)
where 0 t < 1, and the signal does not exist outside the
given interval. The signal is examined again by the STFT, the
PWVD, the standard S-transform and the proposed scheme.
Even though, the representation obtained by the PWVD,
shows that the components are well concentrated in the
time–frequency domain, the presence of the cross-terms
diminishes readability of the representation. The STFT is
capable of concentrating the chirp; however, the concentration of the parabolic component is very poor, especially, at
the higher frequencies. The standard S-transform provides
higher concentration of the parabolic component than the
STFT, but the concentration of the linear chirp is poor
at the higher frequencies. From Fig. 2(e), it is obvious
that the proposed algorithm for the S-transform is able to
diminish the spectral leakage which is occurring with the
standard S-transform. It also provides good concentration
I. Djurović et al. / Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
249
algorithm yields a smaller MSE than the standard
S-transform and, therefore, is more accurate in estimating
the instantaneous frequency of the noisy signal. Similar
situation has been observed for other test signals as well.
104
Mean Square Error
103
5. Conclusions
102
101
p = 0.1
p = 0.3
p = 0.5
p = 0.7
p = 1 (standard S-transform)
Proposed Algorithm
100
0
5
10
15
SNR (dB)
Fig. 3. MSE for the instantaneous frequency estimation for
p
Sx (t, f ) based on various values of p.
of the parabolic component. By examining the behavior of
p(f ) shown in Fig. 2(f), it can be seen that the window
for the proposed algorithm becomes wider in comparison
to the window in standard S-transform, that is significant to
form more concentrated time–frequency representation of
the signal.
So far, it has been shown that the proposed algorithm for
the S-transform is capable of enhancing the energy concentration. Now, it is interesting to examine the behavior of the
proposed algorithm in the presence of noise, and how noise
affects the accuracy of the instantaneous frequency estimap
tion based on the peak values [11] of Sx (t, f ) for various
values of p. To examine such behavior the following signal
is used:
x3 (t) = A sin(100t + 4 cos(4t)),
In this paper, a window width optimized algorithm for the
S-transform is developed, that provides enhanced energy
concentration of the signal in the time–frequency domain.
The improvement is performed through an additional
parameter p, which controls the width of window used in
the S-transform. The parameter is determined through the
optimization procedure that maximizes the energy concentration of the time–frequency representation for each frequency bin. The proposed scheme is evaluated numerically
and compared with the standard S-transform by using a set
of synthetic test signals. The results have shown that the
proposed algorithm can significantly enhance the energy
localization of the signals in comparison to the standard
S-transform. Also, it is showed that the proposed algorithm is capable of achieving higher energy concentration
than other standard algorithms, such as, the STFT and the
PWVD. Furthermore, it has been numerically shown that
the proposed algorithm provides more accurate estimation of the instantaneous frequency in comparison to the
standard S-transform.
Acknowledgements
Ervin Sejdić and Jin Jiang would like to thank the Natural Sciences and Engineering Research Council of Canada
(NSERC) for financially supporting this work.
(12)
where 0 t < 1 and A=1. The signal is contaminated with an
additive white Gaussian noise, and the signal-to-noise ratio
(SNR) is given by SNR = 10 log10 (A2 /2n ) with A being a
signal’s amplitude and 2n being the variance of the noise. To
investigate the behaviour of the proposed algorithm in the
presence of noise, a range of SNR values are used, and the
mean square error (MSE) is evaluated for the instantaneous
frequency estimation of the given sinusoidally modulated
signal. The estimation is performed based on the peaks of
p
the Sx (t, f ) for various values of p, namely, p = 0.1, 0.3,
p
0.5, 0.7, 1 and adaptive value, p(f ). It is noted that Sx (t, f )
for p = 1 corresponds to the standard S-transform. The SNR
varies from 0 to 15 dB in a 1 dB step. For each SNR value,
100 realizations are used.
Fig. 3 represents the results of such analysis. The horizontal axis represents the SNR (in decibels), and the vertical
axis represents the MSE for the instantaneous frequency
estimation. The simulation results show that the proposed
References
[1] Stanković LJ. An analysis of some time–frequency and timescale distributions. Ann Telecommun 1994;49(9/10):505–17.
[2] Gröchenig K. Foundations of time–frequency analysis.
Boston: Birkhäuser; 2001.
[3] Djurović I, Stanković LJ. A virtual instrument for
time–frequency analysis. IEEE Trans Instrum Meas
1999;48(6):1086–92.
[4] Stanković LJ. Highly concentrated time–frequency
distributions: pseudo quantum signal representation. IEEE
Trans Signal Process 1997;45(3):543–51.
[5] Stockwell R, Mansinha L, Lowe R. Localization of the
complex spectrum: the S-transform. IEEE Trans Signal
Process 1996;44(4):998–1001.
[6] Cohen L. Time–frequency analysis. Englewood Cliffs, NJ:
Prentice-Hall, PTR; 1995.
[7] Stanković LJ. A measure of some time–frequency
distributions concentration. Signal Process 2001;81(3):
621–31.
250
I. Djurović et al. / Int. J. Electron. Commun. (AEÜ) 62 (2008) 245 – 250
[8] Loughlin PJ, Pitton JW, Atlas LE. Construction of positive
time–frequency distributions. IEEE Trans Signal Process
1994;42(10):2697–705.
[9] Gini F, Giannakis GB. Hybrid FM-polynomial phase signal
modeling: parameter estimation and Cramér-Rao bounds.
IEEE Trans Signal Process 1999;47(2):363–77.
[10] Stanković LJ. A method for time–frequency analysis. IEEE
Trans Signal Process 1994;42(1):225–9.
[11] Boashash B. Estimating and interpreting the instantaneous
frequency of a signal—part 1: fundamentals. Proc IEEE
1992;80(4):520–38.
Igor Djurović was born in Montenegro in 1971. He received B.S., M.S.
and Ph.D. degrees, all in electrical
engineering from the University of
Montenegro, in 1994, 1996 and 2000,
respectively. He is currently an associate
professor at the University of Montenegro. During 2002, he was on leave at the
Department of Mechanical and System
Engineering, Kyoto Institute of Technology, Japan, as a fellow of the Japan
Society for the Promotion of Science. He was on short stays and
was involved in joint research at the AIIA Laboratory, Aristotle
University, Thessaloniki, Greece; Signal Theory Group, Ruhr University, Bochum Germany; ENSIETA, Brest, France; International
Center for Signal Processing, Tampere University of Technology,
Tampere, Finland; and National Aerospace University, Kharkov,
Ukraine. He received the biannual award of the National Academy
of Science and Art of Montenegro as the young scientist with
the best achievements in the technical sciences in Montenegro
from 2000 to 2002. He published over 90 papers in the international scientific journals and conferences. He contributed to
“Time–frequency signal analysis and processing”, ed. B. Boashash,
Prentice-Hall. His current research interests include application
of virtual instruments, time–frequency analysis based methods for
signal estimation and filtering, fractional Fourier transform applications, image processing, robust estimation, motion estimation,
and digital watermarking. Dr. Djurović is author of two books and
co-author of three books.
Ervin Sejdić received the B.E.Sc. degree in electrical engineering with a
specialization in wireless communications from the University of Western
Ontario, London, Ont., Canada, in
2002, where he is currently working
toward the Ph.D. degree in electrical
engineering. His research interests include digital signal processing, general
time–frequency analysis, and biomedical signal processing. Mr.
Sejdić won prestigious research scholarships from the Natural
Sciences and Engineering Research Council of Canada in 2003
and 2005.
Jin Jiang received his B.E.Sc. from
Xi’an Jiaotong University, in Xi’an,
China in 1982, and M.E.Sc. and
Ph.D. degrees from the University of
New Brunswick, in Fredericton, New
Brunswick, Canada in 1984 and 1989,
respectively. He started his academic
career as a lecturer in Marine Institute
in St. John’s, Newfoundland in 1988,
and became as an Assistant Professor in
Lakehead University in Thunder Bay,
Ontario in July 1990. From 1991, Dr. Jiang has been with the
University of Western Ontario, London, Ontario, Canada, where
he currently is a Professor in the Department of Electrical and
Computer Engineering. From 1996 to 1997, he spent 1 year
in Yamaguchi University, in Ube, Japan as a visiting Associate
Professor on his sabbatical leave. From 2003, he has been an
NSERC/UNENE Senior Industrial Research Chair in the area of
control, instrumentation, and electrical systems in nuclear power
plants. Dr. Jiang has wide range of research interests, including
fault-tolerant control of safety-critical systems (aircraft, nuclear
power plants), advanced control of electrical power plants and
power systems (combustion, generation, transmissions, and fuel
cells), and advanced signal processing for diagnosis applications
(manufacturing processes and human hearts). He has published
extensively in the above areas including a recent book on active fault-tolerant control systems. Dr. Jiang is also a passionate
teacher, and has won several teaching awards in the last few years.
He is currently supervising a large number of graduate students.