TuC03.3
43rd IEEE Conference on Decision and Control
December 14-17, 2004
Atlantis, Paradise Island, Bahamas
On the performance limitation of Active Queue
Management (AQM)
Khushboo Shah, Stephan Bohacek and Edmond Jonckheere
Abstract— Traffic properties relevant to Active Queue Management (AQM) and the resulting pole/zero and Bode limitations are examined. ARX models of TCP are derived from data
collected in a fixed "dumbbell" simulation environment. The
pole dynamics at short time scales can be traced to the RTT,
which serves as a confirmation of the relevance of the models.
It is shown that the weighted sensitivity function is a reasonable metric for AQM performance. However, K " design
shows that the sensitivity can only be slightly reduced. The
conclusion of this work is that the goal of smoothing TCP’s
oscillations (a principle design goal of AQM) will likely not be
met in general.
I. I NTRODUCTION
One of the chief services provided by TCP is congestion
control. The goal of TCP congestion control is to share the
available bandwidth among the many flows that utilize a
common link. In order to achieve this goal, TCP dictates a
sending rate that is dependent on the past packet losses [1].
Since 1993 [2], there has been extensive research focused
on the prospect of deliberately dropping packets according
to a time-varying loss probability in order to control TCP
connections’ data rate and meet various performance objectives. Such proactive dropping of packets is known as
Active Queue Management (AQM).
Early attempts of AQM focused strictly on the development of a mapping between the filtered version of the
queue occupancy and the loss probability, without explicitly
considering the dynamics of TCP. The most popular of
these approaches is known as Random Early Drop (RED)
[2]. The problem with this approach is that the design
parameters are difficult to adjust and chaotic behavior can
result [3]. More recently, there has been a number of AQM
designs that take a more control theoretic approach and
consider dynamic models of the TCP [4], [5], [6], [7].
The major difference between this recent work and the
present paper is that we do not use a “first principles”
model of TCP. Instead, a very large number of observations
are made and a local linear model is identified from I/O
data. By local, we mean that the loss probability varies
by only a small amount. This allows the linear model
to be very accurate. (See [8] for a comparison between
linear and nonlinear models.) A second difference between
Khushboo Shah and Edmond Jonckheere are with the Department of
Electrical Engineering, University of Southern California, Los Angeles,
CA, USA. khushboo,
[email protected]
Stephan Bohacek is with the Department of Electrical and Computer Engineering, University of Delaware, Newark, DE, USA.
[email protected]
0-7803-8682-5/04/$20.00 ©2004 IEEE
much of the work on AQM and this paper is that this
paper focuses on the performance of the closed-loop from a
control theoretic perspective. Specifically, we examine the
sensitivity function as a measurement of performance.
The conclusions of this investigation bode poorly for
the prospect of AQM. In short, the non-minimum phase
zeros and the Bode limitations lead to design difficulties.
Specifically, the incorporation of a dynamic controller to
smooth the variations in packet arrivals has only a minor
impact on the variation of the packet arrivals.
The paper proceeds as follows. In Section II, the approach
used to model TCP is discussed. Section III discusses the
accuracy of the models. This model is further analyzed in
Section IV where the pole zero configuration is investigated.
In Section V, the motivation and rationale for the controller design is discussed. Section VI discusses the Bode
limitation and how it affects the controller. Section VII
investigates several controllers and shows that the control
has little impact on the performance.
II. TCP DYNAMICS IDENTIFICATION METHODOLOGY
This section describes the identification methods used
to develop a model of the dynamics of packet arrivals.
This approach to modeling is quite different from the
typical approach to AQM that uses first principles models.
However, those models suffer from a number of drawbacks
that are avoided by the approach taken here. The first
principles approach models the mean packet arrival rate,
whereas in practice, the actual arrival of packets does not
exactly follow the mean but is some stochastic process.
The approach here embraces the stochastic nature of packet
arrivals. Another serious problem with the first principles
approach to TCP modeling is that these models do not
model TCP time-out; time-out has been shown to play a
significant role in TCP [9]. The approach here will account
for time-out and any other mechanism (e.g., the dynamics
of fast retransmit/fast recovery, slow-start after a time-out,
etc.). On the other hand, it is difficult to see how such an
approach could be used in practice. However, the objective
of this work is to understand the performance limitations in
a fixed and known operating environment. This "best-case"
situation then stands as a benchmark
In order to identify models of TCP, extensive simulations
were carried out with the Network Simulator (ns) [10].
TCP traffic was modeled as long lived FTP traffic, that is,
for each source-destination pair a single TCP connection
sent data for the entire simulation over a single bottleneck
1016
7
9
10
5
6
Monitored Link
11
-26.5
-32.5
Power Spectral Density
(dB/ rad/sample)
1
Power Spectral Density
(dB/ rad/sample)
0
4
SamplePeriod = 0.05
SamplePeriod = 0.01
8
3
Destinations
Sources
2
-27
-33
-27.5
-33.5
-28
-34
-28.5
-34.5
-29
-35
-29.5
-35.5
-30
-30.50 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u Srad/sample)
Fig. 1. Dumbbell topology. The traffic is sent from sources to destinations.
Link 0-1, the bottleneck link, is monitored.
l=0
l=0
where % is the so-called residual noise and O denotes the
order of the model. Standard least-squares techniques were
used to estimate the coefficients dl and el . Because the
best choice of the model order, O> is generally not known
a priori, it is usually necessary in practice to postulate
several model orders. Many criteria have been proposed as
tentatively objective functions for selecting the ARX model
order. One of the best known ones is Rissanen’s Minimum
Description Length (MDL) criterion [12] which has the
form (for Gaussian disturbances, which is the case here [8]),
P GO[O] = Q ln(P VHO ) + O ln (Q ) >
SamplePeriod = 0.5
-30
Power Spectral Density
(dB/ rad/sample)
Power Spectral Density
(dB/ rad/sample)
Normalized Frequency (u Srad/sample)
SamplePeriod = 0.1
-31.5
-32
-30.5
-32.5
-31
-33
-31.5
-33.5
-32
-32.5
-34
-33
-34.5
-35 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u Srad/sample)
-33.50 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u Srad/sample)
Fig. 2. Power spectral density plots of % for sample periods T=0.01, 0.05,
0.1, and 0.5 seconds.
SamplePeriod = 1
SamplePeriod = 10
-30
Power Spectral Density
(dB/ rad/sample)
-38
-38.5
Power Spectral Density
(dB/ rad/sample)
-30.5
-39
-31
-39.5
-31.5
-40
-32
-40.5
-32.5
-41
-33
-33.5
-41.5
0
-42 0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
SamplePeriod = 100
SamplePeriod = 50
-47.5
Power Spectral Density
(dB/ rad/sample)
-44.5
-45
-48
-45.5
-48.5
-46
-49
-46.5
-49.5
-50
-47
-50.5
-47.5
-48
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u S rad/sample)
Normalized Frequency (u S rad/sample)
Power Spectral Density
(dB/ rad/sample)
dumbbell topology, as shown in Fig 1. The nodes Vl (
l = 2> ===> 6) are set as the sources and the nodes Gl
(l = 7> ===> 11) are set as the destinations. The monitored
link, the bottleneck link, is 0 to 1. The starting time of the
flows was varied slightly randomly so that each simulation
was different. There were 4 FTP flows with a round-trip
time (RTT) of 24 msec, 4 FTP flows with a RTT of 26
msec, and 4 FTP flows with a RTT of 28 msec. Each
simulation was run for an extremely long time to ensure
that all parameters were accurately estimated. In particular,
most simulations used over 30,000 sample points. Hence,
when the sample period was 100 sec, the simulation time
was 3,000,000 sec or nearly 35 days.
In the system under investigation, the queue imposes
a loss probability on every arriving packet. This drop
probability is uniformly distributed over [0.0295, 0.0305]
and during each sample period, the drop probability is
fixed. Hence, at time step n, the loss probability s (n) is
set for the time period [nW> (n + 1)W )> where W is the
sample period. We consider sample periods from 10 msec
to nearly an hour and a half. In order to keep the scale
of the packet arrivals the same for all sample periods, we
define | (n + 1) to be the normalized packet arrivals over
the period [nW> (n + 1)W ). The normalization is done by
dividing the observed packet arrivals by the link speed and
the sample period. In this queue discipline, drops could
also occur if the queue fills. However, the queue size is
set sufficiently large and the drop probability is set large
enough, so that the queue does not fill up.
Here we consider only linear ARX models [11] for TCP
traffic ([8] shows that nonlinear models are not required).
These linear ARX models are defined as
O1
O1
X
X
| (n + 1) =
dl | (n l) +
el s (n l) + % (n) >
-36 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u S rad/sample)
-51
0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Normalized Frequency (u S rad/sample)
Fig. 3. Power spectral density plots of % for sample periods T=1, 10, 50,
and 100 seconds.
where Q is the length of the data record and P VHO is the
mean square error defined as
Q
1 X
P VHO :=
(| (n) |b (n))2 >
Q
n=1
where | (n) is the observed packet arrivals while |b (n) is
the packet arrivals predicted by the model from the past
O samples. The order O is selected so as to minimize the
MDL. Furthermore, we use balanced model order reduction
techniques to reduce the order in the case of higher sampling
rates for simplicity.
III. R ELIABILITY OF IDENTIFIED MODELS
Clearly the models are optimal if the residual error % is
uncorrelated and Gaussian.
The uncorrelated property was checked from the “flatness” of the power spectral density of the residual sequence
% using the Pcov method (see Figs 2 and 3). Clearly, for
W = 1=0> ===> 0=1> sec (Figs 2, 3), the power spectral density
appears fairly flat. Not surprisingly, at higher sampling rates,
W = 0=05> 0=01> sec (Fig 2), the models become less reliable
1017
S a m p le P e rio d = 0 .0 1
S a m p le P e rio d = 0 .0 1
S a m p le P e rio d = 0 .0 5
S a m p le P e rio d = 0 .0 5
2
-0 .3
-0 .2 -0 .1
0
0 .1
0 .2
D a ta
S a m p le P e rio d = 0 .1
-0 .2
-0 .1
D a ta
0
0 .1
-2 -2
Imag Axis
-0 .3
0 .2
-0 .2
-0 .1
0
D a ta
0 .1
0 .2
1
0 .8
0 .6
0 .4
0 .2
0
-0 .2
-0 .4
-0 .6
-0 .8
-1 -2
Imag Axis
-1 .5
-1
-0 .5
0
Fig. 4. Normplots of the residual error % for sample periods T=0.01,
0.05, 0.1, and 0.5 seconds.
S a m p le P e rio d = 1
Probability
Probability
0 .0 50 .1 0 .1 5
-0 .0 6-0 .0 4-0 .0 2 0
Probability
Probability
-0 .0 4-0 .0 3-0 .0 2-0 .0 1 0
0 .0 1 0 .0 2 0 .0 3 0 .0 4
D a ta
0 .0 2 0 .0 4 0 .0 6 0 .0 8
D a ta
S a m p le P e rio d = 1 0 0
D a ta
S a m p le P e rio d = 5 0 .
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
1
-1 .5
-1
-0 .5
0
0 .5
-4
-3
-2
-1
R e a l A x is
S a m p le P e rio d = 0 .5
0
1
0 .2 0 .4 0 .6 0 .8 1
R e a l A x is
Fig. 6. Pole/zero configurations of full-order models for sample periods
T=0.01, 0.05, 0.1, and 0.5 seconds.
S a m p le P e rio d = 1 0
0 .9 9 9
0 .9 9 7
0
0 .9
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0
0 .0
.0 2
1
0 .0 0 3
0 .0 0 1
-0 .2 5-0 .2 -0 .1 5-0 .1 -0 .0 50
0 .5
R e a l A x is
S a m p le P e rio d = 0 .1
R e a l A x is
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0
0 .0
.0 2
1
0 .0 0 3
0 .0 0 1
1
-1 .5
0 .0 50 .1 0 .1 5
D a ta
S a m p le P e rio d = 0 .5
Probability
Probability
-0 .3
1
0 .8
0 .6
0 .4
0 .2
0
-0 .2
-0 .4
-0 .6
-0 .8
-1 -1 -0 .8 -0 .6 -0 .4 -0 .2 0
0
-1
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
1 .5
1
0 .5
-0 .5
-0 .3 5-0 .3 -0 .2 5-0 .2 -0 .1 5-0 .1 -0 .0 50
0 .3
1
0 .8
0 .6
0 .4
0 .2
0
-0 .2
-0 .4
-0 .6
-0 .8
-1 -5
1 .5
Imag Axis
-0 .4
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
Imag Axis
Probability
Probability
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
0 .9 9 9
0 .9 9 7
0 .9
0
.9 9
8
0 .9 5
0 .9 0
0 .7 5
0 .5 0
0 .2 5
0 .1 0
0 .0 5
0 .0
.0 1
2
0
0 .0 0 3
0 .0 0 1
-0 .0 2
-05.0-0
2 .0 1
-05.0-0
1 .0 0 0
5 0 .0 005.0 1
0 .0 105.0 2
0 .0 2 5
D a ta
Fig. 5. Normplots of the residual error % for sample periods T=1, 10, 50,
and 100 seconds.
as shown by the non-flat power spectral densities. However,
even at these small sample periods, the ripple in the power
spectral density is less than 6 × 104 . For large sample
periods, W 10 sec (Fig 3), the power spectral density
exhibits ripple of less than 105 .
Regarding the Gauss property, Figs 4 and 5 show the
outcome of the MATLAB “normplot” test. If the resulting
plot is linear, then the residual sequence is Gaussian. For
intermediate sampling rates, W = 0=05> 0=1> sec the Gauss
property is obvious. For high sampling rate, W = 0=01 sec,
the Gauss property deteriorates slightly in the tail. Specifically, 99=4 % of the probability mass is well-modeled by
Gaussian distribution. This test concludes that the Gaussian
assumption is a reasonable one.
IV. POLE / ZERO CONFIGURATION
In order to gain a deeper understanding of this modeling
approach, we examine the pole/zero configuration of the
open-loop system.
Figs 6, 7 show a clear trend in the pole/zero configuration
as the sampling period decreases from 100 to 0.01 sec.
The pole/zero plots start with a pure delay (pole at zero
and a pair of nearly canceling pole/zero) for W = 100
to a much more complicated configuration with multiple
pole/zero pairs near the unit circle along with nonminimum
phase zeros for W = 0=01 sec. With a sampling interval
of W = 0=01 sec> the oscillatory poles/zeros near the unit
circle with their arguments between l = 2 and m = 3
4
correspond to oscillations of periods equal to 2W
l 0=04 A
UW W and 2W
m 0=03 UW W> respectively. At this very
low time scale, the stable poles/zeros are interlaced and
this interlaced property of the lightly damped poles/zeros
produce substantial phase variation making stabilization
difficult. In the case of small sampling period, this should
be kept in mind, however, the poles/zeros close to the unit
circle and with small phase angles could be artifacts of the
fast sampling rather than intrinsically complicated features
of the dynamics. As the time scale increases, we observe
that there is the trend of the poles/zeros inside the unit circle
going from an interlaced configuration to a configuration
where the poles/zeros move on top of each other and
cancel out. In this situation, balanced model reduction
[13], [14] would in fact do the pole/zero cancellation in
a numerically safe manner and produce a simple reduced
order model. Through the same process, balanced model
reduction would make the distinction between those lightly
damped poles/zeros that are artifacts of the fast sampling
and those that are dynamically relevant; the former would
be deleted and the latter would be kept under balanced
model reduction. But probably the most important feature
of pole/zero configuration is the transition in the structure
as revealed by a comparison between the upper left graph in
Figure 8 (W = 0=01 sec) and the lower left graph in Figure
8 (W = 0=05 sec). It should be observed that, around those
values, the sampling period W transits through the round
trip time (UW W ), which in this case is around 0.03 sec. It
appears that one should make a distinction between three
different cases depending on how the sampling interval
relates to the UW W :
1018
1) UW W ¿ 0=5 W 100 (Figs 6,7). In this situation,
the sampling interval is much greater than the UW W ,
leading to an input/output transfer function with fairly
well damped poles. There always seems to be a pole
0.5
0
-0.5
-1
-1.5
0.2 0.4 0.6 0.8 1
Real Axis
Sam plePeriod = 100
-2
-2
-1.5
-1
-0.5
0
0.5
1
1.5
Real Axis
SamplePeriod = 0.05, DropProb = 0.03
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
-5
Imag Axis
Imag Axis
Imag Axis
Real Axis
1
0.2 0.4 0.6 0.8 1
Real Axis
Fig. 7. Pole/zero configurations of full-order models for sample periods
T=1, 10, 50, and 100 seconds.
SamplePeriod = 0.01, DropProb = 0.003
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
-2.5
-2
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
-2
-1.5
Imag Axis
Imag Axis
Real Axis
Sam plePeriod = 50
2
1.5
Imag Axis
0.2 0.4 0.6 0.8 1
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0
Imag Axis
0.2 0.4 0.6 0.8 1
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0
SamplePeriod = 0.01, DropProb = 0.03
Sam plePeriod = 10
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1 -1 -0.8 -0.6 -0.4 -0.2 0
-1.5
-1
-0.5
0
0.5
1
Real Axis
SamplePeriod = 0.1, DropProb = 0.03
Imag Axis
Sam plePeriod = 1
1
0.8
0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1 -1 -0.8 -0.6 -0.4 -0.2 0
-4
-3
-2
Real Axis
-1
0
1
-1
-0.5
Real Axis
0
0.5
1
Fig. 8. Pole/zero configurations of reduced order models for sample
periods T=0.01 (p = 0.03) , 0.01 (p = 0.003), 0.05 (p = 0.03), and 0.1 (p
= 0.03) seconds.
at the origin or near the origin, so that the system
behaves more like a delay, meaning that the packet
arrival does not depend as much on its past history
as it depends on the drop probability. It also appears
that any attempts to reduce the model using system
balancing only produce models qualitatively different
from the full order models. Besides, reduced models
do not seem to be necessary, as the full order pole/zero
diagrams are easily to interpret.
2) 0=05 W UW W 0=1 (Figs 6 and 7). Here we are
in the transition range. The strings of pole/zero pairs
near the unit circle are somewhat misleading, because
balanced model reduction yields models with a pole
at the origin, a pole at 1, and a nonminimum phase
zero on the negative real axis; the lightly damped
poles/zeros have been deleted by the reduction of the
model (Figure 8). The irrelevance of the pole/zero
pairs was predictable from the close proximity of the
poles and the zeros, which can safely be assumed to
cancel. It follows that the packet arrival depends on
the recent past history and the drop probability.
3) W 0=01 ? UW W (Fig 6). Here we are sampling at a
rate faster than the RTT. The raw pole/zero diagram
produced by the model using the MDL is a little
difficult to interpret. For this reason, we first used
balancing to reduce its order. The best compromise
between retaining the key features of the original
model and having a model of sufficient simplicity
to lend itself to analysis is shown in Figure 8. The
reduced model retains some of the early features, that
is, a pole near the origin (at 0=2), a pole near 1,
along with a pair of nearly canceling pole/zero near
1, and a nonminimum phase zero on the negative real
axis. Since the sample rate is less than the RTT, the
dynamics of TCP can be seen by examining the poles.
As discussed in Section II, the simulations included
flows with three different round-trip times: 24, 26,
1019
and 28 msec. The reduced model has poles at these
locations as well as one more set with an oscillation
period of 33 msec and two real poles. The poles
corresponding to the round-trip times and one of the
real poles have slowest decay. The table below shows
the periods of oscillation along with the times that the
modes take to decay to 10% of their initial values.
oscillation period
decay times
26.5 msec
1.2 sec
28.5 msec
0.58 sec
oscillation period
decay times
24.4 msec
1.1 sec
real
0.62 sec
real
0.016 sec
33 msec
0.4 sec
It is likely that the burstiness of the packet arrivals
is the cause of the poles to appear with oscillation
periods equal to the round-trip times of the flows. To
see this, consider the example of a TCP flow in slowstart with congestion window equal to one. Hence,
one packet is transmitted. Upon receiving the ACK for
this packet, the congestion window is increased to two
and two packets are sent back-to-back. These packets
are then received in a back-to-back fashion by the
destination host and the ACKs for these are returned,
again back-to-back. Upon receiving these two ACKs,
the congestion window is increased to 4 and a burst
of 4 packets is transmitted. One round-trip time later,
a burst of 8 packets is sent. This periodic bursting
continues until there is a drop. When TCP is in the
congestion avoidance state, packets are still sent in
bursts every round-trip time; however, the size of the
burst only increases by one packet every round-trip
time. In either case, the presence of a connection with
a particular round-trip time leads to periodic arrival of
packets with the same period as the round-trip time.
While the periods of oscillation for some of the poles
have a clear physical interpretation, the damping is
less clear. However, it appears that a lower average
TCP Flows
AQ M
K(z)
-
drop
probability p
sea of poles and zeros, the relationship between
round-trip times and pole oscillation periods is not
apparent.
QoS
modeling error w
+
+
dynamics
packet
G(z)
arrival y
number of flows,
round-trip times
Fig. 9. Block diagram of the overall system consisting of a feedback
loop for the packet arrival rate and a feedforward structure for the Quality
of Service (QoS).
V. AQM DESIGN PRINCIPLES
After identifying the ARX model, the transmittance J(})
from s to | can be computed. More specifically,
PO1 l
%(})
l=0 el }
|(}) =
s(}) +
PO1
P
O1
} l=0 dl } l
} l=0 dl } l
|
{z
}
|
{z
}
J(})
loss probability leads to a slower decay. This was
verified by performing many simulations where the
sample rate was held constant, but the average loss
probability was reduced by a factor of ten to 0.003.
In this case, the reduced order model is similar. The
pole/zero diagram is shown in Fig 8. The frequencies
of the poles are nearly identical. However, the damping is significantly different. The table below shows
the oscillation periods along with the times to decay
to 10% of the initial values.
oscillation periods
decay times
26.0 msec
12 sec
28.0 msec
21 sec
oscillation periods
decay times
24.1 msec
1.4 sec
real
7.7 sec
real
0.7 sec
35 msec
4.3 sec
Clearly, the modes that correspond to the round-trip
times of 26 and 28 msec decay at drastically reduced
rates. This trend of slower rates of decay is consistent
with other simulations. However, this trend is not
exact. For example, in the two tables above, it is
noticed that the mode with a period of 24 msec only
has a small change in decay rate. Thus, while a direct
relationship between the rate of decay of a particular
mode and the packet loss probability experienced
by the flow that gives rise to the mode cannot be
determined, there is a strong relationship between the
loss probability of all of the flows and the rate of
decay of the entire system.
To understand the relationship between the loss probability and the decay rate, note that a flow experiencing
ten times smaller loss probability will typically send
ten times more packet between packet losses. This
leads to a greater average sending rate, but more
importantly, a longer time between losses. Intuitively,
a reasonable estimate of the future value of the congestion window is that it will increase until roughly
1/S (packet loss) packets are sent. Clearly, if the loss
probability is smaller, the effect of the specific value
of the congestion window will be noticeable for a
longer time than if the loss probability if larger.
4) W ¿ UW W . While no figures are included for this
case, it should be remarked that for very small sample
periods, the model order is very large even after
balanced model order reduction. Furthermore, in the
z(})
Recall, that s is the deviation of the loss probability from
the nominal loss probability and | is the deviation of the
number of packet arrivals during a sample period from the
nominal number of packet arrivals during a sample period.
This model accurately depicts how, given a fixed number
of flows, | will vary over time. This variation can lead to
the queue occasionally emptying and hence a decrease in
utilization1 . One possible approach to avoid empty queues
is to make the queue capacity large. For example, in the
single flow case, it is possible to show that if the capacity of
the queue is the same as delay-bandwidth product, then the
queue will never empty. The drawback is that increasing the
queue size also increases the delay. Since the time it takes
to transfer a small file is strongly dependent on the delay,
increasing the queue size can decrease the performance of
the network. Therefore, a principle objective of AQM is for
the queue to remain as unoccupied as possible, yet never
empty. This goal could be achieved if the number of packet
arrivals in a sample period is nearly constant. The more
variation in the arrival rate, the larger the variation in the
queue occupancy and hence the larger the average queue
occupancy required to avoid an empty queue.
Considering Figure 9, we see that from the control
theory perspective, AQM should minimize the impact of
a disturbance z on the output. |
There are two ways one can approach this performance
objective; either the sup-norm of the transmittance from
the disturbance to the output is minimized, or the supnorm of the transmittance from the disturbance to the
integral of the output is minimized. There two approaches
are nearly the same; in both cases the objective is to
minimize the weighted sensitivity function of the closedloop system. The only difference is the weighting function.
Specifically, if V is the transmittance from the disturbance
to the output (i.e., V = (1 + JN)1 ), and Z is the
transfer function of the weighting function, we seek to
minimize kZ Vk4 . However, it is a good practice to also
minimize W , the complementary sensitivity, where W :=
(1 + JN)1
we seek to find a controller N that
°
° JN. Thus
° ZV °
°
minimizes °
° W ° . As it turns out, the inclusion of the
4
complementary sensitivity does not influence the controller.
1 If the queue never empties, then the link is always sending packet,
hence full utilization is achieved. On the other hand, if the queue empties,
the link will be idle, less than full utilization.
1020
1
4
0
3
2
-1
1
-2
Magnitude (dB)
Next we discuss the rationale behind the selection of
different weighting functions. First, we consider the weighting function
an integrator. Thus, we seek to
° 1that includes °
° Z (}) V (}) °
}
° , where we have decomposed
minimize °
°
°
W
4
the weighting function into the integrator and Z= The
rationale behind including an integrator is that the integrator mimics the queue and therefore this control strategy
will penalize large queueing delay. Further weighting with
Z is also appropriate. To see this, compare the effect
of low frequency oscillations in the queue occupancy to
high frequency oscillations. High frequency oscillations will
appear as short-term, intermittent congestion. However, low
frequency oscillations will appear as persistent congestion
that will include persistent queueing delay and/or multiple
drops. Such persistent congestion has adverse effects on
different applications. For example, in voice-over-IP this
persistent congestion can have a significant impact on
quality, while short-lived congestion has little impact on
quality. TCP connections can suffer time-outs if congestion
is persistent. Thus, if an integrator is incorporated into the
weighting function, a low pass filter should also be included.
There is some motivation to not include the integrator,
and simply focus on minimizing the variation in the number
of packet arrivals in a sample period. To see this, consider
the case where the nominal queue occupancy is very small.
In this case, if the packet arrival rate falls below the link
speed, then the queue will empty and remain empty until
the number of arrivals in a sample period surpasses the
link-capacity. On the other hand, the integral of | will
tend towards 4. In order to force the integrator to zero,
the controller will eventually cause the number of packet
arrivals to increase beyond the link capacity. However,
since the queue was fixed at zero, this increase in data
rate will cause the queue to fill. Similarly, [4] discussed
the idea of the setting the desired packet arrival rate to
be below the link speed. In this case, since small positive
deviations of the number of packet arrivals does not affect
the queue, incorporating an integrator into the weighting
function makes even less sense.
While it would be best if the sensitivity could be reduced
over all frequencies, the Bode limitation discussed in the
next section shows that this is not possible. Thus, the
frequencies over which the sensitivity should be reduced
must be determined. Consider low frequency variation in |.
Such variations will cause the packet arrival rate to increase
beyond the link speed for extended periods of time. This
will cause large queueing delay and perhaps fill the queue
and cause uncontrolled drops. On the other hand, high
frequency variations in the number of packet arrivals will
not cause the queue to fill significantly. Thus, a low-pass
weighting function should be used to put emphasis on the
sensitivity function at low frequencies.
Next, the cut-off frequency of the weighting functions
is determined. To make this selection, we consider the
fundamental frequency of TCP. Consider a TCP flow with
0
-3
-4 0
10 -3
x 10
2.5
2
1.5
1
0.5
0
-0.5
-1
-1.5 0
10
-1
1
10
2
10
3 -2 0
10
10 -3
x 10
1.5
1
10
2
10
3
10
1
0.5
0
-0.5
-1
1
10
2
10
3-1.5 0
1
10 10
10
Frequency (rad/sec)
2
10
3
10
Fig. 10. Design of the controllers in the upper frames did not have an
integrator. Rather, the weighting function is a simple low-pass filter. The
cut-off frequency of the weighting function for the sensitivity on the left
was 6 rad/sec and was 50 rad/sec on the right. Design of the controllers
in the lower frames had an integrator. The weighting function used in the
left-hand plot was an integrator while the weighting function used in the
right-hand plot was an integrator and a low-pass filter with cut-off at 50
rad/sec.
nominal loss probability of 3% and RTT of 25 msec. It is
well known that the average
p size
s of the congestion window
of this flow is given by 3@2@ 0=03 = 7. If the TCP flow
assumes its saw-tooth pattern, then the maximum value of
the window is around 9 and the minimum is 4. The variation
between 4 and 9 takes 5 round-trip times or 125 msec. Thus
we see that the fundamental frequency of this flow is 50
udg@ sec. The damping of these deviations of | at these
frequencies could be a design goal. However, considering
how this frequency is intrinsic to TCP, it is not clear if such
frequencies could be damped. Indeed, we will see that they
cannot be.
Since a weighting function with an integrator and one
without an integrator are both justified, we examine both
situations. In both situations, the conclusion is the same,
the controller has a fairly minor impact on the performance
of the system.
Before embarking on an attempt to reduce sensitivity, we
are forewarned that the sensitivity can never be arbitrarily
reduced. The Bode limitation imposes strict bounds on what
is possible.
VI. B ODE LIMITATION
From the robust control point of view, the effect of the
modeling noise z on the feedback is given by the sensitivity
function V via
|(}) = V(})z(})>
where V(}) = (1 + J(})N(}))1 and N (}) is the transmittance of the AQM. Hence, if the packet arrivals are to be
controlled, then the impact of this noise must be reduced by
1021
minimizing the sensitivity function. In terms of the power
spectral densities || > zz of |> z, respectively, the above
becomes
l
l
)V(hl ) zz (hl )=
|| (h ) = V(h
that the sensitivity is only slightly below -1dB. Again, it
is difficult to justify the inclusion of such a controller in a
network.
The drop probability is imposed in the same time interval
over which the packet arrival is observed. Therefore, in general, there is a direct transmission and J(4) 6= 0= However,
the controller can only decide on a drop probability over
[nW> (n 1)W ] after it has observed the packet arrival on
[(n 2)W> (n 1)W ]. Therefore, the loop O(}) = N(})J(})
inevitably have some delay, so that O(4) = 0. The latter,
along with the open-loop stability of the system, yields the
Bode limitation [15]:
Z +
¡
¢
log V(hl )V(hl ) g = 0=
This work, along with [8], makes the point that one
should not hope for much performance gain at small
time-scales by using AQM schemes. Specifically, it seems
unlikely that a controller, in a fixed environment, could
actively adjust the loss probability to control the short-term
variations in the queue occupancy. The conclusion of this
work is not that AQM is not a realistic goal, but rather to
understand what objectives AQM could achieve.
Since it appears fruitless to attempt to control TCP
dynamics by varying the loss probability around some a
priori imposed static probability, the real problem would
be to find, for a given number of flows and their RTTs, the
static probability that would secure the desired mean output
|. However, changes in the environment (e.g., changes in
the number of flows and round-trip times) would require
adjusting the static loss probability. Thus, future work will
focus on schemes that quickly find the appropriate static
loss probability in a way that leads to a stable closed-loop
system. Note this stands in contrast to the original idea of
AQM where it was hoped that overflow could be minimized
by predicting the future value of the queue occupancy.
This result implies that it is not possible to construct a
controller that makes the sensitivity uniformly small; if V
is decreased at some frequencies, it must increase at others.
As a result, we see that one cannot hope to smooth the
variations in the packet arrivals at all frequencies. Next we
examine several examples to determine to what degree the
sensitivity can be reduced at any frequency. We will see
that it is difficult to get the sensitivity below -1dB. Thus,
the controller has little impact on damping disturbance and
smoothing the variations in |.
VII. K 4 CONTROLLER
We examine two examples where no integrator is incorporated into the weighting function and two examples with
an integrator. We employ K 4 control so that we are assured
that the controller is optimal with respect to minimizing
the weighted sensitivity function. The upper frames of
Figure 10 show the closed-loop sensitivity function where
no integrator is incorporated into the weighting function.
The right-hand plot in the upper frames shows the case
where the cut-off for the weighting function is slightly
above 50 udg@ sec which is the fundamental frequency of
the TCP flows considered here. In this case, the sensitivity
in the low frequencies is above -1dB, indicating rather poor
performance. By decreasing the cut-off frequency of the
weighting function, the sensitivity can be decreased at low
frequencies. However, the left-hand plot in the upper frames
of Figure 10 shows that even if the cut-off is at very low
frequencies, the sensitivity is only slightly below -3dB.
Thus, such a controller would decrease the disturbances
with periods of one second or greater (recall that the
TCP flows considered here have a fundamental period of
oscillation of around 125 msec). The controller acts to cut
the impact of the disturbance by 30 %. While a reduction by
30% is useful, it is not particularly large. This controller also
has no impact on the disturbances with period below 1 sec.
We have found significant disturbances at these moderate
frequencies.
The lower frames of Figure 10 show the sensitivity when
an integrator is included in the weighting function. We see
VIII. C ONCLUSION
R EFERENCES
[1] W. Stevens, TCP/IP Illustrated, Vol, Addison Wesley, New York,
1994.
[2] Sally Floyd and Van Jacobson, “Random early detection gateways
for congestion avoidance”, IEEE/ACM Transactions on Networking,
vol. 1, pp. 397–413, 1993.
[3] Eyad Abed and Priya Ranjan, “Nonlinear instabilities in TCP-RED”,
in INFOCOM, 2002.
[4] Srisankar Kunniyur and R. Srikant, “Analysis and design of an adaptive virtual queue (AVQ) algorithm for active queue management”,
in Proceedings of ACM SIGCOMM, 2001.
[5] S. Low, F. Paganini, and J. Doyle, “Internet congestion control: An
analytical perspective”, IEEE Control Systems Magazine, February
2002.
[6] Sanjeeva Athuraliya, Steven H. Low, and David E. Lapsley, “Random
early marking”, in QofIS, 2000.
[7] G. Vinnicombe, “On the stability of end-to-end congestion control
for the internet”, in CUED/F-INFENG/TR.398, 2000.
[8] K. Shah, S. Bohacek, and E. Jonckheere, “The predictability of data
network traffic”, in American Control Conference, 2003.
[9] Stephan Bohacek and Khushboo Shah, “TCP throughput and timeout
– steady state and time-varying dynamics”, Globecom, 2004.
[10] “Http://www.isi.edu/nsnam”.
[11] Lennart Ljung, System Identification; Theory for the User, 1987.
[12] J. Rissanen, “Modeling by shortest data description”, Automatica,
vol. 14, pp. 465–471, 1978.
[13] U. M. Al-Saggaf and G. F. Franklin, “An error bound for a
discrete reduced order model of a linear multivariable system”, IEEE
transactions on Automatic Control, vol. 32, pp. 815–819, 1987.
[14] E. Jonckheere, “Principal component analysis of flexible systemsopen-loop case”, IEEE trans. on AUTOMATIC CONTROL, 1984.
[15] Bing-Fei Wu and E. Jonckheere, “A simplified approach to bode’s
theorem for continuous-time and discrete-time systems”, IEEE Trans.
on Automatic Control, 1992.
1022