Static vs. Dynamic Timing Analysis
Static vs. Dynamic Timing Analysis
Static vs. Dynamic Timing Analysis
The major design challenges of ASIC design consist of microscopic issues and
macroscopic issues [1]. The microscopic issues are ultra-high speeds, power dissipation,
supply rail drop, growing importance of interconnect, noise, crosstalk, reliability,
manufacturability and the clock distribution. The macroscopic issues are time to market,
design complexity, high levels of abstractions, reuse, IP portability, systems on a chip and
tool interoperability.
To meet the design challenge of clock distribution, the timing analysis is performed.
Timing analysis is to estimate when the output of a given circuit gets stable. Timing
Analysis (TA) is a design automation program which provides an alternative to the
hardware debugging of timing problems. The program establishes whether all paths
within the design meet stated timing criteria, that is, that data signals arrive at storage
elements early enough valid gating but not so early as to cause premature gating. The
output of Timing Analysis includes ‘Slack” at each block to provide a measure of the
severity of any timing problem [13].
Static Timing Analysis (STA) works with timing models where as the Dynamic Timing
Analysis (DTA) works with spice models. STA has more pessimism and thus gives
maximum delay of the design. DTA overcomes this difficulty because it performs full
timing simulation. The problem associated with DTA is the computational complexity
involved in finding the input pattern(s) that produces maximum delay at the output and
hence it is slow. The static timing analyzer will report the following delays: Register to
Register delays, Setup times of all external synchronous inputs, Clock to Output delays,
Pin to Pin combinational delays. The clock to output delay is usually just reported as
simply another pin-to-pin combinational delay. Timing analysis reports are often
pessimistic since they use worst case conditions.
The wide spread use of STA can be attributed to several factors [2]:
The basic STA algorithm is linear in runtime with circuit size, allowing analysis of
designs in excess of 10 million instances.
The basic STA analysis is conservative in the sense that it will over-estimate the delay of
long paths in the circuit and under-estimate the delay of short paths in the circuit. This
makes the analysis ”safe”, guaranteeing that the design will function at least as fast as
predicted and will not suffer from hold-time violations.
The STA algorithms have become fairly mature, addressing critical timing issues such as
interconnect analysis, accurate delay modeling, false or multi-cycle paths, etc.
Delay characterization for cell libraries is clearly defined, forms an effective interface
between the foundry and the design team, and is readily available. In addition to this, the
Static Timing Analysis (STA) does not require input vectors and has a runtime that is
linear with the size of the circuit [9].
This variation accounts for deviations in the semiconductor fabrication process. Usually
process variation is treated as a percentage variation in the performance calculation.
Variations in the process parameters can be impurity concentration densities, oxide
thicknesses and diffusion depths. These are caused bye non uniform conditions during
depositions and/or during diffusions of the impurities. This introduces variations in the
sheet resistance and transistor parameters such as threshold voltage. Variations are in the
dimensions of the devices, mainly resulting from the limited resolution of the
photolithographic process. This causes (W/L) variations in MOS transistors.
The design’s supply voltage can vary from the established ideal value during day-to-day
operation. Often a complex calculation (using a shift in threshold voltages) is employed,
but a simple linear scaling factor is also used for logic-level performance calculations.
The saturation current of a cell depends on the power supply. The delay of a cell is
dependent on the saturation current. In this way, the power supply inflects the
propagation delay of a cell. Throughout a chip, the power supply is not constant and
hence the propagation delay varies in a chip. The voltage drop is due to nonzero
resistance in the supply wires. A higher voltage makes a cell faster and hence the
propagation delay is reduced. The decrease is exponential for a wide voltage range. The
self-inductance of a supply line contributes also to a voltage drop. For example, when a
transistor is switching to high, it takes a current to charge up the output load. This time
varying current (for a short period of time) causes an opposite self-induced electromotive
force. The amplitude of the voltage drop is given by .V=L*dI/dt, where L is the self
inductance and I is the current through the line.
When a chip is operating, the temperature can vary throughout the chip. This is due to the
power dissipation in the MOS-transistors. The power consumption is mainly due to
switching, short-circuit and leakage power consumption. The average switching power
dissipation (approximately given by Paverage = Cload*Vpower supply 2*fclock) is due
to the required energy to charge up the parasitic and load capacitances. The short-circuit
power dissipation is due to the finite rise and fall times. The nMOS and pMOS transistors
may conduct for a short time during switching, forming a direct current from the power
supply to the ground. The leakage power consumption is due to the nonzero reverse
leakage and sub-threshold currents. The biggest contribution to the power consumption is
the switching. The dissipated power will increase the surrounding temperature. The
electron and hole mobility depend on the temperature. The mobility (in Si) decreases with
increased temperature for temperatures above –50 °C. The temperature, when the
mobility starts to decrease, depends on the doping concentration. A starting temperature
at –50 °C is true for doping concentrations below 1019 atoms/cm3. For higher doping
concentrations, the starting temperature is higher. When the electrons and holes move
slower, then the propagation delay increases. Hence, the propagation delay increases with
increased temperature. There is also a temperature effect, which has not been considered.
The threshold voltage of a transistor depends on the temperature. A higher temperature
will decrease the threshold voltage. A lower threshold voltage means a higher current and
therefore a better delay performance. This effect depends extremely on power supply,
threshold voltage, load and input slope of a cell. There is a competition between the two
effects and generally the mobility effect wins.
On Chip Variation
On-chip variation is minor differences on different parts of the chip within one operating
condition. On-Chip variation (OCV) delays vary across a single die due to:
Variations in the manufacturing process (P)
For net delays, the on-chip variation is between 2 percent above and 4 percent
below the SDF back-annotated values.
For cell timing checks, the on-chip variation is 10 percent above the SDF values
for setup checks and 20 percent below the SDF values for hold checks.
In Prime Time, OCV derations are implemented using the following commands:
The uncertainty in the timing estimate of a design can be classified into three main
categories.
For instance, the STA tool might utilize a conservative delay noise algorithm resulting in
certain paths operating faster than expected. Environmental uncertainty and uncertainty
due to modeling and analysis errors are typically modeled using worst-case margins,
whereas uncertainty in process is generally treated statistically.
In practice, device features vary among the devices on a chip and the likelihood that all
devices have a worst-case feature is extremely small. With increasing awareness of
process variation, a number of techniques have been developed which model random
delay variations and perform STA. These can be classified into full-chip analysis and
path-based analysis approaches.
Path based STA provides statistical information on a path-by-path basis. It accounts for
intra-die process variations and hence eliminates the pessimism in deterministic timing
analysis, based on case files. It is a more accurate measure of which paths are critical
under process variability, allowing more correct optimization of the circuit. This approach
does not include the load dependence of the gate delay due to variability of fan out gates
and does not address spatial correlations of intra-die variability.
To compute the intra-die path delay component of process variability, first the sensitivity
of gate delay, output slope and input load with respect to slope, output load and device
length are computed. Finally, when considering sequential circuits, the delay variation in
the buffered clock tree must be considered.
In general, the fully correlated assumptions will under-estimate the variation in the arrival
times at the leaf nodes of the clock tree which will tend to overestimate circuit
performance.