DAC2011 Placement Project Paper
DAC2011 Placement Project Paper
DAC2011 Placement Project Paper
ABSTRACT
Existing thermal-aware 3D placement methods assume that the
temperature of 3D ICs can be optimized by properly distributing
the power dissipations, and ignoring the heat conductivity of
though-silicon-vias (TSVs). However, our study indicates that this
is not exactly correct. While considering the thermal effect of
TSVs during placement appears to be quite complicated, we are
able to prove that when the TSV area in each bin is proportional
to the lumped power consumption in that bin, together with the
bins in all the tiers directly above it, the peak temperature is
minimized. Based on this criterion, we implement a thermalaware 3D placement tool. Compared to the methods that prefer a
uniform power distribution that only results in an 8% peak
temperature reduction, our method reduces the peak temperature
by 34% on average with even slightly less wirelength overhead.
These results suggest that considering thermal effects of TSVs is
necessary and effective during the placement stage. To the best of
the authors knowledge, this is the first thermal-aware 3D
placement tool that directly takes into consideration the thermal
and area impact of TSVs.
General Terms
Algorithms, Design.
Keywords
Thermal awareness, TSV distribution, 3D placement
1. INTRODUCTION
One of the most critical challenges in 3D IC design is heat
dissipation, which has already posed serious problemseven for
2D IC designs [1]. The thermal problem is exacerbated in the 3D
ICs for two main reasons: 1) The vertically stacked multiple
layers of active devices cause a rapid increase in power density; 2)
For face-to-back tier bonding, a dielectric layer exists between
each tier to provide insulation. The thermal conductivity of the
dielectric layers is very low compared to silicon and metal. For
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that
copies bear this notice and the full citation on the first page. To copy
otherwise, to republish, to post on servers or to redistribute to lists,
requires prior specific permission and/or a fee.
DAC'11, June 5-10, 2011, San Diego, California, USA
Copyright 2011 ACM 978-1-4503-0636-2/11/06...$10.00
2. MOTIVATION
The stack-die structure has dramatically increased power density
compared to conventional 2D ICs, and thus threatens the thermal
reliability of 3D ICs. In addition, the low thermal conductivity of
the dielectric layers in face-to-back bonding tiers prohibits the
heat from flowing vertically. Accordingly, as pointed out in [10],
TSVs are the major channels for vertical heat flow.
Such an observation results in the fundamental difference between
the thermal-aware placement for 2D ICs and for 3D ICs. In 2D
placement, by properly distributing the power dissipations across
the chip, heat can flow uniformly through the entire substrate to
the heat sink, and the temperature can be minimized [16].
However, in 3D ICs, it is the correlation between the distributions
of the TSVs and the power density that has a direct impact on the
temperature. For example, compare the two artificial placement
results with the relative power values shown in Figure 1. In Figure
1(a), the power distribution is uniform while the TSVs are
clustered in the center; while in Figure 1(b), the power
distribution is non-uniform with 2 to 8 times higher power density
in some regions than the previous case, and the TSVs are
clustered proportional to the regional power density. The
corresponding temperature maps are shown in Figure 1(c) and (d),
respectively, assuming a 4-tier 3D chip with 6W power in a
1.5mm2 area with about 1200 TSVs per tier, where the 3D
technology parameters for temperature evaluation are the same as
in Section 5. From this artificial example, we can see that the
locations of the TSVs play a very important role in the thermal
integrity of 3D ICs.
As expected, it is sub-optimal for existing thermal-aware 3D
placement to be targeted at distributing power dissipations and
neglect the thermal effect of TSVs. To improve this, a nave
approach would be to compute the optimal locations of the TSVs
that can result in the minimum temperature during each iteration
of placement. However, it will result in an optimization-in-theloop with significant runtime overhead. Since thermal-aware
placement mainly targets large designs, this method is less
practical. On the other hand, if we adjust the locations of the
TSVs after placement is done to minimize the temperature; it will
bring about significant wirelength overhead because these TSVs
are also part of the signal nets. We will address this dilemma in
the remainder of the paper.
There are many different 3D integration technologies, including
face-to-face bonding, face-to-back bonding, via-first, via-middle,
via-last. Different techniques can have totally different thermal
models. In this paper we focus on the face-to-back bonding with
via-first technology. In addition, although it is possible to insert
additional thermal TSVs [10] after placement to further suppress
the temperature, it brings in extra area overhead. In this paper we
focus on exploring the opportunities of temperature reduction by
utilizing the signal TSVs in 3D placement. Our experimental
results show that signal TSVs alone can already reduce the
(a)
(b)
(c)
(d)
Figure 1. Uniform power with clustered TSVs vs. consistent TSV
and power distribution
B
B0
T
P
t i ( ti j )
pi ( pij )
Atot ( Atotj )
ai ( aij )
M i ( M ij )
n
K
gTSV
The two constraints are also self-evident: the total TSV area in
each tier is a fixed number, and the lumped TSV area in each bin
should be non-negative. Note that we have relaxed the constraint
that the TSV area aij in each bin should be discrete. As such, the
TSV areas mentioned in the theorems and corollaries proposed
below should be rounded.
Problem (P1) is non-linear in nature. Integrating nonlinear
optimization engines in a placement tool directly would be
impractical due to the high complexity.
Before we directly tackle (P1), we resort to a simpler version of
the problem: For a one-tier 3D IC(footnote 2) with a given power
distribution, what will be the optimal locations of TSVs so that the
temperature is minimized?
In this case, each TSV is directly connected to the PCB via bumps
(ambient air). As such, (P1) can be rewritten as
min TL = B0 + i =1 gTSV ai M i
n
(P2)
B = B0 + gTSV aij M i j
(3)
i =1 j =1
n K
(P1)
(6)
= Atot
i =1
ai 0 1 i n
ai* = Atot pi
n
i =1
pi
(7)
In the interest of space, we will only outline the proof for the
Theorem. From the fact that TSVs are the major vertical heat flow
channel ( gTSV ak bk ,l where bk ,l is the inter-tier conductance
without TSVs), we can get
p g
i
TSV
a jt j = gTSV aT T
(8)
gTSV T
(5)
s.t.
pi
i =1
a 1 = pi
i =1
a = p
i
i =1
Atot
(10)
i =1
where Atotj is the total area of the TSV connecting tier j and tier
j+1, and is determined once the floorplanning is done. The
infinity norm is defined as x = max{ x1 , x2 ,", xn }. The
objective function is obtained by simply substituting (3) into (1).
s.t.
= Atotj 1 j K
i =1
aij 0 1 i n,1 j K
M iK is the stamping matrix for the lumped TSV in the last tier
connecting to the PCB.
This case is where TSVs are only used to connect the IC and the
PCB.
ai = Atot pi
(13)
i =1
Note that in the above theorem, we neglected the fact that the total
TSV area in each area is discrete, that the dielectric layer is not an
ideal thermal insulator, and that the total TSV area allocated in
each bin cannot exceed the area of that bin. In reality, the optimal
condition needs to be tailored to fit into these constraints. We can
also easily derive a corollary based on this theorem.
Corollary 1. When the TSVs are placed proportional to the power
consumption in each bin, the temperature in each bin is identical,
i.e.,
n
k
i
(15)
i =1 k =1
(14)
(16)
i =1 k =1
To summarize this section, we would like to point out that all the
theorems and corollaries are based on the assumption that TSVs
are much more effective in conducting heat than the dielectric
layer. And accordingly, we have treated the dielectric layer as an
ideal heat insulator. In reality this is not correct, and thus the
theorems are only an approximation. However, our experimental
results show that they work pretty well.
4. THERMAL-AWARE 3D PLACEMENT
Our 3D placement flow is similar to the one in [14], but in this
section we mainly focus on the 3D placement step in the TSV coplacement flow. We assume the tier assignment of each cell is
given, either by manually partitioning or automatic partitioning.
An automatic partitioning method by 3D floorplanning will be
explained in Section 4.2. The 3D placement step is called after 3D
net splitting and TSV insertion.
Db ( x, y ) = Cb
HPWL(init )
COST (init)
(17)
j
p TSV
= (1) Pi k
i =1 k =1
j
i
(18)
i =1
Under this assignment, the total negative power of the TSVs in the
bin i, tier j is
j
p ij = p TSV
Nij
(19)
Therefore, the total TSV power and the lumped cell power in the
bin i, tier j is
j
k =1
k =1
j
Nij
Pi k + p ij = Pi k + p TSV
j
= Pi k + (1) Pi k N i j
k =1
j
i =1 k =1
n
j
i
(20)
i =1
i =1 k =1
n K
j
COST ( x, y ) = Pi k ( x, y ) + p ij ( x, y )
(21)
i =1 j =1 k =1
which is a sum of squares of the total TSV power and the lumped
cell power in each bin. This quadratic penalty method is an easyto-use, common method in engineering practice to satisfy the
equality constraints. Since the existence of a solution that satisfies
both the area density constraint and the TSV distribution
constraint is not easy to determine, we only penalize the COST
function by a finite number instead of pushing it to +.
5. EXPERIMENTAL RESULTS
We implement the algorithm in C++ and run on an Intel Xeon 2.0
GHz machine with Linux. The experiments are performed on
seven open-source IP cores in the IWLS 2005 benchmarks [21].
The circuits are summarized in Table 2, where the utility rate
(Util.) is the total cell area divided by the total chip area.
We synthesize the circuits with a standard cell library for the MIT
Lincoln Lab 130nm 3D SOI technology. The target 3D
technology is a 4-tier 3D IC, with TSV size 6 m 6 m and TSV
pitch 12 m 12 m. The 3D chip temperature is measured by the
compact model in [17], assuming that the height of the silicon
layer is 300 m on the bottom tier and 25 m on the other tiers.
The placement area is set as a square with 20% to 28% whitespace
in total, and the I/O pins are placed uniformly along the
boundaries in alphabetical order. The power dissipation of each
cell is generated as follows: The circuit is partitioned into eight
parts by hMetis. Each part is assigned a random number between
0 and 1 as a relative power number. These relative numbers are
scaled to power values such that the overall power density is on
the order of magnitude of 1 W/mm2, which is the projected power
density for the high-performance chips at the 14nm generation by
ITRS [20].
#Cell
#TSV
aes_core
wb_conmax
ethernet
des_perf
vga_lcd
netcard
leon3mp
20397
25883
49332
69494
82843
478502
509793
1362
2166
3782
3678
7356
9112
14742
Power
(W)
1.31
1.87
4.46
5.28
7.04
40.37
43.86
Cell Area
(mm2)
1.31
1.87
4.46
5.28
7.04
40.37
43.86
Util.
0.80
0.80
0.78
0.77
0.80
0.72
0.73
[2] Y.-L. Chuang, P.-W. Lee, and Y.-W. Chang, Voltage-drop aware
[3]
[4]
[5]
[6]
[7]
baseline
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL (m)
T (C)
RT (s)
HPWL
T
RT
1.43
108
206
2.34
130
214
3.77
124
490
4.24
173
689
5.94
108
772
37.17
461
5121
40.10
437
5440
1.00
1.00
1.00
uniform
power
1.58
103
180
2.42
124
289
3.95
113
395
4.61
158
639
6.26
112
815
39.31
415
4620
43.42
347
4846
1.07
0.92
0.97
postprocessing
1.54
105
208
2.46
119
220
4.08
85
506
4.83
112
702
6.62
80
854
40.67
288
5439
45.38
201
6480
1.10
0.72
1.06
coplacement
1.55
101
208
2.45
108
257
3.89
87
502
4.55
103
759
6.13
79
812
40.21
194
5693
43.05
160
5152
1.06
0.66
1.06
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
6. CONCLUSIONS
[17]
7. ACKNOWLEDGMENTS
[19]
8. REFERENCES
[1] K. Banerjee, Thermal effects in deep submicron VLSI interconnects,
IEEE Int. Symposium on Quality Electronic Design, 2000.
[18]
http://www.itrs.net
[21] http://www.iwls.org/iwls2005/benchmarks.html