Academia.eduAcademia.edu

The independence of miss ratio on page size

1976, Journal of the ACM

AI-generated Abstract

The paper discusses the independence of miss ratio from page size, particularly in the context of the transient-free or long-term miss ratio as opposed to initial loading misses. A key finding is a bound on the expected working-set miss ratio in an independent reference model, showing that the miss ratio is approximately independent of page size under certain conditions. Additionally, the impact of initial loading misses is acknowledged as significant.

The Independence of Miss Ratio on Page Size RONALD FAGIN AND MALCOLM C. EASTON I B M Thomas J . Watson Research Center, Yorktown Heiohts, New York AnsTRXCT A theoretical justification is given to the empirmal observation that in some computing systems with a paged, 2-level storage hierarchy, long-term miss ratio is roughly independent of page size Let M I S S be the expected working+set miss ratio in the independent reference model, with expected working set size C A P pages Now form blocks, by combining the B pages with the highest probabllitms of reference into one block, the B pages with the next-highest probabilities of reference into a second block, and so on Let M I S S * be the expected working-set miss ratio when all data are moved m blocks and when the expected working set size is again C A P pages, that is, C A P / B = C blocks. It ~s proved that I M I S S -- M I S S * I < (2/C) .-}- (33/C2). Thus, ff the expected working-set size (in blocks) is sufficiently large, then the miss ratios in the blocked and unblocked cases are approximately equal This result m used to argue the approximate independence of miss ratio on page size in more realistic models of page references. KEY WORDSAND PHRASES. page raze, working set, LRU, miss ratm, storage hierarchy, independent reference model cR c~'rEGo~.tms: 40, 4.3, 6.1, 6.20 1. Introductw~+ An important parameter in the design of a paged computing system is the page size, t h a t is, the number of bytes of information transferred from one level of a storage hierarchy to another in case of a page fault. Among the factors which influence the choice of page size are the page fault rate (or "miss ratio"), the fragmentation of memory, and the access and transfer times of secondary memory devices (see Gelenbe et al. [12] for a more detailed discussion). The research for this paper was sparked by an empirical observation of Bennett [3], who examined a page reference trace 1 from the I B M Advanced Administrative System (A.A.S.) [22], a large internal I B M data management system. Bennett found no consistent relationship between miss ratio and page size--for some main (first-level) memory sizes, the miss ratio was slightly laIger for the larger page size, and for other main memory sizes, slightly smaller. In all cases the size of main memory had a vastly greater effect on miss ratio than did page size, if the page size was sufficiently large (at least 1500 bytes). The cache multiprogramming trace of Kaplan and Winder [17] and the (main memory) program address traces of Lewis and Shedler [18] and of Anacker and Wang ([16], [2]) give similar results. In the examples cited, different but similar page replacement algorithms were employed, including the working-set memory management policy [5] and the closely-related L R U ("least recently used") memory management policy ([19], [1]). At first glance, some published data seem to contradict this insensitivity of miss ratio Copyright © 1976, Association for Computing Machinery, Inc General permission to republish, but not for profit, all or part of th2s material is granted provided that ACM's copyright notice is given and that reference is made to the pubhcation, to its date of issue, and to the fact that reprinting privileges were granted by permismon of the Association for Computing Machinery. Authors' present addresses R Fagin, IBM Research Laboratory, San Jose, CA 95193, M.C. Easton, IBM Thomas J. Watson Research Center, P O. Box 218, Yorktown Heights, NY 10598 1 All requests in the trace were to the data base--there were no index references or program references. Journal of the Assocmtlon for Computing Machinery, Vol 23, No 1, January 1976, pp 128-146 The Independence of Miss Ratio on Page Size 129 to page size. A dramatic example is found in Chu and Opderbeck's paper [4], in which LRU miss ratio seems to depend very heavily on page size. Their miss ratio curves asymptotically approach a value which is simply the number of initial loading misses (which is the same as the number of pages in the program, if main memory is large enough to hold the entire program) divided by the length of the observed page reference string. Of course, in general, there are approximately twice as many initial loading misses when the page size is half as large, and so the asymptotic value of their miss ratio curve is approximately twice as big when the page size is half as large. By contrast, in this paper we distinguish between initial loading misses on the one hand, and the "transientfree," or "long-term," miss ratio on the other hand (cf. [9]). In the case of LRU, "transient-free" means that the miss ratio is measured starting at a time after main memory has filled; in the working-set ease, "transient-free" means that the miss ratio is measured starting at a time greater than T, where T is the window size. In many cases, it is possible to anMyze the performance of a storage hierarchy more accurately by considering the effects of initial loading misses and of the transient-free miss ratio separately. From here on in this paper, "miss ratio" refers to the transient-free, or long-term, miss ratio The main result of this paper ~s a bound on the effect of page size on the expected working-set miss ratio, in the independent reference model (in whmh page ~ is referenced at time t with probability p,, independent of past history). Specifically, let M I S S be the expected working-set miss ratio in this model, where the window size is chosen so that the expected working-set size is CAP pages. Now form blocks, by combining the B pages with the highest probabilitms of reference into one block, the B pages with the next-highest probabilities of reference into a second block, and so on. Let M I S S * be the expected working-set miss ratio when all data are moved in blocks, and when the window size is chosen so that the expected working-set size is again CAP pages, that is, C A P / B = C blocks. It is proven that I M I S S -- M I S S * I < 2/C + 33/C ~. (1) Thus, if C is sufficiently large, then M I S S * ~ M I S S , that is, the expected miss ratios in the blocked and unblocked eases are approximately equal In Section 2 we will show that (1) implies the approximate independence of miss ratio on page size in certain more realistic models of page references than the independent reference model. This tells us that in some eases, transient-free miss ratio is not a key factor in the selection of page size. Of course, the effect of initial loading misses, which can be considered separately, is an important factor. 2. More Reahst~c Models In this section, we will show that our result about the insensitivity of miss ratio to page size in the independent reference model implies that this insensitivity holds in some more reahstic models of page reference patterns. For convenience in exposition, we will deal in this section with LRU miss ratio, rather than with working-set miss ratio. Indeed, we will show later that our main result (1) can be interpreted as saying that LRU miss ratio is insensitive to page size in the independent reference model, if the capacity (size of main memory) in, say, bytes, is held fixed (and if, as before, pages are blocked together in order of their probabilities). The independent reference model is, in general, inadequate. Various authors, including Lewis and Shedler [18], Denning, Savage, and Spirn [6], and Easton 17] have presented models of page reference patterns (or related quantities) in which page references are the result of two components, where the first is, roughly speaking, an "independent" or "random" component, and the second is a local component, such as a "locality of reference" or "sequential" component. We will first show that approximate independence of expected LRU miss ratio holds in one such model (Easton's), and then we will gem eralize the argument to cover other such models. 130 R. F A G I N A N D M.C. EASTON We brmfly describe Easton's model. If there are n pages, then there are n + 1 parameters, r, p t , • .. , p , , all between 0 and 1. Assume that page i was referenced at time t A t time t --t- 1, a coin is flipped, which comes up heads with probabihty r, and tails with probability 1 - r. If heads comes up, then page z is rereferenced, and we say t h a t the reference to page i (at time t ~ 1) was generated during "rereference" or "sequential" mode. If tails comes up, page3 is then referenced with probability p~, for 1 _< 3 <(including the case 3 = ~), and we say t h a t the reference to page j was generated during " r a n d o m " mode. Thus, if Q,j is the probability t h a t page 3 is referenced at time t ~ 1, given t h a t page ~ was referenced at time t, then j r + (1 - r ) p , , Q" = ( ( 1 - r)p~, J = z, j ~ Easton found [7] t h a t with appropriate choice of parameters, his model gives a good fit to the L R U miss ratio curve of A.A.S., which we referred to in the Introduction. Intuitively, this model "works" because if the page size is large enough, then "locality" can be approximately captured by rereferences to the same page. We will now show t h a t in this model, L R U miss ratio is insensitive to page size. Let S := R1 • - Rm be a finite-length page reference string generated b y this model. Thus, each R, is the name of a page. Assume that the string S is long enough that the effect of initial loading misses is neghgible. For each ~, let R~* be the name of the block which contains page R , . Then S* = R:* ... R,,* is the corresponding block reference string. We will show that the L R U miss ratio (where pages are the unit of transfer) over page reference string S and with capacity CAP pages is approximately the same as the L R U miss ratio (where blocks are now the unit of transfer) over block reference string S* and with capacity C A P / B blocks (i.e. CAP pages), when there are B pages per block. Let R~ , . . - , R,~ (where i~ < ~2 < " " ~ zk) be those page references which were generated after "tails" was flipped; thus, these are the page references which were generated during " r a n d o m " mode. So S~,d = R,~ .- R~ is the substring of S which conrains only the page references generated during random mode, and Srand * . .R~* . . R,k* is the corresponding substring of S*. Let z be the number of misses if we apply the L R U page replacement algorithm to string S~,.d, with capacity CAP pages. Each reference which was generated during sequential mode is automatically a " h i t " ; hence, the number of L R U misses (with capacity CAP pages) over page reference string S is also z. Now S~,,d can be looked at as a block reference string generated under the assumption of independent block references, where each block has reference probability equal to the sum of the reference probabilities of the pages which compose it. As we have said, our main result can be interpreted as saying that under the assumption of independent references, the L R U miss ratio is approximately the same in the blocked and unblocked cases. So z is also approximately the number of misses if we apply L R U block replacement to the block reference string S~,,d * with capacity C A P / B blocks. Again, there are exactly the same number of misses over the string S* as over S~,~d. So the miss ratios in the blocked and unblocked cases are approximately the same, as claimed. This result can be generalized to some other models with an "independent" component and a "locality of reference" component. The argument we will now outline will, of course, have to be tailored to fit each particular model. Assume t h a t the page size is large enough t h a t almost all "locality" references are hits. Then the miss ratio is essentially determined b y the independent component. So once again, the insensitivity of miss ratio to page size under the assumption of independent references implies this insensitivity under more realistic assumptions. We remark t h a t for models other t h a n the independent reference model, the difference in miss ratio between the blocked and unblocked cases will not necessarily be bounded b y the right-hand side of inequality (1). In our result about the independent reference model, we have assumed t h a t pages The Independence of Miss Ratio on Page Size 131 are blocked together in order of likelihood of reference. Of course, the real situation is much more complicated. However, we can justify this assumption on several grounds. 1. In the case of certain large-data base systems, such as A.A.S., groups of sequentiMly stored records tend to have similar access properties. So in forming blocks in the usual way of grouping together sequentially stored records, those pages which are blocked together have approximately the same probabilities of being referenced; t h a t is, pages are blocked together approximately in order of probability of reference. 2. Of all possible ways of blocking pages together with B pages to a block, Yue and Wong [23] proved that in various storage applications and under various criteria of optimality, the blocking we have considered (in which pages are blocked together in order of probability of reference) is optimal. Hence, the use of this particular blocking is a natural assumption to make when one is discussing the performance of a storage hierarchy. We remark that the first author has found a counterexample to thc conjecture that expected LRU miss ratio is minimized in the independent reference model by this blocking [10]; however, this blocking seems to be near-optimal in the independent reference model. 3. Formal ~Stateme~t of Mai~t Result Let {pl, " " , p,} be a probability distribution (that is, ~ p , = 1 and each p, _> 0). Assume t h a t at each discrete time t, page z is referenced with probability p,, independent of past history. (This is the independent reference model.) The expected workzngset ntiss ratw (wzth window size T) [5] is the probability that the page referenced at time t was not one of the pages referenced over the course of the previous T (not necessarily distinct) references. Under the independent, time-invariant assumption of the independent reference model, it is clear t h a t this expected working-set miss ratio is independent of t, for t > T. Let CAP be the expected working-set size, t h a t is, the expected number of distinct pages to appear over the course of T references. Define M I S S ( C A P ) to be the corresponding expected working-set miss ratio. Thus, M I S S ( C A P ) is the expected working-set miss ratio with window size T, where the expected working-set size with window size T is CAP pages. Later on, we will discuss the close relationship between M I S S ( C A P ) and the expected L R U miss ratio with capacity CAP pages. We will now describe the blocked case. Let B (the "block size") be a positive integer which, for convenience, we assume divides n. Assume t h a t p~ > p2 >_ ' - ' _> p, ~ 0, and let B u, = ~ p(,-~)B+~, 1 _< i < n/B. Thus, ul = Pl + "'" + P n , u2 = PB*~ + "'" + P:B, etc. This corresponds to combining the B pages with the highest probabilities of reference into a block, the B pages with the next-highest probabflitms of reference into a second block, and so on. The blocked case corresponds to the independent reference model with block probabilities { u ~ , - . . , u~/BI. Define M I S S * ( C A P ) to be the expected working-set miss ratio (over the probability distribution [ u l , - • - , u~j,I ), when all d a t a are moved in blocks, and when the window size is chosen so t h a t the expected working-set size is C A P / B blocks ( C A P / B blocks contain the same number of bytes as CAP pages, and this is the quantity we hold fixed in comparing the blocked and unblocked cases.)2 Let C = CAP~B, and write M I S S and M I S S * for M I S S ( C A P ) and MISS*(CAP). The main result of this paper is IMISS- M I S S * I < 2/C + 33/C 2. (2) 2 [t may well happen that there is no integer T* such that the expected working-set stze wLth window size T* is CAP/B blocks. If so, then we interpolate, as we will see 132 a . F A G I N AND M.C, E A S T O N We can think of 2/C as the first-order error term, and 33/C ~ as the second-order error term. We will actually prove a slightly stronger statement than (2). Note that statement (2) is a distribution-free result: that is, the error terms do not depend on the values of the p, (or even on n, the number of pages). 4. Detads Abaut Main Result We. begin this section by giving an expression [5] for M I S S ( C A P ) , the expected workingset miss ratio when the expected working-set size is C A P pages. The expected workingset size, that is, the expected number of distinct pages which will be referenced over the course of T references, is S ( T ) = ~,~1 (1 - (1 - p,)r), because the probability that page z is referenced is 1 - (1 - p,)r. The expected working-set miss ratio with window p,)T is the probability that page i size T is M ( T ) = ~ p , (1 - p,)r, because p,(1 is the next page referenced and that page z did not appear in the last T references. Thus, if S ( T ) = CAP, then M I S S ( C A P ) is by definition M ( T ) = M ( S - ~ ( C A P ) ) . Note that M ( S - ~ ( x ) ) is well-defined for each real number x between 0 and ~, even if the intermediate parameter T = S-~(x) is not an integer. By this procedure, which amounts to an interpolation, we can define M I S S ( x ) = M ( S - I ( x ) ) for each x with 0 < x < , Similarly, in the blocked case, we define M I S S * ( x ) = M*(S*-~(x/B) ), for 0 < x < n, where - - n/B S*(T) = ~ (1 - (1 - u,)~), 0_< T < ~, n/B M*(T) = ~ u,(1 -- u,) ~, 0 _< T < ~. Thus, ff the expected working-set size is C A P pages (i.e. C A P / B blocks), then M I S S * ( C A P ) is the expected working-set miss ratio in the blocked case. We will now briefly discuss the relationship between M I S S ( C A P ) on the one hand, and the expected L R U miss ratio M R ( C A P ) with capacity C A P on the other hand. Denning and Schwartz [5] make the intuitive observation that M I S S ( C A P ) ~ M R ( C A P ) . In w~rious simulations of the independent reference model, we found that M I S S differs from M R by around .01, when the number n of pages is several hundred. Indeed, the first author has recently proven [11] that in a certain precise sense, M I S S converges asymptotically to M R as the number n of pages gets large, m the independent reference model. So (2) imphes that in the independent reference model, the expected LRU miss ratio is approximately independent of page size, if the size of main memory is held fixed aad if pages are blocked together in order of their probabilities. (Of course, we are assuming that C is large enough that the right-hand side of inequality (2) is small, and that n is large enough that M I S S ( C A P ) ~ M R ( C A P ) . ) 5. Proof of Main Result We will prove the following theorem. THEOREM. Let M I S S be the expected working-set muss ratw in the independent reference model, with expected working-set szze C A P pages. Let M I S S * be the expected workzng-set ~zss ratw after blocking, where B old pages form each new block, where pages are blocked together ~n order of their reference probab~ht~es, and where the expected working-set size zs again C A P pages (z.e. C = C A P / B blocks). Assume that the original number of pages zs dwzsible by B. Then I M I S S - M I S S * I < 2/C + 33/C 2. PROOF. The theorem is trivial if C A P < 2B or C A P > n; hence, we will assume that 2B < C A P < n. We will actually prove - 1 . 9 2 ( B - 1)~CAP - 6 ( B / C A P ) 2 < M I S S - M I S S * < 1.01(B - 1)~CAP + 3 3 ( B / C A P ) 2 (3) The Independence of M~ss Ratw on Page Size 133 Of course, (3) lmphes { M I S S - M I S S * < 2 B / C A P + 3 3 ( B / C A P ) ~, which is the result of the theorem. There will be six main steps in the proof. Step0. S ( T ) < T, Stcpl. 0 < ~,~1(1- Step 2. 0 < B ~$!~(1 if T ~_ 28. Step 3. - . 7 3 6 ( B - 1 ) / T < ~ = 1 p,(1 - p,)r _ ~ = / ~ u,(1 - u , / 8 ) r < .736(B - 1)IT, if T k 0 . Step 4. 0 _< ~ = n[B 1 u,(1 - u , / 8 ) r - ~ / ~ if T > 2 8 . Step 5. H(T)/H(t) if T >_ 1 - u,/B) r < B- p,)7 _ B ~ = / ~ ( 1 ~n/B - u , / B ) r - B z.~=l (I < S(T)/S(t), if __ U z ) F / B 1, if T > 0. 1) + 6B2/T, < .184(B- u,(1 - u,) 7'/8 < .271(8 - 1 ) / T + 33B2/T 2, T > t > 1, where H(x) = 1 - M(x). Step 0 folh)ws immediately from the development m [5], provided T is an integer (which will not ahvays be the case for us--hence, we must prove It directly). Step 5 says that if the expected working-set size is increased, then the proportional increase in expected working-set hit ratio is bounded by the proportional increase in expected working-set size. Thus, if the expected working-set size is doubled, then the expected hit ratio is at most doubled. We will now show that these six steps imply statement (3). Instead of using the fuactions S* and 111" of Section 2, it will be convenient to define closely related functions S . and M A , as follows. n/B S~(T) = B ~(1 n/B MB(T) = ~u,(1 - (1 - us)T/B), - u J T/B. It is easy to see that M B ( S ~ ( C A P ) ) = M*(S*-~(CAP/B)) = MISS*(CAP), expected working-set miss ratio under blocking. T w o other functions we will find convenient to use are given by H ( T ) = 1 -- M ( T ) , the HB(T) = 1 - MB(T). I t is easy to see t h a t S and SB are each monotone, and each map onto the half-closed, half-open interval [0, n). So we can find Tt and TB such that S ( T O = C A P = S R ( T , ) . Now T1 >_ 1, since S(1) = 1 < C A P = S(T1), and since S is monotone increasing. So we can apply step 0, to obtain T~ > S(T1) = C A P > 2B. Hence T, > 2B. (4) TI B Now S ( T ) = n - ~ = ~ ( 1 - p,)r, and S , ( T ) = n - B ~ ! ~ ( 1 -- u~) . So if we add together the inequalities of steps 1 and 2, with T, substituted for T, we find t h a t 0 < SB(T,) -- S(T~) < Q, (5) where Q = 1.184 (B - 1) + 6 (B~/T,). (6) Since S(T1) = CAP, statement (5) says 0 < SB(T,) - C A P < Q. (7) So So(T1) > C A P = S s ( T B ) . Since S~ is monotone increasing, Tx > T~. (8) The functions H s and S B / B have the same form as H and S, with u, substituted for p , , with n / B substituted for n, and with T / B substituted for T. So it follows from step 5 t h a t if ( T / B ) > ( t / B ) > 1, that is, if T > t > B, then H~(T)/HB(t) < (S~(T)/B)/(Sa(t)/B) = SB(T)/SB(t). (9) 134 R. F A G I N A N D M.C. EASTON Now T~ > T , ~ B : f i r s t , Ti ~" T B b y (8), and S , ( T , ) = C A P > B = S , ( B ) , s o since S , is monotone increasing, this implies that T , > B. Therefore, we can substitute Ti for T and T , for I in (9) ; then H,(T1)/H,(T~) < S~(Ti)/S,(T,) = S,(T~)/CAP. So HB(T~) _~ = _< < H~(TB)(S,(T~)/CAP) H ~ ( T , ) ( 1 --[- ((SB(T~) -- C A P ) / C A P ) ) g s ( T ~ ) ( 1 + ( Q / C A P ) ) b y (7) H~(T~) -k ( Q / C A P ) , since H ~ ( T ~ ) _~ 1 We have lust shown that H~(T~) < H , ( T ~ ) + ( Q / C A P ) . (lO) It is easy to see t h a t HB is monotone increasing. So, from (8), II~(T~) _> H ~ ( T , ) . (11) From (10) and (11), it ~mmednttely follows t h a t 0 < H , ( T i ) -- H , ( T ~ ) ~ Q/CAP, and hence -Q/CAP ~ Ms(T1) -- M~(TB) ~ O. (12) Adding together the inequalities of steps 3 and 4, with T~ substituted for T (which is all right b y (4)), we find t h a t --.736 (B - 1)/T1 < M(T1) - MB(T1) < 1.01 (B - 1)/T~ W 33 (B~/Ti'). (13) Since Ti > C A P b y step 0, statement (13) implies - . 7 3 6 (B - 1 ) / C A P < M(T1) - MB(T1) < 1.01 (B - 1 ) l e A P + 33 ( B / C A P ) ~. (14) Also, T1 ~_ C A P imphes (from statement (6)) t h a t Q _~ 1.184 (B - 1) + 6 (B~/CAP). (15) If we add together the inequalities in statements (12) and (14), and substitute for Q the right-hand side of (15), we get --1.92 (B - I ) / C A P - 6 ( B / C A P ) 2 < M ( T i ) - MB(TB) < 1.Ol ( B - 1 ) ~ C A P + 33 ( B / C A P ) ~. Since as we observed, M(T1) = M I S S ( C A P ) and MB(TB) = M I S S * ( C A P ) , this gives us statement (3), as desired. I t remains to prove steps 0-5. 6. Prehm~narzes One of our basic techniques will be the use of Schur functions ([21]; see also [20]). Assume for convenience throughout t h a t all functions considered are infinitely differentiable Definitwn. Assume t h a t u = ( a l , . - . , a,~), where al >_ a~ ~_ . . - ~_ am, and t h a t = (31, ' " , 3~), where fll >_ g2 ~ "-" ~_ tim- We say ~ > ~ if ~ - 1 c ~ . >_ ~ - l f l , , 1 < k < re, and ~ - 1 ~ , = ~ - 1 3 ~ . Definztion. A real-valued function f of m real variables x l , • • • , x~ is a Schur ]u~ctzon if for each pair i ~ 3, (x, - x,) (O/Ox,)f - (O/Ox,)f ~ O. THEOaEM (Schur). Let f be defined for x, > . . . > x,~. Then f ( a , , . . . , am) _> f(31, "'" , 3m) whenever ¢l > ~, ~ff f is a Schur functzon. Speczal Case 1 [13, p. 89]. Let ~ be a real convex function of one real variable, t h a t is. ~b((x "-b y ) / 2 ) ~ ½(4~(x) --t-~(y)). (16) The Independence of M i s s R a t w on Page Szze T h e n the function (x~, .. 135 , Xm) ---~ ~ 1 ¢ ( X , ) is a Schur function. Hence, if a )- ~, S p e c m l Case 2. Let ~ be continuous a n d concave, t h a t is, the inequality sign in (16) is reversed. T h e n - ~ is convex. Hence, ff a > ~, t h e n ~ - ~ ¢ ( a , ) < ~,2.~ ¢(fl,). An infinite sum ~ : ~ a~ of real n u m b e r s a, is alternating if 1. a, ~ 0 ~ f f a , + l ~ 0, 1 _~ ~ < ~¢. ~. la, I _> la~÷ll, 1 < ~ < ~. 3. a , - - ~ 0 as t--~. ~ . A n alternating s u m is always convergent, and its value lies between a n y two successive partial sums ( L e i b n i t z ' s test [15, p. 68]). 7. P r o o f of Step 0 We will prove t h a t S ( T ) ~ T, if T > 1. Let ~b be the functmn x --~ 1 - (1 - x) v, with d o m a i n the closed interval [0, 1]. I t is easy to see t h a t ¢ is concave. Clearly (pl , " " • , p~) ~ ( 1 / n , . • , l / n ) . So b y specml case 2 of Section 6, ± S(T) o = ¢(p,) < ~ O ( 1 / n ) Hence, we need only show t h a t n ¢ ( 1 / n ) (1 - = ,~¢(1/n). ~ T. This is equivalent to l/n)7 ~_ 1 - T/,. (17) If T ~ ~, then the right-hand side of (17) is negative, a n d (17) follows lmmedmtely. If T ~ n, then the binomial expansion of (1 - 1 / n ) r is an alternating sum, and (17) again follows. 8. P r o o f of Step 1 We will show t h a t n/B 0 < (l/B) ~ (1 - p , ) r _ ~ (1 - u J B ) r < ( B - 1)/B, (18) i f T ~ 0. Let ¢ he the function x ~ (I - x) ~, with domain the closed interval [0, I]. It is easy to see that ¢ is convex and monotone decreasing with range [0, 1]. Since 4~ is convex, it follows [13, p. 72] that ¢ ( (xl zc • • q- x B ) / B ) If we substitute xl = p(z-1)B+l , J'2 < (q~(xl) + • • -k ¢ ( x , ) ) / B . = p(~--I)B+2 , "'" , X. = (19) PuB into (19), for 1 < ~ < n / B , we find t h a t (1 u,/B) r ~ -- (l/B)((1 -- p(,_~)B+,) r + . . . + (1 -- p , , ) r ) . (20) Adding together the inequalities (20), for 1 < ~ < n / B , we obtain n/~ (1 -- u , / B ) r ~ ( l / B ) ~I (1 -- p,)V, ~I which proves the first inequality in (18). If 1 ~ x~ > . . . ~ xB > 0, t h e n since 4~ is m o n o t o n e decreasing, 4~(xl) -~ " . . -b -- ~ ( x~ zc " " " + < (B - 1)~b(x,) + 4~(x,) _ 4~(x,) - B B B 1 (4~(x~) - 4~(x,)). (21) 136 a , FAGIN AND M.C. EASTON Substituting x~ = p<,-x)~+~, x~ = p(,-~)B+2 , • "" , x~ = p,s into (21), for 1 _< i _< n / B , we obtain (1/B)(ch(p<,-~)s+~) + . . + O(P,B)) -4~(u,/B) _~ ( ( B - 1 ) / B ) ( o ( p ~ a ) -4~(P(~-a)B+0). (22) Now ¢'(p(,--1)B+I) _> ¢(P<,--1)B), SO we obtain from (22) (1/B)(~k(p(,-~)s+1) + " " + ¢#(p,.) ) --¢(u,/B) _< ( ( B - l)/B)(¢(p,,) --¢(p(,_~),)). (23) Adding together inequality (22) for ~ = 1 to inequalities (23) for 1 < z < 7~/B, the right-hand side telescopes to give (lIB) ~+(p,) - ~+(u,IB) < ( ( B -- 1)/B)(q~(p.) - ¢ ~ ( p x ) ) _< ( B - I)/B. (24) 1) + 6(Be~T), (25) This is the right-hand inequality of (18). 9. Proof of Step 2 We will show that n/B 0 < B ~(1 n/IB - u,/B) r- B ~(1 - u,) r/~ < . 1 8 4 ( B - if T > 2B. We will first demonstrate the tirst inequality. We need only show t h a t for each ~, (I - u , / B ) ~ > (1 - u,) T:~. (26) This is equivalent to showing t h a t (1 - u , / B ) B > (1 - u,). (27) I t is straightforward to check that the binomial expansion of the left-hand side of (27) is an alternating sum. So (27) follows We will now prove the second inequMity of (25). For each u, 0 _< u < 1, and each nonnegative number 0, define ao(u) = (1 - u / B ) zs°, bo(u) = (1 - u ) °, co(u) = ae - bo. (28) If ½ < u < 1, then e g u ) _< ao(u) _< (1 - 1/2B) Bo. In particular, if 0 = T / B , then er/n(u) _< (1 - 1/2B) r. (29) Assume from here on t h a t 0 < u < ½. We will write e0 for to(u), etc Let x and y be nonnegative real numbers. Then inequality (26), with B x substituted for T and u substituted fol u , , gives a~ > b,. Hence a~ + b~ < 2a~. (30) Multiplying together inequality (30) and the equality a~ -- b~ = ~, we obtain a~ay + %b~ - a~bu -- b~b~ < 2a=%. (31) Clearly, a=a~ = a=+~, b=b, = b=+~. (32) Substituting into (31) using (32), and replacing a,+~ - b=+u b y ~,+u and rearranging terms, we obtain ~=+u --< 2 a ~ + a~bu -- aub~. (33) T h e I n d e p e n d e n c e of M , s s R a t i o on P a g e S , z e 137 S u b s t i t u t i n g a. -- ~. for b. a n d a~ -- e~ for b~ in (33), we find ~+~ _< ~.a~ + ~a~. (34) We now claim t h a t if 3 is a positive integer, then % < j~,a~ -~ (35) This is obvious if :/ = 1. Assume inductively t h a t it is true for.7 = N T h e n from (34), eN+~ < e~a~ -4- ,~a~ < Ne~a~ ~ -4- ~ a ~ b y inductive assumption = N ~ a ~ ~v -4- e l a l N = (N -4- 1)e,a~~'. Hence (35) holds, b y induction, for every positive integer 3. Write T / B = j -4- r, j a positive integer and 0 < r < 1. Then ~T/B _< eja, + e~aj by _< .l~la~- lat -4- ~,aj b y (35) = 3*~a~+~-~ -4- ~a~ J since ar = a~~ and since al < 1 and 3e~a~ +r-I + era~+r-t (34) a~ = a~~ r < 1 (36) < ( T / B ) ~ l a ~ T / ' ) - I - - t- era~ T/B)-I = (T/B)e~(1 How big is E~? As sum. Hence we -- u / B ) T-B -4-4-e.(1 - u / B ) T-È observed, the binomial expansion of (1 - u / B ) B is a n alternating (1 - u / B ) A < 1 - u -4- ( B -- 1 ) u ~ / 2 B . (37) So er = a t - - b, = (1 -- u / B ) Br -- (1 -- u ) ~ _< (1 -- u + ( B - - 1 ) u 2 / 2 B ) r - (1 - u) r b y (37) = (1 - u ) ' [ ( 1 + ( B -- 1 ) u ~ / 2 B ( 1 - u ) ) ~ -- 1]. (38) Let z = ( B -- 1 ) u ~ / 2 B ( 1 -- u ) . Since 0 < u < ½, it is easy to see t h a t 0 < z < 1. Now the binomial expansion of (1 -4- z) r - 1 is an alternating s u m i f 0 < z < 1 a n d 0 < r < 1 ( t h a t is, there is an alternating s u m if we consider only the second, third, . . . terms of the binomiM expansion of (1 -4- z)~). P u t t i n g this together with (38), we o b t a i n u ) ' [ r ( B -- 1 ) u 2 / 2 B ( 1 - u)] = r ( B 1 ) u ~ / 2 " B since (1 - u ) > ½ < .54 ( ( B -- 1 ) u 2 / B ) , er _< (1 - 1)u~/2B(1 - u ) ~-~ _< r ( B - since we find b y elementary calculus t h a t the m a x i m u m of r / 2 ~, 0 < r ~ log 2) < .54. So er _< .54(B - 1 ) u 2 / B < .54 u ~. 1, is 1/(e (39) How big is ~1 ? F r o m (37), we find immediately t h a t ~ _< ( B - 1)u2/2B. (40) If we substitute into the last line of (36) using (39) a n d (40), we o b t a i n ~r/B _< u2(1 - u / B ) T - n ( ( T ( B -- 1 ) / 2 B ~) -4- .54). We are interested in obtaining an upper b o u n d for B ~ n a m e l y u~, can be greater t h a n ½. So from (29) a n d (41), (41) ~T/B(U,). A t most one u , , n/B B ~ ~r/a(u,) < B(1 - 1/2B) r n/B + ((T(B -- 1 ) / 2 B ) -4- .54B) ~ u , 2 ( 1 -- u , / B ) r-~. (42) 138 R. F A G I N A N D M.C. EASTON 0 PW Pmox I~'e B Fro. 1 How big can z..,~-I u~ (1 - us~B) ~-" be? Let ~ be the function x --, J:2(1 - x / B ) r-~, with domain [0, B]. By using elementary calculus, we find that ~ has its maximum at Pm~, = 2 B / ( T + 2 - B ) , and two points of inflection; the first such point of inflection, P*, lies strictly between 0 and P . . . . and the second point of inflection, P**, is bigger than P . . . . The second derivative of ¢ is positive between 0 and P*; negative, between P* and P**; and positive, between P** and B. So the shape of ¢ is roughly as in Figure I. Let VEC1 = {(xl, - . . , xm) :m a positive integer, ~,"21 x~ < 1, and 1 ~ xz ~_ x2 >_ • • > x ~ > 0}. Note t h a t the length of a tuple in VEC1 is arbitrary (but finite). I t is not important t h a t the x~'s are in descending order, b u t it will make the exposition simpler. Say we can find an upper bound M for { ~ ~#(x~) : x E VECz}, where x is an abbreviation ,K.-,~/s for ( x l , . ' . , x~). Then z_,,-1 u~2(1 - u , / B ) r-~ _~ M. Let VEC2 = { x : ~ x ~ < I, and Pm~ >__ xl ~_ x2 ~_ . . - > x , ~ 0}. Then s u p ~ 4 ~ ( x ~ ) :x E VEC~} = s u p l ~ ( x ~ ) : x E VEC2}, since if x E VEC~ and y, = r a i n ( x , , Pm~) for each i, then y E V E C 2 . Let v be a tuple in VEC2; then v can be written as the concatenation ~ of a tuple y (with all entries between P* and P ~ ) and a tuple z (with all entries between 0 and P*); if any entry of x is exactly P*, we place that entry in z. Assume that ~ z , = a. Let 1 be the unique nonnegative integer and 6 the unique real number, 0 < 6 < P*, such t h a t a = lP* + ~. Let a be the vector (P*, • • , P*, 6, 0 , . - . , 0) with length the same as that of z, and with 1 occurrences of P*, oDe occurrence of ~, and the rest O's. Clearly, a > z. Now ~b"(x) > 0 for each x in [0, P*]; hence [13, p. 76] ~ is convex in [0, P*]. Applying special case 1 of Section 5, we find t h a t ~ - ~ ~b(a~) > ~ _ ~ ~b(z,). We have shown that if v = y~"~is an arbitrary tuple in VEC~, then there is a tuple w = y (P*, , P*, ~) such t h a t ~ b ( v ~ ) < ~ b ( w , ) . VECa = {x : ~ In other words: let x, < 1, Pm,~ ~ zz >_ x2 ~_ "'" >_ x,,-1 >_ P*, P ~ x >_ xm >_ 0, where m is the length of x}. Then sup l ~ b ( x , ) : x ~ VEC3} = sup{ ~ ~b(x,) : x 5 VEC~}. Now let v be an arbitrary tuple in V E C s . Write v = y~'~, where each entry of y lies between P* and Pm~x, and where z is a tuple of length 1, whose entry lies between 0 and P ~ . Assume t h a t y is of length k, and t h a t ~ y~ = a. Let a be the tuple ( a / k , • • • , a / k ) of length k. Then y >- ~. Now ~ (x) ~ 0 for each x in [P*, Pmax[; hence, 4~ is concave in [P*, P ~ ] . Applying special case 2 of Section 5, we find t h a t ~ 4~(Y,) _~ ~b(a,). In other words: let VEC4 = Ix : ~ x, < 1, xz . . . . . xm-~ , where nm is the length of x}. Then s u p { ~ q , ( x , ) : x ~ VEC~} = s u p { ~ ¢ ( x , ) : x ~ VEC~}. Let v = {¢, ¢, • • • , ~, ~} be an arbitrary tuple in V E C 4 . Then ~(v,) _< ( 1 / ~ ) l ~ ( ~ ) l + ~(~) -< ( ~ ( ~ ) / ~ ) + _< ~(~) (max ~(x)/x) + (m~ ~(~)). (~3) O~x~l O~z~l The Independence of M i s s R a t w on Page Size 130 Now ¢ ( x ) / x = x(1 - x / B ) r-" Let k be arbitrary for the moment, and let ~ be the function x --, xk(1 - x / B ) T-B. We are interested in finding max0~<l ~b for k = 1 and k = 2 (and, in proving step 4, we will be interested in the case k = 3). By elementary calculus, we find that ~b has its maximum at k B / ( T + k - B), with maximum value (kB/(T + k - B ) ) k (1 - k/(T + k -- B ) ) r-B. (44) We will now estimate (1 - k / ( T + k - B ) ) 7-B. I t i s well known that (1 - l / x ) ~ ~ 1/e as x -~ o~ Also, the function x --> (1 - 1 / x ) ' is a monotone, strictly increasing function of x for x > 1, fol, the derivative of this function, evaluated at z > 1, is easily found to be = 1-- 1 . . . . . . . . . . 2X2 3x~ 4x ~ by using the Taylor expansion of log(1 - l/x) >0 So 1>(1 e- x+l = 1 1) 1 x+l = x~-I 1 -- 1 x+l )2 x x+l' (45) If we multiply (45) through by (x + 1)/x, weflnd that (1 1 x+l )" lx+l <---e x (46) So 1 T-t-k-- B = < 1 --x+l " ' 7 wherex = ( T - B)/k by (46) So the maximum value of ~b, which we found to be expression (44), is bounded by (from(47) ) (T+k B - So from (43), < B 4B ~ + e ( T -- B) e2(T -- B ) 2" Tracing back what we have demonstrated, this means that ~/B (1-- u~_}\r-" _< B + 4B ~ (49) We will now estimate the subexpression (1 - 1/2B) T of (42). It will be convenient for work later on to find a more general estimate than we need now. We will show that i f s > l a n d t > 0, then (1 -- l / s ) t < m!s'~/t '~ for each integer m ~ 0. (50) First, (1 - - l / s ) t = (1 - - l / s ) "('/') < e-'/', (51) 140 R. FAGIN AND M.C. EASTON since as we showed, (I -- I / x ) ~ P I/e. Now if z > 0, then e" > z'~/m! for each integer nn > 0, since z'~/m ! is one term of the Taylor expansion of e~. Hence e-~ < m !/z m. Applying this to the right-hand term of (51), (1 -- 1/s) t < e-~/" < m! s~/t '~, as desired. Hence (1 - 1/2B) r < 2 B / T , (52) where we let n~ = 1 in (50). Substituting into (42) using (49) and (52), we obtain ~/" 2B 2 B~,_a,TjB(u,) < - T - /T(B-+ \ 2B 1) ~( B .... 4 i f , -t- . 5 4 B ] \ e ( T B) + e ~ ( T B) 2J .54ff 2.16ff - 2B ~ + (B - 1)T + + 2 B T ( B - 1) + T 2e(T -- B) e(T -- B) e2(T - B) ~ e~(T - B) 2 2B 2 B - 1 T .54B B -< -'T- + 2--'~-- T - B + e T - B 2B T B 2.16B B B + e-'TT-BT-B -b e~ T - B T - B " (53) We will find simple upper bounds for T / ( T - B) and B / ( T - B), given that T > 2B. T- B- 'r 1+ T -1+ T -- B -< T < 1, sinceB/(T-B) B <l+2B T----~--~ _ T by (54). (54) (55) Substituting into (53) using (54) and (55), we obtain "/" B ' ~ ~T/~(u,) < 2B ~ + B-l( < B - ] 2--~--+-~ 1+ 2+~ _~_) .54B 2B + --- ] + e + ~ + ~ ~ + e~ ] , (56) where we have expanded out and replaced the term B__y__ ~ 1. ~B -by B 2 If we replace BaIT 2 on the right-hand side of (56) by B~/T, and numerically evaluate, we find n]B B ~ ~T/B(U,) < .lS4 (B -- 1) + 5.56 ( B 2 / T ) , which implies the second inequality of (25). 10. Proof of Step 3 We will show that i p,(1 - p , ) r _ ~,,l~ u,(1 - u , / B ) T < .736(B - 1 ) / T , (57) if T > 0. Let ¢ be the function x -+ x(l - x) r, with domain [0, 1]. We find by elementary calculus that ~khas its maximum at p = 1 / ( T + 1), with maximum value 1(, M A X = +------i T 1 < ~ by (46). T + 1 (58) The Independence of Miss Ratzo on Page Size 141 Further, ~ is monotone increasing between 0 and t~, and monotone decreasing between and 1. If 1 > Xl >_ . . . > x~ ~ ~, then since ~ is monotone decreasing between /, and 1, we find as in (21) of the proof of step l that ((¢(xl) + ... + ~(~,) )/B) - ¢~( ( ~ + . . . + x , ) / B ) _< ( ( B -- 1 ) / B ) (~(xB) -- ~b(xl)). (59) And, (~b(xl "-k "" -Jr x , ) / B ) -- (~k(xl) + " " -b ¢ z ( x , ) ) / B _< ¢~(~.~) - ( ( B - 1)¢~(zi) + ~ ( x , ) ) / B = ((B - 1)/B)(~b(x~) -- ~b(xl)). (60) Putting together (59) and (60), we obtain I (~(xl) + "'" + ~(x~))/B - ~((xl + .. + x ~ ) / B ) I < ((B - 1)/B)(¢(xB) -- ~ ( x , ) ) . (61) Let k be the maximal integer such that pks ~ u. T h e n l >_ px > p: >_ • • > pk, >_ u. By using (61) in an analogous way to our use of (21) in the proof of step 1, we obtain, as in (24) of step 1, that i kB (l/B) k ~b(p,) - B ,~l~(u'/B) <_ ( ( B - 1)/B)(~b(p~8) - ~b(p,)) _< ( ( B - 1)/B)~b(pkB). (62) We know t h a t p(~+~)e < u. There are now two cases to consider. Case 1. PkB+I__< tz- Assume t h a t ~ > xl _> x2 _> . . - _~ xs ~ 0. Since ~/, is monotone increasing between 0 and u, we find, by a similar argument to that used to prove (61) and (62), t h a t ] (~b(xl) + . . . + ~ , ( x B ) ) / B -- ~k((xx + .. + XB)/B) I _< ( ( B 1)/B)(¢(x~) (1/B) ~ ~,(p,) - B ~2 ~ ( u , / B ) < ((B - - ~b(x,)). 1)/B)~,(pk~+~). (63) (64) Hence, from (62) and (64), (l/B) ~,(p~) -- B ~ ~,(u,/B) _< ( ( B -- 1)/B)(g,(p~B) + ~(P~.+i)) (2(B -- 1)/B)MA_X < 2(B - 1 ) / e B T b y (58). Since 2/e < .736, this gives us (57). Case 2. p,~+l > u. We ~411 show t h a t [ (&(pEn+,) + . . . + ~b(p(~+,)~))/B -- ~b (u~+ffB)[ _< ( ( B -- 1 ) / B ) ( M A X -- ~ (p,8+~)) + ((B- 1)/B)(MAX - ~k(p(~+~)a)). (65) We will first show t h a t this is sufficient to prove (57). As in the proof of (64) of case 1, n (l/B) ~ n/B ~b(p,) - B ~ ~b(u,/B) ~_ ( ( B -- 1)/B)~b(p(~+~)n+~). (66) Since p~,+~ > ~, and since ~b is monotone decreasing between u and 1, it follows t h a t ¢,(pk~) _~ ~h(pka+l). So from (62), (l/B) ~ ~(u,/B) ,-, ~k(p,) - B ~ ,-1 < ( ( B -- 1)/B)~k(p~+~). (67) 142 lt. F A G I N A N D M.C. EASTON Since p(k+~)B < V, and since 6 is monotone increasing between 0 and v, it follows that ~b(p,k+l)B) >_ .¢(p(k+l)B+l). So (65) gives (1/B) I ¢dp~+l) + ... + ¢(p~+~) - B ¢,(u~+,/B) I < ((B -- 1 ) / B ) ( M A X - q~(pkB+~)) + ((B -- 1 ) / B ) ( M A X - ¢,(p(~+~)B+i). (68) If we add together (66), (67), and (68), and use the triangle inequality, we obtain (I/B) ~(p,) - B ~¢(u,/B) _< (2(B - 1 ) / B ) MAX, and as in the conclusion of ease 1, this gives us (57). It remains to prove (65). Assume that 1 > xl >_ . - . ~_ xB > 0. To prove (65), we must show ] ((iV(x,) + • • + ~k(x,))/BI - ~k((xl + < ((B - ... + xB)/B) I 1)/B)(MAX -~b(x,)) + ( ( B -- 1 ) / B ) ( M A X - ¢ ( x . ) ) . (69) Assume that ~ ( x , ) ~_ .~(x~). The proof ~s sinular if ~b(x~) > ~k(x~). Since ~ ( x . ) > ~b(x~), clearly ~b(x,) > ~(Xl), ~ = 1, . . . , B (~(x~) + .. + ~ ( z ~ ) ) / B - ~((Xl + ... + xD/B) < ((B - 1) M A X + ~ ( x , ) ) / B - ¢ (x,) = ((B - 1 ) / B ) ( M A X - ~(x,)) _< ((B - 1 ) / B ) (MAX -- ~ (x,)) + ( ( B -- I ) / B ) ( M A X - ¢(xB)). (70) And, ~ ( ( x l + . . . + x ~ ) / B ) - (~(xl) + . . . + ~b(x~))/B _< MAX - ((B -- 1)~(xl) + qJ(xs))/B < M A X -- ( ( B - 1)~(x~) + ~b(xB))/B + ((B - 2 ) / B ) ( M A X - ~ ( x s ) ) = ( ( B -- 1 ) / B ) ( M A X -- ¢(x~)) + ( ( B -- 1 ) / B ) ( M A X - ¢dxa)). (71) Putting together (70) and (71), we obtain (69). Remark The result of step 3 can be improved (reducing the right-hand side of (57) by less than a factor of 2), by taking advantage of more properties of ~ than that it is monotone increasing and then monotone decreasing. 11. Proof of Step 4 We will prove n/B 0 < ~ tml u , ( 1 -- u , / B ) r - ~B (1 -- u,) T'B < . 2 7 1 ( B - - 1 ) / T + 33 ( B / T ) 2 (72) tml if T > 2B. Statement (26) of the proof of step 2 says (1 -- u , / B ) r > (1 - u,) r/B. Hence u , (1 - u J B ) ~ > u , (1 - u,) T/, , and the first mequMlty • of (72)-- follows. If we adopt the notation of the proof of step 2, then we are concerned with finding .,,-1 u,e~/B(u,). As before, if ½ < u~ < 1, then i = 1 and u,er/s(u,) _< (1 -- 1/2B) T < 8 ( B / T ) 2, by (50) with m = 2. (73) If 0 < u, < ½, then from (41) of step 2, u,~r/~(u,) < u, s (1 -- u , / B ) r-B ( ( T ( B -- 1)/2B 2) "4- .54). (74) The Independence of Miss Ratio on Page Size 143 So from (73) and 74), n/B n]B u:r/~(u,) < 8 ( R / T ) 2 -q- ~ ~--1 u,3(1 -- u~/B) r-B ( ( T ( B -- 1)/2B 2) + .54). (75) z--I By exactly the same method as in step 2, where we now let ~ be the function x 13(1 - x / B ) T-', we find that ~ u , 3 ( 1 - u , / B ) r-" _< (max ( ¢ ( x ) / x ) ) + (max ~(x)) 0_~<i _< 4B2/e2(T - (76) o_o_<I B) 2 -F 27B2/e3(T - B) 3 , since (48) gives the maxima. If we substitute into (75) using (76), we obtain "/'u,~,/,(u,)< 8 (~)~ "F (--4B2 o-, \e2(T -- B)2 "F 27B3 "~(T(B - I) -'I-.54) e3(T - B) 3] \ 2B 2 B 2.16( +-~~ B )' 2 T 14.58( +--~~ B )' (77) where we expanded out and made one substitution of B for B - 1. If we now use (54) and (55) to substitute 2 B / T for B / ( T -- B) and (1 + 2 B / T ) for T / ( T - B) in (77), we obtain ./~ 2 2(B- 1) 1-t- -I-~ 1-t- + 2.16 e~T _ + + 8+~+~+8.64 + ~+~+ d J (78) where we expanded and occasionally substituted B for B -- 1. If we replace ( B / T ) 3 on the right-hand side of (71) by ( B / T ) ~ and numerically evaluate, we obtain n/B u:r/B(u,) < .271 (B -- 1 ) / T + 32.68 ( B / T ) 2, which implies the second inequality of (72). 12. Proof of Step 5 We will prove that H(T)/H(t) ifT>t> 1. We will show that H ( T ) / S ( T ) ( 1 -- ~- p,(1 -- p , ) r < S(T)/S(t) (79) _< H ( t ) / S ( t ) , that is, (1 -- (1 -- p , ) r ) _< ( 1 - - ~ p , ( 1 Assume for convenience that each p, is nonzero. - - p , ) t ) / ~ ( 1 ,-, -- (1 (80) 144 R. Let a, = I - (1 - p , ) ' B, = 1 - (1 - p~ f~, < FAGIN AND M.C. EASTON p,)T, 1 < i < n. T h e n (80) is e q u i v a l e n t to p, oq L e t a , ' = a, / 5 : °~ - l a j , = B, / Z ~ - 1 ~ , S t a t e m e n t (81) is e q u i v a l e n t to ~,. (81) 1 _< i _< ~. T h e n 5: a, ' = = 1. 5: p,~/ < 5: p,~,'. (82) N o w the function f, given by x --~ 5:~-1 p , x , , with domain ~-tuples ( x l , . . . , x,) with x~ > x2 _> . - . >_ x~, is a Schur function, since (O/Ox,)f = p , , 1 < i < n, and hence (d/Ox,)f > (O/Ox,)f iff p, > p~ iff ~ > .~ iff x, > xj . So, to prove (82), we need only show a' > ~'. (83) To p r o v e (83), we need the following lemma. I,EM~A. I f 0 < r~ < ~'~ < 1, the, (1 -- r,t)/(1 -- r~ t) > (1 -- r,~')/(1 -- r ~ ) . (84) PROOF. I t is easy to see t h a t (84) holds if r~ = 0; hence, assume rl > 0. Let f~ be the function x --~ (1 - rl~)/(1 - r2~), with domain [1, ~ ). We need only show t h a t f~ is m o n o t o n e decreasing, t h a t is, t h a t f~' < 0. A simple calculation shows t h a t f ' _< 0 iff rl ~ l o g r ~ / ( 1 - rl ~) > r2~ log r2/ (1 - r~x) for e a c h x > 1. (85) If we m u l t i p l y b o t h sides of (85) b y x, and let gl = r~", g2 = r ( , we obtain g~ log g~/(1 - g~) ~ g2 log g2/(1 - g~). (86) So we need only show t h a t (86) holds w h e n e v e r 0 g g~ _~ g2 < 1. Let sl = 1 s~ = 1 - g~. T h e n (86) becomes (1 - Sl)log(1 -- s~)/s~ ~_ (1 -- s2)log(1 -- s~)/s2. g~, (87) We need only show t h a t (87) holds w h e n e v e r 0 < s~ .~ Sl ( 1. Let f2 be the function x ~ (1 - x)log(1 - x ) / x , with domain (0, 1). W e m u s t show t h a t f~ is m o n o t o n e increasing. If we replace log(1 - x) by its T a y l o r expansion, t h e n ( f(z)=(1-~) = --(l--x)( -z-y--5-¥ .... l + x ~ + ~ x2 + ¥ + x' )/ ... ) which ~s clearly m o n o t o n e increasing. T h e l e m m a is now proved. To prove (83), we m u s t show k 5:~,' k >_ ~ , ' , i < ~ < ~. ( W e h a v e already n o t e d t h a t ~ , " - ~ a,' = ~ - ~ / ~ ' = 1.) f W e will first p r o v e a ~ ~_ /~1'. I n t h e l e m m a , i f w e l e t r ~ = 1 t h e n we get ~,/a,+~ _> ~ J ~ , + , , 1 _< ~ < n. (88) p , , r~ = 1 -- p,+~, (89) W e will now show t h a t (89) implies ~,' > ~,'. (9o) The Independence of Miss Ratio on Page S~ze For, find o-i, - • • , 0"n--1and "rl, T h e n (89) implies 145 • .,T,-lsuch t h a t a~+l = a , a , , B,+l = v,O,, 1 ~ i < n. 1 < ~ < ~. ow # Ogl = = = ,~U(~I + '" + (91) '~,,) a l / a l ( 1 -k- aa 4- ala2 "4- o'ia:aa q'- . ' . ) 1 / ( 1 + ol + o'1o2 + o'1o2o3 --P .. ) . (92) Similarly 8a' = 1/(1 + T1 "4- TiT2 • T1T2T3 -~ " ' " ). (93) T h e n (91), (92), and (93) imply (90). We will close b y showing t h a t al' -4- c~2' >_ ¢~1' "4- ~ ' . T h e other inequalities of (88) are proved very similarly. Let ~ = (al q- a~, ~3, a4, . - . , a , ) and ~ = (/31-1- t3~,/~,134, . . . , B , ) . W e w i l l first prove t h a t a,/&,+l 7> D,/D,+i (94) for e a c h / . This follows for ~ _> 2 b y (89). If i = l, then (94) says (al --k a 2 ) / a 3 _> (~1 + ~ ) / ~ 3 . (95) B u t (95) holds, since al/a~ _> Ol/f~ a n d a~/a~ ~_ t8~/~33, each b y an application of the lemma, just as in the proof of (89). Let ~ be the normalization of ~, t h a t is, ~ , = a , / ~ - i ~, • Similarly, define ~'. (94) implies ~'1 _> ~'1, (96) just as (89) implies (90). B u t (96) is equivalent to al' -4- a~' _> /~l' + B~', which was to be shown. 13. Summary We have shown t h a t the working-set miss ratio is insensitive to page size, in the indep e n d e n t (page) reference model. We argue t h a t this implies t h a t the insensitivity also holds in more realistic models. ACKNOWLEDGMENT. T h e authors wish to t h a n k A.C. M c K e l l a r for very helpful discussions about how our results relate to actual computing systems. REFERENCES (Note. References [8, 14] are not cited in the text.) 1. AHO,A Y., DENNING, P J., AND ULLMAN, J.D. Principles of optimal page replacement J. ACM 18, 1 (Jan. 1971), 80-93 2. ANACKER,W , AND WANG, C.P. Performance evaluation of computing systems with memory hmrarchies 1EEE Trans. Electrome Computers EC-16 (1967), 765-773. 3. BENNE~'T,B.T. Private communication 4. CHU, W.W., AND OPDERBECK, H Performance of replacement algorithms with different page sizes. Computer 7, 11 (Nov 1974), 14-21. 5. DENNING, P.J., AND SCHWARTZ,S.C Properties of the working-set model. Comm. ACM 16, 3 (March 1972), 191-198. ~. DENNING,P J., SAVAGE,J E., ANDSPIRN, J.R. Models for locality in program behavior Princeton U., Princeton, N.J., April 1972 EASTON, M.C. Model for interactive data base reference strings IBM J. Res. Develop. 19, 6 (1975), 550--556. EASTON, M.C , AND BENNETT, B T. On transient-free working-set statistics Res Rep. RC ~140, IBM Thomas J. Watson Research Center, Yorktown Hmghts, N.Y, Nov. 1974. 'ASTON, M.C., AND FAOIN, R. Cold-start vs. warm-start miss ratios and multiprogramming rformance. (To appear). 146 10 11. 12 13 14. 15. 16 17 18. 19 20 21 22. 23 R. FAGIN AND M.C. EASTON FAGIN,R. A counterintuitive example of computer paglng. Res. Rep. RC 5031, IBM Thomas J. Watson Research Center, Yorktown Heights, N Y , Aug. 1974; Comm ACM (to appear) FAGIN, R. Asymptotic m3ss ratios over independent references Res. Rep. RC 5415, IBM Thomas J Watson Research Center, Yorktown Heights, N Y., May 1975. GELENBE, E , TIBERIO, P , AND BOEKHOERST, J C A Page size m demand paging systems Acta Informatica $, 1 (1973), 1-23 HARDY,G.H , LITTLEWOOD, J.E , AND POLYA, G Inequahtles. Cambridge U Press, London, 1964 KING, W.F IiI Analysis of paging algorithms IFIP Conf Proc , Ljubljana, Yugoslavia, Aug 1971. KNOPP, K Infinite Seque~ces and Series. Dover, New York, 1956. KUCK,D J , AND LAWRIE, D H The use and performance of memory hierarchies. A survey. In Computer and Information Sciences II, J.T. Tou, E d , Academic Press, New York, 1969 KAPLAN, K R , AND WINDER, R.D. Cache-based computer systems. Computer 6, 3 (March 1973), 30-36 LEwis, P.A W , AND SHEDLER, G.S Emplmcally demved mmromodels for sequences of page exceptions IBM J. Res. Develop. 17, 2 (March 1973), 86-100 MATTSON, R., GECSEI, J , SLUTZ, D , AND TRAIGER, I. Evaluatmn techniques for storage hmrarchies. IBM Syst. J 9, 2 (1970), 78-117 MARSHALL,A.W, OLKIN, I , AND PROSCHAN, F. Monotomclty of ratios of means and other applications of majonzatmn. In Inequalities, D. Shmha, Ed., Academic Press, New York, 1967 SCHUR,I ~ber ein Klasse yon Mittlebildungen mit Anwendungen auf die Determinatentheorie Sitzber. Berl. Math. Ges 2~ (1923), 9-20. WIMBaOW, J H A large-scale interactive administrative system IBM Syst. J 10, 4 (1971), 260-282 YUE, P C., AND WONG, C K. On the optimality of the probability ranking scheme m storage applications. J ACM $0,4 (Oct 1973), 624-633 RECEIVED OCTOBER 1974, REVISED JULY 1975 Journal of the Amoclatlon for Computing Machinery, Vol, 23, No 1, January 1976