Paging and Page Replacement Policies
Paging and Page Replacement Policies
Paging and Page Replacement Policies
University of Mumbai
(Electronics Engineering)
By
1
Abstract
In a virtual memory environment the basic principle of program execution is
the adaptiveness of an Operating System environment to larger programs with
in the limitation of the addressable small primary memory. Page Replacement
algorithms play an important role in implementing this memory setting with an
aim to accomplish less page fault, high hit ratio and minimum overhead. Over
the years many page replacement algorithms were designed and proposed. As
the memory types and program designing approaches improved, the need for
betterment in algorithms existed as a need. This paper summarizes the tech-
niques and challenges behind the major traditional page replacement algorithms
and accounts on the various research outcomes in this area.
2
Contents
Abstract i
List of Tables ii
1 Basic Introduction 1 1
• Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
• Organization of Report . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Conclusion 3
Bibliography 4
3
Chapter 1
Introduction
Main memory is an important and a very limited resource in a computer. Al-
though common amount of main memory has increased in the past decades,
so has its demand 1 . As processor and main memory get faster more rapidly
than secondary memories, the impact of swapping increases. The everyday im-
proving hardware has also underlined some bottlenecks in real life applications,
and created a need for further research. In this thesis we will first introduce
general concepts of virtual memory and paging, followed by proposed formal
models for page replacement algorithms and program behaviour. The theory
part ends with presenting well known basic page replacement algorithms, as
well as some more advanced ones, and describing their characteristics. In the
empirical part of this thesis, we will present implementation of nine algorithms
and compare their performance on generated virtual memory traces. Perfor-
mance comparison is done by using page fault rate charts and other page fault
metrics. The page fault rate chart shows the distribution of page faults. In the
real world examples Unix class of operating systems, mainly Linux, will be used
as a reference. The theory is not, however, tied to Unix, and many other oper-
ating systems implement memory management by using the same techniques.
It should also be noted, that page replacement problem is not specific for op-
erating system memory management, but is present also in various systems
using caches, such as databases and web proxies. This thesis, however, limits
its scope to the context of virtual memory management in general purpose op-
erating systems. Some algorithms are designed for databases and later adapted
for virtual memory. These are included because they contain important ideas
and provide a better view of the evolution of replacement algorithms.
4
Organization of report
Chapter 2 deals with
what is Paging and Frames?
Chapter 3 tells us the Page fault and Hit Ratio. We will also see Page Replace-
ment and different algorithms-
a)First in First Out (FIFO) b)Least Recently Used (LRU) c)Optimal Page
Replacement in chapter 4.
Chapter 2
Paging and Frames-
The operating system divides virtual address space into units called pages.
Main memory is also divided to fixed size units called page frames [16]. Each
used page can be either in secondary memory or in a page frame in main mem-
ory. Naturally neither of these memories is needed for the pages, in the virtual
address spaces of processes, that are not used. A paging algorithm is needed
to manage paging. A paging algorithm consists of three algorithms: placement
algorithm, fetch algorithm and replacement algorithm. The placement algo-
rithm is used to decide on which free page frame a page is placed. The fetch
algorithm decides on which page or pages are to be put in main memory. Fi-
nally, the page replacement algorithm decides on which page is swapped out.
Further, paging algorithms can be demand paging or prepaging. A demand
paging algorithm places a page to main memory only when it is needed, while a
prepaging algorithm attempts to guess which pages are needed next by placing
them to main memory before they are needed. In general cases, it is very dif-
ficult to make accurate guesses of page usage and demand paging is generally
accepted as a better choice. It can also be proved, that for certain constraints,
optimal paging algorithm is a demand paging algorithm. Exact constraints
and proof is given in [1]. A virtual address must be translated to corresponding
physical address before the memory can be used. As this address translation is
done with every memory reference, it is important that it is very fast. Usually
special hardware, called Memory Management Unit (MMU), is used to make
this translation. MMU uses virtualto-physical address mapping information,
5
located in operating systems page table, to make the translation. Each process
has its own virtual address space and therefore page tables are per process. If
the given virtual address is not mapped to main memory, the MMU traps the
operating system. This trap, called page fault, gives the operating system an
opportunity to bring the desired page from secondary memory to main memory,
and update to page table accordingly.
Because each process has its own virtual address space, the operating system
must keep track of all pages used by processes, location of each page, and some
per page information. When a page in main memory is referenced or written
to, it is marked accordingly. If all page frames are in use when a page fault
occurs, the operating system moves, or evicts, some page to secondary memory
to make room for the referenced page. As pointed earlier, the operating system
must also update the page table, as the MMU knows nothing about the pages
that are not in the main memory.
A page can be anonymous or file backed. Anonymous pages are used by program
code as a work area, namely heap, stack and global variables. File backed pages
are, as the name indicates, data read from a file in secondary memory. File
backed pages are not meant to be modified during execution (e.g. program
text). When operating system decides to evict a file backed page, it is not
necessary to 4 spend time writing it to secondary memory. The page already
has a valid copy of the data available, if it is needed again later. The page
frame used can just be marked as free, and used for another page. In case the
operating system decides to evict an anonymous page, it will have to check a
few things. If the page has never been in secondary memory, or it has been
modified while in main memory, it will have to be written to secondary memory
before the page frame can be reused . Such modified page is called dirty.
6
Chapter 3
Page fault and hit ratio-
A page fault typically occurs when a process references to a page that is not
marked ’present’ in main memory. Page faults can be classified in to major
page faults and minor page faults. In a minor page fault, the referenced page
is in main memory, but not yet marked ’present’, which means that there is no
disk IO required to handle to fault. Page faults are also caused by 1) reference
to address 0, 2) reference to an unmapped page, 3) user process referencing the
kernel area (remember that kernel areas are mapped to process address space!),
4) attempts to write to an read-only page and 5) attempts to execute code
from non-execute page. The page fault in case 1 is also called an invalid page
fault. Page faults in cases 2–5 are called protection faults and are, together with
the actual mappings, used to implement memory access rights. While access
rights are per page properties, they are usually maintained in larger memory
areas, segments. Read-only pages are useful in implementing copy-on-write.
Automatically growing stack is also implemented by using page faults in Linux.
When a process tries to access an address right next to the current stack limit,
the Linux kernel allocates a new page and maps it for stack usage.
Hit ratio =hits /(hits+misses) When a CPU refers to memory and find it in
cache than it is called hits. And if it is not found than it is called miss.So hit
ratio= the ratio of hits and the sum of hits and misses.
7
Chapter 4
Page replacement algorithms-
A page replacement algorithm can be either global or local. Local replacement
means that replacement candidates are searched only from pages, that belong
to the page faulted process. This effectively means that, if the working set of
8
the process does not fit to the memory reserved for it, the process will page
fault constantly. This generally leads to poor usage of memory resources. In
global page replacement all processes compete for the main memory and the
replacement algorithm automatically adapts to changing working set sizes of
running processes.
As in many algorithms, performance is the main goal in developing page re-
placement algorithms. There are many factors that affect the performance of
page replacement. As computer is implemented in physical hardware, the fea-
tures of the hardware change greatly the balance of different trade-offs. In
fact, it is the evolution of hardware that has made page replacement algorithm
research needed again . The amount of main memory has increased far more
than the speed of reading and writing disks (i.e. secondary memory). So if an
application has been swapped to a disk, the time to return it to main memory
has increased significantly because the 12 memory usage of applications has in-
creased more than the speed of disks. As a consequence, many operations users
normally do, like starting the computer, have become slower. This leads to an
important goal of minimizing disk IO and IO latency. Minimizing disk IO and
keeping IO latency low also frees disk capacity for actual application work, and
keeps the memory subsystem from disturbing latency sensitive applications, like
media players, too much.
FIFO-
9
often than necessary.
String 0 2 1 6 4 0 1 0 3 1 2
f1 0 0 0 6 6 6 1 1 1 1 1
f2 2 2 2 4 4 4 4 3 3 3
f3 1 1 1 0 0 0 0 0 2
F/H F F F F F F F H F H F
1) LRU-
Consider the string 7 0 1 2 0 3 0 4 2
The Least Recently Used (LRU) algorithm is based on generally noted
memory usage patterns of many programs. A page that is just used will
probably be used again very soon, and a page that has not been used for
a long time, will probably remain unused. LRU can be implemented by
keeping a sorted list of all pages in memory. The list is sorted by time when
the page was last used. This list is also called LRU stack . In practice
this means that on every clock tick the position of the pages, used during
that tick, must be updated. As a consequence, the implementation is very
expensive, and not practical in its pure form. Updating on every clock tick
is also an approximation, as it does not differentiate between two pages
that were referenced to during the same clock tick . ecently referenced and
ym the least recently referenced. LRU is relatively simple to implement
and it has constant space and time overhead for given memory. LRU uses
only page access recency as the base of the decision. As discussed ear-
lier, many common memory access patterns follow the principle of locality
and LRU does work well with these. Typically, workloads with strong 17
locality have the most page references with reuse distance small enough
to allow these pages to fit in the main memory . LRU is good also for
stack distances that have common stationary distribution . LRU is even
optimal when LRU stack depth distribution is assumed in page referenc-
ing . LRU adapts fast to changes in the working set and works well with
workloads with no clear large-scale access patterns . LRU is not without
problems. The LRU list, or stack, is a shared structure and must be pro-
tected from concurrent access. Especially in virtual memory, where it is
updated on every reference, it will severely limit the performance . LRU
10
also does not take page use frequency into account. If a frequently used
page has slightly larger inter reference interval (or reuse distance), than
can fit to main memory, the page will always be evicted. This also makes
LRU extremely vulnerable to scan access pattern. In general, LRU does
not work well with workloads that have clear large-scale access patterns
until the pages belonging to the pattern fit to the main memory . While
stack distances with common stationary distribution work well with LRU,
most real world programs have strong correlation among stack distances .
Non-uniform page referencing in general is also not handled well by LRU
and it also does not support programs following the phase-transition model
. When used as a global policy, program execution interaction may cause
programs pages to fall further in the LRU stack as a result of the program
itself page faulting, which is again the principle of LRU . In other words,
pages are falling undesirably low in LRU stack because the program’s vir-
tual time has stopped for the duration of the page fault. In LRU-k , the
LRU algorithm is improved by using the time of the kth most recent refer-
ence, instead of the most recent reference to page. This gives LRU a basic
scan resistance as a page must have been used twice recently in order to
be high in the LRU stack.
11
String 7 0 1 2 0 3 0 4 2 3 0
F1 7 7 7 2 2 2 2 4 4 4 0
F2 0 0 0 0 0 0 0 0 3 3
F3 1 1 1 3 3 3 2 2 2
F/H F F F F H F H F F F F
c) Optimal replacement –
Consider string 7 0 1 2 0 3 0 4 2 3 0 3 2
String 7 0 1 2 0 3 0 4 2 3 0 3 2
F1 7 7 7 2 2 2 2 2 2 2 2 2 2
F2 0 0 0 0 0 0 4 4 4 0 0 0
F3 1 1 1 3 3 3 3 3 3 3 3
F/H F F F F H F H F H H F H H
12
Challenges-
13
Conclusion
The evolution of replacement algorithms shows two clear trends. First, the
analyzing and proof of better performance has moved from mathematical
analysis to testing against real world program traces, and later even by
implementing prototypes on real operating systems. This trend shows how
difficult it is to mathematically model the memory behaviour of programs.
An important factor is also the large amount and easy availability of im-
portant programs. The other clear trend is the realization of the need for
workload adaption. As Denning noted in [5], the phase-transition model,
a good high level model for program behaviour, shows the importance of
the transition phase to the replacement algorithm. This implies that an
algorithm that handles the transition phase (i.e. adapts) better will have
better performance on real programs.
14
Bibliography
a) Engineers Garage- website
15