8086 MP
8086 MP
8086 MP
8086 Microprocessor
8086 Features
Data Registers Control Unit Address Registers Arithmetic Logic Unit Status Flags Memory
Address Bus
Index BP SP
AX
BH
BL
BX
SI
CH CL
CX
DI
DH
DL
DX
8086 Architecture
The 8086 has two parts, the Bus Interface Unit (BIU) and the Execution Unit (EU). The BIU fetches instructions, reads and writes data, and computes the 20-bit address. The EU decodes and executes the instructions using the 16-bit ALU. The BIU contains the following registers: IP - the Instruction Pointer CS - the Code Segment Register DS - the Data Segment Register SS - the Stack Segment Register ES - the Extra Segment Register The BIU fetches instructions using the CS and IP, written CS:IP, to contract the 20-bit address. Data is fetched using a segment register (usually the DS) and an effective address (EA) computed by the EU depending on the addressing mode.
The EU contains the following 16-bit registers: AX - the Accumulator BX - the Base Register CX - the Count Register DX - the Data Register SP - the Stack Pointer \ defaults to stack segment BP - the Base Pointer / SI - the Source Index Register DI - the Destination Register These are referred to as general-purpose registers, although, as seen by their names, they often have a special-purpose use for some instructions. The AX, BX, CX, and DX registers can be considers as two 8-bit registers, a High byte and a Low byte. This allows byte operations and compatibility with the previous generation of 8-bit processors, the 8080 and 8085. 8085 source code could be translated in 8086 code and assembled. The 8-bit registers are: AX --> AH,AL BX --> BH,BL CX --> CH,CL DX --> DH,DL
Registers
ES CS SS DS IP AX BX CX DX AH BH CH DH SP BP SI DI FLAGS AL BL CL DL
Extra Segment Code Segment Stack Segment Data Segment Instruction Pointer Accumulator Base Register Count Register Data Register Stack Pointer Base Pointer Source Index Register Destination Index Register
EU registers
8086/88 internal registers 16 bits (2 bytes each) AX, BX, CX and DX are two bytes wide and each byte can be accessed separately These registers are used as memory pointers.
Segment registers are used as base address for a segment in the 1 M byte of memory
Registers
Registers are in the CPU and are referred to by specific names Data registers
Hold data for an operation to be performed There are 4 data registers (AX, BX, CX, DX)
Address registers
Hold the address of an instruction or data element Segment registers (CS, DS, ES, SS) Pointer registers (SP, BP, IP) Index registers (SI, DI)
Status register
Keeps the current status of the processor On an IBM PC the status register is called the FLAGS register
Instructions execute faster if the data is in a register AX, BX, CX, DX are the data registers Low and High bytes of the data registers can be accessed separately
AH, BH, CH, DH are the high bytes AL, BL, CL, and DL are the low bytes
Data Registers are general purpose registers but they also perform special functions AX
Accumulator Register Preferred register to use in arithmetic, logic and data transfer instructions because it generates the shortest Machine Language Code Must be used in multiplication and division operations Must also be used in I/O operations
BX
Base Register Also serves as an address register
CX
Count register Used as a loop counter Used in shift and rotate operations
DX
Data register Used in multiplication and division Also used in I/O operations
Contain the offset addresses of memory locations Can also be used in arithmetic and other operations SP: Stack pointer
Used with SS to access the stack segment
The SI and the DI registers may also be used to access data stored in arrays
Are Address registers Store the memory addresses of instructions and data Memory Organization
Each byte in memory has a 20 bit address starting with 0 to 220-1 or 1 meg of addressable memory Addresses are expressed as 5 hex digits from 00000 - FFFFF Problem: But 20 bit addresses are TOO BIG to fit in 16 bit registers! Solution: Memory Segment
Block of 64K (65,536) consecutive memory bytes A segment number is a 16 bit number Segment numbers range from 0000 to FFFF Within a segment, a particular memory location is specified with an offset
Segmented Memory
Segmented memory addressing: absolute (linear) address is a combination of a 16-bit segment value added to a 16-bit offset
F0000 E0000 D0000 C0000 B0000 A0000 90000 80000 70000 60000 50000 40000 30000 20000 10000 00000 seg ofs 8000:0000
0250
8000:FFFF
one segment
8000:0250
Intel
The BIU has a dedicated adder for determining physical memory addresses
Offset Value (16 bits)
0000
Adder
Intel
Segment:Offset Address
Logical Address is specified as segment:offset Physical address is obtained by shifting the segment address 4 bits to the left and adding the offset address Thus the physical address of the logical address A4FB:4872 is
A4FB0 + 4872 A9822
Your turn . . .
What linear address corresponds to the segment/offset address 028F:0030?
Your turn . . .
What segment addresses correspond to the linear address 28F30h?
Many different segment-offset addresses can produce the linear address 28F30h. For example: 28F0:0030, 28F3:0000, 28B0:0430, . . .
CS:
0400H IP 0056H
4000H 4056H
The offset is the distance in bytes from the start of the segment. The offset is given by the IP for the Code Segment. Instructions are always fetched with using the CS register.
The physical address is also called the absolute address.
DS:
05C0 SI 0050
05C0 0050
05C50H
0FFFFFH
Data is usually fetched with respect to the DS register. The effective address (EA) is the offset. The EA depends on the addressing mode.
SS:
0A00 SP 0100
The offset is given by the SP register. The stack is always referenced with respect to the stack segment register. The stack grows toward decreasing memory locations. The SP points to the last or top item on the stack. PUSH - pre-decrement the SP POP - post-increment the SP
Flags
Overflow Direction Interrupt enable Trap 6 are status flags 3 are control flag Sign Zero
Flag Register
Conditional flags: They are set according to some results of arithmetic operation. You do not need to alter the value yourself. Control flags: Used to control some operations of the MPU. These flags are to be set by you in order to achieve some specific purposes.
Flag Bit no. 15 14 13 12 O 11 D 1 0 I 9 T 8 S 7 Z 6 5 A 4 3 P 2 1 C 0
CF (carry) Contains carry from leftmost bit following arithmetic, also contains last bit from a shift or rotate operation.
Flag Register
OF (overflow) Indicates overflow of the leftmost bit during arithmetic. DF (direction) Indicates left or right for moving or comparing string data. IF (interrupt) Indicates whether external interrupts are being processed or ignored. TF (trap) Permits operation of the processor in single step mode.
SF (sign) Contains the resulting sign of an arithmetic operation (1=negative) ZF (zero) Indicates when the result of arithmetic or a comparison is zero. (1=yes) AF (auxiliary carry) Contains carry out of bit 3 into bit 4 for specialized arithmetic. PF (parity) Indicates the number of 1 bits that result from an operation.
Addressing modes
Register and immediate modes we have already seen MOV AX,1 MOV BX,AX register immediate
DI
56 02 BX
0256
34 12
IV-1
IV-2
IV-4
Memory
Terminology Volatile - data is lost when power is turned off. Nonvolatile - retains data when powered off. Random Access - all data locations accessed in the same amount of time. Sequential Access - data accessed in varying amounts of time, e.g., tape. ROM - Read Only Memory. RAM - Random Access Memory By convention, RAM in a PC is really Read/Write Memory and ROM (EPROM) in a PC, although random access memory, is not referred to as RAM. Examples VOLATILE Static RAM Dynamic RAM
NONVOLATILE ROM, PROM, EPROM, EEPROM, FLASH Disk, tape Magnetic core, magnetic bubble
IV-6
Latch
A0 MEM*
A(11-1) D(15-8)
hi byte (odd)
47
What Is An Assembler?
Systems Level Program
translates assembly language source code to machine language
object file - contains machine instructions, initial data, and information used when loading the program listing file - contains a record of the translation process, line numbers, addresses, generated code and data, and a symbol table
Data Representation
Binary 0-1
represents the state of electronic components used in computer systems
Word - 16 Bits
Each architecture may define its own wordsize
Numbering Systems
Binary - Base 2
0, 1
Octal - Base 8
0, 1, 2, 7
Decimal - Base 10
0, 1, 2, , 9
Hexadecimal (Hex)
0, 1, , 9, A, B, , F
Decoding a Byte
Raw
01010000b
Machine Instruction
Push AX
Hex
50h
Octal
1208
Integer
80 (eighty)
Decimal
80d
BCD
50 (fifty)
Machine Language
A language of numbers, called the Processors Instruction Set
The set of basic operations a processor can perform
Each instruction is coded as a number Instructions may be one or more bytes Every number corresponds to an instruction
Assembler Directives
Operator
ax, bx
Operand[s]
;Comment
Not case sensitive Unlike other assemblers, destination operand is first mov is the mnemonic that the assembler translates into an opcode
hello
.ASM - Assembler source file prepared by programmer .OBJ - Translated source file by assembler .LST - Listing file, documents Translation process Errors, Addresses, Symbols, etc .CRF Cross reference file
Assembler Directives
end label proc far|near end of program, label is entry point begin a procedure; far, near keywords specify if procedure in different code segment (far), or same code segment (near) end of procedure set a page format for the listing file title of the listing file mark start of code segment mark start of data segment set size of stack segment
Assembler Directives
db dw dd dq dt equ Examples: db 100 dup (?) db Hello maxint equ count equ define 100 bytes, with no initial values for bytes define 5 bytes, ASCII equivalent of Hello. 32767 10 * 20 ; calculate a value (200) define byte define word (2 bytes) define double word (4 bytes) define quadword (8 bytes) define tenbytes equate, assign numeric expression to a name
Program Example
;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ; ; ; This is an example program. It prints the ; ; character string "Hello World" to the DOS standard output ; ; using the DOS service interrupt, function 9. ; ; ; ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; hellostk SEGMENT BYTE STACK 'STACK' ;Define the stack segment DB 100h DUP(?) ;Set maximum stack size to 256 bytes (100h) hellostk ENDS hellodat dos_print strng hellodat hellocod START: SEGMENT BYTE 'DATA' ;Define the data segment EQU 9 ;define a constant via EQU DB 'Hello World',13,10,'$' ;Define the character string ENDS SEGMENT BYTE 'CODE' ;Define mov ax, SEG hellodat mov ds, ax mov ah, dos_print mov dx,OFFSET strng int 21h mov ax, 4c00h int 21h ENDS END START the Code segment ;ax <-- data segment start address ;ds <-- initialize data segment register ;ah <-- 9 DOS 21h string function ;dx <-- beginning of string ;DOS service interrupt ;ax <-- 4c DOS 21h program halt function ;DOS service interrupt ; END label defines program entry
hellocod
hellocod END
.code START: mov mov mov mov int mov int ax, SEG strng ds, ax ah, dos_print dx,OFFSET strng 21h ax, 4c00h 21h ;ax <-- data segment start address ;ds <-- initialize data segment register ;ah <-- 9 DOS 21h string function ;dx <-- beginning of string ;DOS service interrupt ;ax <-- 4c DOS 21h program halt function ;DOS service interrupt
END
START
Program Statements
name operation operand(s) comment
Space or tab separates initial fields Comments begin with semicolon Most assemblers are not case sensitive
Defining Data
Numeric data values
100 - decimal 100B - binary 100H - hexadecimal '100' - ASCII "100" - ASCII
ANum refers to a byte storage location, initialized to FCh The next word has no associated name ONE and UNO refer to the same word X is an unitialized doubleword
Arrays
Any consecutive storage locations of the same size can be called an array
X DW 40CH,10B,-13,0 Y DB 'This is an array' Z DD -109236, FFFFFFFFH, -1, 100B Components of X are at X, X+2, X+4, X+8 Components of Y are at Y, Y+1, , Y+15 Components of Z are at Z, Z+4, Z+8, Z+12
DUP
Allows a sequence of storage locations to be defined or reserved Only used as an operand of a define directive
DB 40 DUP (?) DW 10h DUP (0) DB 3 dup ("ABC")
Word Storage
Word, doubleword, and quadword data are stored in reverse byte order (in memory)
Directive DW 256 DD 1234567H DQ 10 X DW 35DAh Bytes in Storage 00 01 67 45 23 01 0A 00 00 00 00 00 00 00 DA 35
Named Constants
Symbolic names associated with storage locations represent addresses Named constants are symbols created to represent specific values determined by an expression Named constants can be numeric or string Some named constants can be redefined No storage is allocated for these values
EQU Directive
name EQU expression
expression can be string or numeric Use < and > to specify a string EQU these symbols cannot be redefined later in the program sample EQU 7Fh aString EQU <1.234> message EQU <This is a message>
reg can be any nonsegment register except IP cannot be the target register MOV's between a segment register and memory or a 16-bit register are possible
When a variable is created with a define directive, it is assigned a default size attribute (byte, word, etc) You can assign a size attribute using LABEL
LoByte LABEL BYTE aWord DW 97F2h
eXCHanGe
XCHG target, source
reg, reg reg, mem mem, reg
Arithmetic Instructions
ADD dest, source SUB dest, source INC dest DEC dest NEG dest Operands must be of the same size source can be a general register, memory location, or constant dest can be a register or memory location
except operands cannot both be memory
Stack Segment
used to set aside storage for the stack Stack addresses are computed as offsets into this segment
Code Segment
contains executable instructions
Segment directives
.data .code .stack size
Memory Models
.Model memory_model
tiny: code+data <= 64K (.com program) small: code<=64K, data<=64K, one of each medium: data<=64K, one data segment compact: code<=64K, one code segment large: multiple code and data segments huge: allows individual arrays to exceed 64K flat: no segments, 32-bit addresses, protected mode only (80386 and higher)
Program Skeleton
.model small .stack 100H .data ;declarations .code main proc ;code main endp ;other procs end main Select a memory model Define the stack size Declare variables Write code
organize into procedures